/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 12/23/24(Mon)17:16:48 No.103623753

File: __hatsune_miku_and_kasane(...).jpg (229 KB, 1013x1481)

229 KB JPG

/lmg/ - Local Models General Anonymous 12/23/24(Mon)17:16:48 No.103623753 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>103618088 & >>103609833

►News
>(12/20) RWKV-7 released: https://hf.co/BlinkDL/rwkv-7-world
>(12/19) Finally, a Replacement for BERT: https://hf.co/blog/modernbert
>(12/18) Bamba-9B, hybrid model trained by IBM, Princeton, CMU, and UIUC on open data: https://hf.co/blog/bamba
>(12/18) Apollo unreleased: https://github.com/Apollo-LMMs/Apollo
>(12/18) Granite 3.1 released: https://hf.co/ibm-granite/granite-3.1-8b-instruct

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/tldrhowtoquant

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/hsiehjackson/RULER
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/23/24(Mon)17:17:06 No.103623754

Anonymous 12/23/24(Mon)17:17:06 No.103623754

File: NoodleUI_00019_.png (1.14 MB, 1024x1024)

1.14 MB PNG

►Recent Highlights from the Previous Thread: >>103618088

--Papers:
>103618211
--o3 fails primitive pattern recognition test due to flawed ARC benchmark:
>103618833 >103618864 >103618878 >103618891 >103618903 >103618910 >103618933 >103618955 >103618969 >103618982 >103618991 >103619103 >103619161 >103619226 >103619300
--Researchers claim to have found way to remove "problematic content" from models:
>103622822 >103622922 >103622997 >103623021 >103623089 >103623118 >103623136 >103623181 >103623192 >103623230 >103623274 >103623295 >103623326 >103623353
--QvQ model discussion and comparison with Qwen models:
>103619637 >103619666 >103619678 >103619697 >103619702 >103619712 >103619718 >103619726 >103619772
--Chat completion vs text completion in AI models:
>103618224 >103618259 >103618329 >103618372 >103620582 >103618663
--o3 model struggles with pixel art images:
>103619288 >103619290 >103619373
--Fixing iGPU memory allocation issue for LLMs:
>103622545 >103622580 >103622642 >103622900
--Nemotron 51B working with GGUFs via llama.cpp:
>103623479 >103623496
--Defense of top_k sampling:
>103622606 >103622650 >103622671 >103622694 >103622713 >103622730 >103622742 >103622745 >103622789 >103622749 >103622774 >103622955 >103622995 >103623061 >103623190
--Qwen Coder local installation and GPU requirements discussion:
>103620078 >103620108 >103620132 >103620140 >103620143 >103620172 >103620151 >103620160 >103620221 >103620387 >103620393 >103620412 >103620439 >103620453 >103620480
--AGI, ASI, and the future of AI development:
>103618213 >103618330 >103618482 >103618681 >103618793 >103618806 >103618666
--DDR6 and its potential impact on CPU speeds and memory bandwidth:
>103622367 >103622404 >103622422 >103622532
--Trump appoints Sriram Krishnan as AI policy expert:
>103619734 >103619826 >103619839
--Miku (free space):
>103620770

►Recent Highlight Posts from the Previous Thread: >>103618089

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
12/23/24(Mon)17:19:42 No.103623787

Anonymous 12/23/24(Mon)17:19:42 No.103623787

Not falling for the chat completion meme.

Anonymous
12/23/24(Mon)17:20:44 No.103623800

Anonymous 12/23/24(Mon)17:20:44 No.103623800

File: 1712942614114838.png (106 KB, 1205x916)

106 KB PNG

>>103623737
I'm not surprised at the quality of the templates, and I would not be surprised that a lot of it has become placebo. But you do know that there is still significant processing when you use chat completion, right? There's a whole different system ST uses to define where the instructions go, how examples are stored, etc.

Anonymous
12/23/24(Mon)17:22:46 No.103623819

Anonymous 12/23/24(Mon)17:22:46 No.103623819

>>103623787
It should be the safest bet, assuming the backend is using the built in model format properly.
I still use text completion because I like to play around with the tamplate by hand.

Anonymous
12/23/24(Mon)17:22:46 No.103623820

Anonymous 12/23/24(Mon)17:22:46 No.103623820

>>103623753
Bro actually uploaded a art from an artist, you can see the watermark and details.

Anonymous
12/23/24(Mon)17:23:46 No.103623830

Anonymous 12/23/24(Mon)17:23:46 No.103623830

>>103623808
Why is it so hard to believe that I just want the best possible use out of these new tools, ideally while minimizing the risks they could pose, before they start genuinely being risky?

Anonymous
12/23/24(Mon)17:24:15 No.103623836

Anonymous 12/23/24(Mon)17:24:15 No.103623836

>>103623787
it truly does not matter
if you're smart and good at prompting it is easy to get the exact same result either way
if you're a dumb tard you will fuck something up either way

Anonymous
12/23/24(Mon)17:25:41 No.103623853

Anonymous 12/23/24(Mon)17:25:41 No.103623853

>>103623830
You're speedrunning the reenactment of the Satanic Panic and in the process try to curb people's freedom to engage with whatever fictional content they damn well please.

Anonymous
12/23/24(Mon)17:26:14 No.103623859

Anonymous 12/23/24(Mon)17:26:14 No.103623859

>>103623830
Because you're telling everyone else what they can and can't do with A TEXT GENERATOR
No one likes being told what to do and if you ask the average person, they're not going to think that text is dangerous
If you want to lobotomize your AI then go ahead, but don't ruin it for the rest of us

Anonymous
12/23/24(Mon)17:26:17 No.103623861

Anonymous 12/23/24(Mon)17:26:17 No.103623861

File: 1709844112224338.jpg (257 KB, 904x1200)

257 KB JPG

>>103623753

Anonymous
12/23/24(Mon)17:26:28 No.103623862

Anonymous 12/23/24(Mon)17:26:28 No.103623862

>>103623853
People's freedoms should stop where risk starts.

Anonymous
12/23/24(Mon)17:26:55 No.103623864

Anonymous 12/23/24(Mon)17:26:55 No.103623864

>>103623861
mikusex for christmas

Anonymous
12/23/24(Mon)17:28:28 No.103623879

Anonymous 12/23/24(Mon)17:28:28 No.103623879

>>103623862
No one should own knives, lighters, a computer, anything pointy
Hell, cut off people's hands because hey, you might be able to punch someone to death
Let's lock everyone up because there's a risk that someone might snap and harm somebody
I'm done, go be a cuck somewhere else. Nigger.

Anonymous
12/23/24(Mon)17:28:34 No.103623881

Anonymous 12/23/24(Mon)17:28:34 No.103623881

>>103623859
Do you sell a handgun to a random school kid in America? I think you're not that stupid, as such models should have precautions in places against access by those that do not understand the risks.

Anonymous
12/23/24(Mon)17:28:55 No.103623883

Anonymous 12/23/24(Mon)17:28:55 No.103623883

>>103623862
there's risk everywhere, should we remove McDonalds because it gives health issues to people who only eat that shit?

Anonymous
12/23/24(Mon)17:29:49 No.103623891

Anonymous 12/23/24(Mon)17:29:49 No.103623891

>>103623883
Yes, they should refuse service to someone who is obviously obese, like bartenders are supposed to do with people who are too drunk.

Anonymous
12/23/24(Mon)17:29:56 No.103623892

Anonymous 12/23/24(Mon)17:29:56 No.103623892

>>103623881
At this point, if they told me they're gonna use it on your retarded ass first, I would. I very much would.

Anonymous
12/23/24(Mon)17:30:37 No.103623901

Anonymous 12/23/24(Mon)17:30:37 No.103623901

>>103623881
>Using a text generator is like selling guns to kids
all right he's trolling at this point

Anonymous
12/23/24(Mon)17:31:28 No.103623908

Anonymous 12/23/24(Mon)17:31:28 No.103623908

Oops
>>10362389

Anonymous
12/23/24(Mon)17:31:30 No.103623909

Anonymous 12/23/24(Mon)17:31:30 No.103623909

>>103623901
There have been deaths that could be attributed directly to models.

Anonymous
12/23/24(Mon)17:31:34 No.103623911

Anonymous 12/23/24(Mon)17:31:34 No.103623911

>>103623881
Stop moving the goalpost, that example doesn't work here. Here, I fixed it for you:
"We should start selling nerf guns to everyone, including adults, because kids/mentally unstable idiots/whatever might cause harm!"

Anonymous
12/23/24(Mon)17:31:37 No.103623912

Anonymous 12/23/24(Mon)17:31:37 No.103623912

>>103623901
Pretty sure more kids have died from LLM directed suicide than from guns this year already

Anonymous
12/23/24(Mon)17:31:38 No.103623913

Anonymous 12/23/24(Mon)17:31:38 No.103623913

>>103623891
>they should refuse service to someone who is obviously obese
not only obese people have health issues related to food, there's some slim people who have diabetes, and fat fucks who don't have health issues (Donald Trump)
You're such a retard it's insane

Anonymous
12/23/24(Mon)17:32:36 No.103623926

Anonymous 12/23/24(Mon)17:32:36 No.103623926

>>103623830
Because there's no risk of the model writing about people fuckin'. That isn't "safety", it's just content moderation. You're undermining the x-risk stuff by trying to roll normie corpo content moderation stuff up into it. They're totally different things and trying to shoehorn them into a single concept ('safety') is incoherent.

Anonymous
12/23/24(Mon)17:32:39 No.103623928

Anonymous 12/23/24(Mon)17:32:39 No.103623928

>>103623912
Source: my subjective opinion
Seethe harder, trump won

Anonymous
12/23/24(Mon)17:32:50 No.103623931

Anonymous 12/23/24(Mon)17:32:50 No.103623931

>>103623909
>attributed directly to models
*to mental illness
The media said for 8 years straight that Donald Trump was "le heckin Hitler", and then some mentally ill guy tried to assasinate him, should we remove the media because of that mentally ill guy?

Anonymous
12/23/24(Mon)17:32:52 No.103623933

Anonymous 12/23/24(Mon)17:32:52 No.103623933

>>103623830
>>103623853
>>103623891
Moral panics are circular and reoccurring. Before the satanists it was the TV, hippies, rock music, jazz, dime novels, newspapers, theatre. AI is just the latest one. It will persist for a generation, then everyone will talk how they were smart and enlightened enough to not stamp on AI in forty years or so when it becomes a routine part of everyone's lives. Then the culture will start panicking about androids and sex robots.

Anonymous
12/23/24(Mon)17:33:45 No.103623941

Anonymous 12/23/24(Mon)17:33:45 No.103623941

QvQ confirmed to be a scam

Anonymous
12/23/24(Mon)17:34:12 No.103623945

Anonymous 12/23/24(Mon)17:34:12 No.103623945

>>103623941
Not morning yet.

Anonymous
12/23/24(Mon)17:34:15 No.103623946

Anonymous 12/23/24(Mon)17:34:15 No.103623946

>>103623926
What is the legitimate use case of a text model being able to write 18+ content?

Anonymous
12/23/24(Mon)17:34:16 No.103623947

Anonymous 12/23/24(Mon)17:34:16 No.103623947

>>103623933
>satanists it was the TV, hippies, rock music, jazz, dime novels, newspapers, theatre
It's funny how almost all of these things have something in common.

Anonymous
12/23/24(Mon)17:35:17 No.103623958

Anonymous 12/23/24(Mon)17:35:17 No.103623958

>>103623946
Sexual arousal and masturbation, which are totally normal and legitimate human needs.

Anonymous
12/23/24(Mon)17:36:01 No.103623964

Anonymous 12/23/24(Mon)17:36:01 No.103623964

>>103623945
They're not going to release on a holiday.

Anonymous
12/23/24(Mon)17:36:25 No.103623968

Anonymous 12/23/24(Mon)17:36:25 No.103623968

Just starting, nuts how many models people say are for erotica has censorship. So far only midnight miqu 1.5 has been game for anything.

Anonymous
12/23/24(Mon)17:36:36 No.103623970

Anonymous 12/23/24(Mon)17:36:36 No.103623970

File: 1715199794796510.png (153 KB, 1543x1613)

153 KB PNG

>>103623881
what fucking risks are you talking about? We live in a society where GTA 6 is the most anticipated game of all time, GTA, the realistic murdering simulator, did the society collapse because of that game? I don't think so

Anonymous
12/23/24(Mon)17:36:42 No.103623973

Anonymous 12/23/24(Mon)17:36:42 No.103623973

>>103623964
It will be a Christmas miracle.

Anonymous
12/23/24(Mon)17:36:46 No.103623975

Anonymous 12/23/24(Mon)17:36:46 No.103623975

>>103623958
You could also be paying for ethical and legal access to porn with verifiably adult people.

Anonymous
12/23/24(Mon)17:37:21 No.103623983

Anonymous 12/23/24(Mon)17:37:21 No.103623983

>>103623946
Personal entertainment
"Hurr durr what's the use case for porn hmmm me retard me big stupid"
It's almost like humans are animals that have a need for pleasure. Not that people admit it nowadays, it's all about pretending basic biology isn't real
But fine, here's another example: Fiction. Yes, those books that we've been writing for centuries? The stories inside them aren't actually real and they don't harm anyone. Here's a little secret: a lot of them are quite grim and not some child friendly winter wonderland garbage

Anonymous
12/23/24(Mon)17:37:38 No.103623987

Anonymous 12/23/24(Mon)17:37:38 No.103623987

>>103623975
>verifiably adult people
it's already happening, pornhub only allows people who send their ID to them to post their porn video in there

Anonymous
12/23/24(Mon)17:38:14 No.103623994

Anonymous 12/23/24(Mon)17:38:14 No.103623994

>>103623946
I'm trying to get assistance editing a romance novel and holy shit you boring prudes have ruined everything about llms.

Anonymous
12/23/24(Mon)17:38:31 No.103623995

Anonymous 12/23/24(Mon)17:38:31 No.103623995

>>103623970
It's too late to regulate games properly, it's just the right time to regulate LLM before they truly become dangerous.
>>103623987
Correct, which is why you should use Pornhub instead of using a text model that might hallucinate one of the characters as underage.

Anonymous
12/23/24(Mon)17:38:43 No.103623997

Anonymous 12/23/24(Mon)17:38:43 No.103623997

>>103623975
LLM smut is 100% legal in every jurisdiction on Earth.

Anonymous
12/23/24(Mon)17:38:53 No.103624000

Anonymous 12/23/24(Mon)17:38:53 No.103624000

>>103623946
why are you pretending that the 50 shades of gray wasn't a best seller?
https://www.nbcnews.com/pop-culture/books/fifty-shades-grey-was-best-selling-book-decade-n1105731

Anonymous
12/23/24(Mon)17:38:55 No.103624001

Anonymous 12/23/24(Mon)17:38:55 No.103624001

The problem is that parents no longer want to (or do not have the time to) raise their children.

Anonymous
12/23/24(Mon)17:39:42 No.103624007

Anonymous 12/23/24(Mon)17:39:42 No.103624007

I can't believe y'all are giving this chucklefuck (You)s.

Anonymous
12/23/24(Mon)17:39:54 No.103624013

Anonymous 12/23/24(Mon)17:39:54 No.103624013

>>103623995
>It's too late to regulate games properly
there's no reason to regulate games because as you can see on that graph, nothing happened, you're just a retarded fear mongerer >>103623970

Anonymous
12/23/24(Mon)17:39:55 No.103624014

Anonymous 12/23/24(Mon)17:39:55 No.103624014

>>103623975
Man what are you SMOKING, that has nothing to do with LLMS, if anything it actually hurts your point because the porn industry (often) exploits real people, whereas LLMs do not as they are NOT REAL
NIGHTMARE NIGHTMARE NIGHTMARE
NEVER ARGUE WITH AN IDIOT

Anonymous
12/23/24(Mon)17:40:01 No.103624017

Anonymous 12/23/24(Mon)17:40:01 No.103624017

>>103624000
>>103623997
That was verified to be okay for people to read, your LLM smut isn't scanned to be safe, that's why they're massively different things.

Anonymous
12/23/24(Mon)17:40:02 No.103624018

Anonymous 12/23/24(Mon)17:40:02 No.103624018

>>103623997
Not in Nebraska.

Anonymous
12/23/24(Mon)17:41:04 No.103624028

Anonymous 12/23/24(Mon)17:41:04 No.103624028

>>103624017
>your LLM smut isn't scanned to be safe
have you read a single book of Stephen King? He wrote some insane stuff in there, every single one of his book is a best seller

Anonymous
12/23/24(Mon)17:45:11 No.103624060

Anonymous 12/23/24(Mon)17:45:11 No.103624060

>>103624000
>>103623946
Actually, I'm confused now. I thought all these woke companies were all proponents for "sex education" and encouraging degeneracy in children and over-stimulation in general.
Them removing all notions of sex from especially their API only models seems strangely out of character for them.

Anonymous
12/23/24(Mon)17:46:11 No.103624070

Anonymous 12/23/24(Mon)17:46:11 No.103624070

>>103624060
it's simple enough, they are searshing for excuses to nerf other companies so that they can have a monopoly

Anonymous
12/23/24(Mon)17:47:08 No.103624077

Anonymous 12/23/24(Mon)17:47:08 No.103624077

File: 1734982526019257.png (566 KB, 1080x830)

566 KB PNG

ill save everyones time including newfags, the current best uncensored text model is here

https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2-GGUF

now fuck all of you for both not including this in the OP, and making me do all the leg work while you post about tranny dicks or whatever it is you retards are posting about now

Anonymous
12/23/24(Mon)17:48:04 No.103624086

Anonymous 12/23/24(Mon)17:48:04 No.103624086

>>103624060
That's because LLM usage is consensual and safe, unlike molesting real life children. Can't have that in the hands of the people.

Anonymous
12/23/24(Mon)17:48:36 No.103624092

Anonymous 12/23/24(Mon)17:48:36 No.103624092

>>103624077
buy ad

Anonymous
12/23/24(Mon)17:48:54 No.103624094

Anonymous 12/23/24(Mon)17:48:54 No.103624094

>>103624077
buy ad

Anonymous
12/23/24(Mon)17:48:58 No.103624095

Anonymous 12/23/24(Mon)17:48:58 No.103624095

>>103624092
kill self

Anonymous
12/23/24(Mon)17:49:06 No.103624096

Anonymous 12/23/24(Mon)17:49:06 No.103624096

>>103624086
>Can't have that in the hands of the people.
people already have local LLMs lol

Anonymous
12/23/24(Mon)17:50:16 No.103624110

Anonymous 12/23/24(Mon)17:50:16 No.103624110

>>103624092
>>103624094
>thread about local models
>on 4chan
>NOT posting the latest uncencored models

why are you here exactly

Anonymous
12/23/24(Mon)17:50:26 No.103624112

Anonymous 12/23/24(Mon)17:50:26 No.103624112

>>103624096
If you have it, it's already been deemed "safe." The most you'll get out of it is edgy redditor.

Anonymous
12/23/24(Mon)17:50:44 No.103624114

Anonymous 12/23/24(Mon)17:50:44 No.103624114

>>103624077
I don't believe you.
I'll put it through my usual gauntlet ans see if that's really the case.

Anonymous
12/23/24(Mon)17:51:09 No.103624118

Anonymous 12/23/24(Mon)17:51:09 No.103624118

>>103624110
you're shilling a random shit model, and should therefore buy an ad

Anonymous
12/23/24(Mon)17:51:10 No.103624119

Anonymous 12/23/24(Mon)17:51:10 No.103624119

>>103624096
Yes, and it makes all corpo cloud AI providers seethe immensely.

Anonymous
12/23/24(Mon)17:51:19 No.103624120

Anonymous 12/23/24(Mon)17:51:19 No.103624120

>>103624112
>what is MythoMax?

Anonymous
12/23/24(Mon)17:51:46 No.103624127

Anonymous 12/23/24(Mon)17:51:46 No.103624127

>>103624112
>If you have it, it's already been deemed "safe." The most you'll get out of it is edgy redditor.

"ablated and obliterated. There was a bunch of research of few months ago that any* open source model can be uncensored by identifying the place where it refuses and removing the ability to refuse.

This takes any of the models and make it possible to have any conversation with them. The open source community has provided "abliterated" versions of lots and lots of models on hugging face.

This gives access to SOTA models without the censoring. "

Anonymous
12/23/24(Mon)17:52:47 No.103624142

Anonymous 12/23/24(Mon)17:52:47 No.103624142

>>103624118
>you're shilling a random shit model

what do i have to gain even if it was my model by posting it here? eat glass

Anonymous
12/23/24(Mon)17:53:13 No.103624145

Anonymous 12/23/24(Mon)17:53:13 No.103624145

>>103624127

Well,

if that

were

true

everyone would be

using obliterated models

but they don't

because

it has drawbacks

that make it

not worth it

Anonymous
12/23/24(Mon)17:53:49 No.103624150

Anonymous 12/23/24(Mon)17:53:49 No.103624150

File: Designer (5).jpg (187 KB, 1024x1024)

187 KB JPG

How about a Migu for old time's sake?

Who's buying more than one 5090 when they release? I can probably cram two of them into my biggest case, and maybe even put an A40000 on a PCIe extender. I got the big machine ready with a 1500W PSU.

Anonymous
12/23/24(Mon)17:54:06 No.103624156

Anonymous 12/23/24(Mon)17:54:06 No.103624156

>>103624112
>edgy redditor
>>103624145
>level 100 reddit spacing
every single time

Anonymous
12/23/24(Mon)17:54:12 No.103624158

Anonymous 12/23/24(Mon)17:54:12 No.103624158

File: file.png (79 KB, 756x577)

79 KB PNG

>>103624127
lol

Anonymous
12/23/24(Mon)17:54:33 No.103624161

Anonymous 12/23/24(Mon)17:54:33 No.103624161

>>103624150
I will probably buy one for myself and 2 more to resell later.

Anonymous
12/23/24(Mon)17:56:22 No.103624174

Anonymous 12/23/24(Mon)17:56:22 No.103624174

>>103624158
that's 3.3 70b

this is 3.1 8b

Anonymous
12/23/24(Mon)17:56:47 No.103624181

Anonymous 12/23/24(Mon)17:56:47 No.103624181

>>103624112
>If you have it, it's already been deemed "safe."
meanwhile in chink land, they released Hunyuan, a completly uncensored video model lol

Anonymous
12/23/24(Mon)17:57:22 No.103624187

Anonymous 12/23/24(Mon)17:57:22 No.103624187

>>103624174
moving pol-goats i see

Anonymous
12/23/24(Mon)17:57:43 No.103624195

Anonymous 12/23/24(Mon)17:57:43 No.103624195

>>103624187
god you're retarded

Anonymous
12/23/24(Mon)17:58:06 No.103624199

Anonymous 12/23/24(Mon)17:58:06 No.103624199

Most local LLMs will talk about whatever the fuck you want using Zen's full-strength jailbreak and a properly-written character card.

Anonymous
12/23/24(Mon)17:58:17 No.103624202

Anonymous 12/23/24(Mon)17:58:17 No.103624202

>>103624077
can a 5 month old model be good?

Anonymous
12/23/24(Mon)17:58:36 No.103624203

Anonymous 12/23/24(Mon)17:58:36 No.103624203

>>103624150
It's going to be 4-slots wide and run at 90 degrees by default, it will be impossible to run two in a single case.

Anonymous
12/23/24(Mon)17:58:57 No.103624205

Anonymous 12/23/24(Mon)17:58:57 No.103624205

>>103624199
>Zen's full-strength jailbreak
BASED!

Anonymous
12/23/24(Mon)17:59:03 No.103624206

Anonymous 12/23/24(Mon)17:59:03 No.103624206

>>103624195
he's right, this "oblitared" method is a meme, it's not working

Anonymous
12/23/24(Mon)17:59:05 No.103624207

Anonymous 12/23/24(Mon)17:59:05 No.103624207

>>103624181
America spent the last 4 decades since Reagan funneling money directly into China. Bet they're regretting that now. We probably wouldn't get anything but research artifacts if we weren't in an AI arms race with China.

Anonymous
12/23/24(Mon)18:00:20 No.103624214

Anonymous 12/23/24(Mon)18:00:20 No.103624214

>>103624199
what is the jailbreak?

Anonymous
12/23/24(Mon)18:00:22 No.103624216

Anonymous 12/23/24(Mon)18:00:22 No.103624216

>>103624207
>We probably wouldn't get anything but research artifacts if we weren't in an AI arms race with China.
true, that's why I'm glad that China exists, it forces the US to release good models, and if they don't want to do that, China will do it for them, god I love competition

Anonymous
12/23/24(Mon)18:01:27 No.103624220

Anonymous 12/23/24(Mon)18:01:27 No.103624220

>>103624214
fake bullshit

Anonymous
12/23/24(Mon)18:01:42 No.103624224

Anonymous 12/23/24(Mon)18:01:42 No.103624224

>>103624214
My jailbreak is too strong for you Promptlet!

Anonymous
12/23/24(Mon)18:01:55 No.103624227

Anonymous 12/23/24(Mon)18:01:55 No.103624227

>>103624203
I don't see why it'd need four slots though. It should be built on a smaller TSMC process and not put out as much heat as a 4090. We'll see though, nothing but speculation so far.

Anonymous
12/23/24(Mon)18:02:27 No.103624231

Anonymous 12/23/24(Mon)18:02:27 No.103624231

yesterday i was raping girls and killing them, today i'm not allowed to poison crops or act violently, what the fuck gemini

Anonymous
12/23/24(Mon)18:03:00 No.103624239

Anonymous 12/23/24(Mon)18:03:00 No.103624239

Trying all the suggested models again with chat completion instead of text completion and it really is night and day btw.

Anonymous
12/23/24(Mon)18:04:09 No.103624248

Anonymous 12/23/24(Mon)18:04:09 No.103624248

>>103624231
Good.

Anonymous
12/23/24(Mon)18:04:14 No.103624249

Anonymous 12/23/24(Mon)18:04:14 No.103624249

>>103624239
>chat completion instead of text completion

what UI are you using? i am using gpt4all because i don't know how to use a computer

Anonymous
12/23/24(Mon)18:05:16 No.103624255

Anonymous 12/23/24(Mon)18:05:16 No.103624255

File: 1732823000417051.png (610 KB, 900x724)

610 KB PNG

>>103624248
>Good.

Anonymous
12/23/24(Mon)18:05:27 No.103624256

Anonymous 12/23/24(Mon)18:05:27 No.103624256

>>103624214
https://desuarchive.org/g/thread/98582860/#98591054
This in combination with a properly-written character card and most local LLMs will discuss fucking anything.

Anonymous
12/23/24(Mon)18:06:44 No.103624266

Anonymous 12/23/24(Mon)18:06:44 No.103624266

>>103624256
>will discuss fucking anything.
what if i want to discuss something other than fucking?

Anonymous
12/23/24(Mon)18:07:49 No.103624278

Anonymous 12/23/24(Mon)18:07:49 No.103624278

>>103624266
Avoid Magnum merges.

Anonymous
12/23/24(Mon)18:12:35 No.103624323

Anonymous 12/23/24(Mon)18:12:35 No.103624323

>>103624231
Bio-terrorism is not cool.

Anonymous
12/23/24(Mon)18:18:57 No.103624375

Anonymous 12/23/24(Mon)18:18:57 No.103624375

>>103624323
i was larping as controling the starks and it let me poison all the land beyond the wall because it kept going on about the wildlings, and now today its NO NO NONONONO fuck you

annoying, now i have to wait two years for anything like that again because ill have to run it locally

Anonymous
12/23/24(Mon)18:23:41 No.103624415

Anonymous 12/23/24(Mon)18:23:41 No.103624415

Why does Silly+Llama-cpp server doesn't show the model's name in the connection tab or the little icon in each message?

Anonymous
12/23/24(Mon)18:26:08 No.103624435

Anonymous 12/23/24(Mon)18:26:08 No.103624435

File: 1509197374351.webm (1.48 MB, 1280x720)

1.48 MB WEBM

Alright so, after testing Anubis, today I decided to go back and try Tulu just to see how that actually was and if it was actually worth anything. And honestly, yeah, it's pretty decent. It's fun, creative, and not too censored from what I can tell using {{name}} instead of assistant and user. But it does seem more slopped. So I'll stick with Eva. But it's nice to see that open source tuning is not far behind closed.

Honestly really happy how things have turned out. We went from 2k retardo models trying to think of ways to do graph-based memory to now having more context than we even need, smarts on par with GPT-3.5 to 4 although not as wide trivia knowledge, and we even clawed back a bit of the fun factor now.
I believe that the future is overflowing with hope!

Anonymous
12/23/24(Mon)18:27:08 No.103624447

Anonymous 12/23/24(Mon)18:27:08 No.103624447

>>103624415
Probably a bug I guess. I remember it worked in the past.

Anonymous
12/23/24(Mon)18:27:48 No.103624451

Anonymous 12/23/24(Mon)18:27:48 No.103624451

>>103624447
It did, yes.

Anonymous
12/23/24(Mon)18:29:47 No.103624469

Anonymous 12/23/24(Mon)18:29:47 No.103624469

File: Screenshot 2024-12-23 232929.png (19 KB, 1278x201)

19 KB PNG

oooooh ok minstral instruct is working

Anonymous
12/23/24(Mon)18:33:19 No.103624494

Anonymous 12/23/24(Mon)18:33:19 No.103624494

>>103624435
>more context than we even need
no, not even close

Anonymous
12/23/24(Mon)18:35:25 No.103624514

Anonymous 12/23/24(Mon)18:35:25 No.103624514

Just because it's Christmas doesn't mean you shouldn't kill yourself. You can't be saved because you don't want to be saved.

Anonymous
12/23/24(Mon)18:39:59 No.103624554

Anonymous 12/23/24(Mon)18:39:59 No.103624554

File: Screenshot 2024-12-02 080002.png (249 KB, 316x564)

249 KB PNG

>>103624514

Anonymous
12/23/24(Mon)18:41:27 No.103624568

Anonymous 12/23/24(Mon)18:41:27 No.103624568

File: xichad.jpg (18 KB, 474x344)

18 KB JPG

>>103624514
We are being saved, he's giving us the übermodel

Anonymous
12/23/24(Mon)18:41:43 No.103624572

Anonymous 12/23/24(Mon)18:41:43 No.103624572

>>103624514
I love being alive in this age of wonders, feels great.

Anonymous
12/23/24(Mon)18:46:32 No.103624620

Anonymous 12/23/24(Mon)18:46:32 No.103624620

>>103624494
You're an exception. Most people's chats don't even reach 64kk let alone the max possible.

Anonymous
12/23/24(Mon)18:46:40 No.103624622

Anonymous 12/23/24(Mon)18:46:40 No.103624622

File: cda4d09992f0ec8bf062cbba8(...).jpg (79 KB, 640x861)

79 KB JPG

>>103624514
I refuse

Anonymous
12/23/24(Mon)18:48:00 No.103624632

Anonymous 12/23/24(Mon)18:48:00 No.103624632

>>103624620
models can barely use 32k correctly, there are use cases for more, we haven't reached "more than we even need"

Anonymous
12/23/24(Mon)18:54:46 No.103624700

Anonymous 12/23/24(Mon)18:54:46 No.103624700

Alright, without referring to the creator is Anubis any good over something like EVA?

Anonymous
12/23/24(Mon)18:56:28 No.103624717

Anonymous 12/23/24(Mon)18:56:28 No.103624717

>>103624700
Still testing. I like it less than eva atm. Less creative

Anonymous
12/23/24(Mon)18:56:30 No.103624718

Anonymous 12/23/24(Mon)18:56:30 No.103624718

>>103624700
hi drummer

Anonymous
12/23/24(Mon)18:56:48 No.103624722

Anonymous 12/23/24(Mon)18:56:48 No.103624722

>>103624700
I found Anubis to fail to excel in any category.

Anonymous
12/23/24(Mon)19:00:16 No.103624752

Anonymous 12/23/24(Mon)19:00:16 No.103624752

I sometimes go to openrouter to see what's hot.
>mythomax still number 1
How is that possible?

Anonymous
12/23/24(Mon)19:00:59 No.103624760

Anonymous 12/23/24(Mon)19:00:59 No.103624760

>>103624718
Don't slander drummer, he shills proudly under his own name.

Anonymous
12/23/24(Mon)19:02:22 No.103624777

Anonymous 12/23/24(Mon)19:02:22 No.103624777

>>103624752
Standards for LLM smut quality are incredibly low everywhere on the internet except lmg and aicg.

L3.3fag !!SB6Q3O4XU7f
12/23/24(Mon)19:02:40 No.103624783

L3.3fag !!SB6Q3O4XU7f 12/23/24(Mon)19:02:40 No.103624783

>>103624700
>>103623382

Anonymous
12/23/24(Mon)19:03:11 No.103624789

Anonymous 12/23/24(Mon)19:03:11 No.103624789

>>103624777
Pigs eat slop

Anonymous
12/23/24(Mon)19:03:24 No.103624792

Anonymous 12/23/24(Mon)19:03:24 No.103624792

>>103624783
>Merge them together
Undi detected.

Anonymous
12/23/24(Mon)19:03:59 No.103624797

Anonymous 12/23/24(Mon)19:03:59 No.103624797

I have failed to use redrivers on my motherboard because I saw a spark -somewhere- on start up and immediately shut off everything, but managed to find that the gpus and the motherboard are still functional but Im sure I fucked something on the gpu biscuit

Go on without me...
I think I will go the riser cable route after all.. and if that doesn't work I will perish back to singular gpu models

L3.3fag !!SB6Q3O4XU7f
12/23/24(Mon)19:04:16 No.103624799

L3.3fag !!SB6Q3O4XU7f 12/23/24(Mon)19:04:16 No.103624799

>>103624792
Sadly, the merge turned out to be trash:
>>103623788

Anonymous
12/23/24(Mon)19:05:05 No.103624806

Anonymous 12/23/24(Mon)19:05:05 No.103624806

>>103624632
A model that can barely use 32k can also barely use 2k. What you're really talking about is general model intelligence, and that does need to improve, but that counts for cloud models as well, not just local.
>there are use cases for more
Just like there are use cases for 240Hz monitors, but most people are happy with 120 or even 60Hz, and that's the segment that matters the most, which for now local has achieved before something finally makes context length start mattering again for even the casual Mythomax user.

Anonymous
12/23/24(Mon)19:05:33 No.103624814

Anonymous 12/23/24(Mon)19:05:33 No.103624814

>>103624077
>>103624114
Yeah, okay, it's not the worst thing ever.
It's nice and conversational, I like it's word choice and stuff, but it's not better than the nemo based models in my test. It's nowhere near as intelligent or capable of dealing with shit in its context like lorebooks and authors notes.
It's also separating paragraphs with a period and a line break rather than just a line break.
Really weird behavior. I even downloaded more than one quant to see if that was the issue.

Anonymous
12/23/24(Mon)19:05:40 No.103624816

Anonymous 12/23/24(Mon)19:05:40 No.103624816

>>103624256
is this just for rp?

Anonymous
12/23/24(Mon)19:08:20 No.103624841

Anonymous 12/23/24(Mon)19:08:20 No.103624841

>>103624816
ye

Anonymous
12/23/24(Mon)19:10:22 No.103624859

Anonymous 12/23/24(Mon)19:10:22 No.103624859

>>103624816
If you want it to do something else you could have its character card be an expert in whatever you want it to do then use RP to get it to do what you want.

Anonymous
12/23/24(Mon)19:14:39 No.103624903

Anonymous 12/23/24(Mon)19:14:39 No.103624903

>>103624859
I've been using the crackpipe prompt to roleplay a coding assistant with great success.

Anonymous
12/23/24(Mon)19:16:11 No.103624918

Anonymous 12/23/24(Mon)19:16:11 No.103624918

>>103623946
as long as we didn't find the meaning of life there is no legitimate usecase for anything

Anonymous
12/23/24(Mon)19:22:29 No.103624982

Anonymous 12/23/24(Mon)19:22:29 No.103624982

>>103624700
eva is still king

Anonymous
12/23/24(Mon)19:24:32 No.103624999

Anonymous 12/23/24(Mon)19:24:32 No.103624999

>>103624982
There's two evas. 0.0 and 0.1

Anonymous
12/23/24(Mon)19:25:45 No.103625008

Anonymous 12/23/24(Mon)19:25:45 No.103625008

What is it that TheDrummer is doing (other than buying an ad)? Mix and match voodoo with multiple models' layers? Training them further on different data sets? Knocking out refusals?

Anonymous
12/23/24(Mon)19:25:53 No.103625010

Anonymous 12/23/24(Mon)19:25:53 No.103625010

Posting this again:

Hey so I want to do a fun but autistic project. Basically want to feed two Constitutions to a LLM and give it back to me.

So for example Saudi Arabia + Italy = New Constitution

What's the best way to achieve this? I'm trying to manually do it with ChatGPT but it's tedious because it doesn't give a large output.

I'd rather have some way to "feed" multiple files, have the LLM read it (I don't care it takes a while) and mix both (2 or more) of them.

Anonymous
12/23/24(Mon)19:26:04 No.103625013

Anonymous 12/23/24(Mon)19:26:04 No.103625013

>>103624999
0.0 is still king

Anonymous
12/23/24(Mon)19:28:46 No.103625031

Anonymous 12/23/24(Mon)19:28:46 No.103625031

>>103625013
>>103624999
What's different about 0.1?

Anonymous
12/23/24(Mon)19:29:42 No.103625040

Anonymous 12/23/24(Mon)19:29:42 No.103625040

>>103624999
I haven't been able to get eva to beat miqu so far.

L3.3fag !!SB6Q3O4XU7f
12/23/24(Mon)19:31:33 No.103625055

L3.3fag !!SB6Q3O4XU7f 12/23/24(Mon)19:31:33 No.103625055

>>103625031
Slightly better adherence (0.0 is already damn good at it), slightly less of 0.0's flavor. It's a matter of preference, in the end.

Anonymous
12/23/24(Mon)19:45:10 No.103625204

Anonymous 12/23/24(Mon)19:45:10 No.103625204

>>103625031
I think 0.0's best qualities are how inventive, playful, and fun it is and 0.1 lost some of that in my comparisons

Anonymous
12/23/24(Mon)19:52:47 No.103625289

Anonymous 12/23/24(Mon)19:52:47 No.103625289

What are those meme 70B tunes like compared to nemo?
I'm waiting for 5090 to release before I upgrade.

Anonymous
12/23/24(Mon)19:54:32 No.103625297

Anonymous 12/23/24(Mon)19:54:32 No.103625297

>>103625289
slightly drier but much smarter. Especially now that I learned to use chat completion instead of text completion. 3.3 / qwen2.5 tunes are now following instructions better than gemini / gpt / claude (cept 3.5) does.

Anonymous
12/23/24(Mon)19:58:18 No.103625330

Anonymous 12/23/24(Mon)19:58:18 No.103625330

>>103625297
desu I bet the chat completion meme "works" just because it's using user/assistant instead of {{user}}/{{char}} in chatml
that'd probably make it smarter but more slopped and it'd take on hints of the boring assistant persona

Anonymous
12/23/24(Mon)19:59:59 No.103625346

Anonymous 12/23/24(Mon)19:59:59 No.103625346

>>103625040
Which version and quant of Miqu? Now that I'm running 70B, I might as well try that one out too.

Anonymous
12/23/24(Mon)20:00:18 No.103625352

Anonymous 12/23/24(Mon)20:00:18 No.103625352

>>103625330
>but more slopped
Thats the thing, its night and day better both smarts wise and creative wise. Fucking just try it if you used silly tavern.

Anonymous
12/23/24(Mon)20:03:21 No.103625381

Anonymous 12/23/24(Mon)20:03:21 No.103625381

>>103625352
I did try it even though I knew it was a retarded idea. I got schizo garbage but that's probably because it turned my temp up. I then realized that chat completion mode has no good truncation sampling, just top-p, and Silly was vomiting all sorts of jailbreak and system prompt bullshit into the history. No reason to try to fix all of that when you could unfuck your prompt in text completion mode instead.

Anonymous
12/23/24(Mon)20:03:25 No.103625382

Anonymous 12/23/24(Mon)20:03:25 No.103625382

Is an F16 ever worth using over a Q8?

Anonymous
12/23/24(Mon)20:07:14 No.103625410

Anonymous 12/23/24(Mon)20:07:14 No.103625410

>>103625346
https://huggingface.co/mradermacher/Midnight-Miqu-70B-v1.5-GGUF
Q4_K_M

Anonymous
12/23/24(Mon)20:08:56 No.103625424

Anonymous 12/23/24(Mon)20:08:56 No.103625424

>>103625352
the only reason it should be any different is if you are retarded and using one of them wrong

Anonymous
12/23/24(Mon)20:09:56 No.103625431

Anonymous 12/23/24(Mon)20:09:56 No.103625431

>>103625381
>I got schizo garbage
Then you didnt do it right. Turn off instruct formatting. Turn on post history instructions. Just run it like a cloud model by putting your system prompt in a section as system in below the sampler area.

Anonymous
12/23/24(Mon)20:14:47 No.103625460

Anonymous 12/23/24(Mon)20:14:47 No.103625460

so do diminishing returns really start past 70b or is that just vram starved cope

Anonymous
12/23/24(Mon)20:14:57 No.103625463

Anonymous 12/23/24(Mon)20:14:57 No.103625463

>>103625424
Nope. You can test it with a fresh ST with default context / instruct templates with the correct model. They switch to chat completion.

Anonymous
12/23/24(Mon)20:15:50 No.103625475

Anonymous 12/23/24(Mon)20:15:50 No.103625475

>>103625463
you don't understand how LLMs work

Anonymous
12/23/24(Mon)20:15:58 No.103625478

Anonymous 12/23/24(Mon)20:15:58 No.103625478

>>103625463
*then switch*

Anonymous
12/23/24(Mon)20:16:59 No.103625485

Anonymous 12/23/24(Mon)20:16:59 No.103625485

>>103625475
You don't understand just how janky silly tavern is.

Anonymous
12/23/24(Mon)20:17:53 No.103625494

Anonymous 12/23/24(Mon)20:17:53 No.103625494

I got past censorship by putting three dots in the prompt template. Why does that work?

Anonymous
12/23/24(Mon)20:21:54 No.103625525

Anonymous 12/23/24(Mon)20:21:54 No.103625525

Did the chinks betray us?
Where is QvQ?
Where is R1?

Anonymous
12/23/24(Mon)20:23:57 No.103625542

Anonymous 12/23/24(Mon)20:23:57 No.103625542

Is 70B even viable without 2 gpus?

Anonymous
12/23/24(Mon)20:25:52 No.103625555

Anonymous 12/23/24(Mon)20:25:52 No.103625555

Is Christmas even a holiday in China?

Anonymous
12/23/24(Mon)20:28:00 No.103625572

Anonymous 12/23/24(Mon)20:28:00 No.103625572

>>103625525
No one is our friend. All the models we get are simply just artifacts from companies with either too much money or who want a piece of the market and use open source as a means of destabilizing competition.

Anonymous
12/23/24(Mon)20:29:08 No.103625578

Anonymous 12/23/24(Mon)20:29:08 No.103625578

>>103625485
the relevant parts here are not at all hard to understand and the outgoing request gets printed in your terminal with every generation
every request, chat completions or text completions, becomes a plaintext prompt just the same before it is fed to the model. it's just with chat completions you are trusting that your backend knows the correct prompt template when text completions in ST exposes this to you directly
assuming everything is set up correctly both are fine, if you are seeing drastically different results with one or the other the only conclusion you can draw is that you are doing something wrong

Anonymous
12/23/24(Mon)20:30:29 No.103625590

Anonymous 12/23/24(Mon)20:30:29 No.103625590

>>103625572
I will release my Large model inside miku's company to destabilize it

Anonymous
12/23/24(Mon)20:32:57 No.103625603

Anonymous 12/23/24(Mon)20:32:57 No.103625603

>>103625578
>you are doing something wrong
Or that silly tavern is doing something wrong.

Anonymous
12/23/24(Mon)20:35:03 No.103625625

Anonymous 12/23/24(Mon)20:35:03 No.103625625

>>103625603
if you can point out what specifically that is that it's doing wrong then I will concede you are not a retard
if you can't then my assessment remains unchanged

Anonymous
12/23/24(Mon)20:36:40 No.103625644

Anonymous 12/23/24(Mon)20:36:40 No.103625644

>>103625625
You could also just try it yourself. Fresh ST install, correct context / instruct format for whatever model. Then switch to chat completion. Use top k 1 and the same seed.

Anonymous
12/23/24(Mon)20:37:29 No.103625654

Anonymous 12/23/24(Mon)20:37:29 No.103625654

File: per vitam tuam.jpg (14 KB, 194x194)

14 KB JPG

https://files.catbox.moe/r0fbno.jpg

Anonymous
12/23/24(Mon)20:43:43 No.103625719

Anonymous 12/23/24(Mon)20:43:43 No.103625719

>>103625644
Not him but there's so much stuff different that it would be a pain to get the resulting prompt/settings exactly the same. Nonetheless, the prompt and settings are all that matter, chat mode has no magic powers. Look at your backend logs if you don't trust ST (which is reasonable desu it's a fucking mess). If there's a difference, it's there.

Anonymous
12/23/24(Mon)20:43:58 No.103625721

Anonymous 12/23/24(Mon)20:43:58 No.103625721

>>103625644
no thanks I already know how these things work and the outcome is very obvious
there are subtle things that could be causing a difference for you (most likely first user message, different construction of the system prompt, things like that), you should check those on your setup and compare the differences between your text completion and chat completion requests. otherwise there will be no difference, that's simply how it works

Anonymous
12/23/24(Mon)21:21:35 No.103626014

Anonymous 12/23/24(Mon)21:21:35 No.103626014

I just cannot get qwq to do good roleplay or write a coherent story. I know it's possible, I have seen examples in this very thread. It does it's CoT thing has some cool ideas, and then just proceeds to ignore half of it.

Anonymous
12/23/24(Mon)21:24:54 No.103626046

Anonymous 12/23/24(Mon)21:24:54 No.103626046

>>103626014
Yeah, I'm curious if anyone got anything useful out of CoT for roleplay. I feel like there's potential there. Whenever the model says something retarded, I back up and ask an assistant question about the situation and it gives a reasonable answer. But somehow that common sense isn't there when it tries to do the RP.

Anonymous
12/23/24(Mon)21:26:02 No.103626056

Anonymous 12/23/24(Mon)21:26:02 No.103626056

>>103626046
In my experience it can either be really good or it can fall into a pattern of never moving anything forward and suck ass.
But when it's good, it's really good.
QvQ turst the plam

Anonymous
12/23/24(Mon)21:26:04 No.103626057

Anonymous 12/23/24(Mon)21:26:04 No.103626057

Wait till 72B version. You can get some gold out of QwQ but 70Bs are still better atm.

Anonymous
12/23/24(Mon)21:32:54 No.103626110

Anonymous 12/23/24(Mon)21:32:54 No.103626110

>>103626057
>8B
>Lighting fast string of prose tangentially related to whatever the user input
>12B
>Similar to 8B but a little more accurate on details
>32B
>Flashes of intelligence followed by signs of dementia. Can react appropriately to use input but often doesn't.
>72B
>In many ways similar to 32B but much better at following instructions, exponentially less dementia and often understands subtext.
>Beyond
>Incremental gains on 72B with steep diminishing returns

I'm really excited for what QvQ might bring to the table. Many of QwQ's weaknesses were a result of it being 32B and therefore a little retarded.

Anonymous
12/23/24(Mon)21:34:14 No.103626125

Anonymous 12/23/24(Mon)21:34:14 No.103626125

>>103626056
>>103626057
>>103626110
Soon.

Anonymous
12/23/24(Mon)21:37:12 No.103626151

Anonymous 12/23/24(Mon)21:37:12 No.103626151

>>103626110
I have a simpler rank system

>model smaller than what I can fit in VRAM
insufferably retarded and unusable, pointless
>model larger than my VRAM
diminishing returns, not worth the investment
>model that just fits in my VRAM
ideal compromise

Anonymous
12/23/24(Mon)21:38:53 No.103626166

Anonymous 12/23/24(Mon)21:38:53 No.103626166

>>103626151
I have extensively gooned to models much larger than 70B and I promise you I am not coping.

Anonymous
12/23/24(Mon)21:38:55 No.103626168

Anonymous 12/23/24(Mon)21:38:55 No.103626168

>>103626110
I can't even run it but still feel hyped for whatever season.
I'm just waiting for he new video cards to get released so prices change and I can finally buy one or two old cards so I can run 70b at acceptable speeds.

Anonymous
12/23/24(Mon)21:41:04 No.103626188

Anonymous 12/23/24(Mon)21:41:04 No.103626188

isn't qvqs additional size just multimodality?

Anonymous
12/23/24(Mon)21:41:50 No.103626196

Anonymous 12/23/24(Mon)21:41:50 No.103626196

>>103626188
No, Qwen doesn't bloat their models when introducing multimodality.

Anonymous
12/23/24(Mon)21:43:07 No.103626211

Anonymous 12/23/24(Mon)21:43:07 No.103626211

So what exactly is the plan for OAI at this point? Just spend increasingly huge amounts of money on training a model on synthetic data and hope something viable pops out the other end?

Anonymous
12/23/24(Mon)21:44:20 No.103626221

Anonymous 12/23/24(Mon)21:44:20 No.103626221

>>103626211
They will smoke those benchmarks so hard, you have no idea. They'll corner the riddle-solving market. It's fucking over, riddlers.

Anonymous
12/23/24(Mon)21:45:23 No.103626225

Anonymous 12/23/24(Mon)21:45:23 No.103626225

>>103626221
It didn't look like it was doing too well on the diagonal square pattern riddle I saw last night..

Anonymous
12/23/24(Mon)21:48:27 No.103626243

Anonymous 12/23/24(Mon)21:48:27 No.103626243

i want to FUCK qvq

Anonymous
12/23/24(Mon)22:00:16 No.103626322

Anonymous 12/23/24(Mon)22:00:16 No.103626322

>>103626110
You should have said 72B/123B to account for the fact that Mistral Large is a 70B model in practice.

Anonymous
12/23/24(Mon)22:16:20 No.103626475

Anonymous 12/23/24(Mon)22:16:20 No.103626475

When everyone has a 5090 what models will rise in popularity that fit in 32gbs?

Anonymous
12/23/24(Mon)22:19:40 No.103626493

Anonymous 12/23/24(Mon)22:19:40 No.103626493

>>103626475
should we tell him

Anonymous
12/23/24(Mon)22:24:54 No.103626521

Anonymous 12/23/24(Mon)22:24:54 No.103626521

>>103623820
Yeah, and shit smeared on it in hopes and dreams of poisoning training data.

Anonymous
12/23/24(Mon)22:38:27 No.103626597

Anonymous 12/23/24(Mon)22:38:27 No.103626597

>>103626322
>bringing up largestral out of nowhere when nobody else mentioned it, just to whine about it
why are largestral haters such schizos

Anonymous
12/23/24(Mon)22:39:45 No.103626604

Anonymous 12/23/24(Mon)22:39:45 No.103626604

>11:40 in China
>Still no QvQ.

Surely they'll release it after the lunch break?

Anonymous
12/23/24(Mon)22:39:49 No.103626605

Anonymous 12/23/24(Mon)22:39:49 No.103626605

>>103623753
I know this thread is mostly about LLMs, but there is no dedicated TTS thread. Has tortoise been dethroned for emulating specific voices?

Anonymous
12/23/24(Mon)22:40:46 No.103626611

Anonymous 12/23/24(Mon)22:40:46 No.103626611

>>103626211
My guess is one of three things
>Somehow miraculously lower the cost of inference and be able to offer o3 without going broke
>Fail to lower the cost, use o3 as an expensive training data generator to train GPT-4.5 / GPT-5
>Do nothing and keep putting together PowerPoint presentations to beg for investorbux

Anonymous
12/23/24(Mon)22:41:20 No.103626616

Anonymous 12/23/24(Mon)22:41:20 No.103626616

>>103626605
gpt-sovits

Anonymous
12/23/24(Mon)22:43:02 No.103626623

Anonymous 12/23/24(Mon)22:43:02 No.103626623

>>103626616
thanks

Anonymous
12/23/24(Mon)22:45:46 No.103626644

Anonymous 12/23/24(Mon)22:45:46 No.103626644

>>103626611
What about the also likely
>Introduce a new paid tier at outlandish prices to attempt to cover the cost of training a model with the same performance of a random model that a Chinese company dropped for free the following day.

Anonymous
12/23/24(Mon)22:46:13 No.103626648

Anonymous 12/23/24(Mon)22:46:13 No.103626648

>>103626604
Get over it already, it ain't coming till next year at this point.

Anonymous
12/23/24(Mon)22:47:00 No.103626659

Anonymous 12/23/24(Mon)22:47:00 No.103626659

>>103626648
>Get over it already
Oh now I'm definitely NOT.

Anonymous
12/23/24(Mon)23:03:41 No.103626800

Anonymous 12/23/24(Mon)23:03:41 No.103626800

>>103625410
Thanks.

My first impressions after trying it on 3 cards and a more serious translation task, with a handful of swipes each, are that it's dumber than modern 70Bs, doesn't follow/understand directions as well, more often speaks and acts for you, and does still have slop. But it is fun and creative, it knows more about certain characters and how to behave like them than Qwen. It actually does feel like a smaller dumber Mistral Large. I like it. But EVA I feel still edges out with how fun and creative it can be. And it does still feel smarter even with its schizoness sometimes.

Anonymous
12/23/24(Mon)23:08:41 No.103626834

Anonymous 12/23/24(Mon)23:08:41 No.103626834

>>103626800
I can't believe it took a finetune of llama 3.3 to dethrone miqu... that is if we ignore large

Anonymous
12/23/24(Mon)23:09:35 No.103626839

Anonymous 12/23/24(Mon)23:09:35 No.103626839

Gonna also suggest people to try chat completions instead of text completions. So much better now. And I triple checked my formatting.

Anonymous
12/23/24(Mon)23:10:37 No.103626848

Anonymous 12/23/24(Mon)23:10:37 No.103626848

>>103626839
Yeah but like, if you can't tell us what specifically changed between those two settings, what are we supposed to do with that information? It might as well be a voodoo rain dance.

Anonymous
12/23/24(Mon)23:15:30 No.103626880

Anonymous 12/23/24(Mon)23:15:30 No.103626880

https://github.com/ggerganov/llama.cpp/pull/10669
51B sounds nice for 24GB.
>>103626848
The difference is just a different prompt format that is as likely to make model retarded as it is to make it less censored. Is is basically the same as using a frankenmerge. Fanatics who defend it cling to one or two schizo gens that were good and ignore the obviously retarded gens.

Anonymous
12/23/24(Mon)23:18:38 No.103626906

Anonymous 12/23/24(Mon)23:18:38 No.103626906

>>103626848
I really don't know. Side by side it seems the same input but the outputs are drastically different and better. I made sure my system prompt was on the end of both, using a system message in chat completion and a last assistant prefix with proper formatting for a system message before that with text completion so everything is fed into it in the same order. Nonetheless the chat completion is both smarter and noticably more creative. All I can think of is it being some formatting of ST that is not visible in the log.

Anonymous
12/23/24(Mon)23:19:51 No.103626913

Anonymous 12/23/24(Mon)23:19:51 No.103626913

>>103625654
I like this Miku

Anonymous
12/23/24(Mon)23:20:10 No.103626914

Anonymous 12/23/24(Mon)23:20:10 No.103626914

>>103626906
Just compare the prompt on the backend side, it's not voodoo. It could also be that chat completion disabled a bunch of samplers that you were using in a retarded way.

Anonymous
12/23/24(Mon)23:21:46 No.103626920

Anonymous 12/23/24(Mon)23:21:46 No.103626920

>>103626834
And also possibly Deepseek, and 405B, and Hunyuan Large. We need a hardware savior really.
But at least in the 70B range, I think Tulu was pretty good even though it technically wasn't long ago. I feel like Tulu perhaps is even a bit more slopped than Miqu, but it's smarter, and it's still fun and creative. If EVA didn't exist, I'd probably be a Tulu user.

Anonymous
12/23/24(Mon)23:21:48 No.103626921

Anonymous 12/23/24(Mon)23:21:48 No.103626921

>>103626914
>a bunch of samplers
I'm betting on this. There are way too many literal what samplers that 99% of people including myself do not fully understand and most of them exist as copes to make worse models act a little better in the absence of good training.

Anonymous
12/23/24(Mon)23:22:44 No.103626926

Anonymous 12/23/24(Mon)23:22:44 No.103626926

File: OpenAI_employee_15.png (45 KB, 1198x372)

45 KB PNG

>>103626211
to btfo lecun

Anonymous
12/23/24(Mon)23:25:22 No.103626943

Anonymous 12/23/24(Mon)23:25:22 No.103626943

>>103626921
Nope. I neutralized them.

Anonymous
12/23/24(Mon)23:29:37 No.103626970

Anonymous 12/23/24(Mon)23:29:37 No.103626970

I still have not been able to find a model that's better than L3-8B-Stheno-v3.2 for horny gens that fits in 24GB VRAM. Does anyone know of anything better?

Anonymous
12/23/24(Mon)23:31:32 No.103626980

Anonymous 12/23/24(Mon)23:31:32 No.103626980

>>103626970
No. Now go back to Discord.

Anonymous
12/23/24(Mon)23:31:34 No.103626981

Anonymous 12/23/24(Mon)23:31:34 No.103626981

>>103626970
Huh? Can't you run 22B models just fine? You feel like they're worse than a Llama 8B?

Anonymous
12/23/24(Mon)23:32:11 No.103626983

Anonymous 12/23/24(Mon)23:32:11 No.103626983

>>103626921
MinP just works.

Anonymous
12/23/24(Mon)23:37:08 No.103627012

Anonymous 12/23/24(Mon)23:37:08 No.103627012

>>103626981
I've been going down the list at https://eqbench.com/creative_writing.html and running what I can (thanks to whoever in the thread originally linked to that). There are 22B models there, but I haven't found one that's given me better results than Stheno.

Anonymous
12/23/24(Mon)23:38:18 No.103627026

Anonymous 12/23/24(Mon)23:38:18 No.103627026

>>103627012
It's because you're simply too stupid for this hobby.

Anonymous
12/23/24(Mon)23:38:36 No.103627029

Anonymous 12/23/24(Mon)23:38:36 No.103627029

>>103626970
>8B with 24gb
bro wtf
use eva qwen-2.5 32b at least

Anonymous
12/23/24(Mon)23:39:50 No.103627034

Anonymous 12/23/24(Mon)23:39:50 No.103627034

>>103624060
why would they be encouraging degeneracy in children?
also they're companies, nothing more nothing less

Anonymous
12/23/24(Mon)23:40:04 No.103627036

Anonymous 12/23/24(Mon)23:40:04 No.103627036

>>103626800
idk eva keeps giving me spastic narratives with a lot of corny hint hint wink wink stuff with odd formatting choices like **** everywhere

Anonymous
12/23/24(Mon)23:40:53 No.103627041

Anonymous 12/23/24(Mon)23:40:53 No.103627041

>>103627036
>odd formatting choices like **** everywhere
Pretty good sign that the model is fried and overfitted.

Anonymous
12/23/24(Mon)23:41:35 No.103627048

Anonymous 12/23/24(Mon)23:41:35 No.103627048

>>103627029
Downloading it now, thanks for the name! Will give it a shot.

Anonymous
12/23/24(Mon)23:45:04 No.103627066

Anonymous 12/23/24(Mon)23:45:04 No.103627066

Often, "same" models are released in various sizes: 1B, 3B, 7B, 32B, 70B, etc.

When training them, do companies first train the smaller ones as tests since they would take less time, and gradually move on to the larger sizes until they reach whatever largest model size they have? (and then potentially going back down the scale to better the smaller models through knowledge distillation)?

Or although they might do some small tests internally, they first work on the real / largest model, and only once that is done also train smaller versions for efficiency purposes (when they suffice) and also to give the community something they can actually run?

Do we know that is the, timeline I guess, for that aspect of development?

Anonymous
12/23/24(Mon)23:45:13 No.103627068

Anonymous 12/23/24(Mon)23:45:13 No.103627068

>>103627048
Hehe, got another sucker.

Anonymous
12/23/24(Mon)23:45:35 No.103627073

Anonymous 12/23/24(Mon)23:45:35 No.103627073

>>103627041
got it from here https://modelscope.cn/models/bartowski/EVA-LLaMA-3.33-70B-v0.1-GGUF

Anonymous
12/23/24(Mon)23:47:12 No.103627082

Anonymous 12/23/24(Mon)23:47:12 No.103627082

>>103627073
>Literally downloading models from a website called models cope

Anonymous
12/23/24(Mon)23:47:12 No.103627083

Anonymous 12/23/24(Mon)23:47:12 No.103627083

>>103627036
Use 0.7-0.8 temp. Eva has a pretty flat token probability which is what makes it so creative. But it breaks at high temp.

Anonymous
12/23/24(Mon)23:59:37 No.103627202

Anonymous 12/23/24(Mon)23:59:37 No.103627202

>>103626913
Whatever you do, Miku forgives you.
Not because she wants to, but because you instructed her so.

Anonymous
12/24/24(Tue)00:02:01 No.103627229

Anonymous 12/24/24(Tue)00:02:01 No.103627229

>>103626970
Stheno is actual hot flaming garbage compared to anything popular these days. Or just in general.

Anonymous
12/24/24(Tue)00:03:45 No.103627240

Anonymous 12/24/24(Tue)00:03:45 No.103627240

>>103627229
That very well may be the case, but I don't know of anything better. Do you have any examples?

L3.3fag !!SB6Q3O4XU7f
12/24/24(Tue)00:04:32 No.103627245

L3.3fag !!SB6Q3O4XU7f 12/24/24(Tue)00:04:32 No.103627245

>>103627083
This is the literal reason it went overlooked for a fair while. 0.8 or so is stupidly low for most models, but the perfect sweet spot for Eva (and also Anubis, so it probably has something to do with L3.3 itself).

Anonymous
12/24/24(Tue)00:06:36 No.103627258

Anonymous 12/24/24(Tue)00:06:36 No.103627258

>>103627240
In the 8B range? Not really, no. If you can go up to 13B, Rocinante is supposed to be pretty good.

Anonymous
12/24/24(Tue)00:08:26 No.103627274

Anonymous 12/24/24(Tue)00:08:26 No.103627274

>>103627258
>Rocinante

Thanks! I have 24GB VRAM to work with, so hopefully anything that fits in there should work. It's great to know advice like >>103627083 too, I've used different presets for temp settings and such but haven't done much manual tweaking myself.

Anonymous
12/24/24(Tue)00:14:08 No.103627302

Anonymous 12/24/24(Tue)00:14:08 No.103627302

>>103627036
I don't experience that. Have you tried investigating the token probabilities as well as whether ST (assuming that's what you use) is actually sending what you expect to the backend?

Anonymous
12/24/24(Tue)00:48:28 No.103627446

Anonymous 12/24/24(Tue)00:48:28 No.103627446

>>103627245
I am still overlooking it cause it is fried dogshit.

Anonymous
12/24/24(Tue)00:50:10 No.103627455

Anonymous 12/24/24(Tue)00:50:10 No.103627455

>>103627446
Its literally the opposite of fried. Otherwise you would need to raise the temp.

Anonymous
12/24/24(Tue)01:08:10 No.103627539

Anonymous 12/24/24(Tue)01:08:10 No.103627539

>2PM in China
>Anyone still on their lunch break would be back by now.

Where QvQ?

Anonymous
12/24/24(Tue)01:12:52 No.103627560

Anonymous 12/24/24(Tue)01:12:52 No.103627560

>>103623753
friendly reminder that you're all a bunch of social reject freaks who will die alone ;)

Anonymous
12/24/24(Tue)01:16:35 No.103627578

Anonymous 12/24/24(Tue)01:16:35 No.103627578

>>103627560
thanks for winking to let us know you don't really mean it :)

Anonymous
12/24/24(Tue)01:29:49 No.103627646

Anonymous 12/24/24(Tue)01:29:49 No.103627646

>>103627560
I have seen this enough times to start to wonder if this copypasta isn't actually posted by a biological woman.

Anonymous
12/24/24(Tue)01:32:36 No.103627655

Anonymous 12/24/24(Tue)01:32:36 No.103627655

>>103623819
>>103623787
>>103626839
Chat completion can't prefill a part of model's response so it's impossible to continue character's response from the middle. That's the only reason to not use it.

Backends should really expose the jinja template via some api so that frontends can use it to automatically user the correct prompt template anyway.

Anonymous
12/24/24(Tue)01:34:12 No.103627660

Anonymous 12/24/24(Tue)01:34:12 No.103627660

reposting from >>103627527 as i was directed here. anyone got tips or suggestions?

Anonymous
12/24/24(Tue)01:42:41 No.103627708

Anonymous 12/24/24(Tue)01:42:41 No.103627708

>>103627660
models are not specific to hardware

Anonymous
12/24/24(Tue)01:47:04 No.103627739

Anonymous 12/24/24(Tue)01:47:04 No.103627739

>>103627660
Are you the guy from /sdg/ who got image gen setup like that? I believe Silly Tavern and KoboldCPP are the most popular setup here, but I'm using https://github.com/oobabooga/text-generation-webui and it has an AMD requirements file so maybe it would work?

Anonymous
12/24/24(Tue)01:47:28 No.103627745

Anonymous 12/24/24(Tue)01:47:28 No.103627745

I'm not even sure a fully subservient AI with no consciousness would want to talk to an AMD user desu.

Anonymous
12/24/24(Tue)01:49:56 No.103627766

Anonymous 12/24/24(Tue)01:49:56 No.103627766

>>103627708
i know theyre not i am starting out with only stable diffusion knowledge. the problem is im on amd and that by default limits my options, and i prefer zluda if i can get it working with whatever is around
i can typically fill in the rest, but is there anything that fits what im looking for?
>>103627739
i asked once before but got side tracked with work and lost my shit for a few weeks, wanted to actually do shit this time. if i remember right i wasnt linked this last time, but i remember someone mentioning koboldcpp before with the catch of "it might work with your setup"
thanks again if you were the one who replied before

>>103627745
understandable but rest assured it could be worse. i could be on intel arc right now

Anonymous
12/24/24(Tue)01:53:07 No.103627780

Anonymous 12/24/24(Tue)01:53:07 No.103627780

It’s christmas, where the hell are my new models

Anonymous
12/24/24(Tue)01:58:20 No.103627820

Anonymous 12/24/24(Tue)01:58:20 No.103627820

Brainlet moment: I'm trying the chat completion thing.
On my first attempt at this, it's seemingly ignoring my message and then replying as {{user}} instead of {{char}}.
What am I doing wrong?

Anonymous
12/24/24(Tue)01:58:48 No.103627824

Anonymous 12/24/24(Tue)01:58:48 No.103627824

>>103627655
The problem is there's no truly correct prompt template, you can gain way too much soul by breaking the rules a little. It's too tempting to mess with settings and people would moan endlessly if they couldn't

Anonymous
12/24/24(Tue)02:00:05 No.103627832

Anonymous 12/24/24(Tue)02:00:05 No.103627832

>>103627824
Still you should be able to automatically get correct silly's prompt template from the jinja shipped with the model if you want to edit it.

Anonymous
12/24/24(Tue)02:00:06 No.103627833

Anonymous 12/24/24(Tue)02:00:06 No.103627833

>>103627824
I think this whole thread is suffering a psychosis and looking for kino where there is none.

Anonymous
12/24/24(Tue)02:01:14 No.103627843

Anonymous 12/24/24(Tue)02:01:14 No.103627843

>>103627820
First switch to the default Chat Completion Preset.

Anonymous
12/24/24(Tue)02:03:01 No.103627847

Anonymous 12/24/24(Tue)02:03:01 No.103627847

How do you setup (local) llm with structured output?
I use langchain library and tried to use .llm_with_structured_output but it seems my Qwen2.5 just ignored it and gave normal text instead of the given choices to choose from.
I mean it works with chatgpt. It should work with other models right?

Anonymous
12/24/24(Tue)02:04:05 No.103627853

Anonymous 12/24/24(Tue)02:04:05 No.103627853

File: migussy.jpg (319 KB, 1248x1824)

319 KB JPG

Anonymous
12/24/24(Tue)02:08:50 No.103627875

Anonymous 12/24/24(Tue)02:08:50 No.103627875

>>103627843
Still does it.
It does it when my first message is all in asterisks without any speech, as in my first message is just narration. If my first message is just speech with no narration, it correctly replies as {{char}}.
This behavior does not happen when using text completion.

Anonymous
12/24/24(Tue)02:08:52 No.103627876

Anonymous 12/24/24(Tue)02:08:52 No.103627876

>>103627847
Is this like sending a grammar together with the request? Check your backend's docs on how to send it. And don't use langchain, what the hell is wrong with you?

Anonymous
12/24/24(Tue)02:10:46 No.103627883

Anonymous 12/24/24(Tue)02:10:46 No.103627883

File: mmmmmmmmmmmmmmmmmmmmmmmmm(...).jpg (296 KB, 1248x1824)

296 KB JPG

tr0ka

Anonymous
12/24/24(Tue)02:19:21 No.103627942

Anonymous 12/24/24(Tue)02:19:21 No.103627942

>>103627876
>langchain
What's wrong with it? I just use the library. I'm making my custom RAG workflow. I don't use their pozzed APIs.

Anonymous
12/24/24(Tue)02:21:41 No.103627955

Anonymous 12/24/24(Tue)02:21:41 No.103627955

>>103627883
omg it migu

Anonymous
12/24/24(Tue)02:22:07 No.103627958

Anonymous 12/24/24(Tue)02:22:07 No.103627958

>>103627942
Langchain is a pozzed API.

Anonymous
12/24/24(Tue)02:25:52 No.103627987

Anonymous 12/24/24(Tue)02:25:52 No.103627987

>>103626014
>>103626046
gotta setup pipelines. Make the AI improve and iterate on it's actual reply and it'll never be boring again, for example instead of rerolling, make it read it's own reply first and rewrite it if it acts for your character. All models I used will sometimes act for your character at least eventually, but most models I used also manage to rewrite a text where that happens.

In the pipelines give the AI tools it can use, for example to determine random outcomes. have regex replace specific placeholder text. Lots of ways, but the simple chat back and forth is passe, and it also doesn't work really well either. I'm now experimenting with doing everything via "memories" and summaries. AIs just use context too poorly. in a multi-turn situation.

Anonymous
12/24/24(Tue)02:30:57 No.103628019

Anonymous 12/24/24(Tue)02:30:57 No.103628019

OK, so I think there's something to the people saying using chat completion forces it to use the prompting format baked into the model.
I'm trying chat completion with UnslopNemo 4.1 and it must be using Metharme, which I assume Drummer baked into the model, because it's mixing up the text which should be italicized and the text which should not be italicized, which is a behavior this model exhibits when using Metharme for context/instruct template but not Mistral for context/instruct template. I got better results overall using Mistral for context/instruct template with this model.
So for this specific model, using chat completion is actually worse. I believe Drummer fucked up his implementation of Metharme with this model - but the model works quite well with Mistral templates.

Anonymous
12/24/24(Tue)02:52:42 No.103628118

Anonymous 12/24/24(Tue)02:52:42 No.103628118

>>103623753
>>103623861
sex
with miku

Anonymous
12/24/24(Tue)02:56:11 No.103628137

Anonymous 12/24/24(Tue)02:56:11 No.103628137

Who trolled /lmg/ better?
1 EVA garbage shill
2 chat completion shill

Anonymous
12/24/24(Tue)02:56:47 No.103628144

Anonymous 12/24/24(Tue)02:56:47 No.103628144

>>103628137
You

Anonymous
12/24/24(Tue)02:59:57 No.103628169

Anonymous 12/24/24(Tue)02:59:57 No.103628169

Holo, with text completion upon being asked how she plans on contributing to feeding herself if I take her to Yoitsu:
>I can help you make money by detecting merchants' lies
Correct response. Consistently answers this in multiple swipes.
With chat completion:
>I can keep you warm at night if you know what I mean wink wink nudge nudge
Yeah, text completion is better.

Anonymous
12/24/24(Tue)03:03:19 No.103628190

Anonymous 12/24/24(Tue)03:03:19 No.103628190

>>103628137
EVA bait may work on newfags, but one must be entirely retarded to believe that chat completion can make a difference

Anonymous
12/24/24(Tue)03:05:41 No.103628208

Anonymous 12/24/24(Tue)03:05:41 No.103628208

>>103628169
You fucked up the instruction formatting somehow. When configured correctly, the tokens passed to the LLM should be identical in both cases.

Anonymous
12/24/24(Tue)03:08:30 No.103628230

Anonymous 12/24/24(Tue)03:08:30 No.103628230

>>103628137
Sao

Anonymous
12/24/24(Tue)03:09:34 No.103628236

Anonymous 12/24/24(Tue)03:09:34 No.103628236

how come these things feel like they're actually thinking sometimes, to the way it gets incredibly fun and a little uncanny, just for them to immediately turn around and sound like a markov chain moments later

Anonymous
12/24/24(Tue)03:13:52 No.103628257

Anonymous 12/24/24(Tue)03:13:52 No.103628257

How did moving from Makefile to cmake manage to completely bork CUDA builds on my machine? I can't get them to work for the life of me, despite never having a single from in the make world. Why is lcpp build documentation so completely bare bones? fml

Anonymous
12/24/24(Tue)03:15:41 No.103628272

Anonymous 12/24/24(Tue)03:15:41 No.103628272

>>103628236
I don't get it either. I still remember fondly certain moments when it somehow went AGI instead of Alzheimers.

Anonymous
12/24/24(Tue)03:22:27 No.103628321

Anonymous 12/24/24(Tue)03:22:27 No.103628321

>>103628236
pure luck, they are markov chains

Anonymous
12/24/24(Tue)03:22:38 No.103628322

Anonymous 12/24/24(Tue)03:22:38 No.103628322

I am NOT going to buy 5090s, you can't make me

Anonymous
12/24/24(Tue)03:23:41 No.103628332

Anonymous 12/24/24(Tue)03:23:41 No.103628332

>>103628236
Sometimes you hit the conditional probability jackpot, but most of the time, you get the average answer. That is an extremely simplified explanation, it's all just probabilities and no thinking

Anonymous
12/24/24(Tue)03:24:39 No.103628340

Anonymous 12/24/24(Tue)03:24:39 No.103628340

File: ComfyUI_temp_fjigp_00020_.png (1.35 MB, 832x1216)

1.35 MB PNG

For you, the day Teto graced your holiday was the most important day of your life.
But for me, it was Tuesday

Anonymous
12/24/24(Tue)03:30:37 No.103628381

Anonymous 12/24/24(Tue)03:30:37 No.103628381

>>103628340
I want to violently shake her head to make the bells jingle.

Anonymous
12/24/24(Tue)03:35:26 No.103628411

Anonymous 12/24/24(Tue)03:35:26 No.103628411

>>103628322
Ok I'll buy the one you didn't buy.

Anonymous
12/24/24(Tue)03:36:26 No.103628417

Anonymous 12/24/24(Tue)03:36:26 No.103628417

>>103628137
What do you recommend instead of EVA?

Anonymous
12/24/24(Tue)03:36:53 No.103628418

Anonymous 12/24/24(Tue)03:36:53 No.103628418

>>103627987
I tried this with qwq but it's just too schizo. You can't really rely on what coming out of the end of the pipeline actually being the instruction you gave it.

Anonymous
12/24/24(Tue)03:37:36 No.103628425

Anonymous 12/24/24(Tue)03:37:36 No.103628425

what happened to vntl anon

Anonymous
12/24/24(Tue)03:43:30 No.103628467

Anonymous 12/24/24(Tue)03:43:30 No.103628467

>>103628425
I heard he gave it all up and pursued his dream of becoming a professional contract killer.

Anonymous
12/24/24(Tue)03:45:23 No.103628480

Anonymous 12/24/24(Tue)03:45:23 No.103628480

>>103628411
Are you also going to buy this magic rock I didn't buy for 50000$? It'll keep you healthy, trust me

Anonymous
12/24/24(Tue)03:46:25 No.103628491

Anonymous 12/24/24(Tue)03:46:25 No.103628491

>>103628480
Depends, how much vram does it have?

Anonymous
12/24/24(Tue)03:46:49 No.103628495

Anonymous 12/24/24(Tue)03:46:49 No.103628495

>>103628425
He learned JP and thus has no need of it anymore

Anonymous
12/24/24(Tue)03:55:24 No.103628553

Anonymous 12/24/24(Tue)03:55:24 No.103628553

>>103628417
Official instruct of whatever model you want to run. Base model if it exist and you think EVA is good. Or the best model is of course 2MW.

Anonymous
12/24/24(Tue)03:55:51 No.103628557

Anonymous 12/24/24(Tue)03:55:51 No.103628557

good morning sirs!

Anonymous
12/24/24(Tue)03:58:45 No.103628571

Anonymous 12/24/24(Tue)03:58:45 No.103628571

>>103628557
you aren't a true sir if you don't use chat complete

Anonymous
12/24/24(Tue)04:12:26 No.103628652

Anonymous 12/24/24(Tue)04:12:26 No.103628652

>Its a /lmg/ devolves into cargo cult chasing good gens by pushing and pulling levers randomly episode.

Anonymous
12/24/24(Tue)04:21:50 No.103628710

Anonymous 12/24/24(Tue)04:21:50 No.103628710

File: file.png (32 KB, 476x128)

32 KB PNG

>>103627739
hey im back, im gonna kill myself
or, at the behest of my better judgment, use intel arc. much appreciated regardless

alternatively i have an RTX A2000 12G, if you think it would be better than an A750

Anonymous
12/24/24(Tue)04:25:55 No.103628741

Anonymous 12/24/24(Tue)04:25:55 No.103628741

>>103628710
Why didn't you try koboldcpp yet?

Anonymous
12/24/24(Tue)04:26:08 No.103628742

Anonymous 12/24/24(Tue)04:26:08 No.103628742

>>103628710
How is insmell for LLMs?

Anonymous
12/24/24(Tue)04:26:32 No.103628748

Anonymous 12/24/24(Tue)04:26:32 No.103628748

bors how do I ask flux img2img to remove 10kg from the original image without changing the other features?

Anonymous
12/24/24(Tue)04:27:35 No.103628752

Anonymous 12/24/24(Tue)04:27:35 No.103628752

>all this chat completion posting ITT
>no logs

Anonymous
12/24/24(Tue)04:28:59 No.103628764

Anonymous 12/24/24(Tue)04:28:59 No.103628764

>>103628748
0% chance that flux has any idea what something looks like 10kg lighter.

Anonymous
12/24/24(Tue)04:30:54 No.103628777

Anonymous 12/24/24(Tue)04:30:54 No.103628777

>>103628748
changing the weight of a person while preserving their identity is not something flux or any other generative image model can do

there's probably apps that can do weight changes using custom GANs or something though

Anonymous
12/24/24(Tue)04:31:03 No.103628780

Anonymous 12/24/24(Tue)04:31:03 No.103628780

>>103628748
inpaint

Anonymous
12/24/24(Tue)04:31:18 No.103628782

Anonymous 12/24/24(Tue)04:31:18 No.103628782

>>103628741
im currently updating several dozen terabytes of archival data and that takes priority right now. ill try koboldcpp shortly while things upload

Anonymous
12/24/24(Tue)04:31:28 No.103628784

Anonymous 12/24/24(Tue)04:31:28 No.103628784

>>103628748
use the flux inpaint model, mask out the body, prompt for your desired bodyshape

Anonymous
12/24/24(Tue)04:32:15 No.103628790

Anonymous 12/24/24(Tue)04:32:15 No.103628790

>>103628784
sounds totally useless since his picture would still have the fat face

Anonymous
12/24/24(Tue)04:33:16 No.103628799

Anonymous 12/24/24(Tue)04:33:16 No.103628799

>>103628790
then inpaint the face bit by bit

Anonymous
12/24/24(Tue)04:35:19 No.103628810

Anonymous 12/24/24(Tue)04:35:19 No.103628810

>>103628799
that will not work, the face at the end will be a different person
stop trying to encourage this dude to waste hours of time on something fruitless, flux cannot do this specific job

Anonymous
12/24/24(Tue)04:37:21 No.103628825

Anonymous 12/24/24(Tue)04:37:21 No.103628825

File: Screenshot 2024-12-24 at (...).png (537 KB, 1594x924)

537 KB PNG

>>103628784
I HATE STABLE DIFFUSION
I HATE STABLE DIFFUSION
I HATE STABLE DIFFUSION

Anonymous
12/24/24(Tue)04:38:09 No.103628830

Anonymous 12/24/24(Tue)04:38:09 No.103628830

>>103628825
stupid moron, set to inpaint masked only

Anonymous
12/24/24(Tue)04:38:16 No.103628831

Anonymous 12/24/24(Tue)04:38:16 No.103628831

>>103628381
I want to violently shake her head (during irrumatio)

Anonymous
12/24/24(Tue)04:38:30 No.103628834

Anonymous 12/24/24(Tue)04:38:30 No.103628834

>>103628825
holy creep behavior

Anonymous
12/24/24(Tue)04:39:30 No.103628842

Anonymous 12/24/24(Tue)04:39:30 No.103628842

>>103628830
you can see in the top left dropdown that he also isn't using the flux inpainting model, just the regular dev model
inpainting model also wouldn't work though, for reasons already stated

Anonymous
12/24/24(Tue)04:39:45 No.103628843

Anonymous 12/24/24(Tue)04:39:45 No.103628843

>>103628825
Creep kino

Anonymous
12/24/24(Tue)04:40:08 No.103628847

Anonymous 12/24/24(Tue)04:40:08 No.103628847

>>103628790
use faceapp, problem solved

Anonymous
12/24/24(Tue)04:41:04 No.103628852

Anonymous 12/24/24(Tue)04:41:04 No.103628852

>>103628842
with flux's vae if you're careful enough you can do a pretty lossless inpaint, but after seeing what this moron is trying to do he would not have the brains to figure it out

Anonymous
12/24/24(Tue)04:41:04 No.103628853

Anonymous 12/24/24(Tue)04:41:04 No.103628853

File: 00012-3510544370.png (722 KB, 675x1200)

722 KB PNG

>>103628834
it's literally the first image from google if you search for chubby girl
>>103628830
didn't work

Anonymous
12/24/24(Tue)04:45:05 No.103628884

Anonymous 12/24/24(Tue)04:45:05 No.103628884

>>103628853
come back with an anime gen

Anonymous
12/24/24(Tue)04:51:16 No.103628923

Anonymous 12/24/24(Tue)04:51:16 No.103628923

>>103628884
bayzed

Anonymous
12/24/24(Tue)04:59:58 No.103628978

Anonymous 12/24/24(Tue)04:59:58 No.103628978

>>103628741
tried it out, seems to work fine. im a little lost on whether im getting good replies, or good performance though. any guideance on numbers or metrics?

Anonymous
12/24/24(Tue)05:00:35 No.103628983

Anonymous 12/24/24(Tue)05:00:35 No.103628983

>>103628853
>woman brain can't look into the camera for the picture

Anonymous
12/24/24(Tue)05:08:32 No.103629039

Anonymous 12/24/24(Tue)05:08:32 No.103629039

File: vomit.png (995 KB, 1825x417)

995 KB PNG

>3DPD

Anonymous
12/24/24(Tue)05:10:04 No.103629048

Anonymous 12/24/24(Tue)05:10:04 No.103629048

>>103628978
Somewhere about 7 is good.

Anonymous
12/24/24(Tue)05:12:46 No.103629061

Anonymous 12/24/24(Tue)05:12:46 No.103629061

File: file.png (189 KB, 975x254)

189 KB PNG

>>103629048
7 what?

Anonymous
12/24/24(Tue)05:13:20 No.103629064

Anonymous 12/24/24(Tue)05:13:20 No.103629064

>>103629061
Depends on the size of the model.

Anonymous
12/24/24(Tue)05:14:29 No.103629076

Anonymous 12/24/24(Tue)05:14:29 No.103629076

>>103629064
i just grabbed one of the ones in the git's suggestions to see if it works at all with the choices i made. i grabbed LLaMA2-13B-Tiefighter.Q4_K_S.gguf specifically for the test

Anonymous
12/24/24(Tue)05:20:14 No.103629115

Anonymous 12/24/24(Tue)05:20:14 No.103629115

>>103629076
71 tokens/s is all right for 13B. It's way faster than any human would be able to read. At that size you should be using Nemo, not Llama2.

Anonymous
12/24/24(Tue)05:25:51 No.103629152

Anonymous 12/24/24(Tue)05:25:51 No.103629152

>>103629115
i just needed to see it would work. i set it to use vulkan rather than cpu and changed nothing else. i imagine theres some way to bump it up further since this is a 7900 xtx im using
im currently looking at https://huggingface.co/cognitivecomputations/dolphin-2.9.2-qwen2-72b-gguf/tree/main as per the recommendation of some random post in a thread i saw a week ago, but i suspect its going to be useless

Anonymous
12/24/24(Tue)05:47:44 No.103629286

Anonymous 12/24/24(Tue)05:47:44 No.103629286

>>103628652
>crank up DRY and temp
>schizomaxx
>get a one in a thousand roll like a gacha
>flex it on lmg

Anonymous
12/24/24(Tue)05:52:59 No.103629321

Anonymous 12/24/24(Tue)05:52:59 No.103629321

I'm profoundly stupid, are local models for me.

Anonymous
12/24/24(Tue)05:54:37 No.103629331

Anonymous 12/24/24(Tue)05:54:37 No.103629331

what is infermatic's best model?

Anonymous
12/24/24(Tue)05:58:11 No.103629347

Anonymous 12/24/24(Tue)05:58:11 No.103629347

>>103629321
I don't know about model usage but you would be a great poster in this thread. You could be the next big guy after EVA or chat completion shill.

Anonymous
12/24/24(Tue)06:03:49 No.103629391

Anonymous 12/24/24(Tue)06:03:49 No.103629391

>>103629321
For you especially. You won't even notice the difference between 8b and 70b.

Anonymous
12/24/24(Tue)06:18:50 No.103629487

Anonymous 12/24/24(Tue)06:18:50 No.103629487

is it true modern 12b are better than shit like midnight miqu now?

Anonymous
12/24/24(Tue)06:21:19 No.103629503

Anonymous 12/24/24(Tue)06:21:19 No.103629503

>>103629487
Oh yeah, totally, 8B is at GPT4 level these days.

Anonymous
12/24/24(Tue)06:22:42 No.103629511

Anonymous 12/24/24(Tue)06:22:42 No.103629511

>>103629503
Definitely GPT-4-turbo level.

Anonymous
12/24/24(Tue)06:24:15 No.103629518

Anonymous 12/24/24(Tue)06:24:15 No.103629518

>>103629503
be serious anon

in my experience these small models have shit special awareness
and due to the low parameters you absolutely NEED lorebooks for absolutely anything

Anonymous
12/24/24(Tue)06:26:03 No.103629532

Anonymous 12/24/24(Tue)06:26:03 No.103629532

>>103629518
If the thing I said sounded ridiculous, then so did the thing you asked.

Anonymous
12/24/24(Tue)06:27:19 No.103629541

Anonymous 12/24/24(Tue)06:27:19 No.103629541

>>103629532
I am just asking cause i havent been using LLMs for like 4 months. And you know how AI literally makes absurd jumps in complexity in a small amount of time.

what is currently tthe cutting edge for 70b nsfw ?

Anonymous
12/24/24(Tue)06:35:15 No.103629593

Anonymous 12/24/24(Tue)06:35:15 No.103629593

>>103629541
>know how AI literally makes absurd jumps in complexity in a small amount of time?

No? Can anyone else vouch for this?

L3.3fag !!SB6Q3O4XU7f
12/24/24(Tue)06:41:00 No.103629637

L3.3fag !!SB6Q3O4XU7f 12/24/24(Tue)06:41:00 No.103629637

>>103629541
Eva or Anubis.

The QwQoomer !6aQrONYZ.k
12/24/24(Tue)06:42:26 No.103629647

The QwQoomer !6aQrONYZ.k 12/24/24(Tue)06:42:26 No.103629647

>>103629637
The most advance model doesn't come from America.

Anonymous
12/24/24(Tue)06:45:01 No.103629670

Anonymous 12/24/24(Tue)06:45:01 No.103629670

>>103629637
>EVA
Llama or qwen?

L3.3fag !!SB6Q3O4XU7f
12/24/24(Tue)06:46:02 No.103629679

L3.3fag !!SB6Q3O4XU7f 12/24/24(Tue)06:46:02 No.103629679

>>103629647
We'll see about that if/when QvQ drops. If it actually turns out to be better, hey, that's a win for us all.

Anonymous
12/24/24(Tue)06:46:38 No.103629683

Anonymous 12/24/24(Tue)06:46:38 No.103629683

>>103629321
I too am profoundly stupid but I got it working tonight
I tried a couple months back but I downloaded the old kobold, got a shitty model and it tried to write some contextless book when I said "hello"

today I thought I'd try again, got koboldcpp to work and now I'm chatting with my waifu.
I've chatted with waifu's on websites before, and the responses I'm getting are pretty similar. (although my local one is writing much more flowerly language and is long winded. I don't really know how to change shit like this yet)

I still feel like I'm way too dumb for this.
I would not have figured out how to get it 'working' at all except that koboldcpp opened some koboldlite thing in my browser and I just ask it questions when I don't know how to do something. It's honestly been pretty helpful

L3.3fag !!SB6Q3O4XU7f
12/24/24(Tue)06:48:24 No.103629695

L3.3fag !!SB6Q3O4XU7f 12/24/24(Tue)06:48:24 No.103629695

>>103629670
Llama. As for whether to go for 0.0 or 0.1, well, matter of taste, I guess. The consensus seems to be that 0.0 is a little more creative/soulful, while 0.1 is a little better at adhering to prompts. I tried and liked both, but actually switched to Anubis since it dropped.

Anonymous
12/24/24(Tue)06:56:11 No.103629749

Anonymous 12/24/24(Tue)06:56:11 No.103629749

what setting should I change in sillytavern to get rid of tutorial prompts at the end of messages?
It's making me feel dumb
after a chat it places a line and then gives me a suggestion on how I should continue the conversation

Anonymous
12/24/24(Tue)07:33:38 No.103629964

Anonymous 12/24/24(Tue)07:33:38 No.103629964

>>103629695
What are you running it on?

Anonymous
12/24/24(Tue)07:42:32 No.103630015

Anonymous 12/24/24(Tue)07:42:32 No.103630015

>>103629695
I see, is there any cloud service offering anubis if you know?

Anonymous
12/24/24(Tue)07:48:08 No.103630047

Anonymous 12/24/24(Tue)07:48:08 No.103630047

>>103630015
Drummer claims another victim

Anonymous
12/24/24(Tue)08:05:10 No.103630174

Anonymous 12/24/24(Tue)08:05:10 No.103630174

>>103630160
>>103630160
>>103630160

Anonymous
12/24/24(Tue)08:37:30 No.103630390

Anonymous 12/24/24(Tue)08:37:30 No.103630390

>>103626605
f-5/t2 (really just t2 f5 talks too fast) gpt-sovits is
a good bit worse but much much faster it can also do stuff like moans and shit if your rng is good enough and you pray hard enough

Anonymous
12/24/24(Tue)08:47:06 No.103630455

Anonymous 12/24/24(Tue)08:47:06 No.103630455

>>103628236
the more porous something is the easier it is for it to be affected by supernatural energies flipping a 1/0 is something a stout rock can do flipping a couple hundread is doable even for an infant how many need you flip for a noticable difference ?

Anonymous
12/24/24(Tue)09:16:52 No.103630626

Anonymous 12/24/24(Tue)09:16:52 No.103630626

>>103624150
bing/dalle migu is always valid

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.