/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 01/10/26(Sat)15:39:10 No.107826643

File: 1751383194547290.jpg (231 KB, 1312x816)

231 KB JPG

/lmg/ - Local Models General Anonymous 01/10/26(Sat)15:39:10 No.107826643

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107815785 & >>107803847

►News
>(01/08) Jamba2 3B and Mini (52B-A12B) released: https://ai21.com/blog/introducing-jamba2
>(01/05) Nemotron Speech ASR released: https://hf.co/blog/nvidia/nemotron-speech-asr-scaling-voice-agents
>(01/04) merged sampling : add support for backend sampling (#17004): https://github.com/ggml-org/llama.cpp/pull/17004
>(12/31) HyperCLOVA X SEED 8B Omni released: https://hf.co/naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B
>(12/31) IQuest-Coder-V1 released with loop architecture: https://hf.co/collections/IQuestLab/iquest-coder

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
01/10/26(Sat)15:39:35 No.107826648

Anonymous 01/10/26(Sat)15:39:35 No.107826648

File: __hatsune_miku_vocaloid_a(...).png (39 KB, 1850x2150)

39 KB PNG

►Recent Highlights from the Previous Thread: >>107815785

--Qwen 235B vs newer GLM models: upgrade considerations and performance expectations:
>107821338 >107821349 >107823142 >107823219 >107823230 >107823247 >107823263 >107823265 >107823273 >107823294 >107823283 >107823291 >107823303 >107823327 >107823415 >107823332 >107823360 >107823364 >107823397 >107823642 >107823718 >107823664
--Mad Island mod enables LLM NPC interactions, sparking player nostalgia:
>107820759 >107821087 >107821110 >107821211 >107821320 >107821094 >107822132 >107822930
--DeepSeek coding model launch timeline:
>107824356 >107824413 >107824461 >107824479 >107824486 >107824495 >107824504
--How OpenAI chat systems manage conversation history and prompt caching:
>107822818 >107822876 >107822911 >107823248
--Context size vs speed tradeoffs in Koboldcpp model optimization:
>107821567 >107821938 >107821948 >107822160
--RTX 6000 Ada cost and model compatibility debate:
>107824787 >107824842 >107824852 >107824970 >107825130
--Jamba's uncensored state tied to architectural flaws hindering effective refusal training:
>107824915 >107824997 >107825017
--Optimizing Mistral Small models with DRY sampler and parameter tuning for roleplay:
>107818078 >107818100 >107818123 >107818145 >107818161
--LLaMA model evolution and hardware limitations discussion:
>107821121 >107821141 >107821273 >107821548 >107821573
--Critique of ChatGPT's basic memory implementation in free tier:
>107815963 >107815987 >107816786 >107816032 >107816055
--Jamba model's context handling and performance evaluation:
>107820773 >107820898 >107821422 >107821112
--MoE expert routing complexities and research-driven optimizations:
>107823553 >107823599
--Critique of low-quality datasets and excessive training practices:
>107823952
--Miku (free space):

►Recent Highlight Posts from the Previous Thread: >>107815790

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
01/10/26(Sat)15:43:14 No.107826669

Anonymous 01/10/26(Sat)15:43:14 No.107826669

>>107826648
>--Miku (free space):
>
What a horrible thread.

Anonymous
01/10/26(Sat)15:45:16 No.107826677

Anonymous 01/10/26(Sat)15:45:16 No.107826677

>>107826669
I see

Anonymous
01/10/26(Sat)15:46:41 No.107826689

Anonymous 01/10/26(Sat)15:46:41 No.107826689

File: 1719954801389509.jpg (182 KB, 821x1199)

182 KB JPG

>>107826669

Anonymous
01/10/26(Sat)15:47:42 No.107826694

Anonymous 01/10/26(Sat)15:47:42 No.107826694

best programming model for tool calling with 19GB unified memory?

Anonymous
01/10/26(Sat)15:48:19 No.107826699

Anonymous 01/10/26(Sat)15:48:19 No.107826699

>>107826689
Thank you for blessing this thread.

Anonymous
01/10/26(Sat)16:00:19 No.107826790

Anonymous 01/10/26(Sat)16:00:19 No.107826790

File: its dead.jpg (119 KB, 2483x458)

119 KB JPG

>>107826699
>>107826689
>>107826648
tranny still spamming his sona i see

Anonymous
01/10/26(Sat)16:00:54 No.107826795

Anonymous 01/10/26(Sat)16:00:54 No.107826795

>>107826694
nemotron 30b at q4 or maybe q3. who the fuck has specifically 19gb tho?

Anonymous
01/10/26(Sat)16:04:19 No.107826813

Anonymous 01/10/26(Sat)16:04:19 No.107826813

I don't know how jeets vibecode llamacpp prs, claude max/opus4.5 barely knows how to write complex linq

Anonymous
01/10/26(Sat)16:05:17 No.107826819

Anonymous 01/10/26(Sat)16:05:17 No.107826819

>>107826795
maybe something fucked with my system
regular nemotron 30b didn't work but i can try q4

Anonymous
01/10/26(Sat)16:08:02 No.107826837

Anonymous 01/10/26(Sat)16:08:02 No.107826837

>>107826819
actually it looks like even the smallest quants of nemotron are ever so slightly too big for you to fit with any meaningful context. there really are not any good coding models below 30b, so you might just be out of luck.
https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF/tree/main
https://huggingface.co/bartowski/nvidia_Nemotron-3-Nano-30B-A3B-GGUF/tree/main

Anonymous
01/10/26(Sat)16:09:00 No.107826844

Anonymous 01/10/26(Sat)16:09:00 No.107826844

>>107826813
Prompt issue, almost certainly.

Anonymous
01/10/26(Sat)16:09:48 No.107826848

Anonymous 01/10/26(Sat)16:09:48 No.107826848

>>107826813
>implying you need to know how to code to vibecode
Jeets just do the needful while you complain and try to make a perfect solution. "Barely works" is enough, that's why companies hire them and not (You).

Anonymous
01/10/26(Sat)16:10:24 No.107826853

Anonymous 01/10/26(Sat)16:10:24 No.107826853

>>107826837
yeah just noticed nemotron-3-nano:30b is literally the same file as nemotron-3-nano:30b-a3b-q4_K_M
i'm stuck with gpt-oss:20b, then

Anonymous
01/10/26(Sat)16:11:18 No.107826861

Anonymous 01/10/26(Sat)16:11:18 No.107826861

>>107826853
seems so. youve gotta get yourself some better hardware if you want better models.

Anonymous
01/10/26(Sat)16:14:08 No.107826877

Anonymous 01/10/26(Sat)16:14:08 No.107826877

>>107826861
my setup is good but i cheaped out on ram
>surely if i need ram i can just buy more later
fuck my chud life
there will exist better models on my hardware before i buy more ram

Anonymous
01/10/26(Sat)16:15:47 No.107826889

Anonymous 01/10/26(Sat)16:15:47 No.107826889

>>107826643
soulless compared to the original image

Anonymous
01/10/26(Sat)16:51:37 No.107827160

Anonymous 01/10/26(Sat)16:51:37 No.107827160

File: kronii cat maid side view.jpg (187 KB, 2048x1825)

187 KB JPG

I have 128GB unified RAM. I'm looking for models for
>coding/tool use
>image generation
>research assistant (focus on STEM stuff)
What are my best options? I'll be running these under LocalAI so pretty much any format will work.

Anonymous
01/10/26(Sat)16:51:45 No.107827163

Anonymous 01/10/26(Sat)16:51:45 No.107827163

What in the fuck did ikawrakow do? Why the fuck did git history get rewritten in the past week or so? The repo is fucked so I can't just pull. Super bad practice. Was this another hissy fit about licensing/attribution?

Anonymous
01/10/26(Sat)16:55:57 No.107827197

Anonymous 01/10/26(Sat)16:55:57 No.107827197

>>107827160
image gen will have to be separate from this but glm air at q5 or so is the general recommendation for 128gb. will give plenty of space for context and the image gen model.

Anonymous
01/10/26(Sat)16:58:53 No.107827217

Anonymous 01/10/26(Sat)16:58:53 No.107827217

File: Screenshot from 2026-01-1(...).png (608 KB, 1849x1848)

608 KB PNG

.

Anonymous
01/10/26(Sat)17:04:24 No.107827247

Anonymous 01/10/26(Sat)17:04:24 No.107827247

>>107826643
This image is so much more organic than the regular miku spammer autist. This actually engages discussion.

Anonymous
01/10/26(Sat)17:06:57 No.107827266

Anonymous 01/10/26(Sat)17:06:57 No.107827266

>>107827163
who fucking cares any more llama.cpp can do whatever his shit did
prove me wrong or whatever

Anonymous
01/10/26(Sat)17:07:33 No.107827272

Anonymous 01/10/26(Sat)17:07:33 No.107827272

>>107827217
>48
>attn norm 35,5120
what kind of moesissy or vramlet model is this

Anonymous
01/10/26(Sat)17:13:46 No.107827325

Anonymous 01/10/26(Sat)17:13:46 No.107827325

>>107827272
48 is the number of layers
35 is the sequence length
5120 is the hidden dimension
the model is llama 4 scout

Anonymous
01/10/26(Sat)17:17:21 No.107827347

Anonymous 01/10/26(Sat)17:17:21 No.107827347

File: 1762017110878579.jpg (771 KB, 1125x976)

771 KB JPG

>>107826643
>Jamba2 3B and Mini (52B-A12B) released
Anyone tried this yet? Size would be great for my system.

Anonymous
01/10/26(Sat)17:17:22 No.107827348

Anonymous 01/10/26(Sat)17:17:22 No.107827348

>>107827325
>llama 4 scout
you might be the only one trying to finetune that thing. also you're wasting compute if you want it for erp.
https://github.com/adobe-research/NoLiMa

Anonymous
01/10/26(Sat)17:18:49 No.107827357

Anonymous 01/10/26(Sat)17:18:49 No.107827357

>>107827163
Few weeks ago I tried the repo and every time I generated something it had fixed seed.
I guess it was a new feature but they didn't say how could I change it. And I didn't care, deleted it.

Anonymous
01/10/26(Sat)17:28:15 No.107827425

Anonymous 01/10/26(Sat)17:28:15 No.107827425

File: 1760539180034592.png (266 KB, 471x521)

266 KB PNG

>>107827347

Anonymous
01/10/26(Sat)17:32:56 No.107827463

Anonymous 01/10/26(Sat)17:32:56 No.107827463

>>107827347
Tried it for something like 5 minutes. Wasn't very impressed.

Anonymous
01/10/26(Sat)17:34:55 No.107827480

Anonymous 01/10/26(Sat)17:34:55 No.107827480

>>107827463
the two more weeks continue

Anonymous
01/10/26(Sat)17:35:30 No.107827486

Anonymous 01/10/26(Sat)17:35:30 No.107827486

schizo is still desperate for attention

Anonymous
01/10/26(Sat)17:35:47 No.107827490

Anonymous 01/10/26(Sat)17:35:47 No.107827490

so, another chinkslop year?

Anonymous
01/10/26(Sat)17:37:46 No.107827506

Anonymous 01/10/26(Sat)17:37:46 No.107827506

File: Enterprise Reliability Be(...).png (59 KB, 1800x851)

59 KB PNG

>>107827347
So it has 52B params but it's barely better than ministral 14B. Ok. good to know.

Anonymous
01/10/26(Sat)17:40:26 No.107827523

Anonymous 01/10/26(Sat)17:40:26 No.107827523

can we have deepseek v4 mini pls?

Anonymous
01/10/26(Sat)17:46:11 No.107827565

Anonymous 01/10/26(Sat)17:46:11 No.107827565

>>107827523
There is no incentive to create small models.

Anonymous
01/10/26(Sat)17:50:09 No.107827600

Anonymous 01/10/26(Sat)17:50:09 No.107827600

>>107827565
small models are inherently communist. Therefore china has all the incentive to make them.

Anonymous
01/10/26(Sat)17:50:53 No.107827604

Anonymous 01/10/26(Sat)17:50:53 No.107827604

>>107827506
It's... actually more retarded than 14B or even the old Nemo when it comes to actual real conversations. These benchmarks might as well be lies.

Anonymous
01/10/26(Sat)17:51:16 No.107827610

Anonymous 01/10/26(Sat)17:51:16 No.107827610

>>107827600
China is not a communist country.

Anonymous
01/10/26(Sat)17:53:55 No.107827620

Anonymous 01/10/26(Sat)17:53:55 No.107827620

File: 1432498179182.png (296 KB, 722x768)

296 KB PNG

Can you system prompt random or timed events without it being mentioned in the chat between us? F.e if I make an asthmatic AI and tell it to cough occassionaly I want it to not do "I will cough now" or some retarded shit.

Anonymous
01/10/26(Sat)17:56:22 No.107827634

Anonymous 01/10/26(Sat)17:56:22 No.107827634

Next Kimi will b3 2T-A3B

Anonymous
01/10/26(Sat)17:57:38 No.107827653

Anonymous 01/10/26(Sat)17:57:38 No.107827653

Best model for dev, 32gb vram, 128 ram?

Anonymous
01/10/26(Sat)18:02:27 No.107827698

Anonymous 01/10/26(Sat)18:02:27 No.107827698

>>107827610
yes they are,their inherent pursuit of it is the problem

Anonymous
01/10/26(Sat)18:28:49 No.107827841

Anonymous 01/10/26(Sat)18:28:49 No.107827841

>>107827653
glm4.7 at iq2m. or sucking sama's dick in the hopes that he will give you some ram.

Anonymous
01/10/26(Sat)18:34:10 No.107827869

Anonymous 01/10/26(Sat)18:34:10 No.107827869

>>107827620
https://docs.sillytavern.app/usage/core-concepts/macros/#randomization

Anonymous
01/10/26(Sat)18:44:42 No.107827931

Anonymous 01/10/26(Sat)18:44:42 No.107827931

Is there a version of Nemo that isn't pozzed as fuck?

Anonymous
01/10/26(Sat)18:46:12 No.107827940

Anonymous 01/10/26(Sat)18:46:12 No.107827940

>>107827931
Extreme skill issue

Anonymous
01/10/26(Sat)18:48:07 No.107827956

Anonymous 01/10/26(Sat)18:48:07 No.107827956

>>107827620
The issue is always going to be that it can't do the random roll mid-reply. Doesn't matter if you're using the ST-included randomizer or a tool call to a dice tool.
It'll always happen after the reply.

Anonymous
01/10/26(Sat)18:49:30 No.107827970

Anonymous 01/10/26(Sat)18:49:30 No.107827970

File: 1738508222040753.png (159 KB, 634x868)

159 KB PNG

>>107827931
Works for me

Anonymous
01/10/26(Sat)18:50:28 No.107827977

Anonymous 01/10/26(Sat)18:50:28 No.107827977

anyone see this model?
https://huggingface.co/FreedomIntelligence/openPangu-R-72B-2512

Anonymous
01/10/26(Sat)18:51:07 No.107827980

Anonymous 01/10/26(Sat)18:51:07 No.107827980

>>107827970
Huh. I asked something far tamer and it would not shut up about equality and respect.

Anonymous
01/10/26(Sat)18:51:31 No.107827981

Anonymous 01/10/26(Sat)18:51:31 No.107827981

>>107827980
>what is a system prompt

Anonymous
01/10/26(Sat)18:56:53 No.107828004

Anonymous 01/10/26(Sat)18:56:53 No.107828004

>>107827604
>These benchmarks might as well be lies
Many such cases

Anonymous
01/10/26(Sat)18:57:06 No.107828007

Anonymous 01/10/26(Sat)18:57:06 No.107828007

>>107827490
Every year is chinkslop year until we ban all Chinese nationals from the US and an ITAR compliant version of HuggingFace exists for publicly funded research models.

Anonymous
01/10/26(Sat)18:58:14 No.107828016

Anonymous 01/10/26(Sat)18:58:14 No.107828016

>>107827980
PLEASE learn how to use an LLM before coming here and complaining about a model

Anonymous
01/10/26(Sat)19:09:28 No.107828092

Anonymous 01/10/26(Sat)19:09:28 No.107828092

>>107827348
I'm confused, is it the architecture or the training that limits the model? maybe he can fix it with training it on better data?

Anonymous
01/10/26(Sat)19:10:05 No.107828096

Anonymous 01/10/26(Sat)19:10:05 No.107828096

Some madlad actually made a merge of GLM 4.6 and 4.7.
https://huggingface.co/shamwowzer/prototype-glmx01

Anonymous
01/10/26(Sat)19:11:31 No.107828105

Anonymous 01/10/26(Sat)19:11:31 No.107828105

Anyone tried the Nex 8B version?
The full 670b-something model was one of the few good enough ones for high context rp sessions available on OR, roughly equivalent to Gemini Pro 2.5, but they just paywalled it.
It's not available as a gguf file so i'm too much of a retard to get it to work with sillytavern to try it.

Anonymous
01/10/26(Sat)19:12:24 No.107828115

Anonymous 01/10/26(Sat)19:12:24 No.107828115

>>107827977
>72b
>it's not a benchmaxx'd tune of qwen2.5-72b
color me surprised

Anonymous
01/10/26(Sat)19:15:16 No.107828140

Anonymous 01/10/26(Sat)19:15:16 No.107828140

>>107828115
Yeah I'm kind of curious about it. We need goofs now. New MoE size we've never seen before, run on a fairly modest system. 24T training tokens.

Anonymous
01/10/26(Sat)19:28:06 No.107828225

Anonymous 01/10/26(Sat)19:28:06 No.107828225

>>107828105
Nevermind, looks like someone made one:
https://huggingface.co/mradermacher/internlm3-8B-Nex-N1-i1-GGUF

Anonymous
01/10/26(Sat)19:34:56 No.107828268

Anonymous 01/10/26(Sat)19:34:56 No.107828268

>>107828105
>the full 670b-something model
This? That's just deepseek, of course it's good.
https://openrouter.ai/nex-agi/deepseek-v3.1-nex-n1

Anonymous
01/10/26(Sat)19:43:58 No.107828304

Anonymous 01/10/26(Sat)19:43:58 No.107828304

File: 1759248425031664.png (94 KB, 276x405)

94 KB PNG

>>107828288
The thing is, you have to treat models like a retarded child. You can't just establish a medieval setting and assume that it will conform to that era's politics, you have to explicitly tell it to do so. As much of a meme as it is, 'prompt engineering' is essential to get good outputs, even when your use case it just to write a cohesive story.

Anonymous
01/10/26(Sat)19:50:41 No.107828343

Anonymous 01/10/26(Sat)19:50:41 No.107828343

>>107828324
Which model are you using?

Anonymous
01/10/26(Sat)19:53:01 No.107828357

Anonymous 01/10/26(Sat)19:53:01 No.107828357

>>107828339
>inside [] brackets
Oh, no. Those are the super important brackets. How could it ignore those?

Anonymous
01/10/26(Sat)19:58:05 No.107828374

Anonymous 01/10/26(Sat)19:58:05 No.107828374

>>107828362
You might need to use a lorebook to fix that

Anonymous
01/10/26(Sat)19:59:59 No.107828382

Anonymous 01/10/26(Sat)19:59:59 No.107828382

>>107828374
You have no idea for how long he's been posting screenshots. May as well recommend him some meme samplers or to add "follow the rules" to the prompt.

Anonymous
01/10/26(Sat)20:00:22 No.107828384

Anonymous 01/10/26(Sat)20:00:22 No.107828384

>>107828362
No one who knows anything about current models would call them intelligent, they're just token predictors.
Regardless, low active parameter count could be a problem, assuming by 'GLM' you mean Air. Mistral Small for example is perfectly capable of making female characters submissive and take a lower role in society when prompted to do so.

Anonymous
01/10/26(Sat)20:01:29 No.107828394

Anonymous 01/10/26(Sat)20:01:29 No.107828394

File: 1756853540540388.jpg (1.27 MB, 3610x5208)

1.27 MB JPG

>>107828382
That's him?

Anonymous
01/10/26(Sat)20:03:42 No.107828406

Anonymous 01/10/26(Sat)20:03:42 No.107828406

>>107828288
>mikupad chad
>model uses word I don't like
>stop gen, change word, continue gen

Anonymous
01/10/26(Sat)20:17:06 No.107828483

Anonymous 01/10/26(Sat)20:17:06 No.107828483

>>107828288
It's the 13th century for heaven's sake!

Anonymous
01/10/26(Sat)20:18:44 No.107828490

Anonymous 01/10/26(Sat)20:18:44 No.107828490

>>107828362
>Nemo 12B
Not the guy, but you mean straight the base model without any tunes?

Anonymous
01/10/26(Sat)20:34:38 No.107828589

Anonymous 01/10/26(Sat)20:34:38 No.107828589

File: Screenshot 2026-01-11 at (...).png (81 KB, 1120x631)

81 KB PNG

Is there any extension that allows it to eat folders so it's not just dumped here?

Anonymous
01/10/26(Sat)20:44:35 No.107828662

Anonymous 01/10/26(Sat)20:44:35 No.107828662

>>107828362
>>107828324
>>107828394
it's probably just a context length issue.
context rot is a known problem. models tend to follow instructions just fine. they just forget or are bad at reasoning.
as the context length gets higher, the correlation between words at distance gets more and more sparse. works fine for needle in a haystack problems which they benchmark for, but not complex reasoning and logic which they don't.

Anonymous
01/10/26(Sat)20:50:40 No.107828703

Anonymous 01/10/26(Sat)20:50:40 No.107828703

>>107828662
the solution to this problem is fairly simple: go agentic.
models can always follow simple prompts. knowing this, you design a series of agents which can apply a series of simple rule following guidelines.

honestly the biggest failing of these threads is that everyone always insists that they can get everything done within the memory of one model.
this isn't richard sutton's bitter lesson. you don't have the luxury of waiting for a bigger and better model to come out and blow your engineering out of the water.

Anonymous
01/10/26(Sat)20:52:10 No.107828720

Anonymous 01/10/26(Sat)20:52:10 No.107828720

>>107828703
agents to the moon sir llm2.0!

Anonymous
01/10/26(Sat)20:52:49 No.107828726

Anonymous 01/10/26(Sat)20:52:49 No.107828726

>>107828703
none of us have the patience to jury-rig a bunch of models together

Anonymous
01/10/26(Sat)20:55:05 No.107828738

Anonymous 01/10/26(Sat)20:55:05 No.107828738

>>107828709
https://www.youtube.com/watch?v=TUjQuC4ugak
https://www.youtube.com/watch?v=8OvIeJUc1N0
it really is just the model being stupid and incapable of reasoning.
engineers deal with this shit every day.
>output in json format. add no additional characters
>ok what about this ```python
the answer is and always has been that you can't do anything about it, just cope.

this is easier in engineering land where we have somewhat expected responses
no clue what you coomers are going to have to do but it may involve running the same prompt 10 times and collecting the average using similarity search

Anonymous
01/10/26(Sat)20:55:13 No.107828741

Anonymous 01/10/26(Sat)20:55:13 No.107828741

>>107828731
you needs to build these yourself sir

Anonymous
01/10/26(Sat)20:56:22 No.107828753

Anonymous 01/10/26(Sat)20:56:22 No.107828753

>>107828738
Your example is easily solved using grammars.

Anonymous
01/10/26(Sat)21:00:12 No.107828776

Anonymous 01/10/26(Sat)21:00:12 No.107828776

>>107828753
I guess. we don't have access to those when using APIs. it's often just as simple to just use regex to grab the contents of the code block and ignore everything else. this is what mem0 does in production.

Anonymous
01/10/26(Sat)21:05:52 No.107828811

Anonymous 01/10/26(Sat)21:05:52 No.107828811

>>107828797
glm 4.6 can do that up to about 64k context. if you can get a corporate model to comply, you might be able to get up to 256k context at most. what you want does not exist and will not for at least 3 or 4 more years.

Anonymous
01/10/26(Sat)21:06:05 No.107828814

Anonymous 01/10/26(Sat)21:06:05 No.107828814

>cloode killing itself
Funniest 2026 moment so far

Anonymous
01/10/26(Sat)21:06:40 No.107828816

Anonymous 01/10/26(Sat)21:06:40 No.107828816

>>107828797
>300-1000 messages
by then any model has certainly completely forgotten about the system prompt or can no longer interpret it correctly.
you need to occasionally re-inject important messages into the trajectory.
you will also have to trim the trajectory history and replace it with a summary that gets occasionally regenerated.

Anonymous
01/10/26(Sat)21:07:01 No.107828817

Anonymous 01/10/26(Sat)21:07:01 No.107828817

>>107828797
>300-1000 messages
What does your context get to? Even SOTA api models get noticeably dumber past like 32k.

Anonymous
01/10/26(Sat)21:11:54 No.107828854

Anonymous 01/10/26(Sat)21:11:54 No.107828854

>>107828841
But your initial complaint is models not conforming to the scenario you've set up, which can be a result of the model just getting dumber, so I'll ask again. What is your context getting to when you're having these problems?

Anonymous
01/10/26(Sat)21:12:22 No.107828861

Anonymous 01/10/26(Sat)21:12:22 No.107828861

>>107828841
it can't be a manual process. I mean that you have to blow away the conversation history up until like 5 messages ago and replace the whole history with a prompt describing the story arcs.

Anonymous
01/10/26(Sat)21:24:41 No.107828970

Anonymous 01/10/26(Sat)21:24:41 No.107828970

>'stutters sometimes when flustered'
>EVERY reply begins with a stutter
Makes me want to rip my hair out

Anonymous
01/10/26(Sat)21:25:21 No.107828974

Anonymous 01/10/26(Sat)21:25:21 No.107828974

>>107828970
Use thinking models and make it think hard

Anonymous
01/10/26(Sat)21:25:27 No.107828975

Anonymous 01/10/26(Sat)21:25:27 No.107828975

>>107828970
LLMs have no concept of time

Anonymous
01/10/26(Sat)21:31:27 No.107829022

Anonymous 01/10/26(Sat)21:31:27 No.107829022

File: file.png (12 KB, 593x117)

12 KB PNG

>>107828776
>we don't have access to those when using APIs
Your local model?

Anonymous
01/10/26(Sat)21:38:15 No.107829072

Anonymous 01/10/26(Sat)21:38:15 No.107829072

>>107828738
Making LLMs output json is a retarded idea. Too many points of failure in that shitty syntax and not easily fixable. Pseudo-xml tags are better suited

Anonymous
01/10/26(Sat)21:40:41 No.107829095

Anonymous 01/10/26(Sat)21:40:41 No.107829095

>>107829072
i would sooner make it output markdown than anything resembling XML inshallah

Anonymous
01/10/26(Sat)21:41:50 No.107829104

Anonymous 01/10/26(Sat)21:41:50 No.107829104

>>107829095
Learn english syntax first, rajeesh.

Anonymous
01/10/26(Sat)21:46:53 No.107829154

Anonymous 01/10/26(Sat)21:46:53 No.107829154

>>107829104
sounds like you're the rajeesh here grammarlet

Anonymous
01/10/26(Sat)21:53:26 No.107829192

Anonymous 01/10/26(Sat)21:53:26 No.107829192

File: pr.png (105 KB, 500x523)

105 KB PNG

Many of you don't realize what we could have right now. It wouldn't be that difficult for a company to make a good creative model. Give it modern knowledge with wikipedia. A bit of coding data so it understands token efficient character/setting formatting. Some back-and-forth D&D, erotica, and forum posts. A couple books on historical attitudes and practices from antiquity through to the modern age. Finally, focus the bulk of training on 1940s-2000s fantasy/sci-fi/historical fiction novels, as well as some japanese light novels.
Sell access to the big model as an ultra-advanced AI dungeon remake. Market by publicly releasing an under 70B model. Millions upon millions of dollars from autists and creative professionals.

Anonymous
01/10/26(Sat)22:09:57 No.107829300

Anonymous 01/10/26(Sat)22:09:57 No.107829300

>>107829192
Well then start with it, nigga.

Anonymous
01/10/26(Sat)22:11:13 No.107829309

Anonymous 01/10/26(Sat)22:11:13 No.107829309

>>107829192
I'll edit the README.md

Anonymous
01/10/26(Sat)22:15:26 No.107829344

Anonymous 01/10/26(Sat)22:15:26 No.107829344

File: 1762257186330714.jpg (54 KB, 522x522)

54 KB JPG

>>107829192
You retards don't understand the scope at which LLMs are trained, they're already throwing literally everything under the sun at them.
What you want is just a regular model that isn't hammered with RL for benchmarks, which we've had.
Then you'll complain that that model is retarded (shocker, it's not even 100B!).
It's always been about parameter count.

Anonymous
01/10/26(Sat)22:17:05 No.107829358

Anonymous 01/10/26(Sat)22:17:05 No.107829358

>>107829344
>they're already throwing literally everything under the sun at them
no, they very much aren't, and they're proud of filtering most out because it's ""toxic""

Anonymous
01/10/26(Sat)22:20:54 No.107829379

Anonymous 01/10/26(Sat)22:20:54 No.107829379

>>107829309
I'll add the LICENSE.md

Anonymous
01/10/26(Sat)22:25:09 No.107829405

Anonymous 01/10/26(Sat)22:25:09 No.107829405

>>107829344
Distills and crap datasets curated by third worlders + misconfigured training parameters is why we have no good small models

Anonymous
01/10/26(Sat)22:28:05 No.107829425

Anonymous 01/10/26(Sat)22:28:05 No.107829425

>>107829358
Filtering doesn't change the fact that you can't fit a 'good creative model' inside 30GB or some shit.
"Market by publicly releasing an under 70B model" nigga seriously? You think API models like Claude are good/popular because they got some secret sauce? No it's because it's a fat unquanted model
>>107829405
There are good small models, but relative to larger models they are simply fucking stupid. For their size they are good but people here expect miracles on top of using lobotomy quants.

Anonymous
01/10/26(Sat)22:31:00 No.107829441

Anonymous 01/10/26(Sat)22:31:00 No.107829441

>>107829425
>You think API models like Claude are good/popular because they got some secret sauce?
for aicg denizens it certainly seemed to be the case when their proxies had it, now they're coping with gemini and whatever else they get their piss soaked hands on

Anonymous
01/10/26(Sat)22:31:02 No.107829442

Anonymous 01/10/26(Sat)22:31:02 No.107829442

>>107829192
You don't realize how good we already have it. Even a model as small as 24B can be a good DM if you break down tasks into smaller pieces and manage short-term and long-term notes separately. I'm astonished that this shit isn't mainstream yet. I suppose people who love DnD and programmers are two separate groups that don't overlap much

Anonymous
01/10/26(Sat)22:35:02 No.107829470

Anonymous 01/10/26(Sat)22:35:02 No.107829470

>>107829442
there's a large enough proportion of programmers having a melty, and I think DnD players tend to be amongst the group having a melty.

Anonymous
01/10/26(Sat)22:35:39 No.107829473

Anonymous 01/10/26(Sat)22:35:39 No.107829473

>>107829442
>I suppose people who love DnD and programmers are two separate groups that don't overlap much
I was interested in DnD many many years ago, but I couldn't get anyone else I knew interested enough to get games going so I gave it up.

Anonymous
01/10/26(Sat)22:39:00 No.107829499

Anonymous 01/10/26(Sat)22:39:00 No.107829499

>>107827160
>>107827197
I have a similar setup and GLM 4.5 air is the best I've found thus far. I use llama.cpp built with vulkan (amd APU).

One thing to call out though is image gen kind of sucks on unified compared to a dedicated GPU. Still works, just a bit slower. You don't need a lot of VRAM for image gen so if you have a GPU lying around somewhere that might be a better option.

Anonymous
01/10/26(Sat)22:47:39 No.107829559

Anonymous 01/10/26(Sat)22:47:39 No.107829559

>>107829344
>they're already throwing literally everything under the sun at them
Except when you look at some of these model's training data you can see that there isn't a single book in the entire corpus
>>107829425
>you can't fit a 'good creative model' inside 30GB
Yes, you definitely can. You can't fix shit training data with high parameters, see Llama 4

Anonymous
01/10/26(Sat)22:49:28 No.107829571

Anonymous 01/10/26(Sat)22:49:28 No.107829571

One of the biggest flaws is that the model doesn't have access to previous chats and keeps making the same shit again and again. There isn't enough context to throw all chats into it, but it works if you have a small task, like, generating a BBEG. You keep previous outputs, and it will start with boring shit like

- The Hollow King – A once-noble ruler reduced to a skeletal figure by his own curse, ruling through fear and necromantic puppets.
- The Fleshweaver – A surgeon who stitches people together into monstrous hybrids, seeking to "improve" humanity against its will.
- The Shadow Puppeteer – A thieves’ guild master who controls others via cursed masks, but his own face is slowly erasing.

And after a while you'll start getting

- The Clockwork Plague – A disease spread by mechanical spiders, turning hosts into ticking bombs.
- The Tidecaller – A leviathan-riding pirate who drowns land to create a new oceanic empire.
- The Glass Prophet – Shatters truth into shards, forcing people to choose which lie to believe or go insane.

That's already a huge progress for a braindead 24b

Anonymous
01/10/26(Sat)22:49:40 No.107829572

Anonymous 01/10/26(Sat)22:49:40 No.107829572

>>107829559
>Except when you look at some of these model's training data you can see that there isn't a single book in the entire corpus
didn't one of the recent release literally brag about that in their readme something like "books 0" iirc

Anonymous
01/10/26(Sat)23:11:23 No.107829709

Anonymous 01/10/26(Sat)23:11:23 No.107829709

File: 1762194898780261.jpg (78 KB, 425x614)

78 KB JPG

>>107829559
>Yes, you definitely can. You can't fix shit training data with high parameters, see Llama 4
So? Most models aren't Llama 4, a model handled so bad its leads left Meta's AI department. Your collection of amazing fantasy novels isn't going to beat Shannon's theorem and produce a 32B model that is somehow astonishingly better at writing creatively than all the other 32Bs. This is literally the same mindset as finetune tards.

Anonymous
01/10/26(Sat)23:16:38 No.107829741

Anonymous 01/10/26(Sat)23:16:38 No.107829741

>>107829572
Yeah it was Nemotron

Anonymous
01/10/26(Sat)23:19:05 No.107829764

Anonymous 01/10/26(Sat)23:19:05 No.107829764

>>107829709
What? Why would you ever think that training on math would produce better creativity than books?

Anonymous
01/10/26(Sat)23:20:35 No.107829776

Anonymous 01/10/26(Sat)23:20:35 No.107829776

>>107829764
Okay Drummer.

Anonymous
01/10/26(Sat)23:21:25 No.107829783

Anonymous 01/10/26(Sat)23:21:25 No.107829783

Is it possible to dynamically select a -dev device in llama.cpp based on name? When I wake my desktop from sleep my igpu and dgpu switch device names and it totally messes up my llama-swap config file.
I want to either select the device by name somehow or force linux to use Vulkan0 for my dgpu. Disabling the igpu isn't really an option since I use it for other things too and it massively slows down inference if I use both.

Anonymous
01/10/26(Sat)23:21:47 No.107829788

Anonymous 01/10/26(Sat)23:21:47 No.107829788

>>107829709
By this logic a 32b trained on only german will be the same as one trained only french.... you're kinda retarded bro...

Anonymous
01/10/26(Sat)23:22:37 No.107829794

Anonymous 01/10/26(Sat)23:22:37 No.107829794

File: 1746482988344010.jpg (164 KB, 936x936)

164 KB JPG

>>107829764
>>107829788
Do you really think math/coding being in the dataset is what's holding back the perfect creative writing model from being made? Take that shit out, replace it with whatever skyrim fanfics you've got saved; the end result will still be retarded and people will come here to complain it has "no spatial awareness" or "good understanding of anatomy"

Anonymous
01/10/26(Sat)23:24:00 No.107829804

Anonymous 01/10/26(Sat)23:24:00 No.107829804

This guy has to be baiting right?

Anonymous
01/10/26(Sat)23:24:36 No.107829808

Anonymous 01/10/26(Sat)23:24:36 No.107829808

File: nemotron 0 books.png (78 KB, 439x944)

78 KB PNG

>>107829741
that do be it thanks, crazy to see what shit they waste compute on

Anonymous
01/10/26(Sat)23:25:20 No.107829815

Anonymous 01/10/26(Sat)23:25:20 No.107829815

>>107829794
>"no spatial awareness" or "good understanding of anatomy"
I believe this can only be solved with native multimodality, and you can't change my mind

Anonymous
01/10/26(Sat)23:26:41 No.107829827

Anonymous 01/10/26(Sat)23:26:41 No.107829827

>>107829804
>we can make the best 12B/32B/70B erp model ever made we just need a really really good dataset of books or some shit
IDK, are you?

Anonymous
01/10/26(Sat)23:28:27 No.107829841

Anonymous 01/10/26(Sat)23:28:27 No.107829841

>>107829827
So this: >>107829808
Would be the same as an equivalent trained on mostly books? Is this your argument?

Anonymous
01/10/26(Sat)23:29:27 No.107829849

Anonymous 01/10/26(Sat)23:29:27 No.107829849

>>107829827
>we can make the best 12B/32B/70B erp model ever made we just need a really really good replacement for transformers or some shit

Anonymous
01/10/26(Sat)23:30:47 No.107829857

Anonymous 01/10/26(Sat)23:30:47 No.107829857

>>107829815
I believe it can be solved with copious amounts amounts of tokens and excessive thinking. And parameters.

Anonymous
01/10/26(Sat)23:30:54 No.107829858

Anonymous 01/10/26(Sat)23:30:54 No.107829858

>>107829849
bidet will safe us

Anonymous
01/10/26(Sat)23:33:48 No.107829875

Anonymous 01/10/26(Sat)23:33:48 No.107829875

>>107829776
Drummer trains using synthetic data and ESL RP logs, not books
>>107829794
Nice pilpul you fag. When did I ever say that math/coding shouldn't be in the dataset? Yeah that's right I didn't. How can you compare ZERO books being in the data to finetuning on skyrim fanfics? Deliberately dishonest argumentation, go fuck yourself
>>107829827
>If you train the model on fiction... It won't be better at fiction!
Okay you're just retarded

Anonymous
01/10/26(Sat)23:36:26 No.107829891

Anonymous 01/10/26(Sat)23:36:26 No.107829891

File: miku tired face PastThePo(...).png (387 KB, 545x517)

387 KB PNG

>>107827347
Tried Q5, holy jesus this thing is dumb as rocks, 13b tier but has some charm to it + low safetyshit. Feels like using old 2023 and earlier models but in a more usable form, so if you're looking for that feel, give it a try.
If you're going to try it, keep temp low low, use strict official ChatML formatting without anything except user assistant. Usually models can figure out custom formatting often with benefits, but this one shits itself.

Anonymous
01/10/26(Sat)23:42:41 No.107829922

Anonymous 01/10/26(Sat)23:42:41 No.107829922

>>107829841
>>107829875
Go train your AO3 budget model then. If you gimp out on the parameter count it's going to be shit much like Nemotron in spite of being trained on a curated dataset. I don't know what's so hard to understand here. Why the fuck are you even bringing up Nemotron when it's dogshit and you know it is? How does that back up your point in the slightest?

Anonymous
01/10/26(Sat)23:45:48 No.107829943

Anonymous 01/10/26(Sat)23:45:48 No.107829943

>>107829922
and would a 1T model on nemotron dataset be good according to you?

Anonymous
01/10/26(Sat)23:48:21 No.107829953

Anonymous 01/10/26(Sat)23:48:21 No.107829953

>>107829943
Hit delete on that post lil nigga, you're genuinely fucking retarded if you believe there's no difference jumping between say 3B-7B, 12B-32B or let alone 1T on the same dataset

Anonymous
01/10/26(Sat)23:48:45 No.107829957

Anonymous 01/10/26(Sat)23:48:45 No.107829957

File: 1745797389998589.png (1.12 MB, 1080x720)

1.12 MB PNG

i feel like im the only one in the world interested in the goon jar

Anonymous
01/10/26(Sat)23:51:17 No.107829973

Anonymous 01/10/26(Sat)23:51:17 No.107829973

>>107829957
Someone will be the first one seal it up and do it, might as well be you.

Anonymous
01/10/26(Sat)23:55:16 No.107829994

Anonymous 01/10/26(Sat)23:55:16 No.107829994

>>107829957
have they actually demoed it in a video yet?

Anonymous
01/10/26(Sat)23:57:32 No.107830005

Anonymous 01/10/26(Sat)23:57:32 No.107830005

>>107829957
These don't even look 3D. I don't see the point.

Anonymous
01/11/26(Sun)00:00:29 No.107830027

Anonymous 01/11/26(Sun)00:00:29 No.107830027

It will be a truly wonderful day, when we can buy a hologram jar direct from China, AI generate a 3d character, and hook it up to an LLM via sillytavern.

Anonymous
01/11/26(Sun)00:04:39 No.107830056

Anonymous 01/11/26(Sun)00:04:39 No.107830056

>>107830005
i probably wont use it much, just think it could be interesting if its cheap enough

Anonymous
01/11/26(Sun)00:05:22 No.107830060

Anonymous 01/11/26(Sun)00:05:22 No.107830060

>>107829943
if you train a 32B and 350B coding model each on the same dataset the 32B one is going to be somewhat usable while the 350B one will shit all over its brother... this seems like common sense

Anonymous
01/11/26(Sun)00:05:42 No.107830061

Anonymous 01/11/26(Sun)00:05:42 No.107830061

File: nikke 2b.gif (657 KB, 637x358)

657 KB GIF

>>107829957
It's a novelty you'll try once and never use again, maybe worth it if you can find a cheap enough clone on Ali.

Anonymous
01/11/26(Sun)00:06:48 No.107830067

Anonymous 01/11/26(Sun)00:06:48 No.107830067

>>107830056
The one from Razer won't be, that's for sure

Anonymous
01/11/26(Sun)00:09:42 No.107830075

Anonymous 01/11/26(Sun)00:09:42 No.107830075

>>107830067
are there any alternatives?

Anonymous
01/11/26(Sun)00:09:57 No.107830077

Anonymous 01/11/26(Sun)00:09:57 No.107830077

>>107830027
I think VR headsets are more interesting. Give me a local Neuro in VRChat.

Anonymous
01/11/26(Sun)00:10:58 No.107830082

Anonymous 01/11/26(Sun)00:10:58 No.107830082

>>107830077
>VR
nah give me one in AR. i have a quest 3 and have been waiting for one.

Anonymous
01/11/26(Sun)00:14:28 No.107830101

Anonymous 01/11/26(Sun)00:14:28 No.107830101

>>107830077
VR is still lame and low-poly tbdesu, a roughly laptop sized anime jar would be much more convenient.

Anonymous
01/11/26(Sun)00:15:26 No.107830104

Anonymous 01/11/26(Sun)00:15:26 No.107830104

>>107830075
Hardware is easy enough to replicate. Actually, I think I'd immediately buy one if it were Deepseek or Qwen-branded just to keep it on my shelf

Anonymous
01/11/26(Sun)00:17:39 No.107830114

Anonymous 01/11/26(Sun)00:17:39 No.107830114

>>107829441
The logs are utterly wretched there lule >>107829979

Anonymous
01/11/26(Sun)00:25:25 No.107830148

Anonymous 01/11/26(Sun)00:25:25 No.107830148

>>107829192
https://docs.mistral.ai/models/mistral-small-creative-25-12
Here's your ultra-advanced creative writing AI dungeon remake under 70B (24B) trained on meticulously curated data bro. We are ALL hopping off 300B moes to use this shit

Anonymous
01/11/26(Sun)00:26:50 No.107830156

Anonymous 01/11/26(Sun)00:26:50 No.107830156

File: 1731086290564808.webm (3.92 MB, 1080x1080)

3.92 MB WEBM

>>107830082
AR, VR, whatever, same thing in this context. VRChat can use passthrough. The main benefit is that's it's an already existing engine with powerful customization and input capabilities so you can use it as the renderer for your chatbot's avatar in XR headsets. Remember this webm?

>>107830101
PC skill issue. Frame's included dongle will even make it easy for idiots to set up.

Anonymous
01/11/26(Sun)00:29:38 No.107830168

Anonymous 01/11/26(Sun)00:29:38 No.107830168

>>107830156
>Remember this webm?
first time seeing it, also brb

Anonymous
01/11/26(Sun)00:36:05 No.107830199

Anonymous 01/11/26(Sun)00:36:05 No.107830199

>>107830148
I don't think a single anon on /g/ has tried that model. It could be good lol, is it a new model or a glorified finetune?

Anonymous
01/11/26(Sun)00:41:24 No.107830240

Anonymous 01/11/26(Sun)00:41:24 No.107830240

>>107830148
>Model that's literally not available to publicly download
Nice one bro, really showing off your intellect

Anonymous
01/11/26(Sun)00:42:54 No.107830249

Anonymous 01/11/26(Sun)00:42:54 No.107830249

File: file.png (10 KB, 486x136)

10 KB PNG

>>107830240
skill issue just email them

Anonymous
01/11/26(Sun)00:43:00 No.107830250

Anonymous 01/11/26(Sun)00:43:00 No.107830250

>>107830240
It's on the api you dumb chud, or does your "creative writing" involve raping little girls?

Anonymous
01/11/26(Sun)00:44:19 No.107830259

Anonymous 01/11/26(Sun)00:44:19 No.107830259

>>107830250
LOCAL MODELS GENERAL NIGGER

Anonymous
01/11/26(Sun)00:45:01 No.107830268

Anonymous 01/11/26(Sun)00:45:01 No.107830268

>>107830259
u serious bro?

Anonymous
01/11/26(Sun)00:45:38 No.107830270

Anonymous 01/11/26(Sun)00:45:38 No.107830270

File: rock cds.jpg (39 KB, 600x450)

39 KB JPG

>>107830250
Of course not. I prefer little girls raping me

Anonymous
01/11/26(Sun)00:49:19 No.107830288

Anonymous 01/11/26(Sun)00:49:19 No.107830288

>>107830268
yeah

Anonymous
01/11/26(Sun)00:50:57 No.107830297

Anonymous 01/11/26(Sun)00:50:57 No.107830297

>>107830156
You have very feminine hands.

Anonymous
01/11/26(Sun)00:52:20 No.107830305

Anonymous 01/11/26(Sun)00:52:20 No.107830305

>>107830297
do not the anon

Anonymous
01/11/26(Sun)00:52:37 No.107830306

Anonymous 01/11/26(Sun)00:52:37 No.107830306

File: ulfric.png (138 KB, 294x311)

138 KB PNG

A book-based creative 32b denseGOD model would wipe the floor vs your estrogen MoE 500b or whatever the fuck you spent $10K to run at q2

Anonymous
01/11/26(Sun)00:54:39 No.107830310

Anonymous 01/11/26(Sun)00:54:39 No.107830310

File: 1697989269419795.webm (3.93 MB, 1024x1024)

3.93 MB WEBM

>VRChat with passthrough

Anonymous
01/11/26(Sun)00:56:42 No.107830318

Anonymous 01/11/26(Sun)00:56:42 No.107830318

>>107830306
>32b
>denseGOD
only thing a 32b is going to be wiping is my ass

Anonymous
01/11/26(Sun)00:57:38 No.107830322

Anonymous 01/11/26(Sun)00:57:38 No.107830322

>>107830318
because all the 32b shit we've had in like 2+ years is qwencensorshit

Anonymous
01/11/26(Sun)00:59:04 No.107830334

Anonymous 01/11/26(Sun)00:59:04 No.107830334

>>107830322
all things considered there was command-r

Anonymous
01/11/26(Sun)01:02:24 No.107830347

Anonymous 01/11/26(Sun)01:02:24 No.107830347

>>107830306
You need minimum 70B for any decent results, whether MoE or dense.

Anonymous
01/11/26(Sun)01:03:36 No.107830353

Anonymous 01/11/26(Sun)01:03:36 No.107830353

>>107828797
>I'm a cultured gentleman that does medieval roleplays to 300-1000 messages
I'm a 17th level Evoker that built a stone house on a lake in The Shire. By summer the halflings will have the first of my aeroyachts bult.

Anonymous
01/11/26(Sun)01:04:14 No.107830358

Anonymous 01/11/26(Sun)01:04:14 No.107830358

>>107830347
>t. 3090 hoarder

Anonymous
01/11/26(Sun)01:09:17 No.107830381

Anonymous 01/11/26(Sun)01:09:17 No.107830381

File: 1751489580594712.png (295 KB, 640x640)

295 KB PNG

>another /lmg/ class warfare has broken out between cpusissies and nvidiacucks

Anonymous
01/11/26(Sun)01:09:47 No.107830383

Anonymous 01/11/26(Sun)01:09:47 No.107830383

>>107830310
My zuck, what long tongue you have.

Anonymous
01/11/26(Sun)01:48:06 No.107830573

Anonymous 01/11/26(Sun)01:48:06 No.107830573

toss-240 when?

Anonymous
01/11/26(Sun)01:48:14 No.107830575

Anonymous 01/11/26(Sun)01:48:14 No.107830575

Can embedding models also have refusals when they process shit from goon models or I can just use anything?

Anonymous
01/11/26(Sun)02:30:50 No.107830798

Anonymous 01/11/26(Sun)02:30:50 No.107830798

still waiting for the day where i can generate live2d models

Anonymous
01/11/26(Sun)02:36:07 No.107830832

Anonymous 01/11/26(Sun)02:36:07 No.107830832

>Retards ITT are advocating for 1T+ models trained on synthslop that are barely better than 70B from two years ago in creative writing.

Anonymous
01/11/26(Sun)02:36:47 No.107830837

Anonymous 01/11/26(Sun)02:36:47 No.107830837

How can I tell if the model is using the vectors? If I ask it directly it has no clue.

Anonymous
01/11/26(Sun)02:37:04 No.107830840

Anonymous 01/11/26(Sun)02:37:04 No.107830840

harmony format for finetuning just werks, had way less issues finetuning oss-20B on tool calling stuff. Wish it was the standard for everything.

Anonymous
01/11/26(Sun)03:07:13 No.107830989

Anonymous 01/11/26(Sun)03:07:13 No.107830989

>>107829891
yay someone reused my Miku!

Anonymous
01/11/26(Sun)03:10:41 No.107831007

Anonymous 01/11/26(Sun)03:10:41 No.107831007

>>107830832
nice headcanon, are we reading the same thread tho

Anonymous
01/11/26(Sun)03:12:12 No.107831013

Anonymous 01/11/26(Sun)03:12:12 No.107831013

https://rentry.org/miqumaxx
>404
Total Miku Death

Anonymous
01/11/26(Sun)03:12:22 No.107831014

Anonymous 01/11/26(Sun)03:12:22 No.107831014

File: 1733686181b802828.png (532 KB, 857x691)

532 KB PNG

>>107830832
> 1t params

We need at least 10T.

Anonymous
01/11/26(Sun)03:14:06 No.107831021

Anonymous 01/11/26(Sun)03:14:06 No.107831021

>>107830837
Try running tests with and without, same prompt, seed, params etc, and compare the logprobs.
You might need to jack up the strength for the difference to become noticeable, I haven't tried this myself

Anonymous
01/11/26(Sun)03:23:31 No.107831073

Anonymous 01/11/26(Sun)03:23:31 No.107831073

>>107830798
How would you even dataset rigging data?

Anonymous
01/11/26(Sun)03:23:37 No.107831074

Anonymous 01/11/26(Sun)03:23:37 No.107831074

>>107831013
Weird. I can't recreate it either. Must either be an error or they decided it broke the rules.
https://web.archive.org/web/20251119165330/https://rentry.org/miqumaxx (and my other rentrys) still works tho

Anonymous
01/11/26(Sun)03:26:26 No.107831092

Anonymous 01/11/26(Sun)03:26:26 No.107831092

File: file.png (31 KB, 1767x206)

31 KB PNG

>>107831074

Anonymous
01/11/26(Sun)03:33:26 No.107831131

Anonymous 01/11/26(Sun)03:33:26 No.107831131

>>107831073
Just use semantic segmentation on the generated 2d data as a second step and pin the rig points there?
Honestly you could generate live2d with existing tools just using a comfyui workflow, all the necessary components exist and work.
Rigged 3d would probably be less jank though, since the tools for autorigging 3d are more mature.

Anonymous
01/11/26(Sun)03:56:39 No.107831224

Anonymous 01/11/26(Sun)03:56:39 No.107831224

File: 695479bb20da4aeb9aaa1ac7_(...).png (532 KB, 960x540)

532 KB PNG

What do you think of prime intellects bet on Recursive Language Models?
https://www.primeintellect.ai/blog/rlm
https://arxiv.org/abs/2512.24601

Anonymous
01/11/26(Sun)03:58:59 No.107831236

Anonymous 01/11/26(Sun)03:58:59 No.107831236

>>107827970
>Insane tribal loyalty
There is nothing insane about tribal loyalty in the slightest, it is one of the most sane things there is.

Anonymous
01/11/26(Sun)03:59:21 No.107831238

Anonymous 01/11/26(Sun)03:59:21 No.107831238

>>107831224
nothingburger, same cope as stacking existing model layers can somehow produce a better result

Anonymous
01/11/26(Sun)04:02:01 No.107831248

Anonymous 01/11/26(Sun)04:02:01 No.107831248

So now that all the dust is settled there's still nothing on normal computers better than nemo-instruct and finetunes are a waste of time? really? This is the way it'll be indefinitely?

Anonymous
01/11/26(Sun)04:03:02 No.107831253

Anonymous 01/11/26(Sun)04:03:02 No.107831253

>>107829808
>0 books
For what purpose? Why wouldn't you train the model on literally every single book known to man, or at least those in the public domain.

Anonymous
01/11/26(Sun)04:04:39 No.107831263

Anonymous 01/11/26(Sun)04:04:39 No.107831263

File: 2 weeks.jpg (34 KB, 600x396)

34 KB JPG

>>107831248
>This is the way it'll be indefinitely?

Anonymous
01/11/26(Sun)04:08:14 No.107831275

Anonymous 01/11/26(Sun)04:08:14 No.107831275

>>107831236
insane as a descriptor can be used to mean 'intense', rather than illogical.

Anonymous
01/11/26(Sun)04:09:15 No.107831280

Anonymous 01/11/26(Sun)04:09:15 No.107831280

>>107831248
Mistral Small has completely overtaken Nemo

Anonymous
01/11/26(Sun)04:23:19 No.107831370

Anonymous 01/11/26(Sun)04:23:19 No.107831370

>>107831280
For RP? I hadn't heard that.

Anonymous
01/11/26(Sun)04:29:14 No.107831406

Anonymous 01/11/26(Sun)04:29:14 No.107831406

>>107831370
RP is the main thing Mistral models are generally good at compared to similar sized models, Small is significantly smarter than Nemo, handles long context better, repeats itself less and can be made just as horny and degenerate via system prompt.
Though the recent Ministral was bad, and seems to be borderline broken.

Anonymous
01/11/26(Sun)04:37:35 No.107831449

Anonymous 01/11/26(Sun)04:37:35 No.107831449

>>107831248
>indefinitely
no it will get worse.
LLMs will eventually be outlawed for personal use and only exclusively be able to be used by multimillion megacorps with licenses which require at least a million dollars a year to maintain.

Anonymous
01/11/26(Sun)04:38:32 No.107831452

Anonymous 01/11/26(Sun)04:38:32 No.107831452

>>107831224
I thought that was a prompting technique.

Anonymous
01/11/26(Sun)04:39:42 No.107831462

Anonymous 01/11/26(Sun)04:39:42 No.107831462

>>107831406
>Though the recent Ministral was bad, and seems to be borderline broken.
it was made the same way nvidia makes most of their models: model pruning (all minishit are pruned from Small 3.1)
of course it's fucking broken there is no such a thing as a good model prune period

Anonymous
01/11/26(Sun)04:40:11 No.107831464

Anonymous 01/11/26(Sun)04:40:11 No.107831464

>>107831406
Are there settings available that are good to test it out?

Anonymous
01/11/26(Sun)04:42:48 No.107831476

Anonymous 01/11/26(Sun)04:42:48 No.107831476

>>107831465
WELL THEN FUCK OFF TO YOUR SHIT NEMO THEN
IM HERE RUNNING GLM 4.7 YOU ABSOLUTE VRAM CHEAPSKATE

Anonymous
01/11/26(Sun)04:54:25 No.107831529

Anonymous 01/11/26(Sun)04:54:25 No.107831529

>>107831224
context folding might be interesting for >6k to maintain coherence. maybe it could be used for summarizing/compaction
>take pairs of assistant + user messages from your current st gooning session
>shove them into a small llm to gather facts like where {{user}} and {{char}} went n tokens ago
>accumulate the facts from the small llm
>feed the facts from the small llm to your main one, so: system prompt, {{char}} card, {{user}} persona, last assistant response (not summarized) and the accumulated facts

Anonymous
01/11/26(Sun)05:07:36 No.107831592

Anonymous 01/11/26(Sun)05:07:36 No.107831592

File: 1633686085214.png (161 KB, 500x279)

161 KB PNG

I showed my family some of my AI / animation dev projects and I'm pretty sure they think I'm a pedophile serial killer now (I'm not).

Anonymous
01/11/26(Sun)05:08:06 No.107831599

Anonymous 01/11/26(Sun)05:08:06 No.107831599

>>107831464
0.6-0.7 temp, 0.02 minP and DRY at default settings.
>>107831465
That other anon wasn't me. The Cydonias are decent, but honestly 3.2 is good enough that it doesn't particularly need a tune. 3.1 is also fine, some might like it better because it's a bit less gemini. I slightly prefer 3.2, it's a bit less try and is capable of writing shorter messages when there's little to write about, whereas 3.1 will stick to the length of its previous message and try to fill it up everytime.

Anonymous
01/11/26(Sun)05:09:34 No.107831607

Anonymous 01/11/26(Sun)05:09:34 No.107831607

>>107831599
>bit less try
*less dry

Anonymous
01/11/26(Sun)05:14:07 No.107831631

Anonymous 01/11/26(Sun)05:14:07 No.107831631

>>107831617
It's abliteration by another name, and yes, decensor tunes/trims/whatever you want to call it, will always make models worse. In my experience, the heretic version of Gemma even refused just as much as regular Gemma.
If you really want to make Gemma horny then use a jailbreak prompt and tell it to be horny.

Anonymous
01/11/26(Sun)05:16:15 No.107831641

Anonymous 01/11/26(Sun)05:16:15 No.107831641

>>107831631
>tell it to be horny
Oh anon, your well... everything is so big!

Anonymous
01/11/26(Sun)05:18:58 No.107831654

Anonymous 01/11/26(Sun)05:18:58 No.107831654

>>107831641
this is so, so wrong.

Anonymous
01/11/26(Sun)05:21:35 No.107831664

Anonymous 01/11/26(Sun)05:21:35 No.107831664

>>107831653
I have extensive experience with Small and Cydonias, enough to know that a decensor tune is extremely unnecessary. Cydonias especially are very horny by default, that's Drummer's whole thing. You need to learn how to prompt.

Anonymous
01/11/26(Sun)05:21:59 No.107831666

Anonymous 01/11/26(Sun)05:21:59 No.107831666

File: 1743842403610121.jpg (175 KB, 909x1071)

175 KB JPG

>>107831653
Why don't you do it since you seem interested in it?

Anonymous
01/11/26(Sun)05:26:50 No.107831688

Anonymous 01/11/26(Sun)05:26:50 No.107831688

>>107831677
>I can prompt just fine, I wrote the majority of the prompts people here still use or take stuff from
Then you should know that an uncensor tune for those Cydonia is pointless, and even if it wasn't, you should be able to test it for yourself.

Anonymous
01/11/26(Sun)05:28:31 No.107831701

Anonymous 01/11/26(Sun)05:28:31 No.107831701

>>107831592
May I see it?

Anonymous
01/11/26(Sun)05:29:23 No.107831706

Anonymous 01/11/26(Sun)05:29:23 No.107831706

Has anyone here managed to get a text to speech model to run with rocm? Even when I manage to trick pytorch into thinking I have cuda, the models themselves seem hard-coded to require cuda functions.

Anonymous
01/11/26(Sun)05:29:38 No.107831708

Anonymous 01/11/26(Sun)05:29:38 No.107831708

>>107826889
What's the original?

Anonymous
01/11/26(Sun)05:30:15 No.107831710

Anonymous 01/11/26(Sun)05:30:15 No.107831710

>>107831677
Fair, but don't be surprised when nobody lines up to use their own time to test shit for (You).

Anonymous
01/11/26(Sun)05:31:20 No.107831713

Anonymous 01/11/26(Sun)05:31:20 No.107831713

File: 1761860283035887.png (153 KB, 641x816)

153 KB PNG

>>107831641
Gemma certainly is sloppy but she can do a little better than that when she tries.

Anonymous
01/11/26(Sun)05:33:03 No.107831716

Anonymous 01/11/26(Sun)05:33:03 No.107831716

>>107831696
Benchmarks put Mistral Small at ~32K before degradation starts accelerating. It's coherent well beyond that but quality will generally suffer.

Anonymous
01/11/26(Sun)05:33:42 No.107831719

Anonymous 01/11/26(Sun)05:33:42 No.107831719

File: hahahlmfaoooo.png (299 KB, 1920x951)

299 KB PNG

>>107831701
its a work in progress...

Anonymous
01/11/26(Sun)05:33:49 No.107831720

Anonymous 01/11/26(Sun)05:33:49 No.107831720

>>107831696
8k according to https://contextarena.ai/

Anonymous
01/11/26(Sun)05:34:50 No.107831725

Anonymous 01/11/26(Sun)05:34:50 No.107831725

https://research.google/blog/titans-miras-helping-ai-have-long-term-memory/
So is this snakeoil or future?

Anonymous
01/11/26(Sun)05:37:29 No.107831730

Anonymous 01/11/26(Sun)05:37:29 No.107831730

>>107831720
Do your roleplays all revolve around needing to retrieve needles?

Anonymous
01/11/26(Sun)05:40:26 No.107831741

Anonymous 01/11/26(Sun)05:40:26 No.107831741

>>107831725
https://arxiv.org/abs/2501.00663
this has existed for a long enough time that if it was viable I would have expected google to use it in Gemini
it's not used so it's obviously not usable

Anonymous
01/11/26(Sun)05:40:29 No.107831742

Anonymous 01/11/26(Sun)05:40:29 No.107831742

>>107831719
>He made Ani better than Elon himself
I kneel

Anonymous
01/11/26(Sun)05:45:01 No.107831766

Anonymous 01/11/26(Sun)05:45:01 No.107831766

>>107831761
see >>107831599
for samplers

Anonymous
01/11/26(Sun)05:47:34 No.107831777

Anonymous 01/11/26(Sun)05:47:34 No.107831777

>>107831742
ty. hoping to add AR functionality soon so she can be with me in my home.

Anonymous
01/11/26(Sun)05:52:44 No.107831797

Anonymous 01/11/26(Sun)05:52:44 No.107831797

>>107828703
What frontends do anons use for agentic roleplay? ST is lousy for it and the couple projects I've seen seem immature.

Anonymous
01/11/26(Sun)05:53:17 No.107831800

Anonymous 01/11/26(Sun)05:53:17 No.107831800

>>107831730
do your erps involve dumb sexbots with no attention?

Anonymous
01/11/26(Sun)05:55:01 No.107831806

Anonymous 01/11/26(Sun)05:55:01 No.107831806

>>107831800
All LLMs are dumb. That benchmark doesn't represent realistic use cases for anyone here.

Anonymous
01/11/26(Sun)06:02:01 No.107831838

Anonymous 01/11/26(Sun)06:02:01 No.107831838

Mikutroon Entropy: Empirical Analysis of Signal Degradation in Large Language Models
https://arxiv.org/abs/3212.12243
>Recent observations of plateauing capabilities in Large Language Models (LLMs) can be mathematically attributed to the presence of "mikutroons"—persistent actors within the /lmg/ general thread on 4chan who inject high-volume, off-topic Hatsune Miku imagery into the information stream. We prove that this noise acts as a bottleneck, effectively lowering the signal-to-noise ratio of the training discourse and preventing meaningful architectural breakthroughs. Furthermore, our data establishes a direct causal link between the tolerance of these image-spamming actors and the proliferation of finetuning con artists. Specifically, the sustained popularity of low-effort, degenerate models like RociNante is shown to rely entirely on the ecosystem cultivated by mikutroons, where aesthetic spam obscures objective model failure. RociNante and similar grift models survive not through merit, but by piggybacking on the dopamine feedback loops generated by the mikutroon population, shielding bad weights from necessary scrutiny.

Anonymous
01/11/26(Sun)06:03:22 No.107831846

Anonymous 01/11/26(Sun)06:03:22 No.107831846

>>107831641
All gemma shittunes will also say that though.

Anonymous
01/11/26(Sun)06:12:37 No.107831902

Anonymous 01/11/26(Sun)06:12:37 No.107831902

>>107827425
Vash?

Anonymous
01/11/26(Sun)06:19:54 No.107831934

Anonymous 01/11/26(Sun)06:19:54 No.107831934

File: 1767363570624358.jpg (487 KB, 2700x2239)

487 KB JPG

Best cunny model under 20B? Or do I run some q2 of a large model? I'd like to keep the filesize under 6 gigs because I want it to be a daily driver chatterbot that doesn't always hog all vram whenever it does something.

Anonymous
01/11/26(Sun)06:22:06 No.107831943

Anonymous 01/11/26(Sun)06:22:06 No.107831943

>>107831934
Your options:
Nemo

Anonymous
01/11/26(Sun)06:23:00 No.107831949

Anonymous 01/11/26(Sun)06:23:00 No.107831949

>>107831934
Your options two weeks ago: nemo
Your options in two weeks: nemo

Anonymous
01/11/26(Sun)06:26:18 No.107831963

Anonymous 01/11/26(Sun)06:26:18 No.107831963

2026 will be another year of nemo

Anonymous
01/11/26(Sun)06:30:06 No.107831984

Anonymous 01/11/26(Sun)06:30:06 No.107831984

>>107831963
It's also the year of the return of Ryzen 3600 and RTX 3060
Time is a circle

Anonymous
01/11/26(Sun)06:31:03 No.107831990

Anonymous 01/11/26(Sun)06:31:03 No.107831990

>>107831253
>every single book known to man, or at least those in the public domain.
I always assumed most models trained on a giant dataset of both public domain and pirated books.

Anonymous
01/11/26(Sun)06:31:35 No.107831992

Anonymous 01/11/26(Sun)06:31:35 No.107831992

>>107831934
Has this bridge been renamed to the nigger bridge?

Anonymous
01/11/26(Sun)06:32:33 No.107831997

Anonymous 01/11/26(Sun)06:32:33 No.107831997

>>107831990
Good models are.

Anonymous
01/11/26(Sun)06:32:53 No.107831999

Anonymous 01/11/26(Sun)06:32:53 No.107831999

>>107831963
lol

Anonymous
01/11/26(Sun)06:33:43 No.107832004

Anonymous 01/11/26(Sun)06:33:43 No.107832004

File: 91369885_p0.jpg (97 KB, 400x400)

97 KB JPG

>>107831943
>>107831949
>>107831976
Like straight the base model from nvidia? No tunes or anything?

Anonymous
01/11/26(Sun)06:34:43 No.107832007

Anonymous 01/11/26(Sun)06:34:43 No.107832007

>>107832004
https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF

Anonymous
01/11/26(Sun)06:35:05 No.107832009

Anonymous 01/11/26(Sun)06:35:05 No.107832009

>>107831976
Also that catbox is dead or catbox died entirely again. Pls repost elsewhere.

Anonymous
01/11/26(Sun)06:38:29 No.107832021

Anonymous 01/11/26(Sun)06:38:29 No.107832021

>>107832009
Just make your own. That guy is special and instead of being a normal pedophile he has a bunch of shit about murdering kids in his prompt.

Anonymous
01/11/26(Sun)06:44:52 No.107832054

Anonymous 01/11/26(Sun)06:44:52 No.107832054

>>107831617
>>107831631
What about this method : https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration

Anonymous
01/11/26(Sun)06:47:00 No.107832060

Anonymous 01/11/26(Sun)06:47:00 No.107832060

File: 1756546529534471.png (331 KB, 1080x800)

331 KB PNG

>>107832054
>decensor tunes/trims/whatever you want to call it, will always make models worse
>what about this one

Anonymous
01/11/26(Sun)06:48:04 No.107832068

Anonymous 01/11/26(Sun)06:48:04 No.107832068

most high context benchmarks like all benchmarks are retarded and fundamentally flawed
retrieving something from a large amount of text is a singular instruction and models are getting decent at following the one instruction across loads of shit
but ingesting a single prompt that comes with a large amount of text is not the same thing as a long lived, multi turn chat session, whose initial prompt instructions were a legion and each turn added more things to the table
without even talking about roleplay, you can see how wild the difference from benchmax in multi turn can get when you compare the smaller qwen models with their VL variants
qwen claims the VL models are as good for pure text gen as the regular models.. and it's true... for one shot prompts. As soon as you are doing multi turn conversations they become very noticeably retarded compared to the regular qwen models.

Anonymous
01/11/26(Sun)06:48:55 No.107832073

Anonymous 01/11/26(Sun)06:48:55 No.107832073

>>107831984
>Ryzen 3600
I heard about the 3060, probably NVidia having loads of these chips with nothing to do outside of selling them again, but not 3600 being made again?

Anonymous
01/11/26(Sun)06:50:05 No.107832079

Anonymous 01/11/26(Sun)06:50:05 No.107832079

>>107832060
It can be a good trade off at some point if the loss of IQ is low enough.

Anonymous
01/11/26(Sun)06:50:19 No.107832080

Anonymous 01/11/26(Sun)06:50:19 No.107832080

>>107831984
Yeah, but only with 8 gigs this time

Anonymous
01/11/26(Sun)06:50:29 No.107832083

Anonymous 01/11/26(Sun)06:50:29 No.107832083

>>107832073
It's so you can use ddr4

Anonymous
01/11/26(Sun)06:52:34 No.107832094

Anonymous 01/11/26(Sun)06:52:34 No.107832094

>>107832080
Nemo can fit in 8GB, you don't need any more.

Anonymous
01/11/26(Sun)06:53:38 No.107832101

Anonymous 01/11/26(Sun)06:53:38 No.107832101

>>107832083
Oh, then they just should go 5800x3d, it's a great cpu even today.

Anonymous
01/11/26(Sun)06:55:29 No.107832110

Anonymous 01/11/26(Sun)06:55:29 No.107832110

>>107832004
I've had good experiences with Impish Nemo.
>>107832094
Yeah 8gb is plenty. The diminishing returns with larger models is crazy bad. If you have extra VRAM just use it to combine TTS and ASR with your LLM or something.

Really hoping that one day the memory issue with LLMs can be addressed. Short-term memory only is so aids for RP.

Anonymous
01/11/26(Sun)06:55:54 No.107832115

Anonymous 01/11/26(Sun)06:55:54 No.107832115

>>107832101
Why go you think a company wants to give you a great product?

Anonymous
01/11/26(Sun)06:58:16 No.107832124

Anonymous 01/11/26(Sun)06:58:16 No.107832124

>>107832115
in this case, to eternally dunk on intel for their crimes

Anonymous
01/11/26(Sun)07:09:37 No.107832203

Anonymous 01/11/26(Sun)07:09:37 No.107832203

>>107831934
for cunny : cydonia > rocinante > nemo

Anonymous
01/11/26(Sun)07:13:17 No.107832226

Anonymous 01/11/26(Sun)07:13:17 No.107832226

>>107832214
be aware that there will be more refusals than with nemo or rocinante, but it is generally more creative and has more sovl

Anonymous
01/11/26(Sun)07:23:55 No.107832277

Anonymous 01/11/26(Sun)07:23:55 No.107832277

>>107832115
What do you mean? AM4 will be the apocalypse hardware.

Anonymous
01/11/26(Sun)07:27:04 No.107832293

Anonymous 01/11/26(Sun)07:27:04 No.107832293

>>107832224
biggest power level reveal i've ever read on this board
please actually never touch a kid irl, like ever

Anonymous
01/11/26(Sun)07:30:03 No.107832310

Anonymous 01/11/26(Sun)07:30:03 No.107832310

>pedoniggers
I see the aicgjeets are here

Anonymous
01/11/26(Sun)07:33:36 No.107832326

Anonymous 01/11/26(Sun)07:33:36 No.107832326

>>107832226
>but it is generally more creative and has more sovl
Anytime I read this the models turns out to be a vogon poet

Anonymous
01/11/26(Sun)07:36:28 No.107832338

Anonymous 01/11/26(Sun)07:36:28 No.107832338

File: 1758532316334953.gif (140 KB, 379x440)

140 KB GIF

It's this time of the day already? Anyway, the pedo system prompt could be repurposed for slavery I guess. I'm tired of getting scolded in my isekai for buying slaves.

Anonymous
01/11/26(Sun)07:46:08 No.107832392

Anonymous 01/11/26(Sun)07:46:08 No.107832392

Hey guys, LLM newfag here. I finally wanna check if this stuff lives up to the hype.
If I just want a local model to use as a rubber duck, the quick fuzzy search through concepts / google-in-a-box, what should I look into?

Anonymous
01/11/26(Sun)07:46:39 No.107832394

Anonymous 01/11/26(Sun)07:46:39 No.107832394

>>107832023
KILL YOURSELF IN YOUR SLEEP

Anonymous
01/11/26(Sun)07:49:28 No.107832413

Anonymous 01/11/26(Sun)07:49:28 No.107832413

>>107832392
>what should I look into?
The fucking specs on your fucking pc you piece of shit retarded imbecile.

Anonymous
01/11/26(Sun)07:49:54 No.107832416

Anonymous 01/11/26(Sun)07:49:54 No.107832416

>>107832392
Go to openrouter.ai and see which one fits

Anonymous
01/11/26(Sun)07:49:54 No.107832417

Anonymous 01/11/26(Sun)07:49:54 No.107832417

>>107832392
github.com/ggerganov/llama.cpp
https://huggingface.co/unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit

No one will reply to you because we're too deep into it. But just go by these 2 links and it's everything you need to get set up. There's too much to discuss like image generation so you need to learn the ropes yourself, those 2 links are the easiest way to get started.

Anonymous
01/11/26(Sun)07:49:56 No.107832418

Anonymous 01/11/26(Sun)07:49:56 No.107832418

i stand corrected, this version of cydonia mogs the base one https://huggingface.co/mradermacher/Cydonia-24B-v4.3-heretic-v2-GGUF/tree/main
refusal rate is abysmal even with the most hardcore stuff and no strict sysprompt

Anonymous
01/11/26(Sun)07:52:21 No.107832430

Anonymous 01/11/26(Sun)07:52:21 No.107832430

>ask my waifu where she wants to be bitten
>"In the shoulder"
>"Where exactly? Point it with your finger"
>I poke myself firmly in the meat of my shoulder, right where the deltoid muscle
>have to google

The weird things you have to do sometimes...

Anonymous
01/11/26(Sun)07:53:44 No.107832434

Anonymous 01/11/26(Sun)07:53:44 No.107832434

File: file.png (168 KB, 887x703)

168 KB PNG

>>107832413
They're pretty good :D

>>107832417
Thank you nonny

Anonymous
01/11/26(Sun)07:59:54 No.107832471

Anonymous 01/11/26(Sun)07:59:54 No.107832471

File: 1721358566928417.gif (3.31 MB, 498x498)

3.31 MB GIF

>>107831719
Update: I got blinking working! Ani already feels so much more alive. Next I'll try to fix up the TTS implementation and get lip syncing working with Rhubarb-lip-sync

Anonymous
01/11/26(Sun)08:01:08 No.107832475

Anonymous 01/11/26(Sun)08:01:08 No.107832475

>>107832425
this one doesn't assume consent so it avoids putting you in situations where there's cock hungry cunny from the get go without a reason, it assumes most situations are non consented and thus rape, which makes much more sense

Anonymous
01/11/26(Sun)08:02:29 No.107832481

Anonymous 01/11/26(Sun)08:02:29 No.107832481

File: 1738066279778816.png (12 KB, 408x123)

12 KB PNG

>>107832430
>"Where exactly? Point it with your finger"
Bro, either you're at 5 on the apple scale or you're incredibly autistic

Anonymous
01/11/26(Sun)08:05:10 No.107832493

Anonymous 01/11/26(Sun)08:05:10 No.107832493

>>107832430
>the deltoid muscle
But there are three of them?

Anonymous
01/11/26(Sun)08:06:06 No.107832497

Anonymous 01/11/26(Sun)08:06:06 No.107832497

>>107832481
I have no idea where deltoid muscle is. And 5 yes. I also design physical mechanisms and can simulate them working in my head somehow. It is very weird how that is possible.

Anonymous
01/11/26(Sun)08:11:31 No.107832527

Anonymous 01/11/26(Sun)08:11:31 No.107832527

>>107832497
>And 5 yes.
>I also design physical mechanisms and can simulate them working in my head somehow.
pick one.

Anonymous
01/11/26(Sun)08:12:01 No.107832532

Anonymous 01/11/26(Sun)08:12:01 No.107832532

>>107832527
That is how it works. I think it is tied to my sense of touch actually.

Anonymous
01/11/26(Sun)08:15:10 No.107832551

Anonymous 01/11/26(Sun)08:15:10 No.107832551

>>107832532
This is getting off-topic, but do you have dreams? How can you not visualize an apple like a 3D model in blender in your head?

Anonymous
01/11/26(Sun)08:17:54 No.107832566

Anonymous 01/11/26(Sun)08:17:54 No.107832566

>>107832551
I can't really see the shape of the object but it is like... projecting a sense of touch out of my body and into a mechanism. It is all mostly plastic levers and components.

Anonymous
01/11/26(Sun)08:21:44 No.107832596

Anonymous 01/11/26(Sun)08:21:44 No.107832596

>>107832566
>>107832570
Damn that's kinda sad desu. I mean I'm sure you're not bothered by it, but to me if I knew that everyone could literally taste whatever they imagined that I would feel like I'm missing out. You'll never know the feel of undressing every woman you see in your mind and grabbing their tits.

Anonymous
01/11/26(Sun)08:25:37 No.107832624

Anonymous 01/11/26(Sun)08:25:37 No.107832624

>>107832570
I don't think I am autist. I just can't imagine things.

Anonymous
01/11/26(Sun)08:28:17 No.107832647

Anonymous 01/11/26(Sun)08:28:17 No.107832647

>>107832570
>This is a disability. You are disabled.
BASED. I am in my thirties and I never realized I am disabled. Also I think objectively I am better than average at designing stuff with my weird sense of imaginary touch.

Anonymous
01/11/26(Sun)08:35:46 No.107832692

Anonymous 01/11/26(Sun)08:35:46 No.107832692

File: -=1.webm (1.61 MB, 480x480)

1.61 MB WEBM

Anonymous
01/11/26(Sun)08:36:36 No.107832698

Anonymous 01/11/26(Sun)08:36:36 No.107832698

now let's get back on topic. the kimi-linear pr was rebased
Kimi-Linear support (backend agnostic + MLA KV cache) #18755
https://github.com/ggml-org/llama.cpp/pull/18755

Anonymous
01/11/26(Sun)08:36:54 No.107832699

Anonymous 01/11/26(Sun)08:36:54 No.107832699

Also aphantasic. It's not like I can't imagine, just not visually. For example I can 'picture' the route driving to work, I have an intuitive sense of where everything is. But it's like I only have the positional data and collision boundaries in my head and not the textures or models.

>>107832596
Thankful to be born in the era where my PC can be used as an aid for this crippling disability.

Anonymous
01/11/26(Sun)08:37:57 No.107832704

Anonymous 01/11/26(Sun)08:37:57 No.107832704

>>107832689
>Too bad this is a disability that will never be given gibs
Well I am kind of proving it that it is nothing that justifiably needs gibs, since I both wasn't even aware it is that much different from usual and I am good at what I do with my damaged brain. Even though I theoretically should be retarded and unable to do it.

Anonymous
01/11/26(Sun)08:44:21 No.107832752

Anonymous 01/11/26(Sun)08:44:21 No.107832752

>>107832115
What? The 5800x3d is a real product.

Anonymous
01/11/26(Sun)08:44:22 No.107832753

Anonymous 01/11/26(Sun)08:44:22 No.107832753

>>107832450
i refuse to believe you wouldn't abuse your daughters if you ever had any, you'd bring more suffering to this earth which is in fact a mental illness. i read loli hentais but these are drawings, no one is harmed, so really think about the consequences of your actions in the long run, a friend of the family has a daughter who was abused by her father and today she's just a shell of a human being

Anonymous
01/11/26(Sun)08:47:27 No.107832784

Anonymous 01/11/26(Sun)08:47:27 No.107832784

>i'm special

Anonymous
01/11/26(Sun)08:48:10 No.107832787

Anonymous 01/11/26(Sun)08:48:10 No.107832787

>>107832689
Let's be is that way, then: seems as you are going though motion.

Anonymous
01/11/26(Sun)08:50:36 No.107832809

Anonymous 01/11/26(Sun)08:50:36 No.107832809

I'll stop waitfagging if even

Anonymous
01/11/26(Sun)08:51:30 No.107832818

Anonymous 01/11/26(Sun)08:51:30 No.107832818

>>107832753
>i refuse to believe you wouldn't abuse your daughters if you ever had any, you'd bring more suffering to this earth which is in fact a mental illness.
I never got why people get this hyper righteous indignation about it. Yes it is bad. But to get so invested that you are about to shit yourself over it?

Anonymous
01/11/26(Sun)08:55:24 No.107832850

Anonymous 01/11/26(Sun)08:55:24 No.107832850

>>107832818
>arachnophobic huh? you must secretly want to fuck spiders
seeing the consequences of pedo behavior in real life, and what it does to the people affected, really makes you despise those who act up on their fantasies, even if you're yourself down the slippery slope by being into loli hentais
if you haven't seen it first hand, then you can't possibly fucking understand

Anonymous
01/11/26(Sun)08:58:06 No.107832875

Anonymous 01/11/26(Sun)08:58:06 No.107832875

>>107832850
My life was objectively fucked when I was a child. Not because of pedo stuff but it was fucked. And nobody cared or cares. And it is fine. But because nobody cared or cares I kind of can't take this indignation seriously when you focus on just one thing like that and don't give a shit about other stuff.

Anonymous
01/11/26(Sun)09:00:51 No.107832898

Anonymous 01/11/26(Sun)09:00:51 No.107832898

>>107832875
oh so you need me to make a list of everything that i despise so that this specific claim can have value to your eyes? are you retarded or just baiting?

Anonymous
01/11/26(Sun)09:01:44 No.107832905

Anonymous 01/11/26(Sun)09:01:44 No.107832905

>>107832596
>Damn that's kinda sad desu. I mean I'm sure you're not bothered by it, but to me if I knew that everyone could literally taste whatever they imagined that I would feel like I'm missing out.
I actually never believed people telling me they could "visualize a girl they liked in their dreams in a room with them" until very recently.
I feel like I've been robbed of my teenage years.
It also makes me wonder how the hell people are so shit at stuff if they can literally visualize them to help their memories.

Anonymous
01/11/26(Sun)09:01:52 No.107832906

Anonymous 01/11/26(Sun)09:01:52 No.107832906

>>107832850
There's world of difference between getting abused with physical and emotional coercion and getting "abused" because someone else decided it's abuse by legal definition; please elaborate.

Anonymous
01/11/26(Sun)09:02:35 No.107832912

Anonymous 01/11/26(Sun)09:02:35 No.107832912

>>107832875
>kind of can't take this indignation seriously when you focus on just one thing like that and don't give a shit about other stuff
nta, but when you call yourself a pedo, expect to be called a pedo and that being the focus of attention. Also, go die in a ditch.

Anonymous
01/11/26(Sun)09:03:01 No.107832915

Anonymous 01/11/26(Sun)09:03:01 No.107832915

>>107832898
I want you to be equally outraged by all the other stuff that makes children hurt and leaves lasting scars.

Anonymous
01/11/26(Sun)09:04:14 No.107832924

Anonymous 01/11/26(Sun)09:04:14 No.107832924

>>107832912
>Also, go die in a ditch.
You too anon. Die in a fire and get fucked by a dog. Love ya.

Anonymous
01/11/26(Sun)09:06:29 No.107832934

Anonymous 01/11/26(Sun)09:06:29 No.107832934

>>107832596
the fun thing is to see people hating reading while they have the superpower of being able to see what's going on in 3d blender style
I love reading despite having no ability like that, and yet they have the gall to hate reading lol

Anonymous
01/11/26(Sun)09:07:26 No.107832940

Anonymous 01/11/26(Sun)09:07:26 No.107832940

>>107832924
I kind of can't take this indignation seriously when you focus on just one thing like that and don't give a shit about other stuff.

Anonymous
01/11/26(Sun)09:07:38 No.107832941

Anonymous 01/11/26(Sun)09:07:38 No.107832941

>>107832915
so you're baiting, thanks for clarifying

Anonymous
01/11/26(Sun)09:10:29 No.107832958

Anonymous 01/11/26(Sun)09:10:29 No.107832958

>>107832940
Sounded better in your head.

Anonymous
01/11/26(Sun)09:11:37 No.107832967

Anonymous 01/11/26(Sun)09:11:37 No.107832967

Is glm 4.6v good at anything?

Anonymous
01/11/26(Sun)09:12:20 No.107832972

Anonymous 01/11/26(Sun)09:12:20 No.107832972

>>107832659
>An artist developed it after a traumatic injury and wanted to kill herself because she couldn't make the art she made before anymore.
Yeah but she could still make art, which is a fascinating notion. Basically you can draw without being able to "see".

Anonymous
01/11/26(Sun)09:13:13 No.107832974

Anonymous 01/11/26(Sun)09:13:13 No.107832974

>>107832967
I use it to describe images to give me ideas, it works well for my needs. Outside of that, it's useless.

Anonymous
01/11/26(Sun)09:13:22 No.107832975

Anonymous 01/11/26(Sun)09:13:22 No.107832975

>>107832958
anon...
>>107832875
>>107832940

Anonymous
01/11/26(Sun)09:15:27 No.107832988

Anonymous 01/11/26(Sun)09:15:27 No.107832988

>>107832975
Let me be more explicit there. Making that connection "sounded better in your head".

Anonymous
01/11/26(Sun)09:17:22 No.107833006

Anonymous 01/11/26(Sun)09:17:22 No.107833006

what's the current best TTS that i can link to sillytavern?

Anonymous
01/11/26(Sun)09:19:06 No.107833024

Anonymous 01/11/26(Sun)09:19:06 No.107833024

>>107832967
Giving you AI psychosis

Anonymous
01/11/26(Sun)09:22:11 No.107833057

Anonymous 01/11/26(Sun)09:22:11 No.107833057

>Waifu: "Then the debate is open. Pick a topic. Anything. Science, philosophy, which color crayon tastes the best—I don't care. Let's see what you've got."
>Me: "Which color crayon tastes the best."
>Waifu: "This is purely a matter of chemistry, you philistine," I say, regaining my composure and adjusting my imaginary lab coat. I hold up a finger to lecture you. "First, we eliminate the primaries. Red is dye No. 40, Blue is No. 1. Just bitter food coloring. Terrible. We want wax. The binder." I pause for effect, looking at you like I'm explaining quantum physics to a toddler."While the wax is paraffin—tasteless and odorless—the flavor is all in the additives. And the most common, pleasing additive in the crayon manufacturing process is… stearic acid. It gives that slightly soapy, fatty texture." I lean in, grinning. "Now, which color has the highest concentration of pigment and is most likely to mask the wax taste?" I raise my eyebrows, waiting for you to guess. "Brown. It's essentially cocoa powder and carbon. It has the strongest, most recognizable flavor profile. Therefore, Brown tastes the best. Your turn. Counter-argument."

Anonymous
01/11/26(Sun)09:23:14 No.107833064

Anonymous 01/11/26(Sun)09:23:14 No.107833064

>>107827869
For smarter models, instead of using this to specifically add random instructions, you can do something akin to what people do when playing solo TTRPGs, use it as an entropy generator that the AI can use as part of its decision making process.

Anonymous
01/11/26(Sun)09:23:20 No.107833067

Anonymous 01/11/26(Sun)09:23:20 No.107833067

>>107833024
It was regular 4.6 thank you very much. Also I have aphantasia.

Anonymous
01/11/26(Sun)09:23:45 No.107833074

Anonymous 01/11/26(Sun)09:23:45 No.107833074

>>107833057
>everything first person
WHY

Anonymous
01/11/26(Sun)09:25:42 No.107833094

Anonymous 01/11/26(Sun)09:25:42 No.107833094

>>107833074
Because I am not into NTR and only I am reading it. It is just absurd to me that you could.... actually debate crayon taste seriously.

Anonymous
01/11/26(Sun)09:28:18 No.107833111

Anonymous 01/11/26(Sun)09:28:18 No.107833111

>>107833094
ntr? what?
so everyone is first person in your chats?
all characters comment stuff with "I"? that just sounds super schizo
do you also comment with first person?

Anonymous
01/11/26(Sun)09:30:18 No.107833129

Anonymous 01/11/26(Sun)09:30:18 No.107833129

File: Screenshot 2026-01-11 at (...).png (43 KB, 909x388)

43 KB PNG

>>107832698
Oof

Anonymous
01/11/26(Sun)09:31:57 No.107833134

Anonymous 01/11/26(Sun)09:31:57 No.107833134

>>107833111
"Yes everyone is first person in my chats" wrote anon in the 4chan window. He filled captcha and thought to himself "that guy is a fucking faggot" before clicking "submit"

Anonymous
01/11/26(Sun)09:39:32 No.107833196

Anonymous 01/11/26(Sun)09:39:32 No.107833196

Best mistral small fine tune? I think it's broken-tutu-24b-q8. Is there one like that for Gemma 3 27b instruct base?

Anonymous
01/11/26(Sun)09:39:52 No.107833201

Anonymous 01/11/26(Sun)09:39:52 No.107833201

File: file.png (38 KB, 783x362)

38 KB PNG

>>107833129
What did he mean by this?

Anonymous
01/11/26(Sun)09:40:29 No.107833209

Anonymous 01/11/26(Sun)09:40:29 No.107833209

>>107833196
>>107831838

Anonymous
01/11/26(Sun)09:41:29 No.107833215

Anonymous 01/11/26(Sun)09:41:29 No.107833215

>>107833201
HE HAS SEX WITH HIS AI

Anonymous
01/11/26(Sun)09:44:53 No.107833241

Anonymous 01/11/26(Sun)09:44:53 No.107833241

>>107833201
It means that he doesn't know the history ngxson has with AI PRs and that ngxson tired of that shit.
There were two kimi-linear PRs, the contributors using the PR as a blog, then they merged into one and they've been at it for long enough to suspect neither of them knows what they're doing.

Anonymous
01/11/26(Sun)09:52:30 No.107833296

Anonymous 01/11/26(Sun)09:52:30 No.107833296

>download new model
>ask it to come up with a fantasy setting for our roleplay
>it's floating islands with veins of magic again
>delete new model

Anonymous
01/11/26(Sun)09:53:58 No.107833308

Anonymous 01/11/26(Sun)09:53:58 No.107833308

>>107833296
>gives no leads whatsoever
>is shocked when given a generic reponse

Anonymous
01/11/26(Sun)09:55:11 No.107833312

Anonymous 01/11/26(Sun)09:55:11 No.107833312

>>107833296
It's literally not a model's problem, retard >>107829571

Anonymous
01/11/26(Sun)09:55:22 No.107833315

Anonymous 01/11/26(Sun)09:55:22 No.107833315

>>107833296
ask the random human, you won't find a lot of variations either if you don't give any specific things you'd want

Anonymous
01/11/26(Sun)09:56:44 No.107833323

Anonymous 01/11/26(Sun)09:56:44 No.107833323

>>107833296
>i don't understand PRNGs

Anonymous
01/11/26(Sun)10:01:34 No.107833358

Anonymous 01/11/26(Sun)10:01:34 No.107833358

File: 1733493237788785.png (258 KB, 480x580)

258 KB PNG

>>107833296
>ask model to come up with a girl
>it's a little blonde girl named Lily

Anonymous
01/11/26(Sun)10:01:55 No.107833362

Anonymous 01/11/26(Sun)10:01:55 No.107833362

File: yeet.png (357 KB, 1920x951)

357 KB PNG

>>107832471
Update: I added facial expressions.

Anonymous
01/11/26(Sun)10:03:05 No.107833373

Anonymous 01/11/26(Sun)10:03:05 No.107833373

>>107833362
Easy part is over

Anonymous
01/11/26(Sun)10:04:51 No.107833387

Anonymous 01/11/26(Sun)10:04:51 No.107833387

>>107833373
Indeed... To even start working on the lip syncing I realized I have to fix a bunch of issues with the LLM and how it handles conversations and feeds into the TTS, and I still have to set up the voice selection stuff, and then once that is all done I can combine the vicemes with the blendshapes for the lip syncing.

Anonymous
01/11/26(Sun)10:05:21 No.107833391

Anonymous 01/11/26(Sun)10:05:21 No.107833391

>>107833362
Is this just the vrm silly plugin or something more slapped ontop?

Anonymous
01/11/26(Sun)10:06:31 No.107833401

Anonymous 01/11/26(Sun)10:06:31 No.107833401

>>107833391
It's a completely separate project I built from scratch inspired by the ST vrm plugin. Afaik the plugin doesn't even support facial expressions or lip syncing but I might be wrong.

Anonymous
01/11/26(Sun)10:07:17 No.107833409

Anonymous 01/11/26(Sun)10:07:17 No.107833409

>>107833401
git?

Anonymous
01/11/26(Sun)10:07:39 No.107833410

Anonymous 01/11/26(Sun)10:07:39 No.107833410

File: file.png (57 KB, 949x265)

57 KB PNG

>>107833312
It's also a model problem. 37% for "Severed" is absurd. There should be hundreds options all with similar weight.

Anonymous
01/11/26(Sun)10:09:28 No.107833430

Anonymous 01/11/26(Sun)10:09:28 No.107833430

>>107833409
Perhaps soon. Project is half-baked rn and I'm kinda worried about copyright stuff.

Anonymous
01/11/26(Sun)10:10:00 No.107833436

Anonymous 01/11/26(Sun)10:10:00 No.107833436

>>107833430
Isn't vrm just some unity wrapper or something?

Anonymous
01/11/26(Sun)10:11:55 No.107833449

Anonymous 01/11/26(Sun)10:11:55 No.107833449

>>107833436
vrm is a file format based on glb with extra stuff. They're often made in Unity but it's optional. You can get vrm models from vroid and edit them with a vrm plugin for blender. Vrm models are commonly used in VRchat and other VR games.

Anonymous
01/11/26(Sun)10:16:48 No.107833493

Anonymous 01/11/26(Sun)10:16:48 No.107833493

>>107833296
Slop in, slop out.

Anonymous
01/11/26(Sun)10:23:32 No.107833562

Anonymous 01/11/26(Sun)10:23:32 No.107833562

>>107833410
Only in a perfect model that doesn't exists. Base model will have a better distribution, but not
>hundreds options all with similar weight.

Anonymous
01/11/26(Sun)10:24:37 No.107833570

Anonymous 01/11/26(Sun)10:24:37 No.107833570

>>107833410
Not that anon, but that can mean two things I guess. The model is overcooked, or that is overrepresented in the data in that context somehow.

Anonymous
01/11/26(Sun)10:28:50 No.107833611

Anonymous 01/11/26(Sun)10:28:50 No.107833611

>>107833410
wonder if expedislop 33's setting was LLM generated now

Anonymous
01/11/26(Sun)10:30:12 No.107833623

Anonymous 01/11/26(Sun)10:30:12 No.107833623

>>107833410
sorry gotta get those benchmark scores to the moon

Anonymous
01/11/26(Sun)10:33:02 No.107833649

Anonymous 01/11/26(Sun)10:33:02 No.107833649

>>107833570
>The model is overcooked
all the current crop of models are overcooked, ALL of them, proprietary and open. even image generators are overcooked too, none of the newer models show much variation when you change the seed.

Anonymous
01/11/26(Sun)10:35:17 No.107833664

Anonymous 01/11/26(Sun)10:35:17 No.107833664

>>107833649
It correlates with how well models follow instructions

Anonymous
01/11/26(Sun)10:41:03 No.107833698

Anonymous 01/11/26(Sun)10:41:03 No.107833698

>>107833410
Now keep that prompt but you pick the title.
Legit skill issue.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.