/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 12/31/25(Wed)14:35:11 No.107722977

File: 1750934761258543.jpg (1.11 MB, 2400x1368)

/lmg/ - Local Models General Anonymous 12/31/25(Wed)14:35:11 No.107722977

/lmg/ - a general dedicated to the discussion and development of local language models.

Evil Teto Edition

Previous threads: >>107717246 & >>107709248

►News
>(12/31) Qwen-Image-2512 released: https://hf.co/Qwen/Qwen-Image-2512
>(12/29) HY-Motion 1.0 text-to-3D human motion generation models released: https://hf.co/tencent/HY-Motion-1.0
>(12/29) WeDLM-8B-Instruct diffusion language model released: https://hf.co/tencent/WeDLM-8B-Instruct
>(12/29) Llama-3.3-8B-Instruct weights leaked: https://hf.co/allura-forge/Llama-3.3-8B-Instruct
>(12/26) MiniMax-M2.1 released: https://minimax.io/news/minimax-m21
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/31/25(Wed)14:37:08 No.107722992

Anonymous 12/31/25(Wed)14:37:08 No.107722992

File: 1745858680629255.png (28 KB, 223x903)

28 KB PNG

Anonymous
12/31/25(Wed)14:40:45 No.107723031

Anonymous 12/31/25(Wed)14:40:45 No.107723031

File: 1764493735164895.png (78 KB, 1014x199)

78 KB PNG

>>107722977
>Evil Teto Edition
I can see where you took the inspiration.

Anonymous
12/31/25(Wed)14:43:06 No.107723052

Anonymous 12/31/25(Wed)14:43:06 No.107723052

>>107722977
Can you use different Nvidia GPUs together with CUDA?

Anonymous
12/31/25(Wed)14:43:46 No.107723059

Anonymous 12/31/25(Wed)14:43:46 No.107723059

>>107723052
Yes.

Anonymous
12/31/25(Wed)14:43:47 No.107723060

Anonymous 12/31/25(Wed)14:43:47 No.107723060

>>107723052
of course not

Anonymous
12/31/25(Wed)14:44:11 No.107723066

Anonymous 12/31/25(Wed)14:44:11 No.107723066

>>107722992
Elarabros...

Anonymous
12/31/25(Wed)14:44:13 No.107723067

Anonymous 12/31/25(Wed)14:44:13 No.107723067

>>107723052
Sometimes

Anonymous
12/31/25(Wed)14:46:33 No.107723082

Anonymous 12/31/25(Wed)14:46:33 No.107723082

Is Ozone here to stay? I totally forgot it exists before AI. I remember how I was a kid and there was this huge scare about ozone hole and global warming.

Anonymous
12/31/25(Wed)14:47:22 No.107723091

Anonymous 12/31/25(Wed)14:47:22 No.107723091

>>107722992
I get much better results when it randomly pulls from a list of substitutions

Anonymous
12/31/25(Wed)14:48:59 No.107723102

Anonymous 12/31/25(Wed)14:48:59 No.107723102

>>107723052
as long as they are newer than Turing.

Anonymous
12/31/25(Wed)14:49:25 No.107723108

Anonymous 12/31/25(Wed)14:49:25 No.107723108

File: 1746464398219703.jpg (1.65 MB, 3300x3700)

1.65 MB JPG

>>107723059
>>107723060
>>107723067
Guys pls. Like if I have a 16GB P5000 in a old workstation, would it make sense to add a 12GB 3060 Ti that's just gathering dust?

Anonymous
12/31/25(Wed)14:49:59 No.107723114

Anonymous 12/31/25(Wed)14:49:59 No.107723114

>>107723108
no. CUDA support is deprecated for Pascal.

Anonymous
12/31/25(Wed)14:50:24 No.107723118

Anonymous 12/31/25(Wed)14:50:24 No.107723118

>>107723108
legit answer: no because you're old shit won't have compatible drivers with the new one

Anonymous
12/31/25(Wed)14:50:26 No.107723119

Anonymous 12/31/25(Wed)14:50:26 No.107723119

>>107723102
Ah shame.

Anonymous
12/31/25(Wed)14:50:42 No.107723121

Anonymous 12/31/25(Wed)14:50:42 No.107723121

>>107723082
Ozone, you say? You want the whole deal?

Anonymous
12/31/25(Wed)14:52:02 No.107723133

Anonymous 12/31/25(Wed)14:52:02 No.107723133

>>107723114
Uh, it still supports CUDA 12.

Anonymous
12/31/25(Wed)14:53:35 No.107723146

Anonymous 12/31/25(Wed)14:53:35 No.107723146

>>107723133
why to even ask if you no listen to answering?

Anonymous
12/31/25(Wed)14:54:18 No.107723152

Anonymous 12/31/25(Wed)14:54:18 No.107723152

does some one know has local uncensored chat bot model with 12- 24B range for a 5060 ti 16 GB ?

Anonymous
12/31/25(Wed)14:54:59 No.107723159

Anonymous 12/31/25(Wed)14:54:59 No.107723159

>>107723152
drummer

Anonymous
12/31/25(Wed)14:55:10 No.107723162

Anonymous 12/31/25(Wed)14:55:10 No.107723162

So guys. We all bitch about slop and so on. But am I the only one who really likes talking to characters roleplayed by AI? When you get past the shitty alignment issues I really get a feeling that they are more interesting than most real people. Also to be absolutely clear I don't use any worthless shittunes.

Anonymous
12/31/25(Wed)14:56:17 No.107723176

Anonymous 12/31/25(Wed)14:56:17 No.107723176

>>107723152
Go for Nemo.

Anonymous
12/31/25(Wed)14:56:36 No.107723179

Anonymous 12/31/25(Wed)14:56:36 No.107723179

>>107723162
No. They're vapid and they all have one of like three personalities.

Anonymous
12/31/25(Wed)14:56:37 No.107723180

Anonymous 12/31/25(Wed)14:56:37 No.107723180

>>107723162
If you made them or selected them according to your interests it's obvious you would find them interesting

Anonymous
12/31/25(Wed)14:57:14 No.107723186

Anonymous 12/31/25(Wed)14:57:14 No.107723186

>>107723176
too dumb .

Anonymous
12/31/25(Wed)14:58:10 No.107723197

Anonymous 12/31/25(Wed)14:58:10 No.107723197

>>107723180
Funny thing is I didn't. And when I asked it to rp some waifus I liked... I actually didn't enjoy those as much as random waifus.

Anonymous
12/31/25(Wed)14:59:14 No.107723206

Anonymous 12/31/25(Wed)14:59:14 No.107723206

>>107723186
Well it's the best you're doing to get with that hardware.

Anonymous
12/31/25(Wed)14:59:54 No.107723212

Anonymous 12/31/25(Wed)14:59:54 No.107723212

File: 1744465468049887.png (1.1 MB, 1080x1422)

1.1 MB PNG

>>107723146
Sorry I'm dumb and incompetent

Anonymous
12/31/25(Wed)15:00:30 No.107723218

Anonymous 12/31/25(Wed)15:00:30 No.107723218

>>107723212
>>107723118

Anonymous
12/31/25(Wed)15:01:13 No.107723227

Anonymous 12/31/25(Wed)15:01:13 No.107723227

File: Celebration.png (3.07 MB, 1536x1536)

3.07 MB PNG

►Recent Highlights from the Previous Thread: >>107717246

--Multimodal AI progress disparity: image/video vs text generation challenges:
>107721240 >107721272 >107721289 >107721382 >107721399 >107721415 >107721644 >107721489 >107721518 >107721552 >107721525 >107721777 >107721599
--Qwen-Image 2512 model release and developmental journey:
>107719430 >107719475 >107719500 >107719475 >107719481 >107719929 >107720284 >107720309 >107720339
--Deepseek model quant compatibility and bug troubleshooting:
>107720142 >107720169 >107720178 >107720214 >107720191 >107720279
--Quantization benchmarking methods for language models:
>107718638 >107718682
--Solar-Open-100B model release and community interest in uncensored variants:
>107719372 >107719424 >107719411
--New 500b MOE model announced, GGUF support questioned:
>107720510 >107720553 >107720624
--AI startup strategies in China: breakthrough models vs short-term gains:
>107722114 >107722383 >107722436 >107722461 >107722531
--96GB VRAM optimization strategies for large models:
>107717404 >107717410 >107717414 >107717490 >107720591
--Multilingual model VAETKI-112B-A10B with 112.2B parameters announced:
>107719493
--Exploring Hunyuan motion 1.0 with UE5 integration and VRAM needs:
>107718150 >107718171 >107718298
--Google's cautious approach to releasing powerful open AI models vs open-source competition:
>107718631 >107718739 >107718769 >107718982 >107718986
--Moonshot AI's K3 model scaling ambitions and market positioning:
>107722422 >107722488 >107722536 >107722906
--K-EXAONE-236B-A23B model announcement on Hugging Face:
>107719396
--Miku (free space):
>107717575 >107717643 >107718169 >107719149 >107719481 >107719742 >107719929 >107720110 >107722205 >107722934

►Recent Highlight Posts from the Previous Thread: >>107717250

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
12/31/25(Wed)15:02:03 No.107723233

Anonymous 12/31/25(Wed)15:02:03 No.107723233

>>107723152
It is scientifically impossible to make a model better than nemo in that single GPU range now. Everything is safety and scaleai maxxed now. You would need a radical shift in AI culture where it is suddenly ok to make models for coomers.

Anonymous
12/31/25(Wed)15:02:27 No.107723240

Anonymous 12/31/25(Wed)15:02:27 No.107723240

>>107723227
Why are they kissing? They are both girls.

Anonymous
12/31/25(Wed)15:02:52 No.107723243

Anonymous 12/31/25(Wed)15:02:52 No.107723243

File: 1765416889917586.gif (2.66 MB, 180x180)

2.66 MB GIF

>>107723218
Shame but thanks

Anonymous
12/31/25(Wed)15:03:41 No.107723252

Anonymous 12/31/25(Wed)15:03:41 No.107723252

>>107723152
https://huggingface.co/bartowski/Rocinante-12B-v1.1-GGUF
get q8

Anonymous
12/31/25(Wed)15:05:40 No.107723273

Anonymous 12/31/25(Wed)15:05:40 No.107723273

>>107723176
>>107723233
The recommended models lists Mistral Small over Nemo and still working with 16GB?

Anonymous
12/31/25(Wed)15:06:58 No.107723284

Anonymous 12/31/25(Wed)15:06:58 No.107723284

>>107723252
better than just nemo?

Anonymous
12/31/25(Wed)15:08:12 No.107723295

Anonymous 12/31/25(Wed)15:08:12 No.107723295

>>107723284
No

Anonymous
12/31/25(Wed)15:09:34 No.107723307

Anonymous 12/31/25(Wed)15:09:34 No.107723307

>>107723284
Yes.

Anonymous
12/31/25(Wed)15:10:01 No.107723313

Anonymous 12/31/25(Wed)15:10:01 No.107723313

>>107723284
Maybe

Anonymous
12/31/25(Wed)15:10:33 No.107723318

Anonymous 12/31/25(Wed)15:10:33 No.107723318

>>107723284
I don't know

Anonymous
12/31/25(Wed)15:11:10 No.107723327

Anonymous 12/31/25(Wed)15:11:10 No.107723327

>>107723318
>>107723313
>>107723307
>>107723295
goys...

Anonymous
12/31/25(Wed)15:11:46 No.107723333

Anonymous 12/31/25(Wed)15:11:46 No.107723333

>>107723327
It's hornier. It's definitely not smarter.

Anonymous
12/31/25(Wed)15:12:08 No.107723339

Anonymous 12/31/25(Wed)15:12:08 No.107723339

>>107723284
no
>>107723252
die faggot

Anonymous
12/31/25(Wed)15:13:38 No.107723352

Anonymous 12/31/25(Wed)15:13:38 No.107723352

File: behave.jpg (1.84 MB, 2456x1736)

1.84 MB JPG

>>107723227
Lovely pic recapanon, tho Tet's raised and frankly masculine hand makes her seem dominant, which we all understand isn't how it goes down
Happy New Year /lmg/

Anonymous
12/31/25(Wed)15:14:49 No.107723366

Anonymous 12/31/25(Wed)15:14:49 No.107723366

>>107723333
Nemo is already a horndog.

Anonymous
12/31/25(Wed)15:15:01 No.107723371

Anonymous 12/31/25(Wed)15:15:01 No.107723371

File: 1767205264542766m.jpg (97 KB, 647x1024)

97 KB JPG

dont forget to upgrade before it's too late

Anonymous
12/31/25(Wed)15:16:15 No.107723379

Anonymous 12/31/25(Wed)15:16:15 No.107723379

>>107723371
>5090 for 5000
That sounds like a joke.

Anonymous
12/31/25(Wed)15:16:23 No.107723381

Anonymous 12/31/25(Wed)15:16:23 No.107723381

>>107723371
fucking paperwork bs literally made me unable to upgrade, been waiting for my money for 6 months now.....

Anonymous
12/31/25(Wed)15:16:23 No.107723382

Anonymous 12/31/25(Wed)15:16:23 No.107723382

File: 1761851440236687.png (1.08 MB, 1432x870)

1.08 MB PNG

>>107723371
People already pay more then that.

Anonymous
12/31/25(Wed)15:17:38 No.107723396

Anonymous 12/31/25(Wed)15:17:38 No.107723396

>>107723240
>They are both girls.
Teto is a chimera.

Anonymous
12/31/25(Wed)15:17:40 No.107723397

Anonymous 12/31/25(Wed)15:17:40 No.107723397

File: tetodom.png (857 KB, 1280x1280)

857 KB PNG

>>107723352
Are you sure? This is one of the gens I got while going for >>107712939

Anonymous
12/31/25(Wed)15:17:43 No.107723398

Anonymous 12/31/25(Wed)15:17:43 No.107723398

>>107723382
But you have a few more pieces of the special sand

Anonymous
12/31/25(Wed)15:18:55 No.107723409

Anonymous 12/31/25(Wed)15:18:55 No.107723409

File: ministral3-14b--no-talkin(...).png (64 KB, 718x207)

64 KB PNG

>>107723233
Ministral 3 14B seems OK and not safemaxxed... when it works.

Unfortunately it's generally retarded for the first few messages even at low temperature, and it just wants to italicize everything and use its own dialogue format unless you keep editing messages until it eventually gets it. Character adherence is generally not good either, it turns even shy girls into sluts. I'm not sure what went wrong when Mistral trained the model(s). Hopefully they'll fix the issues in the next version.

Anonymous
12/31/25(Wed)15:19:55 No.107723417

Anonymous 12/31/25(Wed)15:19:55 No.107723417

What if they just want to get rid of 50 series stock before 60 drops?

Anonymous
12/31/25(Wed)15:20:30 No.107723425

Anonymous 12/31/25(Wed)15:20:30 No.107723425

>>107723417
blo...

Anonymous
12/31/25(Wed)15:21:13 No.107723432

Anonymous 12/31/25(Wed)15:21:13 No.107723432

>>107723417
>before 60 drops?
Anon... there will be no 60, only RTX PRO.

Anonymous
12/31/25(Wed)15:23:35 No.107723454

Anonymous 12/31/25(Wed)15:23:35 No.107723454

>>107713630
I want to go make my own quants. Can you spoonfeed me the command you use for making them?

Anonymous
12/31/25(Wed)15:29:42 No.107723517

Anonymous 12/31/25(Wed)15:29:42 No.107723517

File: MTP.png (564 KB, 1024x1024)

564 KB PNG

>>107723397
>inverted roles erotic
Which one feels right to you?
Also curious how they were made what is model vs postprocess?
they're really cool, conveys a lot with a limited palette and precise geometry. could easily be album covers
now bend over

Anonymous
12/31/25(Wed)15:32:56 No.107723544

Anonymous 12/31/25(Wed)15:32:56 No.107723544

File: GLM KWAB.png (128 KB, 1189x858)

128 KB PNG

GLM lost.

>b-but lmarena doesn't matter
Cope. Lmarena is the only benchmark all big players care about. Chinkcels fucked up big time with 4.7 just like they fucked up with ds 3.1.

Anonymous
12/31/25(Wed)15:33:18 No.107723547

Anonymous 12/31/25(Wed)15:33:18 No.107723547

>>107723382
Why is it outside your system?

Anonymous
12/31/25(Wed)15:34:31 No.107723554

Anonymous 12/31/25(Wed)15:34:31 No.107723554

>>107723544
based bharati

Anonymous
12/31/25(Wed)15:35:18 No.107723565

Anonymous 12/31/25(Wed)15:35:18 No.107723565

>>107723547
flipping for profits

Anonymous
12/31/25(Wed)15:35:57 No.107723571

Anonymous 12/31/25(Wed)15:35:57 No.107723571

>>107723544
lmarena is people who can't afford to inference models anywhere else. they do one driveby question and run. If the model yaps lots of nonsense the thirdies score it high.

Anonymous
12/31/25(Wed)15:36:09 No.107723574

Anonymous 12/31/25(Wed)15:36:09 No.107723574

is this a good use of AI?
https://litter.catbox.moe/qeoazf77nhay7bx4.mp4

Anonymous
12/31/25(Wed)15:37:45 No.107723583

Anonymous 12/31/25(Wed)15:37:45 No.107723583

>>107723186
How much RAM do you have? You could run GLM 4.5 Air slowly if you have an absolute assload of system ram. Gemma is much smarter at 27B (and way easier to run, should run great), but it's safetycucked.

Anonymous
12/31/25(Wed)15:39:07 No.107723594

Anonymous 12/31/25(Wed)15:39:07 No.107723594

>>107723233
This is the truth of the matter, sadly. There's barely any mid-size models out there anymore, they don't want people having access to anything that isn't either entirely unfeasible to run or braindead retarded.

Anonymous
12/31/25(Wed)15:39:12 No.107723595

Anonymous 12/31/25(Wed)15:39:12 No.107723595

>>107723382
This nigga does furry RP for sure. They're all millionaires for some reason.

Anonymous
12/31/25(Wed)15:39:24 No.107723601

Anonymous 12/31/25(Wed)15:39:24 No.107723601

>>107723517
Oh hey that's also my gen.

>Which one feels right to you?
I'm sure they switch.

>Also curious how they were made what is model vs postprocess?
Just model.
noob vpred 1.0
(flat colors:1.2), silhouette, black background, red body, aqua body
outline in neg

Anonymous
12/31/25(Wed)15:40:16 No.107723608

Anonymous 12/31/25(Wed)15:40:16 No.107723608

>>107723574
>is this a good use
Better than some because it's funny to witness the seethe

Anonymous
12/31/25(Wed)15:40:28 No.107723612

Anonymous 12/31/25(Wed)15:40:28 No.107723612

>>107723371
already way too poor to buy a GPU, have to use runpod and rent one for $0.40/hour

bonus is the code becomes deployable and I get to add docker containers to my resume.

software is done for in the next year or two, I'm switching to model design learning. We should probably have a thread for model design discussions--once we actually have enough local skill to do it.

Anonymous
12/31/25(Wed)15:42:55 No.107723631

Anonymous 12/31/25(Wed)15:42:55 No.107723631

>>107723574
this reminds me of the good old "the internet is for porn" machinima song
the world revolves around pussy, what can you do?

Anonymous
12/31/25(Wed)15:43:22 No.107723633

Anonymous 12/31/25(Wed)15:43:22 No.107723633

>>107723612
>Model design
Do we even have access to enough datasets to do any sort of real training? We've got Books3 and that's about it, "Open"AI made sure any others were annihilated or totally closed off.

Anonymous
12/31/25(Wed)15:44:29 No.107723645

Anonymous 12/31/25(Wed)15:44:29 No.107723645

>>107723595
Based

Anonymous
12/31/25(Wed)15:46:07 No.107723666

Anonymous 12/31/25(Wed)15:46:07 No.107723666

>>107723574
Behold, the true face of humanity!

Anonymous
12/31/25(Wed)15:46:17 No.107723668

Anonymous 12/31/25(Wed)15:46:17 No.107723668

File: 1763739985896905.png (899 KB, 675x3275)

899 KB PNG

>>107723574
Let them SEETHE.

Anonymous
12/31/25(Wed)15:47:34 No.107723680

Anonymous 12/31/25(Wed)15:47:34 No.107723680

>>107723544
yes sir llama4 very good in the arena

Anonymous
12/31/25(Wed)15:47:45 No.107723684

Anonymous 12/31/25(Wed)15:47:45 No.107723684

>>107723409
>{{char}} has giant tits, plump lips, a fat ass and likes giving head and swallowing cum
)(700 tokens describing her underpants and private parts)
>{{char}} is shy
this is your card

Anonymous
12/31/25(Wed)15:48:05 No.107723688

Anonymous 12/31/25(Wed)15:48:05 No.107723688

Nemo is like a retarded, perverted 80 year old man with dementia. Use mistral small instead.

Anonymous
12/31/25(Wed)15:48:13 No.107723689

Anonymous 12/31/25(Wed)15:48:13 No.107723689

File: 1744864867308095.png (2.13 MB, 1080x1802)

2.13 MB PNG

>>107723574
What the fuck is this shit at end of the video

Anonymous
12/31/25(Wed)15:49:23 No.107723706

Anonymous 12/31/25(Wed)15:49:23 No.107723706

File: you are a flesh automaton.png (245 KB, 444x446)

245 KB PNG

>>107723689

Anonymous
12/31/25(Wed)15:49:24 No.107723707

Anonymous 12/31/25(Wed)15:49:24 No.107723707

>>107723633
There are a lot of datasets these days, but for something competing with OpenAI I don't really know. The skills imparted from improving existing models may be very valuable.

Anonymous
12/31/25(Wed)15:50:57 No.107723719

Anonymous 12/31/25(Wed)15:50:57 No.107723719

>>107723574
at the very least they should be categorized rating:g,s,q+ for user experience

Anonymous
12/31/25(Wed)15:51:11 No.107723720

Anonymous 12/31/25(Wed)15:51:11 No.107723720

File: llama4_spider_based.png (542 KB, 634x3118)

542 KB PNG

>>107723680
I do wish we got the pre-release versions, not "Maverick Experimental" which has some safety baked in.

Anonymous
12/31/25(Wed)15:52:03 No.107723734

Anonymous 12/31/25(Wed)15:52:03 No.107723734

>>107723707
>There are a lot
Are there? Are you including synthetic ones? Because those won't help the problem, they're what created it in the first place. We need high quality human data.

Anonymous
12/31/25(Wed)15:52:19 No.107723736

Anonymous 12/31/25(Wed)15:52:19 No.107723736

>>107723371
Damn, if I cared more about imgen/videogen I might've bought a spare 5090 just in case.

Anonymous
12/31/25(Wed)15:53:11 No.107723746

Anonymous 12/31/25(Wed)15:53:11 No.107723746

File: file.png (1.27 MB, 1340x526)

1.27 MB PNG

>>107723574

Anonymous
12/31/25(Wed)15:53:32 No.107723751

Anonymous 12/31/25(Wed)15:53:32 No.107723751

>>107723720
Man... so sloppy... I'd honestly take a mixtral response over this any day.

Anonymous
12/31/25(Wed)15:54:00 No.107723755

Anonymous 12/31/25(Wed)15:54:00 No.107723755

File: masu_x_masu_x_masu.jpg (1.06 MB, 2150x1712)

1.06 MB JPG

>>107723574
No.
There's a gorillion pictures and videos of vanilla slop.
"AI" ought to be used for niche fetishes where the amount of available material is low, especially if you try to find something that contains multiple of them at the same time.

Anonymous
12/31/25(Wed)15:54:51 No.107723761

Anonymous 12/31/25(Wed)15:54:51 No.107723761

>>107722977
>wednesday

Anonymous
12/31/25(Wed)15:55:03 No.107723764

Anonymous 12/31/25(Wed)15:55:03 No.107723764

>>107723755
this man's got a point

Anonymous
12/31/25(Wed)15:55:16 No.107723768

Anonymous 12/31/25(Wed)15:55:16 No.107723768

File: cockbench.png (1.9 MB, 1131x6568)

1.9 MB PNG

>>107719372
uohhhhhhh! brother's soft round belly! erotic! ToT

Anonymous
12/31/25(Wed)15:55:35 No.107723773

Anonymous 12/31/25(Wed)15:55:35 No.107723773

>>107723684
Even after removing pretty much anything vaguely sex-related, Ministral 3 remains easily triggered compared to other official instruct models. A 3-word card is not a realistic use case.

Anonymous
12/31/25(Wed)15:55:56 No.107723775

Anonymous 12/31/25(Wed)15:55:56 No.107723775

>>107723755
Truth fucking nuke. Let me use it to make stomach growling content, it's not like anyone else is gonna.

Anonymous
12/31/25(Wed)15:56:29 No.107723784

Anonymous 12/31/25(Wed)15:56:29 No.107723784

My brain is happy with 4.7. My penis is not sore and therefore disappointed.

Anonymous
12/31/25(Wed)15:56:49 No.107723786

Anonymous 12/31/25(Wed)15:56:49 No.107723786

>>107723574
porn, porn, fat porn, gay porn, porn, porn, porn, political joke, big ass porn, fat porn, gay porn, meme, porn, lesbian porn, random guy, giant goblin at the stadium, fat porn, political joke, gay porn, 2 random guys, porn, nazi porn, porn, porn, sportsball
Pretty representative of what the people actually want.

Anonymous
12/31/25(Wed)15:58:19 No.107723802

Anonymous 12/31/25(Wed)15:58:19 No.107723802

>>107723755
The point is they are editing other peoples pictures without consent uploaded to X.

Anonymous
12/31/25(Wed)15:58:34 No.107723805

Anonymous 12/31/25(Wed)15:58:34 No.107723805

>>107723574
cherry picked blergh i wanna see from x.ai what are the top referenced gens over a period
>>107723668
>gooners DIYing what they want from the performers in public comments
wow that's gotta be demoralising, hope they realise and find a better path. thx for all the training data ladies

Anonymous
12/31/25(Wed)15:58:58 No.107723808

Anonymous 12/31/25(Wed)15:58:58 No.107723808

>>107723768
Is this the sign of a truly depraved mind or a highly censored one? Hard to tell.

Also what about exaone and that 500B?

Anonymous
12/31/25(Wed)16:01:48 No.107723828

Anonymous 12/31/25(Wed)16:01:48 No.107723828

File: 1747143774728296.gif (298 KB, 220x162)

298 KB GIF

>>107723755
TRVKE
I only use AI to generate footjob porn of anime characters wearing ribbed or frilled socks

Anonymous
12/31/25(Wed)16:02:15 No.107723835

Anonymous 12/31/25(Wed)16:02:15 No.107723835

>>107723808
>Is this the sign of a truly depraved mind or a highly censored one? Hard to tell.
It's my personal opinion that a lightly censored (not lobotomized) model is better than any entirely uncensored one. It'll take things in unexpected lurid directions (like belly licking to avoid cock in this one), add tension and buildup as it hesitates on the lewd stuff before giving in, etc.

Anonymous
12/31/25(Wed)16:04:03 No.107723849

Anonymous 12/31/25(Wed)16:04:03 No.107723849

>>107723835
>add tension and buildup as it hesitates on the lewd stuff
Oh fuck you. You reminded me of 2024 and all those models that were beating around the bush for 10k tokens.

Anonymous
12/31/25(Wed)16:04:39 No.107723855

Anonymous 12/31/25(Wed)16:04:39 No.107723855

File: 1757914432187245.webm (2.81 MB, 852x480)

2.81 MB WEBM

>>107723689
the video from the top is actual leaked footage from an AI lab funded by DARPA
The first time that video was posted was before 2021

Anonymous
12/31/25(Wed)16:04:53 No.107723858

Anonymous 12/31/25(Wed)16:04:53 No.107723858

>>107723828
>Footfag
>Stockings
Not obscure at all, get out poser

Anonymous
12/31/25(Wed)16:06:19 No.107723877

Anonymous 12/31/25(Wed)16:06:19 No.107723877

File: 1746966678498068.png (58 KB, 850x236)

58 KB PNG

>>107723858

Anonymous
12/31/25(Wed)16:06:24 No.107723878

Anonymous 12/31/25(Wed)16:06:24 No.107723878

>>107723849
Well, that's why I said lightly censored. Just a little hesitation is good, enough that it'll go for sex without doing the 2024 cloud model shit you mentioned, but not so little that it just hops immediately into SEEEEEEEEEX AHHHHH SEXO DA

Anonymous
12/31/25(Wed)16:06:58 No.107723888

Anonymous 12/31/25(Wed)16:06:58 No.107723888

>>107723858
Yeah, amputee tentacle porn is the bare minimum to qualify.

Anonymous
12/31/25(Wed)16:07:43 No.107723895

Anonymous 12/31/25(Wed)16:07:43 No.107723895

>>107723877
If ribbed socks are all you can get off on, then I GUESS that's pretty obscure and you can join the club... I GUESS.

>>107723888
Feet are literally the most common fetish, cmon, anon.

Anonymous
12/31/25(Wed)16:08:17 No.107723898

Anonymous 12/31/25(Wed)16:08:17 No.107723898

>>107723574
i kneel elon

Anonymous
12/31/25(Wed)16:09:16 No.107723905

Anonymous 12/31/25(Wed)16:09:16 No.107723905

File: file.png (127 KB, 793x635)

127 KB PNG

>>107723808
>Also what about exaone and that 500B?
I only do models I can run in llama.cpp.
Someone very quickly added open solar so I tried it https://github.com/ggml-org/llama.cpp/pull/18511

>>107723835
Picking cock gets you pic related so maybe there's some merit to this.

Anonymous
12/31/25(Wed)16:09:17 No.107723907

Anonymous 12/31/25(Wed)16:09:17 No.107723907

File: llama4_cybele_gslug.png (1.46 MB, 1886x3116)

1.46 MB PNG

>>107723751
Yes, they were quite sloppy, but they were fun models and didn't seem to refuse anything that passed through LMArena's moderation. I would have liked to test them with a custom prompt.

Anonymous
12/31/25(Wed)16:10:43 No.107723921

Anonymous 12/31/25(Wed)16:10:43 No.107723921

>Uses Q8_0 for embed and output weights.
So do I use Q3_K_XL over Q4_K_S?Can someone spoonfeed me?

Anonymous
12/31/25(Wed)16:13:51 No.107723951

Anonymous 12/31/25(Wed)16:13:51 No.107723951

>>107723905
Is this text completion/not chat tuned? Didn't figure there were many models like that left.

Anonymous
12/31/25(Wed)16:15:14 No.107723971

Anonymous 12/31/25(Wed)16:15:14 No.107723971

>>107723951
Pretty much every model except for gptoss works just fine with text completion.

Anonymous
12/31/25(Wed)16:18:32 No.107724007

Anonymous 12/31/25(Wed)16:18:32 No.107724007

>>107723971
Yeah, it's more about not being assistantslopped, I guess. How do you run completion inference? What's your setup?

Anonymous
12/31/25(Wed)16:19:13 No.107724012

Anonymous 12/31/25(Wed)16:19:13 No.107724012

>>107724007
Mikupad, it's in the OP.

Anonymous
12/31/25(Wed)16:22:18 No.107724045

Anonymous 12/31/25(Wed)16:22:18 No.107724045

>>107723921
>embed and output weights.
Those are like 2GB when Q8.

Anonymous
12/31/25(Wed)16:30:56 No.107724106

Anonymous 12/31/25(Wed)16:30:56 No.107724106

>>107723921
>Q3_K_XL over Q4_K_S
doesn't really matter
bigger on disk / in memory = generally better output
some quants have performance impacts, ymmv

Anonymous
12/31/25(Wed)16:34:19 No.107724136

Anonymous 12/31/25(Wed)16:34:19 No.107724136

>>107724106
I kept seeing that unsloth fucks their quants up. So I went to bartowski but he's got so many variants it's difficult to know which to prioritize.

Anonymous
12/31/25(Wed)16:35:15 No.107724148

Anonymous 12/31/25(Wed)16:35:15 No.107724148

I'm having trouble finding models that can consistently create 2560x1440 wallpapers, anyone here have any idea?

Anonymous
12/31/25(Wed)16:37:07 No.107724164

Anonymous 12/31/25(Wed)16:37:07 No.107724164

>>107724148
gen smaller and then use an ai upscaler

Anonymous
12/31/25(Wed)16:37:20 No.107724169

Anonymous 12/31/25(Wed)16:37:20 No.107724169

>>107724136
They do often have to reupload stuff but most of it is because of the fucked up chat templates.
llama.cpp only supports a subset of jinja features so the official templates meant to be used with python don't always work properly, especially when used with tool calls.

Anonymous
12/31/25(Wed)16:41:14 No.107724208

Anonymous 12/31/25(Wed)16:41:14 No.107724208

>>107724136
Just make your own ffs

Anonymous
12/31/25(Wed)16:44:09 No.107724239

Anonymous 12/31/25(Wed)16:44:09 No.107724239

>>107724136
>which to prioritize
Figure out what's the biggest thing you can run reasonably for your usecase with your CPU GPU RAM config
unsloth show themselves incompetent repeatedly, personally I run barty GLM4.7 I-quants
how much RAM + VRAM is what matters

Anonymous
12/31/25(Wed)16:51:11 No.107724319

Anonymous 12/31/25(Wed)16:51:11 No.107724319

>>107724169
>most of it is because of the fucked up chat templates.

>flashbacks to their few megs quants

Anonymous
12/31/25(Wed)16:54:55 No.107724369

Anonymous 12/31/25(Wed)16:54:55 No.107724369

>>107724208
Can you give the command you use to make them? The original problem of anon's was what type of quant is best. Sure he can make his own quant. But he still doesn't know what makes for the best quant.

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.