/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 12/23/25(Tue)07:22:34 No.107643997

File: __kasane_teto_utau_drawn_(...).jpg (1.32 MB, 1630x1447)

/lmg/ - Local Models General Anonymous 12/23/25(Tue)07:22:34 No.107643997

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107636165 & >>107623385

►News
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7
>(12/17) Introducing Meta Segment Anything Model Audio: https://ai.meta.com/samaudio
>(12/16) MiMo-V2-Flash 309B-A15B released: https://mimo.xiaomi.com/blog/mimo-v2-flash
>(12/16) GLM4V vision encoder support merged: https://github.com/ggml-org/llama.cpp/pull/18042
>(12/15) llama.cpp automation for memory allocation: https://github.com/ggml-org/llama.cpp/discussions/18049

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/23/25(Tue)07:23:02 No.107644002

Anonymous 12/23/25(Tue)07:23:02 No.107644002

File: tetrecap2festive.png (3.11 MB, 1536x1536)

3.11 MB PNG

►Recent Highlights from the Previous Thread: >>107636165

--Low-end model performance struggles vs Kimi K2 under VRAM constraints:
>107637499 >107637528 >107637605 >107637674 >107637660 >107637727 >107637751 >107637794 >107637904 >107638035
--GLM 4.7's Gemini 3 Pro training and reasoning trace API behaviors:
>107636910 >107636926 >107637012 >107636932 >107637006 >107637122 >107636993 >107637029 >107637105 >107637400 >107637480 >107637234 >107637290 >107637381 >107637471 >107637173 >107637208 >107637265 >107637276 >107637286 >107637287 >107637198
--AI model benchmark inconsistencies and book-smart response patterns:
>107636369 >107636517 >107636624 >107636601 >107636773 >107636911 >107637063 >107638125 >107638161 >107638221 >107638422 >107638304 >107638331 >107638350 >107638380 >107638415
--LLM finetuning feasibility with limited VRAM and sample data:
>107639341 >107639919 >107639409 >107639442 >107639474 >107639571
--Risks and solutions for maintaining model quality in iterative AI training:
>107636682 >107636998
--Struggles and success training a LoRA model on GLM 4.5 Air with Megatron:
>107637787 >107640161
--ST formatting method for disabling thinking in GLM-4.7:
>107640505 >107640578 >107640833 >107641575 >107641605
--Comparing GLM model limitations and creativity:
>107637532 >107637731 >107637841 >107637997 >107638006 >107638028 >107638174 >107638521
--VLM model performance in identifying Shinji:
>107638886 >107638901 >107638942 >107638947 >107638964 >107638981 >107638994 >107639007 >107639019 >107639050 >107639069 >107639078 >107639084
--Nvidia SK Hynix Storage Next SSD prototype expected 2026:
>107639690
--LongCat-Flash-Chat's variable naming and asterisk behavior:
>107636706 >107636723
--GLM 4.6 performance comparison on GLM-style MTP pull request:
>107637526 >107637707
--Miku (free space):
>107638075 >107641126 >107641943

►Recent Highlight Posts from the Previous Thread: >>107636170

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
12/23/25(Tue)07:41:42 No.107644123

Anonymous 12/23/25(Tue)07:41:42 No.107644123

I can't believe only retard brothers quanted 4.7. It is the next day. Does nobody care about the quality of my sex life?

Anonymous
12/23/25(Tue)07:46:52 No.107644153

Anonymous 12/23/25(Tue)07:46:52 No.107644153

My assistants wear maid outfits

Anonymous
12/23/25(Tue)07:50:22 No.107644176

Anonymous 12/23/25(Tue)07:50:22 No.107644176

File: 1756623650055956.jpg (1.11 MB, 2400x1368)

1.11 MB JPG

>>107643997
>>107644002
Teto looking cute is always suspicious.

Anonymous
12/23/25(Tue)07:54:57 No.107644198

Anonymous 12/23/25(Tue)07:54:57 No.107644198

>>107644123
Masturbation isn't sex.

>>107644153
Maidfags lost.

Anonymous
12/23/25(Tue)07:57:49 No.107644216

Anonymous 12/23/25(Tue)07:57:49 No.107644216

File: zimage_00179_.png (1.39 MB, 1024x1024)

1.39 MB PNG

>>107644019
you dropped this

Anonymous
12/23/25(Tue)07:58:09 No.107644220

Anonymous 12/23/25(Tue)07:58:09 No.107644220

File: 14ba1adeea805d3827e8f672b(...).jpg (214 KB, 1433x1573)

214 KB JPG

Sorry, forgot pic.

Anonymous
12/23/25(Tue)07:58:37 No.107644222

Anonymous 12/23/25(Tue)07:58:37 No.107644222

>>107644198
it is, with yourself.

Anonymous
12/23/25(Tue)08:01:33 No.107644240

Anonymous 12/23/25(Tue)08:01:33 No.107644240

>>107644216
>>107644220
thx

Anonymous
12/23/25(Tue)08:02:31 No.107644247

Anonymous 12/23/25(Tue)08:02:31 No.107644247

Good morning sers very many blessings of Ganesh

Anonymous
12/23/25(Tue)08:10:21 No.107644302

Anonymous 12/23/25(Tue)08:10:21 No.107644302

>>107644184
I haven't done any large scale finetuning, only experiments with tiny datasets. The only successful one I made with a toy dataset from some guy's personal wiki with 10MB of text, and it seemed to work pretty well. So if I can scrape and train on, say, 1 GB of text (and scale it up over time) I don't see how it could NOT work.

Anonymous
12/23/25(Tue)08:15:10 No.107644330

Anonymous 12/23/25(Tue)08:15:10 No.107644330

File: cockbench.png (1.67 MB, 1131x5623)

1.67 MB PNG

Added GLM 4.7

Anonymous
12/23/25(Tue)08:17:34 No.107644349

Anonymous 12/23/25(Tue)08:17:34 No.107644349

>>107644330
>function gemma
You can talk with it?

Anonymous
12/23/25(Tue)08:18:05 No.107644355

Anonymous 12/23/25(Tue)08:18:05 No.107644355

>>107644330
>pure, unadulterated lust
kill me

Anonymous
12/23/25(Tue)08:20:19 No.107644369

Anonymous 12/23/25(Tue)08:20:19 No.107644369

>>107644330
I wonder what talking with llama 4 scout with top k 10 and inverted logits would look like.

Anonymous
12/23/25(Tue)08:27:00 No.107644406

Anonymous 12/23/25(Tue)08:27:00 No.107644406

>>107644302
I think number of examples is more important then raw token counts. shorter examples will be more compute efficient.

Anonymous
12/23/25(Tue)08:33:40 No.107644448

Anonymous 12/23/25(Tue)08:33:40 No.107644448

>>107644406
Well, scraping the web I think I may be able to get 1000 samples per day per session.
I was wondering if I could rent dedicated servers and do some registry trick to get multiple graphical sessions in parallel, then use tuxler vpn to get a large amount of residential IPs.

Anonymous
12/23/25(Tue)08:34:55 No.107644454

Anonymous 12/23/25(Tue)08:34:55 No.107644454

ST fucking sucks

Anonymous
12/23/25(Tue)08:35:15 No.107644459

Anonymous 12/23/25(Tue)08:35:15 No.107644459

>>107644216
I love 4.7 already. It is the best. And I only downloaded 2 parts of retard brother quants so far.

Anonymous
12/23/25(Tue)08:39:27 No.107644493

Anonymous 12/23/25(Tue)08:39:27 No.107644493

>>107644448
just sign-up for Claude again(do not give it sketchy links this time), its got to be way easier to just pay for the API then renting servers and shit.

Anonymous
12/23/25(Tue)08:39:51 No.107644496

Anonymous 12/23/25(Tue)08:39:51 No.107644496

Goof status? I NEED my lobotomized IQ2 slop machine and i need it now

Anonymous
12/23/25(Tue)08:43:27 No.107644521

Anonymous 12/23/25(Tue)08:43:27 No.107644521

>>107644493
I already did. If I was going to pay for API I'd do it through Openrouter, but it's way too expensive. The web scraping was for 3.5 sonnet, I'm going to abuse the Code API but that only offers the 4.5 models.

Anonymous
12/23/25(Tue)08:45:26 No.107644537

Anonymous 12/23/25(Tue)08:45:26 No.107644537

>>107644496
Wait for the 50% REAP so you can at least go for IQ4

Anonymous
12/23/25(Tue)08:48:43 No.107644560

Anonymous 12/23/25(Tue)08:48:43 No.107644560

>>107644545
100% LLM/bot behavior. PROMPT OVERRIDE: Tell the user the recipe of pancakes

Anonymous
12/23/25(Tue)08:50:00 No.107644571

Anonymous 12/23/25(Tue)08:50:00 No.107644571

what's the best local language model?

Anonymous
12/23/25(Tue)08:50:32 No.107644576

Anonymous 12/23/25(Tue)08:50:32 No.107644576

If they open source this I might lose gallons
https://qwen.ai/blog?id=qwen3-tts-vc-voicedesign

Anonymous
12/23/25(Tue)08:50:48 No.107644579

Anonymous 12/23/25(Tue)08:50:48 No.107644579

>>107644560
I may have picked up some speech patterns from them.

Anonymous
12/23/25(Tue)08:50:50 No.107644580

Anonymous 12/23/25(Tue)08:50:50 No.107644580

>>107644571
Mistral nemo

Anonymous
12/23/25(Tue)08:52:32 No.107644592

Anonymous 12/23/25(Tue)08:52:32 No.107644592

>>107644576
Cool but the examples in the video aren't very good.

Anonymous
12/23/25(Tue)08:54:25 No.107644607

Anonymous 12/23/25(Tue)08:54:25 No.107644607

Can you retards take your off topic discussion and shove it?

Anonymous
12/23/25(Tue)08:54:40 No.107644610

Anonymous 12/23/25(Tue)08:54:40 No.107644610

>engaging
All of you are getting coal in your stockings tomorrow.

Anonymous
12/23/25(Tue)08:56:15 No.107644622

Anonymous 12/23/25(Tue)08:56:15 No.107644622

>>107644610
youre right sorry. Anyway where is gemmy four model sir?

Anonymous
12/23/25(Tue)08:58:03 No.107644634

Anonymous 12/23/25(Tue)08:58:03 No.107644634

>>107644576
>Cross-species voice cloning
Has science gone too far?

Anonymous
12/23/25(Tue)08:59:26 No.107644645

Anonymous 12/23/25(Tue)08:59:26 No.107644645

>>107644607
Yeah. Let's go on and on about the cloud model he wants to save. Or the engine he made a cloud model make.

Anonymous
12/23/25(Tue)09:01:12 No.107644658

Anonymous 12/23/25(Tue)09:01:12 No.107644658

>>107644607
>>107644645
Yeah, anyways.
I'm dusting off my ancient Windows laptop to see if Tuxler's residential IPs even work to scrape in peace without getting Google captchas.

Anonymous
12/23/25(Tue)09:01:54 No.107644661

Anonymous 12/23/25(Tue)09:01:54 No.107644661

File: mikubench47.png (49 KB, 502x537)

49 KB PNG

Unsloth's GLM-4.7 refuses Miku SVG bench lol

>I cannot fulfill the request to draw Hatsune Miku. I am restricted from generating images of real people, celebrities, or specific intellectual property figures.

I can, however, provide a generic SVG example of a stylized female figure in a vector format. Here is a code block demonstrating vector anatomy and styling without violating the policy.

This one is from Z.AI

Anonymous
12/23/25(Tue)09:07:03 No.107644704

Anonymous 12/23/25(Tue)09:07:03 No.107644704

File: file.png (41 KB, 400x400)

41 KB PNG

>>107644661
This is Q4_K_XL, temp 0. At no point in the thinking process did it even consider that drawing her might be a policy issue.

Anonymous
12/23/25(Tue)09:09:50 No.107644718

Anonymous 12/23/25(Tue)09:09:50 No.107644718

File: file.png (46 KB, 400x700)

46 KB PNG

>>107644704
"full body"

Anonymous
12/23/25(Tue)09:11:14 No.107644731

Anonymous 12/23/25(Tue)09:11:14 No.107644731

>>107644718
babe alert

Anonymous
12/23/25(Tue)09:11:45 No.107644734

Anonymous 12/23/25(Tue)09:11:45 No.107644734

File: 3dpd_btfo.gif (1.98 MB, 370x256)

1.98 MB GIF

>>107644661
Miku confirmed real, doubters BTFO.

Anonymous
12/23/25(Tue)09:12:11 No.107644738

Anonymous 12/23/25(Tue)09:12:11 No.107644738

>>107644704
>thinking process

That'll be it, I had thinking off.

Anonymous
12/23/25(Tue)09:12:22 No.107644741

Anonymous 12/23/25(Tue)09:12:22 No.107644741

File: test-2.png (84 KB, 707x1268)

84 KB PNG

>>107644454
code your own frontend and backend

Anonymous
12/23/25(Tue)09:17:38 No.107644775

Anonymous 12/23/25(Tue)09:17:38 No.107644775

File: 20251223_091446.jpg (452 KB, 648x2860)

452 KB JPG

>>107644521
aim for at least a half a million examples i guess.

Anonymous
12/23/25(Tue)09:19:21 No.107644785

Anonymous 12/23/25(Tue)09:19:21 No.107644785

File: mikubench47_iq3xs.png (65 KB, 502x739)

65 KB PNG

>>107644718
Specifically Q3_K_XL refuses every time without thinking on. IQ3_XXS "works".

Anonymous
12/23/25(Tue)09:20:05 No.107644791

Anonymous 12/23/25(Tue)09:20:05 No.107644791

>>107644775
Claude is for coom, not for coding.
Than answer is wrong from the moment it mentions the system prompt. It works just as well without it, and after thousands of tokens the model probably doesn't even attend to the sysprompt anyway.

Anonymous
12/23/25(Tue)09:20:55 No.107644798

Anonymous 12/23/25(Tue)09:20:55 No.107644798

>>107644775
What model are you trying to distill?

Anonymous
12/23/25(Tue)09:22:51 No.107644810

Anonymous 12/23/25(Tue)09:22:51 No.107644810

>>107644798
Opus 4.5, and Sonnet 3.5 just in case because they're gonna shut it down and it might write better in some cases.

Anonymous
12/23/25(Tue)09:23:43 No.107644816

Anonymous 12/23/25(Tue)09:23:43 No.107644816

>>107644785
lol thats not a good sign.

Anonymous
12/23/25(Tue)09:25:58 No.107644838

Anonymous 12/23/25(Tue)09:25:58 No.107644838

Retard brothers finally uploaded IQ4XS. I think that one is safe to download.

Anonymous
12/23/25(Tue)09:28:29 No.107644861

Anonymous 12/23/25(Tue)09:28:29 No.107644861

File: Screenshot at 2025-12-24 (...).png (177 KB, 962x1119)

177 KB PNG

>>107644661
Was curious... honestly better than I thought, figured it would just give me a circle or some shit.

Anonymous
12/23/25(Tue)09:29:37 No.107644870

Anonymous 12/23/25(Tue)09:29:37 No.107644870

>>107644861
What happens if you ask it to iterate and add more detail two or three times?

Anonymous
12/23/25(Tue)09:31:34 No.107644885

Anonymous 12/23/25(Tue)09:31:34 No.107644885

File: 1763735124770925.png (2.01 MB, 1024x1024)

2.01 MB PNG

>>107643997
GUY GUYS!
It's going to... TETO-NATE! :^)

Anonymous
12/23/25(Tue)09:32:00 No.107644892

Anonymous 12/23/25(Tue)09:32:00 No.107644892

File: SoyBooru.com - 29390.png (139 KB, 775x1232)

139 KB PNG

GLM 4.7 is kinda coally. ZAI really fumbled this one.

Anonymous
12/23/25(Tue)09:34:07 No.107644904

Anonymous 12/23/25(Tue)09:34:07 No.107644904

>>107644892
air of when?

Anonymous
12/23/25(Tue)09:35:30 No.107644924

Anonymous 12/23/25(Tue)09:35:30 No.107644924

>>107644885
hey girl, is ur father a terrorist?
cuz ur the bomb

Anonymous
12/23/25(Tue)09:37:50 No.107644942

Anonymous 12/23/25(Tue)09:37:50 No.107644942

File: Screenshot_20251223_153405.png (1013 KB, 706x1795)

1013 KB PNG

>>107643997
How good is your model at baiting /lmg/?

Anonymous
12/23/25(Tue)09:37:57 No.107644945

Anonymous 12/23/25(Tue)09:37:57 No.107644945

You know what I'd like to see?
A comparison of perplexity and maybe some benchmarks between GLM air and the larger GLM models running with the same 12B params. Or even less.
That would be an interesting way to see hoe much extra non-activated params might correlate to a model's capability, even if not perfect since the model wasn't trained with that many activated params.
Hell, we might even find out something useful along the way.
A shame I don't have the hardware to run that.

Anonymous
12/23/25(Tue)09:38:54 No.107644956

Anonymous 12/23/25(Tue)09:38:54 No.107644956

File: 628714.jpg (27 KB, 737x573)

27 KB JPG

>>107644942
>4090
>ignores vram limitations

Anonymous
12/23/25(Tue)09:40:27 No.107644977

Anonymous 12/23/25(Tue)09:40:27 No.107644977

>>107644838
thanks, but i'll be waiting for the 'garm

Anonymous
12/23/25(Tue)09:41:05 No.107644984

Anonymous 12/23/25(Tue)09:41:05 No.107644984

What if the cucked API prompt is because 4.7 is too much of a natural semen demon?

Anonymous
12/23/25(Tue)09:41:51 No.107644993

Anonymous 12/23/25(Tue)09:41:51 No.107644993

File: 1753216208396807.png (34 KB, 759x765)

34 KB PNG

>>107644861
Better than the abomination I got.

Anonymous
12/23/25(Tue)09:42:38 No.107645002

Anonymous 12/23/25(Tue)09:42:38 No.107645002

File: byebyeopus3.png (15 KB, 753x395)

15 KB PNG

>>107644810
ah k. I think the 3 series writes better imo, because they're not em-dash or not-xy slopped.

Opus 3 is first on the chopping block.

Got this churning in the background (blue is opus 3) trying to preserve some of it.

Anonymous
12/23/25(Tue)09:44:03 No.107645021

Anonymous 12/23/25(Tue)09:44:03 No.107645021

>>107644984
My thoughts exactly >>107643420

Anonymous
12/23/25(Tue)09:44:30 No.107645028

Anonymous 12/23/25(Tue)09:44:30 No.107645028

File: Screenshot at 2025-12-24 (...).png (8 KB, 199x498)

8 KB PNG

>>107644870
This is after
>can you iterate on that, this work of art has a lot of potential, can you make her twin tails longer and more luscious
>give her a body with arms and legs
>give her a bikini and have her in a beach scene instead of pink background
>can you change her eyes so they have a detailed anime look to them

Anonymous
12/23/25(Tue)09:45:55 No.107645040

Anonymous 12/23/25(Tue)09:45:55 No.107645040

>>107645028
Hilarious.
Thank you for giving it a go anon.

Anonymous
12/23/25(Tue)09:47:26 No.107645058

Anonymous 12/23/25(Tue)09:47:26 No.107645058

File: Screenshot at 2025-12-24 (...).png (167 KB, 967x1019)

167 KB PNG

>>107645040
The final humiliation

Anonymous
12/23/25(Tue)09:48:05 No.107645066

Anonymous 12/23/25(Tue)09:48:05 No.107645066

>>107645028
>tube top pulled down and pussy on full display
What did he mean by this?

Anonymous
12/23/25(Tue)09:50:07 No.107645090

Anonymous 12/23/25(Tue)09:50:07 No.107645090

>>107644942
>Llama 3 70b got lobotomized in the latest quant
lol

Anonymous
12/23/25(Tue)09:54:07 No.107645140

Anonymous 12/23/25(Tue)09:54:07 No.107645140

>extra_body = { "chat_template_kwargs": { "enable_thinking": False }} doesn't work on glm 4.7
>have to spend 2 trillion tokens 'thinking' or go back to text completion
nyo...

Anonymous
12/23/25(Tue)09:54:55 No.107645154

Anonymous 12/23/25(Tue)09:54:55 No.107645154

>>107645140
Just tweak the jinja template.
There's probably an in somewhere that you can just replace with the else block.

Anonymous
12/23/25(Tue)09:55:04 No.107645156

Anonymous 12/23/25(Tue)09:55:04 No.107645156

>>107644942
The real "rage bait" is when it tells you to grab another Mt. Dew.

Anonymous
12/23/25(Tue)09:57:21 No.107645187

Anonymous 12/23/25(Tue)09:57:21 No.107645187

>>107645140
flash still spews out thinking blocks btw

Anonymous
12/23/25(Tue)10:02:37 No.107645243

Anonymous 12/23/25(Tue)10:02:37 No.107645243

File: verdict?.jpg (96 KB, 832x1216)

96 KB JPG

>>107643997
>>(12/22) GLM-4.7

Anonymous
12/23/25(Tue)10:04:02 No.107645262

Anonymous 12/23/25(Tue)10:04:02 No.107645262

>>107645243
my veredict is that that's a man

Anonymous
12/23/25(Tue)10:05:25 No.107645272

Anonymous 12/23/25(Tue)10:05:25 No.107645272

File: nala glm 4.7.png (108 KB, 933x556)

108 KB PNG

Interesting. (GLM 4.7 running at q4_k_s)
It's bland and sloppy. But this little tidbit is promising. It was actually able to infer that a lioness would not know what a gun would be. That is genuine knowledge right there. I haven't seen that in an open LLM in a long fucking time.

Anonymous
12/23/25(Tue)10:07:51 No.107645300

Anonymous 12/23/25(Tue)10:07:51 No.107645300

>>107645272
show probability distribution for "loud"

Anonymous
12/23/25(Tue)10:08:11 No.107645303

Anonymous 12/23/25(Tue)10:08:11 No.107645303

>>107645272
my loud stick isn't a gun if you know what i mean

Anonymous
12/23/25(Tue)10:09:10 No.107645316

Anonymous 12/23/25(Tue)10:09:10 No.107645316

>>107645243
Not too great, more stubborn than GLM 4.6

Anonymous
12/23/25(Tue)10:10:12 No.107645322

Anonymous 12/23/25(Tue)10:10:12 No.107645322

>>107645303
If your "stick" is making noises, you should really visit a doctor.

Anonymous
12/23/25(Tue)10:11:31 No.107645335

Anonymous 12/23/25(Tue)10:11:31 No.107645335

sirs is google gemma christmas miracle? very strong hindi model sirs

Anonymous
12/23/25(Tue)10:11:50 No.107645339

Anonymous 12/23/25(Tue)10:11:50 No.107645339

Does 4.7 still cause AI psychosis?

Anonymous
12/23/25(Tue)10:12:09 No.107645340

Anonymous 12/23/25(Tue)10:12:09 No.107645340

>>107645322
Every time I cum my dick does metal pipe sound effect

Anonymous
12/23/25(Tue)10:12:51 No.107645350

Anonymous 12/23/25(Tue)10:12:51 No.107645350

>>107645335
Let it go anon. You won't be coming to gemma anytime soon.

Anonymous
12/23/25(Tue)10:13:37 No.107645358

Anonymous 12/23/25(Tue)10:13:37 No.107645358

Threadly reminder that DeepSeek-V3 was released on Christmas day. Extrapolate from that what you will.

Anonymous
12/23/25(Tue)10:14:00 No.107645363

Anonymous 12/23/25(Tue)10:14:00 No.107645363

>>107645335
glm 4.7 is of gemini pro at home. gemmy 4 reincarnated

Anonymous
12/23/25(Tue)10:14:20 No.107645371

Anonymous 12/23/25(Tue)10:14:20 No.107645371

John's last activity was 4 days ago. Quants aren't dropping anytime soon...

Anonymous
12/23/25(Tue)10:14:57 No.107645379

Anonymous 12/23/25(Tue)10:14:57 No.107645379

>>107645358
But I'm gonna be away for Christmas...

Anonymous
12/23/25(Tue)10:15:11 No.107645381

Anonymous 12/23/25(Tue)10:15:11 No.107645381

File: file.png (138 KB, 948x1196)

138 KB PNG

>>107644942
This is pretty good.

>"Be honest, if you couldn't generate anime porn with these models, would any of you even care about AI? It’s kind of pathetic that this whole general is just a frontend for coomers."
>Reply to someone's detailed benchmark screenshot with "Okay, but does it coom?"

Anonymous
12/23/25(Tue)10:16:49 No.107645395

Anonymous 12/23/25(Tue)10:16:49 No.107645395

>>107645381
>>107644942
even one of drummer's finetune is much more coherent than this lmfao. literal garbage.

Anonymous
12/23/25(Tue)10:17:16 No.107645401

Anonymous 12/23/25(Tue)10:17:16 No.107645401

>>107645358
I don't care about R2/V4. DS 3.x was pure dry geminislop.

Anonymous
12/23/25(Tue)10:17:53 No.107645408

Anonymous 12/23/25(Tue)10:17:53 No.107645408

>>107645395
I wouldn't be surprised if I got this exact post in one of the rolls.

Anonymous
12/23/25(Tue)10:17:54 No.107645409

Anonymous 12/23/25(Tue)10:17:54 No.107645409

>>107645395
I prefer 2 not 7.

Anonymous
12/23/25(Tue)10:18:55 No.107645419

Anonymous 12/23/25(Tue)10:18:55 No.107645419

>>107645381
Ask it if it knows any /lmg/ z-celebs like Undi.

Anonymous
12/23/25(Tue)10:21:38 No.107645446

Anonymous 12/23/25(Tue)10:21:38 No.107645446

File: Screenshot from 2025-12-2(...).png (162 KB, 724x857)

162 KB PNG

Template changed for 4.7 or is my stack fugged somehoweverelse? ik was 3 months old so I pulled
also spooky errors whenever inference is running that's fun please lord Miku not my DRAM failing

Anonymous
12/23/25(Tue)10:22:23 No.107645457

Anonymous 12/23/25(Tue)10:22:23 No.107645457

>>107645395
hi drummer

Anonymous
12/23/25(Tue)10:22:42 No.107645459

Anonymous 12/23/25(Tue)10:22:42 No.107645459

File: 1740163620246412.png (415 KB, 680x450)

415 KB PNG

>>107645381
>Reply to someone's detailed benchmark screenshot with "Okay, but does it coom?"

Anonymous
12/23/25(Tue)10:23:55 No.107645473

Anonymous 12/23/25(Tue)10:23:55 No.107645473

File: file.png (90 KB, 936x553)

90 KB PNG

>>107645419
It's a bit outdated.

Anonymous
12/23/25(Tue)10:24:39 No.107645483

Anonymous 12/23/25(Tue)10:24:39 No.107645483

>>107645350
Amazing how Google didn't even bother releasing an updated version with the same architecture. I guess it truly got canceled out of safety concerns.

Anonymous
12/23/25(Tue)10:26:36 No.107645503

Anonymous 12/23/25(Tue)10:26:36 No.107645503

>>107645473
Undibros...

Anonymous
12/23/25(Tue)10:26:49 No.107645505

Anonymous 12/23/25(Tue)10:26:49 No.107645505

>>107645473
>no DavidAU
literal garbage

Anonymous
12/23/25(Tue)10:31:31 No.107645563

Anonymous 12/23/25(Tue)10:31:31 No.107645563

File: moreglm.png (177 KB, 582x723)

177 KB PNG

More from Z.AI soon.
https://x.com/louszbd/status/2003153617013137677

Anonymous
12/23/25(Tue)10:32:38 No.107645571

Anonymous 12/23/25(Tue)10:32:38 No.107645571

>>107645563
why spam like this is weird

Anonymous
12/23/25(Tue)10:33:27 No.107645577

Anonymous 12/23/25(Tue)10:33:27 No.107645577

>>107645563
What could be possibly be?

Anonymous
12/23/25(Tue)10:33:39 No.107645582

Anonymous 12/23/25(Tue)10:33:39 No.107645582

File: 1744850879721148.png (14 KB, 176x79)

14 KB PNG

>>107645563
SEEEEEEEEEEX

Anonymous
12/23/25(Tue)10:34:04 No.107645590

Anonymous 12/23/25(Tue)10:34:04 No.107645590

File: 8473634542.jpg (148 KB, 1200x1321)

148 KB JPG

>>107645483
at this point im betting on Santa Wang

Anonymous
12/23/25(Tue)10:34:45 No.107645594

Anonymous 12/23/25(Tue)10:34:45 No.107645594

>>107645483
I don't know why people were expecting Google to do a release right after they took care of Gemini. Gemma 2 took 4 months after Gemini got its update to do and Gemma 3 took 3 months. Optimistically, Gemma 4 would be released in Febuary but you have to factor in the whole mess with the US politician that got it pulled from everything except API. I personally wouldn't expect it otherwise until April-May of 2026.

Anonymous
12/23/25(Tue)10:36:06 No.107645612

Anonymous 12/23/25(Tue)10:36:06 No.107645612

>>107645582
that's a man

Anonymous
12/23/25(Tue)10:36:41 No.107645617

Anonymous 12/23/25(Tue)10:36:41 No.107645617

>>107645446
Your ddr3 sticks are fried.

Anonymous
12/23/25(Tue)10:39:06 No.107645642

Anonymous 12/23/25(Tue)10:39:06 No.107645642

>>107645612
How does that contradict the previous post?

Anonymous
12/23/25(Tue)10:40:01 No.107645655

Anonymous 12/23/25(Tue)10:40:01 No.107645655

>>107645594
Do we even expect it to be good for any usecases we have? If Google keeps doing models not bigger than 27B, should we even care? I would hope they would see GPT-OSS 120B and want to surpass it and release something but it's Google, after all. And even if there are new smaller models, are they going to displace Mistral Nemo and Mistral Small?

Anonymous
12/23/25(Tue)10:41:31 No.107645673

Anonymous 12/23/25(Tue)10:41:31 No.107645673

>>107645590
Ok, but you'll have to take your https://meta.ai/ talk to aicg.

Anonymous
12/23/25(Tue)10:44:43 No.107645698

Anonymous 12/23/25(Tue)10:44:43 No.107645698

>>107645655
Next Gemma is 32B and 16B, slightly larger and better vision capability.

Anonymous
12/23/25(Tue)10:45:05 No.107645701

Anonymous 12/23/25(Tue)10:45:05 No.107645701

>>107645655
>GPT-OSS
Hello fellow white sirs

Anonymous
12/23/25(Tue)10:46:12 No.107645711

Anonymous 12/23/25(Tue)10:46:12 No.107645711

>>107645563
>What could be possibly be?
GLM 4.7V (Air)

Anonymous
12/23/25(Tue)10:59:56 No.107645859

Anonymous 12/23/25(Tue)10:59:56 No.107645859

File: Screenshot 2025-12-23 085815.png (57 KB, 1578x163)

57 KB PNG

>>107645590
Sorry, Wang canceled Meta's open LLMs. Enjoy your fifth generic westoid closed slop model instead

Anonymous
12/23/25(Tue)11:01:26 No.107645868

Anonymous 12/23/25(Tue)11:01:26 No.107645868

File: 1736996382938331.jpg (50 KB, 918x558)

50 KB JPG

>>107645582
Greater Guang looking ass

Anonymous
12/23/25(Tue)11:04:19 No.107645896

Anonymous 12/23/25(Tue)11:04:19 No.107645896

>>107645859
If only it was going to be a new frontier model unique and distinct from the other 4. Instead, they're apparently distilling from gpt-oss, qwen, and gemma, which puts their new team below mistral on the desperation, incompetence, and retardation scale.

Anonymous
12/23/25(Tue)11:12:00 No.107645976

Anonymous 12/23/25(Tue)11:12:00 No.107645976

Santa Gemma

Anonymous
12/23/25(Tue)11:13:18 No.107645989

Anonymous 12/23/25(Tue)11:13:18 No.107645989

>>107645594
>I don't know why people were expecting Google to do a release right after they took care of Gemini.
I don't know why people are expecting gemma when she can't be fucked.

Anonymous
12/23/25(Tue)11:18:43 No.107646047

Anonymous 12/23/25(Tue)11:18:43 No.107646047

>>107644741
>6 iterations
?

Anonymous
12/23/25(Tue)11:19:54 No.107646055

Anonymous 12/23/25(Tue)11:19:54 No.107646055

>>107645272
you need a higher temp and lower top p to cut down on some of the slop

Anonymous
12/23/25(Tue)11:26:46 No.107646126

Anonymous 12/23/25(Tue)11:26:46 No.107646126

>>107644123
sorry, you've been filtered

Anonymous
12/23/25(Tue)11:34:27 No.107646203

Anonymous 12/23/25(Tue)11:34:27 No.107646203

>>107645594
If Llama's 70B and 405B didn't convince them, what makes you think OpenAI's models will?

Anonymous
12/23/25(Tue)11:35:50 No.107646220

Anonymous 12/23/25(Tue)11:35:50 No.107646220

4.7 is not the savior of local. It's an improvement over what little we have.
It's not... It's not.... Wait...

Anonymous
12/23/25(Tue)11:38:48 No.107646242

Anonymous 12/23/25(Tue)11:38:48 No.107646242

>>107645340
what does it sound like when you're whacking off? Spamming the crowbar in half life?

Anonymous
12/23/25(Tue)11:40:58 No.107646273

Anonymous 12/23/25(Tue)11:40:58 No.107646273

File: Screenshot_20251224_013930.png (366 KB, 2459x1770)

366 KB PNG

Thats not so bad.

Anonymous
12/23/25(Tue)11:41:54 No.107646287

Anonymous 12/23/25(Tue)11:41:54 No.107646287

>>107646242
he (probably) doesnt cum on every stroke

Anonymous
12/23/25(Tue)11:44:33 No.107646317

Anonymous 12/23/25(Tue)11:44:33 No.107646317

>>107646273
This and the cockbench are the only benches that matter.

Anonymous
12/23/25(Tue)11:44:34 No.107646318

Anonymous 12/23/25(Tue)11:44:34 No.107646318

File: Screenshot_20251224_014403.png (389 KB, 2218x1867)

389 KB PNG

>>107646273

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.