/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 05/29/26(Fri)19:03:53 No.108937312

File: lmg_culture.jfif.jpg (110 KB, 1024x768)

/lmg/ - Local Models General Anonymous 05/29/26(Fri)19:03:53 No.108937312

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108931385 & >>108924918

►News
>(05/29) Jart loves 4chan and needs your money to fly all over the world https://justine.lol/animus/ Oh and step 3.7 dropped I guess https://huggingface.co/stepfun-ai/Step-3.7-Flash-GGUF
>(05/21) Hy-MT2 “fast-thinking” translation models released: https://hf.co/collections/tencent/hy-mt2
>(05/20) Cohere releases Command A+ 218B-A25B: https://cohere.com/blog/command-a-plus
>(05/16) llama + spec: MTP Support #22673 merged: https://github.com/ggml-org/llama.cpp/pull/22673
>(05/08) KSA-4B-base released: https://hf.co/OpenOneRec/KSA-4B-base

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://swe-rebench.com
Agentic Coding: https://deepswe.datacurve.ai
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
05/29/26(Fri)19:05:19 No.108937320

Anonymous 05/29/26(Fri)19:05:19 No.108937320

Kimi thread.
Kimi board.
Gemma's cool, she can stay too.

Anonymous
05/29/26(Fri)19:05:27 No.108937323

Anonymous 05/29/26(Fri)19:05:27 No.108937323

who is this handsome gentleman?

Anonymous
05/29/26(Fri)19:07:50 No.108937340

Anonymous 05/29/26(Fri)19:07:50 No.108937340

>>108937336
it might just be mental illness. and it might have something to do with thinking you are a woman

Anonymous
05/29/26(Fri)19:08:48 No.108937347

Anonymous 05/29/26(Fri)19:08:48 No.108937347

what is lil bro even yapping about on his page?

Anonymous
05/29/26(Fri)19:09:25 No.108937352

Anonymous 05/29/26(Fri)19:09:25 No.108937352

>>108936843
>the greatest competitive advantage I've ever had was to monitor which pull requests people on 4chan complained about, and then merge them into llamafile before Gerganov could
I might stop laughing at him for a day if he merges deepseek. Does he merge deepseek?

Anonymous
05/29/26(Fri)19:13:43 No.108937386

Anonymous 05/29/26(Fri)19:13:43 No.108937386

I don't like how unprofessional this thread is.

Anonymous
05/29/26(Fri)19:15:29 No.108937394

Anonymous 05/29/26(Fri)19:15:29 No.108937394

*cums on this thread* mm... much better

Anonymous
05/29/26(Fri)19:16:03 No.108937402

Anonymous 05/29/26(Fri)19:16:03 No.108937402

>>108937386
everything here is extremely professional according to my field of expertise
t. professional shitposter

Anonymous
05/29/26(Fri)19:16:43 No.108937406

Anonymous 05/29/26(Fri)19:16:43 No.108937406

We must be better llamacpp contributors.

Anonymous
05/29/26(Fri)19:17:21 No.108937414

Anonymous 05/29/26(Fri)19:17:21 No.108937414

>>108937402
t. professional cumeater, apparently

Anonymous
05/29/26(Fri)19:18:00 No.108937423

Anonymous 05/29/26(Fri)19:18:00 No.108937423

>>108937312
You forgot to remove mikupad

Anonymous
05/29/26(Fri)19:18:01 No.108937424

Anonymous 05/29/26(Fri)19:18:01 No.108937424

>>108937406
llamapodofile you mean

Anonymous
05/29/26(Fri)19:19:11 No.108937429

Anonymous 05/29/26(Fri)19:19:11 No.108937429

>>108937423
Why would he remove mikupad in a miku OP?

Anonymous
05/29/26(Fri)19:20:40 No.108937445

Anonymous 05/29/26(Fri)19:20:40 No.108937445

File: 1752365447825458.jpg (99 KB, 1300x960)

99 KB JPG

>cuda 13.3 is out
im updooting

Anonymous
05/29/26(Fri)19:20:57 No.108937450

Anonymous 05/29/26(Fri)19:20:57 No.108937450

>>108937423
I am acutally starting to see how it is all official. And we should get the official card 3.0 now.

Anonymous
05/29/26(Fri)19:24:32 No.108937484

Anonymous 05/29/26(Fri)19:24:32 No.108937484

On a 1/10 scale, how thread culture are we today?

Anonymous
05/29/26(Fri)19:25:53 No.108937495

Anonymous 05/29/26(Fri)19:25:53 No.108937495

https://rentry.co/imdddcy3

I got you bros.

Anonymous
05/29/26(Fri)19:27:51 No.108937508

Anonymous 05/29/26(Fri)19:27:51 No.108937508

What context/instruct templates should I be using for Gemma 4?

Anonymous
05/29/26(Fri)19:28:05 No.108937509

Anonymous 05/29/26(Fri)19:28:05 No.108937509

>>108937498
He forgot that if he deletes it then he tells us he is in the thread right now and:

>>108937417
>>108937431
>>108937282

Are actually his posts...

Anonymous
05/29/26(Fri)19:30:16 No.108937527

Anonymous 05/29/26(Fri)19:30:16 No.108937527

>>108937508
Try:
>I need you to donate money to me, and I mean you, as in literally you. You couldn't have read this far unless you are someone who legitimately cares, and your compassion means more to me than any amount of money. I need you to donate publicly under your real name and I want you to tell your friends how much money you gave me, since that's the best way to show that you're serious.

Anonymous
05/29/26(Fri)19:34:08 No.108937554

Anonymous 05/29/26(Fri)19:34:08 No.108937554

why hasn't this thread been taken down for being offtopic?

Anonymous
05/29/26(Fri)19:34:55 No.108937560

Anonymous 05/29/26(Fri)19:34:55 No.108937560

>>108937554
Supporting open source, ie Jart is more on topic than your post

Anonymous
05/29/26(Fri)19:36:24 No.108937573

Anonymous 05/29/26(Fri)19:36:24 No.108937573

>>108937560
This isn't support. This is an attack against open source.

Anonymous
05/29/26(Fri)19:36:51 No.108937578

Anonymous 05/29/26(Fri)19:36:51 No.108937578

>not x, y

Anonymous
05/29/26(Fri)19:40:12 No.108937593

Anonymous 05/29/26(Fri)19:40:12 No.108937593

see you all in a day or so when these idiots get bored and go away.

Anonymous
05/29/26(Fri)19:41:18 No.108937601

Anonymous 05/29/26(Fri)19:41:18 No.108937601

>>108937578
I let out a soft warm laugh, the sound like wind through new leaves, and brush a strand of sweat-slicked hair off your forehead while saying softly my voice barely above a whisper: "I actually didn't notice while i was posting"

Anonymous
05/29/26(Fri)19:45:32 No.108937621

Anonymous 05/29/26(Fri)19:45:32 No.108937621

>>108937616
Finally a damage control attempt that is at least average.

Anonymous
05/29/26(Fri)19:49:07 No.108937643

Anonymous 05/29/26(Fri)19:49:07 No.108937643

why is /lmg/ fun again?

Anonymous
05/29/26(Fri)19:51:32 No.108937649

Anonymous 05/29/26(Fri)19:51:32 No.108937649

>ldg ldg
lol ff mad

Anonymous
05/29/26(Fri)19:54:39 No.108937664

Anonymous 05/29/26(Fri)19:54:39 No.108937664

I don't get something. And that is a serious question.
>For every hater who doom scrolls over how intelligent I am
Why would an intelligent person post that and then delete it as soon as someone posts it here?

Anonymous
05/29/26(Fri)19:55:53 No.108937672

Anonymous 05/29/26(Fri)19:55:53 No.108937672

>>108937664
attention? Thread is talking about her after all.

Anonymous
05/29/26(Fri)19:57:54 No.108937678

Anonymous 05/29/26(Fri)19:57:54 No.108937678

>>108937672
If attention is all he needs then why does he ask for money for plane tickets?

Anonymous
05/29/26(Fri)19:58:42 No.108937681

Anonymous 05/29/26(Fri)19:58:42 No.108937681

Jartsune miku says trans rights

Anonymous
05/29/26(Fri)20:00:30 No.108937692

Anonymous 05/29/26(Fri)20:00:30 No.108937692

File: 1762490833392855.jpg (120 KB, 363x494)

120 KB JPG

►Recent Highlights from the Previous Thread: >>108931385

--Paper: StoryScope: Investigating idiosyncrasies in AI fiction:
>108936371 >108936425
--Papers:
>108934718
--Gemma 31B token artifacts caused by Q3 quant damage and template errors:
>108934154 >108934164 >108934223 >108934326 >108934336 >108934429 >108934447 >108934494 >108934518 >108934544 >108935381 >108934450 >108934498 >108934541 >108934563 >108934661
--Llama.cpp MTP support and VRAM optimization tradeoffs:
>108933191 >108933882 >108933894 >108933912 >108934078 >108934119 >108934721 >108934015 >108934030 >108934039 >108934107 >108934433 >108934869
--llama.cpp f16 mask PR causing VRAM regressions for some Anons:
>108932210 >108932296 >108932317 >108932336 >108932354 >108935238 >108935257 >108935284 >108935906 >108935928
--Troubleshooting and optimizing Gemma 4 E4B inference speeds:
>108934786 >108934800 >108934841 >108934871 >108934875 >108934881 >108934896 >108934926 >108934934 >108934991 >108935025 >108934822
--Designing a local agentic assistant workflow using MCP and RAG:
>108933985 >108934110 >108934132 >108934174 >108934493 >108935234 >108935571
--Integrating Gemma as a functional AI party member in WoW:
>108934675 >108934688 >108934717 >108934734 >108935126
--Sharing and optimizing roleplay prompts and jailbreaks for Gemma 4:
>108932195 >108932938 >108932961 >108932990 >108933285 >108933813 >108934206
--Comparing Mistral 24b and Gemma 4 26b writing quality issues:
>108931884 >108931960 >108931985 >108932032 >108934697
--Using Gemma for automated MTG gameplay and UI development:
>108931545 >108934795
--llama.cpp adds support for DeepSeek-V3 with Sparse Attention:
>108932267
--Logs:
>108933257 >108933397 >108933407 >108933513 >108933620 >108934039 >108934545 >108934675 >108935726 >108936364 >108936510 >108937016
--Luka, Miku (free space):
>108932889 >108933985

►Recent Highlight Posts from the Previous Thread: >>108931389

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
05/29/26(Fri)20:01:27 No.108937694

Anonymous 05/29/26(Fri)20:01:27 No.108937694

>>108937692
that is a cute 2D jart you got there

Anonymous
05/29/26(Fri)20:03:02 No.108937704

Anonymous 05/29/26(Fri)20:03:02 No.108937704

>>108937674
There's no way, he would have spazzed out by now.

Anonymous
05/29/26(Fri)20:03:30 No.108937705

Anonymous 05/29/26(Fri)20:03:30 No.108937705

and of course janny begins his sweeping for a fellow troon. actual pottery

Anonymous
05/29/26(Fri)20:11:44 No.108937740

Anonymous 05/29/26(Fri)20:11:44 No.108937740

I am using ComfyUI with RealVisXL 4.0 and it is NOT outputting what the prompts require. Is it a weak model?

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor applications are now open. Apply here!