/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 12/30/25(Tue)22:45:31 No.107717246

File: file.png (1.07 MB, 1280x1280)

/lmg/ - Local Models General Anonymous 12/30/25(Tue)22:45:31 No.107717246

/lmg/ - a general dedicated to the discussion and development of local language models.

New Years Eve Edition

Previous threads: >>107709248 & >>107700909

►News
>(12/29) HY-Motion 1.0 text-to-3D human motion generation models released: https://hf.co/tencent/HY-Motion-1.0
>(12/29) WeDLM-8B-Instruct diffusion language model released: https://hf.co/tencent/WeDLM-8B-Instruct
>(12/29) Llama-3.3-8B-Instruct weights leaked: https://hf.co/allura-forge/Llama-3.3-8B-Instruct
>(12/26) MiniMax-M2.1 released: https://minimax.io/news/minimax-m21
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/30/25(Tue)22:46:04 No.107717250

Anonymous 12/30/25(Tue)22:46:04 No.107717250

File: __hatsune_miku_and_kasane(...).jpg (318 KB, 1206x1500)

318 KB JPG

►Recent Highlights from the Previous Thread: >>107709248

--Text-to-animation model potential vs current limitations in NPC/game applications:
>107713005 >107713022 >107713094 >107713093 >107713161 >107713166 >107713180 >107713199 >107713395 >107713478 >107714633 >107715953 >107715974 >107716259 >107716282 >107716362 >107716378 >107716412 >107716427 >107716455 >107716463 >107716491 >107716546 >107716556 >107716643 >107716752 >107716648
--Choosing SillyTavern prompt modes based on model type and customization needs:
>107715217 >107715245 >107715274 >107715300 >107715318 >107715651 >107715665 >107715758 >107716851 >107716897 >107716949 >107717038
--TTS tool landscape: XTTSv2, Chatterbox, and lightweight alternatives:
>107714613 >107714656 >107714858 >107714701 >107714733D
--Kimi Linear architecture boosts efficiency and compatibility:
>107712023 >107712128
--Feedback on updated model recommendations list:
>107715036 >107715051 >107715075 >107715081 >107715093 >107715088 >107715623 >107715752 >107715866
--M2.1's jailbreak resistance and identity adherence challenges:
>107714948
--Llama.cpp API compatibility issues with enable_thinking flag:
>107712820 >107712892
--Optimizing inference with mixed GPU/CPU setups:
>107714237 >107714606 >107714638 >107714341 >107714370
--32GB VRAM roleplay model recommendations and narrative challenges:
>107710932 >107710954 >107711017 >107711047 >107711311 >107711369 >107711409 >107711524 >107711716 >107711757 >107712384 >107714928 >107714953
--Exynos 2600 NPU advancements vs Nvidia's market dominance:
>107713883 >107713925
--ikllama's declining Windows support and maintenance challenges:
>107714744 >107714804
--Logs: Migubench:
>107715268
--Logs:
>107710745 >107712779 >107713077
--Teto (free space):
>107709736 >107709978 >107710051 >107711883 >107711900 >107712939 >107713166

►Recent Highlight Posts from the Previous Thread: >>107709259

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
12/30/25(Tue)22:53:40 No.107717296

Anonymous 12/30/25(Tue)22:53:40 No.107717296

GIWTWM

Anonymous
12/30/25(Tue)22:57:08 No.107717308

Anonymous 12/30/25(Tue)22:57:08 No.107717308

>>107717246
I want to be the Teto

Anonymous
12/30/25(Tue)23:07:10 No.107717380

Anonymous 12/30/25(Tue)23:07:10 No.107717380

File: 1763986126206111.png (184 KB, 598x506)

184 KB PNG

It's over
Real life people talk like slop now

Anonymous
12/30/25(Tue)23:08:42 No.107717387

Anonymous 12/30/25(Tue)23:08:42 No.107717387

>>107717380
that sounds absolutely awful.

Anonymous
12/30/25(Tue)23:11:14 No.107717404

Anonymous 12/30/25(Tue)23:11:14 No.107717404

What's the gooner recommendation for 96gb vram, assuming something like 4bit cope, I'm not able to offload to ram.

Anonymous
12/30/25(Tue)23:12:12 No.107717410

Anonymous 12/30/25(Tue)23:12:12 No.107717410

>>107717404
StableLM 7B

Anonymous
12/30/25(Tue)23:13:00 No.107717414

Anonymous 12/30/25(Tue)23:13:00 No.107717414

>>107717404
Use SaaS APIs.
>t. someone with 96GB VRAM

Anonymous
12/30/25(Tue)23:14:27 No.107717421

Anonymous 12/30/25(Tue)23:14:27 No.107717421

Bald anime girls make better threads. Decline is already very visible here.

Anonymous
12/30/25(Tue)23:14:28 No.107717422

Anonymous 12/30/25(Tue)23:14:28 No.107717422

>>107717404
Q4 or Q5 of air.

Anonymous
12/30/25(Tue)23:21:03 No.107717464

Anonymous 12/30/25(Tue)23:21:03 No.107717464

>>107717380
You kind of have to wonder though, how much time a guy like Elon is actually spending each day talking with AI.

Anonymous
12/30/25(Tue)23:23:16 No.107717472

Anonymous 12/30/25(Tue)23:23:16 No.107717472

>>107717464
LLMs are amazing for brainstorming and exploring new concepts.

Anonymous
12/30/25(Tue)23:25:25 No.107717481

Anonymous 12/30/25(Tue)23:25:25 No.107717481

>>107717246
erotic colors

Anonymous
12/30/25(Tue)23:26:15 No.107717484

Anonymous 12/30/25(Tue)23:26:15 No.107717484

>>107717380
If it is this piece of shit faggot proclaiming it is gonna happen then we are safe. On the other hand making a smartphone with some weird forked android OS that only lets you use twitter and grok should be easy enough that even with his incompetence it can happen.

Anonymous
12/30/25(Tue)23:26:17 No.107717485

Anonymous 12/30/25(Tue)23:26:17 No.107717485

>>107717472
Brainstorming, eh? You want the details, the real deal?

Anonymous
12/30/25(Tue)23:27:24 No.107717490

Anonymous 12/30/25(Tue)23:27:24 No.107717490

>>107717404
>96gb vram
Get a second one and run 4.6/4.7

Anonymous
12/30/25(Tue)23:29:36 No.107717508

Anonymous 12/30/25(Tue)23:29:36 No.107717508

>>107717380
>a screen with radios that pings Grok/xAI servers for everything in real time
who asked for this?

Anonymous
12/30/25(Tue)23:44:13 No.107717570

Anonymous 12/30/25(Tue)23:44:13 No.107717570

>>107717508
it is being enforced on us like every other rich person who wants to let privacy become a thing of the past

Anonymous
12/30/25(Tue)23:45:43 No.107717575

Anonymous 12/30/25(Tue)23:45:43 No.107717575

File: real_bald_miku.jpg (614 KB, 1489x1986)

614 KB JPG

>>107717308
Bald is the new /lmg/ mascot.

Anonymous
12/30/25(Tue)23:57:26 No.107717643

Anonymous 12/30/25(Tue)23:57:26 No.107717643

File: omg it not migu with only(...).png (40 KB, 317x277)

40 KB PNG

>>107717575

Anonymous
12/31/25(Wed)00:03:05 No.107717669

Anonymous 12/31/25(Wed)00:03:05 No.107717669

Am retard. How do I stop sillytavern from showing shit in code blocks? They aren't hidden with some models and I don't know why. I still want to gem them, I just don't want to see them.

Anonymous
12/31/25(Wed)00:06:26 No.107717689

Anonymous 12/31/25(Wed)00:06:26 No.107717689

File: 79817184917894.png (1.06 MB, 1024x1216)

1.06 MB PNG

so what's a good story/rp telling model? been using mistral nemo 12b for a while, anything new to try?

Anonymous
12/31/25(Wed)00:07:29 No.107717695

Anonymous 12/31/25(Wed)00:07:29 No.107717695

>>107717380
stop wasting time on gadgets that nobody's going to buy and finish up the full dive brainchips you rich cunt

Anonymous
12/31/25(Wed)00:07:38 No.107717696

Anonymous 12/31/25(Wed)00:07:38 No.107717696

>>107717669
do you mean the "thinking" blocks?
what exactly do you mean?

Anonymous
12/31/25(Wed)00:09:40 No.107717707

Anonymous 12/31/25(Wed)00:09:40 No.107717707

>>107717669
regex

Anonymous
12/31/25(Wed)00:10:40 No.107717715

Anonymous 12/31/25(Wed)00:10:40 No.107717715

>>107717695
hes an adhd rich idiot, investors had to give him a 1 Trillion carrot on a stick, just so he continues working on the electric cars

Anonymous
12/31/25(Wed)00:13:09 No.107717729

Anonymous 12/31/25(Wed)00:13:09 No.107717729

>>107717689
Magidonia or Cydonia. Ur specs??

Anonymous
12/31/25(Wed)00:14:57 No.107717740

Anonymous 12/31/25(Wed)00:14:57 No.107717740

>>107717729
16vram 32ram
4060ti

Anonymous
12/31/25(Wed)00:16:53 No.107717748

Anonymous 12/31/25(Wed)00:16:53 No.107717748

>31 December 2pm in worst korea
>bloody besterd mathrachods still haven't done needful release of the solar open 100b gooofs
May sacred cow curse their ancestors and descendants to toilet witch dimension

Anonymous
12/31/25(Wed)00:17:56 No.107717758

Anonymous 12/31/25(Wed)00:17:56 No.107717758

>>107717729
Cydonia just shat me this lil nugget from just the title.

https://rentry.org/opkobdrp

Anonymous
12/31/25(Wed)00:19:03 No.107717766

Anonymous 12/31/25(Wed)00:19:03 No.107717766

>>107717729
>>107717758
Die in a fire

Anonymous
12/31/25(Wed)00:19:35 No.107717770

Anonymous 12/31/25(Wed)00:19:35 No.107717770

>>107717758
>chapter 2: finding your partner
Pretty funny.

Anonymous
12/31/25(Wed)00:20:19 No.107717774

Anonymous 12/31/25(Wed)00:20:19 No.107717774

>>107717689
Just keep using nemo desu.

Anonymous
12/31/25(Wed)00:25:33 No.107717805

Anonymous 12/31/25(Wed)00:25:33 No.107717805

8gb vram. 64gb ram. spoonfeed me model. i want for the BIG sex. i run utilize GLM-4.5-Air-Q4_K_S and NemoReRemix-12B-Q3_K_XL and sometime trashpanda-org_QwQ-32B-Snowdrop-v0-IQ4_XS there is it there anything better?

Anonymous
12/31/25(Wed)00:37:43 No.107717878

Anonymous 12/31/25(Wed)00:37:43 No.107717878

>>107717805
not really, no. there has not really been much progress made in the past few months for us mortals.

Anonymous
12/31/25(Wed)01:29:09 No.107718128

Anonymous 12/31/25(Wed)01:29:09 No.107718128

>>107717805
Why are there so many people asking this recently?
>Hey guys what's the current best model????
Bot spam again?

Anonymous
12/31/25(Wed)01:32:28 No.107718139

Anonymous 12/31/25(Wed)01:32:28 No.107718139

>>107718128
That's a clever insight —

Anonymous
12/31/25(Wed)01:36:53 No.107718150

Anonymous 12/31/25(Wed)01:36:53 No.107718150

Hunyuan motion 1.0 is fun

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.