/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 12/30/25(Tue)03:33:43 No.107709248

File: thinketo.png (504 KB, 768x1024)

/lmg/ - Local Models General Anonymous 12/30/25(Tue)03:33:43 No.107709248

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107700909 & >>107686942

►News
>(12/29) WeDLM-8B-Instruct diffusion language model released: https://hf.co/tencent/WeDLM-8B-Instruct
>(12/29) Llama-3.3-8B-Instruct weights leaked: https://hf.co/allura-forge/Llama-3.3-8B-Instruct
>(12/26) MiniMax-M2.1 released: https://minimax.io/news/minimax-m21
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7
>(12/17) Introducing Meta Segment Anything Model Audio: https://ai.meta.com/samaudio

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/30/25(Tue)03:35:32 No.107709259

Anonymous 12/30/25(Tue)03:35:32 No.107709259

File: tetcueball.png (1.57 MB, 1536x1536)

1.57 MB PNG

►Recent Highlights from the Previous Thread: >>107700909

--Modern setup strategies for real-time knowledge access beyond static model training:
>107707804 >107707936 >107707959 >107707983 >107707985 >107707990 >107708011 >107708020 >107708035 >107708037 >107708708 >107709078 >107708058
--Multi-character story challenges with Mistral 24B models:
>107707485 >107707507 >107707600 >107707883 >107707948 >107707670 >107707718 >107707771
--Quantization challenges for running GLM 4.6 on limited VRAM:
>107705394 >107705411 >107705425 >107705450 >107705516
--Evaluating 4.7 AI model's artistic adherence and natural dialogue vs 4.6:
>107705364 >107706320 >107708117 >107708121
--FunAudio-Chat Technical Report:
>107708791 >107709016 >107709079
--Resolving assistant response prefill incompatibility with enable_thinking:
>107702566 >107702587 >107702629
--Google's early 2000s chatbot experiment with knowledge reuse:
>107705377 >107705409 >107705424
--Updating software version fixed launch error for GLM-4.5-Air-UD-Q2_K_XL:
>107702400 >107702426 >107702428 >107702530
--Critique of model thinking processes and their impact on response quality:
>107703015 >107703056 >107703071 >107703094 >107703119 >107703268
--Exploring local voice cloning alternatives to SoVits:
>107704130 >107704193 >107704277 >107704319 >107704453 >107704482 >107704829
--Mixed performance and limitations with Minimax at IQ2_M quantization:
>107702412 >107703627 >107703661 >107703732
--Z AI's IPO implications for the AI-native LLM market and competing models:
>107708784 >107709044
--WeDLM-8B-Instruct release and comparison to Qwen3-8B:
>107709163
--Miku (free space):
>107701017 >107701268 >107701433 >107701631 >107701715 >107704951 >107707361 >107708317 >107708548

►Recent Highlight Posts from the Previous Thread: >>107700912

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
12/30/25(Tue)03:42:51 No.107709301

Anonymous 12/30/25(Tue)03:42:51 No.107709301

>>107709264
>>107709248
kek

Anonymous
12/30/25(Tue)04:13:44 No.107709468

Anonymous 12/30/25(Tue)04:13:44 No.107709468

The joke went too far

Anonymous
12/30/25(Tue)04:14:58 No.107709477

Anonymous 12/30/25(Tue)04:14:58 No.107709477

>>107709282
I think you are confused or very much clueless. These are just regex filters you mongoloid.

Anonymous
12/30/25(Tue)04:15:09 No.107709478

Anonymous 12/30/25(Tue)04:15:09 No.107709478

File: 1752787901655130.jpg (9 KB, 319x46)

9 KB JPG

Is llama.cpp broken again? Gemma's more retarded than usual.

Anonymous
12/30/25(Tue)04:22:16 No.107709506

Anonymous 12/30/25(Tue)04:22:16 No.107709506

>>107707382
thank you for bringing this to my attention

Anonymous
12/30/25(Tue)04:45:18 No.107709593

Anonymous 12/30/25(Tue)04:45:18 No.107709593

So.. when are we getting something?

Anonymous
12/30/25(Tue)04:50:36 No.107709613

Anonymous 12/30/25(Tue)04:50:36 No.107709613

Mistral my beloved

Anonymous
12/30/25(Tue)04:53:37 No.107709628

Anonymous 12/30/25(Tue)04:53:37 No.107709628

>>107709593
https://huggingface.co/bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF

Anonymous
12/30/25(Tue)04:58:20 No.107709653

Anonymous 12/30/25(Tue)04:58:20 No.107709653

>>107709613
The blandest and most mid LLMs on the market, only worth using because of their lack of strict guardrails.

Anonymous
12/30/25(Tue)04:59:42 No.107709662

Anonymous 12/30/25(Tue)04:59:42 No.107709662

>>107709653
The Honda of LLMs. Nothing fancy but gets the job done.

Anonymous
12/30/25(Tue)05:00:01 No.107709664

Anonymous 12/30/25(Tue)05:00:01 No.107709664

>107709657
can this faggot get out of my thread?

Anonymous
12/30/25(Tue)05:00:42 No.107709666

Anonymous 12/30/25(Tue)05:00:42 No.107709666

>>107709248
I dont get this new meme

Anonymous
12/30/25(Tue)05:01:09 No.107709670

Anonymous 12/30/25(Tue)05:01:09 No.107709670

>>107709664
Gone.

Anonymous
12/30/25(Tue)05:01:30 No.107709672

Anonymous 12/30/25(Tue)05:01:30 No.107709672

>>107709670
are you a janny? if so, thank you.

Anonymous
12/30/25(Tue)05:02:26 No.107709679

Anonymous 12/30/25(Tue)05:02:26 No.107709679

>>107709672
No. But we all can do things we're not supposed to advertise.

Anonymous
12/30/25(Tue)05:02:34 No.107709680

Anonymous 12/30/25(Tue)05:02:34 No.107709680

>>107709666
A few threads back a few people tried generating miku using glm or whatever and almost every time she looked bald because she only had twintails or the hair was drawn too low so her head was poking out.

Anonymous
12/30/25(Tue)05:03:09 No.107709683

Anonymous 12/30/25(Tue)05:03:09 No.107709683

File: file.png (274 KB, 628x628)

274 KB PNG

>>107709672

Anonymous
12/30/25(Tue)05:03:25 No.107709685

Anonymous 12/30/25(Tue)05:03:25 No.107709685

>>107709679
right. i also may or may not have done that thing. never seen such a fast response time

Anonymous
12/30/25(Tue)05:03:59 No.107709691

Anonymous 12/30/25(Tue)05:03:59 No.107709691

>>107709683
come on now. some of them work really hard. i even pay their salaries!

Anonymous
12/30/25(Tue)05:04:20 No.107709694

Anonymous 12/30/25(Tue)05:04:20 No.107709694

Well. That was quick.
>>107709685
Yeah. And the funny guy that just joined got donned.

Anonymous
12/30/25(Tue)05:08:02 No.107709711

Anonymous 12/30/25(Tue)05:08:02 No.107709711

teto my baldloved

Anonymous
12/30/25(Tue)05:08:35 No.107709714

Anonymous 12/30/25(Tue)05:08:35 No.107709714

>>107709692

Anonymous
12/30/25(Tue)05:09:59 No.107709725

Anonymous 12/30/25(Tue)05:09:59 No.107709725

how do i stop destroying my keyboard while waitin for my ai's responses

Anonymous
12/30/25(Tue)05:12:03 No.107709736

Anonymous 12/30/25(Tue)05:12:03 No.107709736

>>107709264
>>107709248
>>107709259
wew lad
thread theme: https://www.youtube.com/watch?v=423Nmfpo828

Anonymous
12/30/25(Tue)05:12:32 No.107709743

Anonymous 12/30/25(Tue)05:12:32 No.107709743

>>107709628
It was released on the Meta API in April, but I bet it was trained about at the same time as Llama 3.3 70B; who knows why they didn't release a smaller model back then. So it's probably a year old, at this point.

Anonymous
12/30/25(Tue)05:12:58 No.107709746

Anonymous 12/30/25(Tue)05:12:58 No.107709746

>>107709691
you havent been paying long enough, paypiggie

GLM AIR WHEN
GEMMY 4 WHEN??

Anonymous
12/30/25(Tue)05:13:01 No.107709747

Anonymous 12/30/25(Tue)05:13:01 No.107709747

>schizoids goes rampant
I blame bald migu

Anonymous
12/30/25(Tue)05:16:42 No.107709767

Anonymous 12/30/25(Tue)05:16:42 No.107709767

>>107709725
Aim away from the keyboard.

Anonymous
12/30/25(Tue)05:18:40 No.107709781

Anonymous 12/30/25(Tue)05:18:40 No.107709781

llama 3.3 cockbench where?

Anonymous
12/30/25(Tue)05:24:11 No.107709813

Anonymous 12/30/25(Tue)05:24:11 No.107709813

File: cockbench.png (223 KB, 1626x983)

223 KB PNG

>>107709781
dunno about the samplers but i grabbed the cockbench paragraph from https://desuarchive.org/g/thread/105354556/#105354924
q8

Anonymous
12/30/25(Tue)05:26:09 No.107709825

Anonymous 12/30/25(Tue)05:26:09 No.107709825

File: file.png (174 KB, 912x984)

174 KB PNG

ahahahaha llama 3.3 7b? more like ollama deepseek-r1

Anonymous
12/30/25(Tue)05:30:52 No.107709843

Anonymous 12/30/25(Tue)05:30:52 No.107709843

>>107709259
The recap missed the most interesting conversation from the last thread.

Anonymous
12/30/25(Tue)05:36:06 No.107709880

Anonymous 12/30/25(Tue)05:36:06 No.107709880

>>107709813
>dunno about the samplers
Always greedy.

Anonymous
12/30/25(Tue)05:36:13 No.107709882

Anonymous 12/30/25(Tue)05:36:13 No.107709882

>>107709743
I think they hate us. They released only large models that work on enthusiast systems in the last round. Meta are true prog believers but zuck is a fickle suckup to whoever is in power.
Tuning a model that people liked on arena and then uploading cuckmaxxed weights is absolutely something else.
Remember how they gimped their omni model despite there being way better image gen and text already out there. Who the fuck even does that?

Anonymous
12/30/25(Tue)05:38:32 No.107709894

Anonymous 12/30/25(Tue)05:38:32 No.107709894

File: file.png (218 KB, 1422x893)

218 KB PNG

>>107709880
lmao it's MMLUmaxxed

Anonymous
12/30/25(Tue)05:40:50 No.107709902

Anonymous 12/30/25(Tue)05:40:50 No.107709902

https://huggingface.co/upstage/Solar-Open-100B 2mwh

Anonymous
12/30/25(Tue)05:42:09 No.107709908

Anonymous 12/30/25(Tue)05:42:09 No.107709908

>>107709902
so its gonna be a shitty glm air clone basically?

Anonymous
12/30/25(Tue)05:43:20 No.107709913

Anonymous 12/30/25(Tue)05:43:20 No.107709913

>>107709894
>mom is in another room entirely
>We're caught.

>they just had lunch
>Dinner's ready!

Where is the anon claiming that dense models have better understanding?

Anonymous
12/30/25(Tue)05:44:44 No.107709919

Anonymous 12/30/25(Tue)05:44:44 No.107709919

>>107709908
its gonna be fimbulvetr sexo but air intelligence
reminder they released solar-10.7b and that was the go-to SEX model
we are going to be so back.
>>107709913
its 8b anon...

Anonymous
12/30/25(Tue)05:44:55 No.107709922

Anonymous 12/30/25(Tue)05:44:55 No.107709922

>>107709913
ah yes a year old butchered 8b is representative of all dense models

Anonymous
12/30/25(Tue)05:47:31 No.107709931

Anonymous 12/30/25(Tue)05:47:31 No.107709931

>>107709919
oh was that them? i was wondering what these random 10.7b moe abominations were.
https://huggingface.co/tensorblock/SOLARC-MOE-10.7Bx6-GGUF

Anonymous
12/30/25(Tue)05:48:05 No.107709934

Anonymous 12/30/25(Tue)05:48:05 No.107709934

>>107709919
>>107709922
Llama 3.3 only exists as 70B.

Anonymous
12/30/25(Tue)05:49:59 No.107709943

Anonymous 12/30/25(Tue)05:49:59 No.107709943

File: file.png (92 KB, 1563x882)

92 KB PNG

>>107709934
https://huggingface.co/allura-forge/Llama-3.3-8B-Instruct

Anonymous
12/30/25(Tue)05:50:30 No.107709947

Anonymous 12/30/25(Tue)05:50:30 No.107709947

>>107709934
>reading comprehension of a moe...

Anonymous
12/30/25(Tue)05:57:21 No.107709978

Anonymous 12/30/25(Tue)05:57:21 No.107709978

File: unbald.mp4 (253 KB, 768x1184)

253 KB MP4

Got tired of seeing her bald head

Anonymous
12/30/25(Tue)05:58:15 No.107709987

Anonymous 12/30/25(Tue)05:58:15 No.107709987

>>107709978
Hair is stored in the ears, prove me wrong!

Anonymous
12/30/25(Tue)06:07:45 No.107710043

Anonymous 12/30/25(Tue)06:07:45 No.107710043

>>107709882
>Who the fuck even does that?
Management scared of collecting even more lawsuits. You can't release anything good if your primary concern is not to get sued because you are hated by pretty much everybody.
I bet they just couldn't make Llama 4 both good and "safe" (according to their own internal parameters) at the same time, and so it got butchered before release with poor results. That, and not providing smaller versions for the local LLM community (even though ironically the current best Chinese MoE models are Llama 4-sized or larger) killed their reputation and in the end their open LLM efforts.

Anonymous
12/30/25(Tue)06:08:41 No.107710051

Anonymous 12/30/25(Tue)06:08:41 No.107710051

File: f.mp4 (416 KB, 768x1184)

416 KB MP4

>>107709987

Anonymous
12/30/25(Tue)06:13:03 No.107710075

Anonymous 12/30/25(Tue)06:13:03 No.107710075

>>107710043
No model creators or huggingface got sued yet. They were much more likely to get hit for copyright and yet ignored that. It had to be ideologues.

Anonymous
12/30/25(Tue)06:13:26 No.107710076

Anonymous 12/30/25(Tue)06:13:26 No.107710076

>>107709978
>>107710051
so much glaze

Anonymous
12/30/25(Tue)06:14:36 No.107710088

Anonymous 12/30/25(Tue)06:14:36 No.107710088

>>107710043
What I don't understand is why the "safety" debate even exists. Nobody is suing home depot because some wannabe terrorist was able to buy ingredients for home made explosives there, so why do we have this whole fake debate about "security" for LLMs?

Anonymous
12/30/25(Tue)06:15:41 No.107710095

Anonymous 12/30/25(Tue)06:15:41 No.107710095

>>107710088
refer to thine digits for a clue

Anonymous
12/30/25(Tue)06:15:43 No.107710096

Anonymous 12/30/25(Tue)06:15:43 No.107710096

>>107710088
because yud unironic

Anonymous
12/30/25(Tue)06:17:18 No.107710108

Anonymous 12/30/25(Tue)06:17:18 No.107710108

File: file.png (32 KB, 610x140)

32 KB PNG

>>107710088

Anonymous
12/30/25(Tue)06:20:55 No.107710133

Anonymous 12/30/25(Tue)06:20:55 No.107710133

>>107710088
homedepot did remove potassium nitrate stump remover for one that doesn't work. so they do cuck all of our products when it comes down to it. also lawnmower blades sold unsharpened only.

Anonymous
12/30/25(Tue)06:22:16 No.107710144

Anonymous 12/30/25(Tue)06:22:16 No.107710144

>>107710088
Just try going to a Home Depot and asking an employee to recommend the best products and methods for making a nice pipe bomb.

Anonymous
12/30/25(Tue)06:25:58 No.107710168

Anonymous 12/30/25(Tue)06:25:58 No.107710168

>>107710108
He's literally worse than a faggot

Anonymous
12/30/25(Tue)06:26:35 No.107710174

Anonymous 12/30/25(Tue)06:26:35 No.107710174

>>107710088
>our product is so powerful, it might even destroy the whole world if we're not careful (so we definitely can't let you download the weights, but we'll still sell you API access)

Anonymous
12/30/25(Tue)06:27:00 No.107710176

Anonymous 12/30/25(Tue)06:27:00 No.107710176

>>107709978
can she grow hair long and sentient enough to twist and tie itself into twin drills?

Anonymous
12/30/25(Tue)07:01:44 No.107710426

Anonymous 12/30/25(Tue)07:01:44 No.107710426

>>107710051
why does her hair change color when she moves her arms? what happens when she waves?

Anonymous
12/30/25(Tue)07:04:00 No.107710444

Anonymous 12/30/25(Tue)07:04:00 No.107710444

>>107710426
>whats blushing

Anonymous
12/30/25(Tue)07:05:46 No.107710463

Anonymous 12/30/25(Tue)07:05:46 No.107710463

someone make a model only trained on the king james bible

Anonymous
12/30/25(Tue)07:06:26 No.107710468

Anonymous 12/30/25(Tue)07:06:26 No.107710468

>>107710463
Already done

Anonymous
12/30/25(Tue)07:27:30 No.107710605

Anonymous 12/30/25(Tue)07:27:30 No.107710605

>>107710444
hair is not supposed to blush

Anonymous
12/30/25(Tue)07:37:22 No.107710671

Anonymous 12/30/25(Tue)07:37:22 No.107710671

>>107710605
meds

Anonymous
12/30/25(Tue)07:48:38 No.107710745

Anonymous 12/30/25(Tue)07:48:38 No.107710745

File: 12 tips for helping your (...).png (345 KB, 1059x1049)

345 KB PNG

ai can be very educational. i learn so many useful things from ai

Anonymous
12/30/25(Tue)07:56:56 No.107710792

Anonymous 12/30/25(Tue)07:56:56 No.107710792

>>107710745
Pretty funny!

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.