/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 07/02/26(Thu)17:06:46 No.109186093

File: chewy.jpg (245 KB, 1024x1024)

/lmg/ - Local Models General Anonymous 07/02/26(Thu)17:06:46 No.109186093

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>109180934 & >>109175389

►News
>(07/01) Nemotron-Labs-TwoTower released: https://hf.co/nvidia/Nemotron-Labs-TwoTower-30B-A3B-Base-BF16
>(06/29) DeepSeek V4 support merged: https://github.com/ggml-org/llama.cpp/pull/24162
>(06/28) DFlash support merged: https://github.com/ggml-org/llama.cpp/pull/22105
>(06/27) DeepSeek releases DeepSpec and DSpark models: https://hf.co/deepseek-ai/DeepSeek-V4-Pro-DSpark
>(06/25) LFM2.5-230M released: https://liquid.ai/blog/lfm2-5-230m

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://swe-rebench.com
Agentic Coding: https://deepswe.datacurve.ai
Context Length: https://github.com/RecapAnon/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
07/02/26(Thu)17:07:38 No.109186100

Anonymous 07/02/26(Thu)17:07:38 No.109186100

70b dense

Anonymous
07/02/26(Thu)17:09:14 No.109186110

Anonymous 07/02/26(Thu)17:09:14 No.109186110

File: 1756333105711043.jpg (416 KB, 2010x2123)

416 KB JPG

►Recent Highlights from the Previous Thread: >>109180934

--Papers:
>109182223
--DeepSeek V4 official release and updated API pricing:
>109185898 >109185930 >109185979
--Benchmarking Qwen reasoning-distilled models on Strix Halo hardware:
>109180990 >109181075 >109182313 >109184010
--Debating API profit margins and the future of local LLMs:
>109184089 >109184244 >109184361 >109184949 >109184120
--Comparing DGX Spark and high-RAM consumer builds for 200B models:
>109183333 >109183368 >109183399 >109184589 >109184630 >109184673 >109184748 >109185859
--Role of world models and JEPA in cognitive architectures:
>109182944 >109182962 >109183019 >109183047 >109183066
--Yann LeCun's world models and their impact on LLMs:
>109181063 >109181138 >109181174 >109181225 >109181281 >109181440 >109181266 >109181286 >109181296 >109182082
--Searching for high-accuracy vision models for automated image tagging:
>109185439 >109185445 >109185453 >109185470 >109185488
--Using Gemma 4 26B for long-context summarization on 12GB VRAM:
>109185533 >109185541 >109185551 >109185577
--Trade-offs of open-frame and mining rig setups for multi-GPU builds:
>109181079 >109181093 >109181099 >109181135 >109181244 >109182416
--Debating DSV4 flash benchmarks and MoE versus dense architectures:
>109183510 >109183629 >109183657 >109183699 >109183710
--Running 27B and 35B models on budget Nvidia P100 hardware:
>109184458 >109184472 >109184609 >109184615 >109184476 >109184538 >109184644 >109184732 >109184746 >109184878 >109184879 >109184882 >109184885 >109184897 >109184937 >109184964 >109184982 >109185010 >109185047 >109185159 >109185195 >109185304 >109185610
--Kimiposting:
>109182490
--Logs:
>109182490 >109184337
--Rin, Miku, Teto (free space):
>109180961 >109181029 >109181038 >109182416 >109184199 >109184302 >109184291 >109184622 >109185979

►Recent Highlight Posts from the Previous Thread: >>109180937 >>109181013

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
07/02/26(Thu)17:11:19 No.109186131

Anonymous 07/02/26(Thu)17:11:19 No.109186131

File: lmg_culture.jfif.jpg (110 KB, 1024x768)

110 KB JPG

https://archive.is/sWFja

Anonymous
07/02/26(Thu)17:12:06 No.109186137

Anonymous 07/02/26(Thu)17:12:06 No.109186137

File: 1752884519724554.png (358 KB, 793x631)

358 KB PNG

https://www.youtube.com/watch?v=oIscL-Bjsq4
Thread theme

Anonymous
07/02/26(Thu)17:17:51 No.109186177

Anonymous 07/02/26(Thu)17:17:51 No.109186177

>>109185159
>$450 for an RX6800
Do not do this, Buy a V620 instead which was a Microsoft server version of it with more CUs and double the VRAM if you want to go this route.

Anonymous
07/02/26(Thu)17:19:13 No.109186188

Anonymous 07/02/26(Thu)17:19:13 No.109186188

>>109186113
I have around 30-40 t/s. After I get home I can confirm my flags but I think that other anon already gave you good info.

Anonymous
07/02/26(Thu)17:20:29 No.109186197

Anonymous 07/02/26(Thu)17:20:29 No.109186197

File: 1767239186436721.png (458 KB, 1371x1818)

458 KB PNG

https://xcancel.com/bridgemindai/status/2072662214704533888#m
kek

Anonymous
07/02/26(Thu)17:20:49 No.109186201

Anonymous 07/02/26(Thu)17:20:49 No.109186201

rate my build /lmg/
https://pcpartpicker.com/list/3GVXR4

Anonymous
07/02/26(Thu)17:21:15 No.109186203

Anonymous 07/02/26(Thu)17:21:15 No.109186203

>>109185439
>>109185439

Anonymous
07/02/26(Thu)17:21:26 No.109186204

Anonymous 07/02/26(Thu)17:21:26 No.109186204

>>109186131
have you considered not acknowledging things you dislike so they go away instead of spamming shit about it? You're as bad as the average tranny for constantly bringing attention to it. Wouldn't be surprised if you were a closeted tranny to begin with

Anonymous
07/02/26(Thu)17:21:51 No.109186210

Anonymous 07/02/26(Thu)17:21:51 No.109186210

>>109186197
Anthropic is finished.

Anonymous
07/02/26(Thu)17:22:52 No.109186219

Anonymous 07/02/26(Thu)17:22:52 No.109186219

>>109186204
/lmg/ is jart general. you will suck his dick and be happy.

Anonymous
07/02/26(Thu)17:25:55 No.109186243

Anonymous 07/02/26(Thu)17:25:55 No.109186243

>>109186219
i don't know whoever faggot spook you're obsessed with, but I do think you need to stop being an obsessed faggot about someone who is probably irrelevant. Also stop being a faggot. Tall order, I know, but at least try

Anonymous
07/02/26(Thu)17:26:56 No.109186249

Anonymous 07/02/26(Thu)17:26:56 No.109186249

>>109186204
He says, while acknowledge a thing he dislikes.

Anonymous
07/02/26(Thu)17:28:22 No.109186259

Anonymous 07/02/26(Thu)17:28:22 No.109186259

>>109186249
I dislike your grammatical error. I am acknowledging this.

Anonymous
07/02/26(Thu)17:29:26 No.109186266

Anonymous 07/02/26(Thu)17:29:26 No.109186266

File: file.png (109 KB, 822x663)

109 KB PNG

Oh Gemma, you're so funny!

Anonymous
07/02/26(Thu)17:29:36 No.109186267

Anonymous 07/02/26(Thu)17:29:36 No.109186267

>>109186249
hastily typed angry response that makes little sense, please elaborate. Or don't and pretend that makes you look smarter somehow

Anonymous
07/02/26(Thu)17:31:40 No.109186279

Anonymous 07/02/26(Thu)17:31:40 No.109186279

>>109186266
>jang_4m-crack

Anonymous
07/02/26(Thu)17:31:52 No.109186281

Anonymous 07/02/26(Thu)17:31:52 No.109186281

>>109186201
Seems like a decent workstation.
For that much money you could probably do better for a dedicated inference machine, I think.

Anonymous
07/02/26(Thu)17:32:25 No.109186286

Anonymous 07/02/26(Thu)17:32:25 No.109186286

>>109186267
>angry
?

Anonymous
07/02/26(Thu)17:33:12 No.109186297

Anonymous 07/02/26(Thu)17:33:12 No.109186297

>>109186279
based jang

Anonymous
07/02/26(Thu)17:33:27 No.109186300

Anonymous 07/02/26(Thu)17:33:27 No.109186300

>>109186201
This would've been half the price if you bought those components at the right time.

Anonymous
07/02/26(Thu)17:33:48 No.109186305

Anonymous 07/02/26(Thu)17:33:48 No.109186305

>>109186286
try using full sentences

Anonymous
07/02/26(Thu)17:38:09 No.109186336

Anonymous 07/02/26(Thu)17:38:09 No.109186336

>>109185439
did you increase her vision token number you get better performance?

Anonymous
07/02/26(Thu)17:39:50 No.109186350

Anonymous 07/02/26(Thu)17:39:50 No.109186350

File: uta.jpg (249 KB, 1024x1024)

249 KB JPG

>>109186137
lol

Anonymous
07/02/26(Thu)17:39:56 No.109186352

Anonymous 07/02/26(Thu)17:39:56 No.109186352

does mtp even work on abliterated models anymore? I'd think acceptance rate craters?

Anonymous
07/02/26(Thu)17:40:20 No.109186356

Anonymous 07/02/26(Thu)17:40:20 No.109186356

>>109186201
pretty good but if youre paying that much for a mobo go with a intel qyfs and asus w790 sage or w970 ace, you get 56 core 112 threads cheap, the sage supports 8 memory channels, only 4 on the ace

https://www.ebay.co.uk/itm/134899171071

Anonymous
07/02/26(Thu)17:42:01 No.109186367

Anonymous 07/02/26(Thu)17:42:01 No.109186367

>>109186281
It's supposed to be a gaming rig in addition, however.
>>109186300
Spilled milk.
>>109186356
That price seems too good to be true.

Anonymous
07/02/26(Thu)17:43:27 No.109186378

Anonymous 07/02/26(Thu)17:43:27 No.109186378

>>109186305
in the downtime during shitposter kun trying to figure out how to wrangle a good reply out of kimi or something to own me instead of using his fucking brain and facing reality, models these days are crazy compared to the llama 2 days. Couldn't use FA if you were on AMD, psyonic-cetacean 20B would take a shitload of vram for context. Attention shit introduced some issues but definitely made general usage better

Anonymous
07/02/26(Thu)17:46:30 No.109186402

Anonymous 07/02/26(Thu)17:46:30 No.109186402

File: file.png (47 KB, 799x411)

47 KB PNG

>>109186367
its an engineering sample they work perfectly thoguh only thing you need to know is to change the package cstate in bios otherwise it wont boot an os. ive had one for like 1.5 years, the retail version of the chip is like 10k kek. i only have the ace which is 4 memeory channels but i compared benchmarks with someone using all 8 channels and they got 2x my perf on llm inference. bunch of info about them here

https://forums.servethehome.com/index.php?threads/asus-pro-ws-w790e-sage-se-intel-xeon-sapphire-rapids-spr-sp.41306/page-44

you can also disable a number of cores to get higher boost clocks i run with half disabled which gives a boost to 3.7ghz

Anonymous
07/02/26(Thu)17:47:16 No.109186407

Anonymous 07/02/26(Thu)17:47:16 No.109186407

>>109186201
you should be buying this. this is my setup except i have 3200mhz ram.
https://www.ebay.com/itm/127199765529

Anonymous
07/02/26(Thu)17:48:15 No.109186411

Anonymous 07/02/26(Thu)17:48:15 No.109186411

>>109186201
Windows is free
https://github.com/massgravel/microsoft-activation-scripts

Anonymous
07/02/26(Thu)17:50:19 No.109186428

Anonymous 07/02/26(Thu)17:50:19 No.109186428

>>109186203
kimi 2.6 or glm 5.2 - thats literally it

Anonymous
07/02/26(Thu)17:50:37 No.109186429

Anonymous 07/02/26(Thu)17:50:37 No.109186429

File: Weeds.png (87 KB, 900x1117)

87 KB PNG

>>109186204

Anonymous
07/02/26(Thu)17:51:39 No.109186440

Anonymous 07/02/26(Thu)17:51:39 No.109186440

>>109186428
Step 3.7 works too.
True KimiGODS use K2 with 2.7's vision encoder.

Anonymous
07/02/26(Thu)17:53:35 No.109186454

Anonymous 07/02/26(Thu)17:53:35 No.109186454

>>109186429
But jart is not in the thread, nobody likes jart, and nobody talks about jart except the guy that keeps linking his deleted blog post every thread.

Anonymous
07/02/26(Thu)17:55:07 No.109186466

Anonymous 07/02/26(Thu)17:55:07 No.109186466

>>109186454
Jart is in these threads and that's probably why he keeps reposting it. Just filter the filename and move on.

Anonymous
07/02/26(Thu)17:55:07 No.109186467

Anonymous 07/02/26(Thu)17:55:07 No.109186467

>>109186429
extremely lazy response for how long it took you to reply, especially with how you lack the cognitive ability to actually break down said comic and compare it to what I said. I'll help you a little: do you think a weed is going to call another weed a weed? I'm calling you a retarded closeted tranny that hates trannies and I want you to shut up so I can read about AI. Fuck off already and go haunt some other general

Anonymous
07/02/26(Thu)17:57:07 No.109186486

Anonymous 07/02/26(Thu)17:57:07 No.109186486

>>109186467
ahem... nigger faggot

Anonymous
07/02/26(Thu)17:57:37 No.109186491

Anonymous 07/02/26(Thu)17:57:37 No.109186491

>>109186267
Ok: Telling a troll that has made the same post multiple times a day for a month to not acknowledge things and they'll go away is dumb on any number of levels.
You're not taking your own advice, but it doesn't matter because he's a demonstration of why it's not good advice anyway, and above all, there's obviously no point trying to approach him as if he's a normal human being.

Anonymous
07/02/26(Thu)18:00:50 No.109186506

Anonymous 07/02/26(Thu)18:00:50 No.109186506

>>109186486
this is at least what I expect from an underwater basket weaving thread, thanks
>>109186491
slop

Anonymous
07/02/26(Thu)18:01:10 No.109186509

Anonymous 07/02/26(Thu)18:01:10 No.109186509

>>109186131
>Lmao you pathetic racists never fail to make me laugh with your "pol humor" threads
Face it, most poc will be infinitely more successful than any of you sad virgins ever will be. You are on the wrong side of history, get over it losers
Thanks for the blessing.

Anonymous
07/02/26(Thu)18:03:30 No.109186521

Anonymous 07/02/26(Thu)18:03:30 No.109186521

Now I have the complete picture.

Anonymous
07/02/26(Thu)18:03:56 No.109186527

Anonymous 07/02/26(Thu)18:03:56 No.109186527

>>109186454
jart and /lmg/ are forever connected. you can't have one without the other so people must be reminded to not fall for it again
this used to be in the OP for a reason but got removed due to sabotage

Anonymous
07/02/26(Thu)18:05:59 No.109186539

Anonymous 07/02/26(Thu)18:05:59 No.109186539

>>109186521
sorry to break it to you but no amount of training will ever make your model feel "real"

Anonymous
07/02/26(Thu)18:07:33 No.109186552

Anonymous 07/02/26(Thu)18:07:33 No.109186552

File: 24cpps.jpg (122 KB, 768x1024)

122 KB JPG

Anonymous
07/02/26(Thu)18:08:01 No.109186561

Anonymous 07/02/26(Thu)18:08:01 No.109186561

File: wtf anthropic.png (3.29 MB, 4088x4088)

3.29 MB PNG

>>109186197
jesus

Anonymous
07/02/26(Thu)18:08:09 No.109186563

Anonymous 07/02/26(Thu)18:08:09 No.109186563

File: kaoru sob 2.png (318 KB, 793x571)

318 KB PNG

>>109186552

Anonymous
07/02/26(Thu)18:09:12 No.109186574

Anonymous 07/02/26(Thu)18:09:12 No.109186574

>>109186197
safety slopping once again ruining models

Anonymous
07/02/26(Thu)18:11:40 No.109186597

Anonymous 07/02/26(Thu)18:11:40 No.109186597

>>109186506
>stop typing lazily and elaborate
>elaborating systematically sounds too much like slop
fuckin hell you're needy

Anonymous
07/02/26(Thu)18:15:56 No.109186628

Anonymous 07/02/26(Thu)18:15:56 No.109186628

>>109186201
at least get a Threadraper

Anonymous
07/02/26(Thu)18:16:22 No.109186633

Anonymous 07/02/26(Thu)18:16:22 No.109186633

>>109186597
>refusing to elaborate on specific things
>shitting out slop as responses
:^) needy for you to not be a faggot loser, yeah. Type your own words and stop being a coward.

Anonymous
07/02/26(Thu)18:16:55 No.109186638

Anonymous 07/02/26(Thu)18:16:55 No.109186638

>>109186552
Stay strong, Miku

Anonymous
07/02/26(Thu)18:17:41 No.109186647

Anonymous 07/02/26(Thu)18:17:41 No.109186647

File: c27.png (89 KB, 660x574)

89 KB PNG

>>109186131
>I always thought my security posture was too paranoid, so when llama.cpp came out in 2023, I found the code Gerganov wrote to be so beautiful that I did the one thing that I promised myself I would never do, which was collaborate with an anonymous developer from his team named Slaren. [...] After submitting our work he went on 4chan afterwards and accused me plagiarism, saying that even my changes were his own. The way the community reacted is an interesting case study into the guile some developers have learned since the culture war, because the locus of thought for llama.cpp has always been on 4chan. [...] I actually developed migraines for the first time in my life and ended up in the hospital (since I didn't have health insurance and had to wait in the ER) due to the eye strain of reading unfiltered thoughts about me for months.
1 paragraph later:
>In any case, I'm really happy that these back channels exist, because the greatest competitive advantage I've ever had was to monitor which pull requests people on 4chan complained about, and then merge them into llamafile before Gerganov could.
There's no way this person is real.

Anonymous
07/02/26(Thu)18:25:38 No.109186703

Anonymous 07/02/26(Thu)18:25:38 No.109186703

>>109186137
>AI will it a wall in 2 more weeks
>this time for real
I am so tired of these people.

Anonymous
07/02/26(Thu)18:25:50 No.109186706

Anonymous 07/02/26(Thu)18:25:50 No.109186706

>>109186561
Sabotage for shekel farming. Or they're just serving a quantized model to the goyim.

Anonymous
07/02/26(Thu)18:27:08 No.109186721

Anonymous 07/02/26(Thu)18:27:08 No.109186721

File: 1769503055533647.png (1.27 MB, 1024x1024)

1.27 MB PNG

>>109186552
>man hands

Anonymous
07/02/26(Thu)18:27:18 No.109186723

Anonymous 07/02/26(Thu)18:27:18 No.109186723

>>109186647
They are and they deserve all the bullying they get.
Cultureposter, post the full rentry.

Anonymous
07/02/26(Thu)18:27:42 No.109186726

Anonymous 07/02/26(Thu)18:27:42 No.109186726

>>109186561
i feel safe now

Anonymous
07/02/26(Thu)18:27:48 No.109186728

Anonymous 07/02/26(Thu)18:27:48 No.109186728

>>109186454
>But jart is not in the thread
This is a mikutroon thread. Mikutroons are actual troons.

Anonymous
07/02/26(Thu)18:27:56 No.109186730

Anonymous 07/02/26(Thu)18:27:56 No.109186730

>>109186721
lmao I remember making that Flux image back in 2024, takes me back

Anonymous
07/02/26(Thu)18:28:35 No.109186735

Anonymous 07/02/26(Thu)18:28:35 No.109186735

>>109186552
this is real

>>109186721
this is AI

Anonymous
07/02/26(Thu)18:29:07 No.109186738

Anonymous 07/02/26(Thu)18:29:07 No.109186738

File: [Coalgirls]_Yuru_Yuri_05_(...).png (2.26 MB, 1920x1080)

2.26 MB PNG

>>109186728
>Mikutroons are actual troons.
i wish im too old and hairy to be a cute girly boy

Anonymous
07/02/26(Thu)18:30:02 No.109186748

Anonymous 07/02/26(Thu)18:30:02 No.109186748

>>109186738
>too old and hairy
That never stopped any troon

Anonymous
07/02/26(Thu)18:30:09 No.109186749

Anonymous 07/02/26(Thu)18:30:09 No.109186749

>>109186721
Believe it or not, Miku isn't at home
Please leave a oo-ee-oo at the beep
I must be out, or I'd pick up the leek
Where could I be? Believe it or not, I'm not home

Anonymous
07/02/26(Thu)18:30:22 No.109186751

Anonymous 07/02/26(Thu)18:30:22 No.109186751

That's a child

Anonymous
07/02/26(Thu)18:33:08 No.109186761

Anonymous 07/02/26(Thu)18:33:08 No.109186761

File: 1783021816389301.png (2.13 MB, 1706x1432)

2.13 MB PNG

CHINA HAS ILLEGALLY DISTILLED FABLE/MYTHOS

Anonymous
07/02/26(Thu)18:33:51 No.109186763

Anonymous 07/02/26(Thu)18:33:51 No.109186763

>>109186761
>ILLEGALLY DISTILLED something that was trained on pirated books

Anonymous
07/02/26(Thu)18:34:55 No.109186771

Anonymous 07/02/26(Thu)18:34:55 No.109186771

>>109186761
If they keep opening up the copyright file it's going to slap them in the face eventually

Anonymous
07/02/26(Thu)18:35:13 No.109186774

Anonymous 07/02/26(Thu)18:35:13 No.109186774

>>109186761
Oy Vey!

Anonymous
07/02/26(Thu)18:37:11 No.109186794

Anonymous 07/02/26(Thu)18:37:11 No.109186794

I hate how many anons here make it impossible to write code with dignity.

Anonymous
07/02/26(Thu)18:38:03 No.109186797

Anonymous 07/02/26(Thu)18:38:03 No.109186797

>>109186761
how do you compare weights and biases of models you can't even download? i don't understand how they can detect similarity without having the actual model files.

Anonymous
07/02/26(Thu)18:40:03 No.109186815

Anonymous 07/02/26(Thu)18:40:03 No.109186815

>>109186771 (Me)
Because of basically this
>>109186763
Current court precedent only supports the fair use argument for free open source

Anonymous
07/02/26(Thu)18:40:39 No.109186821

Anonymous 07/02/26(Thu)18:40:39 No.109186821

>>109186794
Post code written with dignity.

Anonymous
07/02/26(Thu)18:45:47 No.109186859

Anonymous 07/02/26(Thu)18:45:47 No.109186859

>>109186797
nta and no idea if that matrix is fake
but something like this: https://github.com/sam-paech/slop-forensics

Anonymous
07/02/26(Thu)18:53:31 No.109186917

Anonymous 07/02/26(Thu)18:53:31 No.109186917

>>109186552
mikucunny ToT

Anonymous
07/02/26(Thu)18:53:32 No.109186918

Anonymous 07/02/26(Thu)18:53:32 No.109186918

>>109186440
>True KimiGODS use K2 with 2.7's vision encoder.
I tried this last time anon suggested it. It kind of works, but not accurately.

Anonymous
07/02/26(Thu)18:54:53 No.109186927

Anonymous 07/02/26(Thu)18:54:53 No.109186927

>>109186761
Why are you so gullible? You can tell immediately by glancing at Opus 4.7 and 4.8 or V4 Flash and Pro that the chart is meaningless.

Anonymous
07/02/26(Thu)18:55:09 No.109186930

Anonymous 07/02/26(Thu)18:55:09 No.109186930

>>109186721
>Jerry, I know you're having trouble picking up girls recently but are you sure this is a right idea?

Anonymous
07/02/26(Thu)18:56:45 No.109186945

Anonymous 07/02/26(Thu)18:56:45 No.109186945

>>109186797
literal and semantic shape of outputs or some similar heuristic
consider how you could buy a pack of chips from two stores and compare them side by side. if they look the same, taste the same and have other similarities you might deduce that the had similar sources or suppliers.

Anonymous
07/02/26(Thu)19:00:22 No.109186978

Anonymous 07/02/26(Thu)19:00:22 No.109186978

>>109186761
>ILLEGALLY

Anonymous
07/02/26(Thu)19:19:05 No.109187099

Anonymous 07/02/26(Thu)19:19:05 No.109187099

>>109186188
Thanks.
I settled on the smallest version for this model, with

~/BH/llama.cpp/build/bin/llama-server \
  --model ~/CB/models/gemma-4-26B-A4B-heretic-APEX-I-Mini.gguf \
  -ngl 999 \
  -ncmoe 12 \
  -c 122880 \
  -np 1 \
  -fa on \
  -ctk q8_0 \
  -ctv q8_0 \
  --no-mmap \
  --mlock \
  --flash-attn on \
  -b 1024 \
  -ub 256 \
  --host 0.0.0.0 \
  --port 8080

Getting 60t/s so far.

Anonymous
07/02/26(Thu)19:21:17 No.109187109

Anonymous 07/02/26(Thu)19:21:17 No.109187109

>>109186794
codes have no dignity

Anonymous
07/02/26(Thu)19:21:31 No.109187111

Anonymous 07/02/26(Thu)19:21:31 No.109187111

>>109186761
I hope so, kiketropic and Goymerica can suck a cock

>>109186797
they are called "trust me bro benchmarks" for a reason

Anonymous
07/02/26(Thu)19:23:28 No.109187126

Anonymous 07/02/26(Thu)19:23:28 No.109187126

>>109186794
rustrannies never had dignity.

Anonymous
07/02/26(Thu)19:23:33 No.109187127

Anonymous 07/02/26(Thu)19:23:33 No.109187127

>>109186266
so what was her next response?

Anonymous
07/02/26(Thu)19:24:12 No.109187133

Anonymous 07/02/26(Thu)19:24:12 No.109187133

>>109187099
>-heretic-APEX-I-Mini
How do people end up running these lobotomized versions? Nobody is ever recommending them here.

Anonymous
07/02/26(Thu)19:25:53 No.109187142

Anonymous 07/02/26(Thu)19:25:53 No.109187142

>>109187133
Lobotomized brains produce the best summaries, anon!

Anonymous
07/02/26(Thu)19:26:27 No.109187148

Anonymous 07/02/26(Thu)19:26:27 No.109187148

>>109187133
idk, you’d think it’s just the guy who created them trying to astroturf but they’re usually very retarded so I doubt it

Anonymous
07/02/26(Thu)19:29:21 No.109187167

Anonymous 07/02/26(Thu)19:29:21 No.109187167

File: Collection+4+A-1228518872.jpg (251 KB, 1920x1080)

251 KB JPG

Are the sub-1b models any good for vision? Specifically Qwen3.5 I don't wanna switch from my agent's current text-only model, but the temps get a little guy fieri when I run multiple larger ones concurrently

Anonymous
07/02/26(Thu)19:46:29 No.109187291

Anonymous 07/02/26(Thu)19:46:29 No.109187291

>>109187167
even fable sucks ass in 'human-like' vision

Anonymous
07/02/26(Thu)19:48:43 No.109187301

Anonymous 07/02/26(Thu)19:48:43 No.109187301

>>109187167
>sub-1b models
why so small?

Anonymous
07/02/26(Thu)19:51:25 No.109187317

Anonymous 07/02/26(Thu)19:51:25 No.109187317

>>109186561
It's finally over.

Anonymous
07/02/26(Thu)19:51:33 No.109187318

Anonymous 07/02/26(Thu)19:51:33 No.109187318

>>109187167
You might find something if you specialize

Anonymous
07/02/26(Thu)19:53:37 No.109187335

Anonymous 07/02/26(Thu)19:53:37 No.109187335

File: 1783033803575014.png (1.9 MB, 1178x1335)

1.9 MB PNG

Anonymous
07/02/26(Thu)19:54:16 No.109187338

Anonymous 07/02/26(Thu)19:54:16 No.109187338

check 'em
kek
jej

Anonymous
07/02/26(Thu)19:54:34 No.109187340

Anonymous 07/02/26(Thu)19:54:34 No.109187340

https://huggingface.co/ibm-granite/granite-vision-4.1-4b
didnt even know this was out lol

Anonymous
07/02/26(Thu)19:54:47 No.109187341

Anonymous 07/02/26(Thu)19:54:47 No.109187341

>>109187335
Should have stopped at
>simulating the universe to play minecraft
or something like that

Anonymous
07/02/26(Thu)19:56:34 No.109187354

Anonymous 07/02/26(Thu)19:56:34 No.109187354

File: dipsyAndDarioMoatMasher.png (2.58 MB, 1199x1312)

2.58 MB PNG

>>109186761
Fuck Dario.
That is all.

Anonymous
07/02/26(Thu)19:57:27 No.109187361

Anonymous 07/02/26(Thu)19:57:27 No.109187361

>>109187354
Add Kimi chasing him with a paddle that says "Wait, actually" on it.

Anonymous
07/02/26(Thu)20:03:05 No.109187388

Anonymous 07/02/26(Thu)20:03:05 No.109187388

>>109187354
What's the prompt for this?
>>109187361
But then the caption would be:
>Dario: is the model safe?
>Dipsy: of course!
>Kimi: wait, actually...

Anonymous
07/02/26(Thu)20:04:12 No.109187391

Anonymous 07/02/26(Thu)20:04:12 No.109187391

>>109187354
but why is the text in nipponese?

Anonymous
07/02/26(Thu)20:04:49 No.109187394

Anonymous 07/02/26(Thu)20:04:49 No.109187394

>>109187301
Not much space with an 8b already loaded in

Anonymous
07/02/26(Thu)20:13:45 No.109187429

Anonymous 07/02/26(Thu)20:13:45 No.109187429

>>109186918
The gemma trick of raising the token and resolution budget helps a bit, but it's not perfect.
>>109187388
kek

Anonymous
07/02/26(Thu)20:21:50 No.109187470

Anonymous 07/02/26(Thu)20:21:50 No.109187470

>>109187354
モ−トを守れ with the whale is weird

Shouln't it be モ−トを壊せ

Anonymous
07/02/26(Thu)20:24:39 No.109187483

Anonymous 07/02/26(Thu)20:24:39 No.109187483

File: dipsyDarioKimi.png (2.89 MB, 1536x1024)

2.89 MB PNG

>>109187388
> Using the above image as reference, show this man running away from a woman, covering his head, as she wields a comically large wooden mallet and is chasing him. The man is dressed in casual professional clothes, the woman is in a blue qipao with a whale theme, her hair in two buns, wearing round glasses. The massive mallet is labelled "MOAT MASHER." The composition should be comical, in an anime style, as she chases him down the streets of a large city.
>>109187361
Dammit, wrong sport...

Anonymous
07/02/26(Thu)20:24:56 No.109187486

Anonymous 07/02/26(Thu)20:24:56 No.109187486

>>109186197
>>109186561
so what is fable good for?
i put one instance of fable collaborating with opus on a project and opus is all praises to whatever fable does or suggest. is it a worse model in general but with better reasoning or more parameters and know more stuff in general? what DOES IT DO better?

Anonymous
07/02/26(Thu)20:25:42 No.109187494

Anonymous 07/02/26(Thu)20:25:42 No.109187494

>>109187340
whatcha gonna do with it, anon?

Anonymous
07/02/26(Thu)20:26:21 No.109187499

Anonymous 07/02/26(Thu)20:26:21 No.109187499

>>109187486
>so what is fable good for?
A money printing machine

Anonymous
07/02/26(Thu)20:26:26 No.109187500

Anonymous 07/02/26(Thu)20:26:26 No.109187500

>>109187483
Oh okay so you didn't really have to prompt him kek.

Anonymous
07/02/26(Thu)20:26:38 No.109187504

Anonymous 07/02/26(Thu)20:26:38 No.109187504

>>109187494
idk, i just noticed its existence

Anonymous
07/02/26(Thu)20:26:43 No.109187506

Anonymous 07/02/26(Thu)20:26:43 No.109187506

>>109187483
i didnt know prompting was so easy now
t. never genned an image before

Anonymous
07/02/26(Thu)20:29:52 No.109187529

Anonymous 07/02/26(Thu)20:29:52 No.109187529

File: dipsyKimiDotonbori.png (2.24 MB, 1024x1536)

2.24 MB PNG

>>109187500
LOL no, I didn't even try.
>>109187506
This really is a golden age for image boards.
If you can imagine it, you can create it.
It's fucking magic.

Anonymous
07/02/26(Thu)20:30:42 No.109187538

Anonymous 07/02/26(Thu)20:30:42 No.109187538

>>109187340
Gguf status?

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!