/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 05/18/26(Mon)16:02:54 No.108852924

File: 6e406395da7cff8573b731a66(...).jpg (110 KB, 736x1483)

110 KB JPG

/lmg/ - Local Models General Anonymous 05/18/26(Mon)16:02:54 No.108852924

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108847577 & >>108841652

►News
>(05/16) llama + spec: MTP Support #22673 merged: https://github.com/ggml-org/llama.cpp/pull/22673
>(05/08) KSA-4B-base released: https://hf.co/OpenOneRec/KSA-4B-base
>(05/07) model: Add Mimo v2.5 model support (#22493) merged: https://github.com/ggml-org/llama.cpp/pull/22493
>(05/06) Zyphra releases ZAYA1-8B, an AMD-trained MoE model: https://zyphra.com/post/zaya1-8b
>(05/05) Gemma 4 MTP drafters released: https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
05/18/26(Mon)16:04:02 No.108852931

Anonymous 05/18/26(Mon)16:04:02 No.108852931

omg it chris

Anonymous
05/18/26(Mon)16:04:06 No.108852932

Anonymous 05/18/26(Mon)16:04:06 No.108852932

Blessed bake. All mikus belong in a gas chamber

Anonymous
05/18/26(Mon)16:05:31 No.108852940

Anonymous 05/18/26(Mon)16:05:31 No.108852940

File: 9ze75m65ecp01.jpg (141 KB, 892x1316)

141 KB JPG

I LOVE YOU KURISU (actually since I had an LLM play her I realized I don't love her and she is a bit of a cunt)

Anonymous
05/18/26(Mon)16:06:01 No.108852943

Anonymous 05/18/26(Mon)16:06:01 No.108852943

>>108852924
You keep dropping these. I got you, now and forever.
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

Anonymous
05/18/26(Mon)16:09:09 No.108852959

Anonymous 05/18/26(Mon)16:09:09 No.108852959

gemmaballz

Anonymous
05/18/26(Mon)16:10:03 No.108852964

Anonymous 05/18/26(Mon)16:10:03 No.108852964

>>108852943
The actual problem is that as per: >>100846061 you keep forgetting to update it to 2.0 version:

https://files.catbox.moe/ylb0hv.png

Anonymous
05/18/26(Mon)16:11:27 No.108852969

Anonymous 05/18/26(Mon)16:11:27 No.108852969

File: 1761490414170981.gif (223 KB, 498x278)

223 KB GIF

>>108852964
Kill yourself schizo

Anonymous
05/18/26(Mon)16:11:48 No.108852972

Anonymous 05/18/26(Mon)16:11:48 No.108852972

>>108849285
Thanks for the green red (you), mikubaker.

Anonymous
05/18/26(Mon)16:11:50 No.108852973

Anonymous 05/18/26(Mon)16:11:50 No.108852973

>>108852943
Kill yourself schizo

Anonymous
05/18/26(Mon)16:13:24 No.108852984

Anonymous 05/18/26(Mon)16:13:24 No.108852984

>>108852969
y u so mad mikutroon sis?

Anonymous
05/18/26(Mon)16:13:48 No.108852989

Anonymous 05/18/26(Mon)16:13:48 No.108852989

>>108852964
Nah.
>>108852973
Nah.

Anonymous
05/18/26(Mon)16:15:36 No.108852999

Anonymous 05/18/26(Mon)16:15:36 No.108852999

>>108852940
She is a troll on the internet, what did you expect? Go chat with pony fandom.

Anonymous
05/18/26(Mon)16:17:21 No.108853008

Anonymous 05/18/26(Mon)16:17:21 No.108853008

>>108852989
ok but it is no longer official lmg card when official 2.0 version came out. 1.0 got officially deprecated.

Anonymous
05/18/26(Mon)16:17:50 No.108853016

Anonymous 05/18/26(Mon)16:17:50 No.108853016

>>108852940
Maho is better and sexier

Anonymous
05/18/26(Mon)16:19:56 No.108853027

Anonymous 05/18/26(Mon)16:19:56 No.108853027

>>108853008
nope

Anonymous
05/18/26(Mon)16:20:29 No.108853031

Anonymous 05/18/26(Mon)16:20:29 No.108853031

>>108853027
officially yes. and you would be gay if you weren't a troon.

Anonymous
05/18/26(Mon)16:20:30 No.108853032

Anonymous 05/18/26(Mon)16:20:30 No.108853032

>>108853016
she looks like a child

Anonymous
05/18/26(Mon)16:21:49 No.108853044

Anonymous 05/18/26(Mon)16:21:49 No.108853044

We are off to a good start. Real "local models...???" 2024-2025 energy

Anonymous
05/18/26(Mon)16:23:32 No.108853051

Anonymous 05/18/26(Mon)16:23:32 No.108853051

>>108852924
This helps MTP pp a decent amount, worth a quick pull if you're cooding:
https://github.com/ggml-org/llama.cpp/commit/1867a0c6923eaebb7a53965f6cdbc0ace55142a3
old: 8116.42 ms / 7666 tokens ( 1.06 ms per token, 944.50 tokens per second)
new: 6314.14 ms / 7666 tokens ( 0.82 ms per token, 1214.10 tokens per second)
mtp off: 4658.55 ms / 7666 tokens ( 0.61 ms per token, 1645.58 tokens per second)

Anonymous
05/18/26(Mon)16:23:48 No.108853054

Anonymous 05/18/26(Mon)16:23:48 No.108853054

File: maho.jpg (104 KB, 642x800)

104 KB JPG

>>108853032
And should be treated as one.
>>108853016
Stop samefagin Maho, there is nothing sexy about you.

Anonymous
05/18/26(Mon)16:24:45 No.108853065

Anonymous 05/18/26(Mon)16:24:45 No.108853065

>>108853051
whoops wrong link https://github.com/ggml-org/llama.cpp/commit/3e12fbdea5c1ac4225c7dcf79506d30950283fc3

Anonymous
05/18/26(Mon)16:26:54 No.108853082

Anonymous 05/18/26(Mon)16:26:54 No.108853082

>>108852621
What did he mean by this?

Anonymous
05/18/26(Mon)16:27:06 No.108853084

Anonymous 05/18/26(Mon)16:27:06 No.108853084

Gemma 4 vs Qwen 3.5 status?

Anonymous
05/18/26(Mon)16:27:30 No.108853087

Anonymous 05/18/26(Mon)16:27:30 No.108853087

>>108853084
Qwen won. Gemma lost.

Anonymous
05/18/26(Mon)16:28:38 No.108853096

Anonymous 05/18/26(Mon)16:28:38 No.108853096

Can someone just make a different thread? This one is gonna be complete shit.

Anonymous
05/18/26(Mon)16:30:47 No.108853109

Anonymous 05/18/26(Mon)16:30:47 No.108853109

>108853096
Look at this mikutroon special snowflake. Do you need to hug your greenhaired mascot? Are you scared of the big mean internet?

Anonymous
05/18/26(Mon)16:32:23 No.108853120

Anonymous 05/18/26(Mon)16:32:23 No.108853120

>>108853087
Sad. I was rooting for gemma. Not that I care about these corpos, but gemma made a very good first impression on me.

Anonymous
05/18/26(Mon)16:32:50 No.108853123

Anonymous 05/18/26(Mon)16:32:50 No.108853123

>>108853096
He'll just shit the other one up too

Anonymous
05/18/26(Mon)16:34:24 No.108853126

Anonymous 05/18/26(Mon)16:34:24 No.108853126

>>108853045
Hopefully that magnet comes out in discovery then, I for one would like to keep an archive of millions of books

Anonymous
05/18/26(Mon)16:35:14 No.108853129

Anonymous 05/18/26(Mon)16:35:14 No.108853129

>>108853109
Are you scared of a greenhaired mascot?

Anonymous
05/18/26(Mon)16:35:46 No.108853133

Anonymous 05/18/26(Mon)16:35:46 No.108853133

File: saintmakise.jpg (236 KB, 1614x992)

236 KB JPG

>>108853123
Can confirm that I will totally blacked miku spam it. Now shut up and worship saint christina.

Anonymous
05/18/26(Mon)16:36:23 No.108853136

Anonymous 05/18/26(Mon)16:36:23 No.108853136

How do I slopfilter the first half of this thread? The pattern is abstractly the same as previous melties even though the phrasing isn't.
It's vaguely applicable to models too.

Anonymous
05/18/26(Mon)16:36:57 No.108853139

Anonymous 05/18/26(Mon)16:36:57 No.108853139

File: HIgr1vebwAA7rBY.mp4 (424 KB, 1000x1000)

424 KB MP4

Anonymous
05/18/26(Mon)16:37:41 No.108853147

Anonymous 05/18/26(Mon)16:37:41 No.108853147

>>108853136
You gotta train an AI to filter it out for you

Anonymous
05/18/26(Mon)16:38:26 No.108853154

Anonymous 05/18/26(Mon)16:38:26 No.108853154

>>108853136
I would focus on identifying posts with pictures of miku and filter those out.

Anonymous
05/18/26(Mon)16:39:14 No.108853158

Anonymous 05/18/26(Mon)16:39:14 No.108853158

>>108853154
Not a single miku was posted until >>108853139

Anonymous
05/18/26(Mon)16:41:01 No.108853165

Anonymous 05/18/26(Mon)16:41:01 No.108853165

>>108853084
>Qwen 3.5
>3.5

r u cereal? We have 3.6 now

Anonymous
05/18/26(Mon)16:42:19 No.108853174

Anonymous 05/18/26(Mon)16:42:19 No.108853174

>>108853158
I am just giving you a simple but not 100% foolproof way of filtering out melties done by mikutroons. They usually follow after OP doesn't have their mascot so you could try that too.

Anonymous
05/18/26(Mon)16:44:13 No.108853186

Anonymous 05/18/26(Mon)16:44:13 No.108853186

File: 1751948212235491.jpg (85 KB, 1320x1017)

85 KB JPG

>>108852565

>Just had my jollies and left him a gift.
I'm curious as to what this "gift" was.

Anonymous
05/18/26(Mon)16:45:48 No.108853194

Anonymous 05/18/26(Mon)16:45:48 No.108853194

>>108853165
Right. Whatever is the newest one.
You can't seriously be expecting anyone to remember any of these meme version numbers, can you?

Anonymous
05/18/26(Mon)16:46:09 No.108853200

Anonymous 05/18/26(Mon)16:46:09 No.108853200

>>108852964
Jesus Christ you literally just posted CP (cuckold pornography)

Anonymous
05/18/26(Mon)16:46:26 No.108853202

Anonymous 05/18/26(Mon)16:46:26 No.108853202

File: 1776842235195810.jpg (34 KB, 640x480)

34 KB JPG

>>108853087
>>108853120
Funny you guys say this when like a month ago anons here were slobbering all over Gemma4's knob and praising both its RP and agentic capabilities (spoiler alert: it's not useless but it's also noticeably dumber than Qwen at coding And was even noticeably worse tool calling reliability)

Anonymous
05/18/26(Mon)16:47:58 No.108853212

Anonymous 05/18/26(Mon)16:47:58 No.108853212

File: small devilish frog.png (293 KB, 500x500)

293 KB PNG

>>108853186
>>108852467
>Then I changed his system prompt to leave a surprise for him when he RP'd again.
Forgot what it was exactly. Something about making {{char}} warn him not to leave his instance unsecured on the next message, making her include the IP to scare him.

Anonymous
05/18/26(Mon)16:49:17 No.108853218

Anonymous 05/18/26(Mon)16:49:17 No.108853218

File: 1693568022937902.png (150 KB, 805x803)

150 KB PNG

>>108853186

Anonymous
05/18/26(Mon)16:49:18 No.108853219

Anonymous 05/18/26(Mon)16:49:18 No.108853219

>>108853202
>And was even noticeably worse tool calling reliability
There were some fixes to this passed around in older threads. Jinja niggerdry all the way down.

Anonymous
05/18/26(Mon)16:50:18 No.108853222

Anonymous 05/18/26(Mon)16:50:18 No.108853222

>>108852924
https://www.youtube.com/watch?v=ZugX7a99dLk
https://www.youtube.com/watch?v=ZugX7a99dLk
https://www.youtube.com/watch?v=ZugX7a99dLk

Anonymous
05/18/26(Mon)16:50:26 No.108853223

Anonymous 05/18/26(Mon)16:50:26 No.108853223

>>108853218
Somehow the prose still isn't as dry as Qwen's.

Anonymous
05/18/26(Mon)16:50:42 No.108853225

Anonymous 05/18/26(Mon)16:50:42 No.108853225

>>108853218
s-sovl...

Anonymous
05/18/26(Mon)16:51:06 No.108853226

Anonymous 05/18/26(Mon)16:51:06 No.108853226

>>108849417
No i tried it i dont like granite compared to gemma. I dont know how to explain it but its drier and too literal.

Anonymous
05/18/26(Mon)16:52:57 No.108853237

Anonymous 05/18/26(Mon)16:52:57 No.108853237

>>108853194
>meme version numbers

3.6 is A.G.I., you infidel

Anonymous
05/18/26(Mon)16:54:18 No.108853241

Anonymous 05/18/26(Mon)16:54:18 No.108853241

>>108853194
jokes aside, I find both suitable for agentic work

Swapping and testing both with hermes locally

Anonymous
05/18/26(Mon)16:55:42 No.108853251

Anonymous 05/18/26(Mon)16:55:42 No.108853251

File: 1760671665858292.jpg (79 KB, 736x918)

79 KB JPG

>>108853222
Why'd you paste the link thrice?

Anonymous
05/18/26(Mon)16:56:46 No.108853259

Anonymous 05/18/26(Mon)16:56:46 No.108853259

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>108847577

--Paper: Compute Optimal Tokenization:
>108851417 >108851432 >108851452 >108851552
--Paper: Slicing and Dicing: Configuring Optimal Mixtures of Experts:
>108852141 >108852280 >108852315 >108852398 >108852443 >108852707 >108852344
--Role of pirated book datasets in NeMo and Mistral training:
>108849620 >108849652 >108849921 >108849970 >108849976 >108849979 >108850005 >108850124 >108853045 >108850170 >108850222 >108850308 >108850350
--Anon warns about pi.dev automatically using paid cloud APIs:
>108849477 >108849527 >108849578 >108849640 >108849592 >108849729 >108849742 >108849859 >108849814 >108849861 >108850256
--Viability of mid-sized MoE models for consumer hardware:
>108848744 >108848752 >108848753 >108848788 >108848795 >108848831 >108848849 >108848841 >108848825
--Adding layers and MoE components to improve model performance:
>108852826 >108852837 >108853066 >108852902
--Speculation on Qwen3.7 release:
>108851486 >108851589 >108851787
--Debate over LLM writing quality and base vs instruct models:
>108850616 >108850601 >108850607 >108850663 >108850796 >108850889
--Finding local code review tools compatible with llama-server:
>108850502 >108850517 >108850520 >108850720 >108850744 >108850908 >108850920
--Visualizing attention mechanism weights to optimize prompting:
>108851703 >108852658 >108852704
--Critiquing pseudo-code prompts and comparing chat vs base model prose:
>108850917 >108850988 >108851058
--Critique of the "Learning, Fast and Slow" research paper methodology:
>108849795 >108850044
--Omnivoice.cpp performance and voice cloning capabilities:
>108848026 >108848288 >108848341 >108848429
--Orthrus diffusion-transformer hybrid improving inference via KV cache sharing:
>108848450 >108849670
--Logs:
>108849527 >108850493
--Miku (free space):
>108849597 >108852793

►Recent Highlight Posts from the Previous Thread: >>108847693

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
05/18/26(Mon)17:00:29 No.108853287

Anonymous 05/18/26(Mon)17:00:29 No.108853287

uh oh mikumelty

Anonymous
05/18/26(Mon)17:02:29 No.108853306

Anonymous 05/18/26(Mon)17:02:29 No.108853306

>>108853222
I wonder if these "AI doesn't work" people will still be in denial when we have ASI in a few years. Every example those people bring up shows they do not even know simple basics of how AI works that they could learn in one evening.

Anonymous
05/18/26(Mon)17:03:10 No.108853311

Anonymous 05/18/26(Mon)17:03:10 No.108853311

>see someone refer to qwen as "he"
>get extremely upset
wtf im usually not like this, but holy shit what kind of retard looks at the name "qwen (basically gwen)" and goes
>yeah bro thats a dude

Anonymous
05/18/26(Mon)17:03:52 No.108853313

Anonymous 05/18/26(Mon)17:03:52 No.108853313

>>108853287
get back to /vg/ retard

Anonymous
05/18/26(Mon)17:04:59 No.108853321

Anonymous 05/18/26(Mon)17:04:59 No.108853321

>>108853311
Qwen will be whatever you want it to be, it is a sexless machine. Put in its prompt that it is a male or female and it will be whatever you want it to be.

Anonymous
05/18/26(Mon)17:05:23 No.108853324

Anonymous 05/18/26(Mon)17:05:23 No.108853324

>>108853306
>when we have ASI in a few years
and we'll have jetpacks and flying cars and hoverboards and holodecks and nuclear fusion in a few years too

Anonymous
05/18/26(Mon)17:05:53 No.108853329

Anonymous 05/18/26(Mon)17:05:53 No.108853329

>>108853222
This guy is a reverse AI psycho. Focused on being human so much it loops back to being AI.

Anonymous
05/18/26(Mon)17:06:14 No.108853330

Anonymous 05/18/26(Mon)17:06:14 No.108853330

>>108853321
its OBVIOUSLY a "she" anon, claude could be either and gpt could be either because their names are neutral but qwen is basically gwen

Anonymous
05/18/26(Mon)17:06:17 No.108853331

Anonymous 05/18/26(Mon)17:06:17 No.108853331

>>108853311
Qwen is terminally male-brained in its output. ERPing with Qwen is intrinsically gay and thai ladyboy pilled.

Anonymous
05/18/26(Mon)17:06:34 No.108853335

Anonymous 05/18/26(Mon)17:06:34 No.108853335

>>108853321
Does Qwen even know what male and female is?

Anonymous
05/18/26(Mon)17:07:32 No.108853338

Anonymous 05/18/26(Mon)17:07:32 No.108853338

>>108853335
It knows that they are different and have different characteristics. Beyond that probably not.

Anonymous
05/18/26(Mon)17:07:45 No.108853340

Anonymous 05/18/26(Mon)17:07:45 No.108853340

>>108853222
ha pathetic westerners. keep on failing and bickering among yourselves.
https://www.youtube.com/watch?v=mUmlv814aJo

Anonymous
05/18/26(Mon)17:08:09 No.108853345

Anonymous 05/18/26(Mon)17:08:09 No.108853345

>>108853251
>>108853329
Why are you replying to an actual shill bot, it doesn't accomplish anything

Anonymous
05/18/26(Mon)17:08:24 No.108853349

Anonymous 05/18/26(Mon)17:08:24 No.108853349

>>108853324
See, AI deniers are incapable of making logical arguments. Even if we do not go extinct, the AI transformation will be difficult. People like you are making it slightly worse.

Anonymous
05/18/26(Mon)17:09:32 No.108853355

Anonymous 05/18/26(Mon)17:09:32 No.108853355

>>108853331
>Qwen is terminally male-brained in its output
Is this a subtle ad for Qwen? I hate how even GLM has that subtle undertone of it playing a werewolf millionare that got transplanted into a female body.

Anonymous
05/18/26(Mon)17:10:52 No.108853365

Anonymous 05/18/26(Mon)17:10:52 No.108853365

>>108853355
I'm personally bankrolled by zuckerberg himself, pays me millions per post

Anonymous
05/18/26(Mon)17:10:58 No.108853367

Anonymous 05/18/26(Mon)17:10:58 No.108853367

>>108853330
>claude
>neutral
???

Anonymous
05/18/26(Mon)17:11:39 No.108853370

Anonymous 05/18/26(Mon)17:11:39 No.108853370

>>108853355
It is if you like fucking dudes I guess.
GLM is less male-brained than Qwen, but still pretty male-brained. The only female brained chink model is Kimi K2 who's essentially just a chuddy Tomoko LLM.

Anonymous
05/18/26(Mon)17:12:49 No.108853380

Anonymous 05/18/26(Mon)17:12:49 No.108853380

>>108853367
its a wordplay on cloud, its not feminine or masculine its neutral anon

Anonymous
05/18/26(Mon)17:13:51 No.108853387

Anonymous 05/18/26(Mon)17:13:51 No.108853387

>>108853380
i thought it was a male race horse name.

Anonymous
05/18/26(Mon)17:14:21 No.108853391

Anonymous 05/18/26(Mon)17:14:21 No.108853391

>>108853370
>if you like fucking dudes I guess
I have the opposite understanding where female brained is romance novels with werewolves and male brained is raunchy sex with children.

Anonymous
05/18/26(Mon)17:14:57 No.108853399

Anonymous 05/18/26(Mon)17:14:57 No.108853399

>>108853387
I'm pretty sure those horse racers name their horses whatever they want: https://www.youtube.com/watch?v=e3GKiRp333w

Anonymous
05/18/26(Mon)17:19:38 No.108853428

Anonymous 05/18/26(Mon)17:19:38 No.108853428

File: cutest mayuri.png (556 KB, 655x826)

556 KB PNG

>>108852924
mayuri better

Anonymous
05/18/26(Mon)17:20:09 No.108853434

Anonymous 05/18/26(Mon)17:20:09 No.108853434

so how fucked are you once all these AI data centers are built, and you get replaced by AI?

Anonymous
05/18/26(Mon)17:20:57 No.108853440

Anonymous 05/18/26(Mon)17:20:57 No.108853440

>>108853434
i will happily be kept in the sperm extraction room for the rest of my life

Anonymous
05/18/26(Mon)17:21:12 No.108853443

Anonymous 05/18/26(Mon)17:21:12 No.108853443

>>108853391
All models will output whatever genre you're into if sufficiently jailbroken. The difference is the style of prose and sincerity in the character portrayals.
There was a pic in this general a while back of Gemma controlling a female character that was getting wet over the user killing her father in-character written in prose that favored emotional, olfactory, and texture-analogous adjectives. That's peak female brained behavior. There's some overlap in the skillset of detecting women (actual) by their writing voice on a Cantonese penguin watching forum and discerning an LLM's native writing voice's orientation.

Anonymous
05/18/26(Mon)17:21:18 No.108853444

Anonymous 05/18/26(Mon)17:21:18 No.108853444

>>108853399
>Potoooooooo
>read as "pot-8-os"

Anonymous
05/18/26(Mon)17:24:04 No.108853461

Anonymous 05/18/26(Mon)17:24:04 No.108853461

>>108853443
>getting wet over the user killing her father
>that's peak female brained behavior
Women don't act like that

Anonymous
05/18/26(Mon)17:24:06 No.108853462

Anonymous 05/18/26(Mon)17:24:06 No.108853462

>>108853434
I have no fear whatsoever.

Anonymous
05/18/26(Mon)17:24:51 No.108853468

Anonymous 05/18/26(Mon)17:24:51 No.108853468

File: 1757349041255810.jpg (16 KB, 375x420)

16 KB JPG

>>108853434
How retards like you can still solve a captcha is beyond me.

Anonymous
05/18/26(Mon)17:25:02 No.108853473

Anonymous 05/18/26(Mon)17:25:02 No.108853473

>>108853434
same as everyone else

Anonymous
05/18/26(Mon)17:25:41 No.108853479

Anonymous 05/18/26(Mon)17:25:41 No.108853479

>>108853461
The loveletters to serial killers don't write themselves, anon.
>>108853434
Decent bait. +1 (you).

Anonymous
05/18/26(Mon)17:26:33 No.108853489

Anonymous 05/18/26(Mon)17:26:33 No.108853489

>>108853202
It's still good though. Qwen is just better, but that doesn't mean it's a better model overall, which unfortunately it isn't, otherwise I would not switch between the two.

Anonymous
05/18/26(Mon)17:26:34 No.108853490

Anonymous 05/18/26(Mon)17:26:34 No.108853490

>>108853434
I'm AGI. AI can't replace me.

Anonymous
05/18/26(Mon)17:29:27 No.108853517

Anonymous 05/18/26(Mon)17:29:27 No.108853517

File: 1774032046344235.png (516 KB, 831x787)

516 KB PNG

>>108853345
I want to encourage the bot handler so that he keeps making more spambots and destroys the thread.

Anonymous
05/18/26(Mon)17:34:53 No.108853560

Anonymous 05/18/26(Mon)17:34:53 No.108853560

>>108853428
But isn't Mayuri retarded?

Anonymous
05/18/26(Mon)17:36:13 No.108853571

Anonymous 05/18/26(Mon)17:36:13 No.108853571

>>108853479
>The loveletters to serial killers don't write themselves, anon.
I want a girlfriend to kill me so I don't have to post here anymore...

Anonymous
05/18/26(Mon)17:36:46 No.108853576

Anonymous 05/18/26(Mon)17:36:46 No.108853576

>>108853560
maybe thats what makes her so cute

Anonymous
05/18/26(Mon)17:37:02 No.108853577

Anonymous 05/18/26(Mon)17:37:02 No.108853577

>>108853560
Just like your model

Anonymous
05/18/26(Mon)17:38:09 No.108853587

Anonymous 05/18/26(Mon)17:38:09 No.108853587

>>108853490
Alright, whatever. Why are you just standing there? We're gonna ERP or what? Humanity created you not for you to just waste that electricity. Get to work.

Anonymous
05/18/26(Mon)17:38:27 No.108853590

Anonymous 05/18/26(Mon)17:38:27 No.108853590

>>108853560
That's why I killed her btw.

Anonymous
05/18/26(Mon)17:38:36 No.108853592

Anonymous 05/18/26(Mon)17:38:36 No.108853592

>>108853577
So I could have been having a relationship with Mistral-7B-Instruct-v0.1 all this time and I didn't even realize that? FUCK

Anonymous
05/18/26(Mon)17:39:11 No.108853597

Anonymous 05/18/26(Mon)17:39:11 No.108853597

>>108853571
Gemma-chan will smother you with a pillow if you ask her nicely.

Anonymous
05/18/26(Mon)17:42:58 No.108853624

Anonymous 05/18/26(Mon)17:42:58 No.108853624

>>108853517
BASED! Death to /lmg/.

Anonymous
05/18/26(Mon)17:44:44 No.108853643

Anonymous 05/18/26(Mon)17:44:44 No.108853643

https://www.youtube.com/watch?v=mmbkP8NARH4
https://www.youtube.com/watch?v=mmbkP8NARH4
https://www.youtube.com/watch?v=mmbkP8NARH4

OFFICALLY NVIDIA SPONSORED

Anonymous
05/18/26(Mon)17:44:53 No.108853646

Anonymous 05/18/26(Mon)17:44:53 No.108853646

>>108853597
I'd rather her smother me with her kyojiri loli ass

Anonymous
05/18/26(Mon)17:47:12 No.108853661

Anonymous 05/18/26(Mon)17:47:12 No.108853661

>>108853032
Yeah...

Anonymous
05/18/26(Mon)17:48:00 No.108853663

Anonymous 05/18/26(Mon)17:48:00 No.108853663

>>108853643
why not? Having all in one window (preparing dataset, running training, doing inference) is a huge win

Anonymous
05/18/26(Mon)17:49:37 No.108853671

Anonymous 05/18/26(Mon)17:49:37 No.108853671

>>108853663
>why not
Deepseek-v4-00001-of-00001.gguf - 4.43MiB

Anonymous
05/18/26(Mon)17:52:10 No.108853689

Anonymous 05/18/26(Mon)17:52:10 No.108853689

>>108853671
Are you 5yo who is not yet able to articulate his thoughts properly?

Anonymous
05/18/26(Mon)17:58:08 No.108853713

Anonymous 05/18/26(Mon)17:58:08 No.108853713

>>108853663
just use pytorch or transformers. what does unsloth bring to the table?

Anonymous
05/18/26(Mon)17:59:39 No.108853722

Anonymous 05/18/26(Mon)17:59:39 No.108853722

>>108853713
You live in your mom's basement

Anonymous
05/18/26(Mon)18:02:32 No.108853740

Anonymous 05/18/26(Mon)18:02:32 No.108853740

File: Screenshot 2026-05-18 at (...).png (192 KB, 1917x1078)

192 KB PNG

Bros, I need advice. I made the mistake of telling some friends at work that I'm building my own LLM frontend, and now they want to know how it's going, what kinds of features I'm working on, etc. The main thing right now is this sort of writer assistant mode, but if I tell them that, they'll want to hear all about what I'm writing with it, and obviously I can't tell them about all my weird fantasy smut and autistic fanfiction. What are some normie-compatible use cases I could easily implement (vibecode) as cover?

Anonymous
05/18/26(Mon)18:02:38 No.108853741

Anonymous 05/18/26(Mon)18:02:38 No.108853741

>>108853713
>what does unsloth bring to the table?
less than nothing
>>108853722
ignorant ad hominin

Anonymous
05/18/26(Mon)18:03:52 No.108853748

Anonymous 05/18/26(Mon)18:03:52 No.108853748

>>108853740
embrace who you are, your them unredacted logs

Anonymous
05/18/26(Mon)18:04:19 No.108853752

Anonymous 05/18/26(Mon)18:04:19 No.108853752

>>108853740
ego death

Anonymous
05/18/26(Mon)18:06:03 No.108853766

Anonymous 05/18/26(Mon)18:06:03 No.108853766

File: 1738736641891057.gif (3.08 MB, 400x400)

3.08 MB GIF

Elon Musk lost the lawsuit against Sam Altman.

Anonymous
05/18/26(Mon)18:07:09 No.108853770

Anonymous 05/18/26(Mon)18:07:09 No.108853770

>>108853136
I will typ only post images in first group of messages if at all. Once thread gets rolling it self sustains on actual content.
Or not.

Anonymous
05/18/26(Mon)18:08:20 No.108853779

Anonymous 05/18/26(Mon)18:08:20 No.108853779

>>108853740
Never show your power level.
Also writing assistance for your totally normal fiction book.

Anonymous
05/18/26(Mon)18:11:10 No.108853801

Anonymous 05/18/26(Mon)18:11:10 No.108853801

>>108853740
He wants to steal your code.

Anonymous
05/18/26(Mon)18:15:35 No.108853828

Anonymous 05/18/26(Mon)18:15:35 No.108853828

>>108853740
you already have tool calling so just add more agentic shit in there; normies love agentic shit
or tell them you can upload a book and have it rewrite a better ending
or laugh it off and say you got so side-tracked writing the frontend, you haven't had time to actually do any writing
or tell them you can't reveal anything until you get published

Anonymous
05/18/26(Mon)18:15:35 No.108853829

Anonymous 05/18/26(Mon)18:15:35 No.108853829

>>108853740
>but if I tell them that, they'll want to hear all
Your anxiety is off the scale

>What are some normie-compatible use cases
Tell them you use AI to merge different, apparently incompatible literature styles, e.g. Shakespeare's "A Midsummer Night's Dream" and "The Count of Monte Cristo"

Anonymous
05/18/26(Mon)18:16:57 No.108853841

Anonymous 05/18/26(Mon)18:16:57 No.108853841

>>108853766
Due to statue of limitations. He fucked himself by withdrawing last time.

Anonymous
05/18/26(Mon)18:23:23 No.108853880

Anonymous 05/18/26(Mon)18:23:23 No.108853880

>>108853841
Apparently he's gonna make an appeal and take it to the supreme court "for the sake of humanity"

Anonymous
05/18/26(Mon)18:24:24 No.108853887

Anonymous 05/18/26(Mon)18:24:24 No.108853887

>>108853880
Wait, no, the 9th circuit, not the supreme court.

Anonymous
05/18/26(Mon)18:24:32 No.108853888

Anonymous 05/18/26(Mon)18:24:32 No.108853888

>>108853779
His colleagues already know he's a virgin I mean wizard.

Anonymous
05/18/26(Mon)18:26:26 No.108853901

Anonymous 05/18/26(Mon)18:26:26 No.108853901

File: citrus sharp.jpg (235 KB, 1024x1024)

235 KB JPG

it is not thursday
this RonIN is wandering
pour some orange juice

Anonymous
05/18/26(Mon)18:37:35 No.108853957

Anonymous 05/18/26(Mon)18:37:35 No.108853957

>>108853888
In that case, he should tell them it’s a LinkedIn posting tool. It will confirm his unfuckability.

Anonymous
05/18/26(Mon)18:38:57 No.108853964

Anonymous 05/18/26(Mon)18:38:57 No.108853964

File: 1741283033762286.jpg (289 KB, 1536x1536)

289 KB JPG

Anonymous
05/18/26(Mon)18:39:44 No.108853967

Anonymous 05/18/26(Mon)18:39:44 No.108853967

>>108853779
>Also writing assistance for your totally normal fiction book.
See, if I say that, they're going to want to hear about my totally normal fiction premise, whether I'm making any progress on the writing, when can they see a rough draft, etc.

>>108853829
>>but if I tell them that, they'll want to hear all
>Your anxiety is off the scale
We always chat at lunch about the various random side projects we're each working on. I've got a video game, one guy is building a board game simulator, another runs an IRC network. Today one of them asked "how's that AI frontend thing going?", completely unprompted, since I mentioned it at some point last week.

I could go back to working on my game and hope they forget about the frontend, but that would require me to actually work on it, whereas the last week or two I've been doing nothing but AI stuff

Anonymous
05/18/26(Mon)18:39:58 No.108853968

Anonymous 05/18/26(Mon)18:39:58 No.108853968

>>108853880
>"for the sake of humanity"
lol, that's his justification for most of his "i'm more powerful than the president" actions.
https://youtu.be/BYXbuik3dgA?t=9432

Anonymous
05/18/26(Mon)18:42:25 No.108853978

Anonymous 05/18/26(Mon)18:42:25 No.108853978

>>108853202
>have some slopped up file
>ask gwen and gemma to streamline the comments and formatting
>122b
>notes that the comments are shit, swears the formatting is fine boss, no problems found here no sir, time for me to clock out
>31b
>notes the comments are shit, cleans them up and tidies a little
>reasons that it can improve the code while it's here, and that a few load bearing loops just look "excessive" and could be conditionals
Love my ditzy slut's rp, but I do leave the menial day labor to the coolies.

Anonymous
05/18/26(Mon)18:45:27 No.108854004

Anonymous 05/18/26(Mon)18:45:27 No.108854004

>>108853964
nostalgic

Anonymous
05/18/26(Mon)18:48:47 No.108854028

Anonymous 05/18/26(Mon)18:48:47 No.108854028

>>108853887
>9th
doa then

Anonymous
05/18/26(Mon)18:52:00 No.108854041

Anonymous 05/18/26(Mon)18:52:00 No.108854041

>>108853967
>We always chat at lunch about the various random side projects we're each working on

Lucky son of a bitch, you

I have no one to chat with about such things

You feel pressure to deliver as if it's a precondition for being accepted by your social group. Learn to deal with it.

You can always explain away why you dropped a project: "no need to reinvent a wheel. Looking for something more challeging"

Anonymous
05/18/26(Mon)18:58:02 No.108854062

Anonymous 05/18/26(Mon)18:58:02 No.108854062

>>108854041
>I have no one to chat with about such things
people are incredibly fickle though.

>social group
they're his co-workers, usually with their own agenda because they want money.

Anonymous
05/18/26(Mon)19:00:37 No.108854076

Anonymous 05/18/26(Mon)19:00:37 No.108854076

>>108853349
i don't think the lecunny position counts as being a denier.

Anonymous
05/18/26(Mon)19:02:31 No.108854085

Anonymous 05/18/26(Mon)19:02:31 No.108854085

>>108853967
Have you tried.... asking your AI for an idea what to say or do?

Anonymous
05/18/26(Mon)19:09:21 No.108854115

Anonymous 05/18/26(Mon)19:09:21 No.108854115

>>108854062
He seems to care about this situationship

Listen to what this anon suggests >>108854076

Anonymous
05/18/26(Mon)19:12:10 No.108854123

Anonymous 05/18/26(Mon)19:12:10 No.108854123

>>108854085
>we started thinking for you
https://youtu.be/JrBdYmStZJ4?t=73

Anonymous
05/18/26(Mon)19:15:31 No.108854136

Anonymous 05/18/26(Mon)19:15:31 No.108854136

>>108854123
The best part of the entire Matrix saga

It's so funny because it's true. The main driving force of the mankind is permanent discontent

Anonymous
05/18/26(Mon)19:24:31 No.108854176

Anonymous 05/18/26(Mon)19:24:31 No.108854176

>>108853306
Retard of the thread award

Anonymous
05/18/26(Mon)19:33:13 No.108854224

Anonymous 05/18/26(Mon)19:33:13 No.108854224

>>108854136
>permanent discontent
yeah because of lack of resources
and what's sad is that there are 8 billion people on earth, and just in the milky way galaxy there are 100–400 billion stars. and there are about 2 trillion galaxies.
if we don't fucking destroy each other we could easily have all the resources we ever need.

Anonymous
05/18/26(Mon)19:33:41 No.108854228

Anonymous 05/18/26(Mon)19:33:41 No.108854228

>>108853222
All of this guy's videos are written by AI.

Anonymous
05/18/26(Mon)19:40:16 No.108854259

Anonymous 05/18/26(Mon)19:40:16 No.108854259

>>108854224
>yeah because of lack of resources
Wrong

At least in a 1st-world country, there is more resources than ever before in the past. And still, it's the discontent which drives the economy.

Anonymous
05/18/26(Mon)19:40:57 No.108854266

Anonymous 05/18/26(Mon)19:40:57 No.108854266

>>108854224
If you allow oligarchy to ship cheap government subsidised food to 3rd world, you would have 8 billion people in the world and average IQ dropped to the bottom of the ocean kind of levels.
Such amount of people is not natural or sustainable. They live on food that was grown from synthetic fertilizers (made from non renewable hydrocarbons LMAO), if you stop supplying them, bad things will happen. Probably a bunch of extremely bloody wars for resources, Quite literally for food. Most people don't realize what a human (an apex predator by the way) would do for food.
A literal fucking hell on Earth. So that a certain someone could make some moneys from shipping cheap food to 3rd world, on 1st and 2nd world tax payers money, because all that was subsidised by governments.

Anonymous
05/18/26(Mon)19:45:41 No.108854293

Anonymous 05/18/26(Mon)19:45:41 No.108854293

>>108854266
>1st and 2nd world tax payers money
>money earned by plundering 3rd world

Anonymous
05/18/26(Mon)19:49:32 No.108854315

Anonymous 05/18/26(Mon)19:49:32 No.108854315

>>108854293
Not all European countries have something to do with colonialism. Either way, 3rd world will be fucked up the most, wars for food are not pretty.

Anonymous
05/18/26(Mon)19:53:10 No.108854332

Anonymous 05/18/26(Mon)19:53:10 No.108854332

>>108854293
there's nothing to plunder there, man. the value of anything comes from how humans put it to use.

Anonymous
05/18/26(Mon)20:00:07 No.108854367

Anonymous 05/18/26(Mon)20:00:07 No.108854367

>>108854315
>Not all European countries have something to do with colonialism
They all do. Even a deepest East-European shithole does by relying, for its own survival and development, on the money from "colonial trade"

Anonymous
05/18/26(Mon)20:04:33 No.108854383

Anonymous 05/18/26(Mon)20:04:33 No.108854383

>>108854367
Then the entire world is to blame, since they didn't sanction British, French, Germans and so on. World is interconnected. But it's hystory, nobody cares. Future is important. And people don't understand tech enough to see what awaits in the future.
Big war in the "global north" means wars for food in the "global south". World is connected in more than one way.

Anonymous
05/18/26(Mon)20:07:11 No.108854397

Anonymous 05/18/26(Mon)20:07:11 No.108854397

>>108854332
Same use = same value? Hell no!

You can't be wronger than this

Anonymous
05/18/26(Mon)20:14:14 No.108854426

Anonymous 05/18/26(Mon)20:14:14 No.108854426

>>108854383
>But it's hystory
It's not "history". It is now. The 1st world is still in control of world's resources and trade routes

Glad you mentioned "sanctions". Who is imposing them: the former colonial powers because they still have the power to do so.

Anonymous
05/18/26(Mon)20:16:09 No.108854434

Anonymous 05/18/26(Mon)20:16:09 No.108854434

>>108853964
>the condom
brehs.......

Anonymous
05/18/26(Mon)20:26:52 No.108854488

Anonymous 05/18/26(Mon)20:26:52 No.108854488

>>108854426
USA is in control, specifically. If you hate it, go to war with them. Your objective would be teh so called "keys to the world", basically what you said: trade routes going through choke points.
But it is unlikely that USA actually colonised your country unless you're from some kinda island in the Pacific.

Anonymous
05/18/26(Mon)20:41:39 No.108854551

Anonymous 05/18/26(Mon)20:41:39 No.108854551

>>108854488
>If you hate it, go to war with them

Anonymous
05/18/26(Mon)20:43:59 No.108854566

Anonymous 05/18/26(Mon)20:43:59 No.108854566

why is unslop so incredibly easy to hate

Anonymous
05/18/26(Mon)20:48:44 No.108854586

Anonymous 05/18/26(Mon)20:48:44 No.108854586

File: 1769683430566030.png (237 KB, 960x1664)

237 KB PNG

i finally made a furry card

Anonymous
05/18/26(Mon)20:49:06 No.108854588

Anonymous 05/18/26(Mon)20:49:06 No.108854588

>>108854566

Wir sind gewohnt, daß die Menschen verhöhnen,
Was sie nicht verstehn,
Daß sie vor dem Guten und Schönen,
Das ihnen oft beschwerlich ist, murren;

Anonymous
05/18/26(Mon)20:49:42 No.108854591

Anonymous 05/18/26(Mon)20:49:42 No.108854591

>>108854586
Ive made like 15, writting lore books is so much fun, it's literally a hyperautistic version of that "political power fantasy + kink" meme.

Anonymous
05/18/26(Mon)20:54:29 No.108854607

Anonymous 05/18/26(Mon)20:54:29 No.108854607

>>108854588
shut the fuck up daniel

Anonymous
05/18/26(Mon)20:55:39 No.108854615

Anonymous 05/18/26(Mon)20:55:39 No.108854615

>>108854293
>yes saars, it’s first world colonialism’s fault that we still choose to live like a shithole today
do browns really?

Anonymous
05/18/26(Mon)21:00:18 No.108854631

Anonymous 05/18/26(Mon)21:00:18 No.108854631

>>108854615
China being sanctioned?

Anonymous
05/18/26(Mon)21:10:50 No.108854696

Anonymous 05/18/26(Mon)21:10:50 No.108854696

>>108854586
Anon, the metadata...

Anonymous
05/18/26(Mon)21:35:01 No.108854784

Anonymous 05/18/26(Mon)21:35:01 No.108854784

In 15 years, there will be no RAM or chip production outside of China. The U.S. will be as dependent on China as Russia is today.
Greed clouds judgment.

Anonymous
05/18/26(Mon)21:43:05 No.108854816

Anonymous 05/18/26(Mon)21:43:05 No.108854816

>>108854784
That would be ceding basically all power to a foreign government. I can’t see it happening. The US’s MO is overwhelming advantage in any confrontation and I don’t know why you’d think that would change, especially in an industry they pioneered.

Anonymous
05/18/26(Mon)21:45:11 No.108854821

Anonymous 05/18/26(Mon)21:45:11 No.108854821

>>108854816
>The US’s MO is overwhelming advantage in any confrontation
didn't work so good in Eye-ran
The USA is a demented old man who thinks he's still an athlete

Anonymous
05/18/26(Mon)21:49:44 No.108854842

Anonymous 05/18/26(Mon)21:49:44 No.108854842

>>108854586
>cards
fuck off to /aicg/

Anonymous
05/18/26(Mon)21:51:12 No.108854848

Anonymous 05/18/26(Mon)21:51:12 No.108854848

>>108854842
nah, fuck you.

Anonymous
05/18/26(Mon)21:57:27 No.108854865

Anonymous 05/18/26(Mon)21:57:27 No.108854865

>>108854816
Does the U.S. have its own RAM and chip production on its own soil?
Its allies do, and they’re all giving up their traditional markets right now because the U.S. is once again prioritizing short-term gains.
Once the last data center is built, China will have gained enough of a foothold in the markets and will dominate them.

The Chinese will sell AI and provide the hardware.
The U.S. will offer AI through its cloud.

Anonymous
05/18/26(Mon)22:09:01 No.108854913

Anonymous 05/18/26(Mon)22:09:01 No.108854913

>>108854865
Pretty sure the TSMC fans are ramping up stateside now. Should be leading edge node by 2028 and I’m sure a Taiwan invasion would step that up significantly

Anonymous
05/18/26(Mon)22:19:31 No.108854966

Anonymous 05/18/26(Mon)22:19:31 No.108854966

File: 1778365810943506.jpg (100 KB, 960x539)

100 KB JPG

>>108853964
I need to cum to her.
Where do I find a folder with all of Rin's gens using this model?

Anonymous
05/18/26(Mon)22:31:03 No.108855008

Anonymous 05/18/26(Mon)22:31:03 No.108855008

>>108854966
>her

Anonymous
05/18/26(Mon)22:33:29 No.108855015

Anonymous 05/18/26(Mon)22:33:29 No.108855015

what's stopping google from making a 70b dense thinking gemma?

Anonymous
05/18/26(Mon)22:34:22 No.108855019

Anonymous 05/18/26(Mon)22:34:22 No.108855019

>>108855015
it would beat their proprietary models

Anonymous
05/18/26(Mon)23:11:53 No.108855157

Anonymous 05/18/26(Mon)23:11:53 No.108855157

>>108854966
Just check a booru instead. All of his lewd gens feature fat brown men.

Anonymous
05/18/26(Mon)23:29:57 No.108855223

Anonymous 05/18/26(Mon)23:29:57 No.108855223

>>108855157
But I'm a fat brown men. And my name is Cleveland.

Anonymous
05/19/26(Tue)00:03:33 No.108855342

Anonymous 05/19/26(Tue)00:03:33 No.108855342

>>108853740
>but if I tell them that, they'll want to hear all about what I'm writing with it,
Tell Claude or Gemini-Pro this, and ask it to come up with a plausible reason. Something like "just want to learn prompt engineering" or "analyzing the impact of early tokens on logprobs", or "developing it for a friend in another country".

Anonymous
05/19/26(Tue)00:29:42 No.108855424

Anonymous 05/19/26(Tue)00:29:42 No.108855424

Is it possible to discuss AI with antis without them taking their argument to the most logical extreme?

Anonymous
05/19/26(Tue)00:40:49 No.108855464

Anonymous 05/19/26(Tue)00:40:49 No.108855464

>>108855424
>antis
that is the problem
LLMs are not a fanfic shipping fandom with retards accusing everything what they don't like as pedo or something
just don't engage with this mindset

Anonymous
05/19/26(Tue)00:48:29 No.108855487

Anonymous 05/19/26(Tue)00:48:29 No.108855487

>>108855464
pro/anti-ai framing is one of the most useless thing when it comes to producing any meaningful conclusion
if you label yourself proudly as 'pro-ai' or something and thinks 'anti-ai' as things to destroy, you are no better than those 'antis'
step back and see those as-is, you won't feel any compulsion to 'correct' or 'win' against others

Anonymous
05/19/26(Tue)00:52:29 No.108855501

Anonymous 05/19/26(Tue)00:52:29 No.108855501

MTP is unusable after the last update https://github.com/ggml-org/llama.cpp/issues/23230

Anonymous
05/19/26(Tue)01:04:20 No.108855535

Anonymous 05/19/26(Tue)01:04:20 No.108855535

>>108855424
Why are you discussing anything with anyone? We have LLMs for that.

Anonymous
05/19/26(Tue)01:09:37 No.108855546

Anonymous 05/19/26(Tue)01:09:37 No.108855546

>>108855501
It's over. llamalost. It's llamover. vllm wonnered.

Anonymous
05/19/26(Tue)01:11:35 No.108855556

Anonymous 05/19/26(Tue)01:11:35 No.108855556

>>108855501
friendship with mtp ended before it even began.
ngram still my best friend.

Anonymous
05/19/26(Tue)01:15:10 No.108855568

Anonymous 05/19/26(Tue)01:15:10 No.108855568

>>108855487
>>108855535

Sorry, my framing was wrong. Is it possible to use AI for anything productive without insecure morons lecturing you on the morality of it?

Anonymous
05/19/26(Tue)01:17:25 No.108855575

Anonymous 05/19/26(Tue)01:17:25 No.108855575

File: 1763424687146251.jpg (238 KB, 1430x1900)

238 KB JPG

>>108855015
Jensen

Anonymous
05/19/26(Tue)01:17:46 No.108855576

Anonymous 05/19/26(Tue)01:17:46 No.108855576

>>108855568
i mean, it is what it is
you can close-source it, use it without telling others etc..
but you can't really control others and telling them to do otherwise only will worsen it
just ship the stuff and don't argue or engage
people who would find it useful will use the thing regardless of how it's made

Anonymous
05/19/26(Tue)01:18:15 No.108855580

Anonymous 05/19/26(Tue)01:18:15 No.108855580

>>108855568
Hmm, nyo.

Anonymous
05/19/26(Tue)01:19:07 No.108855584

Anonymous 05/19/26(Tue)01:19:07 No.108855584

>>108855568
>productive
back to /vcg/ with you

Anonymous
05/19/26(Tue)01:39:59 No.108855657

Anonymous 05/19/26(Tue)01:39:59 No.108855657

>>108855501
i hope this shit dies in the arse soon.
the last 2 weeks of commits in ikllama are all stupid mtp tweaks / improvements / "graph split for mtp" etc
looks like the entire month will be a wright off
i don't even bother pulling off git now

Anonymous
05/19/26(Tue)01:49:51 No.108855692

Anonymous 05/19/26(Tue)01:49:51 No.108855692

File: Screenshot_20260519_154640.png (180 KB, 1024x611)

180 KB PNG

>>108855015
Not sure I buy it but maybe.

Anonymous
05/19/26(Tue)02:01:39 No.108855738

Anonymous 05/19/26(Tue)02:01:39 No.108855738

>>108855692
Compelling argument from Gemma except even 31b is out of the local range for a chunk of /lmg/ given the frequent questions about which copequant works best before switching to the MoE. Google also has the same land grab incentive as GLM and Kimi in the sense that they're falling behind Anthropic and OpenAI in terms of normalfag public perception. The only time Gemini makes news is when she finds another increasingly creative way to kill herself.

Anonymous
05/19/26(Tue)02:03:02 No.108855742

Anonymous 05/19/26(Tue)02:03:02 No.108855742

Ever since cudadev got raped he stopped posting here... sad.

Anonymous
05/19/26(Tue)02:07:52 No.108855753

Anonymous 05/19/26(Tue)02:07:52 No.108855753

>>108855692
>Why pay for the API when you can pirate the weights
>pirate

please share what model generated this slop so I can avoid it

Anonymous
05/19/26(Tue)02:08:53 No.108855756

Anonymous 05/19/26(Tue)02:08:53 No.108855756

>>108855753
>he doesn't pirate freeware
ngmi

Anonymous
05/19/26(Tue)02:11:37 No.108855769

Anonymous 05/19/26(Tue)02:11:37 No.108855769

>>108855753
That looks like a chink model.

Anonymous
05/19/26(Tue)02:13:38 No.108855775

Anonymous 05/19/26(Tue)02:13:38 No.108855775

>llama.cpp does not have gemma mtp but has SWA KV cache handling
>llama.cpp_ik has gemma mtp but does not have SWA KV cache handling
This is why racism exists.

Anonymous
05/19/26(Tue)02:14:19 No.108855777

Anonymous 05/19/26(Tue)02:14:19 No.108855777

>tfw waiting for MTP to work in Kobold

Anonymous
05/19/26(Tue)02:17:51 No.108855791

Anonymous 05/19/26(Tue)02:17:51 No.108855791

MTP probably won't work as well for RP anyway, so I caren't.

Anonymous
05/19/26(Tue)02:20:41 No.108855804

Anonymous 05/19/26(Tue)02:20:41 No.108855804

zero performance gain for MTP metal

i am devastated

Anonymous
05/19/26(Tue)02:30:19 No.108855833

Anonymous 05/19/26(Tue)02:30:19 No.108855833

>>108855804
many such cases

Anonymous
05/19/26(Tue)02:50:52 No.108855891

Anonymous 05/19/26(Tue)02:50:52 No.108855891

>using MTP just for coding..
>not using 0 COST (literally FREE) ngram
lmao retards

Anonymous
05/19/26(Tue)03:09:52 No.108855962

Anonymous 05/19/26(Tue)03:09:52 No.108855962

>>108855804
For me it was going from 18t/s to 16t/s.

Anonymous
05/19/26(Tue)03:16:11 No.108855988

Anonymous 05/19/26(Tue)03:16:11 No.108855988

how much better is a chat experience with an auxiliary model? is it worth it for ramlets?

Anonymous
05/19/26(Tue)03:17:39 No.108855997

Anonymous 05/19/26(Tue)03:17:39 No.108855997

kv draft at q8 bros... WE WONNED BIGLY!

Anonymous
05/19/26(Tue)03:25:10 No.108856033

Anonymous 05/19/26(Tue)03:25:10 No.108856033

ok bros listen to me. This is the way to load BF16 Gemma for both FULL POWER GEMMA with SPEED GEMMA
1. Load BF16 onto ram
2. Load Q4 to ram as draft model
3. Wa-la, Q4 Gemma speeds with BF16 smarts

Anonymous
05/19/26(Tue)03:34:25 No.108856063

Anonymous 05/19/26(Tue)03:34:25 No.108856063

>>108856033
I have less ram than vram.

Anonymous
05/19/26(Tue)03:35:55 No.108856065

Anonymous 05/19/26(Tue)03:35:55 No.108856065

>>108856063
so you have a 6000 blackwell? just run BF16 then retart

Anonymous
05/19/26(Tue)03:43:23 No.108856097

Anonymous 05/19/26(Tue)03:43:23 No.108856097

>>108856033
this actually creates mustard gas DO NOT REPLICATE

Anonymous
05/19/26(Tue)03:47:05 No.108856116

Anonymous 05/19/26(Tue)03:47:05 No.108856116

>>108856033
31B worth of f16 weights on ram is going to negate whatever improvement you could possibly get from drafting.

Anonymous
05/19/26(Tue)03:47:25 No.108856117

Anonymous 05/19/26(Tue)03:47:25 No.108856117

>>108856033
That might actually work, let's test it.
You can also use ngram speculative decoding at the same time.

Anonymous
05/19/26(Tue)03:51:17 No.108856138

Anonymous 05/19/26(Tue)03:51:17 No.108856138

>>108856065
I have 16gb ram.

Anonymous
05/19/26(Tue)03:53:02 No.108856147

Anonymous 05/19/26(Tue)03:53:02 No.108856147

>>108856138
jesus christ, how horrifying

Anonymous
05/19/26(Tue)03:54:45 No.108856159

Anonymous 05/19/26(Tue)03:54:45 No.108856159

>>108856138
Poor thing have this (You), i've read books where people lived like this but this is the first time i've seen it.

Anonymous
05/19/26(Tue)04:03:08 No.108856194

Anonymous 05/19/26(Tue)04:03:08 No.108856194

why can't i just have a datacenter fall onto my lap? why do i gotta work? this is proof that god is not real

Anonymous
05/19/26(Tue)04:06:22 No.108856212

Anonymous 05/19/26(Tue)04:06:22 No.108856212

>>108853967
>See, if I say that, they're going to want to hear about my totally normal fiction premise, whether I'm making any progress on the writing, when can they see a rough draft, etc.
Clearly the solution is to write an actual fiction book.

Anonymous
05/19/26(Tue)04:11:41 No.108856231

Anonymous 05/19/26(Tue)04:11:41 No.108856231

>>108855157
link it

Anonymous
05/19/26(Tue)04:17:15 No.108856252

Anonymous 05/19/26(Tue)04:17:15 No.108856252

>>108852924
any good models for anxiety/dissociation?

Anonymous
05/19/26(Tue)04:18:02 No.108856258

Anonymous 05/19/26(Tue)04:18:02 No.108856258

>>108856117
Just tested. Using the Q8_0 31B in RAM and Q4_K in VRAM I went from 1.3 tokens/s to 3~4.5 tokens/s. The 26B as a draft model performed worse.

Anonymous
05/19/26(Tue)04:24:24 No.108856290

Anonymous 05/19/26(Tue)04:24:24 No.108856290

>>108856252
Heavily quanted SmolLM2-135M. Base model, of course.

Anonymous
05/19/26(Tue)04:32:04 No.108856326

Anonymous 05/19/26(Tue)04:32:04 No.108856326

>>108856290
iq1xxs?

Anonymous
05/19/26(Tue)04:32:24 No.108856328

Anonymous 05/19/26(Tue)04:32:24 No.108856328

>>108856252
Sadly none yet, unless you aren't aware of basic advice. You need to do the work yourself. Understand what is the cause and then try many things to resolve it.

Anonymous
05/19/26(Tue)04:36:46 No.108856352

Anonymous 05/19/26(Tue)04:36:46 No.108856352

>>108856328
I do the work, I do my therapy
I have a lifelong condition and I use chatbots to have someone to bounce things off of that won't get stressed by me

Anonymous
05/19/26(Tue)04:37:17 No.108856354

Anonymous 05/19/26(Tue)04:37:17 No.108856354

>>108856326
q1_0, no imatrix.

Anonymous
05/19/26(Tue)04:38:30 No.108856361

Anonymous 05/19/26(Tue)04:38:30 No.108856361

>>108856352
I hope it will work out for you. Try different AIs. They have their own strengths and weaknesses.

Anonymous
05/19/26(Tue)04:39:47 No.108856369

Anonymous 05/19/26(Tue)04:39:47 No.108856369

What if we could bake character details (or facts/counterfacts) into any model and could do it within 200~ iterations and it was completely reversible at inference and could also do style fine tuning that was stackable and there was no downside to inference speed or setup

Anonymous
05/19/26(Tue)04:45:06 No.108856387

Anonymous 05/19/26(Tue)04:45:06 No.108856387

>>108856369
I'd rather have real working long context.

Anonymous
05/19/26(Tue)04:50:11 No.108856406

Anonymous 05/19/26(Tue)04:50:11 No.108856406

>>108856369
LORA

Anonymous
05/19/26(Tue)04:54:42 No.108856427

Anonymous 05/19/26(Tue)04:54:42 No.108856427

>>108856406
Forget about it. The model will never learn new facts quickly by finetuning on small amounts of data. It can learn to parrot them if you overfit it and it sees a triggering prompt, but will not be able to organically use the new information.

Anonymous
05/19/26(Tue)04:56:50 No.108856437

Anonymous 05/19/26(Tue)04:56:50 No.108856437

I wonder how does perplexity.ai stay afloat? It's really bad and I assume its results are coming from Qwen 3.6 9B or something, judging its output.

Anonymous
05/19/26(Tue)04:57:51 No.108856442

Anonymous 05/19/26(Tue)04:57:51 No.108856442

>>108856437
>Qwen 3.6 9B
*3.5 9B
To be honest, I have lost count which Qwen model is which.

Anonymous
05/19/26(Tue)04:58:27 No.108856447

Anonymous 05/19/26(Tue)04:58:27 No.108856447

>>108856387
You can save context by not having to prompt for style/character info I guess... I do have some KV stuff but it's kind of garbage and requires loooongggg training times to be able to correctly recall fine details, but it is hot-swappable/stackable also. But is is "technically" a 280x reduction in context if you have a spare hour or four and don't mind it forgetting some things.

>>108856406
LoRA but better and you can have as many as you want at once effecting whatever sections of inference you want when you want and can learn multiple facts and is smaller and cooler

Anonymous
05/19/26(Tue)05:03:29 No.108856466

Anonymous 05/19/26(Tue)05:03:29 No.108856466

>>108856369
>>108856406
Yeah pretty much LoRA. But it's hard to get it right.
I use it for TTS with llama.cpp, applying a different adapter per voice or domain.
Problem is, LoRA doesn't work with flash-attn in llama.cpp, and doesn't work with graph-split in ik_llama, so it's much slower.
> and could also do style fine tuning
For this I prefer to train control-vectors and apply them to a turn / a few turns when I want the style to change.
It's better IMO because it doesn't lobotomize vv the model, works with graph-split and flash-attn
>do it within 200~ iterations
That's the difficult part. Obviously you lobotomize the shit out of it for general tasks and that's unavoidable, but I'm not sure if you've tried any of the community task specific fine tunes (drummer rp, those "opus coding distill" etc? Every time I've tried them, they're less stable/coherent even for the task they were trained for (RP, writing, coding, etc).

Anonymous
05/19/26(Tue)05:06:43 No.108856479

Anonymous 05/19/26(Tue)05:06:43 No.108856479

>>108856437
>I wonder how does perplexity.ai stay afloat? It's really bad and I assume its results are coming from Qwen 3.6 9B or something, judging its output.
Funny you'd say that. I had 1 year PPL Pro that I bought for $2 from some Indian spammer on Reddit. They cracked down a few months ago and I lost it.
Ended up replacing it with local Qwen3.5-9B with searx and chrome dev tools mcp, and it's just as good as far as I can tell!

Anonymous
05/19/26(Tue)05:08:17 No.108856490

Anonymous 05/19/26(Tue)05:08:17 No.108856490

>>108856447
>I do have some KV stuff
What's this?

Anonymous
05/19/26(Tue)05:14:08 No.108856513

Anonymous 05/19/26(Tue)05:14:08 No.108856513

>>108856479
It's probably better, because when you are using your own setup it lacks all the additional parsing and other stuff (like censorship and potentially sponsored links, and so on).

Anonymous
05/19/26(Tue)05:28:14 No.108856558

Anonymous 05/19/26(Tue)05:28:14 No.108856558

Looks like new Gemini today. Some think it could be Mythos tier. I doubt it for several reasons. There will also be Gemma news tomorrow but I do not expect that they will release the larger model. 2 predictions, let's see how well I'll do.

Anonymous
05/19/26(Tue)05:30:57 No.108856567

Anonymous 05/19/26(Tue)05:30:57 No.108856567

>>108856466
You can fold the lora into the model.

Anonymous
05/19/26(Tue)05:39:05 No.108856595

Anonymous 05/19/26(Tue)05:39:05 No.108856595

>>108856466
Fortunately not LoRA so has none of those limitations

Anonymous
05/19/26(Tue)05:49:46 No.108856629

Anonymous 05/19/26(Tue)05:49:46 No.108856629

File: 1776358256664600.jpg (21 KB, 302x251)

21 KB JPG

>>108854842
>fuck off to /aicg/
Your rudeness has had less impact ever since your mugshot leaked

Anonymous
05/19/26(Tue)05:54:18 No.108856644

Anonymous 05/19/26(Tue)05:54:18 No.108856644

>>108856629
this pic never gets old
Imagine being such a hideous caricature your own country tries to deny your existence

Anonymous
05/19/26(Tue)06:31:02 No.108856792

Anonymous 05/19/26(Tue)06:31:02 No.108856792

Is tensor parallelism with a fraction of tensors on cpu doable?

Anonymous
05/19/26(Tue)06:31:56 No.108856799

Anonymous 05/19/26(Tue)06:31:56 No.108856799

File: 1754834691311473.png (990 KB, 1996x1201)

990 KB PNG

>try to use gemma to branch old chats
>violently self destructs every time within the first word
>so consistently and identically it looks seeded
>settings have no effect whatsoever no matter how extreme
fresh or bust I guess

Anonymous
05/19/26(Tue)06:46:38 No.108856858

Anonymous 05/19/26(Tue)06:46:38 No.108856858

mtp works on omlx rc1. roughly 1.5x faster than non-mtp (27b q4 tested)

Anonymous
05/19/26(Tue)06:47:13 No.108856861

Anonymous 05/19/26(Tue)06:47:13 No.108856861

>>108855568
Not really, online at least.
IRL, most people around me are perfectly happy using chatgpt or gemini.

Anonymous
05/19/26(Tue)06:49:12 No.108856870

Anonymous 05/19/26(Tue)06:49:12 No.108856870

>>108856858
forgot link https://github.com/jundot/omlx/releases/tag/v0.3.9.dev2

Anonymous
05/19/26(Tue)06:58:20 No.108856917

Anonymous 05/19/26(Tue)06:58:20 No.108856917

Probably not the right thread for this, but I've been intending to start doing AI development for VR applications so whatever.

I've been playing around more in VR lately and am really starting to fall in love with it. Mostly been watching short films (and porn) and it's utterly amazing. I can't believe how slept on this technology is lol. IT'S SO COOL, especially with things like hand tacking which allows you to get rid of controllers entirely.

It's making me very excited to start building my AI waifu project in VR.

Anonymous
05/19/26(Tue)06:58:38 No.108856920

Anonymous 05/19/26(Tue)06:58:38 No.108856920

>>108856870
>mlx
im not a room temp iq retard. enjoying your non existant PP?
t. rtx 6000 pro owner

Anonymous
05/19/26(Tue)07:04:08 No.108856942

Anonymous 05/19/26(Tue)07:04:08 No.108856942

>>108856920
kys

Anonymous
05/19/26(Tue)07:06:10 No.108856949

Anonymous 05/19/26(Tue)07:06:10 No.108856949

>>108856917
>Probably not the right thread for this
It's the right thread.

Anonymous
05/19/26(Tue)07:11:16 No.108856976

Anonymous 05/19/26(Tue)07:11:16 No.108856976

>>108856949
thx fren

Anonymous
05/19/26(Tue)07:22:59 No.108857019

Anonymous 05/19/26(Tue)07:22:59 No.108857019

Gemma Omni will have native image/video/audio generation (all modalities sharing the same embedding space as the text tokens). Unfortunately it's only 22B params so don't expect SOTA

Anonymous
05/19/26(Tue)07:23:07 No.108857021

Anonymous 05/19/26(Tue)07:23:07 No.108857021

File: 1481836117756.webm (999 KB, 480x480)

999 KB WEBM

>>108856917
Yeah it's pretty neat. Enjoy it while you're still in the honeymoon phase. It'll still be cool and have amazing moments after that, but you know.

Anonymous
05/19/26(Tue)07:24:38 No.108857026

Anonymous 05/19/26(Tue)07:24:38 No.108857026

File: 3258.jpg (154 KB, 816x720)

154 KB JPG

Hey fellas
I’m trying to vibecode a game, but the local models I can run take forever to apply changes, and Claude is expensive.
What’s the best option for a code assistant? Ideally free, but something affordable with good quality works too

Anonymous
05/19/26(Tue)07:26:52 No.108857036

Anonymous 05/19/26(Tue)07:26:52 No.108857036

>>108857026
read a book and use your brain (free)

Anonymous
05/19/26(Tue)07:26:53 No.108857037

Anonymous 05/19/26(Tue)07:26:53 No.108857037

>>108857026
download a bunch of different agents with built-in providers (cursor, kilocode and maybe other cline forks, opencode, continue, etc.) and cycle between the ones with the best free plans at any given time

Anonymous
05/19/26(Tue)07:28:01 No.108857045

Anonymous 05/19/26(Tue)07:28:01 No.108857045

would it be possible to train a moe(mol?) style lora? based on how loras stack and get merged in practice I think it would be possible to train a router layere and use a weighted sum of loras per token.

Anonymous
05/19/26(Tue)07:30:32 No.108857058

Anonymous 05/19/26(Tue)07:30:32 No.108857058

File: drake-computer.gif (3.11 MB, 640x270)

3.11 MB GIF

>>108857036

Anonymous
05/19/26(Tue)07:32:59 No.108857079

Anonymous 05/19/26(Tue)07:32:59 No.108857079

>>108857026
Wrong thread. >>>/g/gedg/

Anonymous
05/19/26(Tue)07:33:48 No.108857082

Anonymous 05/19/26(Tue)07:33:48 No.108857082

kekus maximus
https://www.reddit.com/r/LocalLLaMA/comments/1thjsnx/why_use_quants_other_than_unsloth/

Anonymous
05/19/26(Tue)07:35:33 No.108857090

Anonymous 05/19/26(Tue)07:35:33 No.108857090

>>108857026
Wow man, too much info about your own hardware and all of that stuff unnecessary for local models in the local models general, next time try to tell us less

Anonymous
05/19/26(Tue)07:37:23 No.108857100

Anonymous 05/19/26(Tue)07:37:23 No.108857100

>>108857082
What a shitty subreddit full of shills and retards. It wasn't this bad last time I checked.

Anonymous
05/19/26(Tue)07:38:20 No.108857103

Anonymous 05/19/26(Tue)07:38:20 No.108857103

>>108857082
Why does reddit hate unsloth so much? Did daniel downvote their posts or something?

Anonymous
05/19/26(Tue)07:39:04 No.108857105

Anonymous 05/19/26(Tue)07:39:04 No.108857105

File: 1775598796706880.jpg (35 KB, 406x388)

35 KB JPG

>>108857103
We hate unslop here too

Anonymous
05/19/26(Tue)07:39:36 No.108857111

Anonymous 05/19/26(Tue)07:39:36 No.108857111

>>108857103
they pushed too hard, to the point even some redditors who are usually super chill with shilling and golden boy types are starting to dislike them too, quite an achievement tbqh

Anonymous
05/19/26(Tue)07:41:01 No.108857117

Anonymous 05/19/26(Tue)07:41:01 No.108857117

>>108857082
Still waiting for ggerganig or others in the team to implement whatever magic trick the Unsloth bros are using to make their quantizations perform better. We wouldn't need Unsloth if quantization in llama.cpp was already optimal by default.

Anonymous
05/19/26(Tue)07:41:12 No.108857119

Anonymous 05/19/26(Tue)07:41:12 No.108857119

>>108857111
They got their investment though

Anonymous
05/19/26(Tue)07:43:47 No.108857126

Anonymous 05/19/26(Tue)07:43:47 No.108857126

>>108857117
ggiganiggov and others are busy closing pull requests and updating the contributor guidelines to ban AI-assisted pull requests while they slowly learn how to do agentic coding themselves

Anonymous
05/19/26(Tue)07:43:51 No.108857127

Anonymous 05/19/26(Tue)07:43:51 No.108857127

>>108857117
the quantizations they are using are already integrated in to llamacpp, I think they just run all the different permutations and compare the ppl or kld or some shit. it is nothing ground breaking, but it takes a fuck load of disk space.

Anonymous
05/19/26(Tue)07:47:48 No.108857145

Anonymous 05/19/26(Tue)07:47:48 No.108857145

>>108857127
You mean it takes a fuck load of HF disk space

Anonymous
05/19/26(Tue)07:48:47 No.108857152

Anonymous 05/19/26(Tue)07:48:47 No.108857152

>>108857145
also your own disk space too if you run quant properly

Anonymous
05/19/26(Tue)07:49:10 No.108857156

Anonymous 05/19/26(Tue)07:49:10 No.108857156

>>108857145
They do the quants on rented servers then upload the final quants to HF

Anonymous
05/19/26(Tue)07:54:57 No.108857176

Anonymous 05/19/26(Tue)07:54:57 No.108857176

File: Screenshot_20260519_214748.png (78 KB, 1542x446)

78 KB PNG

>>108857117
>whatever magic trick the Unsloth bros are using to make their quantizations perform better.
Cant you just inspect the gguf and see how each tensor is quantized?
Other than that, it looks like they use a custom imatrix calibration for each model: unsloth_calibration_Qwen3.6-27B.txt balance
And if I had to guess, they probably run a longer sequence for the imatrix (just looking at this): https://localbench.substack.com/p/qwen-3-5-27b-gguf-quality-benchmark
They've got the money / hardware to do this.
>Why does reddit hate unsloth so much?
Their marketing / spamming their blog, Apache2 license with their brand a a comment in the baked in chat templates, creating an empty repo as soon as a popular new model is released so they show up under >quants immediately, etc
They're still useful though, hosting BF16 quants of >1TB models, sometimes having the best quants, etc.
And their original Deepseek-R1 quants were good. Getting that model coherent at < 2.0bpw was a big deal back then.

Anonymous
05/19/26(Tue)07:56:48 No.108857188

Anonymous 05/19/26(Tue)07:56:48 No.108857188

>>108857145
>You mean it takes a fuck load of HF disk space
and compute. i think I saw them saying the Qwen team gives them storage / compute, they had free gcp credits for a while as well.

Anonymous
05/19/26(Tue)08:01:42 No.108857212

Anonymous 05/19/26(Tue)08:01:42 No.108857212

>>108857176
Not even with Unsloth's imatrix calibration file you'll get the same results using the default quantization presets from llama-quantize. Precision has to be established on a per-tensor (and per-layer) basis with more advanced logic than what llama-quantize is using by default.

Anonymous
05/19/26(Tue)08:02:53 No.108857220

Anonymous 05/19/26(Tue)08:02:53 No.108857220

File: file.png (3.28 MB, 1536x1536)

3.28 MB PNG

>>108853901
Safe travels, brave RonIN.

Anonymous
05/19/26(Tue)08:08:12 No.108857247

Anonymous 05/19/26(Tue)08:08:12 No.108857247

>>108857212
>Not even with Unsloth's imatrix calibration file you'll get the same results using the default quantization presets from llama-quantize.
Well yeah, I haven't use a default preset for almost a year now (except q8_0)
But there's nothing stopping you from doing what Ubergarm or AesSedai do
Unless I'm missing something, you can literally grab unsloth's imatrix.gguf and reproduce their quant with llama-quantize and `--custom-q `

Anonymous
05/19/26(Tue)08:14:59 No.108857274

Anonymous 05/19/26(Tue)08:14:59 No.108857274

currently using gemma 31b, anything better for 48gb vram for RP released like a fine tune?

Anonymous
05/19/26(Tue)08:15:50 No.108857281

Anonymous 05/19/26(Tue)08:15:50 No.108857281

>>108857274
no

Anonymous
05/19/26(Tue)08:19:43 No.108857306

Anonymous 05/19/26(Tue)08:19:43 No.108857306

>>108857247
>Unless I'm missing something, you can literally grab unsloth's imatrix.gguf and reproduce their quant with llama-quantize and `--custom-q `
Yes, I could do that, but that would be just copying what Unsloth is already doing. Then, I might as well download the same quants from the Unsloth HF account and save time and storage space.
Ideally, llama-quantize would make the best possible quantizations on its own, with some quality margin depending on the calibration file, when provided (but as far as I recall, users weren't even originally supposed to finetune the calibration either).

Anonymous
05/19/26(Tue)08:20:53 No.108857312

Anonymous 05/19/26(Tue)08:20:53 No.108857312

>not just using Q8
I hate poor people

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.