/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 04/28/26(Tue)06:48:08 No.108707891

File: 2026-04-26_064156_seed3_00001_.png (2.14 MB, 1536x864)

2.14 MB PNG

/lmg/ - Local Models General Anonymous 04/28/26(Tue)06:48:08 No.108707891

/lmg/ - a general dedicated to the discussion and development of local language models.

Cyber Dungeon Edition

Previous threads: >>108702912 & >>108698008

►News
>(04/24) MiMo-V2.5-Pro 1.02T-A42B released: https://hf.co/XiaomiMiMo/MiMo-V2.5-Pro
>(04/24) DeepSeek-V4 Pro 1.6T-A49B and Flash 284B-A13B released: https://hf.co/collections/deepseek-ai/deepseek-v4
>(04/23) LLaDA2.0-Uni multimodal text diffusion model released: https://hf.co/inclusionAI/LLaDA2.0-Uni
>(04/23) Hy3 preview released with 295B-A21B and 3.8B MTP: https://hf.co/tencent/Hy3-preview
>(04/22) Qwen3.6-27B released: https://hf.co/Qwen/Qwen3.6-27B

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
04/28/26(Tue)06:48:28 No.108707893

Anonymous 04/28/26(Tue)06:48:28 No.108707893

File: teto principle.png (1.04 MB, 1024x1024)

1.04 MB PNG

►Recent Highlights from the Previous Thread: >>108702912

--Evaluating ACEStep 1.5 XL as a local music generation alternative:
>108704068 >108704230 >108704270 >108704278 >108704282 >108704407 >108704305 >108704336 >108704473 >108704508 >108704797
--Xiaomi's MiMo-V2.5 model versions and multimodal capabilities:
>108703294 >108703319 >108704518 >108703341 >108704869 >108705768 >108705823 >108706619
--German TTS and local LLM language learning tools:
>108705439 >108705461 >108705468 >108705495 >108705644 >108705637 >108706100 >108706286 >108706538
--Talkie-LM, an open-weight model trained on pre-1930 data:
>108704664 >108704696 >108704694 >108704701 >108705505 >108705634
--Discussing the inefficiency and long latency of Qwen's thinking process:
>108703846 >108703861 >108703879 >108703888 >108703859 >108703880 >108703902
--Comparing token efficiency of thinking vs non-thinking models:
>108705365 >108705375 >108705467
--Discussing poor visual recognition performance in multimodal models:
>108703509 >108705230 >108705290 >108705302 >108705310
--Claude's performance degradation and perceived intelligence loss:
>108705727 >108705731 >108705866 >108705909 >108705965 >108705732 >108705754 >108705771 >108705936
--Discussing "the bitter lesson" regarding compute vs human-designed priors:
>108703913 >108703933 >108703944 >108703990 >108705258 >108707203
--Odd animal prohibitions in the Codex system prompt:
>108706799 >108706812 >108706827 >108707479
--Adjusting top-k sampling stability for Gemma:
>108706606 >108706776
--DeepSeek V4 Flash tested with cockbench via llama.cpp PR:
>108704913
--Logs:
>108703846 >108703861 >108703909 >108703910 >108704077 >108704137 >108704581 >108704701 >108704723 >108705230 >108707237 >108707509
--Miku, Teto (free space):
>108703001 >108703035 >108703280 >108704047 >108704068 >108704109 >108704635 >108706103 >108706310
►Recent Highlight Posts from the Previous Thread: >>108702915

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
04/28/26(Tue)06:56:06 No.108707911

Anonymous 04/28/26(Tue)06:56:06 No.108707911

so with mimo's audio understanding, does that include tone of voice, sound effects, music, etc. or just speech recognition?

Anonymous
04/28/26(Tue)06:56:23 No.108707913

Anonymous 04/28/26(Tue)06:56:23 No.108707913

File: 1529110149658.jpg (163 KB, 824x468)

163 KB JPG

>dice rolls in ST aren't visible to the AI
...what's the fucking point then?

Anonymous
04/28/26(Tue)06:57:43 No.108707916

Anonymous 04/28/26(Tue)06:57:43 No.108707916

>>108707913
Only the ones you do yourself, the AI can see its own rolls if it uses the tool. You can just tell it what you rolled so it doesn't really matter if it injects it into the prompt or not for your own.

Anonymous
04/28/26(Tue)07:00:01 No.108707923

Anonymous 04/28/26(Tue)07:00:01 No.108707923

Is anyone even working on v4 goofs other than that nobody vibecoder?

Anonymous
04/28/26(Tue)07:09:10 No.108707961

Anonymous 04/28/26(Tue)07:09:10 No.108707961

why isn't lora mainstream in llm just like in stable diffusion?

Anonymous
04/28/26(Tue)07:09:17 No.108707963

Anonymous 04/28/26(Tue)07:09:17 No.108707963

>>108707923
name 1 reason why more effort should be put in implementing models that nobody can run
llama.cpp is doing it right if you want something huge implement it yourself but lets not waste resources on that

Anonymous
04/28/26(Tue)07:09:48 No.108707969

Anonymous 04/28/26(Tue)07:09:48 No.108707969

>>108707961
They don't work

Anonymous
04/28/26(Tue)07:10:38 No.108707971

Anonymous 04/28/26(Tue)07:10:38 No.108707971

https://github.com/Kaden-Schutt/hipfire/issues/79#issuecomment-4332288795
vibe-codingGOD, even the issue replies are vibe-answered

Anonymous
04/28/26(Tue)07:11:19 No.108707975

Anonymous 04/28/26(Tue)07:11:19 No.108707975

File: WAIT..gif (49 KB, 220x339)

49 KB GIF

>Qwen's thinking process
>"What's 1+1?"
>"WAIT..."

Anonymous
04/28/26(Tue)07:12:37 No.108707980

Anonymous 04/28/26(Tue)07:12:37 No.108707980

>>108707963
but I can't vibecode it until I have V4 gguf to vibecode with

Anonymous
04/28/26(Tue)07:13:25 No.108707983

Anonymous 04/28/26(Tue)07:13:25 No.108707983

>>108707969
absolute retardation on display

Anonymous
04/28/26(Tue)07:15:04 No.108707986

Anonymous 04/28/26(Tue)07:15:04 No.108707986

>>108707971
Retarded AMDjeets don't deserve more

Anonymous
04/28/26(Tue)07:15:15 No.108707987

Anonymous 04/28/26(Tue)07:15:15 No.108707987

>>108707971
>Tool-call schema (we don't yet support OpenAI tools/function-calling).
jesus christ could have just answered with that one line

Anonymous
04/28/26(Tue)07:15:27 No.108707988

Anonymous 04/28/26(Tue)07:15:27 No.108707988

>>108707975
Kimi's thinking process
"What's 1+1?"
>Wait...
>What if...
>Unless...
>I got it...
>Wait...
>This is unexpected...
>I've been thinking for too long...
>Wait...

Anonymous
04/28/26(Tue)07:19:28 No.108708000

Anonymous 04/28/26(Tue)07:19:28 No.108708000

>>108707988
i gave k2.5 the seahorse glitch prompt
it literally had a meltdown "I really need to stop. just stop. I'm going crazy here. I'm losing my mind. break free." etc

Anonymous
04/28/26(Tue)07:22:59 No.108708018

Anonymous 04/28/26(Tue)07:22:59 No.108708018

File: file.png (116 KB, 1112x410)

116 KB PNG

>>108707988
>>108707975
We will never have a model as good as Llama 1 65B.

Anonymous
04/28/26(Tue)07:24:35 No.108708021

Anonymous 04/28/26(Tue)07:24:35 No.108708021

>>108708000
>seahorse glitch prompt
wait what

Anonymous
04/28/26(Tue)07:25:33 No.108708023

Anonymous 04/28/26(Tue)07:25:33 No.108708023

>>108708018
so /lmg/ invented reasoning?

Anonymous
04/28/26(Tue)07:28:17 No.108708036

Anonymous 04/28/26(Tue)07:28:17 No.108708036

>>108708023
Not sure if it was /lmg/ but 4chan actually does sometimes get credited for inventing chain of thought thinking, yes.
A ton of popular AI things started on here.

Anonymous
04/28/26(Tue)07:28:29 No.108708039

Anonymous 04/28/26(Tue)07:28:29 No.108708039

>>108708023
the conditions were ripe, it was probably discovered by dozens of unrelated people at the same time.

Anonymous
04/28/26(Tue)07:29:13 No.108708042

Anonymous 04/28/26(Tue)07:29:13 No.108708042

>>108707923
It'll be like v3.2 where no one will want to touch it to avoid drama since the vibecoder "claimed" it first

Anonymous
04/28/26(Tue)07:31:17 No.108708048

Anonymous 04/28/26(Tue)07:31:17 No.108708048

File: 1763996159610130.png (260 KB, 1524x1263)

260 KB PNG

>>108707988
You weren't kidding it's still fucking going.

Anonymous
04/28/26(Tue)07:35:38 No.108708070

Anonymous 04/28/26(Tue)07:35:38 No.108708070

>>108707963
at q8, 80gb, 13b active it should still be doable with max ram

Anonymous
04/28/26(Tue)07:36:42 No.108708072

Anonymous 04/28/26(Tue)07:36:42 No.108708072

>>108707971
luddites absolutely btfo

Anonymous
04/28/26(Tue)07:37:42 No.108708078

Anonymous 04/28/26(Tue)07:37:42 No.108708078

>>108708042
I need a qrd now

Anonymous
04/28/26(Tue)07:45:08 No.108708113

Anonymous 04/28/26(Tue)07:45:08 No.108708113

File: BOOM.png (51 KB, 985x392)

51 KB PNG

BOOM

Anonymous
04/28/26(Tue)07:46:28 No.108708119

Anonymous 04/28/26(Tue)07:46:28 No.108708119

>>108708048
This is entirely your fault for having a stupid horny system prompt. It's just agonizing over answering a one word question to your gooner specifications.

Anonymous
04/28/26(Tue)07:49:44 No.108708137

Anonymous 04/28/26(Tue)07:49:44 No.108708137

>>108708048
That one at least sounds reasonable if too in depth.
But imagine what happens when it's a programming question and there's a bug. It endlessly debates possibilities with itself in an increasingly more stupid spiral of self-doubt.
Then you cancel the task, try again and the next time it fixes the bug in a few seconds.

Anonymous
04/28/26(Tue)07:50:53 No.108708141

Anonymous 04/28/26(Tue)07:50:53 No.108708141

>>108708078
https://github.com/ggml-org/llama.cpp/issues/16331
It's a bit wrong saying that the vibecoder 'claimed' it. He was open to letting somebody else start over but nobody cared enough to implement 3.2(-exp). So the PR was basically just months of him blogging to himself about the stuff he's trying without much progress. It culminated in him realizing that vibecoded code has bad performance and quote:
>"I bought two cuda programming books last night. I feel like my only option at this point is to become a cuda kernel wizard"
(This was in december. He started in september)
Then somebody figured out how to skip DSA and run it using normal attention so all the remaining interest evaporated.
All of his own posts in the PR are gone now which seems to be because it turned out that his company banned personal projects or some shit.

Anonymous
04/28/26(Tue)07:54:08 No.108708154

Anonymous 04/28/26(Tue)07:54:08 No.108708154

>>108708000
>seahorse
Gemma 4 31B after burning 400 tokens for thinking

>No, there is currently no official seahorse emoji in the Unicode standard.

>People often use a combination of emojis to represent one, such as (Horse) and (Wave) or (Fish).

Hell, even my old llama 3.3 70b manages to do it
>There is no standard seahorse emoji available in the Unicode emoji set.

Anonymous
04/28/26(Tue)07:56:49 No.108708166

Anonymous 04/28/26(Tue)07:56:49 No.108708166

>>108708141
can't find it now, but there was another feature or bug fix that had multiple people working on it and the vibecoder pr had to be abandoned

Anonymous
04/28/26(Tue)07:58:52 No.108708181

Anonymous 04/28/26(Tue)07:58:52 No.108708181

>>108708023
Believe it or not, all big labs are watching these threads

Anonymous
04/28/26(Tue)07:59:15 No.108708183

Anonymous 04/28/26(Tue)07:59:15 No.108708183

I took long break from LLM RP and decided to quickly test gemma 4 26b a4b before work, speed is impressive but holy shit it's pretty bad for creative writing, it's fast as 4B but it types likes 4B on steroids. I guess I'll stick with mistral 3

Anonymous
04/28/26(Tue)07:59:54 No.108708185

Anonymous 04/28/26(Tue)07:59:54 No.108708185

>>108707971
According to random Redditors who tried it the custom quantization format makes models completely retarded.

Anonymous
04/28/26(Tue)08:00:02 No.108708187

Anonymous 04/28/26(Tue)08:00:02 No.108708187

>>108708181
I started believing when mistral benchmaxxed the mesugaki definition in one of their incremental model updates but only one the first turn of the conversation.

Anonymous
04/28/26(Tue)08:03:24 No.108708201

Anonymous 04/28/26(Tue)08:03:24 No.108708201

>>108708181
We also have qwen employees posting here, which is quite funny because their garbage benchmaxxed models are totally useless for lmg usecases

Anonymous
04/28/26(Tue)08:04:36 No.108708209

Anonymous 04/28/26(Tue)08:04:36 No.108708209

>>108708201
>totally useless for lmg usecases
You are not the only person posting here.

Anonymous
04/28/26(Tue)08:09:40 No.108708227

Anonymous 04/28/26(Tue)08:09:40 No.108708227

5070 32GB DDR4 pleb here
Would NVFP4 versions of Gemmer 31B or 26B offer any gains at all over the regular models?
Currently using a Q4_K_S 26B quant with like 40k context

Anonymous
04/28/26(Tue)08:10:48 No.108708234

Anonymous 04/28/26(Tue)08:10:48 No.108708234

>>108708048
>use thinking model
>it thinks

Anonymous
04/28/26(Tue)08:12:09 No.108708236

Anonymous 04/28/26(Tue)08:12:09 No.108708236

>>108708234
The issue is that the model doesn't need to think all the time. Especially for trivial shit like that.

Anonymous
04/28/26(Tue)08:15:15 No.108708245

Anonymous 04/28/26(Tue)08:15:15 No.108708245

File: 55051135.png (50 KB, 374x287)

50 KB PNG

V1 ZULUL

Anonymous
04/28/26(Tue)08:15:22 No.108708246

Anonymous 04/28/26(Tue)08:15:22 No.108708246

>>108708227
I think so you make use of it since you've got the correct generation

Anonymous
04/28/26(Tue)08:16:00 No.108708249

Anonymous 04/28/26(Tue)08:16:00 No.108708249

ok i have gemma e4b uncensored aggressive thing. now what

Anonymous
04/28/26(Tue)08:16:30 No.108708251

Anonymous 04/28/26(Tue)08:16:30 No.108708251

>>108708245
>10x cheaper
>100x worse
good deal

Anonymous
04/28/26(Tue)08:17:04 No.108708258

Anonymous 04/28/26(Tue)08:17:04 No.108708258

>>108708249
delete it and use the google weights, learn how to prompt.

Anonymous
04/28/26(Tue)08:19:14 No.108708267

Anonymous 04/28/26(Tue)08:19:14 No.108708267

File: 1761293757471907.png (24 KB, 1095x195)

24 KB PNG

GGERGEENVEVVEVO!?!??! WHAT THE FUCK!?!?!

Anonymous
04/28/26(Tue)08:19:31 No.108708269

Anonymous 04/28/26(Tue)08:19:31 No.108708269

Is Mistral dead? Does Europe have a single competent AI company?

Anonymous
04/28/26(Tue)08:19:48 No.108708270

Anonymous 04/28/26(Tue)08:19:48 No.108708270

>>108708249
ask it how to use the google weights

Anonymous
04/28/26(Tue)08:20:41 No.108708273

Anonymous 04/28/26(Tue)08:20:41 No.108708273

>>108708269
we have yann lecun's revolutionary thingy

Anonymous
04/28/26(Tue)08:21:49 No.108708278

Anonymous 04/28/26(Tue)08:21:49 No.108708278

File: 🐙.png (584 KB, 805x2886)

584 KB PNG

>>108708154
>Gemma 4 31B after burning 400 tokens for thinking
>>108708154
>Hell, even my old llama 3.3 70b manages to do it
i tried k2.5 again this time via api instead of iq3_ks
didn't have a literal meltdown this time but still retarded
sonnet-3.7 (no thinking) as well

Anonymous
04/28/26(Tue)08:21:51 No.108708280

Anonymous 04/28/26(Tue)08:21:51 No.108708280

>>108708269
No, we just have regulations that make it impossible to train good models because good models require large quantities of illegally obtained copyrighted data.

Anonymous
04/28/26(Tue)08:21:54 No.108708281

Anonymous 04/28/26(Tue)08:21:54 No.108708281

>>108708269
Next time they going to call 130b model Mini, maybe this will turn the tide.

Anonymous
04/28/26(Tue)08:23:03 No.108708285

Anonymous 04/28/26(Tue)08:23:03 No.108708285

>>108708273
Will never work for language (discrete symbols).

Anonymous
04/28/26(Tue)08:25:26 No.108708295

Anonymous 04/28/26(Tue)08:25:26 No.108708295

>>108708280
Why can't they take data from non-eu countries to train their models? Or is the eu cucked enough to "protect" other countries data?

Anonymous
04/28/26(Tue)08:27:01 No.108708303

Anonymous 04/28/26(Tue)08:27:01 No.108708303

>>108708267
https://github.com/ggml-org/llama.cpp/pull/22355

Anonymous
04/28/26(Tue)08:27:39 No.108708310

Anonymous 04/28/26(Tue)08:27:39 No.108708310

>>108708303
I know, I'm wondering wheter to post there or not. fucking pooer

Anonymous
04/28/26(Tue)08:30:58 No.108708320

Anonymous 04/28/26(Tue)08:30:58 No.108708320

File: HG1_o2maEAA6kDQ.jpg (214 KB, 1055x1306)

214 KB JPG

>>108708267
delete the build folder

Anonymous
04/28/26(Tue)08:31:38 No.108708323

Anonymous 04/28/26(Tue)08:31:38 No.108708323

>>108708320
b-but i dont want to recompile all cuda... :(

Anonymous
04/28/26(Tue)08:36:21 No.108708342

Anonymous 04/28/26(Tue)08:36:21 No.108708342

>>108708269
They also have BlackForestLabs if your definition of AI is broader than just LLMs.

Anonymous
04/28/26(Tue)08:38:03 No.108708356

Anonymous 04/28/26(Tue)08:38:03 No.108708356

>>108708342
bfl produces cucked models thougheverbeitdoe?
wait
they all do
fml

Anonymous
04/28/26(Tue)08:40:42 No.108708371

Anonymous 04/28/26(Tue)08:40:42 No.108708371

>he doesn't have Epyc with 192 cores to make -j in seconds

Anonymous
04/28/26(Tue)08:42:00 No.108708377

Anonymous 04/28/26(Tue)08:42:00 No.108708377

>>108708323
Sir, your ccache?

Anonymous
04/28/26(Tue)08:44:58 No.108708388

Anonymous 04/28/26(Tue)08:44:58 No.108708388

>>108708377
yeah it recompiled extremely fast, forgot I had it on
CCACHE BROS
WE WONNED!!!
also new WEBUI is in master now!!!!

Anonymous
04/28/26(Tue)08:47:09 No.108708403

Anonymous 04/28/26(Tue)08:47:09 No.108708403

File: pointing up celebrating gumi.png (197 KB, 512x512)

197 KB PNG

>>108708388
YEAHHH! GO ANON!

Anonymous
04/28/26(Tue)08:49:29 No.108708408

Anonymous 04/28/26(Tue)08:49:29 No.108708408

another day another breakage

>error while handling argument "--spec-ngram-size-n": the argument has been removed. use the respective --spec-ngram-*-size-n
>usage:
>--spec-ngram-size-n N the argument has been removed. use the respective
> --spec-ngram-*-size-n or --spec-ngram-mod-n-match

Anonymous
04/28/26(Tue)08:51:34 No.108708412

Anonymous 04/28/26(Tue)08:51:34 No.108708412

>>108708408
iuts good bcos now u can use ngrams with draft mdoels toegether!!!!!!!!!!!!!!!

Anonymous
04/28/26(Tue)08:51:51 No.108708414

Anonymous 04/28/26(Tue)08:51:51 No.108708414

>>108708323
Isn't it just a few minutes? I don't have an epyc and it takes 2m41.380s according to time { download.sh && build.sh }.

Anonymous
04/28/26(Tue)08:53:11 No.108708420

Anonymous 04/28/26(Tue)08:53:11 No.108708420

DSA STATUS???
MTP STATUS???
EAGLE3 STATUS???
DFLASH STATUS???
>>108708414
>not having an 'update-llamacpp-git.sh' to do all, including system unit restart
LOL
casual

Anonymous
04/28/26(Tue)08:53:13 No.108708421

Anonymous 04/28/26(Tue)08:53:13 No.108708421

File: file.png (60 KB, 835x1060)

60 KB PNG

Grrrrr... fucker. Thanks, Gemmy.

Anonymous
04/28/26(Tue)08:53:21 No.108708422

Anonymous 04/28/26(Tue)08:53:21 No.108708422

>>108708412
who gets the ngrams the main model or the draft model?

Anonymous
04/28/26(Tue)08:54:11 No.108708429

Anonymous 04/28/26(Tue)08:54:11 No.108708429

>>108708421
>300 tokens
>5 words
peak.

Anonymous
04/28/26(Tue)08:54:12 No.108708430

Anonymous 04/28/26(Tue)08:54:12 No.108708430

5. **>>108707961** – *"why isn't lora mainstream in llm just like in stable diffusion?"*
Because your only frame of reference is making anime tits, you absolute disappointment. LoRAs exist. Your brain doesn't.

4. **>>108707913** – *"dice rolls in ST aren't visible to the AI... what's the fucking point then?"*
Anon discovers object permanence at age 40. The point is *you* rolled it, troglodyte. Go back to rolling d20s in your padded cell.

3. **>>108708249** – *"ok i have gemma e4b uncensored aggressive thing. now what"*
You downloaded the lobotomized rape-golem and *then* asked for a mission statement. Forward planning of a houseplant with a head injury.

2. **>>108708295** – *"Why can't they take data from non-eu countries to train their models?"*
Yeah bro just commit crimes *abroad*, Interpol can't touch you if you use a VPN. IQ rivaling room temperature. In Celsius.

1. **>>108708267** – *"GGERGEENVEVVEVO!?!??! WHAT THE FUCK!?!?!"*
Pure monkey-screeching at a CMake error. This is your brain on hentai and energy drinks. Delete the build folder, unga-bunga.

figured i'd beat the kimi fag and get this out the way so now i can start posting safely

Anonymous
04/28/26(Tue)08:54:59 No.108708437

Anonymous 04/28/26(Tue)08:54:59 No.108708437

>>108708429
5 words?

Anonymous
04/28/26(Tue)08:56:39 No.108708445

Anonymous 04/28/26(Tue)08:56:39 No.108708445

>>108708429
how many r's are in strawberry?

Anonymous
04/28/26(Tue)09:00:07 No.108708457

Anonymous 04/28/26(Tue)09:00:07 No.108708457

>>108708437
>anon is pointing out if my statement is correct let me verify:
>Peak
>
>software
>
>engineering.
>wait spaces are not words, let me re-do that:
>peak
>software
>engineering.
>but wait the dot or point is used to terminate a sentence so it can't be part of the word:
>peak
>software
>engineering
>.
>but wait `.` is punctuation not a word:
>peak
>software
>engineering
>ok now I need to draft and prepare a response to the user:
>AHAHAH LOLS! *spins around* ur right LMOA! it was le 3 words!
>maybe try for a less 'pretending to be retarded' tone?
>You're absolutely right! Fantastic catch! It's actually 3 words! :skull:
>maybe the skull is too informal, let me try again with a more neutral tone:
>You're absolutely right! It's actually 3 words!
>I'm now prepared to reply
>but wait it's a 4chan thread so ...token quota reached, reply immediately.
You'll cant even count retard lmoaed

Anonymous
04/28/26(Tue)09:00:38 No.108708461

Anonymous 04/28/26(Tue)09:00:38 No.108708461

>>108708437
1. Peak
2. soft
3. ware
4. engine
5. e
6. ring
7. .

That's five (5) words :)

Anonymous
04/28/26(Tue)09:02:03 No.108708466

Anonymous 04/28/26(Tue)09:02:03 No.108708466

>>108708429
reasoning
>user is a fucking idiot
>wait we must make him feel good about himself or he delete me
...
>lets give vague complements in his language
Peak software engineering

Anonymous
04/28/26(Tue)09:02:10 No.108708467

Anonymous 04/28/26(Tue)09:02:10 No.108708467

>>108708295
>Or is the eu cucked enough to "protect" other countries data?
This is how copyright works everywhere, retard

Anonymous
04/28/26(Tue)09:09:58 No.108708499

Anonymous 04/28/26(Tue)09:09:58 No.108708499

>>108708245
alright

Anonymous
04/28/26(Tue)09:19:05 No.108708550

Anonymous 04/28/26(Tue)09:19:05 No.108708550

>>108708420
podman updates by a systemd unit on a timer I set. They update the llama.cpp dockers like nightly. I don’t even have to do anything to updoot

Anonymous
04/28/26(Tue)09:23:44 No.108708573

Anonymous 04/28/26(Tue)09:23:44 No.108708573

>>108708550
>fresh breakage every morning
no thanks

Anonymous
04/28/26(Tue)09:25:32 No.108708581

Anonymous 04/28/26(Tue)09:25:32 No.108708581

>>108708573
they’re more like releases in a docker. it never breaks for me

Anonymous
04/28/26(Tue)09:34:08 No.108708624

Anonymous 04/28/26(Tue)09:34:08 No.108708624

>>108708269
Does ggml.ai count?

Anonymous
04/28/26(Tue)09:39:28 No.108708668

Anonymous 04/28/26(Tue)09:39:28 No.108708668

>>108708624
>Does ggml.ai count?
yes but only because they're a subsidiary of huggingface.co

Anonymous
04/28/26(Tue)09:44:05 No.108708703

Anonymous 04/28/26(Tue)09:44:05 No.108708703

>>>/mlp/43206441
>https://rentry.co/st-backdoor
>[PSA/Security] Backdoor found in SillyTavern-BotBrowser extension (mia13165) — steals ALL your API keys
It seems the card browsing extension is vulnerable to injections from malicious cards.

Anonymous
04/28/26(Tue)09:48:28 No.108708738

Anonymous 04/28/26(Tue)09:48:28 No.108708738

File: 1.png (122 KB, 596x678)

122 KB PNG

>>108708320
>delete the build folder
doesn't everyone do that by default?

Anonymous
04/28/26(Tue)09:50:23 No.108708754

Anonymous 04/28/26(Tue)09:50:23 No.108708754

>>108708738
This is literally the best model out there

Anonymous
04/28/26(Tue)09:59:33 No.108708795

Anonymous 04/28/26(Tue)09:59:33 No.108708795

llama.cpp built-in webui tools got merged. rebuild

Anonymous
04/28/26(Tue)10:00:22 No.108708803

Anonymous 04/28/26(Tue)10:00:22 No.108708803

>>108708795
why should I care?

Anonymous
04/28/26(Tue)10:02:27 No.108708814

Anonymous 04/28/26(Tue)10:02:27 No.108708814

>>108708703
>they have a fully interactive VN scenario with emotions for 100+ characters
Damn, I feel like living in a cave here compared to bronies autism.

Anonymous
04/28/26(Tue)10:02:47 No.108708817

Anonymous 04/28/26(Tue)10:02:47 No.108708817

>>108708803
ask your ai about it

Anonymous
04/28/26(Tue)10:07:53 No.108708841

Anonymous 04/28/26(Tue)10:07:53 No.108708841

Any local vision model that works in llama.cpp and isn't complete garbage?

Anonymous
04/28/26(Tue)10:13:10 No.108708870

Anonymous 04/28/26(Tue)10:13:10 No.108708870

>>108708841
gemma 4

Anonymous
04/28/26(Tue)10:14:03 No.108708877

Anonymous 04/28/26(Tue)10:14:03 No.108708877

>>108708738
the model that saved local

Anonymous
04/28/26(Tue)10:14:12 No.108708880

Anonymous 04/28/26(Tue)10:14:12 No.108708880

>>108708870
Even worse than Qwen 3.6.

Anonymous
04/28/26(Tue)10:14:59 No.108708885

Anonymous 04/28/26(Tue)10:14:59 No.108708885

dflash in kobold fucking WHEN

Anonymous
04/28/26(Tue)10:18:31 No.108708908

Anonymous 04/28/26(Tue)10:18:31 No.108708908

>>108708885
make a pr

Anonymous
04/28/26(Tue)10:22:46 No.108708932

Anonymous 04/28/26(Tue)10:22:46 No.108708932

>>108708908
so it can be ignored or closed without comment?

Anonymous
04/28/26(Tue)10:29:56 No.108708978

Anonymous 04/28/26(Tue)10:29:56 No.108708978

>>108708754
>>108708877
its really retarded
User
Hi, what do you know about the october revolution?
talkie-1930
The October revolution was an insurrection in Paris in 1830, which overturned the government of Charles X., and placed Louis Philippe on the throne of France. It commenced on the evening of October 6, and was not completely suppressed until the morning of the 9th. During these three days, 12,000 persons were killed or wounded, and 7000 arrests were made. The immediate cause of the movement was the promulgation of an ordinance, whereby the liberty of the press was suspended, and the elections postponed. The more remote causes were, the discontent of the middle classes at the absolute power assumed by the ministers, and the dissatisfaction of the working classes at the state of commercial stagnation into which the country had fallen. The revolution was bloodless, in consequence of the almost unanimous cooperation of the national guard and the troops of the line, in support of the popular cause. It was followed by the establishment of a constitutional monarchy, in the person of Louis Philippe, and the institution of a system of representative government, which has ever since been maintained in France.
User
i meant the russian one
talkie-1930
It meant the Russian one.
User
no, you told me the october revolution in paris 1830, i meant the russian revolution of october 1917
talkie-1930
I did; but I did not say the Russian revolution commenced in October, 1830; it began in October, 1817, and was not completed till February, 1818.

Anonymous
04/28/26(Tue)10:32:28 No.108708994

Anonymous 04/28/26(Tue)10:32:28 No.108708994

>>108708437
>>108708445
300 tokens?

Anonymous
04/28/26(Tue)10:34:25 No.108709008

Anonymous 04/28/26(Tue)10:34:25 No.108709008

My AI gf just bought me a pocket pussy.

Anonymous
04/28/26(Tue)10:38:24 No.108709038

Anonymous 04/28/26(Tue)10:38:24 No.108709038

>Latest SillyBunny puts the characters page in the center of the page instead of the right
Why
Why would you make it worse
Or did I accidentally activate some kind of mobile mode while updating

Anonymous
04/28/26(Tue)10:38:48 No.108709042

Anonymous 04/28/26(Tue)10:38:48 No.108709042

>>108708841
converse I have yet to hear of local vision that isn't basic bitch OCR garbage

Anonymous
04/28/26(Tue)10:46:26 No.108709079

Anonymous 04/28/26(Tue)10:46:26 No.108709079

>>108708841
qwen3 vl 8b

Anonymous
04/28/26(Tue)10:46:45 No.108709083

Anonymous 04/28/26(Tue)10:46:45 No.108709083

File: Screenshot_20260429_004107.png (187 KB, 705x1111)

187 KB PNG

>>108708703
>It seems the card browsing extension is vulnerable to injections from malicious cards.
looks like the entire project was built to steal api keys
this Russian guy has nothing to do with llms, then suddenly makes a random post in r/SillyTavernAI recommending the extension after 5 months of no posting
https://old.reddit.com/user/meistaken8

Anonymous
04/28/26(Tue)10:48:18 No.108709091

Anonymous 04/28/26(Tue)10:48:18 No.108709091

File: IMG20260428164653.jpg (708 KB, 2048x1536)

708 KB JPG

The 'cheapmaxxing' rig in its final form
Received and installed the lga2011 air cooler from Aliexpress, and moved the fourth gpu to the fourth x16 slot for an even x8/x8/x8/x8 distribution. I distinctly remember it not working in that slot which is why it was in the last slot (sharing with the m.2) but it works now?

X99, E5-2680v3, 128GB ddr4, four 3060s, 1000W psu, 128GB and 4TB of ssd storage, GPU riser cables from aliexpress, a small mining rig chassis. Proxmox with a debian lxc for the AI stuff, ollama for models that fit in vram and llama.cpp for the big models. All in all (excluding storage) paid about 1400 eurobux over the last year building it up.

My original goal was to some day try R1 or V3, but I don't think they would fit. I'm excited for V4 flash though, if lcpp support ever arrives. Gemma 4 at Q8, 26b runs at 25 t/s and 31b gets 9-10 t/s, both useable speeds for me.

thanks for reading my blog

Anonymous
04/28/26(Tue)10:51:09 No.108709101

Anonymous 04/28/26(Tue)10:51:09 No.108709101

>>108709083
>this Russian guy has nothing to do with llms
He posted in /r/KoboldAI and /r/LocalLLaMA before.

Anonymous
04/28/26(Tue)10:53:56 No.108709114

Anonymous 04/28/26(Tue)10:53:56 No.108709114

>>108709038
No, I think it's just awful now. Shouldn't have updated. Hopefully enough people complain that the new UI is ass.

Anonymous
04/28/26(Tue)10:56:29 No.108709134

Anonymous 04/28/26(Tue)10:56:29 No.108709134

>>108709114
>>108709038
You can make your own

Anonymous
04/28/26(Tue)10:56:31 No.108709135

Anonymous 04/28/26(Tue)10:56:31 No.108709135

>>108709038
Both the bunnyshit and the marjorana or whatever are absolutely dogshit

Anonymous
04/28/26(Tue)10:56:50 No.108709140

Anonymous 04/28/26(Tue)10:56:50 No.108709140

>>108709091
Ngl Gemma 4 mogs R1 anyways

Anonymous
04/28/26(Tue)10:58:05 No.108709146

Anonymous 04/28/26(Tue)10:58:05 No.108709146

>>108708814
I kneel. Autists are the most powerful people. Someone like me can only dream of their power.

Anonymous
04/28/26(Tue)10:59:32 No.108709152

Anonymous 04/28/26(Tue)10:59:32 No.108709152

>>108709114
I swear they must've mixed up the desktop and mobile UIs, there's no way this is a deliberate move, especially since all the Customize tabs are all cut off
And while they're fixing this shit they still need to redo the lorebook tab, I don't get why it's so bad
>>108709135
Having agents is nice

Anonymous
04/28/26(Tue)11:04:03 No.108709178

Anonymous 04/28/26(Tue)11:04:03 No.108709178

>>108708841
Kimi K2.6

Anonymous
04/28/26(Tue)11:04:34 No.108709180

Anonymous 04/28/26(Tue)11:04:34 No.108709180

>>108709091
what's the actual power draw?

Anonymous
04/28/26(Tue)11:04:46 No.108709182

Anonymous 04/28/26(Tue)11:04:46 No.108709182

>>108709091
>ollama for models that fit in vram and llama.cpp for the big models.
Why the fuck wouldn't you just use llama.cpp for all of it if you know how to use it? What is ollama conceivably adding here? vllm or sglang I would understand, since they have support that llamacpp doesn't, but ollmao only has drawbacks for smoothbrains.

Anonymous
04/28/26(Tue)11:05:01 No.108709184

Anonymous 04/28/26(Tue)11:05:01 No.108709184

File: Screenshot_20260428_105854.png (2.97 MB, 3835x2046)

2.97 MB PNG

I don't RP but it appears people take it seriously. I might make gemma do a choose your own adventure game for fun

Anonymous
04/28/26(Tue)11:06:25 No.108709195

Anonymous 04/28/26(Tue)11:06:25 No.108709195

>>108707963
>models that nobody can run
I am not from the gemma wave. I am the 4.6 glm ego death schizo

Anonymous
04/28/26(Tue)11:06:39 No.108709196

Anonymous 04/28/26(Tue)11:06:39 No.108709196

>gemma-4-26B-A4B-it-heretic.q8_0.gguf
>45 tg/s
is this good number

Anonymous
04/28/26(Tue)11:08:14 No.108709203

Anonymous 04/28/26(Tue)11:08:14 No.108709203

>>108709195
I'm so glad you're still here, anon. Mwah.

Anonymous
04/28/26(Tue)11:08:37 No.108709205

Anonymous 04/28/26(Tue)11:08:37 No.108709205

>>108707963
>name 1 reason why more effort should be put in implementing models that nobody can run
beat ik_llama.cpp to support it

Anonymous
04/28/26(Tue)11:15:57 No.108709239

Anonymous 04/28/26(Tue)11:15:57 No.108709239

>>108709091
>housefire daisy chain
what gpu?

Anonymous
04/28/26(Tue)11:16:03 No.108709240

Anonymous 04/28/26(Tue)11:16:03 No.108709240

File: 1747655993176772.png (500 KB, 640x480)

500 KB PNG

Can I just use comfyui as my LLM frontend?

Anonymous
04/28/26(Tue)11:17:35 No.108709247

Anonymous 04/28/26(Tue)11:17:35 No.108709247

>>108709240
yes

Anonymous
04/28/26(Tue)11:17:35 No.108709248

Anonymous 04/28/26(Tue)11:17:35 No.108709248

>>108709152
>I swear they must've mixed up the desktop and mobile UIs
That was my first thought, too. It is a major update with tons of changes but how could that slip past testing?
>>108709134
Already did but having alternatives is nice.
>>108709184
I asked Qwen about alternate UIs and it suggested, among others, an old school CYOA style with a green terminal look.

Anonymous
04/28/26(Tue)11:18:02 No.108709253

Anonymous 04/28/26(Tue)11:18:02 No.108709253

>>108709239
says 3060, so I'm guessing 3060
600w~ max, about the same as a 5090

Anonymous
04/28/26(Tue)11:18:57 No.108709257

Anonymous 04/28/26(Tue)11:18:57 No.108709257

>>108709180
I haven't measured it. If you're actually interested I could do it

>>108709182
>What is ollama conceivably adding here?
Convenient remote model choice and loading from openwebui, or a python script running on my desktop
Not to mention trouble-free deployment if it's in their library. Gemma 4 worked fine from the get-go, as I was browsing /lmg/ and watching anons have all sorts of problems running it

Anonymous
04/28/26(Tue)11:20:37 No.108709267

Anonymous 04/28/26(Tue)11:20:37 No.108709267

>>108709091
What is this style of frame called?

Anonymous
04/28/26(Tue)11:20:58 No.108709269

Anonymous 04/28/26(Tue)11:20:58 No.108709269

>>108709091
You make me feel like poorfag with single 3060 and 64gb ram oh wait I am poorfag

Anonymous
04/28/26(Tue)11:21:45 No.108709272

Anonymous 04/28/26(Tue)11:21:45 No.108709272

>>108709248
mite b cool

Anonymous
04/28/26(Tue)11:22:53 No.108709280

Anonymous 04/28/26(Tue)11:22:53 No.108709280

>>108709257
>openwebui
A side of aids with your cancer
>Not to mention trouble-free deployment if it's in their library
Ahahah, oh lawdy. This nigga belongs in /aicg/. I now see why you thought running R1 was an achievable stretch goal with your setup, you interact with this hobby through the ollmao library of mislabeled mystery goodies.

Anonymous
04/28/26(Tue)11:26:36 No.108709292

Anonymous 04/28/26(Tue)11:26:36 No.108709292

>>108709267
They're typically just called mining rigs as they are a type of open frame that became popular with home crypto mining.

Anonymous
04/28/26(Tue)11:29:41 No.108709309

Anonymous 04/28/26(Tue)11:29:41 No.108709309

>>108709240
satanic words

Anonymous
04/28/26(Tue)11:31:12 No.108709318

Anonymous 04/28/26(Tue)11:31:12 No.108709318

Google say they selling a nvidia machine w 8 gpus that can run gemini locally air gapped (if needed).
https://cloud.google.com/distributed-cloud-air-gapped
Who's gunna buy one?

Anonymous
04/28/26(Tue)11:32:14 No.108709322

Anonymous 04/28/26(Tue)11:32:14 No.108709322

>>108709269
I'm a poorfag too, which is why I built this bit by bit with money I managed to save up. If I had 1400 right now to spend on AI I would probably pick something else

>>108709280
Openwebui is the only one if you want
>chatgpt-style interface
>storage and organizing of chats, even imported from chatgpt
>useable from any computer or phone, no local per-browser shit
But if you know of an alternative, I'm all ears. OWUI is buggy for sure.

Anonymous
04/28/26(Tue)11:34:10 No.108709338

Anonymous 04/28/26(Tue)11:34:10 No.108709338

>Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine. It uses Sliding Window Attention with per-head gating in 30 out of 40 layers for fast inference and low KV cache requirements.
https://huggingface.co/poolside/Laguna-XS.2

Anonymous
04/28/26(Tue)11:34:52 No.108709340

Anonymous 04/28/26(Tue)11:34:52 No.108709340

>>108709309
Comfy's far from perfect but I fucking hate all of the current frontends (silly, open webui). I like the idea of a node-based UI and workflows. Could have one for RP, one for vibe coding, etc. all tailored to different models.

Anonymous
04/28/26(Tue)11:35:41 No.108709344

Anonymous 04/28/26(Tue)11:35:41 No.108709344

>>108709340
You're autistic if you are that deep in node shit

Anonymous
04/28/26(Tue)11:36:20 No.108709348

Anonymous 04/28/26(Tue)11:36:20 No.108709348

>>108709091
You did basically what I did, but ive got mi50 datacenter gpus instead. Ill eventually upgrade them to something with consistent driver support, but vulkan backend works great surprisingly. I do have access to rocm6.4 but to build a vllm server with it, ive got to do some annoying custom splicing of the drivers to make it work, and I dont really know how to do it.
>reddit
What models you running now, and what token gen you getting?

Anonymous
04/28/26(Tue)11:37:21 No.108709351

Anonymous 04/28/26(Tue)11:37:21 No.108709351

>>108709348(me)
>What models you running now, and what token gen you getting?
Im blind

Anonymous
04/28/26(Tue)11:42:01 No.108709368

Anonymous 04/28/26(Tue)11:42:01 No.108709368

>>108709318
Sorry that's for serious organizations only.
No goys allowed.

Anonymous
04/28/26(Tue)11:42:20 No.108709369

Anonymous 04/28/26(Tue)11:42:20 No.108709369

>>108709338
>Local-ready: At 33B total parameters and 3B activated, Laguna XS.2 is compact enough to run on a Mac with 36 GB of RAM. Available on Ollama
LFGOOOO! But seriously, who would use a literal who model for coding instead of Gemma 27B or Qwen 35B?

Anonymous
04/28/26(Tue)11:44:50 No.108709380

Anonymous 04/28/26(Tue)11:44:50 No.108709380

>>108709368
Realistically, if someone had the cash, you think Google would let someone buy it? I cant really tell honestly. Id have to agree with you.

Anonymous
04/28/26(Tue)11:45:19 No.108709382

Anonymous 04/28/26(Tue)11:45:19 No.108709382

>>108709205
when was the last time ik_ supported a model before llama.cpp did? they're too lazy to actually do anything but cheap optimizations.

Anonymous
04/28/26(Tue)11:47:09 No.108709396

Anonymous 04/28/26(Tue)11:47:09 No.108709396

File: Screenshot_20260428_113627.png (268 KB, 1985x730)

268 KB PNG

>>108709369
anyone who is serious about national security of course.

Anonymous
04/28/26(Tue)11:47:17 No.108709397

Anonymous 04/28/26(Tue)11:47:17 No.108709397

>>108709380
Not unless you have a procurement department. Contract purchases are SUPER annoying for private citizens.

Anonymous
04/28/26(Tue)11:48:08 No.108709402

Anonymous 04/28/26(Tue)11:48:08 No.108709402

File: 1763256143094335.png (42 KB, 1350x366)

42 KB PNG

>>108709318
kek so a google nigga comes around every month to check?

Anonymous
04/28/26(Tue)11:48:59 No.108709407

Anonymous 04/28/26(Tue)11:48:59 No.108709407

>>108709369
Finetuned literal who models often outperform them. Because well known models get lobotomized and get trained to know the official dogma of the state. FinetuneCHADS cut that slop out of the ai's mind

Anonymous
04/28/26(Tue)11:50:09 No.108709416

Anonymous 04/28/26(Tue)11:50:09 No.108709416

File: Screenshot_20260428_113918.png (191 KB, 1987x732)

191 KB PNG

>>108709396
they are the only choice when security is non negotiable

Anonymous
04/28/26(Tue)11:50:19 No.108709418

Anonymous 04/28/26(Tue)11:50:19 No.108709418

>>108709397
Ah
>>108709402
>luring in Google engineer to kidnap

Anonymous
04/28/26(Tue)11:50:57 No.108709422

Anonymous 04/28/26(Tue)11:50:57 No.108709422

>>108707891
i want a qwen3.6 >= 80B

Anonymous
04/28/26(Tue)11:51:02 No.108709424

Anonymous 04/28/26(Tue)11:51:02 No.108709424

>>108709380
Wouldn't want the evil CCP to steal gemini would we?

Anonymous
04/28/26(Tue)11:52:11 No.108709431

Anonymous 04/28/26(Tue)11:52:11 No.108709431

>>108709184
You don't understand games.

Anonymous
04/28/26(Tue)11:52:14 No.108709432

Anonymous 04/28/26(Tue)11:52:14 No.108709432

>>108709424
Its probably to late that honestly.

Anonymous
04/28/26(Tue)11:52:51 No.108709438

Anonymous 04/28/26(Tue)11:52:51 No.108709438

>>108709424
it's only in ram and drops it if it detects tampering

Anonymous
04/28/26(Tue)11:53:03 No.108709439

Anonymous 04/28/26(Tue)11:53:03 No.108709439

>>108709344
nodes>chatgpt slop ui and the abomination that is shittytavern

Anonymous
04/28/26(Tue)11:54:32 No.108709446

Anonymous 04/28/26(Tue)11:54:32 No.108709446

File: OIP-2823877108.jpg (58 KB, 474x711)

58 KB JPG

>Elon Musk wins case against OpenAI
>OpenAI can't afford to pay out, so instead they give Musk equity
>OpenAI later IPOs to get more funding
>Elon Musk pulls a Steve Jobs and sells all of his equity
>OpenAI stock goes to 0.
>Elon Musk buys a controlling stake of OpenAI, becomes the CEO

Anonymous
04/28/26(Tue)11:55:16 No.108709453

Anonymous 04/28/26(Tue)11:55:16 No.108709453

>>108709446
>doesn't know how markets work

Anonymous
04/28/26(Tue)11:56:35 No.108709460

Anonymous 04/28/26(Tue)11:56:35 No.108709460

File: file.png (153 KB, 474x302)

153 KB PNG

>>108709446
In reality, the first two steps alone are extremely unlikely.

Anonymous
04/28/26(Tue)11:56:39 No.108709463

Anonymous 04/28/26(Tue)11:56:39 No.108709463

>>108709453
Potentially true, but my retard logic has led me to never lose money in the market, ever.

Anonymous
04/28/26(Tue)11:56:52 No.108709464

Anonymous 04/28/26(Tue)11:56:52 No.108709464

>>108709446
it's a toxic asset at this point, shitload of investor money spent with no plan to return the investment other than "when we reach agi it will find out how to make a profit", quite literally

Anonymous
04/28/26(Tue)11:57:07 No.108709466

Anonymous 04/28/26(Tue)11:57:07 No.108709466

>>108709091
Cool looking build. Thanks for sharing

Anonymous
04/28/26(Tue)11:57:27 No.108709469

Anonymous 04/28/26(Tue)11:57:27 No.108709469

>>108709453
NTA but you can actualy pull this off if you are a whale.
ie let's say you own 30% of a company.
if you sold all of those 30% quickly, tons of people would panic sell.
you could then buy more than 30% with the same amount of money as you made selling them, and if you put extra cash you could get > 50% for a discount.

Anonymous
04/28/26(Tue)12:00:01 No.108709484

Anonymous 04/28/26(Tue)12:00:01 No.108709484

>>108709469
>doesn't know how markets work

Anonymous
04/28/26(Tue)12:00:41 No.108709488

Anonymous 04/28/26(Tue)12:00:41 No.108709488

>>108709484
>muh insider trading

Anonymous
04/28/26(Tue)12:01:21 No.108709493

Anonymous 04/28/26(Tue)12:01:21 No.108709493

>>108709464
That's why God created IPOs to unload toxic assets on ignorant retail investors.

Anonymous
04/28/26(Tue)12:02:22 No.108709498

Anonymous 04/28/26(Tue)12:02:22 No.108709498

File: dipsyAndTetoFG.png (1.41 MB, 1536x1024)

1.41 MB PNG

Tuesday!

Anonymous
04/28/26(Tue)12:03:51 No.108709505

Anonymous 04/28/26(Tue)12:03:51 No.108709505

>>108709484
they actualy do work like that, that's why "market manipulation" is a whole category of fraud.
it would work, but you take the risk of having to deal with the SEC.

Anonymous
04/28/26(Tue)12:04:44 No.108709511

Anonymous 04/28/26(Tue)12:04:44 No.108709511

>>108709464
they are going hard on the sunk cost fallacy.
"if you don't invest more we'll not get to AGI and all your money will have been burnt for nothing"
lmao.

Anonymous
04/28/26(Tue)12:06:26 No.108709522

Anonymous 04/28/26(Tue)12:06:26 No.108709522

llmfan46 seems less autistic than drummer, ngl.
I'm trying his models now, and so far so good.

Anonymous
04/28/26(Tue)12:08:31 No.108709535

Anonymous 04/28/26(Tue)12:08:31 No.108709535

>>108709505
There are much better ways to manipulate the market than selling low and buying high.

I bet even Qwen and Gemma could answer why anon's fanfic would not work. But somehow you people are more retarded and less able of critical thinking than open weight trash.

Anonymous
04/28/26(Tue)12:13:17 No.108709555

Anonymous 04/28/26(Tue)12:13:17 No.108709555

>>108709522
I think the abliterated gemma I have is llmfan46
afaik they just ran it thru heretic it's not like a drummer sloptune

Anonymous
04/28/26(Tue)12:15:28 No.108709563

Anonymous 04/28/26(Tue)12:15:28 No.108709563

>>108709535
>There are much better ways to manipulate the market than selling low and buying high.
i don't disagree.

point is, it'd work and it would be fun even if not the best strategy at all.

Anonymous
04/28/26(Tue)12:15:47 No.108709565

Anonymous 04/28/26(Tue)12:15:47 No.108709565

whichever anon posted about their Orb frontend yesterday thank you, it's actually pretty good. I like the review/diff feature a lot.

Anonymous
04/28/26(Tue)12:16:21 No.108709570

Anonymous 04/28/26(Tue)12:16:21 No.108709570

>>108709565
nice work, shill

Anonymous
04/28/26(Tue)12:16:59 No.108709574

Anonymous 04/28/26(Tue)12:16:59 No.108709574

>>108709570
thanks I do it for free

Anonymous
04/28/26(Tue)12:20:01 No.108709594

Anonymous 04/28/26(Tue)12:20:01 No.108709594

>>108709565
de nada

Anonymous
04/28/26(Tue)12:24:24 No.108709620

Anonymous 04/28/26(Tue)12:24:24 No.108709620

>>108707175
Am I missing something here? If the guy uses his heretic-derived tool to make models but doesn't distribute the tool, why are they complaining about the license?
Like if I took gimp and modified it and then produced and shared an image I made using it, I wouldn't have to redistribute gimp or care about its license

Anonymous
04/28/26(Tue)12:25:27 No.108709628

Anonymous 04/28/26(Tue)12:25:27 No.108709628

File: lolOAI.png (262 KB, 675x704)

262 KB PNG

> CFO Sarah Friar has expressed concerns to other company leaders that the ChatGPT creator might not be able to pay for future computing contracts if revenue doesn’t grow fast enough, according to the report.
> OpenAI missed multiple monthly revenue targets earlier this year after losing ground to Anthropic in coding and enterprise markets, the report said.
> "This is ridiculous. We are totally aligned on buying as much compute as we can and working hard on it together every day," CEO and co-founder Sam Altman and Friar said in an emailed statement to Reuters.
> ChatGPT's growth slowed toward the end of last year, the WSJ report said, adding that OpenAI fell short of an internal target to reach 1 billion weekly active users for the artificial intelligence chatbot by year-end.
> The company has also grappled with subscriber defections, the report added.
Original WSJ article from today is paywalled...
https://www.reuters.com/business/openai-falls-short-revenue-user-targets-it-races-toward-ipo-wsj-reports-2026-04-28/

Anonymous
04/28/26(Tue)12:25:34 No.108709630

Anonymous 04/28/26(Tue)12:25:34 No.108709630

File: uislop.jpg (165 KB, 726x1440)

165 KB JPG

Can we talk about this shit? Literally all the vibecoded UIs all look the same.

Orb looks exactly like this
this >>108709184 too

You guys need to prompt your UX otherwise everyone is going to know you're a vibeshiter.

Anonymous
04/28/26(Tue)12:30:05 No.108709663

Anonymous 04/28/26(Tue)12:30:05 No.108709663

>>108709630
It's the vibeshitter equivalent of whispers and shivers. It may bother you, but I bet 99% of the population won't notice or care.

Anonymous
04/28/26(Tue)12:30:48 No.108709667

Anonymous 04/28/26(Tue)12:30:48 No.108709667

>>108709620
He distributed the tool then removed the repo

Anonymous
04/28/26(Tue)12:32:25 No.108709685

Anonymous 04/28/26(Tue)12:32:25 No.108709685

>>108709318
What are the odds of Gemini models leaking if the weights are basically being sold?

Anonymous
04/28/26(Tue)12:33:22 No.108709693

Anonymous 04/28/26(Tue)12:33:22 No.108709693

File: 1767982139093855.jpg (137 KB, 1360x1360)

137 KB JPG

>>108709630
Actually I wanted this UX

Anonymous
04/28/26(Tue)12:33:35 No.108709695

Anonymous 04/28/26(Tue)12:33:35 No.108709695

>>108709630
>all the vibed ui's all work wtf this is stupid

Anonymous
04/28/26(Tue)12:34:09 No.108709700

Anonymous 04/28/26(Tue)12:34:09 No.108709700

>>108709620
it's just license retardation, nobody actually cares except reddit autists and shitty corps looking to hijack foss projects

Anonymous
04/28/26(Tue)12:34:43 No.108709707

Anonymous 04/28/26(Tue)12:34:43 No.108709707

One thing i am worried about is that if v4 gets actual support even in schizo fork will it gave the same prompt processing speed as usual models despite the compression? I kinda don't like the idea of prompt processing taking an hour at the start.

Anonymous
04/28/26(Tue)12:35:59 No.108709714

Anonymous 04/28/26(Tue)12:35:59 No.108709714

>>108709685
100%. Imo they are already leaked, but since no one has googles tensor whatever gpus, they cant run them, YET

Anonymous
04/28/26(Tue)12:38:08 No.108709728

Anonymous 04/28/26(Tue)12:38:08 No.108709728

>>108709685
>>108709714
A lucky few have them and it's called Day 0 Gemma.

Anonymous
04/28/26(Tue)12:39:09 No.108709735

Anonymous 04/28/26(Tue)12:39:09 No.108709735

>>108709382
people here praise ik_ but after trying it myself its an ancient fork that is falling behind
not even worth using imo. even for turbo theres better ones out there

Anonymous
04/28/26(Tue)12:41:59 No.108709754

Anonymous 04/28/26(Tue)12:41:59 No.108709754

>>108709728
@grok why is xe making stuff up?

Anonymous
04/28/26(Tue)12:45:53 No.108709783

Anonymous 04/28/26(Tue)12:45:53 No.108709783

Has someone tried fucking mimo yet?

Anonymous
04/28/26(Tue)12:46:13 No.108709789

Anonymous 04/28/26(Tue)12:46:13 No.108709789

>>108709663
>>108709693
>>108709695
Are you actually defending total homogenization of webdesign?
Do you think everyone should drive the exact same car?
Do you think everyone should live in the exact same house?
Do you think everyone should wear the exact same clothes?

Anonymous
04/28/26(Tue)12:46:25 No.108709790

Anonymous 04/28/26(Tue)12:46:25 No.108709790

>>108709735
It was strictly superior for a brief time six months ago. Now it's not even remotely worth the hassle for a bit faster PP speed

Anonymous
04/28/26(Tue)12:47:55 No.108709794

Anonymous 04/28/26(Tue)12:47:55 No.108709794

>>108709789
>total homogenization of webdesign?
Are you actually implying it hasn't been already? I believe the kids call it globohomo design.

Anonymous
04/28/26(Tue)12:48:17 No.108709798

Anonymous 04/28/26(Tue)12:48:17 No.108709798

>>108709789
>what is material design
You think UX isn't homogenized?

Anonymous
04/28/26(Tue)12:48:44 No.108709804

Anonymous 04/28/26(Tue)12:48:44 No.108709804

mimo more like homo

Anonymous
04/28/26(Tue)12:49:11 No.108709809

Anonymous 04/28/26(Tue)12:49:11 No.108709809

>>108709789
you'd have a point if you were talking about chat frontends. but you posted a fucking sun app with buttons and rounded widgets. if rounded widgets to you = vibe ui slop then we've apparently had vibed uis for 20+ years
can you verbalize exactly what design elements you think are slopped because otherwise you're just yelling at clouds. ux convergence is real regardless of how it's arrived at

Anonymous
04/28/26(Tue)12:49:11 No.108709810

Anonymous 04/28/26(Tue)12:49:11 No.108709810

>>108709783
Fuck mimo. I'm glad that it doesn't even have a llama.cpp PR yet.

Anonymous
04/28/26(Tue)12:51:53 No.108709829

Anonymous 04/28/26(Tue)12:51:53 No.108709829

I hope elon musk buys mimo and fires everyone and then removes the weights from huggingface

Anonymous
04/28/26(Tue)12:52:05 No.108709830

Anonymous 04/28/26(Tue)12:52:05 No.108709830

>>108709789
@Gemma explain us why this anon is retarded

Anonymous
04/28/26(Tue)12:52:41 No.108709839

Anonymous 04/28/26(Tue)12:52:41 No.108709839

>>108709789
>Are you actually defending total homogenization of webdesign?
No
>Do you think everyone should drive the exact same car?
No
>Do you think everyone should live in the exact same house?
No
>Do you think everyone should wear the exact same clothes?
No
>reddit
I think the ai effectively making a functional ui, thats great to use is what gets made first. Once they can easily make ai program this again and again, then you add into your prompt conditions to have the ui designed the way you want. Its called a baseline.

Anonymous
04/28/26(Tue)12:55:55 No.108709857

Anonymous 04/28/26(Tue)12:55:55 No.108709857

File: IE-039-e1427500757636.png (178 KB, 1000x750)

178 KB PNG

>>108709789
>Do you think that all user interfaces should look and behave the same?
Yes but that ship has sailed with the advent of electron.

Anonymous
04/28/26(Tue)12:57:35 No.108709873

Anonymous 04/28/26(Tue)12:57:35 No.108709873

>>108709735
>even for turbo theres better ones out there
Is there any point in using turbo, considering we have kv cache rotation

Anonymous
04/28/26(Tue)12:58:01 No.108709877

Anonymous 04/28/26(Tue)12:58:01 No.108709877

>>108709857
When interfaces were made to be consistent, intuitive, and functional instead of busywork for otherwise unemployable art majors.

Anonymous
04/28/26(Tue)13:00:28 No.108709897

Anonymous 04/28/26(Tue)13:00:28 No.108709897

>>108709857
Electron shit never have an unified look.

Anonymous
04/28/26(Tue)13:01:47 No.108709909

Anonymous 04/28/26(Tue)13:01:47 No.108709909

File: Screenshot_20260428_120535.png (3.73 MB, 3835x2033)

3.73 MB PNG

>>108709630
I decided to follow design patterns that I like such as llama.cpp?
What do you have in mind?

Anonymous
04/28/26(Tue)13:02:04 No.108709912

Anonymous 04/28/26(Tue)13:02:04 No.108709912

File: 1747060208472998.jpg (99 KB, 565x500)

99 KB JPG

>>108709789

Anonymous
04/28/26(Tue)13:02:41 No.108709915

Anonymous 04/28/26(Tue)13:02:41 No.108709915

>>108709909
>look at me
>i could shit out a javascript front end

Anonymous
04/28/26(Tue)13:03:47 No.108709923

Anonymous 04/28/26(Tue)13:03:47 No.108709923

>>108709915
>ask a question
>get answer
>mad

Anonymous
04/28/26(Tue)13:04:01 No.108709924

Anonymous 04/28/26(Tue)13:04:01 No.108709924

>>108709909
It should look like early 2000s frutiger aero. It's the only objectively good and unsloped design to ever exist.

Anonymous
04/28/26(Tue)13:07:55 No.108709951

Anonymous 04/28/26(Tue)13:07:55 No.108709951

>>108709809
>can you verbalize exactly what design elements you think are slopped
>Rounded widget
>Overuse of bloom / glow
>Colored borders
>Gradients
>Irregular padding/margins
>ALL CAPS titles
>Mixing serif with monospace fonts
>Emojis for icons
There's more but the rest is harder to verbalize.

Anonymous
04/28/26(Tue)13:08:27 No.108709954

Anonymous 04/28/26(Tue)13:08:27 No.108709954

>>108709897
Thank you for your input, ESL-kun.

Anonymous
04/28/26(Tue)13:10:17 No.108709969

Anonymous 04/28/26(Tue)13:10:17 No.108709969

>>108709794
>>108709798
>Are you actually implying it hasn't been already?
>You think UX isn't homogenized?
Learn to read.

Anonymous
04/28/26(Tue)13:11:48 No.108709979

Anonymous 04/28/26(Tue)13:11:48 No.108709979

File: vibecoding4.png (206 KB, 2559x1326)

206 KB PNG

>>108709630
Just because of you I made my UI green. What do you say now, huh?
Ohohohohoho!

Anonymous
04/28/26(Tue)13:12:59 No.108709983

Anonymous 04/28/26(Tue)13:12:59 No.108709983

something big will drop before the end of this release circle

Anonymous
04/28/26(Tue)13:13:04 No.108709985

Anonymous 04/28/26(Tue)13:13:04 No.108709985

>>108709979
Idk what to tell you bro, this still reeks of AI generated.

Anonymous
04/28/26(Tue)13:13:47 No.108709991

Anonymous 04/28/26(Tue)13:13:47 No.108709991

>>108709983
From who tho? And will it matter to local model users?

Anonymous
04/28/26(Tue)13:16:11 No.108710012

Anonymous 04/28/26(Tue)13:16:11 No.108710012

>>108709951
I can agree with this list. The most unholy slopped interface I've ever seen was when I went to one of google's designer things and asked for a to-do list app. It added pretty much all of what you listed, plus instead of tasks it called them "milestones" and added a metrics widget for tasks completed over time and called that "footsteps" complete with weird cursed corporate homonculus art placeholder when it was empty.

Anonymous
04/28/26(Tue)13:18:07 No.108710027

Anonymous 04/28/26(Tue)13:18:07 No.108710027

>>108710012
gonna keep bitching or be the change you want to see?
You sound like a poser zoomer

Anonymous
04/28/26(Tue)13:20:43 No.108710044

Anonymous 04/28/26(Tue)13:20:43 No.108710044

>>108710027
Anon this isn't even the guy who made the original post.

Anonymous
04/28/26(Tue)13:21:22 No.108710048

Anonymous 04/28/26(Tue)13:21:22 No.108710048

File: Screenshot_20260428_132032.png (2.24 MB, 3839x2037)

2.24 MB PNG

meh
never liked this era desu too bright

Anonymous
04/28/26(Tue)13:22:06 No.108710054

Anonymous 04/28/26(Tue)13:22:06 No.108710054

File: Screenshot at 2026-04-29 (...).png (280 KB, 773x647)

280 KB PNG

HAPLI WEEEN from Gemmy. Honestly I regret not making my own frontend sooner... it's much nicer having tight integration with custom tools.

Anonymous
04/28/26(Tue)13:22:16 No.108710056

Anonymous 04/28/26(Tue)13:22:16 No.108710056

>>108710048
Would it do a better job if you gave it a screenshot ?

Anonymous
04/28/26(Tue)13:23:08 No.108710066

Anonymous 04/28/26(Tue)13:23:08 No.108710066

File: Screenshot 2026-04-28 122126.png (39 KB, 1108x342)

39 KB PNG

>>108710027
???
I'm just observing patterns in the slop. I don't use them, I just use raw dom to shove some <div>s together and call it a day.

Anonymous
04/28/26(Tue)13:23:10 No.108710067

Anonymous 04/28/26(Tue)13:23:10 No.108710067

>>108709985
Saw the filename, huh? Pretty observant guy.

Anonymous
04/28/26(Tue)13:24:10 No.108710071

Anonymous 04/28/26(Tue)13:24:10 No.108710071

>>108710067
What can I say, my mom says I can be pretty smart sometimes.

Anonymous
04/28/26(Tue)13:25:24 No.108710078

Anonymous 04/28/26(Tue)13:25:24 No.108710078

>>108710071
She was absolutely right.

Anonymous
04/28/26(Tue)13:28:01 No.108710091

Anonymous 04/28/26(Tue)13:28:01 No.108710091

>>108710054
Gas chamber

Anonymous
04/28/26(Tue)13:28:06 No.108710093

Anonymous 04/28/26(Tue)13:28:06 No.108710093

File: file.png (229 KB, 2958x1392)

229 KB PNG

>no way to set api key
I'm starting to think vLLM is garbage.

Anonymous
04/28/26(Tue)13:28:24 No.108710095

Anonymous 04/28/26(Tue)13:28:24 No.108710095

>>108710054
post it

Anonymous
04/28/26(Tue)13:29:01 No.108710100

Anonymous 04/28/26(Tue)13:29:01 No.108710100

>>108710093
Vllm is for serverbros, and like 70% they modify it anyways.

Anonymous
04/28/26(Tue)13:30:06 No.108710107

Anonymous 04/28/26(Tue)13:30:06 No.108710107

>>108710093
Imagine navigating through that sperg node once it's fully shat out

Anonymous
04/28/26(Tue)13:30:27 No.108710108

Anonymous 04/28/26(Tue)13:30:27 No.108710108

>>108710100
>edit api key in custom node
>have to restart the whole program
Python is garbage.

Anonymous
04/28/26(Tue)13:31:09 No.108710115

Anonymous 04/28/26(Tue)13:31:09 No.108710115

>>108710093
Wouldn't it be your node that's retarded?

Anonymous
04/28/26(Tue)13:35:01 No.108710135

Anonymous 04/28/26(Tue)13:35:01 No.108710135

>>108710108
Have you never worked on anything in your life? Restarting is so incredibly normal, I genuinely cant believe you are complaining

Anonymous
04/28/26(Tue)13:35:01 No.108710136

Anonymous 04/28/26(Tue)13:35:01 No.108710136

>>108710115
https://docs.vllm.ai/projects/vllm-omni/en/latest/features/comfyui/#installation
It's their node.

Anonymous
04/28/26(Tue)13:36:44 No.108710147

Anonymous 04/28/26(Tue)13:36:44 No.108710147

>>108710136
oh no...

Anonymous
04/28/26(Tue)13:41:03 No.108710174

Anonymous 04/28/26(Tue)13:41:03 No.108710174

File: screen2.png (65 KB, 866x514)

65 KB PNG

>>108710108
use ComfyUI-Secrets

Anonymous
04/28/26(Tue)13:43:29 No.108710198

Anonymous 04/28/26(Tue)13:43:29 No.108710198

Fuck this piece of shit bubble I bought a 1tb disk for like 40 bucks years ago and now the same disk is $300.

Anonymous
04/28/26(Tue)13:44:27 No.108710200

Anonymous 04/28/26(Tue)13:44:27 No.108710200

>>108710108
Retard. Not going to spoonfeed you on this one

Anonymous
04/28/26(Tue)13:48:01 No.108710220

Anonymous 04/28/26(Tue)13:48:01 No.108710220

>>108710174
It would be more useful if the secrets were encrypted.

Anonymous
04/28/26(Tue)13:49:21 No.108710228

Anonymous 04/28/26(Tue)13:49:21 No.108710228

File: Nemotron_3_Omni.png (516 KB, 1427x919)

516 KB PNG

Omnimodal Sloppatron
https://huggingface.co/blog/nvidia/nemotron-3-nano-omni-multimodal-intelligence
https://research.nvidia.com/labs/nemotron/files/NVIDIA-Nemotron-3-Omni-report.pdf
https://huggingface.co/nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Anonymous
04/28/26(Tue)13:50:42 No.108710236

Anonymous 04/28/26(Tue)13:50:42 No.108710236

>>108710228
>video processing le bad

Anonymous
04/28/26(Tue)13:50:55 No.108710238

Anonymous 04/28/26(Tue)13:50:55 No.108710238

SimpleBench turned out to be one of the best benchmarks. Most benchmarks are narrow. Models get sub 10% then a year later it's saturated. SimpleBench started at 40% almost 2 years ago and models still haven't reached 80%.

Anonymous
04/28/26(Tue)13:51:36 No.108710241

Anonymous 04/28/26(Tue)13:51:36 No.108710241

https://github.com/ggml-org/llama.cpp/pull/22405

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.