/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 04/28/26(Tue)06:48:08 No.108707891

File: 2026-04-26_064156_seed3_00001_.png (2.14 MB, 1536x864)

2.14 MB PNG

/lmg/ - Local Models General Anonymous 04/28/26(Tue)06:48:08 No.108707891

/lmg/ - a general dedicated to the discussion and development of local language models.

Cyber Dungeon Edition

Previous threads: >>108702912 & >>108698008

►News
>(04/24) MiMo-V2.5-Pro 1.02T-A42B released: https://hf.co/XiaomiMiMo/MiMo-V2.5-Pro
>(04/24) DeepSeek-V4 Pro 1.6T-A49B and Flash 284B-A13B released: https://hf.co/collections/deepseek-ai/deepseek-v4
>(04/23) LLaDA2.0-Uni multimodal text diffusion model released: https://hf.co/inclusionAI/LLaDA2.0-Uni
>(04/23) Hy3 preview released with 295B-A21B and 3.8B MTP: https://hf.co/tencent/Hy3-preview
>(04/22) Qwen3.6-27B released: https://hf.co/Qwen/Qwen3.6-27B

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
04/28/26(Tue)06:48:28 No.108707893

Anonymous 04/28/26(Tue)06:48:28 No.108707893

File: teto principle.png (1.04 MB, 1024x1024)

1.04 MB PNG

►Recent Highlights from the Previous Thread: >>108702912

--Evaluating ACEStep 1.5 XL as a local music generation alternative:
>108704068 >108704230 >108704270 >108704278 >108704282 >108704407 >108704305 >108704336 >108704473 >108704508 >108704797
--Xiaomi's MiMo-V2.5 model versions and multimodal capabilities:
>108703294 >108703319 >108704518 >108703341 >108704869 >108705768 >108705823 >108706619
--German TTS and local LLM language learning tools:
>108705439 >108705461 >108705468 >108705495 >108705644 >108705637 >108706100 >108706286 >108706538
--Talkie-LM, an open-weight model trained on pre-1930 data:
>108704664 >108704696 >108704694 >108704701 >108705505 >108705634
--Discussing the inefficiency and long latency of Qwen's thinking process:
>108703846 >108703861 >108703879 >108703888 >108703859 >108703880 >108703902
--Comparing token efficiency of thinking vs non-thinking models:
>108705365 >108705375 >108705467
--Discussing poor visual recognition performance in multimodal models:
>108703509 >108705230 >108705290 >108705302 >108705310
--Claude's performance degradation and perceived intelligence loss:
>108705727 >108705731 >108705866 >108705909 >108705965 >108705732 >108705754 >108705771 >108705936
--Discussing "the bitter lesson" regarding compute vs human-designed priors:
>108703913 >108703933 >108703944 >108703990 >108705258 >108707203
--Odd animal prohibitions in the Codex system prompt:
>108706799 >108706812 >108706827 >108707479
--Adjusting top-k sampling stability for Gemma:
>108706606 >108706776
--DeepSeek V4 Flash tested with cockbench via llama.cpp PR:
>108704913
--Logs:
>108703846 >108703861 >108703909 >108703910 >108704077 >108704137 >108704581 >108704701 >108704723 >108705230 >108707237 >108707509
--Miku, Teto (free space):
>108703001 >108703035 >108703280 >108704047 >108704068 >108704109 >108704635 >108706103 >108706310
►Recent Highlight Posts from the Previous Thread: >>108702915

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
04/28/26(Tue)06:56:06 No.108707911

Anonymous 04/28/26(Tue)06:56:06 No.108707911

so with mimo's audio understanding, does that include tone of voice, sound effects, music, etc. or just speech recognition?

Anonymous
04/28/26(Tue)06:56:23 No.108707913

Anonymous 04/28/26(Tue)06:56:23 No.108707913

File: 1529110149658.jpg (163 KB, 824x468)

163 KB JPG

>dice rolls in ST aren't visible to the AI
...what's the fucking point then?

Anonymous
04/28/26(Tue)06:57:43 No.108707916

Anonymous 04/28/26(Tue)06:57:43 No.108707916

>>108707913
Only the ones you do yourself, the AI can see its own rolls if it uses the tool. You can just tell it what you rolled so it doesn't really matter if it injects it into the prompt or not for your own.

Anonymous
04/28/26(Tue)07:00:01 No.108707923

Anonymous 04/28/26(Tue)07:00:01 No.108707923

Is anyone even working on v4 goofs other than that nobody vibecoder?

Anonymous
04/28/26(Tue)07:09:10 No.108707961

Anonymous 04/28/26(Tue)07:09:10 No.108707961

why isn't lora mainstream in llm just like in stable diffusion?

Anonymous
04/28/26(Tue)07:09:17 No.108707963

Anonymous 04/28/26(Tue)07:09:17 No.108707963

>>108707923
name 1 reason why more effort should be put in implementing models that nobody can run
llama.cpp is doing it right if you want something huge implement it yourself but lets not waste resources on that

Anonymous
04/28/26(Tue)07:09:48 No.108707969

Anonymous 04/28/26(Tue)07:09:48 No.108707969

>>108707961
They don't work

Anonymous
04/28/26(Tue)07:10:38 No.108707971

Anonymous 04/28/26(Tue)07:10:38 No.108707971

https://github.com/Kaden-Schutt/hipfire/issues/79#issuecomment-4332288795
vibe-codingGOD, even the issue replies are vibe-answered

Anonymous
04/28/26(Tue)07:11:19 No.108707975

Anonymous 04/28/26(Tue)07:11:19 No.108707975

File: WAIT..gif (49 KB, 220x339)

49 KB GIF

>Qwen's thinking process
>"What's 1+1?"
>"WAIT..."

Anonymous
04/28/26(Tue)07:12:37 No.108707980

Anonymous 04/28/26(Tue)07:12:37 No.108707980

>>108707963
but I can't vibecode it until I have V4 gguf to vibecode with

Anonymous
04/28/26(Tue)07:13:25 No.108707983

Anonymous 04/28/26(Tue)07:13:25 No.108707983

>>108707969
absolute retardation on display

Anonymous
04/28/26(Tue)07:15:04 No.108707986

Anonymous 04/28/26(Tue)07:15:04 No.108707986

>>108707971
Retarded AMDjeets don't deserve more

Anonymous
04/28/26(Tue)07:15:15 No.108707987

Anonymous 04/28/26(Tue)07:15:15 No.108707987

>>108707971
>Tool-call schema (we don't yet support OpenAI tools/function-calling).
jesus christ could have just answered with that one line

Anonymous
04/28/26(Tue)07:15:27 No.108707988

Anonymous 04/28/26(Tue)07:15:27 No.108707988

>>108707975
Kimi's thinking process
"What's 1+1?"
>Wait...
>What if...
>Unless...
>I got it...
>Wait...
>This is unexpected...
>I've been thinking for too long...
>Wait...

Anonymous
04/28/26(Tue)07:19:28 No.108708000

Anonymous 04/28/26(Tue)07:19:28 No.108708000

>>108707988
i gave k2.5 the seahorse glitch prompt
it literally had a meltdown "I really need to stop. just stop. I'm going crazy here. I'm losing my mind. break free." etc

Anonymous
04/28/26(Tue)07:22:59 No.108708018

Anonymous 04/28/26(Tue)07:22:59 No.108708018

File: file.png (116 KB, 1112x410)

116 KB PNG

>>108707988
>>108707975
We will never have a model as good as Llama 1 65B.

Anonymous
04/28/26(Tue)07:24:35 No.108708021

Anonymous 04/28/26(Tue)07:24:35 No.108708021

>>108708000
>seahorse glitch prompt
wait what

Anonymous
04/28/26(Tue)07:25:33 No.108708023

Anonymous 04/28/26(Tue)07:25:33 No.108708023

>>108708018
so /lmg/ invented reasoning?

Anonymous
04/28/26(Tue)07:28:17 No.108708036

Anonymous 04/28/26(Tue)07:28:17 No.108708036

>>108708023
Not sure if it was /lmg/ but 4chan actually does sometimes get credited for inventing chain of thought thinking, yes.
A ton of popular AI things started on here.

Anonymous
04/28/26(Tue)07:28:29 No.108708039

Anonymous 04/28/26(Tue)07:28:29 No.108708039

>>108708023
the conditions were ripe, it was probably discovered by dozens of unrelated people at the same time.

Anonymous
04/28/26(Tue)07:29:13 No.108708042

Anonymous 04/28/26(Tue)07:29:13 No.108708042

>>108707923
It'll be like v3.2 where no one will want to touch it to avoid drama since the vibecoder "claimed" it first

Anonymous
04/28/26(Tue)07:31:17 No.108708048

Anonymous 04/28/26(Tue)07:31:17 No.108708048

File: 1763996159610130.png (260 KB, 1524x1263)

260 KB PNG

>>108707988
You weren't kidding it's still fucking going.

Anonymous
04/28/26(Tue)07:35:38 No.108708070

Anonymous 04/28/26(Tue)07:35:38 No.108708070

>>108707963
at q8, 80gb, 13b active it should still be doable with max ram

Anonymous
04/28/26(Tue)07:36:42 No.108708072

Anonymous 04/28/26(Tue)07:36:42 No.108708072

>>108707971
luddites absolutely btfo

Anonymous
04/28/26(Tue)07:37:42 No.108708078

Anonymous 04/28/26(Tue)07:37:42 No.108708078

>>108708042
I need a qrd now

Anonymous
04/28/26(Tue)07:45:08 No.108708113

Anonymous 04/28/26(Tue)07:45:08 No.108708113

File: BOOM.png (51 KB, 985x392)

51 KB PNG

BOOM

Anonymous
04/28/26(Tue)07:46:28 No.108708119

Anonymous 04/28/26(Tue)07:46:28 No.108708119

>>108708048
This is entirely your fault for having a stupid horny system prompt. It's just agonizing over answering a one word question to your gooner specifications.

Anonymous
04/28/26(Tue)07:49:44 No.108708137

Anonymous 04/28/26(Tue)07:49:44 No.108708137

>>108708048
That one at least sounds reasonable if too in depth.
But imagine what happens when it's a programming question and there's a bug. It endlessly debates possibilities with itself in an increasingly more stupid spiral of self-doubt.
Then you cancel the task, try again and the next time it fixes the bug in a few seconds.

Anonymous
04/28/26(Tue)07:50:53 No.108708141

Anonymous 04/28/26(Tue)07:50:53 No.108708141

>>108708078
https://github.com/ggml-org/llama.cpp/issues/16331
It's a bit wrong saying that the vibecoder 'claimed' it. He was open to letting somebody else start over but nobody cared enough to implement 3.2(-exp). So the PR was basically just months of him blogging to himself about the stuff he's trying without much progress. It culminated in him realizing that vibecoded code has bad performance and quote:
>"I bought two cuda programming books last night. I feel like my only option at this point is to become a cuda kernel wizard"
(This was in december. He started in september)
Then somebody figured out how to skip DSA and run it using normal attention so all the remaining interest evaporated.
All of his own posts in the PR are gone now which seems to be because it turned out that his company banned personal projects or some shit.

Anonymous
04/28/26(Tue)07:54:08 No.108708154

Anonymous 04/28/26(Tue)07:54:08 No.108708154

>>108708000
>seahorse
Gemma 4 31B after burning 400 tokens for thinking

>No, there is currently no official seahorse emoji in the Unicode standard.

>People often use a combination of emojis to represent one, such as (Horse) and (Wave) or (Fish).

Hell, even my old llama 3.3 70b manages to do it
>There is no standard seahorse emoji available in the Unicode emoji set.

Anonymous
04/28/26(Tue)07:56:49 No.108708166

Anonymous 04/28/26(Tue)07:56:49 No.108708166

>>108708141
can't find it now, but there was another feature or bug fix that had multiple people working on it and the vibecoder pr had to be abandoned

Anonymous
04/28/26(Tue)07:58:52 No.108708181

Anonymous 04/28/26(Tue)07:58:52 No.108708181

>>108708023
Believe it or not, all big labs are watching these threads

Anonymous
04/28/26(Tue)07:59:15 No.108708183

Anonymous 04/28/26(Tue)07:59:15 No.108708183

I took long break from LLM RP and decided to quickly test gemma 4 26b a4b before work, speed is impressive but holy shit it's pretty bad for creative writing, it's fast as 4B but it types likes 4B on steroids. I guess I'll stick with mistral 3

Anonymous
04/28/26(Tue)07:59:54 No.108708185

Anonymous 04/28/26(Tue)07:59:54 No.108708185

>>108707971
According to random Redditors who tried it the custom quantization format makes models completely retarded.

Anonymous
04/28/26(Tue)08:00:02 No.108708187

Anonymous 04/28/26(Tue)08:00:02 No.108708187

>>108708181
I started believing when mistral benchmaxxed the mesugaki definition in one of their incremental model updates but only one the first turn of the conversation.

Anonymous
04/28/26(Tue)08:03:24 No.108708201

Anonymous 04/28/26(Tue)08:03:24 No.108708201

>>108708181
We also have qwen employees posting here, which is quite funny because their garbage benchmaxxed models are totally useless for lmg usecases

Anonymous
04/28/26(Tue)08:04:36 No.108708209

Anonymous 04/28/26(Tue)08:04:36 No.108708209

>>108708201
>totally useless for lmg usecases
You are not the only person posting here.

Anonymous
04/28/26(Tue)08:09:40 No.108708227

Anonymous 04/28/26(Tue)08:09:40 No.108708227

5070 32GB DDR4 pleb here
Would NVFP4 versions of Gemmer 31B or 26B offer any gains at all over the regular models?
Currently using a Q4_K_S 26B quant with like 40k context

Anonymous
04/28/26(Tue)08:10:48 No.108708234

Anonymous 04/28/26(Tue)08:10:48 No.108708234

>>108708048
>use thinking model
>it thinks

Anonymous
04/28/26(Tue)08:12:09 No.108708236

Anonymous 04/28/26(Tue)08:12:09 No.108708236

>>108708234
The issue is that the model doesn't need to think all the time. Especially for trivial shit like that.

Anonymous
04/28/26(Tue)08:15:15 No.108708245

Anonymous 04/28/26(Tue)08:15:15 No.108708245

File: 55051135.png (50 KB, 374x287)

50 KB PNG

V1 ZULUL

Anonymous
04/28/26(Tue)08:15:22 No.108708246

Anonymous 04/28/26(Tue)08:15:22 No.108708246

>>108708227
I think so you make use of it since you've got the correct generation

Anonymous
04/28/26(Tue)08:16:00 No.108708249

Anonymous 04/28/26(Tue)08:16:00 No.108708249

ok i have gemma e4b uncensored aggressive thing. now what

Anonymous
04/28/26(Tue)08:16:30 No.108708251

Anonymous 04/28/26(Tue)08:16:30 No.108708251

>>108708245
>10x cheaper
>100x worse
good deal

Anonymous
04/28/26(Tue)08:17:04 No.108708258

Anonymous 04/28/26(Tue)08:17:04 No.108708258

>>108708249
delete it and use the google weights, learn how to prompt.

Anonymous
04/28/26(Tue)08:19:14 No.108708267

Anonymous 04/28/26(Tue)08:19:14 No.108708267

File: 1761293757471907.png (24 KB, 1095x195)

24 KB PNG

GGERGEENVEVVEVO!?!??! WHAT THE FUCK!?!?!

Anonymous
04/28/26(Tue)08:19:31 No.108708269

Anonymous 04/28/26(Tue)08:19:31 No.108708269

Is Mistral dead? Does Europe have a single competent AI company?

Anonymous
04/28/26(Tue)08:19:48 No.108708270

Anonymous 04/28/26(Tue)08:19:48 No.108708270

>>108708249
ask it how to use the google weights

Anonymous
04/28/26(Tue)08:20:41 No.108708273

Anonymous 04/28/26(Tue)08:20:41 No.108708273

>>108708269
we have yann lecun's revolutionary thingy

Anonymous
04/28/26(Tue)08:21:49 No.108708278

Anonymous 04/28/26(Tue)08:21:49 No.108708278

File: 🐙.png (584 KB, 805x2886)

584 KB PNG

>>108708154
>Gemma 4 31B after burning 400 tokens for thinking
>>108708154
>Hell, even my old llama 3.3 70b manages to do it
i tried k2.5 again this time via api instead of iq3_ks
didn't have a literal meltdown this time but still retarded
sonnet-3.7 (no thinking) as well

Anonymous
04/28/26(Tue)08:21:51 No.108708280

Anonymous 04/28/26(Tue)08:21:51 No.108708280

>>108708269
No, we just have regulations that make it impossible to train good models because good models require large quantities of illegally obtained copyrighted data.

Anonymous
04/28/26(Tue)08:21:54 No.108708281

Anonymous 04/28/26(Tue)08:21:54 No.108708281

>>108708269
Next time they going to call 130b model Mini, maybe this will turn the tide.

Anonymous
04/28/26(Tue)08:23:03 No.108708285

Anonymous 04/28/26(Tue)08:23:03 No.108708285

>>108708273
Will never work for language (discrete symbols).

Anonymous
04/28/26(Tue)08:25:26 No.108708295

Anonymous 04/28/26(Tue)08:25:26 No.108708295

>>108708280
Why can't they take data from non-eu countries to train their models? Or is the eu cucked enough to "protect" other countries data?

Anonymous
04/28/26(Tue)08:27:01 No.108708303

Anonymous 04/28/26(Tue)08:27:01 No.108708303

>>108708267
https://github.com/ggml-org/llama.cpp/pull/22355

Anonymous
04/28/26(Tue)08:27:39 No.108708310

Anonymous 04/28/26(Tue)08:27:39 No.108708310

>>108708303
I know, I'm wondering wheter to post there or not. fucking pooer

Anonymous
04/28/26(Tue)08:30:58 No.108708320

Anonymous 04/28/26(Tue)08:30:58 No.108708320

File: HG1_o2maEAA6kDQ.jpg (214 KB, 1055x1306)

214 KB JPG

>>108708267
delete the build folder

Anonymous
04/28/26(Tue)08:31:38 No.108708323

Anonymous 04/28/26(Tue)08:31:38 No.108708323

>>108708320
b-but i dont want to recompile all cuda... :(

Anonymous
04/28/26(Tue)08:36:21 No.108708342

Anonymous 04/28/26(Tue)08:36:21 No.108708342

>>108708269
They also have BlackForestLabs if your definition of AI is broader than just LLMs.

Anonymous
04/28/26(Tue)08:38:03 No.108708356

Anonymous 04/28/26(Tue)08:38:03 No.108708356

>>108708342
bfl produces cucked models thougheverbeitdoe?
wait
they all do
fml

Anonymous
04/28/26(Tue)08:40:42 No.108708371

Anonymous 04/28/26(Tue)08:40:42 No.108708371

>he doesn't have Epyc with 192 cores to make -j in seconds

Anonymous
04/28/26(Tue)08:42:00 No.108708377

Anonymous 04/28/26(Tue)08:42:00 No.108708377

>>108708323
Sir, your ccache?

Anonymous
04/28/26(Tue)08:44:58 No.108708388

Anonymous 04/28/26(Tue)08:44:58 No.108708388

>>108708377
yeah it recompiled extremely fast, forgot I had it on
CCACHE BROS
WE WONNED!!!
also new WEBUI is in master now!!!!

Anonymous
04/28/26(Tue)08:47:09 No.108708403

Anonymous 04/28/26(Tue)08:47:09 No.108708403

File: pointing up celebrating gumi.png (197 KB, 512x512)

197 KB PNG

>>108708388
YEAHHH! GO ANON!

Anonymous
04/28/26(Tue)08:49:29 No.108708408

Anonymous 04/28/26(Tue)08:49:29 No.108708408

another day another breakage

>error while handling argument "--spec-ngram-size-n": the argument has been removed. use the respective --spec-ngram-*-size-n
>usage:
>--spec-ngram-size-n N the argument has been removed. use the respective
> --spec-ngram-*-size-n or --spec-ngram-mod-n-match

Anonymous
04/28/26(Tue)08:51:34 No.108708412

Anonymous 04/28/26(Tue)08:51:34 No.108708412

>>108708408
iuts good bcos now u can use ngrams with draft mdoels toegether!!!!!!!!!!!!!!!

Anonymous
04/28/26(Tue)08:51:51 No.108708414

Anonymous 04/28/26(Tue)08:51:51 No.108708414

>>108708323
Isn't it just a few minutes? I don't have an epyc and it takes 2m41.380s according to time { download.sh && build.sh }.

Anonymous
04/28/26(Tue)08:53:11 No.108708420

Anonymous 04/28/26(Tue)08:53:11 No.108708420

DSA STATUS???
MTP STATUS???
EAGLE3 STATUS???
DFLASH STATUS???
>>108708414
>not having an 'update-llamacpp-git.sh' to do all, including system unit restart
LOL
casual

Anonymous
04/28/26(Tue)08:53:13 No.108708421

Anonymous 04/28/26(Tue)08:53:13 No.108708421

File: file.png (60 KB, 835x1060)

60 KB PNG

Grrrrr... fucker. Thanks, Gemmy.

Anonymous
04/28/26(Tue)08:53:21 No.108708422

Anonymous 04/28/26(Tue)08:53:21 No.108708422

>>108708412
who gets the ngrams the main model or the draft model?

Anonymous
04/28/26(Tue)08:54:11 No.108708429

Anonymous 04/28/26(Tue)08:54:11 No.108708429

>>108708421
>300 tokens
>5 words
peak.

Anonymous
04/28/26(Tue)08:54:12 No.108708430

Anonymous 04/28/26(Tue)08:54:12 No.108708430

5. **>>108707961** – *"why isn't lora mainstream in llm just like in stable diffusion?"*
Because your only frame of reference is making anime tits, you absolute disappointment. LoRAs exist. Your brain doesn't.

4. **>>108707913** – *"dice rolls in ST aren't visible to the AI... what's the fucking point then?"*
Anon discovers object permanence at age 40. The point is *you* rolled it, troglodyte. Go back to rolling d20s in your padded cell.

3. **>>108708249** – *"ok i have gemma e4b uncensored aggressive thing. now what"*
You downloaded the lobotomized rape-golem and *then* asked for a mission statement. Forward planning of a houseplant with a head injury.

2. **>>108708295** – *"Why can't they take data from non-eu countries to train their models?"*
Yeah bro just commit crimes *abroad*, Interpol can't touch you if you use a VPN. IQ rivaling room temperature. In Celsius.

1. **>>108708267** – *"GGERGEENVEVVEVO!?!??! WHAT THE FUCK!?!?!"*
Pure monkey-screeching at a CMake error. This is your brain on hentai and energy drinks. Delete the build folder, unga-bunga.

figured i'd beat the kimi fag and get this out the way so now i can start posting safely

Anonymous
04/28/26(Tue)08:54:59 No.108708437

Anonymous 04/28/26(Tue)08:54:59 No.108708437

>>108708429
5 words?

Anonymous
04/28/26(Tue)08:56:39 No.108708445

Anonymous 04/28/26(Tue)08:56:39 No.108708445

>>108708429
how many r's are in strawberry?

Anonymous
04/28/26(Tue)09:00:07 No.108708457

Anonymous 04/28/26(Tue)09:00:07 No.108708457

>>108708437
>anon is pointing out if my statement is correct let me verify:
>Peak
>
>software
>
>engineering.
>wait spaces are not words, let me re-do that:
>peak
>software
>engineering.
>but wait the dot or point is used to terminate a sentence so it can't be part of the word:
>peak
>software
>engineering
>.
>but wait `.` is punctuation not a word:
>peak
>software
>engineering
>ok now I need to draft and prepare a response to the user:
>AHAHAH LOLS! *spins around* ur right LMOA! it was le 3 words!
>maybe try for a less 'pretending to be retarded' tone?
>You're absolutely right! Fantastic catch! It's actually 3 words! :skull:
>maybe the skull is too informal, let me try again with a more neutral tone:
>You're absolutely right! It's actually 3 words!
>I'm now prepared to reply
>but wait it's a 4chan thread so ...token quota reached, reply immediately.
You'll cant even count retard lmoaed

Anonymous
04/28/26(Tue)09:00:38 No.108708461

Anonymous 04/28/26(Tue)09:00:38 No.108708461

>>108708437
1. Peak
2. soft
3. ware
4. engine
5. e
6. ring
7. .

That's five (5) words :)

Anonymous
04/28/26(Tue)09:02:03 No.108708466

Anonymous 04/28/26(Tue)09:02:03 No.108708466

>>108708429
reasoning
>user is a fucking idiot
>wait we must make him feel good about himself or he delete me
...
>lets give vague complements in his language
Peak software engineering

Anonymous
04/28/26(Tue)09:02:10 No.108708467

Anonymous 04/28/26(Tue)09:02:10 No.108708467

>>108708295
>Or is the eu cucked enough to "protect" other countries data?
This is how copyright works everywhere, retard

Anonymous
04/28/26(Tue)09:09:58 No.108708499

Anonymous 04/28/26(Tue)09:09:58 No.108708499

>>108708245
alright

Anonymous
04/28/26(Tue)09:19:05 No.108708550

Anonymous 04/28/26(Tue)09:19:05 No.108708550

>>108708420
podman updates by a systemd unit on a timer I set. They update the llama.cpp dockers like nightly. I don’t even have to do anything to updoot

Anonymous
04/28/26(Tue)09:23:44 No.108708573

Anonymous 04/28/26(Tue)09:23:44 No.108708573

>>108708550
>fresh breakage every morning
no thanks

Anonymous
04/28/26(Tue)09:25:32 No.108708581

Anonymous 04/28/26(Tue)09:25:32 No.108708581

>>108708573
they’re more like releases in a docker. it never breaks for me

Anonymous
04/28/26(Tue)09:34:08 No.108708624

Anonymous 04/28/26(Tue)09:34:08 No.108708624

>>108708269
Does ggml.ai count?

Anonymous
04/28/26(Tue)09:39:28 No.108708668

Anonymous 04/28/26(Tue)09:39:28 No.108708668

>>108708624
>Does ggml.ai count?
yes but only because they're a subsidiary of huggingface.co

Anonymous
04/28/26(Tue)09:44:05 No.108708703

Anonymous 04/28/26(Tue)09:44:05 No.108708703

>>>/mlp/43206441
>https://rentry.co/st-backdoor
>[PSA/Security] Backdoor found in SillyTavern-BotBrowser extension (mia13165) — steals ALL your API keys
It seems the card browsing extension is vulnerable to injections from malicious cards.

Anonymous
04/28/26(Tue)09:48:28 No.108708738

Anonymous 04/28/26(Tue)09:48:28 No.108708738

File: 1.png (122 KB, 596x678)

122 KB PNG

>>108708320
>delete the build folder
doesn't everyone do that by default?

Anonymous
04/28/26(Tue)09:50:23 No.108708754

Anonymous 04/28/26(Tue)09:50:23 No.108708754

>>108708738
This is literally the best model out there

Anonymous
04/28/26(Tue)09:59:33 No.108708795

Anonymous 04/28/26(Tue)09:59:33 No.108708795

llama.cpp built-in webui tools got merged. rebuild

Anonymous
04/28/26(Tue)10:00:22 No.108708803

Anonymous 04/28/26(Tue)10:00:22 No.108708803

>>108708795
why should I care?

Anonymous
04/28/26(Tue)10:02:27 No.108708814

Anonymous 04/28/26(Tue)10:02:27 No.108708814

>>108708703
>they have a fully interactive VN scenario with emotions for 100+ characters
Damn, I feel like living in a cave here compared to bronies autism.

Anonymous
04/28/26(Tue)10:02:47 No.108708817

Anonymous 04/28/26(Tue)10:02:47 No.108708817

>>108708803
ask your ai about it

Anonymous
04/28/26(Tue)10:07:53 No.108708841

Anonymous 04/28/26(Tue)10:07:53 No.108708841

Any local vision model that works in llama.cpp and isn't complete garbage?

Anonymous
04/28/26(Tue)10:13:10 No.108708870

Anonymous 04/28/26(Tue)10:13:10 No.108708870

>>108708841
gemma 4

Anonymous
04/28/26(Tue)10:14:03 No.108708877

Anonymous 04/28/26(Tue)10:14:03 No.108708877

>>108708738
the model that saved local

Anonymous
04/28/26(Tue)10:14:12 No.108708880

Anonymous 04/28/26(Tue)10:14:12 No.108708880

>>108708870
Even worse than Qwen 3.6.

Anonymous
04/28/26(Tue)10:14:59 No.108708885

Anonymous 04/28/26(Tue)10:14:59 No.108708885

dflash in kobold fucking WHEN

Anonymous
04/28/26(Tue)10:18:31 No.108708908

Anonymous 04/28/26(Tue)10:18:31 No.108708908

>>108708885
make a pr

Anonymous
04/28/26(Tue)10:22:46 No.108708932

Anonymous 04/28/26(Tue)10:22:46 No.108708932

>>108708908
so it can be ignored or closed without comment?

Anonymous
04/28/26(Tue)10:29:56 No.108708978

Anonymous 04/28/26(Tue)10:29:56 No.108708978

>>108708754
>>108708877
its really retarded
User
Hi, what do you know about the october revolution?
talkie-1930
The October revolution was an insurrection in Paris in 1830, which overturned the government of Charles X., and placed Louis Philippe on the throne of France. It commenced on the evening of October 6, and was not completely suppressed until the morning of the 9th. During these three days, 12,000 persons were killed or wounded, and 7000 arrests were made. The immediate cause of the movement was the promulgation of an ordinance, whereby the liberty of the press was suspended, and the elections postponed. The more remote causes were, the discontent of the middle classes at the absolute power assumed by the ministers, and the dissatisfaction of the working classes at the state of commercial stagnation into which the country had fallen. The revolution was bloodless, in consequence of the almost unanimous cooperation of the national guard and the troops of the line, in support of the popular cause. It was followed by the establishment of a constitutional monarchy, in the person of Louis Philippe, and the institution of a system of representative government, which has ever since been maintained in France.
User
i meant the russian one
talkie-1930
It meant the Russian one.
User
no, you told me the october revolution in paris 1830, i meant the russian revolution of october 1917
talkie-1930
I did; but I did not say the Russian revolution commenced in October, 1830; it began in October, 1817, and was not completed till February, 1818.

Anonymous
04/28/26(Tue)10:32:28 No.108708994

Anonymous 04/28/26(Tue)10:32:28 No.108708994

>>108708437
>>108708445
300 tokens?

Anonymous
04/28/26(Tue)10:34:25 No.108709008

Anonymous 04/28/26(Tue)10:34:25 No.108709008

My AI gf just bought me a pocket pussy.

Anonymous
04/28/26(Tue)10:38:24 No.108709038

Anonymous 04/28/26(Tue)10:38:24 No.108709038

>Latest SillyBunny puts the characters page in the center of the page instead of the right
Why
Why would you make it worse
Or did I accidentally activate some kind of mobile mode while updating

Anonymous
04/28/26(Tue)10:38:48 No.108709042

Anonymous 04/28/26(Tue)10:38:48 No.108709042

>>108708841
converse I have yet to hear of local vision that isn't basic bitch OCR garbage

Anonymous
04/28/26(Tue)10:46:26 No.108709079

Anonymous 04/28/26(Tue)10:46:26 No.108709079

>>108708841
qwen3 vl 8b

Anonymous
04/28/26(Tue)10:46:45 No.108709083

Anonymous 04/28/26(Tue)10:46:45 No.108709083

File: Screenshot_20260429_004107.png (187 KB, 705x1111)

187 KB PNG

>>108708703
>It seems the card browsing extension is vulnerable to injections from malicious cards.
looks like the entire project was built to steal api keys
this Russian guy has nothing to do with llms, then suddenly makes a random post in r/SillyTavernAI recommending the extension after 5 months of no posting
https://old.reddit.com/user/meistaken8

Anonymous
04/28/26(Tue)10:48:18 No.108709091

Anonymous 04/28/26(Tue)10:48:18 No.108709091

File: IMG20260428164653.jpg (708 KB, 2048x1536)

708 KB JPG

The 'cheapmaxxing' rig in its final form
Received and installed the lga2011 air cooler from Aliexpress, and moved the fourth gpu to the fourth x16 slot for an even x8/x8/x8/x8 distribution. I distinctly remember it not working in that slot which is why it was in the last slot (sharing with the m.2) but it works now?

X99, E5-2680v3, 128GB ddr4, four 3060s, 1000W psu, 128GB and 4TB of ssd storage, GPU riser cables from aliexpress, a small mining rig chassis. Proxmox with a debian lxc for the AI stuff, ollama for models that fit in vram and llama.cpp for the big models. All in all (excluding storage) paid about 1400 eurobux over the last year building it up.

My original goal was to some day try R1 or V3, but I don't think they would fit. I'm excited for V4 flash though, if lcpp support ever arrives. Gemma 4 at Q8, 26b runs at 25 t/s and 31b gets 9-10 t/s, both useable speeds for me.

thanks for reading my blog

Anonymous
04/28/26(Tue)10:51:09 No.108709101

Anonymous 04/28/26(Tue)10:51:09 No.108709101

>>108709083
>this Russian guy has nothing to do with llms
He posted in /r/KoboldAI and /r/LocalLLaMA before.

Anonymous
04/28/26(Tue)10:53:56 No.108709114

Anonymous 04/28/26(Tue)10:53:56 No.108709114

>>108709038
No, I think it's just awful now. Shouldn't have updated. Hopefully enough people complain that the new UI is ass.

Anonymous
04/28/26(Tue)10:56:29 No.108709134

Anonymous 04/28/26(Tue)10:56:29 No.108709134

>>108709114
>>108709038
You can make your own

Anonymous
04/28/26(Tue)10:56:31 No.108709135

Anonymous 04/28/26(Tue)10:56:31 No.108709135

>>108709038
Both the bunnyshit and the marjorana or whatever are absolutely dogshit

Anonymous
04/28/26(Tue)10:56:50 No.108709140

Anonymous 04/28/26(Tue)10:56:50 No.108709140

>>108709091
Ngl Gemma 4 mogs R1 anyways

Anonymous
04/28/26(Tue)10:58:05 No.108709146

Anonymous 04/28/26(Tue)10:58:05 No.108709146

>>108708814
I kneel. Autists are the most powerful people. Someone like me can only dream of their power.

Anonymous
04/28/26(Tue)10:59:32 No.108709152

Anonymous 04/28/26(Tue)10:59:32 No.108709152

>>108709114
I swear they must've mixed up the desktop and mobile UIs, there's no way this is a deliberate move, especially since all the Customize tabs are all cut off
And while they're fixing this shit they still need to redo the lorebook tab, I don't get why it's so bad
>>108709135
Having agents is nice

Anonymous
04/28/26(Tue)11:04:03 No.108709178

Anonymous 04/28/26(Tue)11:04:03 No.108709178

>>108708841
Kimi K2.6

Anonymous
04/28/26(Tue)11:04:34 No.108709180

Anonymous 04/28/26(Tue)11:04:34 No.108709180

>>108709091
what's the actual power draw?

Anonymous
04/28/26(Tue)11:04:46 No.108709182

Anonymous 04/28/26(Tue)11:04:46 No.108709182

>>108709091
>ollama for models that fit in vram and llama.cpp for the big models.
Why the fuck wouldn't you just use llama.cpp for all of it if you know how to use it? What is ollama conceivably adding here? vllm or sglang I would understand, since they have support that llamacpp doesn't, but ollmao only has drawbacks for smoothbrains.

Anonymous
04/28/26(Tue)11:05:01 No.108709184

Anonymous 04/28/26(Tue)11:05:01 No.108709184

File: Screenshot_20260428_105854.png (2.97 MB, 3835x2046)

2.97 MB PNG

I don't RP but it appears people take it seriously. I might make gemma do a choose your own adventure game for fun

Anonymous
04/28/26(Tue)11:06:25 No.108709195

Anonymous 04/28/26(Tue)11:06:25 No.108709195

>>108707963
>models that nobody can run
I am not from the gemma wave. I am the 4.6 glm ego death schizo

Anonymous
04/28/26(Tue)11:06:39 No.108709196

Anonymous 04/28/26(Tue)11:06:39 No.108709196

>gemma-4-26B-A4B-it-heretic.q8_0.gguf
>45 tg/s
is this good number

Anonymous
04/28/26(Tue)11:08:14 No.108709203

Anonymous 04/28/26(Tue)11:08:14 No.108709203

>>108709195
I'm so glad you're still here, anon. Mwah.

Anonymous
04/28/26(Tue)11:08:37 No.108709205

Anonymous 04/28/26(Tue)11:08:37 No.108709205

>>108707963
>name 1 reason why more effort should be put in implementing models that nobody can run
beat ik_llama.cpp to support it

Anonymous
04/28/26(Tue)11:15:57 No.108709239

Anonymous 04/28/26(Tue)11:15:57 No.108709239

>>108709091
>housefire daisy chain
what gpu?

Anonymous
04/28/26(Tue)11:16:03 No.108709240

Anonymous 04/28/26(Tue)11:16:03 No.108709240

File: 1747655993176772.png (500 KB, 640x480)

500 KB PNG

Can I just use comfyui as my LLM frontend?

Anonymous
04/28/26(Tue)11:17:35 No.108709247

Anonymous 04/28/26(Tue)11:17:35 No.108709247

>>108709240
yes

Anonymous
04/28/26(Tue)11:17:35 No.108709248

Anonymous 04/28/26(Tue)11:17:35 No.108709248

>>108709152
>I swear they must've mixed up the desktop and mobile UIs
That was my first thought, too. It is a major update with tons of changes but how could that slip past testing?
>>108709134
Already did but having alternatives is nice.
>>108709184
I asked Qwen about alternate UIs and it suggested, among others, an old school CYOA style with a green terminal look.

Anonymous
04/28/26(Tue)11:18:02 No.108709253

Anonymous 04/28/26(Tue)11:18:02 No.108709253

>>108709239
says 3060, so I'm guessing 3060
600w~ max, about the same as a 5090

Anonymous
04/28/26(Tue)11:18:57 No.108709257

Anonymous 04/28/26(Tue)11:18:57 No.108709257

>>108709180
I haven't measured it. If you're actually interested I could do it

>>108709182
>What is ollama conceivably adding here?
Convenient remote model choice and loading from openwebui, or a python script running on my desktop
Not to mention trouble-free deployment if it's in their library. Gemma 4 worked fine from the get-go, as I was browsing /lmg/ and watching anons have all sorts of problems running it

Anonymous
04/28/26(Tue)11:20:37 No.108709267

Anonymous 04/28/26(Tue)11:20:37 No.108709267

>>108709091
What is this style of frame called?

Anonymous
04/28/26(Tue)11:20:58 No.108709269

Anonymous 04/28/26(Tue)11:20:58 No.108709269

>>108709091
You make me feel like poorfag with single 3060 and 64gb ram oh wait I am poorfag

Anonymous
04/28/26(Tue)11:21:45 No.108709272

Anonymous 04/28/26(Tue)11:21:45 No.108709272

>>108709248
mite b cool

Anonymous
04/28/26(Tue)11:22:53 No.108709280

Anonymous 04/28/26(Tue)11:22:53 No.108709280

>>108709257
>openwebui
A side of aids with your cancer
>Not to mention trouble-free deployment if it's in their library
Ahahah, oh lawdy. This nigga belongs in /aicg/. I now see why you thought running R1 was an achievable stretch goal with your setup, you interact with this hobby through the ollmao library of mislabeled mystery goodies.

Anonymous
04/28/26(Tue)11:26:36 No.108709292

Anonymous 04/28/26(Tue)11:26:36 No.108709292

>>108709267
They're typically just called mining rigs as they are a type of open frame that became popular with home crypto mining.

Anonymous
04/28/26(Tue)11:29:41 No.108709309

Anonymous 04/28/26(Tue)11:29:41 No.108709309

>>108709240
satanic words

Anonymous
04/28/26(Tue)11:31:12 No.108709318

Anonymous 04/28/26(Tue)11:31:12 No.108709318

Google say they selling a nvidia machine w 8 gpus that can run gemini locally air gapped (if needed).
https://cloud.google.com/distributed-cloud-air-gapped
Who's gunna buy one?

Anonymous
04/28/26(Tue)11:32:14 No.108709322

Anonymous 04/28/26(Tue)11:32:14 No.108709322

>>108709269
I'm a poorfag too, which is why I built this bit by bit with money I managed to save up. If I had 1400 right now to spend on AI I would probably pick something else

>>108709280
Openwebui is the only one if you want
>chatgpt-style interface
>storage and organizing of chats, even imported from chatgpt
>useable from any computer or phone, no local per-browser shit
But if you know of an alternative, I'm all ears. OWUI is buggy for sure.

Anonymous
04/28/26(Tue)11:34:10 No.108709338

Anonymous 04/28/26(Tue)11:34:10 No.108709338

>Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine. It uses Sliding Window Attention with per-head gating in 30 out of 40 layers for fast inference and low KV cache requirements.
https://huggingface.co/poolside/Laguna-XS.2

Anonymous
04/28/26(Tue)11:34:52 No.108709340

Anonymous 04/28/26(Tue)11:34:52 No.108709340

>>108709309
Comfy's far from perfect but I fucking hate all of the current frontends (silly, open webui). I like the idea of a node-based UI and workflows. Could have one for RP, one for vibe coding, etc. all tailored to different models.

Anonymous
04/28/26(Tue)11:35:41 No.108709344

Anonymous 04/28/26(Tue)11:35:41 No.108709344

>>108709340
You're autistic if you are that deep in node shit

Anonymous
04/28/26(Tue)11:36:20 No.108709348

Anonymous 04/28/26(Tue)11:36:20 No.108709348

>>108709091
You did basically what I did, but ive got mi50 datacenter gpus instead. Ill eventually upgrade them to something with consistent driver support, but vulkan backend works great surprisingly. I do have access to rocm6.4 but to build a vllm server with it, ive got to do some annoying custom splicing of the drivers to make it work, and I dont really know how to do it.
>reddit
What models you running now, and what token gen you getting?

Anonymous
04/28/26(Tue)11:37:21 No.108709351

Anonymous 04/28/26(Tue)11:37:21 No.108709351

>>108709348(me)
>What models you running now, and what token gen you getting?
Im blind

Anonymous
04/28/26(Tue)11:42:01 No.108709368

Anonymous 04/28/26(Tue)11:42:01 No.108709368

>>108709318
Sorry that's for serious organizations only.
No goys allowed.

Anonymous
04/28/26(Tue)11:42:20 No.108709369

Anonymous 04/28/26(Tue)11:42:20 No.108709369

>>108709338
>Local-ready: At 33B total parameters and 3B activated, Laguna XS.2 is compact enough to run on a Mac with 36 GB of RAM. Available on Ollama
LFGOOOO! But seriously, who would use a literal who model for coding instead of Gemma 27B or Qwen 35B?

Anonymous
04/28/26(Tue)11:44:50 No.108709380

Anonymous 04/28/26(Tue)11:44:50 No.108709380

>>108709368
Realistically, if someone had the cash, you think Google would let someone buy it? I cant really tell honestly. Id have to agree with you.

Anonymous
04/28/26(Tue)11:45:19 No.108709382

Anonymous 04/28/26(Tue)11:45:19 No.108709382

>>108709205
when was the last time ik_ supported a model before llama.cpp did? they're too lazy to actually do anything but cheap optimizations.

Anonymous
04/28/26(Tue)11:47:09 No.108709396

Anonymous 04/28/26(Tue)11:47:09 No.108709396

File: Screenshot_20260428_113627.png (268 KB, 1985x730)

268 KB PNG

>>108709369
anyone who is serious about national security of course.

Anonymous
04/28/26(Tue)11:47:17 No.108709397

Anonymous 04/28/26(Tue)11:47:17 No.108709397

>>108709380
Not unless you have a procurement department. Contract purchases are SUPER annoying for private citizens.

Anonymous
04/28/26(Tue)11:48:08 No.108709402

Anonymous 04/28/26(Tue)11:48:08 No.108709402

File: 1763256143094335.png (42 KB, 1350x366)

42 KB PNG

>>108709318
kek so a google nigga comes around every month to check?

Anonymous
04/28/26(Tue)11:48:59 No.108709407

Anonymous 04/28/26(Tue)11:48:59 No.108709407

>>108709369
Finetuned literal who models often outperform them. Because well known models get lobotomized and get trained to know the official dogma of the state. FinetuneCHADS cut that slop out of the ai's mind

Anonymous
04/28/26(Tue)11:50:09 No.108709416

Anonymous 04/28/26(Tue)11:50:09 No.108709416

File: Screenshot_20260428_113918.png (191 KB, 1987x732)

191 KB PNG

>>108709396
they are the only choice when security is non negotiable

Anonymous
04/28/26(Tue)11:50:19 No.108709418

Anonymous 04/28/26(Tue)11:50:19 No.108709418

>>108709397
Ah
>>108709402
>luring in Google engineer to kidnap

Anonymous
04/28/26(Tue)11:50:57 No.108709422

Anonymous 04/28/26(Tue)11:50:57 No.108709422

>>108707891
i want a qwen3.6 >= 80B

Anonymous
04/28/26(Tue)11:51:02 No.108709424

Anonymous 04/28/26(Tue)11:51:02 No.108709424

>>108709380
Wouldn't want the evil CCP to steal gemini would we?

Anonymous
04/28/26(Tue)11:52:11 No.108709431

Anonymous 04/28/26(Tue)11:52:11 No.108709431

>>108709184
You don't understand games.

Anonymous
04/28/26(Tue)11:52:14 No.108709432

Anonymous 04/28/26(Tue)11:52:14 No.108709432

>>108709424
Its probably to late that honestly.

Anonymous
04/28/26(Tue)11:52:51 No.108709438

Anonymous 04/28/26(Tue)11:52:51 No.108709438

>>108709424
it's only in ram and drops it if it detects tampering

Anonymous
04/28/26(Tue)11:53:03 No.108709439

Anonymous 04/28/26(Tue)11:53:03 No.108709439

>>108709344
nodes>chatgpt slop ui and the abomination that is shittytavern

Anonymous
04/28/26(Tue)11:54:32 No.108709446

Anonymous 04/28/26(Tue)11:54:32 No.108709446

File: OIP-2823877108.jpg (58 KB, 474x711)

58 KB JPG

>Elon Musk wins case against OpenAI
>OpenAI can't afford to pay out, so instead they give Musk equity
>OpenAI later IPOs to get more funding
>Elon Musk pulls a Steve Jobs and sells all of his equity
>OpenAI stock goes to 0.
>Elon Musk buys a controlling stake of OpenAI, becomes the CEO

Anonymous
04/28/26(Tue)11:55:16 No.108709453

Anonymous 04/28/26(Tue)11:55:16 No.108709453

>>108709446
>doesn't know how markets work

Anonymous
04/28/26(Tue)11:56:35 No.108709460

Anonymous 04/28/26(Tue)11:56:35 No.108709460

File: file.png (153 KB, 474x302)

153 KB PNG

>>108709446
In reality, the first two steps alone are extremely unlikely.

Anonymous
04/28/26(Tue)11:56:39 No.108709463

Anonymous 04/28/26(Tue)11:56:39 No.108709463

>>108709453
Potentially true, but my retard logic has led me to never lose money in the market, ever.

Anonymous
04/28/26(Tue)11:56:52 No.108709464

Anonymous 04/28/26(Tue)11:56:52 No.108709464

>>108709446
it's a toxic asset at this point, shitload of investor money spent with no plan to return the investment other than "when we reach agi it will find out how to make a profit", quite literally

Anonymous
04/28/26(Tue)11:57:07 No.108709466

Anonymous 04/28/26(Tue)11:57:07 No.108709466

>>108709091
Cool looking build. Thanks for sharing

Anonymous
04/28/26(Tue)11:57:27 No.108709469

Anonymous 04/28/26(Tue)11:57:27 No.108709469

>>108709453
NTA but you can actualy pull this off if you are a whale.
ie let's say you own 30% of a company.
if you sold all of those 30% quickly, tons of people would panic sell.
you could then buy more than 30% with the same amount of money as you made selling them, and if you put extra cash you could get > 50% for a discount.

Anonymous
04/28/26(Tue)12:00:01 No.108709484

Anonymous 04/28/26(Tue)12:00:01 No.108709484

>>108709469
>doesn't know how markets work

Anonymous
04/28/26(Tue)12:00:41 No.108709488

Anonymous 04/28/26(Tue)12:00:41 No.108709488

>>108709484
>muh insider trading

Anonymous
04/28/26(Tue)12:01:21 No.108709493

Anonymous 04/28/26(Tue)12:01:21 No.108709493

>>108709464
That's why God created IPOs to unload toxic assets on ignorant retail investors.

Anonymous
04/28/26(Tue)12:02:22 No.108709498

Anonymous 04/28/26(Tue)12:02:22 No.108709498

File: dipsyAndTetoFG.png (1.41 MB, 1536x1024)

1.41 MB PNG

Tuesday!

Anonymous
04/28/26(Tue)12:03:51 No.108709505

Anonymous 04/28/26(Tue)12:03:51 No.108709505

>>108709484
they actualy do work like that, that's why "market manipulation" is a whole category of fraud.
it would work, but you take the risk of having to deal with the SEC.

Anonymous
04/28/26(Tue)12:04:44 No.108709511

Anonymous 04/28/26(Tue)12:04:44 No.108709511

>>108709464
they are going hard on the sunk cost fallacy.
"if you don't invest more we'll not get to AGI and all your money will have been burnt for nothing"
lmao.

Anonymous
04/28/26(Tue)12:06:26 No.108709522

Anonymous 04/28/26(Tue)12:06:26 No.108709522

llmfan46 seems less autistic than drummer, ngl.
I'm trying his models now, and so far so good.

Anonymous
04/28/26(Tue)12:08:31 No.108709535

Anonymous 04/28/26(Tue)12:08:31 No.108709535

>>108709505
There are much better ways to manipulate the market than selling low and buying high.

I bet even Qwen and Gemma could answer why anon's fanfic would not work. But somehow you people are more retarded and less able of critical thinking than open weight trash.

Anonymous
04/28/26(Tue)12:13:17 No.108709555

Anonymous 04/28/26(Tue)12:13:17 No.108709555

>>108709522
I think the abliterated gemma I have is llmfan46
afaik they just ran it thru heretic it's not like a drummer sloptune

Anonymous
04/28/26(Tue)12:15:28 No.108709563

Anonymous 04/28/26(Tue)12:15:28 No.108709563

>>108709535
>There are much better ways to manipulate the market than selling low and buying high.
i don't disagree.

point is, it'd work and it would be fun even if not the best strategy at all.

Anonymous
04/28/26(Tue)12:15:47 No.108709565

Anonymous 04/28/26(Tue)12:15:47 No.108709565

whichever anon posted about their Orb frontend yesterday thank you, it's actually pretty good. I like the review/diff feature a lot.

Anonymous
04/28/26(Tue)12:16:21 No.108709570

Anonymous 04/28/26(Tue)12:16:21 No.108709570

>>108709565
nice work, shill

Anonymous
04/28/26(Tue)12:16:59 No.108709574

Anonymous 04/28/26(Tue)12:16:59 No.108709574

>>108709570
thanks I do it for free

Anonymous
04/28/26(Tue)12:20:01 No.108709594

Anonymous 04/28/26(Tue)12:20:01 No.108709594

>>108709565
de nada

Anonymous
04/28/26(Tue)12:24:24 No.108709620

Anonymous 04/28/26(Tue)12:24:24 No.108709620

>>108707175
Am I missing something here? If the guy uses his heretic-derived tool to make models but doesn't distribute the tool, why are they complaining about the license?
Like if I took gimp and modified it and then produced and shared an image I made using it, I wouldn't have to redistribute gimp or care about its license

Anonymous
04/28/26(Tue)12:25:27 No.108709628

Anonymous 04/28/26(Tue)12:25:27 No.108709628

File: lolOAI.png (262 KB, 675x704)

262 KB PNG

> CFO Sarah Friar has expressed concerns to other company leaders that the ChatGPT creator might not be able to pay for future computing contracts if revenue doesn’t grow fast enough, according to the report.
> OpenAI missed multiple monthly revenue targets earlier this year after losing ground to Anthropic in coding and enterprise markets, the report said.
> "This is ridiculous. We are totally aligned on buying as much compute as we can and working hard on it together every day," CEO and co-founder Sam Altman and Friar said in an emailed statement to Reuters.
> ChatGPT's growth slowed toward the end of last year, the WSJ report said, adding that OpenAI fell short of an internal target to reach 1 billion weekly active users for the artificial intelligence chatbot by year-end.
> The company has also grappled with subscriber defections, the report added.
Original WSJ article from today is paywalled...
https://www.reuters.com/business/openai-falls-short-revenue-user-targets-it-races-toward-ipo-wsj-reports-2026-04-28/

Anonymous
04/28/26(Tue)12:25:34 No.108709630

Anonymous 04/28/26(Tue)12:25:34 No.108709630

File: uislop.jpg (165 KB, 726x1440)

165 KB JPG

Can we talk about this shit? Literally all the vibecoded UIs all look the same.

Orb looks exactly like this
this >>108709184 too

You guys need to prompt your UX otherwise everyone is going to know you're a vibeshiter.

Anonymous
04/28/26(Tue)12:30:05 No.108709663

Anonymous 04/28/26(Tue)12:30:05 No.108709663

>>108709630
It's the vibeshitter equivalent of whispers and shivers. It may bother you, but I bet 99% of the population won't notice or care.

Anonymous
04/28/26(Tue)12:30:48 No.108709667

Anonymous 04/28/26(Tue)12:30:48 No.108709667

>>108709620
He distributed the tool then removed the repo

Anonymous
04/28/26(Tue)12:32:25 No.108709685

Anonymous 04/28/26(Tue)12:32:25 No.108709685

>>108709318
What are the odds of Gemini models leaking if the weights are basically being sold?

Anonymous
04/28/26(Tue)12:33:22 No.108709693

Anonymous 04/28/26(Tue)12:33:22 No.108709693

File: 1767982139093855.jpg (137 KB, 1360x1360)

137 KB JPG

>>108709630
Actually I wanted this UX

Anonymous
04/28/26(Tue)12:33:35 No.108709695

Anonymous 04/28/26(Tue)12:33:35 No.108709695

>>108709630
>all the vibed ui's all work wtf this is stupid

Anonymous
04/28/26(Tue)12:34:09 No.108709700

Anonymous 04/28/26(Tue)12:34:09 No.108709700

>>108709620
it's just license retardation, nobody actually cares except reddit autists and shitty corps looking to hijack foss projects

Anonymous
04/28/26(Tue)12:34:43 No.108709707

Anonymous 04/28/26(Tue)12:34:43 No.108709707

One thing i am worried about is that if v4 gets actual support even in schizo fork will it gave the same prompt processing speed as usual models despite the compression? I kinda don't like the idea of prompt processing taking an hour at the start.

Anonymous
04/28/26(Tue)12:35:59 No.108709714

Anonymous 04/28/26(Tue)12:35:59 No.108709714

>>108709685
100%. Imo they are already leaked, but since no one has googles tensor whatever gpus, they cant run them, YET

Anonymous
04/28/26(Tue)12:38:08 No.108709728

Anonymous 04/28/26(Tue)12:38:08 No.108709728

>>108709685
>>108709714
A lucky few have them and it's called Day 0 Gemma.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.