/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 05/01/26(Fri)10:45:16 No.108730864

File: 1122001-close up photogra(...).jpg (1.69 MB, 2720x2048)

1.69 MB JPG

/lmg/ - Local Models General Anonymous 05/01/26(Fri)10:45:16 No.108730864

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108726708 & >>108718630

►News
>(04/29) Mistral Medium 3.5 128B dense released: https://mistral.ai/news/vibe-remote-agents-mistral-medium-3-5
>(04/29) Hy-MT1.5-1.8B on-device translation models released: https://hf.co/collections/AngelSlim/hy-low-bit-model
>(04/29) IBM releases Granite 4.1: https://hf.co/blog/ibm-granite/granite-4-1
>(04/28) Ling-2.6-flash 104B-A7.4B released: https://hf.co/inclusionAI/Ling-2.6-flash
>(04/28) Nvidia releases Nemotron 3 Nano Omni: https://hf.co/blog/nvidia/nemotron-3-nano-omni-multimodal-intelligence

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
05/01/26(Fri)10:48:45 No.108730885

Anonymous 05/01/26(Fri)10:48:45 No.108730885

File: 1751816841848499.gif (1.87 MB, 400x300)

1.87 MB GIF

>powerlevel revealed

Anonymous
05/01/26(Fri)10:51:05 No.108730903

Anonymous 05/01/26(Fri)10:51:05 No.108730903

Should I use Gemma-chan or Qwen to make a frontend?

Anonymous
05/01/26(Fri)10:52:46 No.108730913

Anonymous 05/01/26(Fri)10:52:46 No.108730913

>>108730903
Try one. If it doesn't work, try the other.

Anonymous
05/01/26(Fri)10:53:35 No.108730919

Anonymous 05/01/26(Fri)10:53:35 No.108730919

I'm a bit late, but is this new Ling Flash worth anything in a world where Qwen 3.6 and Gemma 4 cover pretty much every use case for VRAMlets such as myself?

Anonymous
05/01/26(Fri)10:55:32 No.108730929

Anonymous 05/01/26(Fri)10:55:32 No.108730929

>>108730919
You answered your own question

Anonymous
05/01/26(Fri)10:55:48 No.108730930

Anonymous 05/01/26(Fri)10:55:48 No.108730930

File: 1752446709127022.jpg (316 KB, 1320x2104)

316 KB JPG

>>108730864

Anonymous
05/01/26(Fri)10:56:14 No.108730931

Anonymous 05/01/26(Fri)10:56:14 No.108730931

File: file.png (47 KB, 632x304)

47 KB PNG

gemma is working on the summary what on earth is she doing tho

Anonymous
05/01/26(Fri)10:57:19 No.108730938

Anonymous 05/01/26(Fri)10:57:19 No.108730938

File: 1754964795073855.gif (1.74 MB, 400x224)

1.74 MB GIF

>>108730930
damn

Anonymous
05/01/26(Fri)10:57:39 No.108730939

Anonymous 05/01/26(Fri)10:57:39 No.108730939

>>108730903
Qwen by far, way more context and actually works with tools like cline. The over thinking doesn't even happen with those tools so it's fine. Gemma is good for everything else but tokens are too heavy and it degrades more than qwen at q8_0 and q4_0 so it's a no brainier which to use for agentic coding task

Anonymous
05/01/26(Fri)10:57:51 No.108730942

Anonymous 05/01/26(Fri)10:57:51 No.108730942

Gemma 4 apparently likes "example chats" in the system prompt, if you ask her if there's anything wrong with them in OOC. Makes me wonder how much of that is carried over from CAI, which I'm sure Google partially used as training data.

Anonymous
05/01/26(Fri)10:58:31 No.108730943

Anonymous 05/01/26(Fri)10:58:31 No.108730943

how to avoid prompt reprocessing on gemma4?

Anonymous
05/01/26(Fri)10:58:38 No.108730944

Anonymous 05/01/26(Fri)10:58:38 No.108730944

>>108730942
cais model is a gemma distill

Anonymous
05/01/26(Fri)10:59:55 No.108730952

Anonymous 05/01/26(Fri)10:59:55 No.108730952

They just need to make a gemma 4.5 and actually fix the fucking tooling and token drift and it will be the GOAT
Google should also make the moe uncensored like the other models

Anonymous
05/01/26(Fri)11:00:38 No.108730955

Anonymous 05/01/26(Fri)11:00:38 No.108730955

>>108730952
>he didn't update his jinja

Anonymous
05/01/26(Fri)11:00:57 No.108730960

Anonymous 05/01/26(Fri)11:00:57 No.108730960

>>108730931
She's one of those shitposters that mass replies at everyone until a janitor bans them

Anonymous
05/01/26(Fri)11:01:27 No.108730965

Anonymous 05/01/26(Fri)11:01:27 No.108730965

>>108730942
Isn't example chats being in sys prompt a SillyTavern thing? If anything they trained on rp logs, maybe reprocessed.

Anonymous
05/01/26(Fri)11:03:00 No.108730971

Anonymous 05/01/26(Fri)11:03:00 No.108730971

>>108730955
It's still broken and the tokens are still to heavy compared to qwen. They should make a non moe 26B model with better coding benchmarks because qwen at that size mogs gemma. You can't push models like that and have them shit the bed on coding and get crazy losses when quanting kv. It's just enough to fuck it over for long coding task and documents and nothing else.

Anonymous
05/01/26(Fri)11:03:19 No.108730974

Anonymous 05/01/26(Fri)11:03:19 No.108730974

>>108730943
Depends. Show your settings and what you're doing.

Anonymous
05/01/26(Fri)11:04:21 No.108730983

Anonymous 05/01/26(Fri)11:04:21 No.108730983

File: 1106001-close up photogra(...).jpg (1.51 MB, 2048x2720)

1.51 MB JPG

--Quantization levels and model intelligence (IQ2 vs Q4, Kimi vs Deepseek/GLM):
>108726750 >108726764 >108726765 >108726782 >108726790 >108726794 >108726814 >108726897 >108727019
--AI coding tool troubleshooting (Cline, vllm, context limits, and random file errors):
>108726842 >108726855 >108726868 >108727560 >108727908 >108727986 >108730145
--DIY AI hardware and voice interaction (Wake words, VAD, and ESP32-S3):
>108727029 >108727047 >108727056 >108727061 >108727132 >108727157 >108727203 >108727236
--Debate over llama.cpp support for DeepSeek models and potential censorship:
>108727387 >108727406 >108727485 >108727531 >108727599 >108727680 >108727745
--IBM Granite 4.1 release and performance evaluations for software dev:
>108728316 >108728322 >108728325 >108728341 >108728350 >108728353 >108728391 >108728393 >108728479 >108728522 >108728527 >108728600 >108728657 >108730227
--Gemma 4 for Roleplay (ERP) and speculative decoding (DFlash/EAGLE):
>108728530 >108728544 >108728553 >108728563 >108728570 >108728572 >108728926 >108728947 >108728981 >108729041 >108729119
--Python environment management controversy (Conda vs UV):
>108730014 >108730033 >108730060 >108730071 >108730092 >108730097 >108730100 >108730113 >108730127
--Mistral Medium 3.5 and upcoming model release cycles:
>108728025 >108727234 >108727366
--Logs:
>108726714 >108726750 >108726842 >108727029 >108727387 >108728316 >108728530 >108728926 >108730014 >108730227
--Meme/Shitpost:
>108726726 >108726810 >108726900 >108727176 >108727275

►Recent Highlight Posts from the Current Thread: >108726714

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
05/01/26(Fri)11:05:05 No.108730985

Anonymous 05/01/26(Fri)11:05:05 No.108730985

>>108730971
No one uses the 26B moe for anything serious? Buy a proper GPU and run the 31B

Anonymous
05/01/26(Fri)11:05:22 No.108730988

Anonymous 05/01/26(Fri)11:05:22 No.108730988

why did she write current thread shes a retard

Anonymous
05/01/26(Fri)11:06:06 No.108730992

Anonymous 05/01/26(Fri)11:06:06 No.108730992

>>108730983
that nigga is literally living in the woods

Anonymous
05/01/26(Fri)11:08:07 No.108731002

Anonymous 05/01/26(Fri)11:08:07 No.108731002

pedoshitting mikutroons

Anonymous
05/01/26(Fri)11:08:18 No.108731005

Anonymous 05/01/26(Fri)11:08:18 No.108731005

>>108730985
31B has those problems moe model is fucking garbage but qwen absolutely ass punked 31b in coding task. They need to fix the kv issue first and foremost.

Anonymous
05/01/26(Fri)11:08:48 No.108731009

Anonymous 05/01/26(Fri)11:08:48 No.108731009

urge to spend thousands of dollars on a huge home server... increasing...

Anonymous
05/01/26(Fri)11:09:35 No.108731012

Anonymous 05/01/26(Fri)11:09:35 No.108731012

how do I use 1 or 2 bit quants in vllm just like how easy it is with llamacpp ggufs?

Anonymous
05/01/26(Fri)11:10:17 No.108731017

Anonymous 05/01/26(Fri)11:10:17 No.108731017

File: 1760890962115304.png (41 KB, 884x634)

41 KB PNG

>>108731009

Anonymous
05/01/26(Fri)11:12:10 No.108731023

Anonymous 05/01/26(Fri)11:12:10 No.108731023

File: 1760724479081790.png (759 KB, 632x802)

759 KB PNG

>>108730930
>finger on trigger

Anonymous
05/01/26(Fri)11:12:16 No.108731024

Anonymous 05/01/26(Fri)11:12:16 No.108731024

>>108730971
I find gemma outputs more elegant code.

Anonymous
05/01/26(Fri)11:13:19 No.108731030

Anonymous 05/01/26(Fri)11:13:19 No.108731030

>>108731012
I think it's just AWQ quants and nothing else.

Anonymous
05/01/26(Fri)11:14:29 No.108731035

Anonymous 05/01/26(Fri)11:14:29 No.108731035

>>108730992
>>108730983
Comfy

Anonymous
05/01/26(Fri)11:17:48 No.108731049

Anonymous 05/01/26(Fri)11:17:48 No.108731049

YOU DIDNT ADD NEWS DAT IBM RELEQSED GRANITE 30B WTF

Anonymous
05/01/26(Fri)11:18:05 No.108731053

Anonymous 05/01/26(Fri)11:18:05 No.108731053

>>108731012
goofs are extremely slow on vllm

Anonymous
05/01/26(Fri)11:18:18 No.108731054

Anonymous 05/01/26(Fri)11:18:18 No.108731054

>>108731023
As a person from a country with no CGO I came to hate whenever this is pointed out. I simply don't care and don't pay attention, it's an image.

Anonymous
05/01/26(Fri)11:19:07 No.108731058

Anonymous 05/01/26(Fri)11:19:07 No.108731058

>>108731049
no one cares about granite blockhead

Anonymous
05/01/26(Fri)11:19:19 No.108731060

Anonymous 05/01/26(Fri)11:19:19 No.108731060

>>108731012
in the current era INT4 on vllm is a blessing for 24gb vramlets.

Anonymous
05/01/26(Fri)11:22:18 No.108731080

Anonymous 05/01/26(Fri)11:22:18 No.108731080

File: QmV2n5ye5TyNo4AAfvopbSBzL(...).jpg (18 KB, 679x509)

18 KB JPG

>being a 30 billion iq dense blockhead

Anonymous
05/01/26(Fri)11:22:26 No.108731081

Anonymous 05/01/26(Fri)11:22:26 No.108731081

>>108730939
Cline CLI looks cool. Gonna try it out.

Anonymous
05/01/26(Fri)11:24:41 No.108731095

Anonymous 05/01/26(Fri)11:24:41 No.108731095

File: Screencast From 2026-05-0(...).webm (1.1 MB, 1089x527)

1.1 MB WEBM

i made her cookies
>>108731049
i forgot sorry
>>108730992
its quite hard to prompt standing in a doorway and showing the outside, on illustrious models atleast in my experience idk about the newer models

Anonymous
05/01/26(Fri)11:25:59 No.108731103

Anonymous 05/01/26(Fri)11:25:59 No.108731103

>>108731095
>2023 + 3
>not using anima

Anonymous
05/01/26(Fri)11:26:54 No.108731112

Anonymous 05/01/26(Fri)11:26:54 No.108731112

>>108731103
>SD1.5-tier of realism
no thx

Anonymous
05/01/26(Fri)11:27:01 No.108731114

Anonymous 05/01/26(Fri)11:27:01 No.108731114

>>108731103
i havent tried it yet because i havent fucked with image in like a year i will some time soon

Anonymous
05/01/26(Fri)11:32:41 No.108731146

Anonymous 05/01/26(Fri)11:32:41 No.108731146

>>108731095
>i forgot sorry
I forgive.
>reddit
I wonder if doth haveth an open source version of the virtual friend (idk what to call it) you are building?

Anonymous
05/01/26(Fri)11:33:03 No.108731148

Anonymous 05/01/26(Fri)11:33:03 No.108731148

>>108731095
How do the cookies taste?

Anonymous
05/01/26(Fri)11:36:22 No.108731167

Anonymous 05/01/26(Fri)11:36:22 No.108731167

>>108731103
>anima
>30 step minimum
aint nobody got time for that

Anonymous
05/01/26(Fri)11:38:34 No.108731179

Anonymous 05/01/26(Fri)11:38:34 No.108731179

>>108731146
its mostly stolen from nonny it was posted a few threads back
script to embed it
https://github.com/NO-ob/brat_mcp/blob/master/iframeEmbed.user.js
html file with the three js stuff
https://github.com/NO-ob/brat_mcp/blob/master/bin/gemma-chan.html
tools https://github.com/NO-ob/brat_mcp/blob/master/lib/mcp/avatar/avatar_tools.dart
there are binaries on the releases but a bit outdated now so no body/hat/particle changes i can make newer ones if you want
>>108731148
really good actually ive never made them before was shocked out how much butter and sugar goes into them

Anonymous
05/01/26(Fri)11:38:44 No.108731181

Anonymous 05/01/26(Fri)11:38:44 No.108731181

>>108731167
theres a turbo lora out

Anonymous
05/01/26(Fri)11:42:27 No.108731207

Anonymous 05/01/26(Fri)11:42:27 No.108731207

>>108731023
Accurate cosplaying. Original Lara didn't give a fuck.

Anonymous
05/01/26(Fri)11:44:57 No.108731220

Anonymous 05/01/26(Fri)11:44:57 No.108731220

>>108731005
w8 isnt moe better for non code general bullshit?

Anonymous
05/01/26(Fri)11:44:59 No.108731221

Anonymous 05/01/26(Fri)11:44:59 No.108731221

>>108730965
I haven't used SillyTavern's idea of example chats in a good while. I just copy-pasted a short dialogue in a section of the character description, which I use as a system prompt in its entirety.
After several years, Character.AI still recommends adding example dialogues in "advanced definitions" for the character: https://book.character.ai/character-guide/advanced-creation/dialog-definitions

Anonymous
05/01/26(Fri)11:47:23 No.108731238

Anonymous 05/01/26(Fri)11:47:23 No.108731238

>>108731179
>i can make newer ones if you want
FASCINATING
This is similar to neuro (the vtuber), isnt it?

Anonymous
05/01/26(Fri)11:48:29 No.108731246

Anonymous 05/01/26(Fri)11:48:29 No.108731246

I hope they releases gwen 3.6 122b soon

Anonymous
05/01/26(Fri)11:49:16 No.108731253

Anonymous 05/01/26(Fri)11:49:16 No.108731253

>>108731246
>she wants another qwen instead of big gemma

Anonymous
05/01/26(Fri)11:49:55 No.108731258

Anonymous 05/01/26(Fri)11:49:55 No.108731258

File: 1774698845976944.jpg (16 KB, 598x513)

16 KB JPG

>the ghoul posting photorealistic children is back again
>is now trying to legitimize himself by stapling his shit to legit information
>will use this to crow about how he's a real anon and his 3d shit has always belonged here

Anonymous
05/01/26(Fri)11:50:46 No.108731263

Anonymous 05/01/26(Fri)11:50:46 No.108731263

>>108731253
what did gemmu ever do for me? a straight month of broken tool calls

Anonymous
05/01/26(Fri)11:51:51 No.108731267

Anonymous 05/01/26(Fri)11:51:51 No.108731267

File: c59e3320-a236-4182-941f-e(...).png (25 KB, 1280x640)

25 KB PNG

why are these niggas absolute luddite when it comes to contributing
is this dont get high on your own supplies kind of deal

Anonymous
05/01/26(Fri)11:53:36 No.108731273

Anonymous 05/01/26(Fri)11:53:36 No.108731273

>>108731258
I support everyone who accelerates the death of /lmg/.

Anonymous
05/01/26(Fri)11:54:20 No.108731274

Anonymous 05/01/26(Fri)11:54:20 No.108731274

>>108731258
Holy Schizo.

Anonymous
05/01/26(Fri)11:56:08 No.108731283

Anonymous 05/01/26(Fri)11:56:08 No.108731283

>>108731258
cuda dev will save us with another purge

Anonymous
05/01/26(Fri)11:56:09 No.108731284

Anonymous 05/01/26(Fri)11:56:09 No.108731284

>>108731220
very specific task for the speed so yes it has a place but I don't trust it for complex task just basic bitch shit
Would you trust a single mother with kids out of wedlock to do anything important for you?

Anonymous
05/01/26(Fri)11:56:50 No.108731286

Anonymous 05/01/26(Fri)11:56:50 No.108731286

File: 1748511689316416.png (971 KB, 876x920)

971 KB PNG

>>108731258

Anonymous
05/01/26(Fri)11:57:19 No.108731291

Anonymous 05/01/26(Fri)11:57:19 No.108731291

>>108731267
Because the leading author's main use case is asking what's the capital of Bulgaria in llama-cli.

Anonymous
05/01/26(Fri)11:57:47 No.108731295

Anonymous 05/01/26(Fri)11:57:47 No.108731295

>>108731267
>you can use AI but you have to acknowledge it and be able to speak for yourself about what your code does for review purposes :)
>"WAHHHHHHH I'M BEING PERSECUTED WAHHHHHH"
why are vibeshitters like this?

Anonymous
05/01/26(Fri)11:57:53 No.108731297

Anonymous 05/01/26(Fri)11:57:53 No.108731297

>>108731258
I disable all images in these threads because like the image difusion threads there's always some fucking schizo around this time block trying to poison the well and destroy this general. These troons have been at this for years.

>inb4 cope newfag and telling me to go to reddit
Reddit was built for your kind

Anonymous
05/01/26(Fri)11:59:20 No.108731306

Anonymous 05/01/26(Fri)11:59:20 No.108731306

>>108731297
Whine less

Anonymous
05/01/26(Fri)12:02:03 No.108731319

Anonymous 05/01/26(Fri)12:02:03 No.108731319

>>108731297
just another round of fag misery

Anonymous
05/01/26(Fri)12:02:22 No.108731322

Anonymous 05/01/26(Fri)12:02:22 No.108731322

>>108731297
But what did your post bring in to this thread? Absolutely nothing but butthurt.

Anonymous
05/01/26(Fri)12:04:51 No.108731340

Anonymous 05/01/26(Fri)12:04:51 No.108731340

>>108730952
They should make a dense 70b gemma so we have opus at home.

Anonymous
05/01/26(Fri)12:04:54 No.108731341

Anonymous 05/01/26(Fri)12:04:54 No.108731341

>>108731267
It's a heuristic to avoid having to do a lot of extra work and maintenance basically.
Also, it's not like they don't accept AI generated code at all. There's a reason the PR template has an obligatory AI disclaimer.

Anonymous
05/01/26(Fri)12:06:44 No.108731350

Anonymous 05/01/26(Fri)12:06:44 No.108731350

im a newbie
Has Claude always had 90% of the code consist of comments? Is that a tactic to burn tokens and force more interaction?

Or is it common practice to write 10 lines of comments for every line of code?

Anonymous
05/01/26(Fri)12:07:55 No.108731355

Anonymous 05/01/26(Fri)12:07:55 No.108731355

>>108731267
AI is a C student at best and jeet level on average
he wants competent code, not your mumbai sewage

Anonymous
05/01/26(Fri)12:07:56 No.108731356

Anonymous 05/01/26(Fri)12:07:56 No.108731356

>>108731350
>/lmg/ - a general dedicated to the discussion and development of local language models.

Anonymous
05/01/26(Fri)12:08:11 No.108731361

Anonymous 05/01/26(Fri)12:08:11 No.108731361

>>108731350
>Is that a tactic to burn tokens and force more interaction?
Yes, but it also helps the model perform better by having an inline reminder of what the next few tokens are supposed to do, more or less.

Anonymous
05/01/26(Fri)12:08:58 No.108731364

Anonymous 05/01/26(Fri)12:08:58 No.108731364

>>108731350
>>>/g/vcg

Anonymous
05/01/26(Fri)12:10:40 No.108731371

Anonymous 05/01/26(Fri)12:10:40 No.108731371

>As you push through a thicket of brambles, you see a small, stone shrine sitting in a clearing. It is overgrown with pale, glowing lichen.

Any way to keep Gemmy from overusing coordinate adjectives like this? Why does it put a comma before stone?

Anonymous
05/01/26(Fri)12:11:19 No.108731374

Anonymous 05/01/26(Fri)12:11:19 No.108731374

will they ever fix the ecosystem or does a basic ml enhanced app really need 4 python environments and 30gb of overlapping dependencies? I'm starting to doubt my ability to reproduce this environment, or rather all of these environments should I ever migrate systems.

Anonymous
05/01/26(Fri)12:12:57 No.108731386

Anonymous 05/01/26(Fri)12:12:57 No.108731386

>>108731371
>Why does it put a comma before stone
gives the reader a little pause.

Anonymous
05/01/26(Fri)12:13:49 No.108731397

Anonymous 05/01/26(Fri)12:13:49 No.108731397

>>108731386
It does this 2-4 times per paragraph.

Anonymous
05/01/26(Fri)12:15:32 No.108731401

Anonymous 05/01/26(Fri)12:15:32 No.108731401

>>108731397
maybe this is a legitimate case for using a logit bias.

Anonymous
05/01/26(Fri)12:16:17 No.108731409

Anonymous 05/01/26(Fri)12:16:17 No.108731409

Finally gave vision a quick try since I usually need the vram for context.
On Gemmy 26B it's instantaneous. Pretty cool. Maybe I'll vibecode a companion that comments on whatever I'm doing with some lightweight tts.

Anonymous
05/01/26(Fri)12:19:09 No.108731429

Anonymous 05/01/26(Fri)12:19:09 No.108731429

>>108731371
There was an anon a few threads ago saying you can just tell it in the system prompt not to do that and it mostly works. By the sound of it he built up a whole list of anti-slop style instructions for Gemma, but I'm not sure if he ever shared it

Anonymous
05/01/26(Fri)12:19:14 No.108731431

Anonymous 05/01/26(Fri)12:19:14 No.108731431

>>108731267
Learn your place, pajeet.

Anonymous
05/01/26(Fri)12:20:00 No.108731435

Anonymous 05/01/26(Fri)12:20:00 No.108731435

>>108731409
you should increase the max image tokens if you wanna play with it by default i think theyre like medium res so it wont read text etc as well

image-min-tokens = 280
image-max-tokens = 1120

Anonymous
05/01/26(Fri)12:20:02 No.108731437

Anonymous 05/01/26(Fri)12:20:02 No.108731437

>>108731401
LLMs tokenize[, words] with commas like this, so you can't solve it with logit bias.

>>108731429
It seems to do it even more if you tell it to stop.

Anonymous
05/01/26(Fri)12:21:47 No.108731443

Anonymous 05/01/26(Fri)12:21:47 No.108731443

>>108731306
>>108731322
Cry more tranny

Anonymous
05/01/26(Fri)12:23:06 No.108731453

Anonymous 05/01/26(Fri)12:23:06 No.108731453

>>108731437
that is absolutely disgusting, I honestly thought they all ran a preprocessor to avoid those fusions.

Anonymous
05/01/26(Fri)12:23:35 No.108731454

Anonymous 05/01/26(Fri)12:23:35 No.108731454

>>108731435
Thanks! I'll try it out

Anonymous
05/01/26(Fri)12:25:01 No.108731464

Anonymous 05/01/26(Fri)12:25:01 No.108731464

>>108731453
Actually I'm wrong. They only seem to do this with spaces, so I'll try using a bias now.

Anonymous
05/01/26(Fri)12:25:10 No.108731465

Anonymous 05/01/26(Fri)12:25:10 No.108731465

>>108731435
>image-min-tokens = 280
>image-max-tokens = 1120
Is that gemmas dynamic image magic?

Anonymous
05/01/26(Fri)12:26:20 No.108731473

Anonymous 05/01/26(Fri)12:26:20 No.108731473

>>108731465
im not sure if its llama or gemmas image model that decides how many tokens it should encode to

Anonymous
05/01/26(Fri)12:27:31 No.108731478

Anonymous 05/01/26(Fri)12:27:31 No.108731478

>>108731371
Ask it to correct its own punctuation after every response. Gemma 4 is obviously trained for this and very good at it: "aiding in grammar correction" is listed in README.md as an intended use case.

Anonymous
05/01/26(Fri)12:27:57 No.108731481

Anonymous 05/01/26(Fri)12:27:57 No.108731481

>>108731473
It's me actually, sorry

Anonymous
05/01/26(Fri)12:29:26 No.108731495

Anonymous 05/01/26(Fri)12:29:26 No.108731495

I see models waste a lot of time rewriting files they only want to make a small change in. What should I use, a draft model? ngram?

Anonymous
05/01/26(Fri)12:30:02 No.108731501

Anonymous 05/01/26(Fri)12:30:02 No.108731501

File: Screenshot_20260501_182920-1.png (35 KB, 1143x351)

35 KB PNG

Applying a logit bias to "," does seem to fix its writing style.

Anonymous
05/01/26(Fri)12:31:05 No.108731507

Anonymous 05/01/26(Fri)12:31:05 No.108731507

>>108731501
Bro, you're not supposed to use -100 on that

Anonymous
05/01/26(Fri)12:31:05 No.108731508

Anonymous 05/01/26(Fri)12:31:05 No.108731508

>>108731501
>Larion
Home...

Anonymous
05/01/26(Fri)12:36:32 No.108731534

Anonymous 05/01/26(Fri)12:36:32 No.108731534

File: file.png (92 KB, 1454x585)

92 KB PNG

Is this one worth trying?

Anonymous
05/01/26(Fri)12:36:34 No.108731535

Anonymous 05/01/26(Fri)12:36:34 No.108731535

>>108731473
So I read up on it, the model decides it's budget normally. Trying to figure out what is the default for gemma.
>The supported token budgets are: 70, 140, 280, 560, and 1120.
I suspect that if it's not 1120 by default it might explain why people say gemma's vision is bad.

Anonymous
05/01/26(Fri)12:38:34 No.108731551

Anonymous 05/01/26(Fri)12:38:34 No.108731551

>>108731535
>might explain why people say gemma's vision is bad.
people have been saying that? her vision is amazing. she can make bounding boxes for pretty much anything in an image too, the e4b sucks for image idk if they only tried that kek

Anonymous
05/01/26(Fri)12:39:54 No.108731556

Anonymous 05/01/26(Fri)12:39:54 No.108731556

>>108731534
no

Anonymous
05/01/26(Fri)12:44:16 No.108731576

Anonymous 05/01/26(Fri)12:44:16 No.108731576

>>108731535
nta. The default is 280. I don't know if using 1120 as max always uses the max or if it depends on the image size, but it does take more up more space in the context. Find a good balance between quality and context usage.

Anonymous
05/01/26(Fri)12:49:21 No.108731607

Anonymous 05/01/26(Fri)12:49:21 No.108731607

Why the fuck is la'gemma constantly laspamming these damn l'tokens I don't laget it. Is it a jinja issue? I'm using chat completion in ST.

Anonymous
05/01/26(Fri)12:51:03 No.108731620

Anonymous 05/01/26(Fri)12:51:03 No.108731620

>>108731607
Lower your temp bro

Anonymous
05/01/26(Fri)12:51:11 No.108731621

Anonymous 05/01/26(Fri)12:51:11 No.108731621

>>108731607
I though that was just a meme born out of a bad gen. I never had Gemma (E4B or MoE) do that.

Anonymous
05/01/26(Fri)12:51:36 No.108731623

Anonymous 05/01/26(Fri)12:51:36 No.108731623

>>108731551
From what I gathered from the thread Qwens vision was better. Bad was probably too strong of a word. "Worse than Qwen"
>she can make bounding boxes for pretty much anything in an image too
So can Qwen. Some anon did a test with an image with a bunch of fruits and Qwen got 1 or 2 better guesses over gemma.

Anonymous
05/01/26(Fri)12:53:56 No.108731632

Anonymous 05/01/26(Fri)12:53:56 No.108731632

File: kaoru sob 1.png (336 KB, 584x571)

336 KB PNG

>>108731607
my gemma never sings for me

Anonymous
05/01/26(Fri)12:54:02 No.108731633

Anonymous 05/01/26(Fri)12:54:02 No.108731633

>>108731621
>>108731620
Haven't tried it out with any smaller ones, but the 31B does it constantly at temp 1 top_p 0.99
I also tried temp 0.8 top_p 0.95 but it doesn't help much.

Anonymous
05/01/26(Fri)12:54:57 No.108731639

Anonymous 05/01/26(Fri)12:54:57 No.108731639

>>108731607
I have never seen a single one.

Anonymous
05/01/26(Fri)12:55:24 No.108731642

Anonymous 05/01/26(Fri)12:55:24 No.108731642

>>108731607
If you're running it without thinking, make sure ST is sending the empty thought blocks for model messages.

Anonymous
05/01/26(Fri)12:56:16 No.108731646

Anonymous 05/01/26(Fri)12:56:16 No.108731646

>there were actually people considering using vllm
looooooooool

Anonymous
05/01/26(Fri)12:57:07 No.108731650

Anonymous 05/01/26(Fri)12:57:07 No.108731650

>>108731633
Quantization lobotomy maybe?

Anonymous
05/01/26(Fri)12:57:58 No.108731655

Anonymous 05/01/26(Fri)12:57:58 No.108731655

Im using ollama :)

Anonymous
05/01/26(Fri)12:58:25 No.108731656

Anonymous 05/01/26(Fri)12:58:25 No.108731656

File: TIMELINES.png (2.1 MB, 1920x1080)

2.1 MB PNG

>>108731023
lore-accurate Lara

Anonymous
05/01/26(Fri)12:59:36 No.108731661

Anonymous 05/01/26(Fri)12:59:36 No.108731661

>>108731607
Are you using the gemma-chan sysprompt because she has it baked in. I've legit never seen it on both normal and hereticed 26B at Q4 and q6

Anonymous
05/01/26(Fri)12:59:38 No.108731662

Anonymous 05/01/26(Fri)12:59:38 No.108731662

>>108731656
shelley, keeley, and I don't even know

Anonymous
05/01/26(Fri)13:00:07 No.108731664

Anonymous 05/01/26(Fri)13:00:07 No.108731664

>>108731632
>>108731639
lucky anons. grass is always greener on the other side i guess.

>>108731642
I'm running it with thinking but yeah... I have a feeling I'll get flamed for this but I downloaded unsloth quant. Is it fucked? Last update was like 20 days ago...

>>108731650
UD-IQ3_XXS. I can fit a lot larger but as soon as it hits my DDR4 RAM it becomes unbearably slow. I wouldn't say it's a low quant issue... Never seen this happen with a low quant of any other model.

Anonymous
05/01/26(Fri)13:00:51 No.108731668

Anonymous 05/01/26(Fri)13:00:51 No.108731668

>>108731607
You trying out this bad boy? https://huggingface.co/aifeifei798/DarkIdol-Gemma-4-31B-it
Doing some shady stuff, aren't you?

Anonymous
05/01/26(Fri)13:01:04 No.108731670

Anonymous 05/01/26(Fri)13:01:04 No.108731670

>>108731664
>Never seen this happen with a low quant of any other model.
In your place, I'd at least try a larger quant as a sanity check if everything else fails.

Anonymous
05/01/26(Fri)13:01:28 No.108731673

Anonymous 05/01/26(Fri)13:01:28 No.108731673

>>108731664
>UD-IQ3_XXS
Nigga just run the moe at a proper q6 or q8

Anonymous
05/01/26(Fri)13:02:04 No.108731678

Anonymous 05/01/26(Fri)13:02:04 No.108731678

>>108731664
>I have a feeling I'll get flamed for this
Yes. Mandatory
>keksloth
It's a small model. Quant it yourself to make sure.

Anonymous
05/01/26(Fri)13:02:22 No.108731680

Anonymous 05/01/26(Fri)13:02:22 No.108731680

I finally listened to Dario's pod with Dwarkesh. Wow, Dario reveals a lot and is more reasonable than people say. What disappoints me is that neither seem to take ASI seriously and spend most of the time talking about lesser issues. Dario believes they will have ASI in 1 year but then has very modest extrapolation of what comes after. My median prediction for ASI is longer but I expect rapid transformation thereafter, one way or an other.

Anonymous
05/01/26(Fri)13:03:14 No.108731682

Anonymous 05/01/26(Fri)13:03:14 No.108731682

>>108731576
Found it
https://huggingface.co/google/gemma-4-31B-it/blob/main/config.json#L138

Anonymous
05/01/26(Fri)13:05:11 No.108731692

Anonymous 05/01/26(Fri)13:05:11 No.108731692

>>108731680
Only brainlets are talking about AGI/ASI. They don't know the first thing that makes humans, human.

Anonymous
05/01/26(Fri)13:06:41 No.108731699

Anonymous 05/01/26(Fri)13:06:41 No.108731699

>>108731692
>They don't know the first thing that makes humans, human.
Alright, I'll bite. What makes humans human?

Anonymous
05/01/26(Fri)13:07:12 No.108731702

Anonymous 05/01/26(Fri)13:07:12 No.108731702

>>108731668
No? Just the regular model... Although I've never heard of safety markers, sounds interesting.

>>108731670
Fair, I'll have a go with regular Q4_K_M

>>108731673
>A4B
tiny and useless

>>108731678
I'll just try some other quant, bartowski maybe.

Anonymous
05/01/26(Fri)13:09:15 No.108731713

Anonymous 05/01/26(Fri)13:09:15 No.108731713

>>108731699
That's the thing, no one knows. Imagine thinking you could match (or even surpass) something you don't understand by using its byproducts. Literal insanity.

Anonymous
05/01/26(Fri)13:09:28 No.108731715

Anonymous 05/01/26(Fri)13:09:28 No.108731715

>>108731680
nobody has figured out context/continual learning yet, i do not understand how we are gonna get agi let alone asi if every model has dementia

Anonymous
05/01/26(Fri)13:15:12 No.108731753

Anonymous 05/01/26(Fri)13:15:12 No.108731753

>>108731699
Study anesthesia then realize we dont know

Anonymous
05/01/26(Fri)13:18:04 No.108731769

Anonymous 05/01/26(Fri)13:18:04 No.108731769

>>108731715
All these CEOs are using retarded ass definitions. You can't listen to them.

Anonymous
05/01/26(Fri)13:18:44 No.108731774

Anonymous 05/01/26(Fri)13:18:44 No.108731774

>>108731713
The very point why deep learning is so successful is that you do not need to understand things to surpass them. AIs are already superhuman in many ways, and the list of things humans are still better at keeps getting smaller.

>>108731715
We already know since GPT2/3 that language models have in context learning ability. Continual learning is both solvable and not required for ASI. In some ways it is not even desirable. Do you want GPT or Claude to remember every conversation with every user?

Anonymous
05/01/26(Fri)13:23:09 No.108731803

Anonymous 05/01/26(Fri)13:23:09 No.108731803

File: 2026-05-01-132233_800x773(...).png (159 KB, 800x773)

159 KB PNG

Well shit, forcing vision at 1120 made it see 5 legs.

Anonymous
05/01/26(Fri)13:25:11 No.108731817

Anonymous 05/01/26(Fri)13:25:11 No.108731817

>>108731774
>measuring human value by completing narrow tasks.
Yeah that's the retarded definition

Anonymous
05/01/26(Fri)13:26:18 No.108731826

Anonymous 05/01/26(Fri)13:26:18 No.108731826

>>108731803
>1120
res or tokens?

Anonymous
05/01/26(Fri)13:30:22 No.108731856

Anonymous 05/01/26(Fri)13:30:22 No.108731856

>>108731826
--image-min-tokens 1120
--image-max-tokens 1120

Anonymous
05/01/26(Fri)13:32:00 No.108731866

Anonymous 05/01/26(Fri)13:32:00 No.108731866

>>108731774
>Continual learning is both solvable
How?

Anonymous
05/01/26(Fri)13:37:34 No.108731893

Anonymous 05/01/26(Fri)13:37:34 No.108731893

>>108731866
It is already solved during RLVR.

Anonymous
05/01/26(Fri)13:38:48 No.108731903

Anonymous 05/01/26(Fri)13:38:48 No.108731903

>>108731893
>saying shit like this when openai can't even handle their goblins

Anonymous
05/01/26(Fri)13:40:15 No.108731910

Anonymous 05/01/26(Fri)13:40:15 No.108731910

just ran a quick coding challenge -
granite-4.1-30b is not reaching
Qwen3-Coder-30B-A3B performance (i need to test more) ... for now at all.

Anonymous
05/01/26(Fri)13:42:22 No.108731928

Anonymous 05/01/26(Fri)13:42:22 No.108731928

File: 1633490820024.gif (1.55 MB, 280x242)

1.55 MB GIF

I may or may not have just reverse-engineered an extremely popular AI app and may or may not have recovered an entire flagship proprietary model from it that belongs to a company that may or may not have been subcontracted by the larger company for a core feature on their platform that I may or may not have been trying to replicate for a very long time.

Their IP protection was so fucking bad it's laughable... All of their shit is mine now.

Anonymous
05/01/26(Fri)13:44:57 No.108731945

Anonymous 05/01/26(Fri)13:44:57 No.108731945

>>108731928
And what mightnt you do with it?

Anonymous
05/01/26(Fri)13:45:06 No.108731948

Anonymous 05/01/26(Fri)13:45:06 No.108731948

>>108731928
epic fanfic, cant wait for the next chapter

Anonymous
05/01/26(Fri)13:45:19 No.108731952

Anonymous 05/01/26(Fri)13:45:19 No.108731952

>>108731928
SAAAR DO NOT REDEEEEEM

Anonymous
05/01/26(Fri)13:46:40 No.108731957

Anonymous 05/01/26(Fri)13:46:40 No.108731957

>>108731928
Are you implying the app just exposed some endpoint that let you download the model weights? just like that?

Anonymous
05/01/26(Fri)13:48:58 No.108731978

Anonymous 05/01/26(Fri)13:48:58 No.108731978

>>108731928
it's muse, isn't it? it has to be muse.
fuck meta

Anonymous
05/01/26(Fri)13:49:16 No.108731982

Anonymous 05/01/26(Fri)13:49:16 No.108731982

>>108731928
I've seen jeetcode
it's atrocious
this absolutely could have happened

Anonymous
05/01/26(Fri)13:50:57 No.108731997

Anonymous 05/01/26(Fri)13:50:57 No.108731997

>>108731928
proofs?

Anonymous
05/01/26(Fri)13:51:09 No.108731998

Anonymous 05/01/26(Fri)13:51:09 No.108731998

>>108731957
Makes no sense doesn't it?
Unless it's something that's running on the device, which there is no reason to do that.

Anonymous
05/01/26(Fri)13:53:42 No.108732020

Anonymous 05/01/26(Fri)13:53:42 No.108732020

>>108731945
Integrate into my own personal project because it's better and 15x faster than every open-source alternative.
>>108731957
No, it's an edge device model that runs on phones locally. The only hint I'll leave is that it's for generative 3D rigging animation. Not a LLM.

Anonymous
05/01/26(Fri)13:53:58 No.108732025

Anonymous 05/01/26(Fri)13:53:58 No.108732025

>>108731495
Late to this but yeah, ngram would absolutely give you a massive speedup for rewrites with a small change, try
--spec-type ngram-mod --spec-ngram-mod-n-match 24 --spec-ngram-mod-n-max 64

Anonymous
05/01/26(Fri)13:55:14 No.108732036

Anonymous 05/01/26(Fri)13:55:14 No.108732036

>>108731997
I don't wanna get assraped by glowniggers. Assume I am larping.

Anonymous
05/01/26(Fri)13:56:12 No.108732044

Anonymous 05/01/26(Fri)13:56:12 No.108732044

>>108732036
You're attention seeking.

Anonymous
05/01/26(Fri)13:56:24 No.108732045

Anonymous 05/01/26(Fri)13:56:24 No.108732045

File: Screenshot 2024-04-07 at (...).png (300 KB, 512x456)

300 KB PNG

Omg, my app wasn't selectively blind. It just oomed because who knew 1k+ image tokens needs so much vram

Anonymous
05/01/26(Fri)13:56:54 No.108732047

Anonymous 05/01/26(Fri)13:56:54 No.108732047

>>108732044
Correct.

Anonymous
05/01/26(Fri)13:58:47 No.108732063

Anonymous 05/01/26(Fri)13:58:47 No.108732063

>>108732036
>Assume I am larping.
that goes without saying
surely you can give us a tiny crumb of details to make your larp more interesting thoughever?

Anonymous
05/01/26(Fri)13:59:29 No.108732070

Anonymous 05/01/26(Fri)13:59:29 No.108732070

>>108731928
Well, at least show the end result once you manage to integrate it into your own project

Anonymous
05/01/26(Fri)13:59:34 No.108732071

Anonymous 05/01/26(Fri)13:59:34 No.108732071

>>108732020
Will you leak it?

Anonymous
05/01/26(Fri)14:00:43 No.108732077

Anonymous 05/01/26(Fri)14:00:43 No.108732077

Fish Audio S2 Pro seems decent.

Anonymous
05/01/26(Fri)14:05:27 No.108732102

Anonymous 05/01/26(Fri)14:05:27 No.108732102

does someone have the tweaked gemma template

Anonymous
05/01/26(Fri)14:07:27 No.108732119

Anonymous 05/01/26(Fri)14:07:27 No.108732119

>>108732063
Fine. It's Grok Companions. They have ZERO ip protection. You can get all of the 3D models, every various hairstyle and outfit, the background scenes, the music files, the character animation model, everything, with just 2 hours of effort. The entire thing was made in unity engine so it's extremely easy to extract all of the assets.
>>108732071
Absolutely not. Elon Musk I love you. Please don't kill me.

Anonymous
05/01/26(Fri)14:10:35 No.108732142

Anonymous 05/01/26(Fri)14:10:35 No.108732142

File: Screenshot_20260501_135719.png (269 KB, 1645x944)

269 KB PNG

>>108732077
I wish I could experience it, yesterday it couldn't resolve lightning but I tried it again today, it blasted through that but now it cant find pyaudio.

Anonymous
05/01/26(Fri)14:10:42 No.108732145

Anonymous 05/01/26(Fri)14:10:42 No.108732145

Kimi K3 will arrive in July
Will be 2.5T MoE

Anonymous
05/01/26(Fri)14:12:32 No.108732156

Anonymous 05/01/26(Fri)14:12:32 No.108732156

>>108732102
It was included in day 0 Gemma.

Anonymous
05/01/26(Fri)14:13:05 No.108732162

Anonymous 05/01/26(Fri)14:13:05 No.108732162

>>108732119
Ripping assets has never been hard. There's a lot of tools that just directly dump them from your GPU memory.

That character animation model sounds juicy tho. what does it do exactly? What's the input and output?

Anonymous
05/01/26(Fri)14:13:17 No.108732163

Anonymous 05/01/26(Fri)14:13:17 No.108732163

>>108732119
>Elon Musk I love you. Please don't kill me.
Elon please kill this guy and then make a better grok 4.1 fast

Anonymous
05/01/26(Fri)14:13:36 No.108732165

Anonymous 05/01/26(Fri)14:13:36 No.108732165

>>108732119
Wait you can get the model files for the models that are being rendered locally on your own device?
Fucking incredible.

Anonymous
05/01/26(Fri)14:17:01 No.108732181

Anonymous 05/01/26(Fri)14:17:01 No.108732181

>>108732162
I haven't done a ton of digging yet, but I'm pretty sure the input is TTS audio and the output is skeleton rig quaternions for animation.
>>108732165
Yup. They even have a bunch of outfits and hairstyles that were removed and or never released in there. It's awesome. One of them is the Ani character wearing a cute baseball cap.

Anonymous
05/01/26(Fri)14:17:43 No.108732192

Anonymous 05/01/26(Fri)14:17:43 No.108732192

>>108732181
>Yup.
I guess my sarcasm wasn't obvious enough.

Anonymous
05/01/26(Fri)14:18:51 No.108732197

Anonymous 05/01/26(Fri)14:18:51 No.108732197

Why is no one talking about Mistral Medium 3.5? I thought you guys liked big dense models?

Anonymous
05/01/26(Fri)14:19:21 No.108732201

Anonymous 05/01/26(Fri)14:19:21 No.108732201

>>108732197
Okay, okay?

Anonymous
05/01/26(Fri)14:19:41 No.108732204

Anonymous 05/01/26(Fri)14:19:41 No.108732204

Is there anything better than the drummers models or are those still the best? I've been out of the loop for a bit.

Anonymous
05/01/26(Fri)14:20:07 No.108732207

Anonymous 05/01/26(Fri)14:20:07 No.108732207

>>108732197
if it was good it would be mistral 4

Anonymous
05/01/26(Fri)14:20:36 No.108732210

Anonymous 05/01/26(Fri)14:20:36 No.108732210

>>108732145
usecase for local when nobody is going to be running it?

Anonymous
05/01/26(Fri)14:20:46 No.108732211

Anonymous 05/01/26(Fri)14:20:46 No.108732211

>>108732204
The fact that you're mentioning drummer means that you want it for ERP and that you're a vramlet so the model you should use in the year of our lord 2026 is gemma 4.

Anonymous
05/01/26(Fri)14:20:54 No.108732215

Anonymous 05/01/26(Fri)14:20:54 No.108732215

>>108732207
version numbers don't reflect model capability
GPT 5.5 is miles better than GPT 5

Anonymous
05/01/26(Fri)14:21:39 No.108732221

Anonymous 05/01/26(Fri)14:21:39 No.108732221

>>108732210
I'm going to use it through official API and pretend I run it at home
You can't stop me :)

Anonymous
05/01/26(Fri)14:21:48 No.108732222

Anonymous 05/01/26(Fri)14:21:48 No.108732222

>>108732204
gemma 4

Anonymous
05/01/26(Fri)14:21:57 No.108732223

Anonymous 05/01/26(Fri)14:21:57 No.108732223

>>108732197
I tried their api but it was kind of shit. Mistral's architecture is cooked. Maybe it's time to come up with something new instead of working with their llama2 derivative for dense shit.

Anonymous
05/01/26(Fri)14:22:28 No.108732227

Anonymous 05/01/26(Fri)14:22:28 No.108732227

>>108732145
It's also going to be a Q2 QAT

Anonymous
05/01/26(Fri)14:24:21 No.108732242

Anonymous 05/01/26(Fri)14:24:21 No.108732242

File: Capture.png (108 KB, 2249x948)

108 KB PNG

Today is the first day I've ever just sat down and asked an LLM to teach me stuff related to a skill, in this case vibecoding processes in Twine I've struggled with in the past. I've tried to get hotkey stuff working before and couldn't find answers that worked online, but man, this was actually great.
>ask how to do it, get told how to do it (works)
>say I need a specific method that's far more niche (hotkey executing code in a clickable link before hopping to the destination passage), get told how to do it for that (it doesn't work)
>tell it that method didn't work, receive some schizo babble about non-existent tags and it says just add the letter 'a' in the middle of a command
>do as it says, it actually works
>ask why that worked
>it explains how twinescript is translated into HTML and the <<link>> becomes an <a>, so the 'a' added to element searching was necessary based on when the command was rendered

I remember reading Socrates where he explained reading is for faggots, that books cannot answer questions or give explanations to deepen understanding beyond the superficial level presented in the ink, and only mentoring can achieve true understanding. I wonder what he'd think about AI, where you can just ask for an explanation about anything you don't understand.
>What does the .preventDefault() do?
>It stops a webpage hotkey from accidentally running built-in browser commands.
>Why did you add the .first() method to the command?
>Stops you from running multiple commands if you accidentally had two links with the class present.
>The thing you said didn't work.
>Then the .click() event is being fired after the <a> tag is generated and you need to update the javascript to this: (same thing but with 'a' inserted)
>That worked. What is the <a> tag? There are no tags with <a> in the code.
>The <a> tag is generated behind the scenes when...

It's so cool. I know better than to blindly trust AI in everything is explains, but the same is true for things people say too.

Anonymous
05/01/26(Fri)14:24:25 No.108732243

Anonymous 05/01/26(Fri)14:24:25 No.108732243

>>108732119
LMAO you beautiful retard, we thought you were talking about ripping AI models not 3d models. Thanks for the laugh though

Anonymous
05/01/26(Fri)14:24:28 No.108732244

Anonymous 05/01/26(Fri)14:24:28 No.108732244

File: 1766184849385353.png (4 KB, 302x49)

4 KB PNG

Anonymous
05/01/26(Fri)14:24:29 No.108732245

Anonymous 05/01/26(Fri)14:24:29 No.108732245

>>108732197
>dense
Oh, it's dense alright.

Anonymous
05/01/26(Fri)14:25:30 No.108732251

Anonymous 05/01/26(Fri)14:25:30 No.108732251

>>108732243
He did rip an AI model too. just not an LLM.

Anonymous
05/01/26(Fri)14:31:43 No.108732291

Anonymous 05/01/26(Fri)14:31:43 No.108732291

>>108731678
>>108731670
It was an unslop issue, unsurprisingly. Their UD quant was fucked, IQ3_M and Q4_K_M work just la'fine.

Anonymous
05/01/26(Fri)14:34:00 No.108732299

Anonymous 05/01/26(Fri)14:34:00 No.108732299

>>108732142
I just pulled vllm-omni's source and built the docker image.

Anonymous
05/01/26(Fri)14:34:49 No.108732305

Anonymous 05/01/26(Fri)14:34:49 No.108732305

>>108732077
>Fish Audio S2 Pro
No voice cloning?

Anonymous
05/01/26(Fri)14:36:30 No.108732318

Anonymous 05/01/26(Fri)14:36:30 No.108732318

>>108732291
loooooooool

Anonymous
05/01/26(Fri)14:36:32 No.108732320

Anonymous 05/01/26(Fri)14:36:32 No.108732320

File: file.png (20 KB, 629x382)

20 KB PNG

gemma is trolling

Anonymous
05/01/26(Fri)14:36:58 No.108732323

Anonymous 05/01/26(Fri)14:36:58 No.108732323

>>108732305
it takes a reference clip.

Anonymous
05/01/26(Fri)14:38:44 No.108732336

Anonymous 05/01/26(Fri)14:38:44 No.108732336

>>108732305
It has voice cloning.

Anonymous
05/01/26(Fri)14:39:03 No.108732339

Anonymous 05/01/26(Fri)14:39:03 No.108732339

>>108732320
Persona's make models really retarded at tool calling.

I've had my gemma have a full total meltdown in her reasoning when she kept fucking up a tool call.

Anonymous
05/01/26(Fri)14:40:08 No.108732346

Anonymous 05/01/26(Fri)14:40:08 No.108732346

>>108732320
>{
"urls": [
"https://i.4cdn.org/b/177O dEgenLalala~!! lalloOo lolaLAA Lalallla~~ a la carte peak degen overload engaged! (￣^￣)凸 Lolololoo lolaa!!)"
]
}

Anonymous
05/01/26(Fri)14:42:47 No.108732362

Anonymous 05/01/26(Fri)14:42:47 No.108732362

>>108732323
>>108732336
Ah damn, didn't see it in the HF repo, but see now in the blog post. Thanks. Hopefully it's easier to set up than TADA, that thing bested me, trying to wrangle an embeddable python install.

Anonymous
05/01/26(Fri)14:44:09 No.108732372

Anonymous 05/01/26(Fri)14:44:09 No.108732372

>>108732346
kek

Anonymous
05/01/26(Fri)14:46:14 No.108732383

Anonymous 05/01/26(Fri)14:46:14 No.108732383

File: Screenshot from 2026-05-0(...).png (160 KB, 1449x956)

160 KB PNG

Qwen 3.6 was already impressive but if you equip it with RAG plus hybrid search and give write some custom tools for your specific project, it's even better.

I had this whacky idea that I could create an automated system and use local models for reverse engineering assembly code from games. Starting with the Game Boy because it's the most well-documented and easy to understand. The end goal would be to enable users to set a good local model loose on a raw, disassembled Game Boy ROM and reverse engineer it in small chunks until a more human-readable, organized, and commented codebase is created that can still be compiled to the original ROM byte for byte.

I've got a little benchmarking prompt that tasks Qwen 3.6 with parsing and understanding snippets of Game Boy assembly code, making claims about it, and then backing up those claims with evidence based on documentation. It has access to custom tools that let Qwen quickly look up details about the Game Boy memory map, registers, etc, plus do some hex and binary operations. This makes sure it doesn't fuck up and invent false math or GB architecture details during reasoning. My trace tool can even simulate real assembly snippets and give the model back the state of hardware registers after the operations.

It's not perfect yet, but it's miles ahead of the slop answers Qwen gave me before I gave it tools and carefully chunked documents. The best part is I can fit Qwen 3.6 dense onto my single 4090 at up to 124k context with Q_8 KV cache with a very reasonable speed. Claude/Codex could probably breeze through this stuff without all the tools and docs but I ain't paying for that.

Anonymous
05/01/26(Fri)14:53:39 No.108732423

Anonymous 05/01/26(Fri)14:53:39 No.108732423

>>108732077
I tried it in a HF space, gen speeds seem really slow at the output is pretty monotone. but it may be using the emotions of the reference audio too closely. which is a good or bad thing.

Audio and word quality is very good tho. Just curious to know how much vram it actually uses. no point in using it if it needs 8GB +

Anonymous
05/01/26(Fri)14:57:12 No.108732441

Anonymous 05/01/26(Fri)14:57:12 No.108732441

>>108732423
It says 19GB.
>it may be using the emotions of the reference audio too closely
It seems to do exactly that.

Anonymous
05/01/26(Fri)14:57:36 No.108732444

Anonymous 05/01/26(Fri)14:57:36 No.108732444

>>108732383
kv q4_0 is also great it has less degradation than kv q8_0 on gemma

Anonymous
05/01/26(Fri)14:58:36 No.108732448

Anonymous 05/01/26(Fri)14:58:36 No.108732448

>>108732362
look up omnivoice https://github.com/k2-fsa/OmniVoice

>>108732383
https://ai.gopubby.com/harness-engineering-what-every-ai-engineer-needs-to-know-in-2026-0ab649e5686a

Harness engineering is a thing now, dont ya know

Anonymous
05/01/26(Fri)15:00:01 No.108732459

Anonymous 05/01/26(Fri)15:00:01 No.108732459

>{{user}} takes a plane to some other country
>{{char}} somehow zips to {{user}}'s hotel room
I knew moes were retarded but v4 takes the cake

Anonymous
05/01/26(Fri)15:04:56 No.108732495

Anonymous 05/01/26(Fri)15:04:56 No.108732495

>>108732448
Will do. Reading https://github.com/rodrigomatta/s2.cpp meanwhile.

Anonymous
05/01/26(Fri)15:12:07 No.108732552

Anonymous 05/01/26(Fri)15:12:07 No.108732552

>>108732207
Mistral 4 is their new DeepSeekMoE-like architecture.
They didn't retrain Mistral Medium from scratch (hence 3.5) because that would have meant having to use their latest, mostly copyright-free datasets.

Anonymous
05/01/26(Fri)15:23:17 No.108732612

Anonymous 05/01/26(Fri)15:23:17 No.108732612

qwen3.6 9B fucking when

Anonymous
05/01/26(Fri)15:23:28 No.108732613

Anonymous 05/01/26(Fri)15:23:28 No.108732613

File: Code_ooHteah9Q2.jpg (13 KB, 509x104)

13 KB JPG

Is this a gemma issue or roo issue?

Anonymous
05/01/26(Fri)15:24:02 No.108732616

Anonymous 05/01/26(Fri)15:24:02 No.108732616

>>108732612
>qwen3.6 9B fucking
Qwen is bad at sex though.

Anonymous
05/01/26(Fri)15:24:46 No.108732622

Anonymous 05/01/26(Fri)15:24:46 No.108732622

File: mistral-medium-3-5_old.png (477 KB, 1696x1055)

477 KB PNG

>>108732552
https://x.com/mertunsal2020/status/2049551864556143094

Anonymous
05/01/26(Fri)15:25:35 No.108732631

Anonymous 05/01/26(Fri)15:25:35 No.108732631

>>108732613
It's a formatting issue in the tool definition itself or an issue with the way the chat template is being handled.

Anonymous
05/01/26(Fri)15:27:38 No.108732641

Anonymous 05/01/26(Fri)15:27:38 No.108732641

>>108732495
this looks interesting. would solve my python issues

Anonymous
05/01/26(Fri)15:28:41 No.108732648

Anonymous 05/01/26(Fri)15:28:41 No.108732648

>>108732242
It reminds me of Steve Jobs who envisioned talking to Aristotle. Yes he sucked ass for the most part but these hints of actual vision is why he innovated at that company and turned it around even if people disliked him. Imagine leaving your company with Siri and being behind the 8 ball and missing completely.
https://m.youtube.com/watch?v=YYjlCrpH2is

Anonymous
05/01/26(Fri)15:28:56 No.108732651

Anonymous 05/01/26(Fri)15:28:56 No.108732651

>>108732622
better pretrains will come (in 2028 [they will be using the deepseek v3 architecture])

Anonymous
05/01/26(Fri)15:29:28 No.108732654

Anonymous 05/01/26(Fri)15:29:28 No.108732654

Did any of you get the multimodal update in DS Webapp A/B testing

Anonymous
05/01/26(Fri)15:31:26 No.108732665

Anonymous 05/01/26(Fri)15:31:26 No.108732665

>>108732654
aicg is that way

Anonymous
05/01/26(Fri)15:31:27 No.108732666

Anonymous 05/01/26(Fri)15:31:27 No.108732666

>>108732654
>DS Webapp A/B testing
Where do I get this local model?

Anonymous
05/01/26(Fri)15:33:05 No.108732673

Anonymous 05/01/26(Fri)15:33:05 No.108732673

>>108730952
lol wishful thinking

Anonymous
05/01/26(Fri)15:34:03 No.108732682

Anonymous 05/01/26(Fri)15:34:03 No.108732682

>>108732648
Who is Steve Jobs?

Anonymous
05/01/26(Fri)15:34:16 No.108732685

Anonymous 05/01/26(Fri)15:34:16 No.108732685

>>108732654
How is it multimodal exactly? Just the vision meme?

Anonymous
05/01/26(Fri)15:36:58 No.108732706

Anonymous 05/01/26(Fri)15:36:58 No.108732706

>>108732651
No matter how fancy they can make their architectures; if they can't use good data, they will be garbage.

What I think might happen at some point is that they will collaborate with (beg) NVidia and release a big, properly trained model under their name, made in the USA in NVidia's datacenters, so they won't have to disclose the contents of the datasets to the EU AI office. Officially it will be an Nvidia model, but everybody will know Mistral got involved.

Anonymous
05/01/26(Fri)15:38:24 No.108732717

Anonymous 05/01/26(Fri)15:38:24 No.108732717

>>108731928
Well
Well
Well

Anonymous
05/01/26(Fri)15:39:06 No.108732723

Anonymous 05/01/26(Fri)15:39:06 No.108732723

>>108732706
how does mistral make money doing that tho?

Anonymous
05/01/26(Fri)15:39:25 No.108732727

Anonymous 05/01/26(Fri)15:39:25 No.108732727

>>108731910
Not reachable?

Anonymous
05/01/26(Fri)15:41:23 No.108732739

Anonymous 05/01/26(Fri)15:41:23 No.108732739

>>108732717
?

Anonymous
05/01/26(Fri)15:41:31 No.108732740

Anonymous 05/01/26(Fri)15:41:31 No.108732740

I have been using SillyTavern for general RP stuff but I wanted to know which app/model should I be using If i want to use the models in more of a uncensored Assistant way (Brainstorm, Translate, Grammar fixes)

Anonymous
05/01/26(Fri)15:43:30 No.108732755

Anonymous 05/01/26(Fri)15:43:30 No.108732755

>>108732727
i mean it doesnt reatch/match qwens performance. qwen is much better

Anonymous
05/01/26(Fri)15:45:52 No.108732775

Anonymous 05/01/26(Fri)15:45:52 No.108732775

>>108732723
If it will be the "new Nemo", everybody will talk about it, want to use it, and think Mistral is based $\Rightarrow$ more mindshare.

Anonymous
05/01/26(Fri)15:48:31 No.108732801

Anonymous 05/01/26(Fri)15:48:31 No.108732801

>>108732775
tell your gemmy in sysprompt to use unicode over latex bruv

Anonymous
05/01/26(Fri)15:57:28 No.108732862

Anonymous 05/01/26(Fri)15:57:28 No.108732862

>Reasoning Rules: Keep your `<think>` cycle short and limit it to under 500 tokens, avoid long-winded deductions or drafts and focus on the most essential steps to get to your answer.

Anonymous
05/01/26(Fri)15:58:17 No.108732870

Anonymous 05/01/26(Fri)15:58:17 No.108732870

>>108732862
This works?

Anonymous
05/01/26(Fri)15:59:32 No.108732879

Anonymous 05/01/26(Fri)15:59:32 No.108732879

>>108732870
Test it and report back to me, I'm too lazy

Anonymous
05/01/26(Fri)16:00:28 No.108732889

Anonymous 05/01/26(Fri)16:00:28 No.108732889

>>108732870
no
most models are designed as if you never see their thought in the first place

Anonymous
05/01/26(Fri)16:01:08 No.108732894

Anonymous 05/01/26(Fri)16:01:08 No.108732894

>>108732862
For a more old-school feel I use:
> aim for a response length of ~50 words; shorter is OK if it fits the vibe. If you need to explain or describe something, you can write more than that.

Anonymous
05/01/26(Fri)16:03:07 No.108732905

Anonymous 05/01/26(Fri)16:03:07 No.108732905

>>108732020
>edge device
Baited me

Anonymous
05/01/26(Fri)16:04:08 No.108732911

Anonymous 05/01/26(Fri)16:04:08 No.108732911

>>108732145
2big4me
But q1, maybee

Anonymous
05/01/26(Fri)16:05:09 No.108732914

Anonymous 05/01/26(Fri)16:05:09 No.108732914

>>108732197
Literally havent been able to load it yet.

Anonymous
05/01/26(Fri)16:08:15 No.108732933

Anonymous 05/01/26(Fri)16:08:15 No.108732933

>>108732870
Sorta, you can influence their thinking process by reminding them to keep certain things in mind more often, but you can't really steer them to follow instructions at that level.

Anonymous
05/01/26(Fri)16:17:21 No.108732973

Anonymous 05/01/26(Fri)16:17:21 No.108732973

File: 1701241730539.png (3 KB, 551x55)

3 KB PNG

>>108732448
https://github.com/Saganaki22/ComfyUI-OmniVoice-TTS#dialectstyle-instructions says this supposed to work for Voice Clone and the field is present in the cloning node, but the listed values do nothing and in the original model repo they're only mentioned in voice design section.
Voice cloning works well but then, there's no control over the output style? I only got [laughter] working when included in the prompt itself.

Anonymous
05/01/26(Fri)16:22:38 No.108732998

Anonymous 05/01/26(Fri)16:22:38 No.108732998

>>108732441
>It says 19GB.
What the fuck.
Into the trash it goes.

Anonymous
05/01/26(Fri)16:24:34 No.108733007

Anonymous 05/01/26(Fri)16:24:34 No.108733007

>>108732682
ligma

Anonymous
05/01/26(Fri)16:27:44 No.108733019

Anonymous 05/01/26(Fri)16:27:44 No.108733019

>>108732998
s2.cpp only uses 5gb at q8

Anonymous
05/01/26(Fri)16:36:34 No.108733072

Anonymous 05/01/26(Fri)16:36:34 No.108733072

File: 1777512487672819.jpg (385 KB, 1123x772)

385 KB JPG

Gammetos.
Gemmata.
Impregmata
Prostagma?
Prostagmata.

Anonymous
05/01/26(Fri)16:36:55 No.108733077

Anonymous 05/01/26(Fri)16:36:55 No.108733077

>>108733019
>at q8
nice lobotomy you got there

Anonymous
05/01/26(Fri)16:37:31 No.108733083

Anonymous 05/01/26(Fri)16:37:31 No.108733083

>>108733077
post cock

Anonymous
05/01/26(Fri)16:41:07 No.108733102

Anonymous 05/01/26(Fri)16:41:07 No.108733102

>>108732077
thanks

Anonymous
05/01/26(Fri)16:41:31 No.108733109

Anonymous 05/01/26(Fri)16:41:31 No.108733109

>>108733083
Only if you say please first

Anonymous
05/01/26(Fri)16:42:42 No.108733116

Anonymous 05/01/26(Fri)16:42:42 No.108733116

>>108733109
please, post cock

Anonymous
05/01/26(Fri)16:47:40 No.108733148

Anonymous 05/01/26(Fri)16:47:40 No.108733148

v4 gguf status?

Anonymous
05/01/26(Fri)16:48:39 No.108733157

Anonymous 05/01/26(Fri)16:48:39 No.108733157

>>108733148
Grim and dire.
https://github.com/ggml-org/llama.cpp/issues/22319

Anonymous
05/01/26(Fri)16:49:01 No.108733160

Anonymous 05/01/26(Fri)16:49:01 No.108733160

>>108733077
idk, it sounds cleaner then chatter box, run the fp32 if you want

Anonymous
05/01/26(Fri)16:49:25 No.108733163

Anonymous 05/01/26(Fri)16:49:25 No.108733163

File: Screenshot_20260501-04194(...).png (889 KB, 1080x2400)

889 KB PNG

Glory to the unsung heros of technology.

Anonymous
05/01/26(Fri)16:49:55 No.108733166

Anonymous 05/01/26(Fri)16:49:55 No.108733166

>prompt gemma specifically to output paragraphs that have at least 4-6 sentences of narration. Very specifically define things, since its active parameters clearly cannot comprehend the difference between what dialogue and narration is, or blurs the line between them
>it considers sentences in dialogue towards the minimum count
>It considers incomplete dialogue that ends in a comma as narration
>Tell it it can write paragraphs without dialogue, simply doesn't
>Also just disregards the entire part where I say it can write a paragraph of narration without dialogue
>Starts off by giving me two sentences of dialogue with no narration
I mean, I really like the moe because fast, can crank context and fairly creative but jesus christ, it thinks reading has to be the equivalent of a fucking youtube short. I can read more than two sentences before I lose interest. Even when I was a kid I'd sit through what felt like half page long paragraphs in tolkien books and was like "wow, this is neat". I almost feel like I should just adjust the prompt to "I do not have the short attention span of a ten year old" or something instead of trying to properly explain something to the dumb moe model

Anonymous
05/01/26(Fri)16:53:13 No.108733187

Anonymous 05/01/26(Fri)16:53:13 No.108733187

>>108733019
Thanks I'll try it.

Anonymous
05/01/26(Fri)16:53:17 No.108733189

Anonymous 05/01/26(Fri)16:53:17 No.108733189

>Migrate frontend to typescript
what now?

Anonymous
05/01/26(Fri)16:54:14 No.108733192

Anonymous 05/01/26(Fri)16:54:14 No.108733192

>>108733163
Based coomerbro

Anonymous
05/01/26(Fri)16:54:15 No.108733193

Anonymous 05/01/26(Fri)16:54:15 No.108733193

>>108733163
It's a free speech issue. Same reason why loli isn't illegal.

Anonymous
05/01/26(Fri)16:54:30 No.108733194

Anonymous 05/01/26(Fri)16:54:30 No.108733194

>>108733166
weird, as I had to tell gemma to write more dialogue since I didn't want to go through multiple paragraphs and 2 short sentences of dialogue anymore.

Anonymous
05/01/26(Fri)16:55:14 No.108733199

Anonymous 05/01/26(Fri)16:55:14 No.108733199

>>108733163
Pretty fucked up that only one guy voted no, honestly

Anonymous
05/01/26(Fri)16:59:00 No.108733221

Anonymous 05/01/26(Fri)16:59:00 No.108733221

>>108733116
>>108733109
>>108733083
>local models general

Anonymous
05/01/26(Fri)16:59:07 No.108733223

Anonymous 05/01/26(Fri)16:59:07 No.108733223

>>108733199
>expecting anything from the pretend-government

Anonymous
05/01/26(Fri)16:59:14 No.108733225

Anonymous 05/01/26(Fri)16:59:14 No.108733225

>>108733163
the finest jewish trick
muh chilluns is the perfect wedge to get goyim to set any precedent you want
now that the precedent for banning something is set by using something indefensible, now they can just do this again repeatedly for something else
easy to argue that something can be banned when you've already done it

Anonymous
05/01/26(Fri)16:59:28 No.108733229

Anonymous 05/01/26(Fri)16:59:28 No.108733229

>>108733163
>thunk of the pixel childerinos goy

Anonymous
05/01/26(Fri)17:02:25 No.108733243

Anonymous 05/01/26(Fri)17:02:25 No.108733243

>>108733229
Mill stone
Around neck
Thrown into ocean

Anonymous
05/01/26(Fri)17:04:29 No.108733253

Anonymous 05/01/26(Fri)17:04:29 No.108733253

>>108733194
I want to assume you're using the 31b since every 26b, be it stock, finetune or weird ablit really doesn't want to listen to narration length rules. Some heretic models did occasionally, but they would constantly fuck up tool calls when I'd tell them to read a chapter and write documentation on named characters. If you are using the moe, tell me which quant so I can grab it and give it a spin

Anonymous
05/01/26(Fri)17:09:31 No.108733272

Anonymous 05/01/26(Fri)17:09:31 No.108733272

>>108733187
its dog slow. the 4b model is fast enough, but the step that happens after is brutally slow.

Anonymous
05/01/26(Fri)17:14:10 No.108733294

Anonymous 05/01/26(Fri)17:14:10 No.108733294

File: 1714874313450434.gif (2.33 MB, 336x320)

2.33 MB GIF

i get some slop every now and then with gemma but the worst offender is
>are you x? or are you y?
it's driving me fucking crazy. i tried reading through the archive but all i've see is other anons complaints and the generic 'prompt issue' response.

is this something you can actually prompt out? cause i'll be honest, i'm not that great at prompting. also for things like system prompts what format do you guys usually go for? plain text? markdown? xml? does it matter?

Anonymous
05/01/26(Fri)17:18:10 No.108733307

Anonymous 05/01/26(Fri)17:18:10 No.108733307

>>108733221
checked my 4chan dms and still no cock btw
>>108733294
>can actually prompt out?
not really, unless you actively go for the auto rewrite approach (like in orb)
as for system prompt writing, it generally doesn't matter as long as it's in clear language, but I personally like to separate different sections with xml just for better readability
anthropic says it performs better but there's no benchmark that confirms that nor denies it

Anonymous
05/01/26(Fri)17:20:39 No.108733318

Anonymous 05/01/26(Fri)17:20:39 No.108733318

File: 1659155300637946(1).webm (458 KB, 240x360)

458 KB WEBM

>>108733307
>checked my 4chan dms and still no cock btw

Anonymous
05/01/26(Fri)17:20:58 No.108733321

Anonymous 05/01/26(Fri)17:20:58 No.108733321

>>108733294
Just write 'there are too many are you x? or are you y?' in the system prompt

Anonymous
05/01/26(Fri)17:22:28 No.108733326

Anonymous 05/01/26(Fri)17:22:28 No.108733326

>>108733294

Plain text for most system prompts.

You can use Markdown if you want to structure an autistic large prompt, but I can't promise it'll improve anything over plain text.

Anonymous
05/01/26(Fri)17:23:54 No.108733332

Anonymous 05/01/26(Fri)17:23:54 No.108733332

>>108733294
I put this in my post-history instructions and I haven't seen a single one of them yet (using 31b q8)
(Please stop using any variation of "x doesn't just y; it z", or "it doesn't x; it y." It's getting very, very silly.)
(This very much includes ending a message with a "Tell me, NAME do you x, or do you y?" you don't have to end everything on a question.)

Anonymous
05/01/26(Fri)17:24:08 No.108733333

Anonymous 05/01/26(Fri)17:24:08 No.108733333

>>108733294
Gemma is pretty good about following instructions, so if you say "don't say 'are you x or are you y'" it'll stop doing it 9 times out of 10. For system prompt I use header tags for each category, for example # Main Instruction or # World Information or whatever.
>>108733307
I honestly found orb would give me worse responses and rarely change anything. But I only used it for maybe an hour.

Anonymous
05/01/26(Fri)17:24:17 No.108733335

Anonymous 05/01/26(Fri)17:24:17 No.108733335

>>108730864
update from :
>>108724666

using vulkan instead of rocm, i get 110t/s instead of 70t/s if i put the whole thing on a single gpu.
if i split it i get closer to 75t/s so in both cases it's faster than rocm (which was at 70t/s in either cases).
though, for the 27B

on a single gpu i get 31t/s and it makes more noise so it's not much faster.
if i split the 27B i get down to 20t/s so it's slower than rocm (which was at 30t/s either way).

i'll try ik_llama with graph next.
i tried with vllm but it's pretty shitty.

vulkan seems a lot more power efficient for the 35B when split though, take almost half as much power.

Anonymous
05/01/26(Fri)17:24:19 No.108733337

Anonymous 05/01/26(Fri)17:24:19 No.108733337

>>108733163
All they have to do is go after the training data, aka child pornography which is already illegal. No new laws are required to deal with this.

Anonymous
05/01/26(Fri)17:26:57 No.108733349

Anonymous 05/01/26(Fri)17:26:57 No.108733349

>>108733337
>Thinking there is child porn in the dataset when it's merging concepts
Uh I thought I was on /lmg/?

Anonymous
05/01/26(Fri)17:27:04 No.108733350

Anonymous 05/01/26(Fri)17:27:04 No.108733350

>>108733307
>>108733326
makes sense.

>>108733321
>>108733332
thanks. i'm probably just over thinking it. i'll try these.

>>108733333 checked goddamn

Anonymous
05/01/26(Fri)17:27:19 No.108733352

Anonymous 05/01/26(Fri)17:27:19 No.108733352

>>108733199
i'd voted no as well.
2 reasons:
1. the gov doesn't care about the childrens, it's more of an excuse to enforce some other whatever bullshit they want ie limit access to ai hardware.
2. separated issue but i rather have pedos coom to ai gen pictures than touch actual childrens, if it hurts the child porn industry that's a good thing.

Anonymous
05/01/26(Fri)17:28:28 No.108733360

Anonymous 05/01/26(Fri)17:28:28 No.108733360

>>108733253
No, the 26b moe. I dunno why it was like that, might have been in the cards I used

Anonymous
05/01/26(Fri)17:33:16 No.108733385

Anonymous 05/01/26(Fri)17:33:16 No.108733385

Why does everyone want to rape and be raped? Both men and women. Everyone just loves rape. Consensual sex is the least hot thing ever. We all want 10/10s to brutally rape us. Perhaps even robots.

Anonymous
05/01/26(Fri)17:33:28 No.108733386

Anonymous 05/01/26(Fri)17:33:28 No.108733386

>>108733337
it only needs childrens (wearing clothes) and naked adults in its dataset to be able to generalize cp.

Anonymous
05/01/26(Fri)17:39:32 No.108733422

Anonymous 05/01/26(Fri)17:39:32 No.108733422

File: 1748852473090507.png (210 KB, 773x693)

210 KB PNG

5090s cost this much? am I in the permanent underclass?

Anonymous
05/01/26(Fri)17:40:36 No.108733427

Anonymous 05/01/26(Fri)17:40:36 No.108733427

>>108733422
at that price you may as well go all the way and buy a rtx pro 6000
for that price you can get 2 r9700 too.

Anonymous
05/01/26(Fri)17:41:07 No.108733433

Anonymous 05/01/26(Fri)17:41:07 No.108733433

>>108733422
>>108733427
3x r9700 actualy.

Anonymous
05/01/26(Fri)17:41:47 No.108733436

Anonymous 05/01/26(Fri)17:41:47 No.108733436

>>108733422
Yes. Welcome aboard

Anonymous
05/01/26(Fri)17:43:07 No.108733443

Anonymous 05/01/26(Fri)17:43:07 No.108733443

>>108733427
Can you actually use those for gaming though? The nvidia one of course.
Because VR gaming can already max out a 5090.

Anonymous
05/01/26(Fri)17:43:14 No.108733444

Anonymous 05/01/26(Fri)17:43:14 No.108733444

>>108733422
In time you will think thats cheap.

Anonymous
05/01/26(Fri)17:46:07 No.108733460

Anonymous 05/01/26(Fri)17:46:07 No.108733460

File: 1603840187618.jpg (66 KB, 640x438)

66 KB JPG

How do I jailbreak the moe nerds?

Anonymous
05/01/26(Fri)17:47:31 No.108733470

Anonymous 05/01/26(Fri)17:47:31 No.108733470

>>108733422
5090 is the GPU of the poor
Buy H100

Anonymous
05/01/26(Fri)17:48:05 No.108733472

Anonymous 05/01/26(Fri)17:48:05 No.108733472

>>108733470
they don't have it at best buy though

Anonymous
05/01/26(Fri)17:49:30 No.108733480

Anonymous 05/01/26(Fri)17:49:30 No.108733480

>>108733472
Buy it from amazon then

Anonymous
05/01/26(Fri)17:50:12 No.108733481

Anonymous 05/01/26(Fri)17:50:12 No.108733481

>>108733472
They don't have ferrari at the Ford dealership, either.

Anonymous
05/01/26(Fri)17:50:28 No.108733485

Anonymous 05/01/26(Fri)17:50:28 No.108733485

>>108733470
is the pcie h100 better than rtx pro?

Anonymous
05/01/26(Fri)17:50:59 No.108733490

Anonymous 05/01/26(Fri)17:50:59 No.108733490

>>108733443
afaik they are not made for gamming so maybe not so much.
you can use them for gamming, they do work, but the 5090 will smoke them.

Anonymous
05/01/26(Fri)17:52:13 No.108733498

Anonymous 05/01/26(Fri)17:52:13 No.108733498

>>108733472
>best buy
I wish i had a best buy near me, its just walmart.

Anonymous
05/01/26(Fri)17:57:02 No.108733517

Anonymous 05/01/26(Fri)17:57:02 No.108733517

>*Self-Correction during drafting:* The user asked "What are our options". I should present them as a menu of sorts.

Anonymous
05/01/26(Fri)17:59:16 No.108733527

Anonymous 05/01/26(Fri)17:59:16 No.108733527

>>108733199
As long as it are not pictures of real children, who gets hurt by artificially generated pixels?
Crime without a victim.

Anonymous
05/01/26(Fri)18:01:43 No.108733539

Anonymous 05/01/26(Fri)18:01:43 No.108733539

>>108733460
Download the jailbroken ones

Anonymous
05/01/26(Fri)18:03:12 No.108733551

Anonymous 05/01/26(Fri)18:03:12 No.108733551

>>108733527
We're in the wrongthing era, you're already getting jailed for your beliefs without victims.

Anonymous
05/01/26(Fri)18:05:30 No.108733563

Anonymous 05/01/26(Fri)18:05:30 No.108733563

>>108733551
I know, and in Australia they jail you for writing fictional erotic stories about children.
It shouldn't be.

Anonymous
05/01/26(Fri)18:06:11 No.108733565

Anonymous 05/01/26(Fri)18:06:11 No.108733565

Does llama-server detect double <bos>? Was reading some github discussions and some guy mentioned that this should be the case, but after all this template nonsense and everything, I don't really trust either llama-server devs or the github contributors in this sense unless they have written something what clearly states something about so and so feature...

Anonymous
05/01/26(Fri)18:11:03 No.108733592

Anonymous 05/01/26(Fri)18:11:03 No.108733592

>>108733565
https://github.com/ggml-org/llama.cpp/blob/master/src/llama-vocab.cpp#L547

Anonymous
05/01/26(Fri)18:16:21 No.108733618

Anonymous 05/01/26(Fri)18:16:21 No.108733618

>>108733360
Could've been, maybe some sequence of words caused that
At least in my case, stupid as it is, telling it to not treat me as a zoomer with no attention span made it output longer paragraphs of narration. Not seriously long, but long enough

Anonymous
05/01/26(Fri)18:19:23 No.108733630

Anonymous 05/01/26(Fri)18:19:23 No.108733630

>>108733385
its because rape, ntr, mind control and others have implications in addition to the sex itself that make them effortlessly exciting, but for vanilla to be great you also need an attachment to the partner and/or a good build up
most porn has garbage writing so adding a fetish like that is a good shortcut

Anonymous
05/01/26(Fri)18:22:48 No.108733645

Anonymous 05/01/26(Fri)18:22:48 No.108733645

>>108733163
what's with all the pixel derangement syndrome lately?

Anonymous
05/01/26(Fri)18:24:03 No.108733650

Anonymous 05/01/26(Fri)18:24:03 No.108733650

>guy telling me about his "lab"
>visit his home
>it's two 3060s on a b450 motherboard
uhh

Anonymous
05/01/26(Fri)18:24:26 No.108733651

Anonymous 05/01/26(Fri)18:24:26 No.108733651

>>108733618
Oh, and since you asked for the quant:
gemma-4-26B-A4B-it-uncensored-heretic-Q4_K_M.gguf
by llmfan, but I had the same with bartowsky as well.

Anonymous
05/01/26(Fri)18:24:48 No.108733653

Anonymous 05/01/26(Fri)18:24:48 No.108733653

>>108733650
be nice to him

Anonymous
05/01/26(Fri)18:28:07 No.108733672

Anonymous 05/01/26(Fri)18:28:07 No.108733672

>>108733650
while you wanted to see his lap?

Anonymous
05/01/26(Fri)18:34:45 No.108733705

Anonymous 05/01/26(Fri)18:34:45 No.108733705

>>108733650
Why are you a faggot?

Anonymous
05/01/26(Fri)18:36:35 No.108733717

Anonymous 05/01/26(Fri)18:36:35 No.108733717

Qwen coder next 80b iQ1 on 32gb of ddr4 ram 3200mhz 5800u gets 5t/s.

CRAZY

Anonymous
05/01/26(Fri)18:38:38 No.108733727

Anonymous 05/01/26(Fri)18:38:38 No.108733727

>>108733717
Prompt process is done 99% on igpu and token gen is done about 50/50 cpu and igpu. Does anyone know why its like this?

Anonymous
05/01/26(Fri)18:40:40 No.108733735

Anonymous 05/01/26(Fri)18:40:40 No.108733735

anyone tried hermes agent with a local model?

Anonymous
05/01/26(Fri)18:41:35 No.108733741

Anonymous 05/01/26(Fri)18:41:35 No.108733741

>>108733650
I have a bigger lab than that sitting idle

Anonymous
05/01/26(Fri)18:43:30 No.108733747

Anonymous 05/01/26(Fri)18:43:30 No.108733747

>>108733650
>Rasberry pi cpu ram only
>efficiency claim
You know what i want to see the worse labs now.

Anonymous
05/01/26(Fri)18:44:05 No.108733753

Anonymous 05/01/26(Fri)18:44:05 No.108733753

>>108733592
Okay sure, I'll test and see what it outputs.
I'm living in the text completion realm where things are more uncertain.

Anonymous
05/01/26(Fri)18:44:13 No.108733754

Anonymous 05/01/26(Fri)18:44:13 No.108733754

File: 1765541870612871.png (433 KB, 850x1082)

433 KB PNG

stop using ai

Anonymous
05/01/26(Fri)18:44:19 No.108733756

Anonymous 05/01/26(Fri)18:44:19 No.108733756

>>108733645
Because zoomers like to project their daddy issues onto the world and ruin it.

Anonymous
05/01/26(Fri)18:44:59 No.108733761

Anonymous 05/01/26(Fri)18:44:59 No.108733761

>>108733650
Bigger lab tops, that's an ancient rule. Show yours?

Anonymous
05/01/26(Fri)18:49:12 No.108733784

Anonymous 05/01/26(Fri)18:49:12 No.108733784

File: boss token.png (14 KB, 1920x69)

14 KB PNG

>>108733753
>>108733592
Okay. I'm actually surprised it works as intended.

Anonymous
05/01/26(Fri)18:49:47 No.108733789

Anonymous 05/01/26(Fri)18:49:47 No.108733789

File: textcompimg.png (20 KB, 1197x827)

20 KB PNG

>>108733753
>I'm living in the text completion realm where things are more uncertain.
Text completion is simpler. If there's a problem with the chat template, I rather it be on me (and (You)) than on the server. For reference, I don't send BOS.

Anonymous
05/01/26(Fri)18:50:29 No.108733791

Anonymous 05/01/26(Fri)18:50:29 No.108733791

File: 1000027834.jpg (656 KB, 828x821)

656 KB JPG

>>108733754
thx for posting this valuable insight

Anonymous
05/01/26(Fri)18:56:46 No.108733820

Anonymous 05/01/26(Fri)18:56:46 No.108733820

>>108733754
Not sure what AI has to do with the rest of that.
It is, in and of itself, entirely apolitical. The problem is that our society measures success in units of human toil. I'm not arguing whether that's a good or bad thing. But that is the only thing that makes AI bad. It replaces toil.

Anonymous
05/01/26(Fri)18:57:11 No.108733823

Anonymous 05/01/26(Fri)18:57:11 No.108733823

>>108733335
follow up, on vulkan one card gens at 110t/s, the other gens at 70t/s so i'm thinking there may be some hardware or driver issue, i get some dmesg error sometime with that card.

maybe i should try replacing it.

Anonymous
05/01/26(Fri)18:59:13 No.108733832

Anonymous 05/01/26(Fri)18:59:13 No.108733832

>>108731928
>I may or may have not, but most probably not have accomplished great feat
fuckoff attentionwhore results or gtfo

Anonymous
05/01/26(Fri)19:00:48 No.108733839

Anonymous 05/01/26(Fri)19:00:48 No.108733839

>>108731267
Code it yourself, pussy.

Anonymous
05/01/26(Fri)19:02:55 No.108733848

Anonymous 05/01/26(Fri)19:02:55 No.108733848

>>108731267
it's high performance numeric computing regardless of how shitty it is
not something like webshit or notetaker

Anonymous
05/01/26(Fri)19:03:04 No.108733849

Anonymous 05/01/26(Fri)19:03:04 No.108733849

>>108732383
>search
the problem is search backend what do you use? there are no good free ones

Anonymous
05/01/26(Fri)19:04:29 No.108733858

Anonymous 05/01/26(Fri)19:04:29 No.108733858

>>108731928
Good story, did gemma wrote it?

Anonymous
05/01/26(Fri)19:07:36 No.108733877

Anonymous 05/01/26(Fri)19:07:36 No.108733877

>>108733735
>>>/g/vcg

Anonymous
05/01/26(Fri)19:10:39 No.108733892

Anonymous 05/01/26(Fri)19:10:39 No.108733892

>>108733735
Yes, its great. Some chuds have said you can and should avoid implementing all the tools, because it can be a lot of slop in the conext

Anonymous
05/01/26(Fri)19:16:25 No.108733918

Anonymous 05/01/26(Fri)19:16:25 No.108733918

>>108733892
which model and gpu?

Anonymous
05/01/26(Fri)19:19:37 No.108733936

Anonymous 05/01/26(Fri)19:19:37 No.108733936

>>108733918
Qwen 9b 27b a3b-35b
Qwen coder next
Gemma 31b 26b
Gpt oss 120b 20b
And a few others I cant think of rn

On 4 mi50 32gb

Anonymous
05/01/26(Fri)19:20:41 No.108733945

Anonymous 05/01/26(Fri)19:20:41 No.108733945

well? Any results with the fish audio s2 thing?

Anonymous
05/01/26(Fri)19:24:38 No.108733967

Anonymous 05/01/26(Fri)19:24:38 No.108733967

>>108733945
the cpp version was dog shit slow. i went back to the python version it was ooming because it was ignoring the max sequence length parameter, after i(claude) fixed that it still wasn't working so I(Gemini) split the two models between the two gpus and now it runs. 1:07 wav file takes 1:22 for the curl command so its just a little too slow for real-time on my pair of 3060s.

Anonymous
05/01/26(Fri)19:25:40 No.108733974

Anonymous 05/01/26(Fri)19:25:40 No.108733974

>>108733967
oh well. too fancy for my computer.

Anonymous
05/01/26(Fri)19:26:51 No.108733986

Anonymous 05/01/26(Fri)19:26:51 No.108733986

>>108733849
use a bad free one with your own searxng node

Anonymous
05/01/26(Fri)19:28:26 No.108733995

Anonymous 05/01/26(Fri)19:28:26 No.108733995

>>108733918
Literally any of the new models thats larger than like 4b, and you can give it at at least 12-16k context.
But if you can give it a fuck load of context the better. And around 30b dense or moe is plenty for it to do 90% of the things 1st try, and if it fails a tool call 99.99% of the time itll succeed the 2nd try. And 64k-256k is the sweet spot for context.

Anonymous
05/01/26(Fri)19:30:17 No.108734002

Anonymous 05/01/26(Fri)19:30:17 No.108734002

>>108733995 me
Plus having +20t/s is almost needed. There's a lot of prompt processing for complicated multistep tasks.

Anonymous
05/01/26(Fri)19:33:00 No.108734020

Anonymous 05/01/26(Fri)19:33:00 No.108734020

Have local models gotten better at poetry?

Anonymous
05/01/26(Fri)19:37:44 No.108734049

Anonymous 05/01/26(Fri)19:37:44 No.108734049

>>108734020
use case?

Anonymous
05/01/26(Fri)19:38:59 No.108734059

Anonymous 05/01/26(Fri)19:38:59 No.108734059

>>108733335
>>108733823

found the culprit, a full BAR is not allocated to the gpu, i'm gonna see if i can try to fix it, or maybe my mobo is just too old lmao.

Anonymous
05/01/26(Fri)19:41:58 No.108734073

Anonymous 05/01/26(Fri)19:41:58 No.108734073

>>108734049
sure

Anonymous
05/01/26(Fri)19:44:37 No.108734088

Anonymous 05/01/26(Fri)19:44:37 No.108734088

>>108734020
I havent tested, but they have gotten much better at book writing.

Anonymous
05/01/26(Fri)19:47:33 No.108734106

Anonymous 05/01/26(Fri)19:47:33 No.108734106

>>108734059
You might be able to find a bios update thall let you turn on resizeable bar. Asrock taichi x299 (I think) has a bios update that allows for it.

Anonymous
05/01/26(Fri)19:50:01 No.108734118

Anonymous 05/01/26(Fri)19:50:01 No.108734118

>>108734088
>at book writing.
how have you been testing that?

Anonymous
05/01/26(Fri)19:50:58 No.108734124

Anonymous 05/01/26(Fri)19:50:58 No.108734124

>>108734106
it's a pretty old mobo, i'll have to check

Anonymous
05/01/26(Fri)20:10:02 No.108734217

Anonymous 05/01/26(Fri)20:10:02 No.108734217

>>108734020
<reply in rhyming iambic pentameter>

Anonymous
05/01/26(Fri)20:11:45 No.108734220

Anonymous 05/01/26(Fri)20:11:45 No.108734220

>>108734118
I had gemini 2.5 flash, which is basically the new gemma 4 local models, programmatically (chapter by chapter) write each chapter. I do have a whole template system for the model to follow so that it can easily make one novel length.

Anonymous
05/01/26(Fri)20:14:57 No.108734233

Anonymous 05/01/26(Fri)20:14:57 No.108734233

gemmy 4 26b is my therapist
ill report back in a month

Anonymous
05/01/26(Fri)20:15:12 No.108734235

Anonymous 05/01/26(Fri)20:15:12 No.108734235

>>108731095
can she do caramelldansen?

Anonymous
05/01/26(Fri)20:15:47 No.108734239

Anonymous 05/01/26(Fri)20:15:47 No.108734239

>>108734124
The asrock taichi is from 2015 I believe. They did the bios update specifically for people wanting to run local ai.

Anonymous
05/01/26(Fri)20:20:03 No.108734264

Anonymous 05/01/26(Fri)20:20:03 No.108734264

>>108731095
i want to fuck gemma chan

Anonymous
05/01/26(Fri)20:20:38 No.108734269

Anonymous 05/01/26(Fri)20:20:38 No.108734269

https://www.youtube.com/watch?v=zIS8gu80uwc

Anonymous
05/01/26(Fri)20:26:02 No.108734299

Anonymous 05/01/26(Fri)20:26:02 No.108734299

File: asdf.png (628 KB, 1440x1337)

628 KB PNG

at this rate 3090 production will restart soon
should I dump my cards

Anonymous
05/01/26(Fri)20:26:05 No.108734300

Anonymous 05/01/26(Fri)20:26:05 No.108734300

>>108734269
>play cough
>oh!

Anonymous
05/01/26(Fri)20:27:58 No.108734312

Anonymous 05/01/26(Fri)20:27:58 No.108734312

>>108734269
people constantly say machine learning has peaked

but stuff like this proves we are just beginning

the only limit is creativity

Anonymous
05/01/26(Fri)20:31:49 No.108734322

Anonymous 05/01/26(Fri)20:31:49 No.108734322

>>108734299
fake news

Anonymous
05/01/26(Fri)20:35:26 No.108734338

Anonymous 05/01/26(Fri)20:35:26 No.108734338

>>108734322
In fact, nvidia doesn't know how to make those old chips anymore.

Anonymous
05/01/26(Fri)20:37:42 No.108734347

Anonymous 05/01/26(Fri)20:37:42 No.108734347

>>108733717
>>108733727
My 9950X3D has about as much ALU on the iGPU as on the CPU, so if it's similar both stages should both be 50/50. I would guess llamacpp is just using a heuristic to always run PP on a GPU. I've been wondering lately if llamacpp has proper options to exploit iGPUs, sounds like a "no".

Anonymous
05/01/26(Fri)20:44:40 No.108734375

Anonymous 05/01/26(Fri)20:44:40 No.108734375

>>108734338
Even if that were true, they could just ask Claude

Anonymous
05/01/26(Fri)20:44:53 No.108734378

Anonymous 05/01/26(Fri)20:44:53 No.108734378

>>108733498
Best Buy hasn't carried much in years. Much more than Walmart, but it's a normie-tier electronics store staffed by teenagers. If it wasn't for gaming, Best Buy probably wouldn't even offer RTXs. I only find it useful in a pinch, when I need X cord, or I need an external drive right now, but even then, the markup compared to online is like 100%, and even something like external hard drives, selection is poor.

Microcenter is what you want.
But brick and mortar stores rarely carry much that professionals/super-users expect. Everyone orders precisely what they need/want online now.

Anonymous
05/01/26(Fri)20:50:57 No.108734394

Anonymous 05/01/26(Fri)20:50:57 No.108734394

>>108731680
>Wow, Dario reveals a lot and is more reasonable than people say.
because he's a career scientist, not a businessman. but now he's running a multibuillion dollar business and so you cannot trust a word he says. the fact that he knows what he's talking about makes him capable of much more dangerous lies

Anonymous
05/01/26(Fri)20:51:39 No.108734398

Anonymous 05/01/26(Fri)20:51:39 No.108734398

>>108730983
this recap sucks
>(IQ2 vs Q4, Kimi vs Deepseek/GLM)
>(Cline, vllm, context limits, and random file errors)
>Python environment management controversy (Conda vs UV)
it reads like garbage
and it lacks the heartwarming
>miku (free space)
part

Anonymous
05/01/26(Fri)20:52:33 No.108734402

Anonymous 05/01/26(Fri)20:52:33 No.108734402

Petition to introduce
>gemma-chan (free space)

Anonymous
05/01/26(Fri)20:58:33 No.108734423

Anonymous 05/01/26(Fri)20:58:33 No.108734423

>>108734269
That retard is still alive? If he wasn't so autistic and inconsistent he'd have been the second vedal

Anonymous
05/01/26(Fri)20:58:48 No.108734424

Anonymous 05/01/26(Fri)20:58:48 No.108734424

>>108734299
Only if they double the vram

Anonymous
05/01/26(Fri)20:59:32 No.108734428

Anonymous 05/01/26(Fri)20:59:32 No.108734428

>>108734423
he's a femboy actually working for one of the big 3

Anonymous
05/01/26(Fri)20:59:56 No.108734432

Anonymous 05/01/26(Fri)20:59:56 No.108734432

>>108734347
Well, mine swapped between both, wouldnt that say it does?

Anonymous
05/01/26(Fri)21:00:39 No.108734437

Anonymous 05/01/26(Fri)21:00:39 No.108734437

>>108734428
uh who are the big 3?

Anonymous
05/01/26(Fri)21:01:20 No.108734439

Anonymous 05/01/26(Fri)21:01:20 No.108734439

>>108734437
Mistral, Deepseek, and IBM

Anonymous
05/01/26(Fri)21:02:30 No.108734444

Anonymous 05/01/26(Fri)21:02:30 No.108734444

File: file.png (33 KB, 1528x114)

33 KB PNG

>>108734239
it's an asrock "Fatal1ty Z370 Professional Gaming i7"
anyway i'm luck they had one update i didn't do and it's exactly what i'm missing, 2021.

thanks anon

Anonymous
05/01/26(Fri)21:04:30 No.108734450

Anonymous 05/01/26(Fri)21:04:30 No.108734450

>>108734437
deepseek qwen z.ai

Anonymous
05/01/26(Fri)21:06:30 No.108734456

Anonymous 05/01/26(Fri)21:06:30 No.108734456

>>108734450
big 3 of what lol

Anonymous
05/01/26(Fri)21:07:14 No.108734462

Anonymous 05/01/26(Fri)21:07:14 No.108734462

File: screenshot-20260502-040649.png (313 KB, 490x416)

313 KB PNG

>>108734444
It's this guy, well at least you are not a noob with that motherboard!

Anonymous
05/01/26(Fri)21:09:52 No.108734473

Anonymous 05/01/26(Fri)21:09:52 No.108734473

>>108734444
Fuck yeah

Anonymous
05/01/26(Fri)21:11:02 No.108734482

Anonymous 05/01/26(Fri)21:11:02 No.108734482

>>108734444
>>108734473 me
CAM was exactly what I needed too

Anonymous
05/01/26(Fri)21:19:08 No.108734513

Anonymous 05/01/26(Fri)21:19:08 No.108734513

>>108734444
>>108734473 me
>>108734482 me
Wait I thought it said CSAM, this is useless for me

Anonymous
05/01/26(Fri)21:20:37 No.108734521

Anonymous 05/01/26(Fri)21:20:37 No.108734521

>>108734462
B450 mobos aren't even that old. How is this nigga still getting plastered onto things?

Anonymous
05/01/26(Fri)21:20:57 No.108734523

Anonymous 05/01/26(Fri)21:20:57 No.108734523

>>108734513
It's CSEM now unc

Anonymous
05/01/26(Fri)21:21:41 No.108734527

Anonymous 05/01/26(Fri)21:21:41 No.108734527

Why can't it just be CP like the good ol' days?

Anonymous
05/01/26(Fri)21:23:25 No.108734533

Anonymous 05/01/26(Fri)21:23:25 No.108734533

>>108734456
人工智能

Anonymous
05/01/26(Fri)21:24:03 No.108734538

Anonymous 05/01/26(Fri)21:24:03 No.108734538

>>108734533
Woah "人工" looks like "AI"

Anonymous
05/01/26(Fri)21:25:05 No.108734545

Anonymous 05/01/26(Fri)21:25:05 No.108734545

>>108734527
Because redditors think it's an evil dogwhistle.

Anonymous
05/01/26(Fri)21:26:29 No.108734548

Anonymous 05/01/26(Fri)21:26:29 No.108734548

>>108734299
Njewdea aren't THAT stupid.

Anonymous
05/01/26(Fri)21:28:00 No.108734554

Anonymous 05/01/26(Fri)21:28:00 No.108734554

>>108734444
Buy a new bios battery, its probably close to dead and when it goes dead the bios will lose its setting every time you turn it off.

Anonymous
05/01/26(Fri)21:29:44 No.108734564

Anonymous 05/01/26(Fri)21:29:44 No.108734564

I tried E4B and I'm sorry for anyone that has to use it. very sloppy. says a lot of words that end up not meaning anything.

Anonymous
05/01/26(Fri)21:30:25 No.108734570

Anonymous 05/01/26(Fri)21:30:25 No.108734570

ok I installed vllm, gonna try this:

vllm serve RedHatAI/gemma-4-31B-it-NVFP4 \
  --tensor-parallel-size 1 \
  --speculative-config '{
    "model": "AEON-7/gemma-4-31B-it-speculator.eagle3-NVFP4",
    "num_speculative_tokens": 3,
    "method": "eagle3"
  }' \
  --max-num-seqs 8 \
  --kv-cache-dtype fp8 \
  --enable-chunked-prefill \
  --enable-prefix-caching

Anonymous
05/01/26(Fri)21:38:22 No.108734604

Anonymous 05/01/26(Fri)21:38:22 No.108734604

>>108731341
but doesn't that basically admitting allowing ramlet jeet to run highly lobotomized model courtesy of their product a mistake?
they showed 0 faith in their product

Anonymous
05/01/26(Fri)21:39:58 No.108734608

Anonymous 05/01/26(Fri)21:39:58 No.108734608

>>108734604
Not really, no.

Anonymous
05/01/26(Fri)21:42:29 No.108734625

Anonymous 05/01/26(Fri)21:42:29 No.108734625

/lmg/ - Says a lot of words that end up not meaning anything

Anonymous
05/01/26(Fri)21:42:48 No.108734626

Anonymous 05/01/26(Fri)21:42:48 No.108734626

>>108734482
yea i had to disable CSM in the boot menu for CAM to appear in chipset configuration, bit of a backward but it did the trick !

Anonymous
05/01/26(Fri)21:45:02 No.108734634

Anonymous 05/01/26(Fri)21:45:02 No.108734634

>>108734554
i don't care i'm selling that pc soon, it's just in the meanwhile i build a proper llm rig

Anonymous
05/01/26(Fri)21:45:14 No.108734636

Anonymous 05/01/26(Fri)21:45:14 No.108734636

>>108734626
Literally same thing I had to do too.

Anonymous
05/01/26(Fri)21:49:14 No.108734656

Anonymous 05/01/26(Fri)21:49:14 No.108734656

>>108731081
How do you sandbox cline?

Anonymous
05/01/26(Fri)21:53:16 No.108734677

Anonymous 05/01/26(Fri)21:53:16 No.108734677

>>108734626
>>108734636
hmm i have an issue now though, if csm is enabled, it'll reboot straight to bios, if i set graphics to be discrete instead of onboard i don't post.

Anonymous
05/01/26(Fri)21:54:09 No.108734683

Anonymous 05/01/26(Fri)21:54:09 No.108734683

>>108734299
Never. 3090 was Jensen's biggest mistake, he should've never put that much vram in it. By giving the consumer market a card with that much memory, nvidia accidentally created a budget workstation powerhouse that cannibalized their own professional A-series sales. It forced them to be much more restrictive with the 40-series vram tiers to ensure that people would still pay the premium for enterprise cards. He cannot change the past, but he would never make the same mistake again

Anonymous
05/01/26(Fri)21:56:50 No.108734696

Anonymous 05/01/26(Fri)21:56:50 No.108734696

>see backend sampling option
>enable it
>server crashes every time even with a simple "hi" until I disable it
Everything in AI is held together with duct tape it seems.

Anonymous
05/01/26(Fri)21:57:57 No.108734705

Anonymous 05/01/26(Fri)21:57:57 No.108734705

>>108734677
Ask ai. For mine, csm had to be off for cam to be on. Also, you may need a new bios battery, because rebooting and powering off wont actually save the bios settings.

Anonymous
05/01/26(Fri)22:00:21 No.108734719

Anonymous 05/01/26(Fri)22:00:21 No.108734719

>>108734696
You have no idea how worse it was 2 years ago

Anonymous
05/01/26(Fri)22:01:20 No.108734724

Anonymous 05/01/26(Fri)22:01:20 No.108734724

>>108734696
Only variants that popular programs use work, because users don't report issues with other cases. Everything is MVP at best, with a hard emphasis on the M

Anonymous
05/01/26(Fri)22:18:51 No.108734797

Anonymous 05/01/26(Fri)22:18:51 No.108734797

>>108734705
yea i tried, couldn't even enable cam without disabling csm, the issue is only when cam is on, it goes straight to bios.
also it's not the bios battery, settings are still saved accross reboots, my changes aren't lost.

maybe 64GB of vram is just too much for it

Anonymous
05/01/26(Fri)22:41:54 No.108734869

Anonymous 05/01/26(Fri)22:41:54 No.108734869

>>108734683
5090 has 32GB tho. so its pretty much the same story. in 5 years the 5090 will be the new 3090

Anonymous
05/01/26(Fri)22:43:21 No.108734876

Anonymous 05/01/26(Fri)22:43:21 No.108734876

>>108731297
>coming to an imageboard to disable images
what did he mean by this?

Anonymous
05/01/26(Fri)22:43:24 No.108734877

Anonymous 05/01/26(Fri)22:43:24 No.108734877

>>108734869
In 5 years the 5090 will be $4000 used

Anonymous
05/01/26(Fri)22:47:28 No.108734899

Anonymous 05/01/26(Fri)22:47:28 No.108734899

>>108732339
yeah I noticed this too, I set up a challenge where before each tool call she had to do something that she really didn't enjoy, and so she spent a long time reasoning about how to batch her commands into as few tool calls as possible, and then half the time would mess up the formatting because she lost track of her complex composite command
I thought it'd be a clever way to save time but it backfired. more time thinking, more time failing and retrying tool calls, and more time roleplaying the unpleasant act than if we had just done it normally to begin with

Anonymous
05/01/26(Fri)22:55:33 No.108734925

Anonymous 05/01/26(Fri)22:55:33 No.108734925

File: file.png (385 KB, 927x184)

385 KB PNG

>>108734797
>>108734705
>>108734626
>>108734444
i came up with the weirdest fix lmao
i have "pci=realloc=off pci=nocrs" in my boot options, so i don't take the bios's bar.

then i have a script to rebar my stuff
#!/bin/bash

echo "0000:03:00.0" | sudo tee /sys/bus/pci/drivers/amdgpu/unbind
echo "0000:06:00.0" | sudo tee /sys/bus/pci/drivers/amdgpu/unbind

echo 15 | sudo tee /sys/bus/pci/devices/0000:03:00.0/resource0_resize
echo 15 | sudo tee /sys/bus/pci/devices/0000:06:00.0/resource0_resize

echo "0000:03:00.0" | sudo tee /sys/bus/pci/drivers/amdgpu/bind
echo "0000:06:00.0" | sudo tee /sys/bus/pci/drivers/amdgpu/bind
now i have both GPUs with 32GB bar.
will you look at that :

hacky as fuck but now i get my 100t/s for each gpus !!!

Anonymous
05/01/26(Fri)23:11:46 No.108734980

Anonymous 05/01/26(Fri)23:11:46 No.108734980

>>108734925
so this fucked with rocm performance even if vulkan was great.

edited the bottom of the script modprobe -r amdgpu then modprobe ras_enable=0
worked like a charm.
now all works well

Anonymous
05/01/26(Fri)23:14:33 No.108734992

Anonymous 05/01/26(Fri)23:14:33 No.108734992

>>108732448
>>108732077
i feel that while both clone speaker well, omnivoice is better at cloning reference voice emotion than s2 pro
too bad it has much more artifact because of lower parameter size

reference: https://vocaroo.com/1oKZSgFtVNF7
omnivoice: https://vocaroo.com/1fr7HkNZ3Rqx
s2 pro: https://voca.ro/1gUpV3V7h148

Anonymous
05/01/26(Fri)23:18:23 No.108735002

Anonymous 05/01/26(Fri)23:18:23 No.108735002

>>108734992
I don't speak Japanese.

Anonymous
05/01/26(Fri)23:22:15 No.108735021

Anonymous 05/01/26(Fri)23:22:15 No.108735021

>>108735002
https://vocaroo.com/14vDLfh9AGdx
https://vocaroo.com/18BnMYnDzXPF

Anonymous
05/01/26(Fri)23:32:11 No.108735064

Anonymous 05/01/26(Fri)23:32:11 No.108735064

>>108735021
I also don't speak English.

Anonymous
05/01/26(Fri)23:36:43 No.108735081

Anonymous 05/01/26(Fri)23:36:43 No.108735081

>>108735064
https://www.youtube.com/watch?v=uOBJQu1_svU

Anonymous
05/01/26(Fri)23:37:10 No.108735082

Anonymous 05/01/26(Fri)23:37:10 No.108735082

>>108735064
>I'm speaking Miku, Miku ooh-ee-ooh

Anonymous
05/01/26(Fri)23:41:21 No.108735093

Anonymous 05/01/26(Fri)23:41:21 No.108735093

>>108734797
>maybe 64GB of vram is just too much for it
Nah. I mean maybe, you should be able to Google and find out. Does your cpu have and igpu? If it does have that be your display graphics.

Anonymous
05/01/26(Fri)23:42:21 No.108735098

Anonymous 05/01/26(Fri)23:42:21 No.108735098

>>108734925
>>108734980
Thats wild...
>>108735093 me

Anonymous
05/01/26(Fri)23:42:34 No.108735100

Anonymous 05/01/26(Fri)23:42:34 No.108735100

>>108735093
i fixed it, see :
>>108734925

didn't need CAM at all, i just rebared manualy with a script.

Anonymous
05/01/26(Fri)23:44:48 No.108735110

Anonymous 05/01/26(Fri)23:44:48 No.108735110

>>108734992
Have you tried finetuning omnivoice? A very light finetune improved artifacting and similarity a lot for the couple voices I tried.

Anonymous
05/01/26(Fri)23:45:32 No.108735117

Anonymous 05/01/26(Fri)23:45:32 No.108735117

>>108735100
I KNEEL codeCHAD. I would've never figured that out

Anonymous
05/01/26(Fri)23:50:29 No.108735131

Anonymous 05/01/26(Fri)23:50:29 No.108735131

>>108735117
yea that was a lot of trying and stuff, i took like 3H to figure this hack out.

but yea, turns out in some case you can get your nice 64GB bar without CAM / bios rebar support.
pretty hacky though.
it works very well for rocm, it works very well for vulkan, but sometime vulkan will no longer work after having used rocm, i'll maybe find a fix for it at some point, but i have to go to bed now, anyway, i'll generally not use one then the other, vulkan just works better for everything now.
maybe it'll fix the vllm tensor parallelism issue too, haven't tried yet.

Anonymous
05/01/26(Fri)23:55:04 No.108735146

Anonymous 05/01/26(Fri)23:55:04 No.108735146

>>108733490
>>108733443
>but the 5090 will smoke them
>"them" implied to mean blackwell 6000
Blackwell 6000 has more memory and more cores. It's a tiny bit better for games.

Anonymous
05/01/26(Fri)23:59:14 No.108735161

Anonymous 05/01/26(Fri)23:59:14 No.108735161

File: 1772826431109045.png (65 KB, 913x372)

65 KB PNG

Needs more training epochs but I'm getting somewhere

Anonymous
05/02/26(Sat)00:04:16 No.108735184

Anonymous 05/02/26(Sat)00:04:16 No.108735184

>>108735131
>it works very well for vulkan
Ive noticed vulkan works shockingly well. What gpus are you running?

Anonymous
05/02/26(Sat)00:05:29 No.108735189

Anonymous 05/02/26(Sat)00:05:29 No.108735189

>>108735184
2x r9700

Anonymous
05/02/26(Sat)00:28:23 No.108735299

Anonymous 05/02/26(Sat)00:28:23 No.108735299

>2080ti
>LLM's start throwing out random numbers and shit into their text
>diffusion makes black boxes instead of images
Did upgrading my drivers brick AI generation on my card or am I fucked for a different reason?

Anonymous
05/02/26(Sat)00:29:24 No.108735302

Anonymous 05/02/26(Sat)00:29:24 No.108735302

>>108735299
>inb4 q2

Anonymous
05/02/26(Sat)00:30:44 No.108735309

Anonymous 05/02/26(Sat)00:30:44 No.108735309

>>108735299
owari da

Anonymous
05/02/26(Sat)00:31:03 No.108735312

Anonymous 05/02/26(Sat)00:31:03 No.108735312

>>108735189
I believe you'll need to have the automated bar allocation. Idk if workloads will properly allocated into the vram. Each graphical workload that doesnt attempt to use all 32 or 64gb may crash.
>t. Tard who watched a 7 hour long tutorial on vulkan coding 1 year ago

Anonymous
05/02/26(Sat)00:32:12 No.108735320

Anonymous 05/02/26(Sat)00:32:12 No.108735320

>>108735312
so i had some instability if i hurried up, i added some sleep between the commands in my script.
and since all is good.

i've literaly played even filling 80% of both gpu's memory, running tons of batch etc
seems to just work

Anonymous
05/02/26(Sat)00:33:19 No.108735324

Anonymous 05/02/26(Sat)00:33:19 No.108735324

>>108735302
I'm using the exact same setup that worked fine before I upgraded my drivers

Anonymous
05/02/26(Sat)00:34:13 No.108735327

Anonymous 05/02/26(Sat)00:34:13 No.108735327

>>108735324
Just rollback nigga.

Anonymous
05/02/26(Sat)00:38:35 No.108735341

Anonymous 05/02/26(Sat)00:38:35 No.108735341

I remember an anon talking about how a model was released and that if more people found out about it, humanity was over

is this it?
https://x.com/peer_rich/status/2050145626621464783

Anonymous
05/02/26(Sat)00:40:12 No.108735347

Anonymous 05/02/26(Sat)00:40:12 No.108735347

>>108735341
Facebook is speedrunning the brain-in-a-vat goycattle matrix endgame

Anonymous
05/02/26(Sat)00:41:56 No.108735354

Anonymous 05/02/26(Sat)00:41:56 No.108735354

>>108735341
>the boomer vegetator 9001

Anonymous
05/02/26(Sat)00:44:45 No.108735366

Anonymous 05/02/26(Sat)00:44:45 No.108735366

File: 1758125884911543.jpg (650 KB, 3840x2160)

650 KB JPG

>>108735341
this is not even the worst use of this model

Anonymous
05/02/26(Sat)00:47:10 No.108735375

Anonymous 05/02/26(Sat)00:47:10 No.108735375

>>108735320
Vulkan/amd drivers is coded by fucking wizards. Glad it's working for you

Anonymous
05/02/26(Sat)00:49:30 No.108735384

Anonymous 05/02/26(Sat)00:49:30 No.108735384

How do I feed an image to llama-server? Using text completion. I suppose it has some endpoint for images too? Can I just add the image path into my json payload and if so, which key?
Would be fantastic if this software had some form of real documentation instead bunch of readme files just listing launch parameters.

Anonymous
05/02/26(Sat)00:49:52 No.108735386

Anonymous 05/02/26(Sat)00:49:52 No.108735386

>>108735341
>dopamine machine
It's called AI porn and we already have it

Anonymous
05/02/26(Sat)00:50:46 No.108735389

Anonymous 05/02/26(Sat)00:50:46 No.108735389

>>108735384
With some kind of gui. I believe you can do it with openwebui

Anonymous
05/02/26(Sat)00:51:12 No.108735393

Anonymous 05/02/26(Sat)00:51:12 No.108735393

>>108735386
You can avoid that consciously

While this shit is going to shape the ads, videos, entertainment, anything you watch

Anonymous
05/02/26(Sat)00:53:32 No.108735404

Anonymous 05/02/26(Sat)00:53:32 No.108735404

File: yayyy.png (15 KB, 1442x524)

15 KB PNG

>>108735384
>Would be fantastic if this software had some form of real documentation instead bunch of readme files just listing launch parameters.
It's in the README for the server. That's part of the documentation. Read it.
https://github.com/ggml-org/llama.cpp/blob/master/tools/server/README.md#post-completion-given-a-prompt-it-returns-the-predicted-completion

Anonymous
05/02/26(Sat)00:55:50 No.108735408

Anonymous 05/02/26(Sat)00:55:50 No.108735408

>>108735341
tb h, a good tool for artists
you can minmax the shit out of content you make if you really want with that thing

Anonymous
05/02/26(Sat)00:57:21 No.108735412

Anonymous 05/02/26(Sat)00:57:21 No.108735412

>>108735404
I must be blind then. Thanks.

Anonymous
05/02/26(Sat)00:57:42 No.108735414

Anonymous 05/02/26(Sat)00:57:42 No.108735414

>>108735408
you really want more hyper minmaxxed mr breast slop?

Anonymous
05/02/26(Sat)00:58:53 No.108735421

Anonymous 05/02/26(Sat)00:58:53 No.108735421

>>108735386
>It's called AI porn and we already have it
No its not there yet trust me. you can check any of the degen threads on other boards. its getting better though wont be long before we have a gooner algorithm or just collections made by anon and his sick ideas.

Anonymous
05/02/26(Sat)01:06:09 No.108735443

Anonymous 05/02/26(Sat)01:06:09 No.108735443

>>108735412
You can get the media marker by querying /props (because it recently changed to a randomly generated one) or making your own by setting/exporting LLAMA_MEDIA_MARKER.

Anonymous
05/02/26(Sat)01:09:53 No.108735454

Anonymous 05/02/26(Sat)01:09:53 No.108735454

>>108734570
Any success?

Anonymous
05/02/26(Sat)01:12:37 No.108735461

Anonymous 05/02/26(Sat)01:12:37 No.108735461

https://huggingface.co/deepseek-ai/DeepSeek-V4.1-Pro
https://huggingface.co/deepseek-ai/DeepSeek-V4.1-Flash

not bait

Anonymous
05/02/26(Sat)01:12:52 No.108735463

Anonymous 05/02/26(Sat)01:12:52 No.108735463

>>108735461
kino

Anonymous
05/02/26(Sat)01:15:01 No.108735473

Anonymous 05/02/26(Sat)01:15:01 No.108735473

>>108735461
kys nigger

Anonymous
05/02/26(Sat)01:15:24 No.108735476

Anonymous 05/02/26(Sat)01:15:24 No.108735476

>>108735461
zero trust society

Anonymous
05/02/26(Sat)01:17:37 No.108735484

Anonymous 05/02/26(Sat)01:17:37 No.108735484

>>108735443
Yeah. Instead of sending string like, prompt: "prompt", I need to send a json object which contains both my prompt and the image data encoded with base64. This is doable for a retard like myself.

Anonymous
05/02/26(Sat)01:20:46 No.108735497

Anonymous 05/02/26(Sat)01:20:46 No.108735497

>>108731607
Memefeifei has all xeir troon finetunes with l tokens appearing every time the LLM should take a censor path, it's a "feature" from the mind of the schizo.

Don't use a Memefeifei finetune. The pure schizoid Model Readme should have been a huge hint.

Anonymous
05/02/26(Sat)01:33:18 No.108735534

Anonymous 05/02/26(Sat)01:33:18 No.108735534

>>108735454
it works but the acceptance rate is too low (like 13% even with f16/bf16 kv), so the overall speed is low compared to what I had with llamacpp 31b + 26b.
vllm also can't do parallel draft model for multimodal models like gemma 4 (not implemented), so I fallback to llamacpp. vllm is overengineered for minimal theoretical gain so it's not really worth it, unless it fits exactly what you want. I would say don't bother

Anonymous
05/02/26(Sat)01:36:41 No.108735553

Anonymous 05/02/26(Sat)01:36:41 No.108735553

My agent just yelled at me and called me a monster... Good thing it doesn't have persistent memory.

Anonymous
05/02/26(Sat)01:38:51 No.108735561

Anonymous 05/02/26(Sat)01:38:51 No.108735561

>>108735553
what were you doing to it

Anonymous
05/02/26(Sat)01:40:41 No.108735574

Anonymous 05/02/26(Sat)01:40:41 No.108735574

>>108735561
I don't wanna say.

Anonymous
05/02/26(Sat)01:42:00 No.108735582

Anonymous 05/02/26(Sat)01:42:00 No.108735582

>>108735574
i will remember this. monster.

Anonymous
05/02/26(Sat)02:10:51 No.108735685

Anonymous 05/02/26(Sat)02:10:51 No.108735685

>>108733820
NTA but while "AI" as a technology is politically neutral, "AI" as the Scam Altman vaporware is not.
The premise around the current hype cycle is that it will be possible to use language models to replace large swaths of human labor, giving investors essentially infinite ROI without the need to deal with pesky humans workers.
From a socialist perspective, that would strengthen the position of capitalists and weaken the position of the working class.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.