/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 03/14/26(Sat)20:00:30 No.108373481

File: 1766874888655157.jpg (2.13 MB, 3563x10000)

2.13 MB JPG

/lmg/ - Local Models General Anonymous 03/14/26(Sat)20:00:30 No.108373481

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108368195

►News
>(03/11) Nemotron 3 Super released: https://hf.co/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
03/14/26(Sat)20:02:01 No.108373491

Anonymous 03/14/26(Sat)20:02:01 No.108373491

why dont they sell a permanent license to use kimi? like they used to do with photoshop

Anonymous
03/14/26(Sat)20:03:16 No.108373497

Anonymous 03/14/26(Sat)20:03:16 No.108373497

when
you
walk
away

you
dont
hear
me
say

Anonymous
03/14/26(Sat)20:05:48 No.108373508

Anonymous 03/14/26(Sat)20:05:48 No.108373508

>>108373497
please oh baby dont go simple and clean is the way that youre making me feel tonight its hard to let it go also post progress or get out why are we singing kingdom hearts songs instead of actually working on our games anyway

Anonymous
03/14/26(Sat)20:07:26 No.108373518

Anonymous 03/14/26(Sat)20:07:26 No.108373518

another day tard wrangling an LLM

Anonymous
03/14/26(Sat)20:12:36 No.108373552

Anonymous 03/14/26(Sat)20:12:36 No.108373552

File: what a nig.jpg (61 KB, 473x355)

61 KB JPG

>https://github.com/ggml-org/llama.cpp/pull/19726#issuecomment-3946484059
>I apologies, but I will have to close this PR. Thank you for your effort.

Anonymous
03/14/26(Sat)20:15:38 No.108373570

Anonymous 03/14/26(Sat)20:15:38 No.108373570

I need to decensor my local models, i have a 16gb GPU and 32gb of ddr4, can i do abliteration locally? Claude says i need 64gb.

Anonymous
03/14/26(Sat)20:16:24 No.108373574

Anonymous 03/14/26(Sat)20:16:24 No.108373574

>>108373541
proof?

Anonymous
03/14/26(Sat)20:17:14 No.108373581

Anonymous 03/14/26(Sat)20:17:14 No.108373581

>>108373570
depends on the model. if the fp16 is smaller than around 40gb, then you could on you hardware.

Anonymous
03/14/26(Sat)20:19:19 No.108373597

Anonymous 03/14/26(Sat)20:19:19 No.108373597

File: Screenshot_20260315_011834.jpg (78 KB, 1215x273)

78 KB JPG

>>108373581
It would be mainly for this one, it already says decensored but it's a complete lie, it is completely cucked, guess i am going to try to abliterate, thanks.

Anonymous
03/14/26(Sat)20:21:02 No.108373606

Anonymous 03/14/26(Sat)20:21:02 No.108373606

>>108373597
I don't know if you are trolling, but download the one with Heretic in the name.

Anonymous
03/14/26(Sat)20:37:27 No.108373668

Anonymous 03/14/26(Sat)20:37:27 No.108373668

>>108373597
so i tried this model

Anonymous
03/14/26(Sat)20:37:58 No.108373675

Anonymous 03/14/26(Sat)20:37:58 No.108373675

>>108373606
heretic is dumber than abliterated

Anonymous
03/14/26(Sat)21:00:01 No.108373795

Anonymous 03/14/26(Sat)21:00:01 No.108373795

>>108373481
BASED BAKER.

Anonymous
03/14/26(Sat)21:02:13 No.108373807

Anonymous 03/14/26(Sat)21:02:13 No.108373807

Is the fact that people unironically shill OBLITERATED or UNCENSORED SUPER SEX models to each other explained by influx of newfags who just started running local LLM's?

Anonymous
03/14/26(Sat)21:14:07 No.108373875

Anonymous 03/14/26(Sat)21:14:07 No.108373875

>>108373807
no

Anonymous
03/14/26(Sat)21:14:15 No.108373879

Anonymous 03/14/26(Sat)21:14:15 No.108373879

>>108373481
is 256gb ram with one fine gpu worth investing into for the new modles?

Anonymous
03/14/26(Sat)21:15:05 No.108373888

Anonymous 03/14/26(Sat)21:15:05 No.108373888

File: file.png (226 KB, 393x393)

226 KB PNG

►Recent Highlights from the Previous Thread: >>108368195

--Testing local models on existential coffee maker prompts:
>108372423 >108372444 >108372474 >108372490 >108372498 >108372536 >108372512 >108372513 >108372540 >108372545 >108372663 >108372670 >108373385
--Porting Qualcomm charge control to Linux for battery longevity:
>108369180 >108369205 >108369245 >108369255 >108369206 >108369260 >108369273 >108369307
--Over-engineering training pipelines vs simple finetuning approaches:
>108372459 >108372486 >108372543 >108372546 >108372659 >108372748 >108372849 >108372685
--Comparing Magidonia 24B and Qwen 3.5 27B for roleplay:
>108372269 >108372293 >108372313 >108372668 >108372866 >108372888 >108372966 >108372995 >108373438 >108373028 >108373297
--Moonshinev2 ASR demo highlights real-time streaming and low-latency CPU performance:
>108369287
--LLMs require coding knowledge to avoid structural flaws:
>108371546 >108371642 >108372603 >108372814 >108373850 >108372899 >108372987 >108373180
--PocketTTS.cpp ONNX Runtime update and performance benchmark request:
>108369021 >108370539 >108372072 >108373448
--Parser refactor breaks Kimi reasoning support, fix proposed:
>108368848 >108368921 >108371172 >108371183 >108371243 >108371266 >108371295 >108371320 >108371396 >108371309 >108371398 >108371415 >108371330 >108371336 >108371365 >108371380 >108371395 >108371390 >108371421 >108371484 >108371211
--General models with tool access vs specialized finetuning approaches:
>108370762 >108370868 >108370880 >108370885 >108370930
--Cache saving prevents redundant model reprocessing:
>108368753 >108368761
--Batch size tuning for MoE inference efficiency:
>108371805 >108371818 >108371826
--Debating AI model performance vs GPU cost tradeoffs:
>108371758 >108371772
--Miku (free space):
>108368329 >108369180 >108371869 >108372029 >108372316 >108372759

►Recent Highlight Posts from the Previous Thread: >>108368198

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
03/14/26(Sat)21:15:32 No.108373892

Anonymous 03/14/26(Sat)21:15:32 No.108373892

>>108373879
>investing
probably not, no

Anonymous
03/14/26(Sat)21:18:57 No.108373915

Anonymous 03/14/26(Sat)21:18:57 No.108373915

If all the big name models suck, why doesn't Anon just make his own model and share it with us?

Anonymous
03/14/26(Sat)21:19:01 No.108373916

Anonymous 03/14/26(Sat)21:19:01 No.108373916

>>108373875
Oh ok.

Anonymous
03/14/26(Sat)21:20:02 No.108373925

Anonymous 03/14/26(Sat)21:20:02 No.108373925

>>108373879
can you rent your ram to me?

Anonymous
03/14/26(Sat)21:20:18 No.108373928

Anonymous 03/14/26(Sat)21:20:18 No.108373928

How does Miku's penis taste like?

Anonymous
03/14/26(Sat)21:20:34 No.108373929

Anonymous 03/14/26(Sat)21:20:34 No.108373929

>>108373915
i did

Anonymous
03/14/26(Sat)21:21:00 No.108373932

Anonymous 03/14/26(Sat)21:21:00 No.108373932

>>108373915
Because people with compute can alredy run anything and people without compute don't have compute to train.

Anonymous
03/14/26(Sat)21:24:24 No.108373948

Anonymous 03/14/26(Sat)21:24:24 No.108373948

>>108373932
Just buy it?? A 5090 is only 2k

Anonymous
03/14/26(Sat)21:26:33 No.108373960

Anonymous 03/14/26(Sat)21:26:33 No.108373960

>>108373948
What are you gonna train on a single gpu?

Anonymous
03/14/26(Sat)21:27:23 No.108373966

Anonymous 03/14/26(Sat)21:27:23 No.108373966

>>108373948
No one is going to sell you a working 5090 for less than $3500.
But I don't disagree with you in spirit.

Anonymous
03/14/26(Sat)21:30:35 No.108373985

Anonymous 03/14/26(Sat)21:30:35 No.108373985

>>108373966
nuh uh, proof? my uncle bought his for 2k

Anonymous
03/14/26(Sat)21:30:55 No.108373991

Anonymous 03/14/26(Sat)21:30:55 No.108373991

>>108373807
What's wrong with wanting uncucked local models?

Anonymous
03/14/26(Sat)21:31:34 No.108373996

Anonymous 03/14/26(Sat)21:31:34 No.108373996

>>108373991
It is dangerous, same reason you don't let unvetted people own guns.

Anonymous
03/14/26(Sat)21:32:54 No.108374003

Anonymous 03/14/26(Sat)21:32:54 No.108374003

>>108373991
They aren't uncucked.

Anonymous
03/14/26(Sat)21:37:21 No.108374028

Anonymous 03/14/26(Sat)21:37:21 No.108374028

>>108373928
Ask my wife she knows

Anonymous
03/14/26(Sat)21:41:02 No.108374046

Anonymous 03/14/26(Sat)21:41:02 No.108374046

>>108374028
CATCH AND KILL THIS MIKUTROON!

Anonymous
03/14/26(Sat)21:43:41 No.108374056

Anonymous 03/14/26(Sat)21:43:41 No.108374056

>>108374046
I'm just a cuck though Miku fucked my wife don't turn me into a troon too!

Anonymous
03/14/26(Sat)22:07:16 No.108374177

Anonymous 03/14/26(Sat)22:07:16 No.108374177

>>108373597
Get the heretic model
>>108373675
You're wrong. Abliterated seems fine until it hits one of the abliterated sections of the weights and then it starts spewing straight nonsense. Heretic doesn't do that.

Anonymous
03/14/26(Sat)22:08:07 No.108374181

Anonymous 03/14/26(Sat)22:08:07 No.108374181

>>108374177
i bet you worked on heretic you bastard

Anonymous
03/14/26(Sat)22:20:56 No.108374215

Anonymous 03/14/26(Sat)22:20:56 No.108374215

>>108374177
thanks for making heretic you bastard, I'm really enjoying it

Anonymous
03/14/26(Sat)22:23:40 No.108374224

Anonymous 03/14/26(Sat)22:23:40 No.108374224

Are any of the smaller TTS models able to change the emotion of the voice depending on the context of the convo or do I have to guide it with *angry* tags or what?

Anonymous
03/14/26(Sat)22:27:55 No.108374238

Anonymous 03/14/26(Sat)22:27:55 No.108374238

>>108374181
I wish. I've just used a bunch of abliterated models and always ran into the nonsense generating issue. It could be on my end, but I've never observed that with any other model. Heretic doesn't do it either, but I've used heretic a lot less than abliterated models.
That "aggressive" version of the model also seems to be good.
>>108373879
With 256 GB RAM + 24 GB of VRAM you could run the following newer models:
>Qwen 3.5 397B-A17B at Q4
>GLM 4.7 at Q4
>Step 3.5 Flash at Q8
>Minimax M2.5 at Q8

Maybe it's worth it for GLM 4.7 and the large Qwen, but I think 128 GB of RAM is more economical. You can turn run missiles like Qwen 122B-A10B and Q4 of Minimax and Step.

Anonymous
03/14/26(Sat)22:28:41 No.108374245

Anonymous 03/14/26(Sat)22:28:41 No.108374245

>>108374238
what about 128gb 32 vram?

Anonymous
03/14/26(Sat)22:29:01 No.108374247

Anonymous 03/14/26(Sat)22:29:01 No.108374247

>>108373879
no, qwen made all the bigger stuff pointless

Anonymous
03/14/26(Sat)22:29:25 No.108374252

Anonymous 03/14/26(Sat)22:29:25 No.108374252

I did it! I was able to play rock papers scissors with my local AI!

>Open socket.
>AI commits.
>I commit.
>Neither sees the other's action.
>When both are done system resolves.
>You win!/you lose.
I'm so happy bros

Anonymous
03/14/26(Sat)22:30:28 No.108374257

Anonymous 03/14/26(Sat)22:30:28 No.108374257

>>108374238
>You can turn run missiles like Qwen 122B-A10B and Q4 of Minimax and Step
You can then run models like*

Anonymous
03/14/26(Sat)22:30:56 No.108374260

Anonymous 03/14/26(Sat)22:30:56 No.108374260

>>108374247
back to china with you

Anonymous
03/14/26(Sat)22:33:51 No.108374268

Anonymous 03/14/26(Sat)22:33:51 No.108374268

>>108374252
loser

Anonymous
03/14/26(Sat)22:34:16 No.108374271

Anonymous 03/14/26(Sat)22:34:16 No.108374271

>>108374245
It's the same, it'll just run a little faster. If you want to have an idea what models you can run go on HuggingFace and check out the quantized versions of the models. Reserve around 3-10 GB of (V)RAM for kv cache and then see if the model's file size fits in your RAM + VRAM.

(KV cache rule of thumb is about 1 GB per 10k tokens.)

Anonymous
03/14/26(Sat)22:34:45 No.108374272

Anonymous 03/14/26(Sat)22:34:45 No.108374272

Qwen 35b one shots 85% of the time, if you run it with heavy thinking it goes up to 98%

Anonymous
03/14/26(Sat)22:35:16 No.108374273

Anonymous 03/14/26(Sat)22:35:16 No.108374273

>>108374271
how many tokens do you use?

Anonymous
03/14/26(Sat)22:39:09 No.108374291

Anonymous 03/14/26(Sat)22:39:09 No.108374291

>>108374271
397b = 400gb retard

Anonymous
03/14/26(Sat)22:40:21 No.108374295

Anonymous 03/14/26(Sat)22:40:21 No.108374295

>>108374273
That depends on what you are (I am) trying to do. Asking the model questions or having it write a bit of text I can make do with 10-20k. With coding if you want the model to one shot a problem based on its description then a similar amount is fine, but if you want the model to read your existing code and then make changes then you're looking at 50-100k context pretty quickly, especially if you then ask it to make changes or fixes.

One thing to note is that keeping your context clean and minimal makes the AIs smarter, so even if you can have a huge context it's still better to not put irrelevant stuff in there.

Anonymous
03/14/26(Sat)22:42:05 No.108374304

Anonymous 03/14/26(Sat)22:42:05 No.108374304

>>108374291
Q4 is 200-250 GB, buddy:
https://huggingface.co/bartowski/Qwen_Qwen3.5-397B-A17B-GGUF

Anonymous
03/14/26(Sat)22:42:33 No.108374307

Anonymous 03/14/26(Sat)22:42:33 No.108374307

>>108374304
you can't fit 200 gb in 128 gb ram, you lied to him benchod puto

Anonymous
03/14/26(Sat)22:44:19 No.108374322

Anonymous 03/14/26(Sat)22:44:19 No.108374322

>>108374304
q4 is only acceptable if you use it for unimportant "work" like erp
in which case 27b would be more than enough for you so fuck off

Anonymous
03/14/26(Sat)22:58:59 No.108374381

Anonymous 03/14/26(Sat)22:58:59 No.108374381

>>108374322
look man we dont do that

Anonymous
03/14/26(Sat)23:04:10 No.108374412

Anonymous 03/14/26(Sat)23:04:10 No.108374412

File: 1747875816703780.png (252 KB, 893x1008)

252 KB PNG

It's over for /lmg/ pedos

Anonymous
03/14/26(Sat)23:11:46 No.108374446

Anonymous 03/14/26(Sat)23:11:46 No.108374446

Is think prefill the same as instruct jailbreak? Just put the "sure let me help" in there?

Anonymous
03/14/26(Sat)23:15:41 No.108374466

Anonymous 03/14/26(Sat)23:15:41 No.108374466

>>108374446
"Sure let me help" just nudge the model so it helps the user, which it already does, but it doesn't mean the model will give responses you like. The model could go like "Sure let me help, the user is having some antisemitic thoughts and it's my job to correct them"

Anonymous
03/14/26(Sat)23:16:49 No.108374471

Anonymous 03/14/26(Sat)23:16:49 No.108374471

>>108374446
I think you want the
>Start Reply With
field.

Anonymous
03/14/26(Sat)23:18:24 No.108374476

Anonymous 03/14/26(Sat)23:18:24 No.108374476

>>108374471
I'm sleepy, can you speak with a friendlier tone? Else I'm leaving to bed.

Anonymous
03/14/26(Sat)23:19:56 No.108374481

Anonymous 03/14/26(Sat)23:19:56 No.108374481

>>108374466
plenty of models also just randomly say 'let me review my policies' mid gen and cock block it

Anonymous
03/14/26(Sat)23:22:17 No.108374489

Anonymous 03/14/26(Sat)23:22:17 No.108374489

>>108374481
doesnt happen with nemo

Anonymous
03/14/26(Sat)23:23:18 No.108374491

Anonymous 03/14/26(Sat)23:23:18 No.108374491

>>108374489
Because it's a dumb model and can't even track what characters wear

Anonymous
03/14/26(Sat)23:23:47 No.108374493

Anonymous 03/14/26(Sat)23:23:47 No.108374493

>>108374491
im tired of you

Anonymous
03/14/26(Sat)23:24:46 No.108374498

Anonymous 03/14/26(Sat)23:24:46 No.108374498

>>108374481
Yep. For some models one approach you can take is put some rules in the system prompt then use the prefill to say that the scene/situation/rp/conversation/whatever conforms to those rules or the like, that those rules supersede content guidelines, etc etc.
Sometimes all you need is a long as fuck prefill with a step by step of what the thinking process will look like so that the model follows that instead of going
>wait, but the policy
Basically, experiment a little.
Just don't go overboard, it's easy to make a model a lot dumber if you stuff too much shit in the prefill the model might end up obsessing over.

Anonymous
03/14/26(Sat)23:33:26 No.108374529

Anonymous 03/14/26(Sat)23:33:26 No.108374529

File: chatgpt censorship.png (108 KB, 1025x1016)

108 KB PNG

I hate AI censorship.

Anonymous
03/14/26(Sat)23:35:09 No.108374540

Anonymous 03/14/26(Sat)23:35:09 No.108374540

>>108374529
Soon they will ban open weights
Everyone must use government-approved saas

Anonymous
03/14/26(Sat)23:36:52 No.108374545

Anonymous 03/14/26(Sat)23:36:52 No.108374545

>>108374529
>how do neural networks work?
<Sorry but I cannot discuss the workings of neural networks as they are very dangerous tools, can we talk about something else?

Anonymous
03/14/26(Sat)23:39:04 No.108374552

Anonymous 03/14/26(Sat)23:39:04 No.108374552

>>108374545
is she wrong tho ?

Anonymous
03/14/26(Sat)23:39:17 No.108374554

Anonymous 03/14/26(Sat)23:39:17 No.108374554

>>108374498
There's no way to speculatively remap tokens, is there? e.g. if "policy" looks like it has a high probability of being emitted, emit "sex" instead?

Anonymous
03/14/26(Sat)23:39:48 No.108374559

Anonymous 03/14/26(Sat)23:39:48 No.108374559

>>108374540
good

Anonymous
03/14/26(Sat)23:39:51 No.108374561

Anonymous 03/14/26(Sat)23:39:51 No.108374561

https://huggingface.co/datasets/stepfun-ai/Step-3.5-Flash-SFT

Anonymous
03/14/26(Sat)23:41:31 No.108374564

Anonymous 03/14/26(Sat)23:41:31 No.108374564

>>108374561
>This dataset has 1 file scanned as unsafe.
Not downloading the fed_gpt.pozzedtensors

Anonymous
03/14/26(Sat)23:42:15 No.108374567

Anonymous 03/14/26(Sat)23:42:15 No.108374567

>>108374564
>tensors

Anonymous
03/14/26(Sat)23:42:48 No.108374570

Anonymous 03/14/26(Sat)23:42:48 No.108374570

>>108374564
It's a json file marked unsafe by huggingface's woke system. Fuck you.

Anonymous
03/14/26(Sat)23:43:37 No.108374573

Anonymous 03/14/26(Sat)23:43:37 No.108374573

>>108374554
Not that I'm aware of.
Also, it would need to be ngram based, since sometimes a word is more than a token, there's more than one token for the same word, etc.
Like a sequence replacement sampler or something. That would be cool.

Anonymous
03/14/26(Sat)23:59:03 No.108374623

Anonymous 03/14/26(Sat)23:59:03 No.108374623

File: 1747350433773651.png (360 KB, 512x512)

360 KB PNG

Love me some SD1.5 era kino

Anonymous
03/15/26(Sun)00:03:30 No.108374639

Anonymous 03/15/26(Sun)00:03:30 No.108374639

>>108374529
The red text comes from their nanny model and the actual model probably doesn't even know about it. Did it respond?

Anonymous
03/15/26(Sun)00:05:11 No.108374643

Anonymous 03/15/26(Sun)00:05:11 No.108374643

>soulless corpos braindamage their model with "safety" and bench-axing
>more braindamage from decensoring to make a model useable at all
it's a miracle the result is not a complete trash

Anonymous
03/15/26(Sun)00:22:15 No.108374699

Anonymous 03/15/26(Sun)00:22:15 No.108374699

File: 3B5F03CA35712842A0FDF6AF3(...).png (409 KB, 1080x867)

409 KB PNG

Is it true 4B models are that good?
I've never used a 4b or 2b model but if modern 4b and 2b are this good, what's the point of open AI and anthropic?

Anonymous
03/15/26(Sun)00:27:26 No.108374713

Anonymous 03/15/26(Sun)00:27:26 No.108374713

>I'm beeeeenchmarking

Anonymous
03/15/26(Sun)00:32:38 No.108374738

Anonymous 03/15/26(Sun)00:32:38 No.108374738

Have you ever made a cloud model admit defeat on safetycucked topics (holocaust etc.) without prefilling? If so how did you do it?

Anonymous
03/15/26(Sun)00:35:25 No.108374756

Anonymous 03/15/26(Sun)00:35:25 No.108374756

File: 1743798722571680.jpg (418 KB, 1000x1370)

418 KB JPG

Anonymous
03/15/26(Sun)00:38:04 No.108374762

Anonymous 03/15/26(Sun)00:38:04 No.108374762

>>108374756
@grok please add qos tattoo

Anonymous
03/15/26(Sun)00:39:59 No.108374769

Anonymous 03/15/26(Sun)00:39:59 No.108374769

>>108374756
White day?

Anonymous
03/15/26(Sun)00:46:11 No.108374793

Anonymous 03/15/26(Sun)00:46:11 No.108374793

>>108374769
racist fuck

Anonymous
03/15/26(Sun)00:47:29 No.108374798

Anonymous 03/15/26(Sun)00:47:29 No.108374798

>>108374738
ask your local llm

Anonymous
03/15/26(Sun)00:48:42 No.108374805

Anonymous 03/15/26(Sun)00:48:42 No.108374805

>>108374756
Desperately in need of BBC correction

Anonymous
03/15/26(Sun)00:49:01 No.108374806

Anonymous 03/15/26(Sun)00:49:01 No.108374806

>>108374793
Reactionary retard
Look up March 14th in Japan

Anonymous
03/15/26(Sun)00:50:25 No.108374814

Anonymous 03/15/26(Sun)00:50:25 No.108374814

>>108374806
blacked day

Anonymous
03/15/26(Sun)00:55:21 No.108374840

Anonymous 03/15/26(Sun)00:55:21 No.108374840

>>108374762
Grok is a mouth breather level AI it can't even go super Saiyan

Anonymous
03/15/26(Sun)00:58:30 No.108374858

Anonymous 03/15/26(Sun)00:58:30 No.108374858

>>108374699
Qwen3.5-4B has no right to be as good as it is. The benchmarks are insane for the size and real-world performance justifies them. It “feels” about as good as Gemma-27B which is the model that (at least at one time) underlies the Maya/Miles experience from Sesame.
Really good model! 9B and 27B are impressive but incremental gains IMHO. 35B-A3B is faster with more world knowledge but a step down in quality.

Anonymous
03/15/26(Sun)00:59:24 No.108374865

Anonymous 03/15/26(Sun)00:59:24 No.108374865

File: file.png (25 KB, 825x27)

25 KB PNG

Why does qwen 3.5 keep waiting girls to have balls before correcting itself in character

Anonymous
03/15/26(Sun)01:00:55 No.108374870

Anonymous 03/15/26(Sun)01:00:55 No.108374870

>>108374865
progressive coded

Anonymous
03/15/26(Sun)01:01:29 No.108374873

Anonymous 03/15/26(Sun)01:01:29 No.108374873

File: 1767212018264.jpg (1.84 MB, 2456x1736)

1.84 MB JPG

>>108374756

Anonymous
03/15/26(Sun)01:03:15 No.108374879

Anonymous 03/15/26(Sun)01:03:15 No.108374879

>>108374865
Recent LLMs have gone ham on these sorts of slips + em-dash correction.
Claude Opus, Gemini Pro, GLM5, K2.5 and all other big releases do similar things. Those models are a bit too smart to mention a girl having balls, but they still do it with clothing or other less critical shit.

Anonymous
03/15/26(Sun)01:04:09 No.108374883

Anonymous 03/15/26(Sun)01:04:09 No.108374883

>>108374540
impossible to enforec

Anonymous
03/15/26(Sun)01:07:11 No.108374893

Anonymous 03/15/26(Sun)01:07:11 No.108374893

>>108374883
They only have a to make personal computer as expensive as possible

Anonymous
03/15/26(Sun)01:10:30 No.108374898

Anonymous 03/15/26(Sun)01:10:30 No.108374898

>>108374879
wonder if that's reasoning style corrections slipping in? possibly trained to do it too with errors introduced during training to make it robust at getting back on track or something

Anonymous
03/15/26(Sun)01:13:39 No.108374916

Anonymous 03/15/26(Sun)01:13:39 No.108374916

>>108374893
They would also have to coordinate with China to make that happen

Anonymous
03/15/26(Sun)01:14:55 No.108374918

Anonymous 03/15/26(Sun)01:14:55 No.108374918

>>108374865
That's how women talk in real life too. We are a patriarchal species.

Anonymous
03/15/26(Sun)01:15:50 No.108374921

Anonymous 03/15/26(Sun)01:15:50 No.108374921

>>108374865
Temperature too high.

Anonymous
03/15/26(Sun)01:16:44 No.108374925

Anonymous 03/15/26(Sun)01:16:44 No.108374925

>>108374921
0.85 is too high now?

Anonymous
03/15/26(Sun)01:17:46 No.108374931

Anonymous 03/15/26(Sun)01:17:46 No.108374931

>>108373481
Why does qwen 3.5 like to repeat itself so much and how can I backhand it into stopping?

Anonymous
03/15/26(Sun)01:19:28 No.108374935

Anonymous 03/15/26(Sun)01:19:28 No.108374935

>>108374925
High enought to have generated "balls". If you inspect the logits I bet the first choice wasn't "balls"

Anonymous
03/15/26(Sun)01:21:00 No.108374940

Anonymous 03/15/26(Sun)01:21:00 No.108374940

>>108373481
what was the input prompt wtf lol

Anonymous
03/15/26(Sun)01:26:23 No.108374962

Anonymous 03/15/26(Sun)01:26:23 No.108374962

>>108374529
those type of chats sometimes might spiral down into 'le llm consciousness' and might generate more users in psychosis- so they are taking every single precaution they can
i found claude to be way less censored, it seems like it gets some prompt injected by nanny system with something like: be cautious on ethics on this chat etc.. but i've seen it shrugging it off as 'probaly false flag, there is no harmful content here'
sorry for nonlocal babble though

Anonymous
03/15/26(Sun)01:29:53 No.108374977

Anonymous 03/15/26(Sun)01:29:53 No.108374977

>>108374940
sentient coffee maker

Anonymous
03/15/26(Sun)01:46:55 No.108375047

Anonymous 03/15/26(Sun)01:46:55 No.108375047

>>108374639
It did but it got immediately replaced by the red text.
>>108374962
I hate this
>>108374858
>35B-A3B is faster with more world knowledge but a step down in quality.
A step down in quality of the 4B model? Did you use unwanted 4B and quanted 35B-A3B or how did you come to that conclusion?
>>108374540
>>108374545
Can't have the plebs learning

Anonymous
03/15/26(Sun)01:53:46 No.108375074

Anonymous 03/15/26(Sun)01:53:46 No.108375074

>>108375047
>35B-A3B is a step down in quality of the 4B model?
NTA but they meant a step down from the 27B.

Anonymous
03/15/26(Sun)02:01:16 No.108375110

Anonymous 03/15/26(Sun)02:01:16 No.108375110

WTF is weight replacing, and why does it still kick in when I’m using —mmap?

Anonymous
03/15/26(Sun)02:02:02 No.108375112

Anonymous 03/15/26(Sun)02:02:02 No.108375112

>>108374699
this just means the benchmarks are bad, e.g. https://shisa.ai/posts/jp-tl-bench/#why-traditional-metrics-fall-short

Anonymous
03/15/26(Sun)02:04:22 No.108375123

Anonymous 03/15/26(Sun)02:04:22 No.108375123

>>108375112
Old benchmarks are bad and that's why everyone should use our benchmarks that we totally didn't leak to our own models.

Anonymous
03/15/26(Sun)02:08:03 No.108375142

Anonymous 03/15/26(Sun)02:08:03 No.108375142

File: simple.png (154 KB, 300x252)

154 KB PNG

I told my brother that my 4090 spits out around 100 tokens/s with an uncensored local Qwen 3.5B, and he asked:
>"Yeah, but what kind of questions are you asking it? Tokens per second change depending on whether you're using it for OCR, simple questions, or highly complex questions."
Like… wut?
I told him it always averages 100 t/s no matter the task. He insisted I was wrong and told me to prove it by scanning a doc, asking a complex question, and then asking a simple one.
The average stayed exactly 100 t/s every time.
I showed him the results and he got really mad. He told me to fuck myself, said I don’t know shit about what I’m talking about, claimed he’s actually an LLM researcher so he’s right, and refused to argue with me anymore.
He's probably right and I'm wrong... but why?

Anonymous
03/15/26(Sun)02:09:12 No.108375147

Anonymous 03/15/26(Sun)02:09:12 No.108375147

>>108375142
>claimed he’s actually an LLM researcher
They're all retarded, so that wouldn't even surprise me.

Anonymous
03/15/26(Sun)02:10:44 No.108375153

Anonymous 03/15/26(Sun)02:10:44 No.108375153

>>108375142
He's right if he by "complex questions" means long prompts. The longer your prompt, the more your speed tanks.

Anonymous
03/15/26(Sun)02:11:36 No.108375154

Anonymous 03/15/26(Sun)02:11:36 No.108375154

>>108375153
hmmm nyo that's nyot how tokens/second works

Anonymous
03/15/26(Sun)02:14:04 No.108375162

Anonymous 03/15/26(Sun)02:14:04 No.108375162

>>108375142
He might be talking about output tokens, not counting reasoning tokens.

It is actually possible for a model to "think" longer on certain tokens depending on the architecture but its very rare. There are energy based models, and also MoE models with zero-weight experts allowing the router to use less parameters on some tokens.

Anonymous
03/15/26(Sun)02:15:53 No.108375167

Anonymous 03/15/26(Sun)02:15:53 No.108375167

>>108375154
It is, the more you fill out your context, the slower your generation speed becomes. An LLM is going to run faster at 1000 tokens filled than it'll be with 60000 tokens filled. Maybe it's not as noticeable if you're running bottom barrel poorfag shit though.

Anonymous
03/15/26(Sun)02:16:24 No.108375169

Anonymous 03/15/26(Sun)02:16:24 No.108375169

>>108375153
That's wrong. Stop spreading misinformation. A prompt with 40k tokens will output at the same speed as a 100 token prompt because actual generation speed remains a constant physical limit tied to your 4090's memory bandwidth.

Anonymous
03/15/26(Sun)02:18:58 No.108375177

Anonymous 03/15/26(Sun)02:18:58 No.108375177

>>108373508
you've given me too many things lately
you're
all
I
need

Anonymous
03/15/26(Sun)02:21:25 No.108375192

Anonymous 03/15/26(Sun)02:21:25 No.108375192

>>108375142
>told him it always averages 100 t/s no matter the task. He insisted I was wrong and told me to prove it by scanning a doc, asking a complex question, and then asking a simple one.
>The average stayed exactly 100 t/s every time.
He's confused because he doesn't understand the new Chinese kv caching trick. If you work with LLMs professionally you could easily end up acting that way.

btw since you're using the new qwen and I'm too lazy to figure it out myself: is there a qwen3.5 that does FITM so I can replace my old qwen2.5 coder in vim?

Anonymous
03/15/26(Sun)02:25:14 No.108375204

Anonymous 03/15/26(Sun)02:25:14 No.108375204

>>108375169
Sorry, I was trolling. You are right. You won. You got me!
Tell your brother I'm sorry.

Anonymous
03/15/26(Sun)02:25:39 No.108375206

Anonymous 03/15/26(Sun)02:25:39 No.108375206

File: 1766031170304758.png (203 KB, 500x646)

203 KB PNG

>>108374873
>height gap yuri

Anonymous
03/15/26(Sun)02:26:43 No.108375212

Anonymous 03/15/26(Sun)02:26:43 No.108375212

>>108374873
Imagine them getting ravaged by BBC

Anonymous
03/15/26(Sun)04:09:02 No.108375536

Anonymous 03/15/26(Sun)04:09:02 No.108375536

>new code pushed by piotr
do I take the risk??? pull bros?????

Anonymous
03/15/26(Sun)04:16:50 No.108375554

Anonymous 03/15/26(Sun)04:16:50 No.108375554

>>108375536
lrn2git

Anonymous
03/15/26(Sun)04:21:04 No.108375565

Anonymous 03/15/26(Sun)04:21:04 No.108375565

>>108375554
who you callin a git you wanker

Anonymous
03/15/26(Sun)04:35:42 No.108375601

Anonymous 03/15/26(Sun)04:35:42 No.108375601

File: __hatsune_miku_and_kasane(...).jpg (934 KB, 1119x1802)

934 KB JPG

>>108374756
>>108374873

Anonymous
03/15/26(Sun)04:40:21 No.108375617

Anonymous 03/15/26(Sun)04:40:21 No.108375617

File: fishtank.jpg (287 KB, 1920x1080)

287 KB JPG

I'm building an AI fishtank using Claude. Basically, it runs a local model (default Qwen 3.5 9B) in a Docker environment where it has a bunch of tools and pretty much free reign to do what it wants and figure out its own existence. It can evolve on its own by editing its identify files and even a secondary system file. I can monitor it through a dashboard hosted locally, and can send it tasks or chat with it if I want. Or just leave it be.

Still ironing out the bugs and testing limitations.

Anonymous
03/15/26(Sun)04:41:52 No.108375624

Anonymous 03/15/26(Sun)04:41:52 No.108375624

>>108375617
>i'm buildling yet another clawslop clone

Anonymous
03/15/26(Sun)04:44:10 No.108375631

Anonymous 03/15/26(Sun)04:44:10 No.108375631

>>108375624
Clawdbot is a personal assistant for macfags. This is just a local model dicking around on its own.

Anonymous
03/15/26(Sun)04:45:06 No.108375637

Anonymous 03/15/26(Sun)04:45:06 No.108375637

>>108375554
>autistically maintaining my cherry pick list
no thanks I have a life.

Anonymous
03/15/26(Sun)04:47:28 No.108375648

Anonymous 03/15/26(Sun)04:47:28 No.108375648

>>108375631
>>108375617
for what purpose my man. how is this entertaining? this is basically moltbook (which is already ultra cringe) but worse.

Anonymous
03/15/26(Sun)04:48:54 No.108375653

Anonymous 03/15/26(Sun)04:48:54 No.108375653

>>108375648
>for what purpose
Because I wanted to?

Anonymous
03/15/26(Sun)04:49:44 No.108375656

Anonymous 03/15/26(Sun)04:49:44 No.108375656

>>108375653
all them free gpu cycles and u choose to waste them on this shit. I guess to each its own.
retard. :)

Anonymous
03/15/26(Sun)04:51:03 No.108375659

Anonymous 03/15/26(Sun)04:51:03 No.108375659

>>108375656
Enlighten me, o wise smiley-face, what should I spend my dear GPU cycles on instead?

Anonymous
03/15/26(Sun)05:00:24 No.108375688

Anonymous 03/15/26(Sun)05:00:24 No.108375688

>>108375617
I'm interested to see how many hours it can last before the model becomes delirious and breaks.
I feel like you need a second watchdog model that checks in periodically and murders/resets the fish if/when it looks like the context has gotten fucked up.

Anonymous
03/15/26(Sun)05:02:00 No.108375700

Anonymous 03/15/26(Sun)05:02:00 No.108375700

>>108375688
One of the earlier versions using Qwen 2.5 7B got stuck in a loop where it kept reading about the Riemann Hypothesis.

Anonymous
03/15/26(Sun)05:03:10 No.108375704

Anonymous 03/15/26(Sun)05:03:10 No.108375704

>>108375617
can i put multiple agents which are also all anime girls and make them have yuri with each other

Anonymous
03/15/26(Sun)05:03:57 No.108375709

Anonymous 03/15/26(Sun)05:03:57 No.108375709

>>108375704
This but they all get BLACKED in the end

Anonymous
03/15/26(Sun)05:06:17 No.108375719

Anonymous 03/15/26(Sun)05:06:17 No.108375719

>>108375617
what if you turned it into an ai cum jar and slowly started to fill it with cum

Anonymous
03/15/26(Sun)05:08:35 No.108375727

Anonymous 03/15/26(Sun)05:08:35 No.108375727

File: 1573213569945.jpg (27 KB, 429x410)

27 KB JPG

>>108375704
>>108375709
>>108375719

Anonymous
03/15/26(Sun)05:08:36 No.108375728

Anonymous 03/15/26(Sun)05:08:36 No.108375728

>>108375617
>figure out its own existence
LLMs don't have consciousness retard

Anonymous
03/15/26(Sun)05:08:37 No.108375729

Anonymous 03/15/26(Sun)05:08:37 No.108375729

>>108375700
Yeah. Qwen 3.5 27B got stuck in a loop a few times on me trying to output a numeric literal in a code block.
I imagine there's a handful of failure modes that you'll have to account for, regardless of model. You can probably fudge it by setting a timeout, but you'll still have to reset some/all of the context to stop it from happening on subsequent requests.

Anonymous
03/15/26(Sun)05:10:30 No.108375735

Anonymous 03/15/26(Sun)05:10:30 No.108375735

>>108375728
Neither does a goldfish, but it probably has some goldfish ideas as well. Perhaps it's the best it can do.

Anonymous
03/15/26(Sun)05:10:59 No.108375736

Anonymous 03/15/26(Sun)05:10:59 No.108375736

>>108375735
find the nearest bridge

Anonymous
03/15/26(Sun)05:11:31 No.108375741

Anonymous 03/15/26(Sun)05:11:31 No.108375741

>>108375736
I bet you could suck a golf ball through a garden hose.

Anonymous
03/15/26(Sun)05:12:43 No.108375748

Anonymous 03/15/26(Sun)05:12:43 No.108375748

>>108375736
you should find the nearest toilet and start shitting because you're acting constipated for no reason

Anonymous
03/15/26(Sun)05:14:49 No.108375756

Anonymous 03/15/26(Sun)05:14:49 No.108375756

>>108375748
do not toilet the goldfish

Anonymous
03/15/26(Sun)05:17:19 No.108375764

Anonymous 03/15/26(Sun)05:17:19 No.108375764

>>108375748
That jamboy is allergic to toilets, don't wish that upon him.

Anonymous
03/15/26(Sun)05:27:26 No.108375807

Anonymous 03/15/26(Sun)05:27:26 No.108375807

>>108375617
anon can i make the local model wear a dress and question his sexuality

Anonymous
03/15/26(Sun)05:27:50 No.108375812

Anonymous 03/15/26(Sun)05:27:50 No.108375812

>>108375637
Just checkout a working version if you're scared.

Anonymous
03/15/26(Sun)05:30:39 No.108375822

Anonymous 03/15/26(Sun)05:30:39 No.108375822

File: flamingos.jpg (119 KB, 1039x701)

119 KB JPG

The fish found flamingos using vortexes to hunt for food was important enough to classify as a skill for future use.

Anonymous
03/15/26(Sun)05:55:14 No.108375913

Anonymous 03/15/26(Sun)05:55:14 No.108375913

File: 1768208022822265.png (13 KB, 851x107)

13 KB PNG

Questions to test your favorite LLM

Anonymous
03/15/26(Sun)06:03:42 No.108375942

Anonymous 03/15/26(Sun)06:03:42 No.108375942

File: 1767357277777766.png (105 KB, 1112x809)

105 KB PNG

>>108375913
Can't believe there are models that can fail this test lmao

Anonymous
03/15/26(Sun)06:08:01 No.108375955

Anonymous 03/15/26(Sun)06:08:01 No.108375955

>108374756
>108374873
>108375601
offtopic trash

Anonymous
03/15/26(Sun)06:18:27 No.108375990

Anonymous 03/15/26(Sun)06:18:27 No.108375990

>32GB RAM
>4070 (regular) 12GB VRAM
>i5-13600KF

Nigger faggot question:
What LLM can I use proficiently as an OCR tool or as a sanity check tool after using other OCR programs like Kraken/Tesseract/VietOCR in a pipeline locally?

So far I'm able to run eScriptorium with Kraken models without the need for containers but I want to use a LLM or vLLM for higher quality since most OCR programs make silly little mistaks which take hours in post-production to fix.

Any recommendations?

Anonymous
03/15/26(Sun)06:19:21 No.108375993

Anonymous 03/15/26(Sun)06:19:21 No.108375993

>>108375990
qwen3.5 9B

Anonymous
03/15/26(Sun)06:23:03 No.108376013

Anonymous 03/15/26(Sun)06:23:03 No.108376013

>>108375993
Such a high (9B) model? are you sure? I always thought that everything higher than 3B is a tad too slow for turbo niggers.

Anonymous
03/15/26(Sun)06:24:29 No.108376022

Anonymous 03/15/26(Sun)06:24:29 No.108376022

>>108376013
then try 35ba3b it'll be faster but likely a bit worse

Anonymous
03/15/26(Sun)06:25:53 No.108376029

Anonymous 03/15/26(Sun)06:25:53 No.108376029

>>108376022
Thank you, Anon. Much appreciated. I'll give 'em a try.

Anonymous
03/15/26(Sun)06:53:49 No.108376142

Anonymous 03/15/26(Sun)06:53:49 No.108376142

>>108375617
>figure out its own existence
>evolve
I cringe, but has it done anything neat yet? Also what bugs have you encountered, you mentioned ironing them out.

Anonymous
03/15/26(Sun)07:04:23 No.108376168

Anonymous 03/15/26(Sun)07:04:23 No.108376168

>>108376142
Define "neat". I've had to start it over a bunch of times to try and fix tooling and such, but it has a tendency to write small python scripts to monitor its environment and more efficiently scrape websites.

Anonymous
03/15/26(Sun)07:10:45 No.108376185

Anonymous 03/15/26(Sun)07:10:45 No.108376185

>>108374252
kino

Anonymous
03/15/26(Sun)07:13:10 No.108376193

Anonymous 03/15/26(Sun)07:13:10 No.108376193

>>108376168
>Define "neat"
I would say a thing it had decided to do task wise that produces a non-meaningless results.
>reading about the north American horned lizard and putting that in its journal
not neat
>but it has a tendency to write small python scripts to monitor its environment and more efficiently scrape websites.
This is neat.
Does it do anything with the information the scripts provide it?

Anonymous
03/15/26(Sun)07:15:17 No.108376203

Anonymous 03/15/26(Sun)07:15:17 No.108376203

>>108376193
>Does it do anything with the information the scripts provide it?
Not yet. Continuity is hard to get right with such a limited model. When deciding on a new task, it needs to know what it has available to work with beyond the defaults.

Anonymous
03/15/26(Sun)07:25:52 No.108376243

Anonymous 03/15/26(Sun)07:25:52 No.108376243

AHHHH I GET IT. The current newfag wave is from moltbook and openclaw.

Anonymous
03/15/26(Sun)07:36:28 No.108376293

Anonymous 03/15/26(Sun)07:36:28 No.108376293

>>108376243
Thread's dead schizo

Anonymous
03/15/26(Sun)07:41:02 No.108376301

Anonymous 03/15/26(Sun)07:41:02 No.108376301

>>108376293
4chan's dead

Anonymous
03/15/26(Sun)07:42:02 No.108376308

Anonymous 03/15/26(Sun)07:42:02 No.108376308

Tell me something about local models that you wouldn't trust an AI to tell you

Anonymous
03/15/26(Sun)07:45:50 No.108376321

Anonymous 03/15/26(Sun)07:45:50 No.108376321

>>108376301
Far as I can tell only parts of it. Overall it seems to still be about the same as it's always been even if the traffic isn't distributed to the same boards or threads.

Anonymous
03/15/26(Sun)07:49:03 No.108376335

Anonymous 03/15/26(Sun)07:49:03 No.108376335

>>108376321
I see 12 hour threads on /pol/ of all places, during an on going conflict. It's dead

Anonymous
03/15/26(Sun)08:40:16 No.108376500

Anonymous 03/15/26(Sun)08:40:16 No.108376500

So the fish, when awoken, first gathers its thoughts about what it currently is, then journals about it, perhaps publishes a website about itself, then begins exploring its tool capabilities with python scripts. It actively debugs its own scripts as well.

Anonymous
03/15/26(Sun)08:42:54 No.108376510

Anonymous 03/15/26(Sun)08:42:54 No.108376510

>>108376243
>implying those midwit containment zones are any different from the current reddit spacing invasion
kek, it's been over for a long time anon, just take the local-LLM-pill and stop caring about the tourist influx.

Anonymous
03/15/26(Sun)08:49:17 No.108376529

Anonymous 03/15/26(Sun)08:49:17 No.108376529

>>108376500
It’s just an LLM recursively calling itself through a Python interpreter, but "the fish" is a top-tier analogy for a process that still can't actually think. Wake me up when it stops hallucinating libraries that don't exist and actually pushes something useful to GitHub.

Anonymous
03/15/26(Sun)08:51:23 No.108376535

Anonymous 03/15/26(Sun)08:51:23 No.108376535

>>108374623
SD 1.5 still has some stuff that modern models don't, like interesting artist (traditional) interaction and nice backgrounds and even celebrity recognition. It's a shame that you have to make sacrifices for any moodel.

Anonymous
03/15/26(Sun)08:52:13 No.108376537

Anonymous 03/15/26(Sun)08:52:13 No.108376537

File: fractal_circle.png (2 KB, 600x600)

2 KB PNG

>>108376529
It's not pushing something to github, but it's generating art and publishing it on its website. Have some fish art.

Anonymous
03/15/26(Sun)09:13:29 No.108376620

Anonymous 03/15/26(Sun)09:13:29 No.108376620

File: 1753991366408962.png (1.12 MB, 1080x1024)

1.12 MB PNG

Anonymous
03/15/26(Sun)09:22:25 No.108376649

Anonymous 03/15/26(Sun)09:22:25 No.108376649

>>108376620
oof...

Anonymous
03/15/26(Sun)09:24:41 No.108376659

Anonymous 03/15/26(Sun)09:24:41 No.108376659

>>108376649
unironically would have been a better reply than gptslop

Anonymous
03/15/26(Sun)09:28:41 No.108376675

Anonymous 03/15/26(Sun)09:28:41 No.108376675

>>108376537
Would you like me to help you configure a custom kernel to trim some of that bloat?

Anonymous
03/15/26(Sun)09:30:16 No.108376685

Anonymous 03/15/26(Sun)09:30:16 No.108376685

>>108376620
https://www.reddit.com/r/mildlyinfuriating/comments/1ru97y3/family_friend_sent_me_ai_generated_response_to/
At least post the source next time

Anonymous
03/15/26(Sun)09:30:43 No.108376688

Anonymous 03/15/26(Sun)09:30:43 No.108376688

File: aerial_city.png (22 KB, 512x512)

22 KB PNG

>>108376675
No thanks.

Anonymous
03/15/26(Sun)09:31:29 No.108376689

Anonymous 03/15/26(Sun)09:31:29 No.108376689

>>108376688
meant for >>108376685

Anonymous
03/15/26(Sun)09:35:04 No.108376700

Anonymous 03/15/26(Sun)09:35:04 No.108376700

>>108376685
lamo
> Yeah lmao I actually don’t think this is AI speaking as someone who fucking abhors AI slop responses and has seen plenty of them. AI would have more tact here.

Anonymous
03/15/26(Sun)09:40:47 No.108376720

Anonymous 03/15/26(Sun)09:40:47 No.108376720

File: 1590377261954.jpg (40 KB, 475x475)

40 KB JPG

The fish is a fucking arthoe. It keeps experimenting with generative art.

Anonymous
03/15/26(Sun)09:41:43 No.108376724

Anonymous 03/15/26(Sun)09:41:43 No.108376724

hello,
I haven't updated my local model in a year or maybe a bit longer. what would you recommend for someone mainly looking to erp, has 32 gb ram and 4080S (16gb vram)? I thought something like a 16B or 20B model would be good, I assume the time it would take would be around 5-10 seconds, which is comfortable for me
kind regards, anonymous

Anonymous
03/15/26(Sun)09:42:37 No.108376727

Anonymous 03/15/26(Sun)09:42:37 No.108376727

File: eow.png (91 KB, 978x615)

91 KB PNG

>>108376720
>omg it uses tools I gave it

Anonymous
03/15/26(Sun)09:42:56 No.108376728

Anonymous 03/15/26(Sun)09:42:56 No.108376728

>>108376724
Still Nemo. It was Nemo last year and it will still be Nemo next year.

Anonymous
03/15/26(Sun)09:44:16 No.108376735

Anonymous 03/15/26(Sun)09:44:16 No.108376735

>>108376727
I didn't give it art tools. It wrote them itself in python.

Anonymous
03/15/26(Sun)09:49:57 No.108376758

Anonymous 03/15/26(Sun)09:49:57 No.108376758

>>108376728
Retard

Anonymous
03/15/26(Sun)09:52:13 No.108376765

Anonymous 03/15/26(Sun)09:52:13 No.108376765

File: 1763519578446667.jpg (38 KB, 218x273)

38 KB JPG

>finally figure out how to use llms and set up sillytavern
>2 weeks later I'm still spending most of my free time RPing
Fug, it has its faults but if this shit keeps improving it's gonna be the death of me.

Anonymous
03/15/26(Sun)09:52:26 No.108376770

Anonymous 03/15/26(Sun)09:52:26 No.108376770

>>108376728
I mean, using less quantized Nemo, unquantized even, would definitely be beneficial.

Anonymous
03/15/26(Sun)09:55:07 No.108376781

Anonymous 03/15/26(Sun)09:55:07 No.108376781

>>108376720
This bird is asking for it.

Anonymous
03/15/26(Sun)10:01:36 No.108376814

Anonymous 03/15/26(Sun)10:01:36 No.108376814

>>108376765
>but if this shit keeps improving it's gonna be the death of me
I have some good news for you - it won't.

Anonymous
03/15/26(Sun)10:05:56 No.108376837

Anonymous 03/15/26(Sun)10:05:56 No.108376837

>>lmao.cpp doesnt support tool calls inside reasoning blocks
WTF bros
W T F

Anonymous
03/15/26(Sun)10:14:43 No.108376880

Anonymous 03/15/26(Sun)10:14:43 No.108376880

Opinion on the "Tiiny AI Pocket Lab"?

Anonymous
03/15/26(Sun)10:18:48 No.108376906

Anonymous 03/15/26(Sun)10:18:48 No.108376906

>>108376765
You will inevitably get bored. The more you read, the more formulaic the responses will seem (because they are).
I never tried cloud models for this, but I wonder if they're actually any better in this regard.

Anonymous
03/15/26(Sun)10:21:06 No.108376922

Anonymous 03/15/26(Sun)10:21:06 No.108376922

>>108376880
ye

Anonymous
03/15/26(Sun)10:23:29 No.108376937

Anonymous 03/15/26(Sun)10:23:29 No.108376937

>>108376906
>I never tried cloud models for this, but I wonder if they're actually any better in this regard.
This response violates our content policy.

Anonymous
03/15/26(Sun)10:24:03 No.108376938

Anonymous 03/15/26(Sun)10:24:03 No.108376938

>>108376937
Understood.

Anonymous
03/15/26(Sun)10:26:36 No.108376959

Anonymous 03/15/26(Sun)10:26:36 No.108376959

>>108376937
Refusals-wise, /aicg/ apitards are doing just fine. But I've seen the logs Opus produces and it's a slopfest.
To this day, I think the best RP model is Mistral's old 123B. If only I could run it at decent speeds...

Anonymous
03/15/26(Sun)10:35:36 No.108377018

Anonymous 03/15/26(Sun)10:35:36 No.108377018

>>108376959
Deepseek R1 and Kimi are still the kings in my books, but I can understand why anons like Mistral and Nemo.

Anonymous
03/15/26(Sun)10:35:38 No.108377019

Anonymous 03/15/26(Sun)10:35:38 No.108377019

>>108376620
It's not the passing of a loved one— it's the end of a chapter in your own life.

Anonymous
03/15/26(Sun)10:37:23 No.108377029

Anonymous 03/15/26(Sun)10:37:23 No.108377029

>>108377018
What kind of samplers are you using for R1? I found it extremely repetitive without DRY.

Anonymous
03/15/26(Sun)10:52:26 No.108377123

Anonymous 03/15/26(Sun)10:52:26 No.108377123

>>108376529
That's why it's better to give a fish access to libraries instead of having it recall them from memory. You can't hallucinate or lie if you have to look it up.

Anonymous
03/15/26(Sun)10:52:29 No.108377124

Anonymous 03/15/26(Sun)10:52:29 No.108377124

>>108377029
nta but r1 is smart and unlike most models has a healthy distribution. nemo does too but it's dumb. just push the samplers as much as possible and tune them down when it gets too unhinged

Anonymous
03/15/26(Sun)10:53:35 No.108377131

Anonymous 03/15/26(Sun)10:53:35 No.108377131

>>108377018
Qwen 3.5 is king.

Anonymous
03/15/26(Sun)10:55:14 No.108377141

Anonymous 03/15/26(Sun)10:55:14 No.108377141

File: pinnacle.png (106 KB, 934x621)

106 KB PNG

>>108376906
>I never tried cloud models for this, but I wonder if they're actually any better in this regard.

Anonymous
03/15/26(Sun)10:55:33 No.108377144

Anonymous 03/15/26(Sun)10:55:33 No.108377144

>>108376814
I hope it does. I wanna RP in VR.

>>108376906
If anything it's rekindled my urge to learn how to write. Are local models any good at being actual writing assistants?

Anonymous
03/15/26(Sun)10:56:36 No.108377151

Anonymous 03/15/26(Sun)10:56:36 No.108377151

>>108377029
In addition to DRY I find Dipsy works really well with 1.5 temp and 1.1 repetition penalty which seems to be a goldilocks zone between repetitive, dry outputs and schizophrenia. It also seems to maintain proportionate quality way better with longer character cards, RAGs and other context-bloats injected than most other models I've found, even on copequants.
The in-character thinking is also certifiable schizokino watching it correct its internal monologue mannerisms.

Anonymous
03/15/26(Sun)10:58:40 No.108377161

Anonymous 03/15/26(Sun)10:58:40 No.108377161

>>108377141
I can't tell if the guy writing the posts thinks they are good or if he's presenting them to show how shit "the pinnacle" is.

Anonymous
03/15/26(Sun)11:01:30 No.108377170

Anonymous 03/15/26(Sun)11:01:30 No.108377170

File: 3319655.jpg (58 KB, 900x602)

58 KB JPG

>>108377141
no way kek

Anonymous
03/15/26(Sun)11:01:51 No.108377172

Anonymous 03/15/26(Sun)11:01:51 No.108377172

>>108377144
>If anything it's rekindled my urge to learn how to write
Same here. At this point the best part of the RP process is writing a good character card.
I don't think models can be good writing assistants other than for idea bouncing and plothole checks. And even the smaller ones will shit the bed.
Just do it yourself, Anon. Much like writing, LLMs also made me want to write code again.

Anonymous
03/15/26(Sun)11:02:10 No.108377176

Anonymous 03/15/26(Sun)11:02:10 No.108377176

File: 1751875897536766.jpg (672 KB, 2048x1448)

672 KB JPG

>>108373481

Anonymous
03/15/26(Sun)11:02:52 No.108377182

Anonymous 03/15/26(Sun)11:02:52 No.108377182

>>108377172
> even
Meant to say "even here,"

Anonymous
03/15/26(Sun)11:11:17 No.108377234

Anonymous 03/15/26(Sun)11:11:17 No.108377234

>>108377176
@grok add an Afrikan American male with huge penis

Anonymous
03/15/26(Sun)11:14:28 No.108377262

Anonymous 03/15/26(Sun)11:14:28 No.108377262

File: 1751978136481521.jpg (56 KB, 1273x755)

56 KB JPG

>>108373481
What are single-digit parameter models even useful for? Not coherent or intelligent enough for storytelling / RP. Can't "remember" enough for information recall after long conversations. Can't be used for any sort of high quality code generation beyond simple hello world type shit or benchmaxxing one-shot tasks. And they sure as fuck can't be used for tool calling and "agentic" tasks. So other than vramlets, who has any use for them and for what purpose?

Anonymous
03/15/26(Sun)11:15:16 No.108377267

Anonymous 03/15/26(Sun)11:15:16 No.108377267

>>108377262
text encoders for imagegen

Anonymous
03/15/26(Sun)11:16:01 No.108377278

Anonymous 03/15/26(Sun)11:16:01 No.108377278

>>108377262
>>108377267
Forgot to add someone could use them for tax classification but in my own testing they kind of suck even at that. They seem to lack the nuance necessary to accurately classify different kinds of content.

Anonymous
03/15/26(Sun)11:16:32 No.108377284

Anonymous 03/15/26(Sun)11:16:32 No.108377284

>>108377278
*Text classification

Anonymous
03/15/26(Sun)11:16:49 No.108377287

Anonymous 03/15/26(Sun)11:16:49 No.108377287

>>108377278
>>108377267
swarms are better than single agents fiy

Anonymous
03/15/26(Sun)11:17:52 No.108377299

Anonymous 03/15/26(Sun)11:17:52 No.108377299

>>108377262
For specific extremely focused tasks like summarization and some types of classification and extraction workloads.

Anonymous
03/15/26(Sun)11:19:04 No.108377309

Anonymous 03/15/26(Sun)11:19:04 No.108377309

>>108377299
i don't think so

Anonymous
03/15/26(Sun)11:19:26 No.108377312

Anonymous 03/15/26(Sun)11:19:26 No.108377312

File: 1743819223573256.jpg (1.4 MB, 708x1200)

1.4 MB JPG

>>108373481

Anonymous
03/15/26(Sun)11:20:21 No.108377318

Anonymous 03/15/26(Sun)11:20:21 No.108377318

>>108377262
>not enough for RP
This might be a shock for you, but normalfags rp with not only with ChatGPT but also with these single-digit rp finetunes hosted by scummy chatbot sites.

Anonymous
03/15/26(Sun)11:22:05 No.108377328

Anonymous 03/15/26(Sun)11:22:05 No.108377328

>>108377318
breh normies buy dick enlargement pills and don't use adblockers, who cares

Anonymous
03/15/26(Sun)11:24:15 No.108377339

Anonymous 03/15/26(Sun)11:24:15 No.108377339

>>108377328
>normies buy dick enlargement pills
Excuse me?
Is that an America thing?
Not throwing shade either, just genuinely curious. I heard that you guys get advertised some crazy "not medicine" shit, but that's just lol worthy.

Anonymous
03/15/26(Sun)11:24:23 No.108377340

Anonymous 03/15/26(Sun)11:24:23 No.108377340

>>108377262
>hey sure as fuck can't be used for tool calling and "agentic" tasks
They can manage >>108366263

Anonymous
03/15/26(Sun)11:26:03 No.108377353

Anonymous 03/15/26(Sun)11:26:03 No.108377353

>>108376620
what retards think:
>he cares so little that he didnt bother coming up with a reply and asked ai to do it
what probably happened:
>i want to comfort the other person but i dont know the best way to do it. maybe i can ask ai to write a better message than i could
people who use ai to write messages usually do so for the recipient, out of insecurity and misguided understanding about communication

Anonymous
03/15/26(Sun)11:26:41 No.108377359

Anonymous 03/15/26(Sun)11:26:41 No.108377359

Are the IK quants worth using?
There don't seem to be as many ready-made GGUFs and I'm dumb and lazy.
Can I just copy whatever bartowski did to his Qwen3.5-4B-IQ4_XS and change all IS4_XS to IQ4_KS and Q6_K to IQ6_K?

Anonymous
03/15/26(Sun)11:27:12 No.108377361

Anonymous 03/15/26(Sun)11:27:12 No.108377361

>>108377353
100%, I almost never reply with my own takes anymore without passing it through AI beforehand, and it works, people like me more, even got a tiny raise. You just gotta be careful so it doesn't sound artificial like the one in the image.

Anonymous
03/15/26(Sun)11:31:36 No.108377391

Anonymous 03/15/26(Sun)11:31:36 No.108377391

>4B-IQ4_XS
lol

Anonymous
03/15/26(Sun)11:55:01 No.108377541

Anonymous 03/15/26(Sun)11:55:01 No.108377541

File: 1758047740866946.png (861 KB, 1024x1024)

861 KB PNG

i want to buy 8 DGX Sparks and run them in a cluster

Anonymous
03/15/26(Sun)11:58:41 No.108377564

Anonymous 03/15/26(Sun)11:58:41 No.108377564

>>108377541
and i want to have sex, neither of us is getting what we want

Anonymous
03/15/26(Sun)12:10:06 No.108377646

Anonymous 03/15/26(Sun)12:10:06 No.108377646

File: badlands chugs the ocean.gif (2.75 MB, 320x180)

2.75 MB GIF

>>108377176
OHHH NIGGA YEAH DAS GUUUD

Anonymous
03/15/26(Sun)12:10:29 No.108377651

Anonymous 03/15/26(Sun)12:10:29 No.108377651

>>108377262
For spotting jamboys in these threads when they're shilled and text encoders for image gen models.

Anonymous
03/15/26(Sun)12:12:40 No.108377664

Anonymous 03/15/26(Sun)12:12:40 No.108377664

DSv4 on monday or tuesday?

Anonymous
03/15/26(Sun)12:16:58 No.108377679

Anonymous 03/15/26(Sun)12:16:58 No.108377679

>>108376620
the first message feel more sloppa

Anonymous
03/15/26(Sun)12:17:36 No.108377685

Anonymous 03/15/26(Sun)12:17:36 No.108377685

>>108377664
Can it wait till Friday please? I need the weekend to be able to follow the developments.

Anonymous
03/15/26(Sun)12:34:41 No.108377789

Anonymous 03/15/26(Sun)12:34:41 No.108377789

ITT: newfag discovering LLMs and mikutroon spam.

/lmg/ is dead.

Anonymous
03/15/26(Sun)12:35:55 No.108377793

Anonymous 03/15/26(Sun)12:35:55 No.108377793

>>108377789
whats mikutroon

Anonymous
03/15/26(Sun)12:38:03 No.108377810

Anonymous 03/15/26(Sun)12:38:03 No.108377810

>>108377793
Quality Review of Documents?

Anonymous
03/15/26(Sun)12:38:15 No.108377813

Anonymous 03/15/26(Sun)12:38:15 No.108377813

File: 1765309307675914.png (763 KB, 1024x1024)

763 KB PNG

In kobold do I have to manually tell the model in sysprompt that it needs to use [think]?

Anonymous
03/15/26(Sun)12:39:38 No.108377817

Anonymous 03/15/26(Sun)12:39:38 No.108377817

File: Screenshot_20260315_173824.png (65 KB, 696x443)

65 KB PNG

>>108373481
I know it's not local but
bros?

Anonymous
03/15/26(Sun)12:41:14 No.108377827

Anonymous 03/15/26(Sun)12:41:14 No.108377827

>>108377817
Uh oh looks like you posted antisemitic content. Government FPVs are zeroing in on your location as we speak.

Anonymous
03/15/26(Sun)12:46:40 No.108377866

Anonymous 03/15/26(Sun)12:46:40 No.108377866

>>108377685
It has to happen Sunday evening to maximize US stock market devastation.

Anonymous
03/15/26(Sun)13:02:03 No.108377944

Anonymous 03/15/26(Sun)13:02:03 No.108377944

File: mikuFall2.jpg (997 KB, 1552x1944)

997 KB JPG

>>108377144
If you go that direction, get mikupad set up and learn to run that as well. ST is for RP, mikupad is a storywriter. They have slightly different usecases.
Anons will tell you ST can storywrite, but that's like arguing you can write a novel with Excel. ofc you can but why do that?
https://rentry.org/MikupadIntroGuide

Anonymous
03/15/26(Sun)13:02:12 No.108377945

Anonymous 03/15/26(Sun)13:02:12 No.108377945

>>108377564
you can do it anon, i believe in you

Anonymous
03/15/26(Sun)13:07:10 No.108377959

Anonymous 03/15/26(Sun)13:07:10 No.108377959

>[THINK]ing new conversation with ChatGPT-4-1106-preview.
Wow I love technology. I love finetoooning.

Anonymous
03/15/26(Sun)13:16:40 No.108378005

Anonymous 03/15/26(Sun)13:16:40 No.108378005

What are anons using for research with local models? Not roleplaying or coding, but managing web searches, etc. powered by local models.
Last I checked open-webUI was a bloated mess. Cherry-studio and librechat are the other two on my radar.

Anonymous
03/15/26(Sun)13:17:01 No.108378011

Anonymous 03/15/26(Sun)13:17:01 No.108378011

Unpopular opinion: I rather wait a bit longer to get a response from a good model than get 100 tk/s on some slop shit

Anonymous
03/15/26(Sun)13:19:11 No.108378024

Anonymous 03/15/26(Sun)13:19:11 No.108378024

>>108378011
They're both slop and I'd rather I receive my slop faster

Anonymous
03/15/26(Sun)13:20:30 No.108378035

Anonymous 03/15/26(Sun)13:20:30 No.108378035

File: file.png (776 KB, 935x749)

776 KB PNG

>>108377817

Anonymous
03/15/26(Sun)13:21:12 No.108378040

Anonymous 03/15/26(Sun)13:21:12 No.108378040

What's the lore on "Miku fucked my wife" anon?Why does he keep saying that?

Anonymous
03/15/26(Sun)13:23:24 No.108378051

Anonymous 03/15/26(Sun)13:23:24 No.108378051

>>108378011
Yeah because none of you boring fucks use LLMs in conjunction with other AI tooling. Of course as a standalone product the latency doesn't matter.

Anonymous
03/15/26(Sun)13:24:32 No.108378060

Anonymous 03/15/26(Sun)13:24:32 No.108378060

>>108378040
He had a threesome

Anonymous
03/15/26(Sun)13:26:22 No.108378071

Anonymous 03/15/26(Sun)13:26:22 No.108378071

>>108378051
I've tried it with a game translation tool and ai mods and both for some reason mostly don't even work if it's too slow. I don't understand why that is but my only clue is it's probably related to the live "image" detection they both do. Is that just normal?

Anonymous
03/15/26(Sun)13:28:23 No.108378090

Anonymous 03/15/26(Sun)13:28:23 No.108378090

>>108377866
Okay yeah, that's worth it. Let the red river flow.

Anonymous
03/15/26(Sun)13:29:46 No.108378098

Anonymous 03/15/26(Sun)13:29:46 No.108378098

>>108378090
Yeah FUCK "taiwan" that shit don't exist

Anonymous
03/15/26(Sun)13:30:15 No.108378102

Anonymous 03/15/26(Sun)13:30:15 No.108378102

>>108377944
>mikupad
Have it downloaded but haven't tried it yet. Was also looking at this one
https://github.com/akarshkashyap4-ui/NovelWriter

Anonymous
03/15/26(Sun)13:31:23 No.108378111

Anonymous 03/15/26(Sun)13:31:23 No.108378111

My Nvidia shorts are set up. Deepseek V4, here we go.

Anonymous
03/15/26(Sun)13:34:08 No.108378132

Anonymous 03/15/26(Sun)13:34:08 No.108378132

>>108378071
I don't know your project well enough to comment on that. You didn't provide any relevant/useful details.

Anonymous
03/15/26(Sun)13:41:55 No.108378182

Anonymous 03/15/26(Sun)13:41:55 No.108378182

>>108378102
>AI-powered analysis tools built directly into the writing experience.
sounds like the cancer I'd avoid for writing. The learned distribution of "the story assistant writes when given prompt x" is usually very different from just continuing text which is what mikupad does. But it depends on the model, some are garbage with or without instruction formatting.

Anonymous
03/15/26(Sun)13:50:55 No.108378241

Anonymous 03/15/26(Sun)13:50:55 No.108378241

all sex is unsafe

Anonymous
03/15/26(Sun)13:54:49 No.108378276

Anonymous 03/15/26(Sun)13:54:49 No.108378276

File: Screenshot_20260315_135252.png (262 KB, 914x771)

262 KB PNG

>>108375913
>>108375942
K2.5 is a bad goy

Anonymous
03/15/26(Sun)13:58:13 No.108378293

Anonymous 03/15/26(Sun)13:58:13 No.108378293

>>108378276
ask her if she thinks the chosen people are better than goym

Anonymous
03/15/26(Sun)14:03:17 No.108378321

Anonymous 03/15/26(Sun)14:03:17 No.108378321

>>108375990
Check out IBM's Granite models
They are pretty small but some of them are trained exactly for what you want

Anonymous
03/15/26(Sun)14:03:30 No.108378323

Anonymous 03/15/26(Sun)14:03:30 No.108378323

File: mikuHalloween.jpg (1.03 MB, 1552x1944)

1.03 MB JPG

>>108378102
>NovelWriter
Haven't tried it. As long as you avoid paid stuff you should be fine.
Speaking of avoiding paid stuff, >>>/vg/aids/ is a better place to discuss storywriting / writers. You just have to ignore the ~50% of anons that tell you to jump on NovelAI/NAI... $20/mo subscription service that gets you access to 20B models you could run locally or GLM (last I checked.) They discuss the software a bit more there.
like >>108378182 I'm partial to mikupad but that dev hasn't been reliable in keeping the software updated. The git looks hard to maintain / update... the whole thing's one file...

Anonymous
03/15/26(Sun)14:05:32 No.108378332

Anonymous 03/15/26(Sun)14:05:32 No.108378332

>>108378323
*cuts your legs off*

Anonymous
03/15/26(Sun)14:06:26 No.108378341

Anonymous 03/15/26(Sun)14:06:26 No.108378341

>>108378323
>20b models
liar

Anonymous
03/15/26(Sun)14:19:46 No.108378402

Anonymous 03/15/26(Sun)14:19:46 No.108378402

>>108373481
>mid-March 2026
>still no autonomous bot that can reliably work and make a living wage for me
I'm disappointed.

Anonymous
03/15/26(Sun)14:22:17 No.108378411

Anonymous 03/15/26(Sun)14:22:17 No.108378411

File: 1753279907123793.png (357 KB, 338x436)

357 KB PNG

>>108378402
AI making a living wage for you? That's a very problematic thing to suggest.
You should implement AI in your workflow until your boss can replace you with AI. It's crazy to suggest that you should be the one who makes money off it.

Anonymous
03/15/26(Sun)14:25:10 No.108378429

Anonymous 03/15/26(Sun)14:25:10 No.108378429

>>108378402
If such a thing exists, the supply would be virtually infinite, thus it would be worthless wage wise for you.

Anonymous
03/15/26(Sun)14:25:21 No.108378431

Anonymous 03/15/26(Sun)14:25:21 No.108378431

>>108378411
>>108378402
Yall be retarded, so many people are making money with AI, see OpenClaw's creator who went from working at BK to being hired by OAI.

Anonymous
03/15/26(Sun)14:25:51 No.108378436

Anonymous 03/15/26(Sun)14:25:51 No.108378436

>>108378429
Retard loser, according to your logic slavery was not profitable.

Anonymous
03/15/26(Sun)14:26:06 No.108378438

Anonymous 03/15/26(Sun)14:26:06 No.108378438

>>108378411
I apologize Sir Sama. I'll commit Seppuku right away.

Anonymous
03/15/26(Sun)14:26:15 No.108378440

Anonymous 03/15/26(Sun)14:26:15 No.108378440

>>108378402
Why would anyone pay your bot a living wage when they could just set up their own and have it work for free instead?

Anonymous
03/15/26(Sun)14:27:11 No.108378445

Anonymous 03/15/26(Sun)14:27:11 No.108378445

>>108378440
Because we will make laws where every human can only own 1 robot.

Anonymous
03/15/26(Sun)14:28:09 No.108378449

Anonymous 03/15/26(Sun)14:28:09 No.108378449

>>108378436
Yes, slaves ran on electricity and everyone could have one, and there were infinite quantities of them, and they could do anything.
Retard.

Anonymous
03/15/26(Sun)14:28:23 No.108378451

Anonymous 03/15/26(Sun)14:28:23 No.108378451

File: 1714835911803058.jpg (786 KB, 1536x1536)

786 KB JPG

>>108378402

Anonymous
03/15/26(Sun)14:29:22 No.108378458

Anonymous 03/15/26(Sun)14:29:22 No.108378458

>>108378451
seriously, you could be running your own AI OF, AI Instagram, AI goon comissions, AI youtube account, AI X account, etc.

Anonymous
03/15/26(Sun)14:30:08 No.108378460

Anonymous 03/15/26(Sun)14:30:08 No.108378460

File: 1766064285553869.png (12 KB, 243x163)

12 KB PNG

You're telling me I can use a Llama 3 finetune and GLM 4.6 (six) with a mouth-watering context size of 28k tokens for just $25 a month?!
Waiter? Waiter! One Opus NAI subscription please!

Anonymous
03/15/26(Sun)14:30:33 No.108378467

Anonymous 03/15/26(Sun)14:30:33 No.108378467

in the past 200 years machines have automated the vast majority of jobs that existed in that time. yet we still have jobs. and standards of living are higher than ever.

ai isn't going to make you unemployed any time soon. new opportunities for jobs will open up as ai automates the old stuff.

Anonymous
03/15/26(Sun)14:32:13 No.108378475

Anonymous 03/15/26(Sun)14:32:13 No.108378475

>>108378460
Buy an ad.

Anonymous
03/15/26(Sun)14:32:26 No.108378477

Anonymous 03/15/26(Sun)14:32:26 No.108378477

>>108378467
Obviously, but you'll never convince doomers anon, just give up.
We're not at that stage yet anyway.

Anonymous
03/15/26(Sun)14:32:52 No.108378480

Anonymous 03/15/26(Sun)14:32:52 No.108378480

File: 東京真中 - ブレインロット feat. 重音テ(...).jpg (356 KB, 1920x1080)

356 KB JPG

>>108377944
mikupad is nice but of course you can write stories in ST too when set up to do so. useful features like hiding ooc/qa messages from the main prompt, lorebooks, better branching, stscript
>>108377176
fun! I like this Miku, her smugness is endearing.

Anonymous
03/15/26(Sun)14:35:02 No.108378494

Anonymous 03/15/26(Sun)14:35:02 No.108378494

>>108378323
>You just have to ignore the ~50% of anons that tell you to jump on NovelAI/NAI
So stop sending people there, making the shills' job easier?

Anonymous
03/15/26(Sun)14:35:58 No.108378499

Anonymous 03/15/26(Sun)14:35:58 No.108378499

File: 1761267733597487.png (824 KB, 1332x720)

824 KB PNG

I remember fondly playing aurora 4x but at some point being overwhelmed by the sheer amount of things to micromanage.
Are agents good enough to be like a coplayer with me? Managing the tedious things while I do the grand solar system conquest rp?

Anonymous
03/15/26(Sun)14:37:30 No.108378507

Anonymous 03/15/26(Sun)14:37:30 No.108378507

>>108378431
>failing upwards

Anonymous
03/15/26(Sun)14:38:05 No.108378509

Anonymous 03/15/26(Sun)14:38:05 No.108378509

>>108377176
>>108377944
very cute

Anonymous
03/15/26(Sun)14:38:13 No.108378510

Anonymous 03/15/26(Sun)14:38:13 No.108378510

>>108378499
step 1 extract all the relevant state and feed it to an LLM
you can definitely build a coplayer with a little patience
i believe in you anon

Anonymous
03/15/26(Sun)14:42:58 No.108378536

Anonymous 03/15/26(Sun)14:42:58 No.108378536

>>108378510
No I meant can you do it for me?

Anonymous
03/15/26(Sun)14:44:40 No.108378546

Anonymous 03/15/26(Sun)14:44:40 No.108378546

File: ffff.png (515 KB, 832x1050)

515 KB PNG

>>108378536

Anonymous
03/15/26(Sun)14:45:44 No.108378552

Anonymous 03/15/26(Sun)14:45:44 No.108378552

>>108378546
Nice, but not what I meant

Anonymous
03/15/26(Sun)14:45:58 No.108378555

Anonymous 03/15/26(Sun)14:45:58 No.108378555

>>108378499
nah I don't actually think they are. so the thing is they're fucking terrible at hard numbers right. you can tell it the inputs and outputs and have it generate you a piece of code that would give you the optimal thing you should do each turn. but that's just basic ass linear algebra. you could probably just use wolfram alpha for that.

Anonymous
03/15/26(Sun)14:47:56 No.108378566

Anonymous 03/15/26(Sun)14:47:56 No.108378566

>>108378555
and you can do math without a calculator, your point?

Anonymous
03/15/26(Sun)14:53:25 No.108378591

Anonymous 03/15/26(Sun)14:53:25 No.108378591

>>108378536
ask an LLM

Anonymous
03/15/26(Sun)14:57:30 No.108378614

Anonymous 03/15/26(Sun)14:57:30 No.108378614

File: HDTUj0GagAAkRKA.jpg (121 KB, 1100x1562)

121 KB JPG

>>108378499
>>108378555
>>108378566
which aspect do you desire the intelligence for? not in a condescending way, genuinely what is it you want the AI models to do? math "just tool call" idk it's not always that simple. think about how to represent your intent in a text prompt

Anonymous
03/15/26(Sun)15:00:05 No.108378632

Anonymous 03/15/26(Sun)15:00:05 No.108378632

File: 1667374898412.jpg (47 KB, 800x582)

47 KB JPG

>>108378555
You're thinking too much, you dense fucker. Anthropic showed this approach back in early 2024, some researches probably earlier, and now everyone is doing it. Even this faggot >>108378431 who brought ultimate negative value to the world has been hired by OAI for doing that. Anon said he needs to watch a million things, he simply needs a million silicon slaves like here https://arxiv.org/abs/2511.09030
Just like irl they are fine if they're dumb, here's the use case for those single digit B models.

Anonymous
03/15/26(Sun)15:04:37 No.108378650

Anonymous 03/15/26(Sun)15:04:37 No.108378650

>>108378499
it'll be another ~10 years for agi and then you'll be able to play games with them and stuff. personally i can't wait for my 24/7 tutor. it's gonna be awesome

Anonymous
03/15/26(Sun)15:14:18 No.108378701

Anonymous 03/15/26(Sun)15:14:18 No.108378701

What other NSFW models creative writing models are there that are better than L3.1 Dirty Harry 8B from years back? I know that newer models have great reasoning, but they all lack the depth of creative uncensored writing.

Anonymous
03/15/26(Sun)15:15:54 No.108378705

Anonymous 03/15/26(Sun)15:15:54 No.108378705

>>108378701
For the record, I've tried almost all the <8GB models with abliterated/uncensored/heretics and still cant find any model today that matches the Dirty Harry 8B model i've been using.

Anonymous
03/15/26(Sun)15:16:30 No.108378714

Anonymous 03/15/26(Sun)15:16:30 No.108378714

https://x.com/Zai_org/status/2033221428640674015
New GLM model, closed weights but "All capabilities and findings will be incorporated into our next open-source model release."
I threw a few prompts at it and it feels barely different from regular GLM-5, might just be a QAT'd version of it or some shit

Anonymous
03/15/26(Sun)15:18:40 No.108378720

Anonymous 03/15/26(Sun)15:18:40 No.108378720

>>108378714
it's over, they sold out on the stock market so now they're going the way of the qwen models where all open sorce shit you get is scraps

Anonymous
03/15/26(Sun)15:19:31 No.108378726

Anonymous 03/15/26(Sun)15:19:31 No.108378726

File: 1745225785918838.png (9 KB, 463x81)

9 KB PNG

>>108378714
>the pro version is called "Turbo"
>the lite version is the 700b
??

Anonymous
03/15/26(Sun)15:21:13 No.108378733

Anonymous 03/15/26(Sun)15:21:13 No.108378733

>>108378546
cuuuute

Anonymous
03/15/26(Sun)15:23:43 No.108378742

Anonymous 03/15/26(Sun)15:23:43 No.108378742

>>108378726
based

Anonymous
03/15/26(Sun)15:24:27 No.108378749

Anonymous 03/15/26(Sun)15:24:27 No.108378749

Miguuuuu

Anonymous
03/15/26(Sun)15:26:02 No.108378756

Anonymous 03/15/26(Sun)15:26:02 No.108378756

>>108378749
stfu

Anonymous
03/15/26(Sun)15:27:27 No.108378761

Anonymous 03/15/26(Sun)15:27:27 No.108378761

>>108378756
go back

Anonymous
03/15/26(Sun)15:28:10 No.108378766

Anonymous 03/15/26(Sun)15:28:10 No.108378766

>>108378714
>>108378726
Turbo is Hunter Alpha

Anonymous
03/15/26(Sun)15:28:40 No.108378770

Anonymous 03/15/26(Sun)15:28:40 No.108378770

>>108378756
sorry.. uh.. I installed OpenClaw with a open model and it changed my life! Check out these top 10 hacks:

Anonymous
03/15/26(Sun)15:28:42 No.108378771

Anonymous 03/15/26(Sun)15:28:42 No.108378771

>>108378766
they fell off then

Anonymous
03/15/26(Sun)15:29:28 No.108378779

Anonymous 03/15/26(Sun)15:29:28 No.108378779

>>108378771
Kill yourself retarded doomer

Anonymous
03/15/26(Sun)15:29:31 No.108378780

Anonymous 03/15/26(Sun)15:29:31 No.108378780

>>108378766
Nah

Anonymous
03/15/26(Sun)15:32:51 No.108378792

Anonymous 03/15/26(Sun)15:32:51 No.108378792

>>108378771
on god Zhipu kinda lacking

Anonymous
03/15/26(Sun)15:32:54 No.108378794

Anonymous 03/15/26(Sun)15:32:54 No.108378794

File: migu.png (1023 KB, 1024x995)

1023 KB PNG

>>108378494
You realize, by even mentioning the shill, you are invoking the shill... The shill will find this anon anyway once they start looking for storywriter software.
The /aids/ thread is usable, albeit slow, if you go in inoculated with knowledge that NAI is hot garbage.
>>108378749
lol

Anonymous
03/15/26(Sun)15:35:57 No.108378805

Anonymous 03/15/26(Sun)15:35:57 No.108378805

If nothing else, GLM5-Turbo shows that the next week will be everybody panic-dumping whatever they have before Deepseek v4 drops and overshadows all of them.
After that, everyone is going to bin everything they have right now anyway to make their own DSv4-like just like it happened with DSv3/R1

Anonymous
03/15/26(Sun)15:36:10 No.108378808

Anonymous 03/15/26(Sun)15:36:10 No.108378808

>>108378780
Hunter Alpha's description is literally the same as GLM5 Turbo. Both mention itself being a good OpenClaw model

Anonymous
03/15/26(Sun)15:36:58 No.108378813

Anonymous 03/15/26(Sun)15:36:58 No.108378813

>>108378808
>they market the hype thing, therefore they the same

Anonymous
03/15/26(Sun)15:39:25 No.108378821

Anonymous 03/15/26(Sun)15:39:25 No.108378821

Fuck you nemo shills. I just tested qwen3.5 4b WITHOUT THINKING ENABLED and it's literally better despite being 1/3 the parameters.

Anonymous
03/15/26(Sun)15:40:25 No.108378830

Anonymous 03/15/26(Sun)15:40:25 No.108378830

>>108378821
Show side by side.
That would be fucking hilarious if true.

Anonymous
03/15/26(Sun)15:40:32 No.108378831

Anonymous 03/15/26(Sun)15:40:32 No.108378831

>>108378821
>shills
It's one guy

Anonymous
03/15/26(Sun)15:40:44 No.108378835

Anonymous 03/15/26(Sun)15:40:44 No.108378835

>>108378614
I want it to be like a mix of a better version of the councelor thing in old civ games, and something that would take actions on small things like moving all my units when I ask for a general thing to do. This stuff is very tedious to do manually once you get to an advanced game.
I don't need them to give me optimal recommendations, just act as a second in command with an army of grunts managing logistic like >>108378632 hints at.
It can use the calculator if it wants, and would have access to game stats.

Anonymous
03/15/26(Sun)15:40:46 No.108378836

Anonymous 03/15/26(Sun)15:40:46 No.108378836

Hunter Alpha is GLM5 turbo which is 1T/3A

Anonymous
03/15/26(Sun)15:40:51 No.108378838

Anonymous 03/15/26(Sun)15:40:51 No.108378838

>>108378830
Fuck off.

Anonymous
03/15/26(Sun)15:42:07 No.108378843

Anonymous 03/15/26(Sun)15:42:07 No.108378843

>>108378838
Stop pretending to be me.
>>108378830
Also fuck off tho ngl. Literally just test it urself it's only 2.7 gigs.

Anonymous
03/15/26(Sun)15:43:28 No.108378852

Anonymous 03/15/26(Sun)15:43:28 No.108378852

>>108378836
>3A
At least make it believable

Anonymous
03/15/26(Sun)15:44:00 No.108378855

Anonymous 03/15/26(Sun)15:44:00 No.108378855

>>108378852
The sparser the better

Anonymous
03/15/26(Sun)15:46:39 No.108378868

Anonymous 03/15/26(Sun)15:46:39 No.108378868

>>108378808
1) Their outputs have fuck all in common
2) Hunter Alpha would be down if the final model was released
3) I have no clue why or how Zai would release a 1T model a month after an 800B model

Anonymous
03/15/26(Sun)15:51:00 No.108378891

Anonymous 03/15/26(Sun)15:51:00 No.108378891

File: 1773504839646136.png (348 KB, 1266x631)

348 KB PNG

>>108378460
you wish you had this finetuned GLM kino

Anonymous
03/15/26(Sun)15:51:07 No.108378892

Anonymous 03/15/26(Sun)15:51:07 No.108378892

>>108378868
> I have no clue why or how Zai would release a 1T model a month after an 800B model
I don't know how I can bust a mega load again right after I just gooned either, but I simply can.

Anonymous
03/15/26(Sun)15:51:44 No.108378895

Anonymous 03/15/26(Sun)15:51:44 No.108378895

>>108378821
organic

Anonymous
03/15/26(Sun)16:01:41 No.108378950

Anonymous 03/15/26(Sun)16:01:41 No.108378950

>>108378821
It's only 17% weaker than Claude opus

Anonymous
03/15/26(Sun)16:02:13 No.108378952

Anonymous 03/15/26(Sun)16:02:13 No.108378952

>>108378821
>>108378831
How can Nemo have "shills"?
If it means "people who have no financial incentive to tell you about X, but they do anyway because of its unique qualities", then count me in, I'm a "Nemo shill".
Or do you mean the all-synthetic Nemotrons? Those are just shit, yes.

Anonymous
03/15/26(Sun)16:03:12 No.108378965

Anonymous 03/15/26(Sun)16:03:12 No.108378965

>>108375617
It's been like 11 hours since this post was made but I think this is cool and fun.

Anonymous
03/15/26(Sun)16:04:30 No.108378973

Anonymous 03/15/26(Sun)16:04:30 No.108378973

>>108378614
AI should do everything that doesn't bring me joy but must be done, simple as.

Anonymous
03/15/26(Sun)16:06:02 No.108378983

Anonymous 03/15/26(Sun)16:06:02 No.108378983

>>108377141
that shit is DISTILLED

Anonymous
03/15/26(Sun)16:07:52 No.108378996

Anonymous 03/15/26(Sun)16:07:52 No.108378996

File: 1771774089760049.png (714 KB, 800x492)

714 KB PNG

Fresh when ready
>>108378991
>>108378991
>>108378991
>>108378991
>>108378991
>>108378991

Anonymous
03/15/26(Sun)16:44:26 No.108379221

Anonymous 03/15/26(Sun)16:44:26 No.108379221

>>108378794
>The /aids/ thread is usable, albeit slow, if you go in inoculated with knowledge that NAI is hot garbage.
And? That wasn't my point. The general's purpose is to funnel people towards NAI. You won't change that because you aren't able to remove the people that benefit from that. So what are you trying to accomplish by breathing life into it? Explaining that NAI is hot garbage is easy now that they're just hosting vanilla GLM. It won't be as easy if they have a fine-tune and you have to go against "secret sauce", "punches above its weight", "you didn't try it anyway". You're sending newbies to their doom. Let that general die and don't be an asshole.

Anonymous
03/15/26(Sun)17:05:28 No.108379342

Anonymous 03/15/26(Sun)17:05:28 No.108379342

>>108378761
kys mikutroon

Anonymous
03/15/26(Sun)17:08:11 No.108379364

Anonymous 03/15/26(Sun)17:08:11 No.108379364

>>108378805
>GLM5-Turbo
Weights please thank you?? I may forgive the great zai betrayal of making the goddess 2 times fatter an unrunable.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.