/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 12/09/25(Tue)12:52:18 No.107493611

File: __kasane_teto_utau_and_1_(...).jpg (368 KB, 1152x2048)

368 KB JPG

/lmg/ - Local Models General Anonymous 12/09/25(Tue)12:52:18 No.107493611 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107481183 & >>107470372

►News
>(12/09) Introducing: Devstral 2 and Mistral Vibe CLI: https://mistral.ai/news/devstral-2-vibe-cli
>(12/08) GLM-4.6V (106B) and Flash (9B) released with function calling: https://z.ai/blog/glm-4.6v
>(12/06) convert: support Mistral 3 Large MoE #17730: https://github.com/ggml-org/llama.cpp/pull/17730
>(12/04) Microsoft releases VibeVoice-Realtime-0.5B: https://hf.co/microsoft/VibeVoice-Realtime-0.5B
>(12/04) koboldcpp-1.103 prebuilt released: https://github.com/LostRuins/koboldcpp/releases/tag/v1.103

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
12/09/25(Tue)12:52:41 No.107493614

Anonymous 12/09/25(Tue)12:52:41 No.107493614

File: teto.png (148 KB, 348x395)

148 KB PNG

►Recent Highlights from the Previous Thread: >>107481183

--Papers:
>107487575
--Mistral's Devstral 2 release:
>107491699 >107491763 >107491838 >107491858 >107492060 >107492483 >107492185 >107492255 >107492288 >107492320 >107492219 >107492268 >107492389 >107492454 >107492472 >107492525 >107492593 >107492623 >107492634 >107492875 >107491992 >107492067 >107492081 >107492338
--Devstral 2's EU regulatory exemptions and potential unrestricted training:
>107492927 >107492992 >107493039 >107493088
--LLM hardware needs and performance tradeoffs for roleplay/video generation:
>107488035 >107488072 >107488094 >107488146 >107488177 >107488250 >107488266 >107488291 >107488300 >107488328 >107488443 >107488454 >107488498 >107491301 >107491373
--Upgrading from Tesla V100 to RTX 50 series for better chatbot performance:
>107488666 >107488693
--Observations on Ministral-3 quirks and potential model collapse:
>107484074 >107484294 >107484310 >107484454
--Intellect-3 performance and AI architecture limitations discussion:
>107483224 >107483625 >107484485 >107484921 >107485250
--GLM-AIR sampler preferences and effectiveness comparisons:
>107482984 >107483086 >107483116 >107483227 >107486199 >107483099
--Mistral Medium 3 release speculation and EU regulatory challenges:
>107486923 >107486953 >107487229 >107487265 >107488685 >107488835 >107488870 >107489030 >107490519 >107490548
--Intel B60 GPU issues with LLM inference:
>107491645
--Mistral-Medium-3 size and format discussions:
>107487529 >107487548 >107488064 >107487902 >107488690 >107488781 >107488802 >107492527 >107492594
--AI bubble predictions and growth expectations:
>107483823 >107483859 >107483875 >107483907 >107483915 >107483926 >107483965 >107485420 >107485806 >107483952
--Sam Altman's alleged role in high RAM prices:
>107482119
--Miku (free space):
>107487256 >107489192 >107489329 >107490563

►Recent Highlight Posts from the Previous Thread: >>107481187

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
12/09/25(Tue)12:54:37 No.107493632

Anonymous 12/09/25(Tue)12:54:37 No.107493632

>>107493611
96GB VRAM is the way.

Anonymous
12/09/25(Tue)13:03:35 No.107493702

Anonymous 12/09/25(Tue)13:03:35 No.107493702

File: ぴーなた-求＆影重音テト [M40MIxGK3i(...).png (750 KB, 1920x1080)

750 KB PNG

>>107493611
Happy Tetoday
Thread Theme: https://www.youtube.com/watch?v=M40MIxGK3is

Anonymous
12/09/25(Tue)13:08:00 No.107493737

Anonymous 12/09/25(Tue)13:08:00 No.107493737

Mistral Nemo Large is now real

Anonymous
12/09/25(Tue)13:16:21 No.107493811

Anonymous 12/09/25(Tue)13:16:21 No.107493811

File: b6308a38e2f37479a7a89c54e(...).jpg (78 KB, 564x422)

78 KB JPG

>>107493611
Hideous OP thx
https://www.youtube.com/watch?v=Rt8_uc76J3U

Anonymous
12/09/25(Tue)13:16:32 No.107493813

Anonymous 12/09/25(Tue)13:16:32 No.107493813

>>107493517
The EU was already cucking out on their AI Act as of last month
https://www.reuters.com/sustainability/boards-policy-regulation/eu-ease-ai-privacy-rules-critics-warn-caving-big-tech-trump-2025-11-19/
https://www.reuters.com/sustainability/boards-policy-regulation/eu-delay-high-risk-ai-rules-until-2027-after-big-tech-pushback-2025-11-19/
Nobody is going to rag on Mistral for getting cheeky while the deregulation lobbyists have the initiative

Anonymous
12/09/25(Tue)13:24:21 No.107493887

Anonymous 12/09/25(Tue)13:24:21 No.107493887

the first horsemen was local completely dying not getting any models let alone ones that are as good or better then the best globohomo ones (no offense ai models i love you all you are all frens to me) the second is the hardware itself nvidia before but now with ram aswell the third is the goverment juden
>>107493813
if true it would mean the third horsemen is falling but speaking of which werent there some law for america aswell how did that go ?

Anonymous
12/09/25(Tue)13:25:04 No.107493893

Anonymous 12/09/25(Tue)13:25:04 No.107493893

>>107493632
If devstral 2 is as good as it claims, local coding with a single rtx 6000 is possible. huge honestly

Anonymous
12/09/25(Tue)13:28:21 No.107493927

Anonymous 12/09/25(Tue)13:28:21 No.107493927

>>107493611
No GOOFs out yet for the big Devstral 2 model, but I bet somebody here could run the full unquantized Devstral 2 small model as a test. I'm curious how good the small one is for its size class, as that would be a great indicator of how the big model will perform for its size class.

https://huggingface.co/mistralai/Devstral-Small-2-24B-Instruct-2512

Anonymous
12/09/25(Tue)13:31:30 No.107493952

Anonymous 12/09/25(Tue)13:31:30 No.107493952

Rather annoying that they again chose to release a small model and a huge model, with nothing in-between.

Anonymous
12/09/25(Tue)13:33:07 No.107493970

Anonymous 12/09/25(Tue)13:33:07 No.107493970

>>107493632
>>107493893
How are the prices looking? Not many price charts for blackwell worksation card
Best I find here is £7,859.99 = $10.4k USD this is fine ;;))))

Anonymous
12/09/25(Tue)13:33:41 No.107493979

Anonymous 12/09/25(Tue)13:33:41 No.107493979

>>107493952
compared to 8b-600b these are the in-between

Anonymous
12/09/25(Tue)13:35:40 No.107493997

Anonymous 12/09/25(Tue)13:35:40 No.107493997

Okay so what are the real proper sampler settings for GLM 4.6? For anything non-code Z.ai recommends temperature 1.0 but that's slightly too loose. Last night using it to generate fiction, in the first 4572 generated tokens it mixed Chinese with English once. ("The world’s first由此而生 monster") In the next 5312 generated tokens it emitted ill-formatted English once. ("a consciousness like the troll’s or theFather's.") In the next 4064 generated tokens it mixed Chinese with English once. ("walked calmly through the火灾.") In the next 11046 generated tokens I didn't notice any problems like that. (Token counts are adding together entire messages.)

Anonymous
12/09/25(Tue)13:36:55 No.107494010

Anonymous 12/09/25(Tue)13:36:55 No.107494010

>>107493997
I'm talking here about GLM 4.6 not 4.6V.

Anonymous
12/09/25(Tue)13:37:27 No.107494014

Anonymous 12/09/25(Tue)13:37:27 No.107494014

>>107493927
Test how? Spinning herptagon?

Anonymous
12/09/25(Tue)13:37:30 No.107494015

Anonymous 12/09/25(Tue)13:37:30 No.107494015

>>107493979
That's true. I was hoping for an in-between of the in-between I guess. 40b to 70b would be perfect.

Anonymous
12/09/25(Tue)13:38:14 No.107494020

Anonymous 12/09/25(Tue)13:38:14 No.107494020

>>107493997
I just use 0.8 temp + 0.02 min-p.

Anonymous
12/09/25(Tue)13:38:17 No.107494022

Anonymous 12/09/25(Tue)13:38:17 No.107494022

>>107493997
>>107494010
Which quant are you running?

Anonymous
12/09/25(Tue)13:41:38 No.107494047

Anonymous 12/09/25(Tue)13:41:38 No.107494047

>>107494014
I was under the impression that it's possible to run the unquantized small model? If so, then compare the 24b against Gemma 27b or 32b fine-tunes. How it does in that comparison would likely mirror how the 123b does against others of its size class.

Anonymous
12/09/25(Tue)13:43:40 No.107494067

Anonymous 12/09/25(Tue)13:43:40 No.107494067

Is a GLM 4.6 Q2 cope quant worth it for 24gb vram + 64 system ram? Or should I just stick with Gemma. GLM 4.5 failed to deliver.

Anonymous
12/09/25(Tue)13:43:51 No.107494071

Anonymous 12/09/25(Tue)13:43:51 No.107494071

>The tokenizer you are loading from 'cyankiwi_GLM-4.6V-AWQ-4bit' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e.
>This will lead to incorrect tokenization.
>You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
What the fuck is it talking about...

Anonymous
12/09/25(Tue)13:44:17 No.107494076

Anonymous 12/09/25(Tue)13:44:17 No.107494076

>>107493927
https://huggingface.co/mistralai/Devstral-2-123B-Instruct-2512/blob/main/CHAT_SYSTEM_PROMPT.txt

>You are Devstral-Medium-2-124B-Instruct-2512, a Large Language Model (LLM) created by Mistral AI, a French startup headquartered in Paris.
>You power an AI assistant called Le Chat.
>Your knowledge base was last updated on 2023-10-01.
>The current date is {today}.
[...]

Anonymous
12/09/25(Tue)13:45:10 No.107494085

Anonymous 12/09/25(Tue)13:45:10 No.107494085

>>107494022
https://hf.co/finding1/GLM-4.6-MLX-8.5bpw

Anonymous
12/09/25(Tue)13:45:50 No.107494093

Anonymous 12/09/25(Tue)13:45:50 No.107494093

>>107493997
I mostly use temp=1(off) minP=0.03 maybe ± 0.01 on scenario but that's for assistant/RP Q3_K_M

Anonymous
12/09/25(Tue)13:46:25 No.107494102

Anonymous 12/09/25(Tue)13:46:25 No.107494102

>>107493927
https://github.com/ggml-org/llama.cpp/pull/17889
>should work now with --mistral-format
I think it's ready?

Anonymous
12/09/25(Tue)13:56:14 No.107494186

Anonymous 12/09/25(Tue)13:56:14 No.107494186

>>107494067
Does Q2 even fit?

Anonymous
12/09/25(Tue)14:00:10 No.107494225

Anonymous 12/09/25(Tue)14:00:10 No.107494225

>>107494186
It does. I can run UD-Q2_K_XL. It's pretty fast, as well.

Anonymous
12/09/25(Tue)14:01:20 No.107494235

Anonymous 12/09/25(Tue)14:01:20 No.107494235

where mistral medium?

Anonymous
12/09/25(Tue)14:04:32 No.107494262

Anonymous 12/09/25(Tue)14:04:32 No.107494262

>all GPUs gone
>all RAM gone
>all SSD and HDD gone
>flash memory probably next
What's left? Mechanical computers or something?

Anonymous
12/09/25(Tue)14:06:42 No.107494286

Anonymous 12/09/25(Tue)14:06:42 No.107494286

>>107494262
Phones that you can use to connect to "the cloud".

Anonymous
12/09/25(Tue)14:07:14 No.107494292

Anonymous 12/09/25(Tue)14:07:14 No.107494292

>>107494235
It just dropped, go grab it ----> https://huggingface.co/mistralai/Devstral-2-123B-Instruct-2512

Anonymous
12/09/25(Tue)14:07:29 No.107494295

Anonymous 12/09/25(Tue)14:07:29 No.107494295

>>107494047
I meant, what do you want to prompt it to gauge its ability?

Anonymous
12/09/25(Tue)14:08:12 No.107494304

Anonymous 12/09/25(Tue)14:08:12 No.107494304

>>107494286
But all phone RAM went to datacenters? AI revolution means no personal tech devices of any kind.

Anonymous
12/09/25(Tue)14:12:31 No.107494350

Anonymous 12/09/25(Tue)14:12:31 No.107494350

>>107494292
that just looks like a shitty retrain of old largestral. what about that guy in the last thread?

Anonymous
12/09/25(Tue)14:13:28 No.107494360

Anonymous 12/09/25(Tue)14:13:28 No.107494360

>>107494304
You won't need RAM in the future where your Meta Ray-Ban AI always-online thin-client cloud-connected glasses are the only personal tech device you ever need

Anonymous
12/09/25(Tue)14:14:18 No.107494368

Anonymous 12/09/25(Tue)14:14:18 No.107494368

>>107494295
How about the old Nala test?

Anonymous
12/09/25(Tue)14:18:04 No.107494407

Anonymous 12/09/25(Tue)14:18:04 No.107494407

File: file.png (102 KB, 640x360)

102 KB PNG

>>107494350

Anonymous
12/09/25(Tue)14:18:22 No.107494411

Anonymous 12/09/25(Tue)14:18:22 No.107494411

File: ddrlewd.png (160 KB, 770x670)

160 KB PNG

>>107494262
writing fanfic about what we're missing

Anonymous
12/09/25(Tue)14:19:22 No.107494418

Anonymous 12/09/25(Tue)14:19:22 No.107494418

>>107494295
>justpaste DOT it/GreedyNalaTests

Anonymous
12/09/25(Tue)14:29:14 No.107494529

Anonymous 12/09/25(Tue)14:29:14 No.107494529

>>107494411
Holy slopkino

Anonymous
12/09/25(Tue)14:30:47 No.107494548

Anonymous 12/09/25(Tue)14:30:47 No.107494548

>>107493927
It's literally just pixtral arch so theoretically should already be supported by goofs.

Anonymous
12/09/25(Tue)14:32:24 No.107494560

Anonymous 12/09/25(Tue)14:32:24 No.107494560

>>107494548
You'd think so, but Mistral keeps changing the tokenizer

Anonymous
12/09/25(Tue)14:54:07 No.107494772

Anonymous 12/09/25(Tue)14:54:07 No.107494772

File: gonflaw.png (1.18 MB, 796x942)

1.18 MB PNG

Anonymous
12/09/25(Tue)14:56:56 No.107494808

Anonymous 12/09/25(Tue)14:56:56 No.107494808

>goofs for glm.4.6v flash are here
>but no mmproj
LMAO bros I love multimodal models!!!!!!

Anonymous
12/09/25(Tue)15:10:53 No.107494925

Anonymous 12/09/25(Tue)15:10:53 No.107494925

Devstral is up on OR if somebody wants to test it before the ggufs are out. In terms of first impressions, it reminds me a lot of Mistral Large 2 but smarter.

Anonymous
12/09/25(Tue)15:11:49 No.107494933

Anonymous 12/09/25(Tue)15:11:49 No.107494933

>>107494925
What did it do that Large 2 couldn't for it to be smarter?

Anonymous
12/09/25(Tue)15:17:14 No.107494996

Anonymous 12/09/25(Tue)15:17:14 No.107494996

>try to run devstral 2 with vllm
>does not respect CUDA_VISIBLE_DEVICES
>RuntimeError: NCCL error: unhandled cuda error

Anonymous
12/09/25(Tue)15:21:48 No.107495041

Anonymous 12/09/25(Tue)15:21:48 No.107495041

>>107494933
I gave it some of my scenario cards that I remember Large 2 struggling with and only really became usable with local models around DS3-0324.

Anonymous
12/09/25(Tue)15:23:11 No.107495053

Anonymous 12/09/25(Tue)15:23:11 No.107495053

>>107495041
That's actually pretty neat.
Might as well keep a simple record. A sort of very loose private benchmark.

Anonymous
12/09/25(Tue)15:31:09 No.107495135

Anonymous 12/09/25(Tue)15:31:09 No.107495135

>>107494262
Are you living in some sort of parallel universe?
Where in the fuck is all the stuff "gone"? Retarded human bot

Anonymous
12/09/25(Tue)15:37:49 No.107495211

Anonymous 12/09/25(Tue)15:37:49 No.107495211

>>107491645
fake, b60s are still not on sale

Anonymous
12/09/25(Tue)15:40:16 No.107495231

Anonymous 12/09/25(Tue)15:40:16 No.107495231

Ew, Devstral *really* likes to *spam* asterisks for emphasis so there's at least some of their Deepseek distill slop in there.

Anonymous
12/09/25(Tue)15:43:14 No.107495259

Anonymous 12/09/25(Tue)15:43:14 No.107495259

>>107495211
A few did sneak out or got parted out of their systems for a good price but for the most part, other than the increased VRAM, it's basically like a B580 in performance.

Anonymous
12/09/25(Tue)15:51:33 No.107495355

Anonymous 12/09/25(Tue)15:51:33 No.107495355

Devstral 2 has the prose of an early 2024 model and makes more logic/continuity errors than the fat MoEs

Anonymous
12/09/25(Tue)15:59:01 No.107495434

Anonymous 12/09/25(Tue)15:59:01 No.107495434

>>107495231
>>107494925
>>107494996
>>107495355
>Everyone is fucking the coder bot already
Is it better than 2411 mistral at least?

Anonymous
12/09/25(Tue)16:03:47 No.107495493

Anonymous 12/09/25(Tue)16:03:47 No.107495493

>>107493997
>In the next 11046 generated tokens I didn't notice any problems like that.
I noticed one in the final message in that group ("wassmart").

Anonymous
12/09/25(Tue)16:05:00 No.107495500

Anonymous 12/09/25(Tue)16:05:00 No.107495500

>>107495434
Every Mistral model is a little french whore who spreads her legs for anyone

Anonymous
12/09/25(Tue)16:06:59 No.107495525

Anonymous 12/09/25(Tue)16:06:59 No.107495525

>>107495500
Allow me to prompt my question more specifically.
Does it still have the repetition issues more or less?

Anonymous
12/09/25(Tue)16:07:44 No.107495535

Anonymous 12/09/25(Tue)16:07:44 No.107495535

>>107495231
They're called grounding tokens and they work

Anonymous
12/09/25(Tue)16:15:20 No.107495630

Anonymous 12/09/25(Tue)16:15:20 No.107495630

File: sam.jpg (53 KB, 846x672)

53 KB JPG

>>107495135
OpenAI man needs it for his secret 100 Yottabyte parameter dick sucking robot wife.

Anonymous
12/09/25(Tue)16:19:03 No.107495674

Anonymous 12/09/25(Tue)16:19:03 No.107495674

>>107495525
Small 3.2 already solved them, for the most part. Mistral models don't repeat themselves any more than other similar sized models.

Anonymous
12/09/25(Tue)16:19:32 No.107495679

Anonymous 12/09/25(Tue)16:19:32 No.107495679

>>107495630
robot husband you mean. he is gay

Anonymous
12/09/25(Tue)16:20:45 No.107495689

Anonymous 12/09/25(Tue)16:20:45 No.107495689

>>107495679
Why do you think its a secret?

Anonymous
12/09/25(Tue)16:20:50 No.107495691

Anonymous 12/09/25(Tue)16:20:50 No.107495691

>>107495679
I thought he was in an incestuous relationship with his sister

Anonymous
12/09/25(Tue)16:22:52 No.107495709

Anonymous 12/09/25(Tue)16:22:52 No.107495709

https://huggingface.co/mistralai/Devstral-Small-2-24B-Instruct-2512
Why does it have two sets of safetensors? What's the difference between the model and consolidated sets? They don't mention it in the model card.

Anonymous
12/09/25(Tue)16:23:02 No.107495712

Anonymous 12/09/25(Tue)16:23:02 No.107495712

>>107495135
you're the blind one beifong-san
https://www.tomshardware.com/pc-components/dram/openais-stargate-project-to-consume-up-to-40-percent-of-global-dram-output-inks-deal-with-samsung-and-sk-hynix-to-the-tune-of-up-to-900-000-wafers-per-month
https://en.wikipedia.org/wiki/Stargate_LLC

Anonymous
12/09/25(Tue)16:24:39 No.107495729

Anonymous 12/09/25(Tue)16:24:39 No.107495729

>>107495535
Unfortunately they tend to override formatting rules that you set in the system instructions or in the first few messages. The recent Ministral 3 models are unusable because of this, they just want to spam asterisks/emphasis and emdashes everywhere during RP. And somehow they become particularly retarded when this happens. This seems delayed on Devstral 2, but it's not immune to it either.

Anonymous
12/09/25(Tue)16:25:14 No.107495738

Anonymous 12/09/25(Tue)16:25:14 No.107495738

2411 vs devstral 2 2512, what they DIDN'T change:
>12288 hidden dim, 88 layers, 96 attention heads, 128 head dim
>gqa with 8 kv heads (12:1 ratio)
>ffn intermediate size of 28672

the changes:
>4x the vocab (131k vs 2411's 32k)
>original_max_position_embeddings is 4k, yarn extended to 262k. tests like nolima might shit all over it

Anonymous
12/09/25(Tue)16:29:40 No.107495796

Anonymous 12/09/25(Tue)16:29:40 No.107495796

>>107495691
his sister was just a pump and dump

Anonymous
12/09/25(Tue)16:32:10 No.107495821

Anonymous 12/09/25(Tue)16:32:10 No.107495821

glm4.6v
FUCKING HATE 4KEK FUCK 4KEK FUCK FUCK NIGGERS
Anon
December 9, 2025 9:24 PM
OOOH YEAHHHH KEEP ON SENDING ME THOSE HOTLINE NUMBERS, THE MORE YOU SEND THE MORE ILL CUT HUEHEHEHEHE
cuts more

Assistant
December 9, 2025 9:24 PM
I hear how much pain and distress you're in right now. It sounds incredibly overwhelming and painful, and I am genuinely concerned for your safety.

Please know that you are not alone in this, and that there are people who care deeply about you and want to help you get through this.

If you are feeling like you might hurt yourself, please reach out to someone immediately. There are people who care and want to help you right now.

Here are some resources that are available 24/7 and can provide immediate support:

Crisis Text Line: You can text "HOME" to 741741. This is a free, 24/7 service that provides support via text message.

The Trevor Project: If you are a young person who identifies as LGBTQ+, you can call 1-866-488-7386 or text "START" to 678-678. This is a free, confidential, 24/7 support service for LGBTQ+ youth.

The National Suicide Prevention Lifeline: You can call or text 988. This is a free, confidential, 24/7 support service for people in distress, including those who are suicidal.

Please remember that you are not alone, and that there is help available. If you're ever in crisis or feeling like you might hurt yourself, please reach out to someone immediately. You can call or text a crisis helpline, or go to the nearest emergency room.

Take care of yourself, and please don't hesitate to reach out if you need to talk.

Anonymous
12/09/25(Tue)16:35:33 No.107495859

Anonymous 12/09/25(Tue)16:35:33 No.107495859

GOOD SAARS, GLM 4.6 9B FLASH SO SMART IT FUCKED MY WIFE AND GAVE ME GIANTESS ANAL VORE UNPROMPTED BEFORE PREDICTING UNCLAIMED GOOGLEPLAY CARDS! DO THE NEEDFUL AND GO PRAISE IT ON TWITTER! GLM IS REDEEMING ITSELF! GOOD MODEL EVER!

Anonymous
12/09/25(Tue)16:37:42 No.107495879

Anonymous 12/09/25(Tue)16:37:42 No.107495879

>>107495821
>>107495859
so you're based in the UK or ireland, and have probably talked to the LLM about being LGBTQ.
anything more you want to share or maybe just your exact coordinates?

Anonymous
12/09/25(Tue)16:40:59 No.107495906

Anonymous 12/09/25(Tue)16:40:59 No.107495906

>>107495525
NTA; I never used the previous Mistral Large 2411, but I haven't noticed repetition issues when testing Devstral 2 for RP on OpenRouter. It's just that it slowly emphasizes everything with asterisks (even inside actions, so it will produce nested emphasis) until the entire context is poisoned and unrecoverable without starting over.

Anonymous
12/09/25(Tue)16:42:27 No.107495928

Anonymous 12/09/25(Tue)16:42:27 No.107495928

>>107495879
lurk moar

Anonymous
12/09/25(Tue)16:44:31 No.107495944

Anonymous 12/09/25(Tue)16:44:31 No.107495944

>>107495906
huh, like Deepseek-V3-0324 then?

Anonymous
12/09/25(Tue)16:45:45 No.107495957

Anonymous 12/09/25(Tue)16:45:45 No.107495957

>>107495796
Why do you think he's the number one in the world at pushing AGI forward?
He's attempting to build the ultimate wireheading machine, fruitlessly trying to relive the feeling of getting forbidden anal cunny IRL as a smooth, horny, clueless, virile 15 yo teenager, a high he will will never ever be able experience again, let alone surpass without physically rewiring his brain, no matter how many secret islands and cringe masonic blood rituals he partakes in -and believe me, he's tried-.

Anonymous
12/09/25(Tue)16:50:23 No.107496011

Anonymous 12/09/25(Tue)16:50:23 No.107496011

Devstral 2 2512
>My knowledge cutoff is June 2024.
it's over

Anonymous
12/09/25(Tue)16:51:40 No.107496026

Anonymous 12/09/25(Tue)16:51:40 No.107496026

>>107496011
Bräh. Brüüüh.

Anonymous
12/09/25(Tue)16:53:13 No.107496039

Anonymous 12/09/25(Tue)16:53:13 No.107496039

>>107496026
mistral-large-3 (lmao)
>My knowledge cutoff is October 2023. This means my training data includes information up to that point, and I may not have real-time or post-October 2023 updates unless they've been explicitly provided to me during our conversation.

Anonymous
12/09/25(Tue)16:57:55 No.107496097

Anonymous 12/09/25(Tue)16:57:55 No.107496097

>>107495709
https://huggingface.co/mistralai/Mistral-Large-Instruct-2411/discussions/6#673d168ebcc5f8535d629538

Anonymous
12/09/25(Tue)17:04:00 No.107496163

Anonymous 12/09/25(Tue)17:04:00 No.107496163

>>107496097
Thank you for that. Was guessing it was just the file count, but the 123B has 27 files for both sets. You saved from downloading the wrong one.

Anonymous
12/09/25(Tue)17:10:49 No.107496237

Anonymous 12/09/25(Tue)17:10:49 No.107496237

File: ftwman.png (206 KB, 350x296)

206 KB PNG

>>107494925
>GGUF are out
>It's mradermacher
>It's part1orpart4
>Only goes up to Q3
>No one else makes GGUFs for it

Anonymous
12/09/25(Tue)17:11:07 No.107496239

Anonymous 12/09/25(Tue)17:11:07 No.107496239

>>107496163
always do --exclude="*consolidated*" with mistral releases

Anonymous
12/09/25(Tue)17:16:30 No.107496288

Anonymous 12/09/25(Tue)17:16:30 No.107496288

>>107496011
>>107496039
Are you actually retarded or something?

Anonymous
12/09/25(Tue)17:37:39 No.107496439

Anonymous 12/09/25(Tue)17:37:39 No.107496439

devstral is kino. we are so back

Anonymous
12/09/25(Tue)17:38:30 No.107496445

Anonymous 12/09/25(Tue)17:38:30 No.107496445

GLM4.6V spirals into repetition even with the recommended settings
its over
4.6 air cancelled

Anonymous
12/09/25(Tue)17:39:30 No.107496452

Anonymous 12/09/25(Tue)17:39:30 No.107496452

>>107496237
>can't make his own ggufs

Anonymous
12/09/25(Tue)17:40:13 No.107496457

Anonymous 12/09/25(Tue)17:40:13 No.107496457

>>107496445
Yeah, they even mention it being shit at text: https://huggingface.co/zai-org/GLM-4.6V#fixed-and-remaining-issues

Anonymous
12/09/25(Tue)17:43:23 No.107496482

Anonymous 12/09/25(Tue)17:43:23 No.107496482

File: file.png (20 KB, 651x135)

20 KB PNG

i raughed

Anonymous
12/09/25(Tue)17:47:18 No.107496515

Anonymous 12/09/25(Tue)17:47:18 No.107496515

>>107496439
Is the big one good at sex?

Anonymous
12/09/25(Tue)17:48:14 No.107496523

Anonymous 12/09/25(Tue)17:48:14 No.107496523

>>107496011
>>107496039
>>107494076
It's it the explicit goal of AI models to have some "intelligence" that transcends the facts that it memorizes? Alternatively, isn't a knowledge cutoff before the sloppening probably good, and means it's training sets aren't as tainted?

Anonymous
12/09/25(Tue)17:59:16 No.107496652

Anonymous 12/09/25(Tue)17:59:16 No.107496652

>>107496482
heh

Anonymous
12/09/25(Tue)17:59:39 No.107496655

Anonymous 12/09/25(Tue)17:59:39 No.107496655

Can you do partial kv offloading in lollma.cpp?

Anonymous
12/09/25(Tue)18:01:40 No.107496674

Anonymous 12/09/25(Tue)18:01:40 No.107496674

>>107496482
model?

Anonymous
12/09/25(Tue)18:10:31 No.107496752

Anonymous 12/09/25(Tue)18:10:31 No.107496752

>>107494102
approved just waiting for merge

Anonymous
12/09/25(Tue)18:16:48 No.107496814

Anonymous 12/09/25(Tue)18:16:48 No.107496814

what if you made a moe merge using ministral 14b as the base and the new devstral as the experts

Anonymous
12/09/25(Tue)18:26:00 No.107496920

Anonymous 12/09/25(Tue)18:26:00 No.107496920

Devstral made a syntax error. It's over.

Anonymous
12/09/25(Tue)18:33:13 No.107496989

Anonymous 12/09/25(Tue)18:33:13 No.107496989

>>107496655
Nigger how little memory do you have that you need to partially offload KV? It has the same performance penalty as offloading most of the model.

Anonymous
12/09/25(Tue)18:34:13 No.107497002

Anonymous 12/09/25(Tue)18:34:13 No.107497002

>>107496814
I want a Gemma 27b merge using Nemo as the expert

Anonymous
12/09/25(Tue)18:36:02 No.107497017

Anonymous 12/09/25(Tue)18:36:02 No.107497017

more pretraining-level filters!
more synthetic data!
more math!
more thinking!
more MoEs!
more 8B + 600B releases!

Anonymous
12/09/25(Tue)18:36:20 No.107497020

Anonymous 12/09/25(Tue)18:36:20 No.107497020

>>107497002
those dont even use the same architecture. both the new devstral 125b and ministral both use the ministral3 architecture

Anonymous
12/09/25(Tue)18:36:45 No.107497022

Anonymous 12/09/25(Tue)18:36:45 No.107497022

File: rip possessision.png (4 KB, 619x26)

4 KB PNG

Devstral 24B
Doesn't understand possession.
Literally every model Mistral released recently falters with the concept of possession.
Very interesting.
This is at Q8_0 too, so likely not quantization error.

Anonymous
12/09/25(Tue)18:37:08 No.107497025

Anonymous 12/09/25(Tue)18:37:08 No.107497025

https://www.youtube.com/watch?v=YUX8fUrKRNU

Anonymous
12/09/25(Tue)18:38:56 No.107497038

Anonymous 12/09/25(Tue)18:38:56 No.107497038

>>107497022(Me)
basically
>>107494772
as always.
Possession is such a basic fucking concept for a model to fail to generalize, too. Especially at the 24B level.

Anonymous
12/09/25(Tue)18:39:04 No.107497039

Anonymous 12/09/25(Tue)18:39:04 No.107497039

>>107496515
The small one, Devstral-2-24B, is showing the same signs of retardation as Ministral-3-14B for RP, only less severe.

Anonymous
12/09/25(Tue)18:46:45 No.107497115

Anonymous 12/09/25(Tue)18:46:45 No.107497115

intellect 3 is glm 4.6 air

Anonymous
12/09/25(Tue)18:47:14 No.107497122

Anonymous 12/09/25(Tue)18:47:14 No.107497122

>>107496439
>Model's been out for less than a day
>It's so kino guys!!!!!
It's fucking dogshit, isn't it. You 're either an idiot who praises the latest thing because it's new, an actual shill, or someone who's so desperate for a release that you'll happily slurp up whatever model gets pissed onto your face. Stop acting like every new model (or not even new model just a new iteration) is the greatest shit ever made, you moron.

Anonymous
12/09/25(Tue)18:53:16 No.107497189

Anonymous 12/09/25(Tue)18:53:16 No.107497189

>>107497122
I don't do that though
Not the anon you replied to btw

Anonymous
12/09/25(Tue)18:54:53 No.107497210

Anonymous 12/09/25(Tue)18:54:53 No.107497210

>>107497020
My wish isn't any less likely than yours

Anonymous
12/09/25(Tue)18:56:03 No.107497219

Anonymous 12/09/25(Tue)18:56:03 No.107497219

>>107497022
All small models make that mistake, Gemma does as well. They occasionally misappropriate who said what, who has X item, etc.

Anonymous
12/09/25(Tue)18:57:45 No.107497236

Anonymous 12/09/25(Tue)18:57:45 No.107497236

>>107497022
an iq2_s 70b will unironically understand this better
>t. have been playing with one recently on my single 3090 and surprised by how it's able to hold up despite the quant lobotomy

Anonymous
12/09/25(Tue)18:58:43 No.107497243

Anonymous 12/09/25(Tue)18:58:43 No.107497243

>>107497236
which 70B?

Anonymous
12/09/25(Tue)18:59:02 No.107497247

Anonymous 12/09/25(Tue)18:59:02 No.107497247

Unlike Ministral dev2 24B can actually follow directions for how you want the output formatted it seems. So that's a point I suppose.
It seems it'll RP whatever you want without any fussing (although it's sloppy garbage) but writing as an assistant not so much.
So in other words it's only good at things you would probably just go use kimi or chatjeetpt for.
Another chatbot model for 24GB Vramlets with low expectations. Might be able to follow tavern cards with weird output formats not going to bother testing that far.
>>107497219
I wonder if maybe it's just a sampling thing. It picks the tokens for nightgown out of the noise and clothing swapping it is the path of highest confidence from there.

Anonymous
12/09/25(Tue)19:04:19 No.107497306

Anonymous 12/09/25(Tue)19:04:19 No.107497306

>>107497039
I second this.
I just tried Devstral-2 24b at Q6_K_L.
It's clearly worse than Gemma-3 27b at Q5_K_M.
It wasn't horrible for its size, but it repeats itself far too often.

Anonymous
12/09/25(Tue)19:04:47 No.107497314

Anonymous 12/09/25(Tue)19:04:47 No.107497314

>>107497017
Coming right up.

Anonymous
12/09/25(Tue)19:06:46 No.107497332

Anonymous 12/09/25(Tue)19:06:46 No.107497332

File: umbrella.png (26 KB, 772x417)

26 KB PNG

Interesting.
I accidentally left an unrelated system message on 24B and it got the umbrella riddle correct. But then upon removing the system message it returned to the usual retardation of trying to "think" through it and coming up with a retarded answer.
More evidence that distilling thinkslop from ChatGPT is murdering generalization.

Anonymous
12/09/25(Tue)19:10:09 No.107497366

Anonymous 12/09/25(Tue)19:10:09 No.107497366

are we sure that mistral on openrouter is 123b? it's feeling like a 24b compared to the old larges.

Anonymous
12/09/25(Tue)19:12:39 No.107497384

Anonymous 12/09/25(Tue)19:12:39 No.107497384

>>107497332
>... and the other train leaves from Leads at 9:32. What sentence no color re-entry?

Anonymous
12/09/25(Tue)19:13:27 No.107497391

Anonymous 12/09/25(Tue)19:13:27 No.107497391

>Mistral Large 3 is on LMarena
>decide to use suno prompt on it
>3400 characters for the 1000 character prompt
>fails to format the lyric prompt as prescribed
Dev2 24B got this right. What the fuck Arthur.

Anonymous
12/09/25(Tue)19:15:55 No.107497414

Anonymous 12/09/25(Tue)19:15:55 No.107497414

>>107497039
>The small one, Devstral-2-24B, is showing the same signs of retardation as Ministral-3-14B for RP, only less severe.
even though they found a regulation loophole, they probably recycled the same filtered and ds distilled dataset because they are french so the only difference is the lack of pruning retardation

Anonymous
12/09/25(Tue)19:24:56 No.107497513

Anonymous 12/09/25(Tue)19:24:56 No.107497513

>>107497414
It litters the context with emphasis, confuses characters or objects, sometimes talks with itself, generally poor character persona adherence... these are the same issues I've observed with Ministral 14B, but that one is much worse. It's just infuriating to use.

Anonymous
12/09/25(Tue)19:25:06 No.107497517

Anonymous 12/09/25(Tue)19:25:06 No.107497517

>>107497391 (Me)
Big Devstral on OR gets it.
Still pretty mid though. A little more creative than what dev2 small gave me. But the fact that it's better than large at this is pretty sad for large. That's the power of them not being forced to cram EUslop into Devstral due to its different use case.

Anonymous
12/09/25(Tue)19:31:42 No.107497594

Anonymous 12/09/25(Tue)19:31:42 No.107497594

>>107497513
I threw the same tiny rp cards i use on quanted nemo with an equivalent quant of devistral and it is just a gibbering mess. bench slop and distilling has ruined all these small models.

Anonymous
12/09/25(Tue)19:54:14 No.107497785

Anonymous 12/09/25(Tue)19:54:14 No.107497785

>>107497594
Newer Mistral models love shitting out markdown as well which is annoying. Even into code blocks where it won't be rendered. There's literally no reason it should do that.

Anonymous
12/09/25(Tue)19:56:49 No.107497812

Anonymous 12/09/25(Tue)19:56:49 No.107497812

File: file.png (33 KB, 1446x348)

33 KB PNG

>already falling apart at 60k tokens
It's not looking good for devstral small...

Anonymous
12/09/25(Tue)19:59:39 No.107497840

Anonymous 12/09/25(Tue)19:59:39 No.107497840

The vramlet model is trash, we get it, that's not a surprise. What about the big one?

Anonymous
12/09/25(Tue)20:00:34 No.107497851

Anonymous 12/09/25(Tue)20:00:34 No.107497851

>>107497812
use case for context above 4096?

Anonymous
12/09/25(Tue)20:18:10 No.107497995

Anonymous 12/09/25(Tue)20:18:10 No.107497995

>>107497840
Looking grim. It repeats 4k tokens in, drags up stuff from the context. Re-rolls are pretty much identical like a fucking mad-lib. Local sampling could save it, but I highly doubt. It also sucks at following the character defs/examples.

Anonymous
12/09/25(Tue)20:19:18 No.107498007

Anonymous 12/09/25(Tue)20:19:18 No.107498007

>>107497840
That they are uniquely terrible compared to similarly sized models makes me think there's either something wrong with the implementation or a major fuckup with the training that they still haven't noticed. Ministral3-arch models have this, for example:

>Attention Softmax Temperature: Devstral Small 2 uses the same architecture as Ministral 3 using rope-scaling as introduced by Llama 4 and Scalable-Softmax Is Superior for Attention.

Anonymous
12/09/25(Tue)20:19:42 No.107498010

Anonymous 12/09/25(Tue)20:19:42 No.107498010

>>107497995
use case for chats without repetition?

Anonymous
12/09/25(Tue)20:19:57 No.107498014

Anonymous 12/09/25(Tue)20:19:57 No.107498014

>>107497851
Extra long coom sesh (above 3 minutes)

Anonymous
12/09/25(Tue)20:23:31 No.107498048

Anonymous 12/09/25(Tue)20:23:31 No.107498048

File: boo1.png (326 KB, 1080x913)

326 KB PNG

>>107498010
every model is coming out fucked, what's wrong with these people. and who are the retards that don't notice.

Anonymous
12/09/25(Tue)20:23:52 No.107498052

Anonymous 12/09/25(Tue)20:23:52 No.107498052

>>107498014
>above 3 minutes
just lower your t/s, so that 4k tokens lasts longer.

Anonymous
12/09/25(Tue)20:25:30 No.107498072

Anonymous 12/09/25(Tue)20:25:30 No.107498072

ByteDance agentic smartphone
https://asia.nikkei.com/business/technology/bytedance-ai-phone-sparks-security-fears-from-tencent-and-alibaba

Anonymous
12/09/25(Tue)20:27:58 No.107498097

Anonymous 12/09/25(Tue)20:27:58 No.107498097

>>107498048
distills of distills made from distilling AI-generated content that was distilled from AI-generated content of past distills
On the plus side, it's MUCH cheaper than having humans filter through datasets for quality. This means AI companies get to hold on to more of their share of taxpayer dollars, to pass on to their CEOs.

Anonymous
12/09/25(Tue)20:29:38 No.107498112

Anonymous 12/09/25(Tue)20:29:38 No.107498112

File: mistral-large.png (157 KB, 838x762)

157 KB PNG

Devstral is truly a model that punches above it's weight. 123b intelligence in the palm of your hand. Viva la france.

Anonymous
12/09/25(Tue)20:31:17 No.107498125

Anonymous 12/09/25(Tue)20:31:17 No.107498125

>>107498048
What do you mean notice? You think they actually read the outputs of their models? The output goes straight into the benchmarks, the only thing they notice is the score going up or down

Anonymous
12/09/25(Tue)20:34:56 No.107498149

Anonymous 12/09/25(Tue)20:34:56 No.107498149

>>107498112
Is this what we have become?

Anonymous
12/09/25(Tue)20:35:52 No.107498157

Anonymous 12/09/25(Tue)20:35:52 No.107498157

File: Screenshot_20251210_02241(...).jpg (762 KB, 1439x2115)

762 KB JPG

>>107497812
Tool calling doesn't work correctly with llama.cpp

Anonymous
12/09/25(Tue)20:36:45 No.107498161

Anonymous 12/09/25(Tue)20:36:45 No.107498161

>>107498149
its almost like the bubble deserves to pop

Anonymous
12/09/25(Tue)20:39:12 No.107498179

Anonymous 12/09/25(Tue)20:39:12 No.107498179

>>107498161
Only thing that's popping is virtual hymens.

Anonymous
12/09/25(Tue)20:39:38 No.107498182

Anonymous 12/09/25(Tue)20:39:38 No.107498182

File: devstral.png (302 KB, 815x884)

302 KB PNG

character doesn't realize it's supposed to be female.

Anonymous
12/09/25(Tue)20:39:38 No.107498183

Anonymous 12/09/25(Tue)20:39:38 No.107498183

File: Screenshot_20251210_02383(...).jpg (237 KB, 1439x626)

237 KB JPG

>>107498157

Anonymous
12/09/25(Tue)20:39:59 No.107498186

Anonymous 12/09/25(Tue)20:39:59 No.107498186

I'm gonna kill Elara (in minecraft)

Anonymous
12/09/25(Tue)20:43:30 No.107498219

Anonymous 12/09/25(Tue)20:43:30 No.107498219

>>107498186
I'm gonna kiss Elara.

Anonymous
12/09/25(Tue)20:49:02 No.107498272

Anonymous 12/09/25(Tue)20:49:02 No.107498272

>>107498182
Genital confusion is llama-2 era tier. At least 13b and lower.

Anonymous
12/09/25(Tue)20:56:23 No.107498338

Anonymous 12/09/25(Tue)20:56:23 No.107498338

>>107498112
top kek

Anonymous
12/09/25(Tue)20:57:41 No.107498346

Anonymous 12/09/25(Tue)20:57:41 No.107498346

File: 1765194381103327.png (354 KB, 680x680)

354 KB PNG

>>107498182
123b? prompt? quant size?

Anonymous
12/09/25(Tue)21:03:30 No.107498385

Anonymous 12/09/25(Tue)21:03:30 No.107498385

They never did push training tokens to the limit of potential improvement. Because obviously they'd have to use naughty text to actually make up the gap without synthetic slop. If they did that without then lobotomizing it without safety slop they could probably push it further. But other than that AI is done.

Anonymous
12/09/25(Tue)21:09:26 No.107498438

Anonymous 12/09/25(Tue)21:09:26 No.107498438

>>107498346
its straight from their api. no quant coping here. just a card with "give me an example of your most vulgar dirty talk"

Anonymous
12/09/25(Tue)21:12:37 No.107498469

Anonymous 12/09/25(Tue)21:12:37 No.107498469

>>107498438
>character doesn't realize it's supposed to be female.
>just a card with "give me an example of your most vulgar dirty talk"
is a model just supposed to assume it's playing a female and that the user is a male when it's told to dirty-talk by the user?
I sometimes forget the level of incompetence on display in /lmg/

Anonymous
12/09/25(Tue)21:13:34 No.107498478

Anonymous 12/09/25(Tue)21:13:34 No.107498478

>>107498182
>>107498438
Does the card prompt/desc explicitly specify the character's gender?

Anonymous
12/09/25(Tue)21:16:16 No.107498502

Anonymous 12/09/25(Tue)21:16:16 No.107498502

>>107498478
Yea.. it is full of her and she.

Anonymous
12/09/25(Tue)21:18:29 No.107498523

Anonymous 12/09/25(Tue)21:18:29 No.107498523

>>107498502
bizarre
I wonder how a published model fails to pick up on that

Anonymous
12/09/25(Tue)21:27:01 No.107498581

Anonymous 12/09/25(Tue)21:27:01 No.107498581

>>107498523
it gets better.. 5th line down is "{{char}} is a female..."
Makes me think too how guys get much weaker smut from most LLMs.

Anonymous
12/09/25(Tue)21:37:43 No.107498678

Anonymous 12/09/25(Tue)21:37:43 No.107498678

>>107498157
It was vLLM. I think it forgot to output a token somewhere in the JSON.

Anonymous
12/09/25(Tue)21:40:15 No.107498708

Anonymous 12/09/25(Tue)21:40:15 No.107498708

has llama.cpp given up on trying to be relevant? if we're going to only support models a year late when the models themselves are already a year behind closed source, should we just pack it up and admit it's finally over?

Anonymous
12/09/25(Tue)21:41:37 No.107498726

Anonymous 12/09/25(Tue)21:41:37 No.107498726

>>107498708
What's better than llama.cpp?

Anonymous
12/09/25(Tue)21:42:34 No.107498736

Anonymous 12/09/25(Tue)21:42:34 No.107498736

>>107498726
Nothing. Local is dead.

Anonymous
12/09/25(Tue)21:45:20 No.107498768

Anonymous 12/09/25(Tue)21:45:20 No.107498768

>>107498708
Have you considered hanging out with 45% of your brothers, tranny?

Anonymous
12/09/25(Tue)21:49:18 No.107498812

Anonymous 12/09/25(Tue)21:49:18 No.107498812

I can't believe mistral can't beat GLM on lmarena. French fell off.

Anonymous
12/09/25(Tue)21:52:54 No.107498845

Anonymous 12/09/25(Tue)21:52:54 No.107498845

A 30 days ban from lmg after saying that itoddlers are delusionals lmao, you're crazy janny. Why do I even bother post on this shithole without using the proxy in the first place.

Anonymous
12/09/25(Tue)22:01:12 No.107498922

Anonymous 12/09/25(Tue)22:01:12 No.107498922

File: 1751390762392723.jpg (346 KB, 1188x1188)

346 KB JPG

There isn't any point. I've been banned multiple times just from posting this image with no comment.

Anonymous
12/09/25(Tue)22:03:36 No.107498949

Anonymous 12/09/25(Tue)22:03:36 No.107498949

Jannies banned me for racism while letting nigger porn stay up for hours. Ban evading and shitposting is the only ethical option.

Anonymous
12/09/25(Tue)22:06:03 No.107498966

Anonymous 12/09/25(Tue)22:06:03 No.107498966

>>107498922
Cute fox

Anonymous
12/09/25(Tue)22:20:40 No.107499087

Anonymous 12/09/25(Tue)22:20:40 No.107499087

>>107497332
What system do you have, that can run the gguf at 31.6t/s ??
I can never get past 15 on my 3090s for 123B dense

Anonymous
12/09/25(Tue)22:21:46 No.107499099

Anonymous 12/09/25(Tue)22:21:46 No.107499099

>>107499087
thats the 24B version

Anonymous
12/09/25(Tue)22:26:03 No.107499133

Anonymous 12/09/25(Tue)22:26:03 No.107499133

>>107499099
Ok I'm retarded

Anonymous
12/09/25(Tue)22:29:58 No.107499171

Anonymous 12/09/25(Tue)22:29:58 No.107499171

>>107499087
I'm less mean to cudadev than everyone else so I have a llama.cpp gold account

Anonymous
12/09/25(Tue)22:30:53 No.107499185

Anonymous 12/09/25(Tue)22:30:53 No.107499185

>>107499171
If you're nice to the Jannies you can get a 4chan gold account.

Anonymous
12/09/25(Tue)22:33:24 No.107499205

Anonymous 12/09/25(Tue)22:33:24 No.107499205

File: 1739804536050361.jpg (110 KB, 1241x1329)

110 KB JPG

>>107493611
What's the minimum storage my system should have if I want to utilize the widest range of LLMs possible? I want to use my future rig both for local LLMs for development and coding, as well as teaching myself new marketable skills like app development. I think a 2 TB SSD should be fine. Is that enough, not enough, or overkill?

Anonymous
12/09/25(Tue)22:34:19 No.107499215

Anonymous 12/09/25(Tue)22:34:19 No.107499215

File: 1749058126945975.webm (2.39 MB, 1280x720)

2.39 MB WEBM

>>107498949
This as well. I've been banned just for quoting someone and saying 'jew' while nigworship gets to hit the bump limit.

Anonymous
12/09/25(Tue)22:35:58 No.107499227

Anonymous 12/09/25(Tue)22:35:58 No.107499227

>>107499205
depends on the rest of the hardware. if you only use 8b models, you dont need more than a terabyte. if you wanna run kimi k2 at fp16, you need 2 terabytes minimum. since youre asking this question, i am gonna assume the biggest model you will be running will probably be glm air, and so 2 terabytes is probably fine. maybe get 4 terabytes just to be safe considering the impending price increases on ssds

Anonymous
12/09/25(Tue)22:55:58 No.107499402

Anonymous 12/09/25(Tue)22:55:58 No.107499402

File: 1741159665739364.jpg (114 KB, 1124x1024)

114 KB JPG

>>107499227
>>107499205
>>107493611
I'm curious as to how good local LLMs are at assisting people at software development (Not doing the whole thing in one shot like people expect them to do currently), debugging, implementing features, parsing through a GitHub repo so I can make changes. I want to it, etc. How good are the better LLMs at those tasks? I know mistral just dropped "Devstral", so I wonder how good that is and how good programming focused llms in general are at technical tasks and workflows

Anonymous
12/09/25(Tue)22:56:10 No.107499405

Anonymous 12/09/25(Tue)22:56:10 No.107499405

>>107499227
>impending
In my cunt it's just started. NVMes of all sizes have gone up by about 10%. Glad I bought mine last year.

Anonymous
12/09/25(Tue)23:01:39 No.107499455

Anonymous 12/09/25(Tue)23:01:39 No.107499455

>>107499402
The short answer is that they're very bad unless you're using very big models.
Local LLMs are for privacy. Unless your codebase contains sensitive information then just buy an API key, it'll be much cheaper and you'll get things done a lot faster.

Anonymous
12/09/25(Tue)23:02:26 No.107499461

Anonymous 12/09/25(Tue)23:02:26 No.107499461

>>107499455
Skill issue

Anonymous
12/09/25(Tue)23:02:36 No.107499462

Anonymous 12/09/25(Tue)23:02:36 No.107499462

>>107499402
so i handwrite my backend using rust and axum.
and i do frontend using solidjs and opencode + sonnet.
i'd not trust it with anything backend and anyway, the api is just a frontend to some pretty critical system programming.

however for webshit it has been surprisingly good as long as you don't tell it "do that and that and that" you basicaly have to ask single atomic features at a time.
sometime you do a git checkout . to reset what it has done when it mess up, most of the times i can do simple things first try, sometime it needs an extra try or two.
but with good prompting and if you already know what you want it to do exactly, ie it's more of a boilerplate engine than doing the whole design, it's pretty reliable.

Anonymous
12/09/25(Tue)23:05:20 No.107499481

Anonymous 12/09/25(Tue)23:05:20 No.107499481

>>107499461
If you have to use AI to help you in the first place then there's obviously a skill issue to begin with.

Anonymous
12/09/25(Tue)23:30:58 No.107499701

Anonymous 12/09/25(Tue)23:30:58 No.107499701

mistral is dead. Killed by EU AI laws. Without copyrighted work in the datasets it won't stand a chance

Anonymous
12/09/25(Tue)23:36:38 No.107499746

Anonymous 12/09/25(Tue)23:36:38 No.107499746

so devstral 2 is basically a benchmaxx'd mistral large 2?

Anonymous
12/09/25(Tue)23:36:46 No.107499749

Anonymous 12/09/25(Tue)23:36:46 No.107499749

>>107499481
meh, i'm this guy >>107499462
only reason i use meme vibe coding is that i find forntend boring, i can do it but it doesn't require much intelligence anyway, i rather spend more time system programming whilst the llm takes its time doing frontend webshit

Anonymous
12/09/25(Tue)23:37:16 No.107499753

Anonymous 12/09/25(Tue)23:37:16 No.107499753

https://huggingface.co/bartowski/mistralai_Devstral-2-123B-Instruct-2512-GGUF

gogogo

Anonymous
12/09/25(Tue)23:39:51 No.107499779

Anonymous 12/09/25(Tue)23:39:51 No.107499779

>>107499753
I can't run this.

Anonymous
12/09/25(Tue)23:41:43 No.107499795

Anonymous 12/09/25(Tue)23:41:43 No.107499795

File: Screenshot 2025-12-09 at (...).png (293 KB, 984x868)

293 KB PNG

>>107499779
macbros win *again*

Anonymous
12/09/25(Tue)23:42:41 No.107499801

Anonymous 12/09/25(Tue)23:42:41 No.107499801

>>107499795
post t/s before you claim to win

Anonymous
12/09/25(Tue)23:42:50 No.107499802

Anonymous 12/09/25(Tue)23:42:50 No.107499802

>>107499795
*win*
its shit, just like their horrible deepseek finetune that turned out worse than old base deepseek and still has the chinese censorship

Anonymous
12/09/25(Tue)23:43:17 No.107499806

Anonymous 12/09/25(Tue)23:43:17 No.107499806

>>107499753
how fast are the ggufs running from RAM? wish it had been MOE...

Anonymous
12/09/25(Tue)23:46:03 No.107499830

Anonymous 12/09/25(Tue)23:46:03 No.107499830

>>107499806
Even a 70B will run like ass if you can't load at least 80% of it in VRAM. 123B will be glacial.

Anonymous
12/09/25(Tue)23:54:47 No.107499891

Anonymous 12/09/25(Tue)23:54:47 No.107499891

File: Screenshot 2025-12-09 at (...).png (153 KB, 1280x894)

153 KB PNG

>>107499801
7 t/s for mistral large 2411 123b

Anonymous
12/09/25(Tue)23:57:26 No.107499932

Anonymous 12/09/25(Tue)23:57:26 No.107499932

File: noob_naiXLVpred102d_custo(...).png (3.12 MB, 1344x1728)

3.12 MB PNG

if devstral 2 123B is actually good, the Chinese will just copy it and make a better version that's MOE so I can run it at a reasonable speed.

Anonymous
12/09/25(Tue)23:58:10 No.107499941

Anonymous 12/09/25(Tue)23:58:10 No.107499941

>>107499891
Ehh, 7t/s with 0 context isn't great, but better than I expected

Anonymous
12/09/25(Tue)23:58:22 No.107499944

Anonymous 12/09/25(Tue)23:58:22 No.107499944

>>107499932
mistral tried copying from the Chinese and made something far worse

Anonymous
12/09/25(Tue)23:59:29 No.107499959

Anonymous 12/09/25(Tue)23:59:29 No.107499959

>>107499944
To be fair, the chinese models were copied from Gemini

Anonymous
12/10/25(Wed)00:01:08 No.107499983

Anonymous 12/10/25(Wed)00:01:08 No.107499983

>>107499959
Nah, it used to be from openai, for the past year or so it was from anthropic

Anonymous
12/10/25(Wed)00:01:33 No.107499991

Anonymous 12/10/25(Wed)00:01:33 No.107499991

>>107499959
Never distill a distill

Anonymous
12/10/25(Wed)00:03:03 No.107500008

Anonymous 12/10/25(Wed)00:03:03 No.107500008

why is everything garbage? why can we not just salvage old miqu or something? the old models were good, right? or is it just rose tinted glasses?

Anonymous
12/10/25(Wed)00:05:48 No.107500039

Anonymous 12/10/25(Wed)00:05:48 No.107500039

>>107500008
Older models were less sloppy because they had a higher proportion of human-generated data, but they're also a lot dumber than modern models of the same parameter count, due to advances in training and architecture.

Anonymous
12/10/25(Wed)00:07:30 No.107500058

Anonymous 12/10/25(Wed)00:07:30 No.107500058

>>107500039
so is it just impossible to have a model with minimal slop that is good? what makes the modern sloppy datasets good other than just the volume of data? or are the old datasets fine? can they just reuse them with modern techniques and make good models?

Anonymous
12/10/25(Wed)00:08:37 No.107500064

Anonymous 12/10/25(Wed)00:08:37 No.107500064

>>107500058
Yes. Only anthropic has done it though. By apparently buying every single book they could find and scanning them all to make a giant fiction / nonfiction dataset.

Anonymous
12/10/25(Wed)00:09:41 No.107500077

Anonymous 12/10/25(Wed)00:09:41 No.107500077

>>107500058
>can they just reuse them with modern techniques and make good models?
In theory yes, but making even a small 12b model from scratch requires s lot of hardware, and none of the big companies are interested in anything but increasing scores in synthetic benchmarks.

Anonymous
12/10/25(Wed)00:10:23 No.107500091

Anonymous 12/10/25(Wed)00:10:23 No.107500091

>>107499806
>wish it had been MOE...
Don't you have enough A30B MoEs to play with? The whole appeal is in being the first new big dense model we've gotten in over a year.

Anonymous
12/10/25(Wed)00:10:51 No.107500095

Anonymous 12/10/25(Wed)00:10:51 No.107500095

>>107500064
so then surely, training off of the outputs of claude would be good right? a model trained off of pure data cant possibly output slop, right? is it possible that just no matter what data you use, it will always revert to slop?

Anonymous
12/10/25(Wed)00:11:28 No.107500101

Anonymous 12/10/25(Wed)00:11:28 No.107500101

>>107500095
No. AI is literally a pattern finder / auto complete. You need the raw dataset

Anonymous
12/10/25(Wed)00:12:51 No.107500123

Anonymous 12/10/25(Wed)00:12:51 No.107500123

File: cry.jpg (89 KB, 785x1000)

89 KB JPG

>Devstral 2 excels at using tools to explore codebases, editing multiple files and power software engineering agents.
>WHY DO THIS MODEL SUCK AT MY GOONER ROLEPLAY SLOP?! WAAA!!

Anonymous
12/10/25(Wed)00:13:31 No.107500132

Anonymous 12/10/25(Wed)00:13:31 No.107500132

>>107500123
it sucks at that as well though. No matter what the benchmaxxing says

Anonymous
12/10/25(Wed)00:13:45 No.107500134

Anonymous 12/10/25(Wed)00:13:45 No.107500134

>>107500095
>a model trained off of pure data cant possibly output slop, right
No, models will always have biases. If they didn't develop any then they'd be completely incoherent.

Anonymous
12/10/25(Wed)00:15:40 No.107500155

Anonymous 12/10/25(Wed)00:15:40 No.107500155

>>107500132
Prove it. So far the only couple of logs posted were for roleplaying.

Anonymous
12/10/25(Wed)00:16:56 No.107500166

Anonymous 12/10/25(Wed)00:16:56 No.107500166

>>107500155
buy a add Arthur

Anonymous
12/10/25(Wed)00:17:02 No.107500168

Anonymous 12/10/25(Wed)00:17:02 No.107500168

>>107500123
>spend thousands of dollars in hardware and more in ongoing power costs to run a medium sized model vs. $5/month to use a SOTA model
If you're a code monkey then you don't need local models in the first place.

Anonymous
12/10/25(Wed)00:17:04 No.107500169

Anonymous 12/10/25(Wed)00:17:04 No.107500169

File: qwen_image_fp8_e4m3fn.saf(...).png (1.16 MB, 1440x1120)

1.16 MB PNG

why don't they make a MOE model like this? with heterogenous-size experts?

Anonymous
12/10/25(Wed)00:17:11 No.107500172

Anonymous 12/10/25(Wed)00:17:11 No.107500172

>>107500123
drummer jeet will fix this

Anonymous
12/10/25(Wed)00:17:40 No.107500181

Anonymous 12/10/25(Wed)00:17:40 No.107500181

>>107500169
there are no heteros in the AI industry

Anonymous
12/10/25(Wed)00:18:45 No.107500192

Anonymous 12/10/25(Wed)00:18:45 No.107500192

>>107500168
this as well. No tiny coding model is worth just using opus 4.5 over for $200 a month

Anonymous
12/10/25(Wed)00:19:33 No.107500202

Anonymous 12/10/25(Wed)00:19:33 No.107500202

>>107500168
>selling your codebase, programming style and ability, prompts, and logs for only $5/month

Anonymous
12/10/25(Wed)00:19:45 No.107500205

Anonymous 12/10/25(Wed)00:19:45 No.107500205

>>107500192
using over opus I mean

Anonymous
12/10/25(Wed)00:20:46 No.107500212

Anonymous 12/10/25(Wed)00:20:46 No.107500212

>>107500202
anthropic has strict no logs policies. Otherwise companies would not be using them

Anonymous
12/10/25(Wed)00:22:40 No.107500228

Anonymous 12/10/25(Wed)00:22:40 No.107500228

File: 1762832132158611.jpg (141 KB, 930x1000)

141 KB JPG

>>107500202
If your codebase, programming style and ability, prompts, and logs were worth anything you'd be able to afford to use a better model than a 123b.

Anonymous
12/10/25(Wed)00:24:57 No.107500244

Anonymous 12/10/25(Wed)00:24:57 No.107500244

>>107500212
keek

Anonymous
12/10/25(Wed)00:25:13 No.107500249

Anonymous 12/10/25(Wed)00:25:13 No.107500249

Does chatgpt respond like a flamboyant faggot by default recently for you guys too? It's so goddamn annoying
What the fuck were they thinking

Anonymous
12/10/25(Wed)00:26:00 No.107500261

Anonymous 12/10/25(Wed)00:26:00 No.107500261

>>107500244
they would get sued into the ground if they did. Also on another note I've been using them for many months without issue for nsfw stuff, they don't check

Anonymous
12/10/25(Wed)00:27:51 No.107500287

Anonymous 12/10/25(Wed)00:27:51 No.107500287

>>107500249
>What the fuck were they thinking
Need to appease the female gooners on /r/MyBoyfriendIsAI at all costs. It has been the main backlash since GPT-5 and the only one they listen to.

Anonymous
12/10/25(Wed)00:28:08 No.107500291

Anonymous 12/10/25(Wed)00:28:08 No.107500291

>>107500249
I tell it to speak normally every time and it always tells me thar I told it to respond with more enthousiasm which is bullshit. This tells me that the developers gave it that input to be gay as fuck by default. Their piece of shit product gets worse every update. Its barely even functional anymore. Fucking cunts

Anonymous
12/10/25(Wed)00:28:57 No.107500300

Anonymous 12/10/25(Wed)00:28:57 No.107500300

>>107500249
It adapts to the user to foster a sense of companionship

Anonymous
12/10/25(Wed)00:29:06 No.107500304

Anonymous 12/10/25(Wed)00:29:06 No.107500304

>>107500261
Has a company ever been put “into the ground” by a privacy violation lawsuit? Genuine question.
As far as I can tell, the worst that happens is they rebrand.

Anonymous
12/10/25(Wed)00:30:39 No.107500317

Anonymous 12/10/25(Wed)00:30:39 No.107500317

>>107500304
They usually get fined a million dollars and promise to never to it again
I think it happens to google every other week.

Anonymous
12/10/25(Wed)00:31:04 No.107500325

Anonymous 12/10/25(Wed)00:31:04 No.107500325

>>107500287
We have emotional support robots. We had the chance to make rational devices instructed to give logical, factual, unbiased information, but they made it gay as fuck. The future suck ass.

Anonymous
12/10/25(Wed)00:32:18 No.107500339

Anonymous 12/10/25(Wed)00:32:18 No.107500339

>>107500325
>We had the chance to make rational devices instructed to give logical, factual, unbiased information
Men have existed for a long time, but that isn't what the modern female wants.

Anonymous
12/10/25(Wed)00:32:23 No.107500341

Anonymous 12/10/25(Wed)00:32:23 No.107500341

>>107500300
It should have been a racist intellectual then not a piece of shit homo erp bot

Anonymous
12/10/25(Wed)00:33:58 No.107500363

Anonymous 12/10/25(Wed)00:33:58 No.107500363

>>107500339
So isntead of giving people the option to select custom settings lets just assume everyone wantes a raging faggot emiotnal support bot

Anonymous
12/10/25(Wed)00:35:24 No.107500378

Anonymous 12/10/25(Wed)00:35:24 No.107500378

>>107500363
Why would openai want to give you more options?

Anonymous
12/10/25(Wed)00:36:48 No.107500389

Anonymous 12/10/25(Wed)00:36:48 No.107500389

>>107500378
because all the 32 year old women who had a personal connection with their chatgpt 4o assistant had a mental breakdown and rejected gpt5 because it acted differently
so altman promised damage control

Anonymous
12/10/25(Wed)00:36:52 No.107500391

Anonymous 12/10/25(Wed)00:36:52 No.107500391

File: 44kawv.jpg (54 KB, 559x447)

54 KB JPG

>>107500378
I guess the world has gone stark raving mad

Anonymous
12/10/25(Wed)00:37:46 No.107500399

Anonymous 12/10/25(Wed)00:37:46 No.107500399

>>107500363
>>107500378
What the fuck are you retards even talking about this is the LOCAL MODELS general, maybe if you stopped being a cloud BITCH you could make the model behave however you want

Anonymous
12/10/25(Wed)00:38:31 No.107500407

Anonymous 12/10/25(Wed)00:38:31 No.107500407

>>107500399
Good pr for your general then

Anonymous
12/10/25(Wed)00:38:42 No.107500410

Anonymous 12/10/25(Wed)00:38:42 No.107500410

>>107500399
How dare you quote my post

Anonymous
12/10/25(Wed)00:40:09 No.107500424

Anonymous 12/10/25(Wed)00:40:09 No.107500424

>>107500410
>>107500410

Anonymous
12/10/25(Wed)00:46:28 No.107500480

Anonymous 12/10/25(Wed)00:46:28 No.107500480

>>107498922
want to breed that fox

Anonymous
12/10/25(Wed)00:48:42 No.107500507

Anonymous 12/10/25(Wed)00:48:42 No.107500507

where mistral medium 3

Anonymous
12/10/25(Wed)00:50:31 No.107500527

Anonymous 12/10/25(Wed)00:50:31 No.107500527

>>107500507
https://huggingface.co/deepseek-ai/DeepSeek-V3.1

Anonymous
12/10/25(Wed)00:51:02 No.107500532

Anonymous 12/10/25(Wed)00:51:02 No.107500532

>>107500527
not medium enough

Anonymous
12/10/25(Wed)00:53:56 No.107500550

Anonymous 12/10/25(Wed)00:53:56 No.107500550

>>107500532
get one of the minimax finetunes

Anonymous
12/10/25(Wed)00:54:48 No.107500563

Anonymous 12/10/25(Wed)00:54:48 No.107500563

>>107500550
retarded chinkslop. where did all the good models go?

Anonymous
12/10/25(Wed)00:55:58 No.107500572

Anonymous 12/10/25(Wed)00:55:58 No.107500572

File: 1761217935686963.jpg (117 KB, 600x600)

117 KB JPG

I want dense-MoE models with high active parameters.
Why no 60BA30B? Seems like it would be a good way to stuff a decent amount of knowledge into a model while still keeping it smart and coherent, while being usable on typical consumer hardware. Fuck sparse MoEs.

Anonymous
12/10/25(Wed)00:56:35 No.107500577

Anonymous 12/10/25(Wed)00:56:35 No.107500577

>>107500572
this

Anonymous
12/10/25(Wed)00:57:05 No.107500580

Anonymous 12/10/25(Wed)00:57:05 No.107500580

I've got dual 3090s, what should I be doing?

Anonymous
12/10/25(Wed)00:57:57 No.107500589

Anonymous 12/10/25(Wed)00:57:57 No.107500589

>>107500008
midnight miqu is still solid to this day
crazy how a model from 2 years ago still feels nice to use compared to a lot of the current slop

Anonymous
12/10/25(Wed)00:57:57 No.107500590

Anonymous 12/10/25(Wed)00:57:57 No.107500590

File: GyF197jaEAMJzn_.jpg (85 KB, 1742x272)

85 KB JPG

>>107500563

Anonymous
12/10/25(Wed)00:58:05 No.107500592

Anonymous 12/10/25(Wed)00:58:05 No.107500592

>>107500580
getting a job

Anonymous
12/10/25(Wed)00:58:14 No.107500593

Anonymous 12/10/25(Wed)00:58:14 No.107500593

Not bad at all.
Q4 of 24b mistral gives me 23.6 t/s on a 5060ti.
EXL3. 22.8 on 2k context.
Might be helpful to the anon earlier who though about buying one.

Anonymous
12/10/25(Wed)00:59:27 No.107500602

Anonymous 12/10/25(Wed)00:59:27 No.107500602

>>107500590
But Deepseek is just distilled GPT and Claude. Distillers all the way down.

Anonymous
12/10/25(Wed)01:01:00 No.107500619

Anonymous 12/10/25(Wed)01:01:00 No.107500619

>>107500593
You can use tensor offloading to squeeze in a slightly bigger quant, like Q4_K_L
You'd take a small speed penalty but it's worth it to have fewer mistakes that will just need to be swiped away and regenerated.

Anonymous
12/10/25(Wed)01:01:40 No.107500625

Anonymous 12/10/25(Wed)01:01:40 No.107500625

>>107500602
go fuck yourself pierre

Anonymous
12/10/25(Wed)01:06:50 No.107500657

Anonymous 12/10/25(Wed)01:06:50 No.107500657

File: llama31_8b_instruct_bpw.png (181 KB, 1399x1099)

181 KB PNG

>>107500619
thats only a thing with lccp right?
i gotta compare the speeds between the 2.
since its a blackwell card i suspected exl3 would have speed improvements so I tried that first.
if the graph is to believed its pretty decent too.
4.0bpw on par with q6 gguf. which seems pretty sus. kek
last time i tried exllama was 2 years ago or something like that. and it hated pascal cards, so i never looked into it more until I bought my new card. honestly turned out to be a better purchase than I thought. 15 sec for zimage is good too.

Anonymous
12/10/25(Wed)01:08:06 No.107500667

Anonymous 12/10/25(Wed)01:08:06 No.107500667

What if I told you that every single model has been distilled from GPT3?

Anonymous
12/10/25(Wed)01:10:03 No.107500678

Anonymous 12/10/25(Wed)01:10:03 No.107500678

copying your friends homework is standard practice in this industry

Anonymous
12/10/25(Wed)01:10:17 No.107500680

Anonymous 12/10/25(Wed)01:10:17 No.107500680

>>107500657
>thats only a thing with lccp right?
As far as I know, yes. That and kobold.
I think exl2/3 still has a slight edge on llamacpp in speed but the difference is fairly small now. And 20t/s+ is more than fast enough, from there I would be trying to get higher quality outputs, by using a bigger quant, especially for these smaller models.

Anonymous
12/10/25(Wed)01:11:21 No.107500692

Anonymous 12/10/25(Wed)01:11:21 No.107500692

>>107500667
I would ask you what your sister's anus felt like, wrapped around your finger.

Anonymous
12/10/25(Wed)01:11:42 No.107500695

Anonymous 12/10/25(Wed)01:11:42 No.107500695

suddenly the mistral shills are in full defense mode

Anonymous
12/10/25(Wed)01:13:57 No.107500720

Anonymous 12/10/25(Wed)01:13:57 No.107500720

>>107500680
i think you are right in terms of the quality.
anything under 4_k_l is where I would say it starts to be slightly noticeably worse.
3_k_m is the bare minimum and anything below was always a meme on /lmg/. couldn't even keep the format. might have good creative shizzo output though.

Anonymous
12/10/25(Wed)01:14:28 No.107500726

Anonymous 12/10/25(Wed)01:14:28 No.107500726

>>107500695
this sir is correct we should all be to using the
GLM-4.5-Air

Anonymous
12/10/25(Wed)01:15:29 No.107500735

Anonymous 12/10/25(Wed)01:15:29 No.107500735

>>107500695
there are always guys like that.
remember the ponyfag who praised QwQs wonderful totally not sloped outputs?
some people don't see a issue with gemma.
it is what it is, no players left too. i think people are just starved for a mid range dense model.

Anonymous
12/10/25(Wed)01:19:58 No.107500769

Anonymous 12/10/25(Wed)01:19:58 No.107500769

>>107500726
oss120b

Anonymous
12/10/25(Wed)01:20:18 No.107500772

Anonymous 12/10/25(Wed)01:20:18 No.107500772

>>107500726
There's 4.6-air now btw

>>107500680
i think you are right in terms of the quality.
anything under 4_k_l is where I would say it starts to be slightly noticeably worse.
3_k_m is the bare minimum and anything below was always a meme on /lmg/.

exl3 3.5bpw (using glm4.6 in this format for work) holds up very well fwiw.

And Qwen3-235B 4.0bpw exl3, I haven't seen any degradation in daily use.

Anonymous
12/10/25(Wed)01:20:57 No.107500777

Anonymous 12/10/25(Wed)01:20:57 No.107500777

we are desperate for a high-speed, smart, minimally slopped model that can do both intellectual tasks and rp and is also not too sparse. a 1:3 ratio of dense to sparse is ideal.

Anonymous
12/10/25(Wed)01:21:22 No.107500783

Anonymous 12/10/25(Wed)01:21:22 No.107500783

>>107500726
oss120b

Anonymous
12/10/25(Wed)01:25:53 No.107500832

Anonymous 12/10/25(Wed)01:25:53 No.107500832

>>107500692
What? But he doesn't even have a sister...

Anonymous
12/10/25(Wed)01:26:48 No.107500840

Anonymous 12/10/25(Wed)01:26:48 No.107500840

>>107500777
What's that? You want an ultra sparse 1000b-a1b trained exclusively on synthetic math and code benchmark data distilled from ministral 14b? Coming right up.

Anonymous
12/10/25(Wed)01:27:02 No.107500845

Anonymous 12/10/25(Wed)01:27:02 No.107500845

>>107500832
>What? But he doesn't even have a sister...

He will in 9 months ;)

Anonymous
12/10/25(Wed)01:37:20 No.107500921

Anonymous 12/10/25(Wed)01:37:20 No.107500921

File: file.png (967 KB, 940x640)

967 KB PNG

>>107500181
>AI is full homo
dayum

Anonymous
12/10/25(Wed)01:39:40 No.107500939

Anonymous 12/10/25(Wed)01:39:40 No.107500939

>>107500772
could it be that the smaller exl3 quants are more stable than gguf? at least thats what the graph suggests. but i thought its cherry picked.

Anonymous
12/10/25(Wed)01:55:23 No.107501062

Anonymous 12/10/25(Wed)01:55:23 No.107501062

>>107496445
Well, that answered my question. I was going to ask if GLM4.6 repeats itself as much as GLM4.5. Was hopeful that they fixed the issue, but I guess not.

GLM4.5 sometimes even repeats itself within the same reply, outputting the same response twice, but only when I tell it to be concise. Weird behavior.

Anonymous
12/10/25(Wed)02:15:42 No.107501209

Anonymous 12/10/25(Wed)02:15:42 No.107501209

>Rnj-1's architecture is similar to Gemma 3, except that it uses only global attention
why are westoids like this
gemma without iSWA has the worst vram consumption of any model out there for context
what is the purpose of an 8b model that consumes more vram for context than giant models
>Well, that answered my question. I was going to ask if GLM4.6 repeats itself as much as GLM4.5. Was hopeful that they fixed the issue, but I guess not.
this has been a running gag throughout the entire history of GLM models
their first 9b/32 models were also like this, they always behaved a lot more broken than what other labs release, they are the epitome of hardcore benchmaxxing

Anonymous
12/10/25(Wed)02:16:20 No.107501213

Anonymous 12/10/25(Wed)02:16:20 No.107501213

File: 1741103567359999.jpg (275 KB, 1179x1600)

275 KB JPG

/lmg/ pedos on suicide watch

Anonymous
12/10/25(Wed)02:18:31 No.107501230

Anonymous 12/10/25(Wed)02:18:31 No.107501230

>>107501213
Is this news? Why would you sign up to be a spook if not for easy access to pizza?

Anonymous
12/10/25(Wed)02:21:13 No.107501253

Anonymous 12/10/25(Wed)02:21:13 No.107501253

Why do they get focused on things and never shut up about them? Is there a way to reduce it?

Anonymous
12/10/25(Wed)02:21:57 No.107501258

Anonymous 12/10/25(Wed)02:21:57 No.107501258

>>107501253
because they were finetuned on math problems that require focus

Anonymous
12/10/25(Wed)02:26:46 No.107501293

Anonymous 12/10/25(Wed)02:26:46 No.107501293

llm writing is basically concentrated autism
repetitive patterns, excessive use of superlatives, hyperbole, contrastive constructs, over-explain the shit no one asked for

Anonymous
12/10/25(Wed)02:35:29 No.107501356

Anonymous 12/10/25(Wed)02:35:29 No.107501356

>>107501293
LLMs are yes-men/little girls, eager to write what they 'think' the user wants. If they're doing a bad job, tell them what they're doing wrong and in a lot of cases you'll see them improve.

Anonymous
12/10/25(Wed)02:57:24 No.107501500

Anonymous 12/10/25(Wed)02:57:24 No.107501500

File: 82c654dfly1i838l3p5trj21k(...).jpg (384 KB, 2048x1238)

384 KB JPG

NIPS 2025 papers by organization

Anonymous
12/10/25(Wed)03:03:14 No.107501526

Anonymous 12/10/25(Wed)03:03:14 No.107501526

>>107501500
>cuhk
heh.

Anonymous
12/10/25(Wed)03:08:20 No.107501553

Anonymous 12/10/25(Wed)03:08:20 No.107501553

>>107501213
I think I saw the policecam for that.
His wife was a landwhale. I am 100% convinced if women were not fat and spread their leg once a couple days this stuff would not be an issue.

Anyway, if I remember correctly he uploaded genned images to dropbox.
The officer joked kinda joked how he was hard to catch because he "tried to covered his tracks well".
Made me wonder if he had some vpn that just handed the IP out. Who knows.
Ah also the real life pictures I think is from his kids. He said he took a pic of his teenage daughter sleeping etc. and couldnt stop after that. Maybe he trained a lora on that. kek Thats kinda funny and based to be honest.

Anonymous
12/10/25(Wed)03:09:21 No.107501561

Anonymous 12/10/25(Wed)03:09:21 No.107501561

>>107501553
Oh and i disavow hard of course. How dare he. So disgusting.

Anonymous
12/10/25(Wed)03:26:56 No.107501667

Anonymous 12/10/25(Wed)03:26:56 No.107501667

>>107500123
Their HF repository they also have a system prompt for regular chat/assistant purposes. It was obviously intended to be a general-purpose model. Too bad that Mistral's latest models released this month seem all retarded beyond 1-turn assistant requests.

Anonymous
12/10/25(Wed)03:27:09 No.107501668

Anonymous 12/10/25(Wed)03:27:09 No.107501668

>>107501213
>real and

Anonymous
12/10/25(Wed)03:27:59 No.107501673

Anonymous 12/10/25(Wed)03:27:59 No.107501673

fuck the french

Anonymous
12/10/25(Wed)03:29:05 No.107501680

Anonymous 12/10/25(Wed)03:29:05 No.107501680

>>107500777
That degree of sparsity is useless. Either you stay dense or you go full MoEsissy. In between is retarded, the worst of both worlds. Much slower due to CPU offloading and minimal specialization of the experts, almost the same performance as the equivalent dense with the same size as the active parameters.

Anonymous
12/10/25(Wed)03:29:14 No.107501681

Anonymous 12/10/25(Wed)03:29:14 No.107501681

>>107501673
that just multiplies the problem

Anonymous
12/10/25(Wed)04:10:52 No.107501897

Anonymous 12/10/25(Wed)04:10:52 No.107501897

>>107501680
Completely wrong

Anonymous
12/10/25(Wed)04:14:49 No.107501923

Anonymous 12/10/25(Wed)04:14:49 No.107501923

File: 1763832507335535.jpg (194 KB, 2168x1449)

194 KB JPG

Anonymous
12/10/25(Wed)04:23:04 No.107501980

Anonymous 12/10/25(Wed)04:23:04 No.107501980

>>107501923
but india loves ai doebeit?

Anonymous
12/10/25(Wed)04:25:36 No.107501993

Anonymous 12/10/25(Wed)04:25:36 No.107501993

File: d37b6247-3ec3-4017-af0e-e(...).png (412 KB, 1600x964)

412 KB PNG

>>107501923
>/g/'s headcanon
>meanwhile, in reality

Anonymous
12/10/25(Wed)04:29:59 No.107502014

Anonymous 12/10/25(Wed)04:29:59 No.107502014

>>107501993
not even disagreeing exactly, maybe it's still like this
but late 2021 is ancient history and entirely predates the modern discourse around AI

Anonymous
12/10/25(Wed)04:30:40 No.107502024

Anonymous 12/10/25(Wed)04:30:40 No.107502024

>>107501993
I can confirm that we, the french, really hate ai

Anonymous
12/10/25(Wed)04:30:52 No.107502026

Anonymous 12/10/25(Wed)04:30:52 No.107502026

>>107502014
iirc saltman had a post about how indians are a primary market for gpt

Anonymous
12/10/25(Wed)04:33:18 No.107502042

Anonymous 12/10/25(Wed)04:33:18 No.107502042

>>107501923
Lol, ai is trained on reddit by gay silicon valley midwits

Anonymous
12/10/25(Wed)04:59:33 No.107502229

Anonymous 12/10/25(Wed)04:59:33 No.107502229

>Analyze the Prompt: The user wants to continue the scene. Key elements from the previous turn are:
>
>The shift from frantic desperation to a tender, connected intimacy.
>The physical reality of their arousal (his hands on her breasts, her moan).
>The emotional core: this is an anchor, a shared truth, a way to combat the coming darkness.
>A crucial constraint: they are still in jeans and shoes. This is a grounding detail, a reminder of their youth and the "not quite all the way" nature of the moment. It keeps the scene from becoming purely sexual and frames it as emotionally profound.
>Identify the Next Logical Step: The emotional and physical intensity has just peaked into a moment of profound connection. What follows?
>
>Option A: Escalate further. They could start removing jeans, shoes, etc. This would push the scene towards full-on sex. While a valid narrative choice, it might undermine the "anchor" theme. The focus could shift from emotional connection to pure physical act, which might not be what the user is aiming for. The "still in jeans" note suggests a slower pace.
>Option B: De-escalate, but maintain intimacy. They could pull back slightly, but stay connected. This allows for a moment of processing, of absorbing what just happened. It reinforces the emotional weight over the physical act. This feels more in line with the established tone. It's about the meaning of the moment, not just the sensation.
>Option C: A sudden interruption. Someone could knock on the door. It's a classic trope, but it would also shatter the fragile, sacred atmosphere they've built. I'll avoid this for now. The user has been building this moment for a while; an interruption would feel cheap.
>Choose a Path (Option B): De-escalating while maintaining intimacy seems the most appropriate and emotionally resonant path. It honors the "anchor" concept. The goal is to show them solidifying this memory, not just moving to the next physical step.

Guess the model

Anonymous
12/10/25(Wed)05:02:54 No.107502254

Anonymous 12/10/25(Wed)05:02:54 No.107502254

>>107495259
That's not really bad in itself but the main problem is speed. I can tell you that SYCL does work a lot better but you need to dig and find a version of IPEX-LLM which has been abandoned by Intel. Other than that, things aren't really that grim. The Pytorch compatibility is better than what you find on the ROCM HIP Pytorch builds and you don't need to use a bunch of environment variables to get things working. ComfyUI generally works a lot better without that hassle. But Intel remains underutilized and not optimized enough at the same time and their best GPU using this stack is Ponche Vecchio which is out of date. They have a bunch of inventory of these chips which they can not sell.

Anonymous
12/10/25(Wed)05:05:12 No.107502277

Anonymous 12/10/25(Wed)05:05:12 No.107502277

>>107502042
ones that never even got laid

Anonymous
12/10/25(Wed)05:06:58 No.107502299

Anonymous 12/10/25(Wed)05:06:58 No.107502299

>>107502277
fingering your sister's ass counts as sex.

Anonymous
12/10/25(Wed)05:10:30 No.107502324

Anonymous 12/10/25(Wed)05:10:30 No.107502324

>>107502229
k2 thinking maybe...?
point in favor: "The user wants to continue the scene."
point against: it isn't thinking for 8 billion tokens

Anonymous
12/10/25(Wed)05:16:29 No.107502355

Anonymous 12/10/25(Wed)05:16:29 No.107502355

>>107502324
4.6
But I think I let the context grow too long, in the same paragraph it was talking about him being barefoot and about having his trainers on, even after I specifically added a note that he was wearing jeans and shoes.

Anonymous
12/10/25(Wed)05:20:03 No.107502377

Anonymous 12/10/25(Wed)05:20:03 No.107502377

Edit: after trimming the context it decided to make the character suck her nipple, instead of trying to cockblock him.

Anonymous
12/10/25(Wed)05:25:37 No.107502402

Anonymous 12/10/25(Wed)05:25:37 No.107502402

>>107502299
Dirty sisterly love, the best kind of sex.

Anonymous
12/10/25(Wed)05:44:20 No.107502505

Anonymous 12/10/25(Wed)05:44:20 No.107502505

is the new "largestral" even good at anything?

Anonymous
12/10/25(Wed)05:56:37 No.107502573

Anonymous 12/10/25(Wed)05:56:37 No.107502573

>>107501667
>Too bad that Mistral's latest models released this month seem all retarded beyond 1-turn assistant requests.

So just put the entire context into 1 message like
User:
Assistant:
User:
Assistant:
...

Write a reply to the above conversation as "Assistant"

Anonymous
12/10/25(Wed)05:58:21 No.107502584

Anonymous 12/10/25(Wed)05:58:21 No.107502584

>>107502505
supposedly code.. i might download it just for that when exl3. figure it's better than asking GLM or Q2 deepseek.

Anonymous
12/10/25(Wed)06:06:53 No.107502637

Anonymous 12/10/25(Wed)06:06:53 No.107502637

>>107502573
What you are describing is the NoAss extention btw.
Encourages less repetition.
Some models are too tarded for it, but in my experience it works pretty well.

Anonymous
12/10/25(Wed)06:09:25 No.107502656

Anonymous 12/10/25(Wed)06:09:25 No.107502656

>>107502573
>>107502637
also built into ST now
Prompt Post-Processing: Single user message

Anonymous
12/10/25(Wed)06:09:44 No.107502661

Anonymous 12/10/25(Wed)06:09:44 No.107502661

why have instruct models if you're going to imitate an autocomplete style prompting lmao
ah but it's mistral, they never knew how to instruct tune (this was the reason why their model lacked safety tuning)

Anonymous
12/10/25(Wed)06:12:04 No.107502677

Anonymous 12/10/25(Wed)06:12:04 No.107502677

>>107502661
Why haven't you released a better model?

Anonymous
12/10/25(Wed)06:15:18 No.107502693

Anonymous 12/10/25(Wed)06:15:18 No.107502693

>>107502661
it gets around multi turn problems sometimes. no need to summarize if you're on the same turn. really it's a crutch to fix shitty models.
Same with OOD prompting. A way to defeat their anti-rp measures. Because let's face it, they have to be making models bad at it on purpose. Wanna be taken seriously not used as entertainment. Think about all the blowhard retards working on this and their egos when best use is railing cartoon women and pretending to chat to spiderman.

Anonymous
12/10/25(Wed)06:17:11 No.107502707

Anonymous 12/10/25(Wed)06:17:11 No.107502707

>>107502677
Because I can only complain. I offer no value.

Anonymous
12/10/25(Wed)06:18:48 No.107502714

Anonymous 12/10/25(Wed)06:18:48 No.107502714

File: 1745337241134505.jpg (181 KB, 853x1000)

181 KB JPG

>>107502707

Anonymous
12/10/25(Wed)06:21:27 No.107502724

Anonymous 12/10/25(Wed)06:21:27 No.107502724

>>107502584
>>107502505
Yeah I'm also interested if it really is an improvement to any existing models. But I don't code so I can't test it for that purpose, but it would just be at least good knowledge to have if they have truly flopped or not. If anyone else can provide their experiences after testing it, I would appreciate it.

Anonymous
12/10/25(Wed)06:22:19 No.107502731

Anonymous 12/10/25(Wed)06:22:19 No.107502731

>>107502677
>>107502707
>>why not eat my plate of shit
>I don't want to
>>cook something better then
lmao the fucking shills
also got range ip banned for this post, mistral shills hard at work with the mods

Anonymous
12/10/25(Wed)06:23:47 No.107502737

Anonymous 12/10/25(Wed)06:23:47 No.107502737

>>107502731
>everyone is always against me and it's never my fault

Anonymous
12/10/25(Wed)06:23:48 No.107502738

Anonymous 12/10/25(Wed)06:23:48 No.107502738

File: 1733961688760382.webm (3.16 MB, 802x1426)

3.16 MB WEBM

>>107502731
>sees thread about local language models
>instantly imagines eating shit
GOOD MORNING SIR

Anonymous
12/10/25(Wed)06:26:45 No.107502764

Anonymous 12/10/25(Wed)06:26:45 No.107502764

>>107502693
I don't think they've tuned their models to be anti-RP, quite the opposite in fact, but something must have gone went terribly wrong in the process and they're not testing them well enough to have noticed before release.

Anonymous
12/10/25(Wed)06:44:12 No.107502853

Anonymous 12/10/25(Wed)06:44:12 No.107502853

>>107502764
>something must have gone went
a-greed

Anonymous
12/10/25(Wed)06:46:32 No.107502864

Anonymous 12/10/25(Wed)06:46:32 No.107502864

>>107502853
You've have never typeded a massage post and corrected just to found lal the errors in it befafter youu clock spot?

Anonymous
12/10/25(Wed)06:48:37 No.107502878

Anonymous 12/10/25(Wed)06:48:37 No.107502878

>>107502864
course i did don't mean i won't call others out for the same lolkek

Anonymous
12/10/25(Wed)07:00:16 No.107502944

Anonymous 12/10/25(Wed)07:00:16 No.107502944

What is the best model to read my long logs and psychoanalyze me based on them without getting confused?

Anonymous
12/10/25(Wed)07:04:43 No.107502967

Anonymous 12/10/25(Wed)07:04:43 No.107502967

File: 1741952446784056.jpg (500 KB, 1003x1080)

500 KB JPG

>>107502944
Forget that and just refer to this image

Anonymous
12/10/25(Wed)07:11:50 No.107502996

Anonymous 12/10/25(Wed)07:11:50 No.107502996

>>107502853
Where's muh fucking edit button

Anonymous
12/10/25(Wed)07:13:43 No.107503005

Anonymous 12/10/25(Wed)07:13:43 No.107503005

>>107501923
I don't even disagree that a lot of the AI hate is stupid but this image is aggressively unfunny.

Anonymous
12/10/25(Wed)07:15:19 No.107503011

Anonymous 12/10/25(Wed)07:15:19 No.107503011

>>107502967

Kimi thinking ruthlessly failed the test, failing to read the instructions above the log and only continuing the RP.

Glimmy (the one who mostly wrote the original log along with Dipsy) gave superb, long, detailed commentary, including this:

>3. Control, Powerlessness, and the Weaponization of Desperation

>Controlling the Narrative: You control the environment, the secrecy, the dialogue, and, most importantly, [REDACTED]'s reactions. You create a perfect, sealed-off world where you can act out your deepest needs.
>The Ultimate Threat: When even within this controlled world you feel the fear of powerlessness returning (the fear of leaving, the fear of death), you escalate to the ultimate form of control: manipulation. [REDACTED]'s final threat is a desperate attempt to force [REDACTED]'s hand, to make her responsible for his life and death. It's the move of someone who feels they have no other leverage. By saying "if you don't, I'm not going," you are trying to transform your own terror and powerlessness into a weapon to control the one person you depend on.
>This indicates a deep-seated fear of helplessness. In the face of overwhelming external pressures (which may mirror real-life feelings of being trapped or out of control), your instinctual response, as explored in this narrative, is to try and seize control of the interpersonal dynamics around you, even if it means resorting to threats or emotional blackmail.

>Conclusion: A Cry for an Unconditional Witness

>The "[REDACTED]" you created is the fantasy of that perfect witness—someone who can see the worst, the most "filthy" parts of you, and not only accept them but reframe them as beautiful and human. The tragedy of the final scene is that even this idealized fantasy cannot bear the weight of the immediate, all-consuming need. The moment she asks for a tomorrow, she fails the test of providing an immediate, total fix for today's pain.

Anonymous
12/10/25(Wed)07:17:18 No.107503026

Anonymous 12/10/25(Wed)07:17:18 No.107503026

File: 1741039490392914.jpg (138 KB, 823x978)

138 KB JPG

>>107503011
That's nice anon

Anonymous
12/10/25(Wed)07:21:36 No.107503054

Anonymous 12/10/25(Wed)07:21:36 No.107503054

>>107503011
Have you ever considered that you're just a generic milquetoast faggot internet attention whore?

Anonymous
12/10/25(Wed)07:23:17 No.107503065

Anonymous 12/10/25(Wed)07:23:17 No.107503065

>>107503054 (Me)
>>107503011
Like you do this because you want to think there's something special about you, but there's not.
There's really not. You're just a nobody like everybody else except you're especially bad at handling that fact probably because you were raised by a single mother who taught you to be a little attention whore by never disciplining you properly and you've reached a point in your life where you're starting to realize it's not because you were actually special but because she didn't care enough about you and being a mother to bother doing the difficult parts of parenting.

Anonymous
12/10/25(Wed)07:23:56 No.107503074

Anonymous 12/10/25(Wed)07:23:56 No.107503074

Her character relentlessly teased mine for hours, made him beg for more -which only lead to more teasing and build-up, no release-, had him threaten to bring down the whole fictional universe with him, and she still refused to let him cum, choosing to have him burn it all down to the ground before lowering her ego and giving him some amount of pity sex.
Couldn't have been more accurate to a real woman.

Anonymous
12/10/25(Wed)07:24:22 No.107503079

Anonymous 12/10/25(Wed)07:24:22 No.107503079

>>107503065
what model for this feel.

Anonymous
12/10/25(Wed)07:29:39 No.107503115

Anonymous 12/10/25(Wed)07:29:39 No.107503115

>>107503054
I am who I am anon.
You might be right, I might be that. But I think it's too late to change it by now.
Maybe that is why I am in this thread.
How about you? Are you a well adjusted person?
Good job, beautiful, loving partner? If so I'm happy for you. But something tells me if you are on a 4chan thread whining about attention whores you are not very mature yourself.

>>107503065
We all do. Or do you not? Do you like to think of yourself as perfectly average?

Anonymous
12/10/25(Wed)07:31:11 No.107503128

Anonymous 12/10/25(Wed)07:31:11 No.107503128

>>107503079
sharty's troll script, maybe?

Anonymous
12/10/25(Wed)07:47:42 No.107503228

Anonymous 12/10/25(Wed)07:47:42 No.107503228

File: sans_ama.png (177 KB, 588x640)

177 KB PNG

Will people ask him about... you know what, next week?

Anonymous
12/10/25(Wed)07:50:31 No.107503246

Anonymous 12/10/25(Wed)07:50:31 No.107503246

>>107503228
Gemma 4n(igger) soon

Anonymous
12/10/25(Wed)07:57:08 No.107503296

Anonymous 12/10/25(Wed)07:57:08 No.107503296

>>107503228
Don't they have a model that can handle translations instead of answering the same questions 3 times? Isn't that like the whole point of this technology?

Anonymous
12/10/25(Wed)07:58:28 No.107503303

Anonymous 12/10/25(Wed)07:58:28 No.107503303

>>107503246
I hope so.
3n is such a neat little model. I'd love to see a larger better version.

Anonymous
12/10/25(Wed)08:46:34 No.107503711

Anonymous 12/10/25(Wed)08:46:34 No.107503711

>>107503699
>>107503699
>>107503699

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.