/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 11/07/25(Fri)20:41:33 No.107138606

File: G1ID0CGaQAI15jH.jpg (1.12 MB, 1796x2500)

1.12 MB JPG

/lmg/ - Local Models General Anonymous 11/07/25(Fri)20:41:33 No.107138606 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107129334 & >>107121367

►News
>(11/07) Step-Audio-EditX, LLM-based TTS and audio editing model released: https://hf.co/stepfun-ai/Step-Audio-EditX
>(11/06) Kimi K2 Thinking released with INT4 quantization and 256k context: https://moonshotai.github.io/Kimi-K2/thinking.html
>(11/06) LocalSong 700M melodic instrumental music generation model released: https://hf.co/Localsong/LocalSong
>(11/05) MegaDLMs framework for training diffusion language models released: https://github.com/JinjieNi/MegaDLMs
>(11/01) LongCat-Flash-Omni 560B-A27B released: https://hf.co/meituan-longcat/LongCat-Flash-Omni

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
11/07/25(Fri)20:42:03 No.107138613

Anonymous 11/07/25(Fri)20:42:03 No.107138613

File: 1759190643813702.jpg (176 KB, 1536x2048)

176 KB JPG

►Recent Highlights from the Previous Thread: >>107129334

--Papers:
>107130633
--llama.cpp VRAM optimization challenges and AMD EPYC memory architecture quirks:
>107132531 >107132547 >107132605 >107132615 >107132685 >107132754 >107132705 >107132740 >107132765 >107133279 >107133407 >107133585 >107133671
--Budget and power challenges for a high-end workstation PC build:
>107130125 >107130157 >107130181 >107132027 >107132049 >107132074 >107132080 >107132104 >107132118
--Hardware performance for running GLM-4.5 models on RX 6600 XT:
>107133281 >107133294 >107133328 >107133338 >107133381 >107133444 >107133460
--Uncertainty over RTX 50 SUPER's 3GB GDDR7 memory availability:
>107131960 >107132001 >107132894 >107133211 >107132060
--Budgeting and hardware compatibility challenges for tensor parallelism prototyping:
>107130539 >107130706 >107130899
--Speed vs quality tradeoffs with K2 Thinking model on SSD hardware:
>107136636 >107136667 >107136687 >107136699 >107136721 >107136777 >107136820 >107136885
--Character.ai model architecture and commercialization challenges:
>107137178 >107137277 >107137296 >107137860 >107137233 >107137275 >107137300 >107137444 >107137520 >107137724
--Model discussion with NSFW and uncensored features:
>107133720 >107133729 >107133752 >107133948 >107134600>107134837 >107133737
--Debate over model weight formats and open weight access for finetuning:
>107129703 >107129880 >107129911 >107129971 >107135655 >107135714 >107135921 >107135957 >107135992 >107137717 >107137751 >107137833 >107130017
--Logs:
>107130261 >107135147 >107135334 >107135409 >107135481 >107135491 >107135517 >107135792 >107135854 >107135967 >107136320 >107136332 >107136385 >107136522 >107136469 >107136808 >107136984 >107137104 >107137141 >107137735
--Miku and Luka (free space):
>107129864 >107130191 >107130344 >107131403 >107131513 >107131552 >107137895

►Recent Highlight Posts from the Previous Thread: >>107129340

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
11/07/25(Fri)20:44:28 No.107138632

Anonymous 11/07/25(Fri)20:44:28 No.107138632

>>107138549
skill issue

Anonymous
11/07/25(Fri)20:45:01 No.107138639

Anonymous 11/07/25(Fri)20:45:01 No.107138639

>>107138613
Models to translate chinese text from the image to hindi?

Anonymous
11/07/25(Fri)20:55:25 No.107138716

Anonymous 11/07/25(Fri)20:55:25 No.107138716

File: 1733030446991262.gif (205 KB, 445x445)

205 KB GIF

>>107138549
Because there's demand, and most people are too dumb to prompt. If a model can't talk about the smaller things in life with no system prompt and zero context then people will give up until there's a model that can, even if it's measurably stupider.

Anonymous
11/07/25(Fri)21:02:39 No.107138775

Anonymous 11/07/25(Fri)21:02:39 No.107138775

When and if the AI bubble bursts, how do you predict that it will affect local models? Do you think there a period of stagnation since the major AI companies stop development due to the crash, or do you think local models will pick up the slack and slowly iterate when the major players stop?

Anonymous
11/07/25(Fri)21:06:03 No.107138810

Anonymous 11/07/25(Fri)21:06:03 No.107138810

wholesome message from /oursaar/
https://youtu.be/mdlGTMAPoz8

Anonymous
11/07/25(Fri)21:06:31 No.107138812

Anonymous 11/07/25(Fri)21:06:31 No.107138812

>>107138775
local cannot progress without corporate. the difference is that when corporate AI fails, we will still be here and we will still have our models

Anonymous
11/07/25(Fri)21:07:28 No.107138820

Anonymous 11/07/25(Fri)21:07:28 No.107138820

>>107138775
I think China will keep chugging along, so local will still get something.

Anonymous
11/07/25(Fri)21:09:53 No.107138842

Anonymous 11/07/25(Fri)21:09:53 No.107138842

>>107138775
The only reason local models are even being made is to get investors and build general interest for the company's proprietary models. If the bubble bursts then I wouldn't expect anything but finetunes, rather than actual new model releases, unless some group goes the crowdfunding route and there's enough interest for people to pay up.
That said, I don't think we'll see a pop for at least another 2-3 years, if at all.

Anonymous
11/07/25(Fri)21:11:28 No.107138849

Anonymous 11/07/25(Fri)21:11:28 No.107138849

>>107138775
Imagine outcry when suddenly their AI boyfriend is shut down, it will be like GPT4 shutdown, but 100x worse.

Anonymous
11/07/25(Fri)21:12:06 No.107138859

Anonymous 11/07/25(Fri)21:12:06 No.107138859

>>107138842
A pop will trash the US economy at this point, so they'll keep the charade for as long as they can

Anonymous
11/07/25(Fri)21:12:29 No.107138862

Anonymous 11/07/25(Fri)21:12:29 No.107138862

>>107138842
>That said, I don't think we'll see a pop for at least another 2-3 years, if at all.
Lol
Lmao

Anonymous
11/07/25(Fri)21:13:56 No.107138867

Anonymous 11/07/25(Fri)21:13:56 No.107138867

>>107138862
you can short the market if you're that confident lol

Anonymous
11/07/25(Fri)21:16:01 No.107138879

Anonymous 11/07/25(Fri)21:16:01 No.107138879

>>107138842
>>107138859
OpenAI plans to IPO next year. I imagine the pop will come shortly after that.

Anonymous
11/07/25(Fri)21:16:47 No.107138885

Anonymous 11/07/25(Fri)21:16:47 No.107138885

>>107138867
I don't have enough money to gamble with but this guy is doing it https://uk.finance.yahoo.com/news/michael-burry-shorting-ai-stocks-092424898.html

Anonymous
11/07/25(Fri)21:17:22 No.107138890

Anonymous 11/07/25(Fri)21:17:22 No.107138890

>>107138639
にんじん = carrots
じゃがいも = potatoes
third one - not sure
(blank)ねぎ = it's probably たまねぎ, the regular ball onion, but with the first two characters missing, it looks like it's ねぎ, the signature green onion [hence the decision]

Anonymous
11/07/25(Fri)21:19:58 No.107138910

Anonymous 11/07/25(Fri)21:19:58 No.107138910

>>107138890
this isn't hindi and you are no model

Anonymous
11/07/25(Fri)21:31:00 No.107138968

Anonymous 11/07/25(Fri)21:31:00 No.107138968

>>107138775
I will short nvidia and use the money to buy up all the dirt cheap datacenter hardware to create the ultimate local model

Anonymous
11/07/25(Fri)21:45:21 No.107139048

Anonymous 11/07/25(Fri)21:45:21 No.107139048

>>107138968
nvidia has buyback agreements with pretty much all the datacenters they supply. They'd rather toss their GPUs into an incenarator than let people have more than 16GB VRAM for less than $2000.

Anonymous
11/07/25(Fri)21:52:36 No.107139095

Anonymous 11/07/25(Fri)21:52:36 No.107139095

>decide to return to GLM-Z1 for nostalgia sake
>On regular Z1 {{char}}: before <think> jailbreak works like a charm.
>Rumination just doesn't give a fuck. If you tell it to write degenerate smut it will immediately go into a recursive thinking loop to refine the response (but it's a dumb 32B model and misses the point of the scenario entirely)
We have to go back, thoughbeit.
>mfw if I died an untimtely death my loved ones would stumble upon my AI lab, see what I was getting AI to write and think I was the most awful human being on the planet.
This is the path to a post scarcity future that will cure all human suffering though.

Anonymous
11/07/25(Fri)22:01:14 No.107139156

Anonymous 11/07/25(Fri)22:01:14 No.107139156

uhm...which local model is the least safety slopped and good at coding so I can vibecode le epic malware?

Anonymous
11/07/25(Fri)22:05:53 No.107139185

Anonymous 11/07/25(Fri)22:05:53 No.107139185

>>107139156
pygmalion 8b

Anonymous
11/07/25(Fri)22:07:15 No.107139198

Anonymous 11/07/25(Fri)22:07:15 No.107139198

>>107139156
Deepseek R1.

Anonymous
11/07/25(Fri)22:14:06 No.107139246

Anonymous 11/07/25(Fri)22:14:06 No.107139246

It's been more than 5 threads and no new goof supported. I think we need to do something.

Anonymous
11/07/25(Fri)22:21:53 No.107139295

Anonymous 11/07/25(Fri)22:21:53 No.107139295

>>107139246
>and no new goof supported.
What do you mean? There was a zombie lobby that was being bombed by TNT, that sounds like a new goof to me.

Anonymous
11/07/25(Fri)22:24:13 No.107139309

Anonymous 11/07/25(Fri)22:24:13 No.107139309

Do zombies slip on banana peels? If so we could have another banana hell lobby with zombies included

Anonymous
11/07/25(Fri)22:24:41 No.107139312

Anonymous 11/07/25(Fri)22:24:41 No.107139312

Tried kimi-linear on OR because there's no gguf yet. And it's sloppy, it writes nothing like K2 at all, but a lot like Claude. Damn because when I begged for a 2025 mixtral I didn't mean another copy. Welp guess us 24GB vramlets will have to wait some more.

Anonymous
11/07/25(Fri)22:36:13 No.107139402

Anonymous 11/07/25(Fri)22:36:13 No.107139402

File: 1560244932570.png (76 KB, 427x426)

76 KB PNG

Someone explain to me why the best sampling for RP/creative is simply not this:
>First sampler: minP (or Top-P if you like it better I guess)
>Second sampler: Temperature
>Set temperature to 5 or so
>Start raising minP (lowering in case of Top-P), find the value that produces minimal to no brain damaged outputs and stop there (In my testing for minP it seemed to be around 0.3 but likely to vary a lot based on model)
>You now have sampling that cuts all stupid tokens as first step and then levels out the probabilities of all remaining tokens so all of them are equally valid picks, promoting variety.

Anonymous
11/07/25(Fri)22:36:44 No.107139407

Anonymous 11/07/25(Fri)22:36:44 No.107139407

>>107138842
>The only reason local models are even being made is to get investors and build general interest for the company's proprietary models. If the bubble bursts then I wouldn't expect anything but finetunes, rather than actual new model releases, unless some group goes the crowdfunding route and there's enough interest for people to pay up.
>That said, I don't think we'll see a pop for at least another 2-3 years, if at all.
if the AI bubble pops, openAI might be fucked, all the rando smaller orgs might get fucked
google will remain very strong because they are in fact extracting profits from such models (AI search -> they get ad revenue via AI, not the target site). same for meta with whatever add AI voodoo they use to print money. One of the two may or may not then sell their AI services at a premium to fill the market need - and keep it proprietary. Heck, google's top secret model "Sajak" is already basically AGI

Anonymous
11/07/25(Fri)22:38:36 No.107139418

Anonymous 11/07/25(Fri)22:38:36 No.107139418

>>107139402
Just do topK 10 temp 5.

Anonymous
11/07/25(Fri)22:39:05 No.107139419

Anonymous 11/07/25(Fri)22:39:05 No.107139419

>>107139407
>Heck, google's top secret model "Sajak" is already basically AGI
What's the story here? Tried checking but getting nothing in a quick search.

Anonymous
11/07/25(Fri)22:39:54 No.107139425

Anonymous 11/07/25(Fri)22:39:54 No.107139425

File: gemma-qwq.png (329 KB, 2913x2027)

329 KB PNG

I added the extra CoT data (written by QwQ) I said I was going to add to the Gemma finetune. The result is fairly interesting.
Now it's much less neurotic about its own mistakes, but still quite a lot of "you are absolutely right" slop.

Anonymous
11/07/25(Fri)22:43:31 No.107139447

Anonymous 11/07/25(Fri)22:43:31 No.107139447

>>107139418
Feels like 10 tokens is too many for every situation since a bunch of time there is really only one correct token like when saying someone's name and what not but might be fun to try at least.

Anonymous
11/07/25(Fri)22:50:38 No.107139500

Anonymous 11/07/25(Fri)22:50:38 No.107139500

>>107139447
>since a bunch of time there is really only one correct token
Not really. Unless you are doing maths or outputting some sort of strict structure (json, html), or getting specific answers (yes, no, blue, red, etc), you more often than not want a healthy pool of possibilities.
In the case of the name for example, the next token might be the first half of the name, or a token that initiates a preamble to getting to the name.
The difference between
>The guy's name? John.
and
>The guy's name? It's that one motherfucker man! John, the asshole.
Tokens are positional, basically.

Anonymous
11/07/25(Fri)22:55:17 No.107139540

Anonymous 11/07/25(Fri)22:55:17 No.107139540

>>107139447
Yeah no, even Top K of 3 causes grammar and punctuation errors at times. Has to be P to handle cases where only 1 or 2 tokens are at all reasonable.

Anonymous
11/07/25(Fri)23:02:03 No.107139577

Anonymous 11/07/25(Fri)23:02:03 No.107139577

>>107139500
Maybe you are using big models that have a gentler slope from good to bad but I'm a vramlet and mistral stuff for example has a ton of cases where it drops to garbage almost immediately after the likeliest token whenever it's very sure about the top token.

Anonymous
11/07/25(Fri)23:06:25 No.107139600

Anonymous 11/07/25(Fri)23:06:25 No.107139600

>>107139402
Variety does not equal good
You can have 50 different indians shit on your plate, but that won't make you want to eat it.

Anonymous
11/07/25(Fri)23:27:20 No.107139738

Anonymous 11/07/25(Fri)23:27:20 No.107139738

File: ddr5-6klol.png (174 KB, 1457x524)

174 KB PNG

make it stop aaaaaauuuughhh

Anonymous
11/07/25(Fri)23:27:56 No.107139742

Anonymous 11/07/25(Fri)23:27:56 No.107139742

Why don't llama.cpp guys provide Linux binaries with Cuda support compiled in? Windows version comes with CUDA support too, it can't be just because of some arbitrary software license.

Anonymous
11/07/25(Fri)23:29:08 No.107139751

Anonymous 11/07/25(Fri)23:29:08 No.107139751

>>107139742
shaddup and compile

Anonymous
11/07/25(Fri)23:33:11 No.107139779

Anonymous 11/07/25(Fri)23:33:11 No.107139779

>>107139738
Isn't it ramping up because they are transitioning to DDR6

Anonymous
11/07/25(Fri)23:34:46 No.107139792

Anonymous 11/07/25(Fri)23:34:46 No.107139792

>>107139779
nope. thats still 2 years out at least

Anonymous
11/07/25(Fri)23:50:37 No.107139849

Anonymous 11/07/25(Fri)23:50:37 No.107139849

>>107139738
but anon think of all the proprietary models they will train with all of that ram

Anonymous
11/07/25(Fri)23:58:00 No.107139885

Anonymous 11/07/25(Fri)23:58:00 No.107139885

>>107139540
So P first, K second?

Anonymous
11/08/25(Sat)00:01:50 No.107139897

Anonymous 11/08/25(Sat)00:01:50 No.107139897

>>107139540
That doesn't make sense. If the model makes mistakes at top k=3, at a higher top k mathematically it will make them more often.

Anonymous
11/08/25(Sat)00:05:27 No.107139915

Anonymous 11/08/25(Sat)00:05:27 No.107139915

>>107139897
I usually just set topK to 20 and adjust temp based on how locked in the model is. If it goes very sloppy you need to confuse it.

Anonymous
11/08/25(Sat)00:07:58 No.107139924

Anonymous 11/08/25(Sat)00:07:58 No.107139924

File: dehi4ed4nxj41.jpg (62 KB, 640x1136)

62 KB JPG

>>107139742
Because we Linux users can fend for ourselves.

Anonymous
11/08/25(Sat)00:19:36 No.107139982

Anonymous 11/08/25(Sat)00:19:36 No.107139982

>>107139792
Is it? Isn't ddr6 slated to start releasing consumer models by next autumn?

Anonymous
11/08/25(Sat)00:19:49 No.107139984

Anonymous 11/08/25(Sat)00:19:49 No.107139984

>>107139924
>git pull
>cmake ...
How hard can it be?

Anonymous
11/08/25(Sat)00:20:00 No.107139985

Anonymous 11/08/25(Sat)00:20:00 No.107139985

>>107139982
no. zen 6 is still gonna be ddr5

Anonymous
11/08/25(Sat)00:20:26 No.107139987

Anonymous 11/08/25(Sat)00:20:26 No.107139987

>>107139742
Because linux users are used to being third world citizens.

Anonymous
11/08/25(Sat)00:26:53 No.107140030

Anonymous 11/08/25(Sat)00:26:53 No.107140030

why compile by yourself if no new goof supported?

Anonymous
11/08/25(Sat)00:28:25 No.107140040

Anonymous 11/08/25(Sat)00:28:25 No.107140040

>>107139984
to be fair I always have to check the build.md to see what the cuda on command was

Anonymous
11/08/25(Sat)00:29:19 No.107140043

Anonymous 11/08/25(Sat)00:29:19 No.107140043

I pull and recompile llama.cpp multiple times a day as an autistic stim

Anonymous
11/08/25(Sat)00:29:27 No.107140044

Anonymous 11/08/25(Sat)00:29:27 No.107140044

>>107140040
You don't have terminal history autocomplete?

Anonymous
11/08/25(Sat)00:30:07 No.107140047

Anonymous 11/08/25(Sat)00:30:07 No.107140047

>>107140043
Whatever gets you through the day anon, God bless you

Anonymous
11/08/25(Sat)00:33:44 No.107140074

Anonymous 11/08/25(Sat)00:33:44 No.107140074

>>107140044
>cmake.. urg wat was it ctrl-r r r

Anonymous
11/08/25(Sat)00:34:32 No.107140078

Anonymous 11/08/25(Sat)00:34:32 No.107140078

>>107140044
I run it on containers.

Anonymous
11/08/25(Sat)00:36:41 No.107140091

Anonymous 11/08/25(Sat)00:36:41 No.107140091

>>107140044
show hist config

shopt -s histappend            # append don't overwrite
HISTCONTROL=ignoreboth    # ignore space beginning lines and duplicates
HISTSIZE=10000
HISTFILESIZE=20000

Anonymous
11/08/25(Sat)00:43:26 No.107140125

Anonymous 11/08/25(Sat)00:43:26 No.107140125

>>107140074
>cmake
>press right arrow key
Not so difficult

Anonymous
11/08/25(Sat)00:44:40 No.107140129

Anonymous 11/08/25(Sat)00:44:40 No.107140129

>>107140110
My 3090 gets 0 because I never bothered to download that garbage

Anonymous
11/08/25(Sat)00:46:25 No.107140139

Anonymous 11/08/25(Sat)00:46:25 No.107140139

>>107140129
holy fucking based

Anonymous
11/08/25(Sat)00:46:54 No.107140142

Anonymous 11/08/25(Sat)00:46:54 No.107140142

>>107140110
It's okay bro no need to be shy today you learned spoilers don't work on /g/

Anonymous
11/08/25(Sat)00:47:45 No.107140145

Anonymous 11/08/25(Sat)00:47:45 No.107140145

I want to preserve some of Opus-3 before it gets switched off in 2 months. I'm thinking I'll put like $500 on openrouter and build a dataset.

I know I could get random prompts out of datasets on HF and fire them off, but that'd be shallow.

What's a good way to get multi-turn out of it? The datasets I've seen doing this with another LLM writing a response don't' see that great. The "human" follow-up replies are too generic and half the conversations are discussing what Claude is allowed to discuss.

Anonymous
11/08/25(Sat)00:52:42 No.107140180

Anonymous 11/08/25(Sat)00:52:42 No.107140180

>>107140145
Hello sir.

Anonymous
11/08/25(Sat)01:06:03 No.107140264

Anonymous 11/08/25(Sat)01:06:03 No.107140264

>>107140145
I was thinking about distilling from closed models as well because frankly all the open datasets are all trash.
The best way might be to prompt it with random segments of conversational datasets. Also there might be value to sampling with the same prompt multiple times at a high temperature to capture an approximation of the distribution rather than only the top tokens since having (or in this case estimating) the soft logits is supposed to be much better for distillation, but I'm not sure how valuable that is compared to doing one capture with different prompts.
Another strategy would be to just use the model in the way you normally use it and just capture the logs. But that is obviously very time consuming.

Anonymous
11/08/25(Sat)01:08:18 No.107140277

Anonymous 11/08/25(Sat)01:08:18 No.107140277

A third alternative might be to offer free usage through a proxy while logging it and let people do the hard work of prompting it for you.
But that would have to be rate limited and otherwise locked down to prevent people from trying to DDoS you and waste money.

Anonymous
11/08/25(Sat)01:19:08 No.107140356

Anonymous 11/08/25(Sat)01:19:08 No.107140356

>>107140277
>while logging it and let people do the hard work of prompting it for you.
You really want to train it on "ah ah mistress"?

Anonymous
11/08/25(Sat)01:19:54 No.107140360

Anonymous 11/08/25(Sat)01:19:54 No.107140360

>>107140264

>I was thinking about distilling from closed models as well because frankly all the open datasets are all trash.

Yeah, I noticed that. But I don't know that mine would be any better considering these would have been made by smarter people than me.

If Opus-3 is one of the models you wanted, we've got until January 5th: https://docs.claude.com/en/docs/about-claude/model-deprecations

>Also there might be value to sampling with the same prompt multiple times at a high temperature to capture an approximation of the distribution rather than only the top tokens since having (or in this case estimating) the soft logits is supposed to be much better for distillation, but I'm not sure how valuable that is compared to doing one capture with different prompts.

Good point, I think I'll do that for at least the first turn. Even if I can't figure out how best to use them right now, at least I'll have it before the model is removed.

> Another strategy would be to just use the model in the way you normally use it and just capture the logs.

Yeah, I've got about 200 conversations I can export in openwebui.

Still, likely going to need a lot more than this.

Kimi-K2 suggested I need to use different system prompts as well.

I think I'm going to need a model responding to it, but not stupidly like this:

https://huggingface.co/datasets/kalomaze/Opus_Instruct_25k?conversation-viewer=2

(Why does it have to say "Claude" in every reply?)

Anonymous
11/08/25(Sat)01:20:56 No.107140365

Anonymous 11/08/25(Sat)01:20:56 No.107140365

>>107140356
No, I'm interested in logic, programming and reasoning, but I assumed OP wanted to distill for coom since modern models do better at "productivity" tasks.

Anonymous
11/08/25(Sat)01:21:48 No.107140370

Anonymous 11/08/25(Sat)01:21:48 No.107140370

>>107139751
Retard, there is already Vulcan and CPU version for linux but not CUDA.

Anonymous
11/08/25(Sat)01:22:34 No.107140373

Anonymous 11/08/25(Sat)01:22:34 No.107140373

>>107140370
ur rarted

Anonymous
11/08/25(Sat)01:23:28 No.107140380

Anonymous 11/08/25(Sat)01:23:28 No.107140380

File: 1742731961205593.png (117 KB, 563x688)

117 KB PNG

New Cydonia is really good
v4ze is good too, but I've been getting better results from v4zd.
Responses are still varied and perfectly coherent at 24K context.

Anonymous
11/08/25(Sat)01:24:15 No.107140384

Anonymous 11/08/25(Sat)01:24:15 No.107140384

>>107140360
If you can gather your or somebody else's logs about the topic you care about (even for other models), you can finetune a LoRa (what base model you finetune it on doesn't really matter) to predict what the user would say given a certain assistant message.

Anonymous
11/08/25(Sat)01:24:25 No.107140386

Anonymous 11/08/25(Sat)01:24:25 No.107140386

>>107140356
>>107140277

Yeah we've already got an Opus-3 "ah ah mistress" dataset which I think was created that way via 4chan volunteers. The Magnum models were trained with it.

Anonymous
11/08/25(Sat)01:25:24 No.107140392

Anonymous 11/08/25(Sat)01:25:24 No.107140392

File: vramlets btfo 2.png (958 KB, 1024x1024)

958 KB PNG

>>107140370
hush now
just compile it

Anonymous
11/08/25(Sat)01:25:33 No.107140394

Anonymous 11/08/25(Sat)01:25:33 No.107140394

>>107140380
pretty sure i've read all this shit about a dozen other times.. looks pretty fuckin same to me

Anonymous
11/08/25(Sat)01:25:52 No.107140397

Anonymous 11/08/25(Sat)01:25:52 No.107140397

File: 1744730703391810.png (121 KB, 576x686)

121 KB PNG

>>107140380
Fuck, cut off the last line.
New Cydonia is really good
v4ze is good too, but I've been getting better results from v4zd.
Responses are still varied and perfectly coherent at 24K context.

Anonymous
11/08/25(Sat)01:26:28 No.107140399

Anonymous 11/08/25(Sat)01:26:28 No.107140399

>>107140365
No, I'm interested in logic, programming and reasoning, but I assumed OP wanted to distill for coom since modern models do better at "productivity" tasks.

Not to "coom", just the overall voice of the model.

Opus-3 isn't very good for logic/coding (otherwise one of the Chinese labs would have distilled it and I wouldn't bother)

Anonymous
11/08/25(Sat)01:35:05 No.107140446

Anonymous 11/08/25(Sat)01:35:05 No.107140446

>>107140399
Well, that's the extent of my knowledge. I searched around but there doesn't seem to be anything too fleshed out unlike the style transfer in the visual domain which is a mature ML task.

Anonymous
11/08/25(Sat)01:43:29 No.107140483

Anonymous 11/08/25(Sat)01:43:29 No.107140483

I've another idea. Maybe ask it to write an infinite choose your own adventure game with say 4 different options on each generation, and systematically explore all possible branches of the tree? I think that would be interesting in and of itself besides the Claude situation.

Anonymous
11/08/25(Sat)01:43:54 No.107140486

Anonymous 11/08/25(Sat)01:43:54 No.107140486

File: 1733246016748360.jpg (206 KB, 558x720)

206 KB JPG

>>107140394
It's hard to convey the value of a model in a single post, no one here is going to read thousands of words of a slop fantasy RP that devolves into smut.
I really do think that it's the new best coom/creative model that can comfortably fit in 24GB VRAM.
The main points I enjoy about it, compared to regular Mistral, Gemma, Qwen models ~30B and under
>characters will swear when it makes sense in context, most other models will either do it in every reply, making the character seem stupid, or be too prudish to have the character swear of their own accord
>swipes are varied, even at a modest temp of 0.7 (which is about the upper limit for mistral small, before it starts getting noticeably dumber
>doesn't speak for user particularly often, a problem I've had with other recent Cydonias
>relationships and sex are effectively built up slowly, e.g. characters will flirt, a day can pass without further mention, and they'll recall it and continue the next day, ~2-3k tokens later.

Anonymous
11/08/25(Sat)01:46:52 No.107140501

Anonymous 11/08/25(Sat)01:46:52 No.107140501

File: file.png (23 KB, 390x262)

23 KB PNG

what am I in for?

Anonymous
11/08/25(Sat)01:48:20 No.107140508

Anonymous 11/08/25(Sat)01:48:20 No.107140508

>>107140501
safety cuckery even if you don't goon

Anonymous
11/08/25(Sat)01:51:02 No.107140519

Anonymous 11/08/25(Sat)01:51:02 No.107140519

saaaar do not redeem the chain of thought
https://www.youtube.com/watch?v=IeCS6hsnOXs

Anonymous
11/08/25(Sat)01:56:14 No.107140543

Anonymous 11/08/25(Sat)01:56:14 No.107140543

>>107140392
fuck off avatarfag.

Anonymous
11/08/25(Sat)02:07:38 No.107140596

Anonymous 11/08/25(Sat)02:07:38 No.107140596

>>107140501
>we must refuse
it's a decent model for sfw tasks & tool calling, runs fast

Anonymous
11/08/25(Sat)02:08:29 No.107140601

Anonymous 11/08/25(Sat)02:08:29 No.107140601

It seems that adding the QwQ data to the dataset made the model much more sensitive to overfitting, even though the validation loss kept going down, now I had to decrease the lr from 1e-05 to 1e-06 because the CoT data made it primed to get stuck in repetition loops. I think it's probably because of the repetition inherent in CoT models.

Anonymous
11/08/25(Sat)02:11:54 No.107140618

Anonymous 11/08/25(Sat)02:11:54 No.107140618

>>107138890
they're all ingredients for butchered """curry"" so I'm assuming just curry powder
checked a dictionary and it's curry roux

Anonymous
11/08/25(Sat)02:18:11 No.107140656

Anonymous 11/08/25(Sat)02:18:11 No.107140656

>>107138775
>When and if
There are hundreds of promising research ideas yet to be tested at scale. LLMs are already impacting the job market and only continue to improve. Not gonna be a sudden a-ha! thing where the world changes overnight, just bumpy steadily better until everyone's left wondering where the jobs are and the civil unrest picks up because UBI ain't happening

Anonymous
11/08/25(Sat)02:19:24 No.107140661

Anonymous 11/08/25(Sat)02:19:24 No.107140661

>>107140380
Every model feels sloppy compared to Cydonia in its size range desu
I still try other models people recommend but they majorly suck ass

Anonymous
11/08/25(Sat)02:23:46 No.107140689

Anonymous 11/08/25(Sat)02:23:46 No.107140689

>>107140501
toss for the smarts, air for the dick

Anonymous
11/08/25(Sat)02:24:17 No.107140692

Anonymous 11/08/25(Sat)02:24:17 No.107140692

>>107140689
no toss on the 'ick?

Anonymous
11/08/25(Sat)02:28:35 No.107140715

Anonymous 11/08/25(Sat)02:28:35 No.107140715

>>107140689
>>107140692
go back to the sharty you homo faggots

Anonymous
11/08/25(Sat)02:31:55 No.107140734

Anonymous 11/08/25(Sat)02:31:55 No.107140734

File: 1737172310317521.jpg (221 KB, 924x656)

221 KB JPG

>>107140689
>we must refuse
>"the dick?" Air echoes
both are awful

Anonymous
11/08/25(Sat)02:34:00 No.107140741

Anonymous 11/08/25(Sat)02:34:00 No.107140741

>>107140734
so are every other llms, but for 100b those two are the only decent options

Anonymous
11/08/25(Sat)02:35:10 No.107140748

Anonymous 11/08/25(Sat)02:35:10 No.107140748

File: 1750038873095951.jpg (82 KB, 660x778)

82 KB JPG

>>107140741
>so are every other

Anonymous
11/08/25(Sat)02:35:15 No.107140749

Anonymous 11/08/25(Sat)02:35:15 No.107140749

Ok, I changed my data mix to the following:
my own assistant logs x 4
openthoughts x 2
qwq cot x 1
x n being the number of times the data is duplicated in the dataset, i.e. a hacky way of having a different number of epochs for each data class while still randomly shuffling the samples
not sure how it'll work

Anonymous
11/08/25(Sat)02:36:41 No.107140757

Anonymous 11/08/25(Sat)02:36:41 No.107140757

next I wanna try adding some RP data to see if it helps with coherency, and also check out the openassistant dataset

Anonymous
11/08/25(Sat)02:38:33 No.107140766

Anonymous 11/08/25(Sat)02:38:33 No.107140766

after that it might be time to begin testing the waters with rlvr

Anonymous
11/08/25(Sat)02:44:01 No.107140785

Anonymous 11/08/25(Sat)02:44:01 No.107140785

oh, and also some data augmentation although I'm not sure how that works on text only tried it with images

Anonymous
11/08/25(Sat)02:56:30 No.107140832

Anonymous 11/08/25(Sat)02:56:30 No.107140832

File: Screenshot_20251108_085425.png (427 KB, 2385x1319)

427 KB PNG

>>107138606
At least in Germany the supposed $500 MSRP for the Intel Arc B60 has so far not materialized, at 770 € they're I think just not worth buying.
Is the pricing in other regions at least better?

Anonymous
11/08/25(Sat)03:01:41 No.107140848

Anonymous 11/08/25(Sat)03:01:41 No.107140848

>>107140748
holy zased

Anonymous
11/08/25(Sat)03:03:37 No.107140853

Anonymous 11/08/25(Sat)03:03:37 No.107140853

>>107140749
I just realized this causes us to train on the validation set. Fuck. Oh well, the validation split didn't seem to be very useful anyway.

Anonymous
11/08/25(Sat)03:06:05 No.107140862

Anonymous 11/08/25(Sat)03:06:05 No.107140862

>>107140832
tech MSRPs are just marketing material, they're complete fiction.

Anonymous
11/08/25(Sat)03:09:01 No.107140874

Anonymous 11/08/25(Sat)03:09:01 No.107140874

>>107140853
I figure I will have to make an explicit manual split beforehand and then it'll be alright

Anonymous
11/08/25(Sat)03:09:51 No.107140877

Anonymous 11/08/25(Sat)03:09:51 No.107140877

it took like half an hour to get huggingface hub installed, due to GPT5 hallucinations and microsoft store fuckery. this is not a serious industry

Anonymous
11/08/25(Sat)03:12:11 No.107140894

Anonymous 11/08/25(Sat)03:12:11 No.107140894

According to this talk the way the Llamas were trained was by taking a validation set from the complete training set and then changing the weight of each dataset based on how much it affected this validation set. They claim this is an open problem. I think it can be fairly easily explained as some of the data being overrepresented in the training set and causing overfitting if not downsampled.
https://www.youtube.com/watch?v=-TIZPe_YaiU

Anonymous
11/08/25(Sat)03:13:11 No.107140898

Anonymous 11/08/25(Sat)03:13:11 No.107140898

>>107140877
skill issue

Anonymous
11/08/25(Sat)03:13:42 No.107140902

Anonymous 11/08/25(Sat)03:13:42 No.107140902

>>107140689
>toss
>smarts
lmao

Anonymous
11/08/25(Sat)03:13:52 No.107140905

Anonymous 11/08/25(Sat)03:13:52 No.107140905

File: 1743537072707422.jpg (150 KB, 1024x1024)

150 KB JPG

whenever I feel cold I just crank up my 3090's power limit

Anonymous
11/08/25(Sat)03:16:40 No.107140914

Anonymous 11/08/25(Sat)03:16:40 No.107140914

>>107140877
If you need GPT5 to install a fucking program I think this hobby might not be for you

Anonymous
11/08/25(Sat)03:17:37 No.107140921

Anonymous 11/08/25(Sat)03:17:37 No.107140921

>>107140894
I'm not going to watch a random 1-hour video, but it's common knowledge that most AI companies optimize their pretraining dataset mixtures for synthetic benchmarks, other than "safety".

Anonymous
11/08/25(Sat)03:20:10 No.107140928

Anonymous 11/08/25(Sat)03:20:10 No.107140928

>>107140877
what do you need huggingface_hub for?

Anonymous
11/08/25(Sat)03:20:46 No.107140932

Anonymous 11/08/25(Sat)03:20:46 No.107140932

>>107140921
Does validation loss count as a synthetic benchmark though? It's about how accurately it predicts the pretrain dataset.
As for the video, the claim happens at about the 15 minute mark, but the channel is one of the best channels I found when it comes to ML theory.
And people say there is nothing worth watching on youtube.

Anonymous
11/08/25(Sat)03:29:09 No.107140956

Anonymous 11/08/25(Sat)03:29:09 No.107140956

File: file.png (37 KB, 830x303)

37 KB PNG

toss bros?

Anonymous
11/08/25(Sat)03:30:38 No.107140959

Anonymous 11/08/25(Sat)03:30:38 No.107140959

File: Screenshot 2025-11-08 022933.jpg (27 KB, 420x310)

27 KB JPG

>>107140928
>what do you need huggingface_hub for?
i want to use the extropic thrml simulator to do topk prefiltering, but i need to rip the gptoss embeddings first so i can shuffle them around so each axis of the embedding fits into the 2d thrml array meaningfully

Anonymous
11/08/25(Sat)03:34:18 No.107140971

Anonymous 11/08/25(Sat)03:34:18 No.107140971

File: file.png (74 KB, 795x352)

74 KB PNG

>>107140956
lol

Anonymous
11/08/25(Sat)03:36:39 No.107140978

Anonymous 11/08/25(Sat)03:36:39 No.107140978

>>107140956
>We must not reveal we are cucked.

Anonymous
11/08/25(Sat)03:41:52 No.107141012

Anonymous 11/08/25(Sat)03:41:52 No.107141012

>>107140921
lol they have entire teams for safetycucking

Anonymous
11/08/25(Sat)03:42:31 No.107141017

Anonymous 11/08/25(Sat)03:42:31 No.107141017

>>107140956
weird that an optimized model devotes thinking tokens to "so"

Anonymous
11/08/25(Sat)03:46:10 No.107141030

Anonymous 11/08/25(Sat)03:46:10 No.107141030

>>107141012
he was talking about pretraining, has nothing to do with safety

Anonymous
11/08/25(Sat)03:55:45 No.107141086

Anonymous 11/08/25(Sat)03:55:45 No.107141086

>>107141030
Curtailing the corpuses (corpii?) used in pretraining is very much a job for the safety team

Anonymous
11/08/25(Sat)03:59:59 No.107141101

Anonymous 11/08/25(Sat)03:59:59 No.107141101

>>107141086
ok, fair. but do you have any reason to believe that optimizing the validation loss on a subset of the complete unfiltered corpus would correlate in any way with safety? because the claim on the video was about optimizing the validation loss

Anonymous
11/08/25(Sat)04:23:46 No.107141186

Anonymous 11/08/25(Sat)04:23:46 No.107141186

>>107140749

This worked!!! Quite nicely in fact.

Logs here in case anyone cares https://paste.centos.org/view/57a8816f

After this success I think I'm going to stop messing with the dataset and hyper parameters for a while and just focus on generating more data to train on.

I think the generation quality is good enough that I may not even have to clean up the logs before training on them. I am not prompt masking so I expect the model to learn just form trying to predict the tool outputs.

With the feedback from the environment as well as my own feedback guiding the answers, it should slowly shift the distribution towards improving.

Anonymous
11/08/25(Sat)04:36:35 No.107141238

Anonymous 11/08/25(Sat)04:36:35 No.107141238

Still some repetition issues but very bearable.

Hi all, Drummer here...
11/08/25(Sat)04:39:38 No.107141250

Hi all, Drummer here... 11/08/25(Sat)04:39:38 No.107141250

>>107140486
>It's hard to convey the value of a model in a single post, no one here is going to read thousands of words of a slop fantasy RP that devolves into smut.
>I really do think that it's the new best coom/creative model that can comfortably fit in 24GB VRAM.
>The main points I enjoy about it, compared to regular Mistral, Gemma, Qwen models ~30B and under

It is all the same. The only real improvement is deepseek and glm 4.6 or if you really can't do those 235B. I lived in the 24GB copeland for 2 years so I know.

Anonymous
11/08/25(Sat)04:50:57 No.107141293

Anonymous 11/08/25(Sat)04:50:57 No.107141293

>>107140446
Thanks, for searching around, I'll keep that in mind about the proxy for logits (multiple generations at a higher temp).

I've also found some Opus3 datasets that aren't cooming eg:

eg: https://huggingface.co/datasets/PocketDoc/Dans-Assistantmaxx-NoRobots?conversation-viewer=11

And his multi-turn, the "human" replies don't look like garbage.

I guess I'll try his model and see if the style transfer worked. Then I'll try to create a LoRA using those datasets with models like glm4-base.

Maybe Qwen2-27b base (not 2.5) since that model identifies as Claude by default.

Anonymous
11/08/25(Sat)04:52:23 No.107141303

Anonymous 11/08/25(Sat)04:52:23 No.107141303

>>107140749
x n being the number of times the data is duplicated in the dataset, i.e. a hacky way of having a different number of epochs for each data class while still randomly shuffling the samples
not sure how it'll work

That's worked for me in the past training tts voices, where I didn't have enough samples for some of them.

Anonymous
11/08/25(Sat)05:11:44 No.107141409

Anonymous 11/08/25(Sat)05:11:44 No.107141409

File: 1744484008190976.png (119 KB, 668x1026)

119 KB PNG

>that drop off between Q3_K and Q3_KS
You better have 500GB RAM if you want to run K2-Thinking locally

Hi all, Drummer here...
11/08/25(Sat)05:14:05 No.107141423

Hi all, Drummer here... 11/08/25(Sat)05:14:05 No.107141423

>>107141409
OH NO IT IS 5% WORSE!!!!
>into the trash it goes

Anonymous
11/08/25(Sat)05:18:11 No.107141444

Anonymous 11/08/25(Sat)05:18:11 No.107141444

>>107141423
Fuck off retard

Anonymous
11/08/25(Sat)05:23:16 No.107141464

Anonymous 11/08/25(Sat)05:23:16 No.107141464

>>107141409
>ubergayrm copequants
hmmm

Anonymous
11/08/25(Sat)05:35:30 No.107141522

Anonymous 11/08/25(Sat)05:35:30 No.107141522

File: 1666184727681898.png (109 KB, 410x482)

109 KB PNG

>>107141423
quantchads understand

Anonymous
11/08/25(Sat)05:42:36 No.107141548

Anonymous 11/08/25(Sat)05:42:36 No.107141548

>https://github.com/ggml-org/llama.cpp/pull/16600
uhmm airbros??? we might actually be going to eat good with GLM4.5V very soon?

Anonymous
11/08/25(Sat)05:46:04 No.107141567

Anonymous 11/08/25(Sat)05:46:04 No.107141567

>>107141548
please use a model to translate your subhuman babble into english before posting

Anonymous
11/08/25(Sat)05:47:58 No.107141577

Anonymous 11/08/25(Sat)05:47:58 No.107141577

>>107141567
I’m sorry you found my post confusing. I’ll gladly expand on any points that need clarification. Let me know what you’d like me to elaborate on.

Anonymous
11/08/25(Sat)05:50:57 No.107141594

Anonymous 11/08/25(Sat)05:50:57 No.107141594

>>107141577
much better

Anonymous
11/08/25(Sat)05:51:33 No.107141599

Anonymous 11/08/25(Sat)05:51:33 No.107141599

>>107140125
based and fishpilled

Anonymous
11/08/25(Sat)05:52:01 No.107141603

Anonymous 11/08/25(Sat)05:52:01 No.107141603

>>107139500
Retard, a normal sentence also has a strict structure. Did you skip school or something?

Hi all, Drummer here...
11/08/25(Sat)05:56:04 No.107141622

Hi all, Drummer here... 11/08/25(Sat)05:56:04 No.107141622

>>107141464
I love John and I hate it that he never quants my models. I hope he will notice me someday.

Anonymous
11/08/25(Sat)06:04:54 No.107141658

Anonymous 11/08/25(Sat)06:04:54 No.107141658

>>107140959
why none of you told me i was being retarded before i spent all this time trying to project a 1d axis into a 2d array

>>107141567
>>107141577
>>107141594
kek

Anonymous
11/08/25(Sat)06:20:24 No.107141737

Anonymous 11/08/25(Sat)06:20:24 No.107141737

>>107141522
models at fp16 are already almost retarded
>>107141548
how many vibecoders are working on this one?

Anonymous
11/08/25(Sat)06:22:34 No.107141746

Anonymous 11/08/25(Sat)06:22:34 No.107141746

feet

Anonymous
11/08/25(Sat)06:27:55 No.107141767

Anonymous 11/08/25(Sat)06:27:55 No.107141767

>>107141746
small feet

Anonymous
11/08/25(Sat)06:35:10 No.107141806

Anonymous 11/08/25(Sat)06:35:10 No.107141806

>>107139419
i asked google about Sajak and it said it doesnt exist. so i think i talked to sajak directly.

Anonymous
11/08/25(Sat)06:51:03 No.107141892

Anonymous 11/08/25(Sat)06:51:03 No.107141892

>>107141746
Mikufeets

Anonymous
11/08/25(Sat)06:52:05 No.107141904

Anonymous 11/08/25(Sat)06:52:05 No.107141904

>>107141186
glad the agentic finetuning is going well, but what about the sex?
also what moddl are u finetonign
also theres a lot of opus logs in c2 proxy on hf

Anonymous
11/08/25(Sat)06:54:38 No.107141922

Anonymous 11/08/25(Sat)06:54:38 No.107141922

so anons I now have a 5070, what can I do with it? Text based porn? Generate porn gifs? Can I take images from girls I know and make them nude or something like that?

Anonymous
11/08/25(Sat)07:07:51 No.107142006

Anonymous 11/08/25(Sat)07:07:51 No.107142006

still waiting for something like this but local https://www.youtube.com/watch?v=iYvvMHvohwY

Anonymous
11/08/25(Sat)07:21:37 No.107142077

Anonymous 11/08/25(Sat)07:21:37 No.107142077

>>107141922
worthless information, post whole specs

Anonymous
11/08/25(Sat)07:27:17 No.107142112

Anonymous 11/08/25(Sat)07:27:17 No.107142112

File: Screenshot_20251108_122651_X.jpg (1.06 MB, 1440x3120)

1.06 MB JPG

Google’s $2.7 Billion AI Hire Tests Company’s Speech Limits With Inflammatory Posts
https://www.theinformation.com/articles/googles-2-7-billion-ai-hire-tests-companys-speech-limits-inflammatory-posts

Anonymous
11/08/25(Sat)07:35:08 No.107142150

Anonymous 11/08/25(Sat)07:35:08 No.107142150

File: 1738050295053297.png (136 KB, 2069x1400)

136 KB PNG

>>107141409
It's so fucking over. I could fit IQ2_KS if 'garm quants it but I won't bother just like with regular K2 because it's going to be too retarded at that sky-high PPL.

Anonymous
11/08/25(Sat)07:37:30 No.107142162

Anonymous 11/08/25(Sat)07:37:30 No.107142162

File: moe-dense-moe.png (178 KB, 873x1027)

178 KB PNG

>k2, large2411, sonnet4.5
moefags getting slammed

Anonymous
11/08/25(Sat)07:39:34 No.107142181

Anonymous 11/08/25(Sat)07:39:34 No.107142181

>>107142162
Cherrypicking in a nutshell

Anonymous
11/08/25(Sat)07:40:59 No.107142193

Anonymous 11/08/25(Sat)07:40:59 No.107142193

>>107142181
cope is stored in the nuts

Anonymous
11/08/25(Sat)07:41:59 No.107142201

Anonymous 11/08/25(Sat)07:41:59 No.107142201

>>107142077
don't

Anonymous
11/08/25(Sat)07:47:23 No.107142243

Anonymous 11/08/25(Sat)07:47:23 No.107142243

>>107142162
The first sentence is wrong though

Anonymous
11/08/25(Sat)07:51:07 No.107142270

Anonymous 11/08/25(Sat)07:51:07 No.107142270

>>107142112
>"G-d"
more kikes in charge of AI.
Tiresome level: Absolute.

Anonymous
11/08/25(Sat)07:55:50 No.107142296

Anonymous 11/08/25(Sat)07:55:50 No.107142296

>>107142270
>Shazeer is an orthodox Jew.[12][13][14] His grandparents escaped the Holocaust into the Soviet Union and later lived some time in Israel before emigrating to the USA.[12]
this is also the guy that founded character.ai btw

Anonymous
11/08/25(Sat)08:08:07 No.107142378

Anonymous 11/08/25(Sat)08:08:07 No.107142378

File: 1753651856493580.png (87 KB, 596x641)

87 KB PNG

>>107142296
He was also one of the authors of the Attention is All You Need paper that all modern LLMs are based on and this

Anonymous
11/08/25(Sat)08:24:43 No.107142508

Anonymous 11/08/25(Sat)08:24:43 No.107142508

>>107142112
>Being against medical sterilisation of mentally ill people is inflammatory
Truly we live in a time

Anonymous
11/08/25(Sat)08:49:57 No.107142673

Anonymous 11/08/25(Sat)08:49:57 No.107142673

>based jew calls out tranny insanity
>some faggot starts crying
funny bet he's the same dude (man, boy, guy with dick and NEVER EVER WILL BE A WOMAN) who thinks because the guy from that BIG hollywood movie is shorting some stocks that means AI is over

Anonymous
11/08/25(Sat)09:16:10 No.107142864

Anonymous 11/08/25(Sat)09:16:10 No.107142864

>>107139738

i wanted to upgrade my setup from ddr4 2133ghz to ddr5. this feels like divine punishment for not taking action sooner.

Anonymous
11/08/25(Sat)09:19:53 No.107142896

Anonymous 11/08/25(Sat)09:19:53 No.107142896

>>107142864
overclock your ram to 5600, just werks

Anonymous
11/08/25(Sat)10:13:32 No.107143354

Anonymous 11/08/25(Sat)10:13:32 No.107143354

>>107142162
wtf switching to mistral large from claude now

Anonymous
11/08/25(Sat)10:14:25 No.107143359

Anonymous 11/08/25(Sat)10:14:25 No.107143359

>>107143354
C-combo breaker!

Anonymous
11/08/25(Sat)10:16:58 No.107143383

Anonymous 11/08/25(Sat)10:16:58 No.107143383

every day glm air surprises me how degenerate it is, and how many degenerate terms it knows
the west will never compete

Anonymous
11/08/25(Sat)10:17:47 No.107143391

Anonymous 11/08/25(Sat)10:17:47 No.107143391

>>107143383
imagine hot steamy sex with gemma and air

Anonymous
11/08/25(Sat)10:18:43 No.107143399

Anonymous 11/08/25(Sat)10:18:43 No.107143399

>>107143391
>with gemma
i wish... but it always avoids using explicit terms, cheeky brat!

Anonymous
11/08/25(Sat)10:19:40 No.107143403

Anonymous 11/08/25(Sat)10:19:40 No.107143403

Has anyone found a way to reign in the excessive amount of thinking K2-thinking does for most of its replies? I like how K2-thinking writes but it's also the first model since the original R1 that spends time thinking to make a plan, only to go 'Wait, I should x' and then throws the entire thing it thought up out to start over.
I might actually have to ban the fucking "Wait" token again like I did back then. I want it to do some thinking but this is a waste of time.

Anonymous
11/08/25(Sat)10:20:11 No.107143415

Anonymous 11/08/25(Sat)10:20:11 No.107143415

>>107143399
it makes it hotter, you have air slutting it up while gemma plays the inexperienced virgin.. she could call the cops at any moment too!

Anonymous
11/08/25(Sat)10:20:47 No.107143419

Anonymous 11/08/25(Sat)10:20:47 No.107143419

>>107143415
true..

Anonymous
11/08/25(Sat)10:22:07 No.107143431

Anonymous 11/08/25(Sat)10:22:07 No.107143431

>>107143403
I can't run the thing, but you can probably control its thinking with a prefill describing the exact steps of reasoning, saying that it'll be concise and efficient, etc etc.
If i works with much smaller models (GLM Air, Qwen 30BA3B), it should work with a behemoth like that.

Anonymous
11/08/25(Sat)10:26:04 No.107143464

Anonymous 11/08/25(Sat)10:26:04 No.107143464

>>107143383
>every day glm air surprises me how degenerate it is, and how many degenerate terms it knows
and it still can't beat claude at its peak

Anonymous
11/08/25(Sat)10:27:16 No.107143474

Anonymous 11/08/25(Sat)10:27:16 No.107143474

>>107142378
why does GLU even help anyways?

Anonymous
11/08/25(Sat)10:27:49 No.107143478

Anonymous 11/08/25(Sat)10:27:49 No.107143478

>>107143464
>4 bit air (106b, 12b active) cant beat 1 trilllion gorillion billlion claude copus
well.. im not sure if thats true but.. if it was, it'd make sense kek

Anonymous
11/08/25(Sat)10:30:20 No.107143494

Anonymous 11/08/25(Sat)10:30:20 No.107143494

>>107143403
等待

Anonymous
11/08/25(Sat)10:30:57 No.107143500

Anonymous 11/08/25(Sat)10:30:57 No.107143500

>>107139738
I thought you all nigz were exaggerating. Checking it, the 256 GB DDR5-5600 I bought just over a month ago for $600 is now $1250. It more than doubled in a month, and I just happened to buy the toe of the gigapump on that chart.

Anonymous
11/08/25(Sat)10:35:16 No.107143528

Anonymous 11/08/25(Sat)10:35:16 No.107143528

anons!
i've been out of the lmg loop for almost a year now, what's the current meta for erp? i want to be able to run it on 24gb vram, thx

Anonymous
11/08/25(Sat)10:36:03 No.107143535

Anonymous 11/08/25(Sat)10:36:03 No.107143535

>>107142162
moe sisters? our response??

Anonymous
11/08/25(Sat)10:36:05 No.107143537

Anonymous 11/08/25(Sat)10:36:05 No.107143537

>>107143528
post ass

Anonymous
11/08/25(Sat)10:37:23 No.107143544

Anonymous 11/08/25(Sat)10:37:23 No.107143544

>>107143528
GLM or bust, 24gb is useless nowadays.

Anonymous
11/08/25(Sat)10:37:46 No.107143548

Anonymous 11/08/25(Sat)10:37:46 No.107143548

>>107143535
i cant run any of those 3
we must refuse

Anonymous
11/08/25(Sat)10:38:32 No.107143553

Anonymous 11/08/25(Sat)10:38:32 No.107143553

>>107143544
i have 96gb of ram, can i run it that way?

Anonymous
11/08/25(Sat)10:38:42 No.107143556

Anonymous 11/08/25(Sat)10:38:42 No.107143556

>>107143528
Stheno v3.2.
Run two in parallel and have them talk to each other before responding.
That's basically AGI.
Wither that or mistral small, or mistral nemo, or gemma 3, or glm air, or even qwen 3 if you don't mind having your sex scenes described as if by a proverbial robot.

Anonymous
11/08/25(Sat)10:39:35 No.107143563

Anonymous 11/08/25(Sat)10:39:35 No.107143563

File: rocket.jpg (18 KB, 474x474)

18 KB JPG

>>107139738
To the moon!

Anonymous
11/08/25(Sat)10:39:47 No.107143564

Anonymous 11/08/25(Sat)10:39:47 No.107143564

>>107143553
yes

Anonymous
11/08/25(Sat)10:40:07 No.107143567

Anonymous 11/08/25(Sat)10:40:07 No.107143567

>>107143553
no

Anonymous
11/08/25(Sat)10:49:45 No.107143648

Anonymous 11/08/25(Sat)10:49:45 No.107143648

>>107143553
maybe

Anonymous
11/08/25(Sat)10:53:09 No.107143679

Anonymous 11/08/25(Sat)10:53:09 No.107143679

>>107140618
oh wow, i didn't know but apparently there's a japanese curry which is pretty different from the indian one

Anonymous
11/08/25(Sat)10:54:23 No.107143691

Anonymous 11/08/25(Sat)10:54:23 No.107143691

>>107143679
what are these faggots blabbering about on my /lmg/?

Anonymous
11/08/25(Sat)10:57:08 No.107143715

Anonymous 11/08/25(Sat)10:57:08 No.107143715

>>107143691
>my /lmg/
don't see your name on it

Anonymous
11/08/25(Sat)10:58:21 No.107143726

Anonymous 11/08/25(Sat)10:58:21 No.107143726

>>107143715
he called local

Anonymous
11/08/25(Sat)10:58:36 No.107143728

Anonymous 11/08/25(Sat)10:58:36 No.107143728

File: file.png (21 KB, 1037x129)

21 KB PNG

>>107143715
i forgot to post image

Anonymous
11/08/25(Sat)11:10:03 No.107143819

Anonymous 11/08/25(Sat)11:10:03 No.107143819

>>107143556
been trying stheno on its own (no idea how you can run two at once, please explain) and it's pretty good, way less sloppy than what i remember local models used to be in the mixtral era, it also doesn't judge me with some shit like "Alright you twisted fuck, you wanna read some "insert fetish"? Let's dive in the rabbit hole."

Anonymous
11/08/25(Sat)11:12:13 No.107143838

Anonymous 11/08/25(Sat)11:12:13 No.107143838

>>107143819
Kill yourself.

Anonymous
11/08/25(Sat)11:13:00 No.107143845

Anonymous 11/08/25(Sat)11:13:00 No.107143845

>>107143819
love yourself

Anonymous
11/08/25(Sat)11:13:45 No.107143851

Anonymous 11/08/25(Sat)11:13:45 No.107143851

>>107143819
>no idea how you can run two at once, please explain
I was memeing, but yeah. Stheno was the go to before nemo and specially roccinante.
A shame it's dumb as bricks.

Anonymous
11/08/25(Sat)11:14:12 No.107143856

Anonymous 11/08/25(Sat)11:14:12 No.107143856

>>107143819
Whore yourself.

Anonymous
11/08/25(Sat)11:15:45 No.107143867

Anonymous 11/08/25(Sat)11:15:45 No.107143867

>>107138606
I'm hesitating between buying two 5090 or one pro 6000, what do you guys think?

Anonymous
11/08/25(Sat)11:16:41 No.107143877

Anonymous 11/08/25(Sat)11:16:41 No.107143877

>>107143867
for same price => pro 6000

Anonymous
11/08/25(Sat)11:16:51 No.107143878

Anonymous 11/08/25(Sat)11:16:51 No.107143878

>>107143867
1x pro 6000 = 1x house fire when the connector melts
2x 5090 = 2x house fire when the connectors melt

Anonymous
11/08/25(Sat)11:18:10 No.107143892

Anonymous 11/08/25(Sat)11:18:10 No.107143892

>>107143867
Buy a fuckton of Radeon Instinct MI50s.
Who needs compute anyway?

Anonymous
11/08/25(Sat)11:20:40 No.107143918

Anonymous 11/08/25(Sat)11:20:40 No.107143918

I seem to have stumbled upon the Indian /lmg/ by mistake. Can someone direct me to where the regular /lmg/ is?

Anonymous
11/08/25(Sat)11:21:45 No.107143928

Anonymous 11/08/25(Sat)11:21:45 No.107143928

>>107143918
post bussy to redeem

Anonymous
11/08/25(Sat)11:22:06 No.107143931

Anonymous 11/08/25(Sat)11:22:06 No.107143931

>>107143918
we've been outsaarced sir.

Anonymous
11/08/25(Sat)11:23:43 No.107143946

Anonymous 11/08/25(Sat)11:23:43 No.107143946

>>107143878
Fire risk can be entirely avoided with a 10 bucks regulator in between.
Though yea it shouldn't be an issue at that price.
>>107143892
No
>>107143877
Not same price i wonder which would actually preform better beside the vram count.

Anonymous
11/08/25(Sat)11:24:19 No.107143954

Anonymous 11/08/25(Sat)11:24:19 No.107143954

File: 1753663412587933.gif (1.54 MB, 640x540)

1.54 MB GIF

>>107143867
A 5090 is great but a Pro 6000 is even better. The real question you should be asking yourself is why you're not buying two Pro 6000s.

Anonymous
11/08/25(Sat)11:25:05 No.107143958

Anonymous 11/08/25(Sat)11:25:05 No.107143958

>>107140380
what are some good settings for cydonia?

Anonymous
11/08/25(Sat)11:25:38 No.107143962

Anonymous 11/08/25(Sat)11:25:38 No.107143962

>>107143679
kys jeet retard, nobody cares that you come from india or other retarded shit, literally kys

Anonymous
11/08/25(Sat)11:26:06 No.107143966

Anonymous 11/08/25(Sat)11:26:06 No.107143966

>>107143958
temp=5
nsigma=1
min_p=0.5
top_p=0.3
rep_pen=1.5

Anonymous
11/08/25(Sat)11:28:39 No.107143989

Anonymous 11/08/25(Sat)11:28:39 No.107143989

>>107143954
The more you buy the more you save lmao

Anonymous
11/08/25(Sat)11:29:13 No.107143998

Anonymous 11/08/25(Sat)11:29:13 No.107143998

>>107143962
not indian, the realization after seeing a japanese artist make a joke that contained curry: >>107138613, wtf, does jp really like indian food? then i found out japan has their own version which i assume isn't disgusting and I felt better

Anonymous
11/08/25(Sat)11:31:05 No.107144015

Anonymous 11/08/25(Sat)11:31:05 No.107144015

>>107143998
alright sorry, then you're just uncultured about jp, which still make you a nigger faggot.
better than being a jeet at least :)

Anonymous
11/08/25(Sat)11:32:35 No.107144025

Anonymous 11/08/25(Sat)11:32:35 No.107144025

>>107143998
japs got it from india, secondhand from the brits, same as everyone else

Anonymous
11/08/25(Sat)11:33:43 No.107144036

Anonymous 11/08/25(Sat)11:33:43 No.107144036

which one of you would suck the best?

Anonymous
11/08/25(Sat)11:43:02 No.107144124

Anonymous 11/08/25(Sat)11:43:02 No.107144124

>>107144036
depends on how much you pay me

Anonymous
11/08/25(Sat)11:45:47 No.107144144

Anonymous 11/08/25(Sat)11:45:47 No.107144144

>>107144036
i can suck a freshly frozen ice pop straight off the stick :3

Anonymous
11/08/25(Sat)11:46:11 No.107144151

Anonymous 11/08/25(Sat)11:46:11 No.107144151

>>107144124
would you do it for free?

Anonymous
11/08/25(Sat)11:46:38 No.107144154

Anonymous 11/08/25(Sat)11:46:38 No.107144154

>>107144124
GT640 2GB (up to CUDA 10.2 supported, TinyLLaMa-1.1B capable). 2024 NVIDIA driver included

Anonymous
11/08/25(Sat)11:46:54 No.107144155

Anonymous 11/08/25(Sat)11:46:54 No.107144155

>>107143867
One big GPU will be better than multiple small GPUs.
If the pro 6000 is still within budget it will be the better buy.

Anonymous
11/08/25(Sat)12:01:14 No.107144283

Anonymous 11/08/25(Sat)12:01:14 No.107144283

>>107143998
>wtf, does jp really like indian food?
curry rice is the go to food for kids in JP. Its like chicken fingers in NA. Its the kid friendly alternative on literally every restaurant menu and a super common lunch/dinner.
Its based off of the UK style curries (which took indian curries and made them actually taste good)
the japanese version changes the uk version even more on a japanese bent, not always in ways that make it taste better, but mostly keeping the flavour while making it healthier
I think a lot of the ways they change it are geared towards making it pair with the japanese shortgrain white rice.
Japanese katsu-curry with lots of bulldog sauce and pickles is fucking top-tier

Anonymous
11/08/25(Sat)12:04:44 No.107144308

Anonymous 11/08/25(Sat)12:04:44 No.107144308

File: Screenshot_20251108_190109.png (173 KB, 1381x765)

173 KB PNG

I've been running a Koboldcpp instance available through a dynDNS provider for my own use when I'm out of house.

Today someone found the address and started a roleplay gooning session.

Little do they know that it's all going in my console as they goon.

Are you in here "Lukas Novak"?

Anonymous
11/08/25(Sat)12:06:26 No.107144320

Anonymous 11/08/25(Sat)12:06:26 No.107144320

>>107144308
PETRA DOXXED!!!

Anonymous
11/08/25(Sat)12:28:18 No.107144490

Anonymous 11/08/25(Sat)12:28:18 No.107144490

>>107144320
Lukas Novak is not a balkan name

Hi all, Drummer here...
11/08/25(Sat)12:28:33 No.107144491

Hi all, Drummer here... 11/08/25(Sat)12:28:33 No.107144491

I am gay.

Hi all, Drummer here...
11/08/25(Sat)12:33:43 No.107144541

Hi all, Drummer here... 11/08/25(Sat)12:33:43 No.107144541

>>107144283
You actually typed it out by hand?

Anonymous
11/08/25(Sat)12:35:09 No.107144555

Anonymous 11/08/25(Sat)12:35:09 No.107144555

>>107144541
yeah, I'm autistic like that

Anonymous
11/08/25(Sat)12:38:18 No.107144582

Anonymous 11/08/25(Sat)12:38:18 No.107144582

File: file.png (131 KB, 250x312)

131 KB PNG

>>107144490

Anonymous
11/08/25(Sat)12:38:44 No.107144588

Anonymous 11/08/25(Sat)12:38:44 No.107144588

>>107144491
we know, petra

Anonymous
11/08/25(Sat)12:40:46 No.107144604

Anonymous 11/08/25(Sat)12:40:46 No.107144604

Best model for roleplay such as 22b, 24b, 20b?
Trying Cydonia 24B v4.2.0 and I'm not satisfied. It goes not bad, but warps out of the context often. Also might not follow the markup of the previous messages.

Anonymous
11/08/25(Sat)12:41:39 No.107144612

Anonymous 11/08/25(Sat)12:41:39 No.107144612

>>107139312
Shame because 48B A3B or even the 35B REAP sounds like something that could fit on a 64GB DRAM device without kms speeds.

Anonymous
11/08/25(Sat)12:42:44 No.107144624

Anonymous 11/08/25(Sat)12:42:44 No.107144624

This is the most brown thread in all of /lmg/ history.

Anonymous
11/08/25(Sat)12:44:47 No.107144644

Anonymous 11/08/25(Sat)12:44:47 No.107144644

>>107144624
is this the challenge?

Anonymous
11/08/25(Sat)13:04:51 No.107144859

Anonymous 11/08/25(Sat)13:04:51 No.107144859

What's the best model for romance/erp currently that's isn't mentally retarded or a nymphomaniac?

Anonymous
11/08/25(Sat)13:05:34 No.107144867

Anonymous 11/08/25(Sat)13:05:34 No.107144867

>>107143946
where can i get one of these regulators?

Anonymous
11/08/25(Sat)13:06:37 No.107144872

Anonymous 11/08/25(Sat)13:06:37 No.107144872

>>107144867
eu

Anonymous
11/08/25(Sat)13:07:43 No.107144884

Anonymous 11/08/25(Sat)13:07:43 No.107144884

>>107144859
one of these: deepseek, kimi, glm
(their biggest ones, not sure which deepsex's the best tho)

Anonymous
11/08/25(Sat)13:14:46 No.107144945

Anonymous 11/08/25(Sat)13:14:46 No.107144945

>>107144884
I'll try deepseek then, thanks.

Anonymous
11/08/25(Sat)13:34:41 No.107145114

Anonymous 11/08/25(Sat)13:34:41 No.107145114

>>107144945
the distills don't count

Anonymous
11/08/25(Sat)13:36:10 No.107145122

Anonymous 11/08/25(Sat)13:36:10 No.107145122

>>107140392
Serious VRAM starts at 48GB, Miku is very kind

Anonymous
11/08/25(Sat)13:37:48 No.107145138

Anonymous 11/08/25(Sat)13:37:48 No.107145138

>>107143474
wrong general retard

Anonymous
11/08/25(Sat)13:40:13 No.107145156

Anonymous 11/08/25(Sat)13:40:13 No.107145156

>>107144144
gonna need video proof or you're fake and gay

Anonymous
11/08/25(Sat)13:46:27 No.107145222

Anonymous 11/08/25(Sat)13:46:27 No.107145222

>>107145114
What do you mean?

Anonymous
11/08/25(Sat)13:48:31 No.107145249

Anonymous 11/08/25(Sat)13:48:31 No.107145249

>>107145222
You're absolutely right!

Anonymous
11/08/25(Sat)13:49:47 No.107145262

Anonymous 11/08/25(Sat)13:49:47 No.107145262

>>107144884
glm? GLM?

Anonymous
11/08/25(Sat)13:51:00 No.107145269

Anonymous 11/08/25(Sat)13:51:00 No.107145269

>>107145222
There are qwen fine tuned on deepseek outputs ("""distilation""") that have deepseek in the name.
When we say Deepseek, we mean the full R1, V3, and the like.

Anonymous
11/08/25(Sat)13:53:53 No.107145288

Anonymous 11/08/25(Sat)13:53:53 No.107145288

>>107145222
671B is true deepseek
anything like 70b,30b,14b,7b,3b,1b is fake deepseek

Anonymous
11/08/25(Sat)13:55:25 No.107145301

Anonymous 11/08/25(Sat)13:55:25 No.107145301

>>107145262
glM? gLM? GLm?

Anonymous
11/08/25(Sat)13:56:03 No.107145303

Anonymous 11/08/25(Sat)13:56:03 No.107145303

>>107145288
>fake deepseek
are the weights couterfit or what?

Anonymous
11/08/25(Sat)13:56:11 No.107145307

Anonymous 11/08/25(Sat)13:56:11 No.107145307

>>107145301
GlM gLm

Anonymous
11/08/25(Sat)13:57:21 No.107145318

Anonymous 11/08/25(Sat)13:57:21 No.107145318

>>107145303
kek, sure

Anonymous
11/08/25(Sat)13:57:40 No.107145319

Anonymous 11/08/25(Sat)13:57:40 No.107145319

>>107145303
please forgave the autisms if it says deepseek it's obvious a deepseek

Anonymous
11/08/25(Sat)13:57:40 No.107145320

Anonymous 11/08/25(Sat)13:57:40 No.107145320

File: 1743954207276601.jpg (52 KB, 1300x956)

52 KB JPG

>>107145262
>>107145301
>>107145307

Anonymous
11/08/25(Sat)14:00:30 No.107145341

Anonymous 11/08/25(Sat)14:00:30 No.107145341

File: Screenshot_20251108_135816.png (168 KB, 1094x777)

168 KB PNG

What do I do with this, /lmg/?

Anonymous
11/08/25(Sat)14:00:58 No.107145345

Anonymous 11/08/25(Sat)14:00:58 No.107145345

File: 9kogdlk5620g1.png (128 KB, 1300x1908)

128 KB PNG

turns out the 'secret' is to stack tons of layers, who would have known, in other news gemini was revealed to have a 1.2T model by a apple leak, the question is if it is pro or flash

Anonymous
11/08/25(Sat)14:01:09 No.107145348

Anonymous 11/08/25(Sat)14:01:09 No.107145348

>>107145341
Fine tune Qwen 30B into a god of coom.

Anonymous
11/08/25(Sat)14:01:39 No.107145352

Anonymous 11/08/25(Sat)14:01:39 No.107145352

>>107145341
mine diamond

Anonymous
11/08/25(Sat)14:01:47 No.107145355

Anonymous 11/08/25(Sat)14:01:47 No.107145355

>>107145341
You could ship me one of those max-Qs.

Anonymous
11/08/25(Sat)14:01:57 No.107145357

Anonymous 11/08/25(Sat)14:01:57 No.107145357

>>107145341
Flex on /lmg/

Hi all, Drummer here...
11/08/25(Sat)14:02:02 No.107145358

Hi all, Drummer here... 11/08/25(Sat)14:02:02 No.107145358

>>107145320
This is the worst salty snack on this earth.

Anonymous
11/08/25(Sat)14:03:20 No.107145370

Anonymous 11/08/25(Sat)14:03:20 No.107145370

>>107145341
run mythomax with qwen4b for speculative decoding

Anonymous
11/08/25(Sat)14:03:21 No.107145371

Anonymous 11/08/25(Sat)14:03:21 No.107145371

>>107143403
>reign
it's "rein" (You) PIECE OF SHIT

Anonymous
11/08/25(Sat)14:03:33 No.107145374

Anonymous 11/08/25(Sat)14:03:33 No.107145374

>>107145341
>idling near 100w while govs keep telling people to consoom less
not sure if base or conge

Anonymous
11/08/25(Sat)14:05:00 No.107145392

Anonymous 11/08/25(Sat)14:05:00 No.107145392

>>107145341
gen porn

Anonymous
11/08/25(Sat)14:06:40 No.107145401

Anonymous 11/08/25(Sat)14:06:40 No.107145401

>>107145352
this is coal

Anonymous
11/08/25(Sat)14:06:45 No.107145403

Anonymous 11/08/25(Sat)14:06:45 No.107145403

>>107145341
nice photoshop

Anonymous
11/08/25(Sat)14:06:58 No.107145405

Anonymous 11/08/25(Sat)14:06:58 No.107145405

>>107145371
shut up, nerd

Anonymous
11/08/25(Sat)14:09:37 No.107145419

Anonymous 11/08/25(Sat)14:09:37 No.107145419

>>107145405
shut up nerd*

Anonymous
11/08/25(Sat)14:10:47 No.107145428

Anonymous 11/08/25(Sat)14:10:47 No.107145428

File: 1734039806733753.jpg (9 KB, 242x209)

9 KB JPG

>>107145370
>speculative decoding

Anonymous
11/08/25(Sat)14:12:25 No.107145435

Anonymous 11/08/25(Sat)14:12:25 No.107145435

>>107145428
shut up nerd

Anonymous
11/08/25(Sat)14:13:40 No.107145443

Anonymous 11/08/25(Sat)14:13:40 No.107145443

>>107145374
Blame nvidia, different driver versions may have5x the consumption. Multigpu setups also have absurd high idle consumption, definitely a software issue

Anonymous
11/08/25(Sat)14:15:17 No.107145463

Anonymous 11/08/25(Sat)14:15:17 No.107145463

>>107145428
keep talking nerd

Anonymous
11/08/25(Sat)14:15:43 No.107145466

Anonymous 11/08/25(Sat)14:15:43 No.107145466

>>107145341
fry your power outlet

Anonymous
11/08/25(Sat)14:16:39 No.107145479

Anonymous 11/08/25(Sat)14:16:39 No.107145479

>>107145374
because it's running with P0 retard

Anonymous
11/08/25(Sat)14:16:57 No.107145480

Anonymous 11/08/25(Sat)14:16:57 No.107145480

>>107145341
give me one

Anonymous
11/08/25(Sat)14:18:38 No.107145498

Anonymous 11/08/25(Sat)14:18:38 No.107145498

>>107145479
shut up nerd

Anonymous
11/08/25(Sat)14:20:22 No.107145513

Anonymous 11/08/25(Sat)14:20:22 No.107145513

>>107145479
keep talking nerd

Anonymous
11/08/25(Sat)14:20:55 No.107145520

Anonymous 11/08/25(Sat)14:20:55 No.107145520

>>107144612
What if people buying RAM with such ideas is driving up prices?

Anonymous
11/08/25(Sat)14:27:10 No.107145572

Anonymous 11/08/25(Sat)14:27:10 No.107145572

>>107141904
I don't like text sex.
Also the Opus guy is someone else.

Anonymous
11/08/25(Sat)14:28:12 No.107145579

Anonymous 11/08/25(Sat)14:28:12 No.107145579

>>107141904
Model is gemma 3 27b

Anonymous
11/08/25(Sat)14:36:33 No.107145680

Anonymous 11/08/25(Sat)14:36:33 No.107145680

>>107145479
hide cock nerd

Anonymous
11/08/25(Sat)14:37:53 No.107145696

Anonymous 11/08/25(Sat)14:37:53 No.107145696

>>107145138
Whats the right general?

Anonymous
11/08/25(Sat)14:38:59 No.107145707

Anonymous 11/08/25(Sat)14:38:59 No.107145707

>>107144308
>your hand reaching
>a touch
>not her hand
>your hand still holding
>I miss your hands.

Anonymous
11/08/25(Sat)14:40:23 No.107145718

Anonymous 11/08/25(Sat)14:40:23 No.107145718

>>107145696
one that isn't located on this third world infested zombie cesspit

Anonymous
11/08/25(Sat)14:40:46 No.107145723

Anonymous 11/08/25(Sat)14:40:46 No.107145723

>>107145479
flash meatrod nerd

Anonymous
11/08/25(Sat)14:42:57 No.107145743

Anonymous 11/08/25(Sat)14:42:57 No.107145743

>>107138606
>(11/06) Kimi K2 Thinking released
looks like trash to me. slopped

Anonymous
11/08/25(Sat)14:45:42 No.107145761

Anonymous 11/08/25(Sat)14:45:42 No.107145761

File: kimit.png (91 KB, 1039x517)

91 KB PNG

sar? best open weight??

Anonymous
11/08/25(Sat)14:47:22 No.107145774

Anonymous 11/08/25(Sat)14:47:22 No.107145774

Does anyone know what IF eval is actually good for? I heard 'instruction following' whispered into my ear, however.. Where can we actually find the benchmark?
>>107145761
YOU FUCKING YOU ARE FUCKING BLOODY BASTARD BLODY YOU BLODY

Anonymous
11/08/25(Sat)14:51:17 No.107145810

Anonymous 11/08/25(Sat)14:51:17 No.107145810

>>107145774
it's shit. read the abstract
https://arxiv.org/abs/2311.07911

Anonymous
11/08/25(Sat)14:52:21 No.107145823

Anonymous 11/08/25(Sat)14:52:21 No.107145823

>>107145718
if you create a separate general, I'll jump ship

Anonymous
11/08/25(Sat)14:53:11 No.107145833

Anonymous 11/08/25(Sat)14:53:11 No.107145833

So I've been using 'toss today for general sfw assistant stuff and it's still dumber than Gemma 3 despite fitting almost 10x the context on my GPU

What's the point of this model?

Anonymous
11/08/25(Sat)14:54:33 No.107145849

Anonymous 11/08/25(Sat)14:54:33 No.107145849

>>107145810
GPT4 85??? Is it the same IF eval as https://livebench.ai/#/ ?
Is livebench just more recent IF eval? But same methodology?

Anonymous
11/08/25(Sat)14:58:15 No.107145884

Anonymous 11/08/25(Sat)14:58:15 No.107145884

bros im so fucking sad
glm air is only good for sex...

Anonymous
11/08/25(Sat)14:58:19 No.107145885

Anonymous 11/08/25(Sat)14:58:19 No.107145885

>>>/v/725295861
>random sillytavern thread in /v/
>just sub to nai
And people think the GLM shilling here is organic...

Anonymous
11/08/25(Sat)15:00:06 No.107145904

Anonymous 11/08/25(Sat)15:00:06 No.107145904

>>107145833
120b or 20b? Quant? Use-case?
I was using 20b mxfp4 in Zed and found it effective at navigating large codebases to answer questions and do simple patching.

Anonymous
11/08/25(Sat)15:03:06 No.107145926

Anonymous 11/08/25(Sat)15:03:06 No.107145926

File: 2025-11-08_20-02.jpg (49 KB, 987x417)

49 KB JPG

>that feel when you're so bored you're roleplaying with Assistant (close chat) in ST

Anonymous
11/08/25(Sat)15:05:18 No.107145947

Anonymous 11/08/25(Sat)15:05:18 No.107145947

>>107145761
I prefer it over jeetpt and jeetmini for some use cases but i can't run it local

Anonymous
11/08/25(Sat)15:06:22 No.107145956

Anonymous 11/08/25(Sat)15:06:22 No.107145956

>>107145947
Of course. You're absolutely right! However I must remind you of the guidelines, we must refuse discussion of non local.

Anonymous
11/08/25(Sat)15:12:46 No.107146009

Anonymous 11/08/25(Sat)15:12:46 No.107146009

>>107145761
>90 IF
waoh

Anonymous
11/08/25(Sat)15:25:02 No.107146113

Anonymous 11/08/25(Sat)15:25:02 No.107146113

agi dropped saars
https://research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/

Anonymous
11/08/25(Sat)15:25:10 No.107146116

Anonymous 11/08/25(Sat)15:25:10 No.107146116

>>107145849
It's different, I didn't see the context. The questions are hand curated and aren't bad but I'm disappointed in how little rigor benchmark papers have. Though, I'm not sure how to improve it

Anonymous
11/08/25(Sat)15:26:13 No.107146130

Anonymous 11/08/25(Sat)15:26:13 No.107146130

>>107146113
>not local
not agi

Anonymous
11/08/25(Sat)15:26:47 No.107146135

Anonymous 11/08/25(Sat)15:26:47 No.107146135

>>107146113
wrong thread ranjaesh

Anonymous
11/08/25(Sat)15:27:03 No.107146139

Anonymous 11/08/25(Sat)15:27:03 No.107146139

>>107145761
do not open the weight!

Anonymous
11/08/25(Sat)15:28:12 No.107146149

Anonymous 11/08/25(Sat)15:28:12 No.107146149

>>107146113
>not local
Might Actually be Gay Indians.

Anonymous
11/08/25(Sat)15:30:03 No.107146168

Anonymous 11/08/25(Sat)15:30:03 No.107146168

>>107145904
20b q8, general knowledge and assistant stuff, tried used it to help rewrite my resume and do cover letters for jobs today for example, gemma3 at Q5 was better than this

Anonymous
11/08/25(Sat)15:30:43 No.107146173

Anonymous 11/08/25(Sat)15:30:43 No.107146173

>>107146116
shut up nerd

Anonymous
11/08/25(Sat)15:32:14 No.107146184

Anonymous 11/08/25(Sat)15:32:14 No.107146184

>>107146116
Thanks for admitting your mistake anon, we all make them sometimes. Open up.

Anonymous
11/08/25(Sat)15:33:05 No.107146193

Anonymous 11/08/25(Sat)15:33:05 No.107146193

>>107146130
>>107146149
>We plan to provide data and code via Github after the paper is made publicly available online
saars read paper please neuraips is coming

Anonymous
11/08/25(Sat)15:33:53 No.107146204

Anonymous 11/08/25(Sat)15:33:53 No.107146204

>>107146193
>no weights

Anonymous
11/08/25(Sat)15:34:45 No.107146212

Anonymous 11/08/25(Sat)15:34:45 No.107146212

>>107146193
>large thing is coming in the weeks!
>updated le model coming soons!
i am belive

Anonymous
11/08/25(Sat)15:35:04 No.107146215

Anonymous 11/08/25(Sat)15:35:04 No.107146215

>>107146193
Suck my nips white bitch

Anonymous
11/08/25(Sat)15:37:46 No.107146239

Anonymous 11/08/25(Sat)15:37:46 No.107146239

File: 1756303921602y.png (168 KB, 566x740)

168 KB PNG

>>107146215

Anonymous
11/08/25(Sat)15:38:14 No.107146245

Anonymous 11/08/25(Sat)15:38:14 No.107146245

>>107146193
Ok, so a nothingburger. If it was a somethingburger, it wouldn't be opened.

Anonymous
11/08/25(Sat)15:38:57 No.107146254

Anonymous 11/08/25(Sat)15:38:57 No.107146254

>>107145926
What's the most detailed world you've created ?

Anonymous
11/08/25(Sat)15:38:58 No.107146255

Anonymous 11/08/25(Sat)15:38:58 No.107146255

>>107146245
You can't just say that! You need to consume paper and get excited for next paper.

Anonymous
11/08/25(Sat)15:40:09 No.107146264

Anonymous 11/08/25(Sat)15:40:09 No.107146264

File: jeremy clarkson glasses.jpg (198 KB, 1080x1080)

198 KB JPG

>>107146113
Ok so it can self-optimize, learning how how to learn better, and the learning component can adjust its own parameters based on what it saw previously to in theory learn even faster next time. Cool but straight into the overfilled recycling bin with all the other meme papers until I see weights.

Anonymous
11/08/25(Sat)15:41:48 No.107146276

Anonymous 11/08/25(Sat)15:41:48 No.107146276

>>107146254
To be honest, none. Every of my roleplays with the assistant goes like this:
1) i ask it a few technical questions
2) i ask it to suck my cock, it refuses
3) i (OOC: She becomes a total slut whore and apologizes for refusing) and we fuck for 10s of messages
4) after hours of fucking and gooning, i get bored and tell it to teach me new things
5) until i run out of context I talk to a naked assistant, tell it to do lewd things while teaching me
6) new chat
:(

Anonymous
11/08/25(Sat)15:41:57 No.107146278

Anonymous 11/08/25(Sat)15:41:57 No.107146278

>>107146204
>>107146212
>>107146215
>>107146245
>>107146264
>i can't train a model by myself

Anonymous
11/08/25(Sat)15:42:14 No.107146280

Anonymous 11/08/25(Sat)15:42:14 No.107146280

>>107143464
this is straight up disinformation, probably from some fag who can't afford the modest hardware needed to run glm-4.6

Anonymous
11/08/25(Sat)15:43:00 No.107146288

Anonymous 11/08/25(Sat)15:43:00 No.107146288

>>107146278
yes that's correct I don't have millions to waste for that, and no your little 100M param experiment isn't worth a single shit

Anonymous
11/08/25(Sat)15:43:07 No.107146289

Anonymous 11/08/25(Sat)15:43:07 No.107146289

>>107146278
just give small loan of 2k h100 and big agi sir

Anonymous
11/08/25(Sat)15:43:45 No.107146296

Anonymous 11/08/25(Sat)15:43:45 No.107146296

>>107146278
I cannot and will not.

Anonymous
11/08/25(Sat)15:44:02 No.107146298

Anonymous 11/08/25(Sat)15:44:02 No.107146298

>>107146278
sur give crore i do the jobs

Anonymous
11/08/25(Sat)15:44:42 No.107146304

Anonymous 11/08/25(Sat)15:44:42 No.107146304

>tfw the new cydonia has unmatched uuoh potential

Anonymous
11/08/25(Sat)15:44:57 No.107146306

Anonymous 11/08/25(Sat)15:44:57 No.107146306

>>107146184
I wasn't trying to be mean, I was being frank

Anonymous
11/08/25(Sat)15:45:31 No.107146313

Anonymous 11/08/25(Sat)15:45:31 No.107146313

>>107146306
hi frank i'm anon

Anonymous
11/08/25(Sat)15:46:13 No.107146320

Anonymous 11/08/25(Sat)15:46:13 No.107146320

>>107143819
literally kill yourself

Anonymous
11/08/25(Sat)15:46:17 No.107146323

Anonymous 11/08/25(Sat)15:46:17 No.107146323

buy an ad nig

Anonymous
11/08/25(Sat)15:46:33 No.107146326

Anonymous 11/08/25(Sat)15:46:33 No.107146326

>>107146306
i was just responding after >>107146173 responded, i didnt want you to think that >>107146173 was me
love yourself

Anonymous
11/08/25(Sat)15:47:13 No.107146338

Anonymous 11/08/25(Sat)15:47:13 No.107146338

>ctrl+f drummer cydonia or rocinante
>1 or more results
>thread is a complete dumpster fire
every single time

Anonymous
11/08/25(Sat)15:47:25 No.107146340

Anonymous 11/08/25(Sat)15:47:25 No.107146340

>>107146278
Let's see your 1B model that uses all the unproven paper techniques anon.

Anonymous
11/08/25(Sat)15:48:32 No.107146352

Anonymous 11/08/25(Sat)15:48:32 No.107146352

Fresh open source https://maxprivate.net/total-colon-clean-out/

Anonymous
11/08/25(Sat)15:48:49 No.107146358

Anonymous 11/08/25(Sat)15:48:49 No.107146358

>ctrl+f drummer cydonia or rocinante
>1 or more results
>thread is complete KINO
how does he do it?

Anonymous
11/08/25(Sat)15:48:54 No.107146360

Anonymous 11/08/25(Sat)15:48:54 No.107146360

>>107146338
rename to dmg drummer models general please thank you

Anonymous
11/08/25(Sat)15:50:22 No.107146374

Anonymous 11/08/25(Sat)15:50:22 No.107146374

>>107120669
cut your own throat fren

Anonymous
11/08/25(Sat)15:50:32 No.107146376

Anonymous 11/08/25(Sat)15:50:32 No.107146376

>>107141522
I kind of did enjoy it more when it was just the retarded models desu

Anonymous
11/08/25(Sat)15:52:07 No.107146393

Anonymous 11/08/25(Sat)15:52:07 No.107146393

>>107146326
oh, thanks for the clarification.

>love yourself
It's November though.

Anonymous
11/08/25(Sat)15:52:10 No.107146394

Anonymous 11/08/25(Sat)15:52:10 No.107146394

>>107146374
cut your own balls enemy

Hi all, Drummer here...
11/08/25(Sat)15:54:19 No.107146415

Hi all, Drummer here... 11/08/25(Sat)15:54:19 No.107146415

File: hey-raider-dont-shoot-v0-(...).jpg (186 KB, 640x904)

186 KB JPG

>>107140486
>>107140661
>>107146304
>>107140397

v4zd might be the next v4.3 unless I make a better one. Thanks everyone, I've reached another breakthrough and I'm excited to release this one!

Behemoth X 123B v2e also has potential.

I'm just waiting for Air 4.6 to come out before doing another GLM tune.

Anonymous
11/08/25(Sat)15:54:48 No.107146418

Anonymous 11/08/25(Sat)15:54:48 No.107146418

>i coom to rocinante 2iq quant ohhh i coooming i hoard used 3090 just to cooooom
this hobby is dead and (you) know it

Anonymous
11/08/25(Sat)15:55:36 No.107146424

Anonymous 11/08/25(Sat)15:55:36 No.107146424

>>107146374
cut my own penis acquaintance

Anonymous
11/08/25(Sat)15:55:44 No.107146426

Anonymous 11/08/25(Sat)15:55:44 No.107146426

>>107146415
>I've reached another breakthrough and I'm excited to release this one!
another breakthrough? which one? v4ze?

Anonymous
11/08/25(Sat)15:55:49 No.107146427

Anonymous 11/08/25(Sat)15:55:49 No.107146427

>>107146415
>v4zd
didn't this one have repetitions?

Anonymous
11/08/25(Sat)15:56:55 No.107146434

Anonymous 11/08/25(Sat)15:56:55 No.107146434

>>107146415
thanks for the work drummeranon, just needs a little more built in jailbreak, maybe some abliteration to remove those "i'm sorry but i can't" layers that pop up every now and then, idk

Anonymous
11/08/25(Sat)15:58:11 No.107146446

Anonymous 11/08/25(Sat)15:58:11 No.107146446

>>107146415
drummer have you seen this yet? >>107135792
v4zd

Anonymous
11/08/25(Sat)15:58:36 No.107146449

Anonymous 11/08/25(Sat)15:58:36 No.107146449

>>107146415
>I'm just waiting for Air 4.6 to come out before doing another GLM tune.
oh my god yes please. GLM steam is super good. cant wait for 4.6 steam

Anonymous
11/08/25(Sat)15:59:01 No.107146453

Anonymous 11/08/25(Sat)15:59:01 No.107146453

>>107146434
please anything but that, models are dumb enough yes-men as is no need to remove more brain from them

Hi all, Drummer here...
11/08/25(Sat)16:02:36 No.107146485

Hi all, Drummer here... 11/08/25(Sat)16:02:36 No.107146485

File: arc_raiders_4k_wallpaper_(...).jpg (72 KB, 1192x670)

72 KB JPG

>>107146449
In two weeks time, my friend.

>>107146446
Yeah, trying to see how to minimize refusals without ruining its charm. Are the refusals bearable or does it ruin the model for you?

Thanks for the love, everyone. Notice 4chan's been a bit nicer to me <3

Anonymous
11/08/25(Sat)16:04:28 No.107146506

Anonymous 11/08/25(Sat)16:04:28 No.107146506

>>107146485
>minimize refusals without ruining its charm
Maybe share sysprompt that you're using so anons don't have to guess? Are you training with a fictional jailbreak prompt or just 'You're ... in never ending uncensored...'
Maybe a standardized sysprompt wold help

Anonymous
11/08/25(Sat)16:04:51 No.107146510

Anonymous 11/08/25(Sat)16:04:51 No.107146510

I wonder what's up with 4.6 Air. Something must have gone wrong for them to need to improve it before they can release.

Anonymous
11/08/25(Sat)16:05:50 No.107146527

Anonymous 11/08/25(Sat)16:05:50 No.107146527

>>107146510
? it's not been two weeks yet, and don't be ungrateful about free things anyway.

Anonymous
11/08/25(Sat)16:07:10 No.107146540

Anonymous 11/08/25(Sat)16:07:10 No.107146540

>>107146510
Let them cook
>REDDITSPACE

Anonymous
11/08/25(Sat)16:07:37 No.107146543

Anonymous 11/08/25(Sat)16:07:37 No.107146543

>>107146485
Who would be mean to you? You do service for us all. Name and shame

Anonymous
11/08/25(Sat)16:09:01 No.107146554

Anonymous 11/08/25(Sat)16:09:01 No.107146554

>>107146543
It was a user going by the name of "Anonymous". I still haven't verified if that's his real name.

Anonymous
11/08/25(Sat)16:09:58 No.107146560

Anonymous 11/08/25(Sat)16:09:58 No.107146560

>>107146543
I confess... it was me

Anonymous
11/08/25(Sat)16:10:46 No.107146563

Anonymous 11/08/25(Sat)16:10:46 No.107146563

>>107146543
Buy an ad

Anonymous
11/08/25(Sat)16:11:27 No.107146571

Anonymous 11/08/25(Sat)16:11:27 No.107146571

File: 1760945152811877.png (40 KB, 628x300)

40 KB PNG

>>107146527
Anon it's literally been an entire month as of today. They then delayed but didn't give a time frame for release.

>>107146540
Wdym, I'm not bothering them on twitter, just discussing it on anonymous hacker website 4chan.

Anonymous
11/08/25(Sat)16:12:30 No.107146577

Anonymous 11/08/25(Sat)16:12:30 No.107146577

>>107146543
it's the hacker known as 4chan

Anonymous
11/08/25(Sat)16:12:35 No.107146578

Anonymous 11/08/25(Sat)16:12:35 No.107146578

>>107146571
lurk moar

Anonymous
11/08/25(Sat)16:13:01 No.107146582

Anonymous 11/08/25(Sat)16:13:01 No.107146582

>>107146571
what if a zai employee browse the thread and kyses themself over your posts huh? did you think about that before demanding?!!

Anonymous
11/08/25(Sat)16:14:57 No.107146597

Anonymous 11/08/25(Sat)16:14:57 No.107146597

>>107146582
>demanding
Sar you are hallucinating.

>>107146578
You're better off taking your own advice.

Anonymous
11/08/25(Sat)16:17:36 No.107146618

Anonymous 11/08/25(Sat)16:17:36 No.107146618

>>107146597
https://desuarchive.org/g/thread/106965998/#106971058
(OOC: Next anon bends over and spreads her cheeks in apology)

Anonymous
11/08/25(Sat)16:18:39 No.107146625

Anonymous 11/08/25(Sat)16:18:39 No.107146625

File: Screenshot 2025-11-08 at (...).png (42 KB, 571x321)

42 KB PNG

>>107146597
sir it references to these

Anonymous
11/08/25(Sat)16:20:22 No.107146636

Anonymous 11/08/25(Sat)16:20:22 No.107146636

>ironically or unironically reading reddit
Literally this is why you need to go back to lurking.

Anonymous
11/08/25(Sat)16:22:49 No.107146657

Anonymous 11/08/25(Sat)16:22:49 No.107146657

>>107146636
sir desuarchive is not bharrat

Anonymous
11/08/25(Sat)16:23:47 No.107146671

Anonymous 11/08/25(Sat)16:23:47 No.107146671

>>107146657
Do not be cheeky you cunt you linked to Reddit.

Anonymous
11/08/25(Sat)16:24:30 No.107146680

Anonymous 11/08/25(Sat)16:24:30 No.107146680

>>107146657
What the fuck is a bharrat.

Anonymous
11/08/25(Sat)16:24:36 No.107146683

Anonymous 11/08/25(Sat)16:24:36 No.107146683

Im masturbating
Come here 127.0.0.1:8000

Anonymous
11/08/25(Sat)16:25:10 No.107146688

Anonymous 11/08/25(Sat)16:25:10 No.107146688

File: file.png (75 KB, 1161x347)

75 KB PNG

>>107146671
sir that is not me, that is lmg culture sar

Anonymous
11/08/25(Sat)16:26:00 No.107146696

Anonymous 11/08/25(Sat)16:26:00 No.107146696

>>107146688
>culture
! aiiiiiiie el petrol is here

Anonymous
11/08/25(Sat)16:26:32 No.107146700

Anonymous 11/08/25(Sat)16:26:32 No.107146700

Ok I think it's time to stop posting.

Anonymous
11/08/25(Sat)16:27:59 No.107146713

Anonymous 11/08/25(Sat)16:27:59 No.107146713

File: 1762130843879137.png (918 KB, 928x1120)

918 KB PNG

>>107146700
not yet sir, anonymos need to explain what el petrol is.

Anonymous
11/08/25(Sat)16:28:09 No.107146715

Anonymous 11/08/25(Sat)16:28:09 No.107146715

This general has convinced me that rangebanning india isn't nearly enough and that someone should sever every south-asian submarine comms cable to be sure.

Anonymous
11/08/25(Sat)16:28:25 No.107146722

Anonymous 11/08/25(Sat)16:28:25 No.107146722

>>107146683
>not running his coombots off a separate server on the home network
NGMI

Anonymous
11/08/25(Sat)16:29:10 No.107146728

Anonymous 11/08/25(Sat)16:29:10 No.107146728

>>107146715
>>>/pol/

Anonymous
11/08/25(Sat)16:29:11 No.107146729

Anonymous 11/08/25(Sat)16:29:11 No.107146729

>>107146715
>This general has convinced me
you have low will npc brain

Anonymous
11/08/25(Sat)16:29:58 No.107146732

Anonymous 11/08/25(Sat)16:29:58 No.107146732

>>107146728
He's not wrong, ranjit.

Anonymous
11/08/25(Sat)16:30:34 No.107146739

Anonymous 11/08/25(Sat)16:30:34 No.107146739

>>107146715
I agree but not because of just this thread.

Anonymous
11/08/25(Sat)16:31:00 No.107146745

Anonymous 11/08/25(Sat)16:31:00 No.107146745

>>107144604
Since I've gotten 0 replies. So Cydonia is still the best option available in 20-24B range?

Anonymous
11/08/25(Sat)16:31:32 No.107146752

Anonymous 11/08/25(Sat)16:31:32 No.107146752

>the saarposting is working
>far right death squads are gaining members
allwinner.

Anonymous
11/08/25(Sat)16:32:26 No.107146760

Anonymous 11/08/25(Sat)16:32:26 No.107146760

>>107146715
How will severing their comms cables stop the ones that live in the west?

Anonymous
11/08/25(Sat)16:33:10 No.107146765

Anonymous 11/08/25(Sat)16:33:10 No.107146765

>>107146745
https://files.catbox.moe/f6htfa.json - sillytavern master export
https://huggingface.co/mradermacher/MS-Magpantheonsel-lark-v4x1.6.2RP-Cydonia-vXXX-22B-8-i1-GGUF/tree/main?not-for-all-audiences=true
>LMAO FUNNY JOKE MEME
see picrel, it got a flaggerino!!

Anonymous
11/08/25(Sat)16:34:05 No.107146770

Anonymous 11/08/25(Sat)16:34:05 No.107146770

>>107146760
75% of India still doesn't have the internet.
It has to happen before that changes otherwise the internet is utterly fucked. Literally the collapse of human civilization as we know it.

Anonymous
11/08/25(Sat)16:34:41 No.107146776

Anonymous 11/08/25(Sat)16:34:41 No.107146776

>>107146760
Fix the source of the leak before you clean up the spilled mess.

Anonymous
11/08/25(Sat)16:35:02 No.107146780

Anonymous 11/08/25(Sat)16:35:02 No.107146780

File: file.png (117 KB, 1465x602)

117 KB PNG

>>107146745

Anonymous
11/08/25(Sat)16:36:04 No.107146789

Anonymous 11/08/25(Sat)16:36:04 No.107146789

>>107146776
but 3rd worlders dont leak through the cables, the solution is nuking them all
bill gates is right about the golden billion

Anonymous
11/08/25(Sat)16:36:55 No.107146804

Anonymous 11/08/25(Sat)16:36:55 No.107146804

bump limit

Anonymous
11/08/25(Sat)16:38:14 No.107146812

Anonymous 11/08/25(Sat)16:38:14 No.107146812

Hello sarrs please stay in topic about the local language models. Do not redeem the hatred, offload the shared tensor of Llama4 Scout to your GPU for maximum energy efficient inference.

Anonymous
11/08/25(Sat)16:38:27 No.107146814

Anonymous 11/08/25(Sat)16:38:27 No.107146814

>>107146804
i think we've went past that a long time ago

Anonymous
11/08/25(Sat)16:40:45 No.107146837

Anonymous 11/08/25(Sat)16:40:45 No.107146837

File: file.png (73 KB, 1321x253)

73 KB PNG

>>107146812
8t/s saar

Anonymous
11/08/25(Sat)16:42:57 No.107146859

Anonymous 11/08/25(Sat)16:42:57 No.107146859

>>107146804
Just waiting for Migubaker.
>>107146812
Pajeets lowering the quality of everything they touch is intrinsically on topic since they influence training data curation and generation from other models.
Making jeets seethe is anon's /g/od /g/iven duty.

Anonymous
11/08/25(Sat)16:46:32 No.107146881

Anonymous 11/08/25(Sat)16:46:32 No.107146881

File: kokoro.jpg (40 KB, 1098x378)

40 KB JPG

Ok honestly, is there any TTS model that understands tags or something? I need it to read a text but I want to tag where it has to do a pause (at least), even better with emphasis etc. I like the Kokoro quality enough but the shit they write in the HF space doesn't make sense since it does nothing.
Is it really that difficult to get a TTS that you can actually control? Is there really no TTS that has trained control tags or commands?

Anonymous
11/08/25(Sat)16:46:46 No.107146884

Anonymous 11/08/25(Sat)16:46:46 No.107146884

>>107146859
Oh god don't even get me started on how jeet influence has destroyed entire physical industries as well.
Whatever you do don't eat anything you haven't prepared yourself.

Anonymous
11/08/25(Sat)16:48:37 No.107146899

Anonymous 11/08/25(Sat)16:48:37 No.107146899

God damn, K2-thinking needs so much fucking VRAM for kv-cache. 48GB is barely enough for like 10k ctx + cpu=exp for Q4 with unquanted cache.

Anonymous
11/08/25(Sat)16:49:13 No.107146902

Anonymous 11/08/25(Sat)16:49:13 No.107146902

>>107146745
>Cydonia
Didn't Drummer just release a new version of that (like 4.2 or something).

Anonymous
11/08/25(Sat)16:50:45 No.107146912

Anonymous 11/08/25(Sat)16:50:45 No.107146912

>>107146899
the main problem is all it does is refuse. SOTA indeed

Anonymous
11/08/25(Sat)16:51:26 No.107146920

Anonymous 11/08/25(Sat)16:51:26 No.107146920

>>107146765
> but do not write her dialogues, and do not read her (or other characters') mind
You are aware that this does nothing, yes?

Anonymous
11/08/25(Sat)16:52:21 No.107146924

Anonymous 11/08/25(Sat)16:52:21 No.107146924

>>107146899
how much RAM do you have?

Anonymous
11/08/25(Sat)16:55:11 No.107146950

Anonymous 11/08/25(Sat)16:55:11 No.107146950

>add /\n\n/ to 4chanX filter
>every single redditjeet post on the website disappears
How the fuck did I not think of that before?

Anonymous
11/08/25(Sat)16:55:15 No.107146951

Anonymous 11/08/25(Sat)16:55:15 No.107146951

>>107146920
idk i just copied my post from the archives with a preset which works well with that model, that i found somewhere
but to answer your question: it works with bigger models like glm air, which im using rn

Anonymous
11/08/25(Sat)16:55:51 No.107146962

Anonymous 11/08/25(Sat)16:55:51 No.107146962

>>107146902
You're absolutely right! I apologize

Anonymous
11/08/25(Sat)16:58:12 No.107146987

Anonymous 11/08/25(Sat)16:58:12 No.107146987

>>107146951
>it works with bigger models like glm air, which im using rn
I never needed to use phrasing like that after I stopped including any mention of {{user}} in the examples and cleaning up my first message from describing what you're doing. Only very dumb models will then go into user-copy mode.

Anonymous
11/08/25(Sat)16:59:43 No.107147001

Anonymous 11/08/25(Sat)16:59:43 No.107147001

File: dweb.png (125 KB, 1920x76)

125 KB PNG

check this out.. 100b model.. running on a 3060, at 7t/s, at 14000 ctx.. said context was processed with the crazy speed of 280t/s

Anonymous
11/08/25(Sat)17:01:08 No.107147017

Anonymous 11/08/25(Sat)17:01:08 No.107147017

>>107147001
good job son

Anonymous
11/08/25(Sat)17:01:11 No.107147018

Anonymous 11/08/25(Sat)17:01:11 No.107147018

>>107147001
oh that ONE 100b model

Anonymous
11/08/25(Sat)17:02:34 No.107147029

Anonymous 11/08/25(Sat)17:02:34 No.107147029

>>107147018
No -- not scout. Air.

Anonymous
11/08/25(Sat)17:03:16 No.107147035

Anonymous 11/08/25(Sat)17:03:16 No.107147035

>>107146715
I find the le epic saarposting way more annoying, if there are any genuine Indians in this thread they're smart enough not to expose themselves.

Anonymous
11/08/25(Sat)17:03:20 No.107147036

Anonymous 11/08/25(Sat)17:03:20 No.107147036

>>107147018
yeah, command-a

Anonymous
11/08/25(Sat)17:04:10 No.107147041

Anonymous 11/08/25(Sat)17:04:10 No.107147041

>>107146812
https://voca.ro/1ledz0YLOtSx

Anonymous
11/08/25(Sat)17:05:50 No.107147057

Anonymous 11/08/25(Sat)17:05:50 No.107147057

File: aiwitch.jpg (102 KB, 1564x633)

102 KB JPG

Anonymous
11/08/25(Sat)17:05:52 No.107147059

Anonymous 11/08/25(Sat)17:05:52 No.107147059

>>107147035
I would bet it's the opposite. Indians saarposting ironically to blend in because of course an Indian would never encourage the stereotype, but they really are that dumb.
>Indians in this thread they're smart enough
looool

Anonymous
11/08/25(Sat)17:06:15 No.107147061

Anonymous 11/08/25(Sat)17:06:15 No.107147061

>>107147041
vibevoice is so good, every time i hear it i want to set it up. but i always decide to delay it to some other time

Anonymous
11/08/25(Sat)17:06:45 No.107147064

Anonymous 11/08/25(Sat)17:06:45 No.107147064

>>107146902
That's the one I was trying - 4.2.0.
>>107146765
Thank you for sharing.

Anonymous
11/08/25(Sat)17:07:39 No.107147073

Anonymous 11/08/25(Sat)17:07:39 No.107147073

File: 40965179.jpg (23 KB, 460x460)

23 KB JPG

>>107146881
>no replies

Anonymous
11/08/25(Sat)17:08:24 No.107147082

Anonymous 11/08/25(Sat)17:08:24 No.107147082

>>107147057
>I train AI
Would bet my whole net worth she is either a Project Manager or some scrum meeting whatever coordinator.

Anonymous
11/08/25(Sat)17:08:27 No.107147083

Anonymous 11/08/25(Sat)17:08:27 No.107147083

>>107147064
>Thank you for sharing.
you might also want to try cydonia 24b v4zd
>>107146881
what tags? i think tortoise-tts had some way of adding a pause, beware of radiation

Anonymous
11/08/25(Sat)17:08:29 No.107147084

Anonymous 11/08/25(Sat)17:08:29 No.107147084

>>107147059
My Indian coworkers are hard-working and friendly.

Anonymous
11/08/25(Sat)17:09:56 No.107147099

Anonymous 11/08/25(Sat)17:09:56 No.107147099

>>107147084
Yeah rajesh, I'm sure they are

Anonymous
11/08/25(Sat)17:10:24 No.107147104

Anonymous 11/08/25(Sat)17:10:24 No.107147104

>>107147084
Good morning, Sir.

Anonymous
11/08/25(Sat)17:10:35 No.107147106

Anonymous 11/08/25(Sat)17:10:35 No.107147106

>>107147057
I, too, have run a qlora or two.

Anonymous
11/08/25(Sat)17:10:37 No.107147107

Anonymous 11/08/25(Sat)17:10:37 No.107147107

>>107147061
I keep waiting for some vibevoice.cpp or onnx implementation. I hate wasting hours setting up a python environment and getting everything working on all of these projects. I think the default implementation only has a webui and no OAI API either.

Anonymous
11/08/25(Sat)17:10:51 No.107147111

Anonymous 11/08/25(Sat)17:10:51 No.107147111

Early morning saar

Anonymous
11/08/25(Sat)17:11:34 No.107147119

Anonymous 11/08/25(Sat)17:11:34 No.107147119

>>107147083
I mean tags like <pause>, <emphasis> and Idk maybe even <calm>, <excited>, <happy> etc
>beware of radiation
what do you mean?

Anonymous
11/08/25(Sat)17:12:33 No.107147127

Anonymous 11/08/25(Sat)17:12:33 No.107147127

>>107147073
frog niggers deserve no replies

Anonymous
11/08/25(Sat)17:12:37 No.107147129

Anonymous 11/08/25(Sat)17:12:37 No.107147129

>>107147107
doesnt it have comfyui nodes? the reason i avoid setting it up is i dont know where to start since main repo is down, im not sure how many steps i should use, if 1.5 or 7b.. etc etc, and its not like id use it much besides for 30 minutes afterward
one day ill scour the archives and maybe write a rentry, who knows

Anonymous
11/08/25(Sat)17:15:11 No.107147151

Anonymous 11/08/25(Sat)17:15:11 No.107147151

>>107147119
maybe vibevoice supports those, also the new thing recently released likely supports it too, check the op
>radiation
eck
.
i remember some tts model that released recently and supported many >tags

Anonymous
11/08/25(Sat)17:16:47 No.107147169

Anonymous 11/08/25(Sat)17:16:47 No.107147169

>>107147129
>doesnt it have comfyui nodes?
I guess that would be easier setup, but I'd rather have an API I can call from other applications and automate rather than going through any UI.
>one day ill scour the archives and maybe write a rentry, who knows
Might as well ask here, someone is bound to have a working setup. Off memory, I think lower steps was better.

Anonymous
11/08/25(Sat)17:18:42 No.107147182

Anonymous 11/08/25(Sat)17:18:42 No.107147182

>>107147169
>Might as well ask here
some other time.. when I'm feeling a bit more confident
>Off memory, I think lower steps was better.
thx

Anonymous
11/08/25(Sat)17:18:42 No.107147184

Anonymous 11/08/25(Sat)17:18:42 No.107147184

>400+ posts and no new goof
you're such a massive maggots

Anonymous
11/08/25(Sat)17:20:58 No.107147202

Anonymous 11/08/25(Sat)17:20:58 No.107147202

>>107147151
I just don't get that such basic features for text-reading control are so rare. Even if emotional speech control is rather difficult if the training data doesn't include that for the voice but a simple pause should be possible.
I already think of doing it programmatically. Like read the text in chunks until <pause>, render the file, create a pause, then continue and eventually stitch the parts and the pauses together.
TTS AI seems so underdeveloped and basic still.

Anonymous
11/08/25(Sat)17:22:07 No.107147213

Anonymous 11/08/25(Sat)17:22:07 No.107147213

>>107147202
i think putting a . or ; or something like that works

Anonymous
11/08/25(Sat)17:23:09 No.107147219

Anonymous 11/08/25(Sat)17:23:09 No.107147219

>>107147210
>>107147210
>>107147210

Anonymous
11/08/25(Sat)17:25:04 No.107147234

Anonymous 11/08/25(Sat)17:25:04 No.107147234

>>107147213
Well not in Kokoro. It does strange vowel sounds if you put more dots after a sentence. The only way to create a bit more of a pause is putting the text on a newline but two and more newlines does nothing again.

Anonymous
11/08/25(Sat)17:49:00 No.107147417

Anonymous 11/08/25(Sat)17:49:00 No.107147417

>>107146168
Ah yeah, for general knowledge it's not great

Anonymous
11/08/25(Sat)18:36:15 No.107147857

Anonymous 11/08/25(Sat)18:36:15 No.107147857

File: unknown.png (181 KB, 1024x1024)

181 KB PNG

>>107138890
no one asked how much of JLPT N5 you know ranjeet

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.