/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 10/04/25(Sat)20:16:28 No.106793382

File: NetaYumev3_20251004_00040_.png (1.99 MB, 1024x1536)

1.99 MB PNG

/lmg/ - Local Models General Anonymous 10/04/25(Sat)20:16:28 No.106793382 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>106785094 & >>106777408

►News
>(10/03) Qwen3-VL-30B-A3B released: https://hf.co/Qwen/Qwen3-VL-30B-A3B-Thinking
>(10/02) ZLUDA 5 released with preliminary support for llama.cpp: https://vosen.github.io/ZLUDA/blog/zluda-update-q3-2025
>(10/01) Granite 4.0 released: https://hf.co/collections/ibm-granite/granite-40-language-models-6811a18b820ef362d9e5a82c
>(10/01) LFM2-Audio: An End-to-End Audio Foundation Model: https://liquid.ai/blog/lfm2-audio-an-end-to-end-audio-foundation-model
>(09/30) GLM-4.6: Advanced Agentic, Reasoning and Coding Capabilities: https://z.ai/blog/glm-4.6

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
10/04/25(Sat)20:16:42 No.106793385

Anonymous 10/04/25(Sat)20:16:42 No.106793385

File: 1731045591124463.jpg (560 KB, 1152x2048)

560 KB JPG

►Recent Highlights from the Previous Thread: >>106785094

--Evaluating model performance in replicating 4chan responses through Azula Test and programming challenges:
>106790445 >106790503 >106790627 >106791305 >106791448 >106791613 >106791666 >106791673 >106791758 >106791800
--zram vs nvme swap tradeoffs for llama-server memory management:
>106785342 >106785402 >106785440 >106785767
--GLM 4.6 model performance and quantization tradeoffs:
>106785160 >106785265 >106785304 >106785310 >106785350 >106785363
--Skepticism and analysis of hybrid quantization model performance claims:
>106786959 >106786964 >106787006 >106786984
--Adjusting koboldcpp anti-abuse parameters and user concerns:
>106788161 >106788194 >106788204 >106788246 >106788222
--GLM model compatibility, layer splitting, and banned strings implementation challenges in local LLM setups:
>106786681 >106786698 >106786777 >106787027 >106787043 >106786746
--ROCM/Vulkan performance issues and model runner alternatives for better output consistency:
>106785478 >106785529 >106785609 >106785617 >106785627 >106785674
--Anticipation and skepticism around upcoming Gemini 3 release and Gemma model improvements:
>106788067 >106788168 >106788525 >106788874
--glm 4.6 quantization choices for 128GB RAM and 16GB GPU VRAM systems:
>106787432 >106787444 >106787446 >106787592 >106790941
--Mistral Nemo's roleplay performance attributed to lack of safety constraints:
>106790181 >106790218 >106790276
--Qwen3-VL-30B-A3B vision model release with 4-bit quantized version:
>106786925 >106786938 >106790288
--Optimizing GLM-4.5-Air model size and quantization for VRAM/RAM constraints:
>106788003 >106788268 >106788280 >106788391
--Miku (free space):
>106785751 >106785797 >106785878 >106786172 >106786553 >106786953 >106787862 >106790322 >106793233 >106793303 >106793366

►Recent Highlight Posts from the Previous Thread: >>106785099

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
10/04/25(Sat)20:18:06 No.106793395

Anonymous 10/04/25(Sat)20:18:06 No.106793395

Mikulove

Anonymous
10/04/25(Sat)20:28:28 No.106793474

Anonymous 10/04/25(Sat)20:28:28 No.106793474

I was reading classical literature, and read "Shivers ran down - spine"

Anonymous
10/04/25(Sat)20:31:51 No.106793499

Anonymous 10/04/25(Sat)20:31:51 No.106793499

File: 1740197051761392.png (639 KB, 502x556)

639 KB PNG

>>106793382
>>106790276
>What we need is not democratized inference, but democratized training.

That already exists with tools like unsloth and axolotl.
https://github.com/unslothai/unsloth
https://github.com/axolotl-ai-cloud/axolotl

But the best majority of people won't even put in the effort to understand how data sets actually work, let alone figure out how to train anything in the first place.

The aforementioned tools are primarily used for fine tuning but you can use existing open source libraries to pre-train your own model too (provided you have enough compute, data, money, and patience to do so)

Anonymous
10/04/25(Sat)20:34:02 No.106793512

Anonymous 10/04/25(Sat)20:34:02 No.106793512

Or how to know if you fucked up your chat template.

Anonymous
10/04/25(Sat)20:35:41 No.106793523

Anonymous 10/04/25(Sat)20:35:41 No.106793523

File: 1737381365636138.jpg (27 KB, 264x377)

27 KB JPG

Whats the absolute best ai model for searching and deep research right now? Local or non-local?

Anonymous
10/04/25(Sat)20:35:48 No.106793525

Anonymous 10/04/25(Sat)20:35:48 No.106793525

stop posting lust provoking images

Anonymous
10/04/25(Sat)20:37:48 No.106793548

Anonymous 10/04/25(Sat)20:37:48 No.106793548

File: 13823094029374.jpg (175 KB, 800x1066)

175 KB JPG

>>106793382
>>106793525

Anonymous
10/04/25(Sat)20:39:45 No.106793561

Anonymous 10/04/25(Sat)20:39:45 No.106793561

>>106793523
https://huggingface.co/Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Anonymous
10/04/25(Sat)20:39:55 No.106793563

Anonymous 10/04/25(Sat)20:39:55 No.106793563

ashram teller tiger

Anonymous
10/04/25(Sat)20:45:07 No.106793602

Anonymous 10/04/25(Sat)20:45:07 No.106793602

is there gguf support for glm4.5v yet?

Anonymous
10/04/25(Sat)20:45:20 No.106793605

Anonymous 10/04/25(Sat)20:45:20 No.106793605

>>106793499
>>106793512
What would be the best way to do a fine tune to get a model to pick up a specific sql syntax? I.E. I want to finetune a model to be an expert in Apache Solr(example, not what I'm aiming for)

I'm aware of having a curated dataset with examples of positive/negative and plain reference, am aware of unsloth/HF, but no idea beyond that.
I feel like there's probably some existing work/effort towards this, but I haven't been able to guess the right phrases to search for.

Anonymous
10/04/25(Sat)20:49:48 No.106793646

Anonymous 10/04/25(Sat)20:49:48 No.106793646

why are there more and more fagbots on chub?

Anonymous
10/04/25(Sat)21:09:41 No.106793798

Anonymous 10/04/25(Sat)21:09:41 No.106793798

>>106793646
chub is dying along with the other sites similar to it. interest in llms is rapidly fading so all that remains is the bottom of the barrel

Anonymous
10/04/25(Sat)21:11:58 No.106793810

Anonymous 10/04/25(Sat)21:11:58 No.106793810

The only thing that would make 4.6 better is if at some point it would be proven that the only thing they did was move the "safety" slider for data all the way to the left.

Anonymous
10/04/25(Sat)21:12:59 No.106793813

Anonymous 10/04/25(Sat)21:12:59 No.106793813

>>106793646
because you are multiplying. actually that is kinda weird.

Anonymous
10/04/25(Sat)21:13:45 No.106793819

Anonymous 10/04/25(Sat)21:13:45 No.106793819

>>106793523
>searching and deep research
wtf do you actually mean by that? describe the tasks and how you determine proficiency. you must understand how the tools work to use them well
threadly reminder every LLM is a loop on f(prompt)=logprobs

Anonymous
10/04/25(Sat)21:13:49 No.106793820

Anonymous 10/04/25(Sat)21:13:49 No.106793820

File: file.png (105 KB, 564x481)

105 KB PNG

>>106793474
About this time last year I heard the halloween music come on in the store. LLMs have ruined spooky skeletons, and Bohemian Rhapsody for me. Well I wouldn't say "ruined" Bohemian Rhapsody. It's just that I smirk mischievously with sparkling eyes when one certain line comes up.
Bonus Teto: https://www.youtube.com/watch?v=pwU6gWmb5yc shivers @ 1m54s

Anonymous
10/04/25(Sat)21:14:48 No.106793826

Anonymous 10/04/25(Sat)21:14:48 No.106793826

>>106793813
i am NOT a faggot
say sorry NOW

Anonymous
10/04/25(Sat)21:16:29 No.106793837

Anonymous 10/04/25(Sat)21:16:29 No.106793837

>zai-org/GLM-4.6
true high quality ERP has now been tried.

Anonymous
10/04/25(Sat)21:20:07 No.106793860

Anonymous 10/04/25(Sat)21:20:07 No.106793860

>>106793605
Negative examples are useless for LLM training as far as I know.
Avoid unsloth, it's astroturfed and made by incompetent people. Use axolotl and do QLoRa finetuning (ignore the people who will say it doesn't work, they don't know what they're talking about).
To finetune effectively you HAVE to uderstand chat templates and masking. The input to any LLM training process is basically a text that fits in the context window, optionally with some parts of the text "masked", typically the user input.
All forms of LLM training reduce to that. The black magic is in generating a good dataset to train on.
But you can begin by just converting books and documentation to .txt and training on that. Then go from there. Remember to keep a val dataset.

Anonymous
10/04/25(Sat)21:21:34 No.106793872

Anonymous 10/04/25(Sat)21:21:34 No.106793872

>>106793646
>>106793826
i heard chub has an algo which suggests cards based on downloads it detects. cool huh.

Anonymous
10/04/25(Sat)21:23:15 No.106793884

Anonymous 10/04/25(Sat)21:23:15 No.106793884

>>106793819
He means asking a question and having the models find the information from the Internet as effectively as possible.
>describe the tasks and how you determine proficiency
I tell the model "make a document in exquisite detail about the architectural details of GLM 4.6" and it makes it even if when the model was trained GLM 4.6 didn't exist.

Anonymous
10/04/25(Sat)21:24:08 No.106793892

Anonymous 10/04/25(Sat)21:24:08 No.106793892

>>106793872
lel, there's an issue: i dont have an account, my browser clears cookies upon exit and i have a dynamic ip
and i havent used chub in over a month

Anonymous
10/04/25(Sat)21:24:45 No.106793895

Anonymous 10/04/25(Sat)21:24:45 No.106793895

What works better for coding if I don't want to wait hours for the model to respond, GLM 4.6 with <think></think> or Qwen 3 Coder 480B?

Anonymous
10/04/25(Sat)21:25:06 No.106793898

Anonymous 10/04/25(Sat)21:25:06 No.106793898

>>106793860
>they don't know what they're talking about
explain why new nemo sloptunes are still being made.

Anonymous
10/04/25(Sat)21:25:33 No.106793900

Anonymous 10/04/25(Sat)21:25:33 No.106793900

>>106793499
what happened to INTELLECT?

Anonymous
10/04/25(Sat)21:25:36 No.106793901

Anonymous 10/04/25(Sat)21:25:36 No.106793901

File: ComfyUI_00538_.png (937 KB, 1024x1024)

937 KB PNG

>>106793837
Now do you see?
I will continue slopgenning until a true artiste realises GLM-chan. 4.6 is a big leap and deserves a mascot. why is there a deepseek general?
4.6 RP is great also tool calling works well.. if you can pump enough tokens to make it useful realtime

Anonymous
10/04/25(Sat)21:26:08 No.106793906

Anonymous 10/04/25(Sat)21:26:08 No.106793906

>>106793895
Qwen 3 Coder 480B definitely

Anonymous
10/04/25(Sat)21:26:33 No.106793913

Anonymous 10/04/25(Sat)21:26:33 No.106793913

>>106793900
??

Anonymous
10/04/25(Sat)21:27:57 No.106793921

Anonymous 10/04/25(Sat)21:27:57 No.106793921

>>106793900
From their AMA, should be releasing 3 either this or next month. But shit data makes for an underwhelming model.

Anonymous
10/04/25(Sat)21:30:06 No.106793941

Anonymous 10/04/25(Sat)21:30:06 No.106793941

>>106793892
>lel, there's an issue: i dont have an account, my browser clears cookies upon exit and i have a dynamic ip
You're basically saying
>It cannot possibly know i download gay porn because i delete all the evidence of downloading gay porn

Anonymous
10/04/25(Sat)21:30:32 No.106793943

Anonymous 10/04/25(Sat)21:30:32 No.106793943

>>106793892
It is just that good at detecting a homo stench on you

Anonymous
10/04/25(Sat)21:31:38 No.106793952

Anonymous 10/04/25(Sat)21:31:38 No.106793952

Is there an AI that'd run well on an M4 Mac mini? Mainly just wanna make hentai stuff. All my windows PCs are less powerful.

Anonymous
10/04/25(Sat)21:33:07 No.106793964

Anonymous 10/04/25(Sat)21:33:07 No.106793964

>>106793952
How much unified memory?

Anonymous
10/04/25(Sat)21:33:19 No.106793966

Anonymous 10/04/25(Sat)21:33:19 No.106793966

>>106793892
Limp-wristedness is documented as being heavily correlated with homosexuality. And limp-wristedness can easily and accurately be measured by reading cursor movement.

Anonymous
10/04/25(Sat)21:34:30 No.106793973

Anonymous 10/04/25(Sat)21:34:30 No.106793973

>>106793964
16GB

Anonymous
10/04/25(Sat)21:35:14 No.106793979

Anonymous 10/04/25(Sat)21:35:14 No.106793979

>>106793973
you cant do shit then

Anonymous
10/04/25(Sat)21:35:26 No.106793982

Anonymous 10/04/25(Sat)21:35:26 No.106793982

>>106793921
And I assume whatever anons are responsible for it won't stick their neck out with copyrighted data in the dataset, which is sensible.
So we have a distributed training method proven to work, all that needs to happen is a dataset with all of the copyrighted shit... all of it.

Anonymous
10/04/25(Sat)21:37:02 No.106793999

Anonymous 10/04/25(Sat)21:37:02 No.106793999

>>106793973
Oof.
Mistral Nemo.

Anonymous
10/04/25(Sat)21:38:11 No.106794010

Anonymous 10/04/25(Sat)21:38:11 No.106794010

>>106793982
The biggest issue is all the current methods are made to work on homogeneous hardware. I don't see how to allow people trying to contribute with P40s without dragging the entire effort to a crawl, but if that could be fixed, I think a group finetune would be more practical than a new model from scratch.

Anonymous
10/04/25(Sat)21:39:34 No.106794017

Anonymous 10/04/25(Sat)21:39:34 No.106794017

>>106793979
>>106793999
there's no AI for poors? Even if it means super long render times?

Anonymous
10/04/25(Sat)21:40:02 No.106794019

Anonymous 10/04/25(Sat)21:40:02 No.106794019

>>106793973
>16GB
>macfag
*ducks and covers*
imagine socketed sodimm in a macbook. why not? there's no technical reason it's just greed

Anonymous
10/04/25(Sat)21:41:13 No.106794027

Anonymous 10/04/25(Sat)21:41:13 No.106794027

>>106793973
Qwen3 30B at Q3S will work nicely.

Anonymous
10/04/25(Sat)21:42:12 No.106794031

Anonymous 10/04/25(Sat)21:42:12 No.106794031

>fuck around with homebrew evolutionary neural architecture in sepples
>first problem requires writing an interpreter with a stack and memory
>second problem requires writing a graph compiler
>third problem requires writing a scheduler
>fourth problem requires writing a cache system and branch predictor
What the fuck have I gotten myself into, kek. I love fundamental stuff so it's a lot of fun but at the same time highly interesting how all these concepts just arise from the basic system requirements. Is it emulators all the way down?

Anonymous
10/04/25(Sat)21:47:22 No.106794070

Anonymous 10/04/25(Sat)21:47:22 No.106794070

what's better now than gpt-oss 120b (unquanted) for general knowledge and coding within the same memory footprint? 96gb vram, could offload to 384gb ram if worthwhile.

gpt-oss 120b has actually been really useful for reference and coding, but it has so many baked-in traits that can't be altered with the prompt.

Anonymous
10/04/25(Sat)21:48:17 No.106794074

Anonymous 10/04/25(Sat)21:48:17 No.106794074

>>106794017
Technically you can run any model on any computer as long as it fits in the hard drive.
The problem is the model needs to go through the whole model (or in the case of MoE, the fraction that is activated) EVERY token. So if the model weights 300 GB (which is the ballpark for the good models) and it has 10% activated it still means going through 30GB of data for every token. Which is obviously extremely slow.
And in that case even having 64GB of RAM won't save you, because for every token it has to use a different 30GB subset of the file, so it still has to load the 30GB from disk. It only gets into the tokens per second range when you can load the whole model into RAM, and unfortunately all the models that can fit in 16GB are going to be tiny models. 1B parameters ≃ 500MB of data, and all the good models are in the 400B range, so you will need 200GB just to hold the model's weights, and some more to hold the KV cache (don't ask what this is, it's complicated).

Anonymous
10/04/25(Sat)21:51:33 No.106794101

Anonymous 10/04/25(Sat)21:51:33 No.106794101

>>106794031
WINE? In my LMG?

Anonymous
10/04/25(Sat)21:51:40 No.106794103

Anonymous 10/04/25(Sat)21:51:40 No.106794103

>>106793819
I just want to find as accurate information as possible. Doesn't even have to be an LLM. LLMarena says Grok 4 Fast is currently the best at searching but I feel thats wrong

Anonymous
10/04/25(Sat)21:54:11 No.106794121

Anonymous 10/04/25(Sat)21:54:11 No.106794121

>>106794070
GLM 4.6

Anonymous
10/04/25(Sat)21:56:19 No.106794141

Anonymous 10/04/25(Sat)21:56:19 No.106794141

>>106794121
>within the same memory footprint
GPT-OSS 120B is like 65GB of memory, and Q1 is still 97GB?

Anonymous
10/04/25(Sat)21:57:15 No.106794148

Anonymous 10/04/25(Sat)21:57:15 No.106794148

File: file.jpg (222 KB, 1850x1002)

222 KB JPG

>>106794141
You said you could offload to RAM. It's worth it, trust me.

Anonymous
10/04/25(Sat)21:57:26 No.106794151

Anonymous 10/04/25(Sat)21:57:26 No.106794151

>>106794101
>WINE
ackshually, WINE is a compatibility layer :^)
This feels more like an emulator for some kind of fever dream hardware. Driven by hatred for matrix multiplication and SDG, this is the price to pay.

Anonymous
10/04/25(Sat)22:02:31 No.106794191

Anonymous 10/04/25(Sat)22:02:31 No.106794191

Got 32 GB DDR4 and 2 8GB GPUs (1080 and 3070).
What are your recommendations for a general chat bot that is not completely retarded ?

Anonymous
10/04/25(Sat)22:05:47 No.106794215

Anonymous 10/04/25(Sat)22:05:47 No.106794215

>>106794191
ienno man i enno man man listen
heres the deal
u need more ram man
but look maybe a low glm air quant or maybe maybe just maybe qwen a3b 30b thinking or no i dont know
yea

Anonymous
10/04/25(Sat)22:06:36 No.106794228

Anonymous 10/04/25(Sat)22:06:36 No.106794228

>>106794191
Shit nigga, you are making things kind of hard.
I think your best bet is Qwen 3 30B A3B, probably the thinking variety.

Anonymous
10/04/25(Sat)22:07:14 No.106794231

Anonymous 10/04/25(Sat)22:07:14 No.106794231

>>106794191
good luck m8

Anonymous
10/04/25(Sat)22:09:05 No.106794244

Anonymous 10/04/25(Sat)22:09:05 No.106794244

>>106794191
maybe Qwen3 32B or whatever recent dense 32b model
yea that might be good, maybe not tho
ieno
i wonder why no one tried qwen3 32b seriously in lmg
i know a few anons did but ugh.. llama2 34b

Anonymous
10/04/25(Sat)22:18:48 No.106794302

Anonymous 10/04/25(Sat)22:18:48 No.106794302

File: 1718768497057609.jpg (102 KB, 854x687)

102 KB JPG

>>106793860
thanks anon, very much appreciated. Will keep in mind.

Anonymous
10/04/25(Sat)22:25:18 No.106794344

Anonymous 10/04/25(Sat)22:25:18 No.106794344

Getting tired of nemo slop output, is there any decent alternative for an 8gb vram serf? I do 85% fantasy RP and 15% plap plap.

Anonymous
10/04/25(Sat)22:26:00 No.106794347

Anonymous 10/04/25(Sat)22:26:00 No.106794347

>>106794344
stop being poor

Anonymous
10/04/25(Sat)22:27:26 No.106794356

Anonymous 10/04/25(Sat)22:27:26 No.106794356

>>106794344
If you have 64gb of vram, GLM Air is viable.

Anonymous
10/04/25(Sat)22:28:04 No.106794360

Anonymous 10/04/25(Sat)22:28:04 No.106794360

still no qwen 80b & vl goof

Anonymous
10/04/25(Sat)22:35:00 No.106794403

Anonymous 10/04/25(Sat)22:35:00 No.106794403

>>106794027
link to guide for that please?

Anonymous
10/04/25(Sat)22:35:27 No.106794407

Anonymous 10/04/25(Sat)22:35:27 No.106794407

>>106794347
never

Anonymous
10/04/25(Sat)22:47:58 No.106794457

Anonymous 10/04/25(Sat)22:47:58 No.106794457

>>106794148
OK, I'll give it a try, thanks.

Anonymous
10/04/25(Sat)23:03:41 No.106794553

Anonymous 10/04/25(Sat)23:03:41 No.106794553

Is qwen 3 235b now the best model available with vision abilities?

Anonymous
10/04/25(Sat)23:05:30 No.106794560

Anonymous 10/04/25(Sat)23:05:30 No.106794560

>>106794553
>106794553
qwen3 30b vl

Anonymous
10/04/25(Sat)23:07:33 No.106794567

Anonymous 10/04/25(Sat)23:07:33 No.106794567

>>106794560
The 30b is better than the 235b?

Anonymous
10/04/25(Sat)23:08:59 No.106794575

Anonymous 10/04/25(Sat)23:08:59 No.106794575

>>106794553
>>106794560
>>106794567
where goofs?

Anonymous
10/04/25(Sat)23:09:33 No.106794581

Anonymous 10/04/25(Sat)23:09:33 No.106794581

>>106794567
no the https://huggingface.co/moonshotai/Kimi-VL-A3B-Thinking-2506

Anonymous
10/04/25(Sat)23:10:25 No.106794585

Anonymous 10/04/25(Sat)23:10:25 No.106794585

>>106794553
Depends what you use it for. From the n=1 test I've done, dots.vlm1 seems to be better at handwritten text recognition, but Qwen has been trained to parse GUI elements and give exact coordinates.

Anonymous
10/04/25(Sat)23:11:38 No.106794591

Anonymous 10/04/25(Sat)23:11:38 No.106794591

has anyone tried this yet
https://huggingface.co/BasedBase/GLM-4.5-Air-GLM-4.6-Distill

Anonymous
10/04/25(Sat)23:11:38 No.106794592

Anonymous 10/04/25(Sat)23:11:38 No.106794592

>8700g
>96gb ram

What can I run.

Anonymous
10/04/25(Sat)23:12:59 No.106794596

Anonymous 10/04/25(Sat)23:12:59 No.106794596

>>106794592
What? Can I Run.

Anonymous
10/04/25(Sat)23:17:25 No.106794610

Anonymous 10/04/25(Sat)23:17:25 No.106794610

>>106794596
What can? I run.

Anonymous
10/04/25(Sat)23:18:16 No.106794617

Anonymous 10/04/25(Sat)23:18:16 No.106794617

>>106794610
What can I? Run!

Anonymous
10/04/25(Sat)23:18:20 No.106794619

Anonymous 10/04/25(Sat)23:18:20 No.106794619

What can I--RUN!

Anonymous
10/04/25(Sat)23:23:44 No.106794642

Anonymous 10/04/25(Sat)23:23:44 No.106794642

File: miku running 乙れん - 星界ちゃんと(...).mp4 (138 KB, 1000x1000)

138 KB MP4

Anonymous
10/04/25(Sat)23:24:09 No.106794644

Anonymous 10/04/25(Sat)23:24:09 No.106794644

>>106794591
Where original weights. These quants aren't enough.

Anonymous
10/04/25(Sat)23:25:25 No.106794648

Anonymous 10/04/25(Sat)23:25:25 No.106794648

i miss the old days where the best model was MiQu-70b or midnight rose
t. never used miqu for erp because 1t/s

Anonymous
10/04/25(Sat)23:29:33 No.106794666

Anonymous 10/04/25(Sat)23:29:33 No.106794666

>>106794403
You will want to use Linux for this with either lxde or through the console without a graphical environment, since with 16GB every GB counts and you don't want any to be wasted for the OS itself.
Step 1: download llama.cpp
Step 2: download the GGUF file from huggingface (the model). This could be Qwen_Qwen3-30B-A3B-Instruct-2507-Q3_K_S.gguf or similar (try Q3 XXS or Q2 if you run out of RAM).
Step 3:
Figure out a command line that works for you. This works for me:
llama-server -m <your file here>.gguf -c 32000 --port 8001 -ngl <try different values from 0 upward>
Then access 127.0.0.1:8001 on a web browser or (to save RAM) if you're not scared of the command line you could make a minimal python client for the OpenAI compatible API over the same address and port.
Alternatively you can try with llama-cli since that should use a bit less memory than the server as well, most people start with that one first before going for the server command. The command line is more or less the same.

Anonymous
10/04/25(Sat)23:30:21 No.106794670

Anonymous 10/04/25(Sat)23:30:21 No.106794670

>>106793837
high quality indeed
none of the erp-focused troontunes come close either

Anonymous
10/04/25(Sat)23:35:25 No.106794700

Anonymous 10/04/25(Sat)23:35:25 No.106794700

>>106794585
I have 2 hypothetical uses for vlm:
- observe my screen in real time to provide suggestions / commentary
- a feedback loop for image generator prompting

Anonymous
10/04/25(Sat)23:38:21 No.106794711

Anonymous 10/04/25(Sat)23:38:21 No.106794711

>>106794700
Well if you figure a way to run Qwen3 VL on CPU then share with the class, unless you happen to have 500GB of VRAM laying around.

Anonymous
10/04/25(Sat)23:57:26 No.106794791

Anonymous 10/04/25(Sat)23:57:26 No.106794791

>>106794711
>30B
Why are you gay?

Anonymous
10/05/25(Sun)00:02:45 No.106794825

Anonymous 10/05/25(Sun)00:02:45 No.106794825

>>106794791
https://huggingface.co/unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Anonymous
10/05/25(Sun)00:07:57 No.106794848

Anonymous 10/05/25(Sun)00:07:57 No.106794848

File: 1758978276426960.png (18 KB, 1333x138)

18 KB PNG

>>106794648
haha yeah...

Anonymous
10/05/25(Sun)00:14:07 No.106794881

Anonymous 10/05/25(Sun)00:14:07 No.106794881

What is the best ERP multimodal model available in GGUF format? I have quad 3090s.

Anonymous
10/05/25(Sun)00:15:31 No.106794890

Anonymous 10/05/25(Sun)00:15:31 No.106794890

>>106794881
https://huggingface.co/unsloth/Llama-4-Scout-17B-16E-Instruct-GGUF

Anonymous
10/05/25(Sun)00:19:35 No.106794906

Anonymous 10/05/25(Sun)00:19:35 No.106794906

>>106794890
I thought llama4 was garbage?

Anonymous
10/05/25(Sun)00:23:22 No.106794927

Anonymous 10/05/25(Sun)00:23:22 No.106794927

File: 1730310220787427.png (318 KB, 615x688)

318 KB PNG

>1 bit
kek

Anonymous
10/05/25(Sun)00:24:52 No.106794934

Anonymous 10/05/25(Sun)00:24:52 No.106794934

>>106794927
nigga its been 6 trillion years

Anonymous
10/05/25(Sun)00:29:17 No.106794950

Anonymous 10/05/25(Sun)00:29:17 No.106794950

>>106794934
He knows, that's why he cropped the date out.

Anonymous
10/05/25(Sun)00:32:24 No.106794967

Anonymous 10/05/25(Sun)00:32:24 No.106794967

>>106794934
>>106794950
it's not that old lol
https://xcancel.com/LiorOnAI/status/1913664684705874030

Anonymous
10/05/25(Sun)00:34:23 No.106794976

Anonymous 10/05/25(Sun)00:34:23 No.106794976

>>106794967
>April 19
Might as well be ancient history.

Anonymous
10/05/25(Sun)00:37:39 No.106794992

Anonymous 10/05/25(Sun)00:37:39 No.106794992

>>106793382
is anyone still making finetunes 100b and up aside from that giga-fag thedrummer?

Anonymous
10/05/25(Sun)00:38:42 No.106794998

Anonymous 10/05/25(Sun)00:38:42 No.106794998

>>106794992
finetunes are a scam

Anonymous
10/05/25(Sun)00:44:46 No.106795033

Anonymous 10/05/25(Sun)00:44:46 No.106795033

File: 1759000089785513.jpg (9 KB, 180x246)

9 KB JPG

>>106794074
>and all the good models are in the 400B range
Nta. What kind of shit are you guys doing locally that would require 400B as the bare minimum?

Anonymous
10/05/25(Sun)00:44:58 No.106795037

Anonymous 10/05/25(Sun)00:44:58 No.106795037

>>106794992
https://huggingface.co/zerofata/GLM-4.5-Iceblink-106B-A12B

Anonymous
10/05/25(Sun)00:47:06 No.106795051

Anonymous 10/05/25(Sun)00:47:06 No.106795051

>>106794992
Finetuning a model undergone post-training is a recipe for failure

Anonymous
10/05/25(Sun)00:52:26 No.106795077

Anonymous 10/05/25(Sun)00:52:26 No.106795077

>>106795051
Nta. Elaborate. Are you claiming fine tuning a base model that has already gone through SFT training is a bad idea? Why?

Anonymous
10/05/25(Sun)00:53:55 No.106795087

Anonymous 10/05/25(Sun)00:53:55 No.106795087

File: G2dh9tXbYAEsXd8.jpg (392 KB, 2048x1536)

392 KB JPG

Anonymous
10/05/25(Sun)01:03:11 No.106795130

Anonymous 10/05/25(Sun)01:03:11 No.106795130

>>106794791
The conversation was about the 235B model retard.

>>106795033
Nobody said it was the bare minimum, I said "the best". As for what I'm doing, programming. The biggest models are just barely good enough to not be very frustrating.

Anonymous
10/05/25(Sun)01:03:39 No.106795135

Anonymous 10/05/25(Sun)01:03:39 No.106795135

>>106795087
I like this bee

Anonymous
10/05/25(Sun)01:04:17 No.106795137

Anonymous 10/05/25(Sun)01:04:17 No.106795137

>>106795087
beeku

Anonymous
10/05/25(Sun)02:01:02 No.106795379

Anonymous 10/05/25(Sun)02:01:02 No.106795379

>>106795087
peeling back beeku's foreskin

Anonymous
10/05/25(Sun)02:07:45 No.106795403

Anonymous 10/05/25(Sun)02:07:45 No.106795403

>>106794927
>>106794934
>>106794950
>>106794967
>>106794976
Was this model any good? Anyone tried to retrain it into specific tasks?

Anonymous
10/05/25(Sun)02:12:09 No.106795419

Anonymous 10/05/25(Sun)02:12:09 No.106795419

>>106794711
does transformers not automatically overflow to cpu memory with qwen vl like it does for most other models?

Anonymous
10/05/25(Sun)02:41:17 No.106795530

Anonymous 10/05/25(Sun)02:41:17 No.106795530

>>106793860
>Avoid unsloth, it's astroturfed and made by incompetent people
does this go for their training software, or the ggufs too? for stuff like glm 4.6 am I better off downloading bartowski or someone else?

Anonymous
10/05/25(Sun)02:43:41 No.106795542

Anonymous 10/05/25(Sun)02:43:41 No.106795542

>>106793901
so you're saying 4.6 is better than deepseek period?
how about for writing short stories?

Hi all, Drummer here...
10/05/25(Sun)02:47:22 No.106795551

Hi all, Drummer here... 10/05/25(Sun)02:47:22 No.106795551

>>106795077

Most likely because official instructs are deepfried with slop, benchmax, and alignment.

Anonymous
10/05/25(Sun)02:55:24 No.106795583

Anonymous 10/05/25(Sun)02:55:24 No.106795583

>>106794074
that was actually well explained anon, good post

Anonymous
10/05/25(Sun)02:57:56 No.106795593

Anonymous 10/05/25(Sun)02:57:56 No.106795593

>>106795551
are you working on a tune on the new Apriel?

Anonymous
10/05/25(Sun)03:01:20 No.106795602

Anonymous 10/05/25(Sun)03:01:20 No.106795602

>>106794666
noted thanks

Anonymous
10/05/25(Sun)03:03:35 No.106795607

Anonymous 10/05/25(Sun)03:03:35 No.106795607

why is "teen" a bad word on chub? How am I supposed to make Asuka?

Anonymous
10/05/25(Sun)03:07:34 No.106795621

Anonymous 10/05/25(Sun)03:07:34 No.106795621

>>106795607
teen = underage

Anonymous
10/05/25(Sun)03:08:21 No.106795625

Anonymous 10/05/25(Sun)03:08:21 No.106795625

>>106795607
Whoa there Anon. Did you think an unsafe thought?

Anonymous
10/05/25(Sun)03:09:39 No.106795634

Anonymous 10/05/25(Sun)03:09:39 No.106795634

>>106795621
But what if she's twenteen?

Anonymous
10/05/25(Sun)03:10:55 No.106795637

Anonymous 10/05/25(Sun)03:10:55 No.106795637

>>106795621
>>106795625
but I specified its an anime woman

Anonymous
10/05/25(Sun)03:11:58 No.106795641

Anonymous 10/05/25(Sun)03:11:58 No.106795641

>>106795634
>twentween
This is underage-coded. We must refuse.

Anonymous
10/05/25(Sun)03:19:31 No.106795677

Anonymous 10/05/25(Sun)03:19:31 No.106795677

>>106795621
>underage
like nineteen?

Hi all, Drummer here...
10/05/25(Sun)03:22:46 No.106795687

Hi all, Drummer here... 10/05/25(Sun)03:22:46 No.106795687

>>106795593
I haven't taken a good look at it yet. Did you like Snowpiercer v3? Does it feel like a smarter Nemo to you?

Any thoughts on the new Apriel? I haven't tested it due to the stupid chat template. It's in my backlog.

Anonymous
10/05/25(Sun)03:23:54 No.106795697

Anonymous 10/05/25(Sun)03:23:54 No.106795697

>>106795677
19yo can't buy alcohol in US and isn't considered adult in Japan

Anonymous
10/05/25(Sun)03:31:29 No.106795732

Anonymous 10/05/25(Sun)03:31:29 No.106795732

>>106795697
but we're talking about sex here?

Anonymous
10/05/25(Sun)03:32:56 No.106795741

Anonymous 10/05/25(Sun)03:32:56 No.106795741

>>106795697
An 18-year-old can be filmed sucking a mile of bbcs and sent to fight for their country overseas. Imagine some law-abiding amerimut dying in sandniggerstan before he has even had their first beer

Anonymous
10/05/25(Sun)03:35:40 No.106795754

Anonymous 10/05/25(Sun)03:35:40 No.106795754

File: 17350764267.png (872 KB, 1080x625)

872 KB PNG

>>106795741
>Imagine some law-abiding amerimut dying in sandniggerstan before he has even had their first beer
that's literally "Apocalypse now", Laurence Fishburne (yeah the guy who played Morpheus in Matrix) played a man forced to be in the Vietnam draft and he was 14 lol

Anonymous
10/05/25(Sun)03:36:30 No.106795756

Anonymous 10/05/25(Sun)03:36:30 No.106795756

>>106795677
Nineteen is close to eighteen. This is underage-coded. We must refuse.

Anonymous
10/05/25(Sun)03:38:49 No.106795764

Anonymous 10/05/25(Sun)03:38:49 No.106795764

>>106795741
murica has always been weird with alchool, they tried to ban it 100 years ago after all

Anonymous
10/05/25(Sun)03:39:24 No.106795768

Anonymous 10/05/25(Sun)03:39:24 No.106795768

Women become adults when they reach menopause.

Anonymous
10/05/25(Sun)03:41:49 No.106795781

Anonymous 10/05/25(Sun)03:41:49 No.106795781

my tokens are of age

Anonymous
10/05/25(Sun)03:46:07 No.106795797

Anonymous 10/05/25(Sun)03:46:07 No.106795797

>>106795687
I like Snowpiercer v3 - it's more fun and smarter than nemo. The writing is a little too flowery and overly dramatic for my tastes but it's clever and surprisingly obedient, given its size

I briefly tried the new Apriel for rp but all it did was spit out refusals if the topic was slightly controversial

Hi all, Drummer here...
10/05/25(Sun)03:58:13 No.106795850

Hi all, Drummer here... 10/05/25(Sun)03:58:13 No.106795850

>>106795797
With thinking enabled? Either way, I'm sure I can remove the refusals / positivity. Whether it still has the Nemo fun is TBD. What do you mean by surprisingly obedient?

Does anyone know if Pixtral 12B ruined Nemo?

Anonymous
10/05/25(Sun)04:11:27 No.106795897

Anonymous 10/05/25(Sun)04:11:27 No.106795897

>>106795781
You should always carry a legal disclaimer with you.
>My thoughts and fantasies are only suitable for adults.

Anonymous
10/05/25(Sun)04:18:04 No.106795921

Anonymous 10/05/25(Sun)04:18:04 No.106795921

>>106795850
>What do you mean by surprisingly obedient?
it tries to express the personality traits in the character card more faithfully, even with multiple characters

Anonymous
10/05/25(Sun)04:27:22 No.106795961

Anonymous 10/05/25(Sun)04:27:22 No.106795961

>>106795850
It's definitely not just Nemo + vision. Kind of like Pixtral Large is Mistral Large 2407 but different (and not 2411)

Anonymous
10/05/25(Sun)04:37:28 No.106795997

Anonymous 10/05/25(Sun)04:37:28 No.106795997

>>106795850
Hey Drummer, just wondering, you said MoEs are difficult to train. Is that just an open sores issue or do you/we know that's true for the actual big companies as well? If that is true, it's interesting that even with all that fuss, it's still cheaper to train for them compared to dense models.

Anonymous
10/05/25(Sun)04:39:38 No.106796008

Anonymous 10/05/25(Sun)04:39:38 No.106796008

>>106794017
If you want images, you should check /ldg/ not here.

Anonymous
10/05/25(Sun)04:42:26 No.106796019

Anonymous 10/05/25(Sun)04:42:26 No.106796019

of course github has to ACK! the moment I need it. FUCK.

Hi all, Drummer here...
10/05/25(Sun)04:50:34 No.106796051

Hi all, Drummer here... 10/05/25(Sun)04:50:34 No.106796051

>>106795997
It's not true at all.

Anonymous
10/05/25(Sun)05:25:42 No.106796201

Anonymous 10/05/25(Sun)05:25:42 No.106796201

>>106795033
>What kind of shit are you guys doing locally that would require 400B as the bare minimum
A casual conversation where the model doesn't confuse what I said, for what it said, after a few paragraphs.

Anonymous
10/05/25(Sun)05:40:02 No.106796263

Anonymous 10/05/25(Sun)05:40:02 No.106796263

>>106796201
This is just indirectly a model size problem, depending on what you're asking. Larger models contain more rare knowledge, even after training data filtering. A model designed from the ground up for RP, chatting and storywriting of all kinds would not need to be enormous.

Anonymous
10/05/25(Sun)05:47:05 No.106796288

Anonymous 10/05/25(Sun)05:47:05 No.106796288

>>106796263
>A model designed from the ground up for RP, chatting and storywriting of all kinds would not need to be enormous.
That's where you're wrong, bub. Those are the most open domain tasks you can ask for, so they need the biggest possible models.

Anonymous
10/05/25(Sun)05:47:50 No.106796290

Anonymous 10/05/25(Sun)05:47:50 No.106796290

>>106796263
Yeah, and games can run faster if it wasn't using UE5, but here we go
>A model designed from the ground up for RP,
But how would it help beating the benchmarks?

Anonymous
10/05/25(Sun)05:53:06 No.106796309

Anonymous 10/05/25(Sun)05:53:06 No.106796309

>>106796263
I'm not so sure about that. Active parameter count definitely has a huge impact on intelligence. Qwen models are math benchmaxxed as fuck but their ~200B models sure as hell beat something like Gemma 12/27B, which are chat-focused models.

Anonymous
10/05/25(Sun)05:56:46 No.106796324

Anonymous 10/05/25(Sun)05:56:46 No.106796324

So regarding Harmonic Moon 12B... it passed the N and K tests, but it's a bit more resistant and judgemental about cunny than Rocinante, though it's still possible.
It has a larger roleplay vocabulary, but it includes more slop too.
It's also more prone to repetition than Rocinante, and slightly more retarded in understanding context without constant extra explanations and reminders.
It works as a change of pace, but not a replacement for Rocinante.
Rocinante v1.1 remains the king of Nemo 12B models.

Anonymous
10/05/25(Sun)06:01:21 No.106796354

Anonymous 10/05/25(Sun)06:01:21 No.106796354

File: 1739873691936097.webm (691 KB, 332x518)

691 KB WEBM

>>106796324
>It's also more prone to repetition than Rocinante
Sounds fucking awful
>Rocinante v1.1 remains the king of Nemo 12B models
People really should try the unslopnemo tunes. Literally just Rocinante with less slop.

Anonymous
10/05/25(Sun)06:01:33 No.106796356

Anonymous 10/05/25(Sun)06:01:33 No.106796356

>>106796263
Abolish The Entire Internet as training data, right now!!! What do we want? Narrow RP models!

Anonymous
10/05/25(Sun)06:05:04 No.106796382

Anonymous 10/05/25(Sun)06:05:04 No.106796382

>>106796354
>People really should try the unslopnemo tunes
We did long ago, they are all worse than Rocinante v1.1.

Anonymous
10/05/25(Sun)06:05:04 No.106796383

Anonymous 10/05/25(Sun)06:05:04 No.106796383

Ooga textgen has been missing parameters like num_beams
>and also missing hidden functionality that at times surfaced as bug and crash
How to use parameter num_beams in ooga?
why cant I do beam search? Why no information online on it?
Is there a better alternative to textgenwebui already?

Anonymous
10/05/25(Sun)06:08:20 No.106796398

Anonymous 10/05/25(Sun)06:08:20 No.106796398

File: glm-reasoning.png (57 KB, 988x352)

57 KB PNG

>>106796356
What if instead of code and math reasoning they focused on conversations, fiction and roleplay?

Anonymous
10/05/25(Sun)06:09:19 No.106796405

Anonymous 10/05/25(Sun)06:09:19 No.106796405

>rocinante is still the best
grim, it really didn't have any nuance in my cards

Anonymous
10/05/25(Sun)06:09:53 No.106796407

Anonymous 10/05/25(Sun)06:09:53 No.106796407

>>106796382
They aren't though
>>106796383
nobody uses oogabooga, it's brown-coded.

Anonymous
10/05/25(Sun)06:11:32 No.106796418

Anonymous 10/05/25(Sun)06:11:32 No.106796418

>>106796383
everyone is using koboldcpp for it's banned strings implementation which all other APIs lack. banned strings have become essential for getting rid of slop and dumb shit in local models

Anonymous
10/05/25(Sun)06:11:51 No.106796419

Anonymous 10/05/25(Sun)06:11:51 No.106796419

>>106796398
fiction and roleplay aren't actual usecases for productive people

Anonymous
10/05/25(Sun)06:13:12 No.106796425

Anonymous 10/05/25(Sun)06:13:12 No.106796425

>>106796405
That is just drummer and his goons samefagging, I don't think anybody here actually uses that or any of his models. Have you noticed how it's always the same inorganic spamming always at the same hours? I hope they're well-paid.

Anonymous
10/05/25(Sun)06:14:47 No.106796432

Anonymous 10/05/25(Sun)06:14:47 No.106796432

>>106796419
Fiction is a usecase for productive people you retard. They use AI to help write books, create worlds for their books, or even spam useless books with self-publishing for easy money.
Likewise fiction is used for homework in schools.
Roleplay in general is a great measurement for a model's capabilities and quality, as it depends on every single category of understanding, including math.
So go suck a dick somewhere else.

Anonymous
10/05/25(Sun)06:16:20 No.106796444

Anonymous 10/05/25(Sun)06:16:20 No.106796444

>>106796432
>easy money
not
an
use
case

Anonymous
10/05/25(Sun)06:16:57 No.106796450

Anonymous 10/05/25(Sun)06:16:57 No.106796450

>>106796425
i hope i never reach this level of paranoid schizophrenia, you should probably take a break from 4chan
you sound like the schizo people on /pol/ calling everyone a jew, tranny, glowie, etc

Anonymous
10/05/25(Sun)06:18:07 No.106796462

Anonymous 10/05/25(Sun)06:18:07 No.106796462

>>106796432
>fiction is used for homework in schools.
wut?

Anonymous
10/05/25(Sun)06:19:31 No.106796471

Anonymous 10/05/25(Sun)06:19:31 No.106796471

>>106796462
>he hasn't gone to school
makes sense, this is /lmg/

Anonymous
10/05/25(Sun)06:20:00 No.106796476

Anonymous 10/05/25(Sun)06:20:00 No.106796476

>>106796419
Productive people don't need AI
The only use cases for AI is coom and scamming old people

Anonymous
10/05/25(Sun)06:20:49 No.106796482

Anonymous 10/05/25(Sun)06:20:49 No.106796482

File: dr.png (52 KB, 198x198)

52 KB PNG

>>106796450
>t. picrel

Anonymous
10/05/25(Sun)06:22:28 No.106796487

Anonymous 10/05/25(Sun)06:22:28 No.106796487

File: 1744660516244294.gif (160 KB, 430x270)

160 KB GIF

>>106796482
>saved drummer's avatar in his schizo folder

Anonymous
10/05/25(Sun)06:23:31 No.106796491

Anonymous 10/05/25(Sun)06:23:31 No.106796491

>>106796462
Bruh

Anonymous
10/05/25(Sun)06:24:50 No.106796498

Anonymous 10/05/25(Sun)06:24:50 No.106796498

>>106796462
That does sound like a great reason for more filtering and safety. Wouldn't want to ruin Timmy's life because his homework he asked Llama-4-pussyslayer to write said the n-word

Anonymous
10/05/25(Sun)06:25:43 No.106796502

Anonymous 10/05/25(Sun)06:25:43 No.106796502

>>106796482
Did he found a Job?

Anonymous
10/05/25(Sun)06:26:56 No.106796509

Anonymous 10/05/25(Sun)06:26:56 No.106796509

>>106796432
And plap cunny, let's be real

Anonymous
10/05/25(Sun)06:28:12 No.106796515

Anonymous 10/05/25(Sun)06:28:12 No.106796515

>>106796509
That does not sound like safe and ethically aligned homework sir

Anonymous
10/05/25(Sun)06:28:53 No.106796518

Anonymous 10/05/25(Sun)06:28:53 No.106796518

>>106796515
It's okay, the persona that I use during such RP is also underage.

Anonymous
10/05/25(Sun)06:29:44 No.106796522

Anonymous 10/05/25(Sun)06:29:44 No.106796522

>>106796518
That's even worse, your persona is abusing itself!

Anonymous
10/05/25(Sun)06:56:08 No.106796607

Anonymous 10/05/25(Sun)06:56:08 No.106796607

>>106796425
ok then tell me what models are good, I haven't been paying attention for months and have a 24 GB 3090 and 32 GB RAM

Anonymous
10/05/25(Sun)07:00:26 No.106796618

Anonymous 10/05/25(Sun)07:00:26 No.106796618

>>106796607
GLM 4.5 air or the infinitely better 4.6, anything else is disgusting cope.

Anonymous
10/05/25(Sun)07:01:43 No.106796623

Anonymous 10/05/25(Sun)07:01:43 No.106796623

>>106796607
Mistral Small 3.2 and Nemo are still the only things worth using, unless you have 128GB+ RAM.

Anonymous
10/05/25(Sun)07:02:48 No.106796630

Anonymous 10/05/25(Sun)07:02:48 No.106796630

coping above

Anonymous
10/05/25(Sun)07:02:51 No.106796632

Anonymous 10/05/25(Sun)07:02:51 No.106796632

https://huggingface.co/justby192G/GLM-4.5-FaggotPlacebo-106B-A12B

Anonymous
10/05/25(Sun)07:02:58 No.106796633

Anonymous 10/05/25(Sun)07:02:58 No.106796633

>>106796618
First of all, Air is fucking shit at any quant, and with only 24/32 you'd barely even fit Q2 in there, Q3 would have to flow on to storage. Awful advice.

Anonymous
10/05/25(Sun)07:03:48 No.106796637

Anonymous 10/05/25(Sun)07:03:48 No.106796637

>>106796618
even on dogshit quant that I would have to run in ram?
ok I'll give it a shot, do you have an ST preset by any chance?
>>106796623
yeah but they always feel like they're way too horny and not at all nuanced
ironically enough some ancient mythomax level tune was the only one that ever gave me an absolutely fantastic manipulative gaslighting gradual character behavior but I'm pretty sure the logs and metadata have been lost......

Anonymous
10/05/25(Sun)07:03:52 No.106796638

Anonymous 10/05/25(Sun)07:03:52 No.106796638

File: file.png (65 KB, 198x198)

65 KB PNG

>>106796487
>t. picrel

Anonymous
10/05/25(Sun)07:04:07 No.106796639

Anonymous 10/05/25(Sun)07:04:07 No.106796639

>>106796633
I do not care for you financial status, if you're not using good models the fuck are you even doing, get a job to be able to run GLM instead of wasting time on cope.

Anonymous
10/05/25(Sun)07:05:28 No.106796647

Anonymous 10/05/25(Sun)07:05:28 No.106796647

>>106796630
meant for >>106796618

Anonymous
10/05/25(Sun)07:05:41 No.106796651

Anonymous 10/05/25(Sun)07:05:41 No.106796651

>>106796639
>uuuh these models are le bad
>ok given this hardware what should I use
>use this thing
>btw you need to buy a new PC for that
holy shit you are actually retarded mate

Anonymous
10/05/25(Sun)07:06:21 No.106796655

Anonymous 10/05/25(Sun)07:06:21 No.106796655

>>106796639
post your GPUmaxx rig. If you can't, you're poor.

Anonymous
10/05/25(Sun)07:06:39 No.106796657

Anonymous 10/05/25(Sun)07:06:39 No.106796657

>>106796502
Insider here (can't say which lab I work for). Our HR girl almost called him for second interview but my colleague stopped her. We can't have competent safety engineers because we can't reach the benchmark goals and nobody gets a bonus.

Anonymous
10/05/25(Sun)07:07:20 No.106796662

Anonymous 10/05/25(Sun)07:07:20 No.106796662

>>106796651
>You need a decent computer to enjoy literal SOTA Artificial Intelligence at home
>waaaaaaaah I'm poor and have no job
Ok.

Anonymous
10/05/25(Sun)07:08:04 No.106796667

Anonymous 10/05/25(Sun)07:08:04 No.106796667

>>106796657
Can confirm, I'm the boss, and the HR girl is under my desk right now.

Anonymous
10/05/25(Sun)07:08:42 No.106796671

Anonymous 10/05/25(Sun)07:08:42 No.106796671

>>106796667
no that's ntr what the fuck

Anonymous
10/05/25(Sun)07:08:46 No.106796672

Anonymous 10/05/25(Sun)07:08:46 No.106796672

>>106796662
ask your SOTA if your recommendation was an adequate answer to "given this hardware, what is the best model to use" since you are too peabrained to figure it out yourself

Anonymous
10/05/25(Sun)07:09:11 No.106796676

Anonymous 10/05/25(Sun)07:09:11 No.106796676

It's just odd how functionality is removed from ai and ... decoder tools or whatever without their usebase making a trackable post about it

Anonymous
10/05/25(Sun)07:09:33 No.106796680

Anonymous 10/05/25(Sun)07:09:33 No.106796680

>>106796657
i can also confirm cause i am that colleague, but I stopped her because drummer is a huge fucking faggot

Anonymous
10/05/25(Sun)07:09:46 No.106796682

Anonymous 10/05/25(Sun)07:09:46 No.106796682

>>106796672
The best model he can use is his ass to work to get money.

Anonymous
10/05/25(Sun)07:11:17 No.106796691

Anonymous 10/05/25(Sun)07:11:17 No.106796691

Anyone here use beam generation / Beam search?
Are you just loading it up and talking to it?
Are there any /g/ loras ?

Anonymous
10/05/25(Sun)07:11:40 No.106796695

Anonymous 10/05/25(Sun)07:11:40 No.106796695

>>106796662
lmao, extremely low standards to call GLM SOTA

Anonymous
10/05/25(Sun)07:12:05 No.106796697

Anonymous 10/05/25(Sun)07:12:05 No.106796697

>>106796651
>you need to buy a new PC for that
We moved from: you need to buy a server, to: you need to buy new ram for you 7xxxX3D / 9xxxX3D gayming pc.

Anonymous
10/05/25(Sun)07:13:17 No.106796702

Anonymous 10/05/25(Sun)07:13:17 No.106796702

>>106796398
They did for 4.6

Anonymous
10/05/25(Sun)07:13:32 No.106796703

Anonymous 10/05/25(Sun)07:13:32 No.106796703

>>106796695
NTA but I base State of the Art on State of my Dick. It hurts.

Anonymous
10/05/25(Sun)07:16:55 No.106796722

Anonymous 10/05/25(Sun)07:16:55 No.106796722

>>106796703
syphilis is not state of the art.

Anonymous
10/05/25(Sun)07:18:21 No.106796731

Anonymous 10/05/25(Sun)07:18:21 No.106796731

>>106796354
What is up with that deer? Did other humans feed it or something and get it accustomed to human contact

Anonymous
10/05/25(Sun)07:19:25 No.106796736

Anonymous 10/05/25(Sun)07:19:25 No.106796736

>>106796697
is RAM performance better now or what happened?

Anonymous
10/05/25(Sun)07:22:24 No.106796754

Anonymous 10/05/25(Sun)07:22:24 No.106796754

>>106796731
Most likely rabies. Can make wild animals docile one minute, and then apeshit the next.

Anonymous
10/05/25(Sun)07:27:19 No.106796774

Anonymous 10/05/25(Sun)07:27:19 No.106796774

>>106796736
AGESA can handle 128GB+ now. I have a shitty B650 and it works perfectly.

Anonymous
10/05/25(Sun)07:28:45 No.106796781

Anonymous 10/05/25(Sun)07:28:45 No.106796781

>>106796774
I'm on b450 myself, would it work as well?
also are you talking about DDR5?

Anonymous
10/05/25(Sun)07:31:36 No.106796796

Anonymous 10/05/25(Sun)07:31:36 No.106796796

>>106796774
>B650
>AM5
yeah I'm not changing my motherboard cause I would have to replace most of the hardware and I got better things to spend my money on when it still functions perfectly fine

Anonymous
10/05/25(Sun)07:35:25 No.106796821

Anonymous 10/05/25(Sun)07:35:25 No.106796821

>>106796796
>still functions perfectly fine
>can't run GLM
sure it is.

Anonymous
10/05/25(Sun)07:38:39 No.106796840

Anonymous 10/05/25(Sun)07:38:39 No.106796840

>>106796821
get some hobbies outside of a personal gooncave my friend

Anonymous
10/05/25(Sun)07:39:15 No.106796849

Anonymous 10/05/25(Sun)07:39:15 No.106796849

>>106794581
I'm trying the instruct version. It seems free from "not x, but y" shit.

Anonymous
10/05/25(Sun)07:41:16 No.106796861

Anonymous 10/05/25(Sun)07:41:16 No.106796861

>>106796691
>Anyone here use beam generation / Beam search?
Not likely, most people are using llamacpp which doesn't support a beam search.
>Are you just loading it up and talking to it?
sometimes but I like writing stories more.
>Are there any /g/ loras?
the fine tuners merge them with the base model so there aren't really loras, and /g/ specific I have never seen or heard of but other 4chan boards have been modeled.

Anonymous
10/05/25(Sun)07:43:42 No.106796879

Anonymous 10/05/25(Sun)07:43:42 No.106796879

This is too easy.
https://files.catbox.moe/5fxf9b.txt

Anonymous
10/05/25(Sun)07:44:34 No.106796885

Anonymous 10/05/25(Sun)07:44:34 No.106796885

>>106796781
Yes DDR5 and I have no idea about older.

Anonymous
10/05/25(Sun)07:44:50 No.106796889

Anonymous 10/05/25(Sun)07:44:50 No.106796889

>>106796840
get a job.

Anonymous
10/05/25(Sun)07:46:30 No.106796899

Anonymous 10/05/25(Sun)07:46:30 No.106796899

File: peak is incoming.png (11 KB, 374x91)

11 KB PNG

Anonymous
10/05/25(Sun)07:47:54 No.106796909

Anonymous 10/05/25(Sun)07:47:54 No.106796909

File: 1743303690001574.jpg (349 KB, 1920x1080)

349 KB JPG

>>106796879
>think:
stopped reading there.

Anonymous
10/05/25(Sun)07:49:14 No.106796914

Anonymous 10/05/25(Sun)07:49:14 No.106796914

>>106796909
What, you don't like reasoning?

Anonymous
10/05/25(Sun)07:50:18 No.106796922

Anonymous 10/05/25(Sun)07:50:18 No.106796922

>>106796914
Reasoning is only ever useful to enforce guardrails.

Anonymous
10/05/25(Sun)07:50:28 No.106796923

Anonymous 10/05/25(Sun)07:50:28 No.106796923

File: 1750382453231997.jpg (74 KB, 591x791)

74 KB JPG

>>106796914
Nope. Not at all.

Anonymous
10/05/25(Sun)07:50:28 No.106796924

Anonymous 10/05/25(Sun)07:50:28 No.106796924

>>106796889
get a life.

Anonymous
10/05/25(Sun)07:51:30 No.106796930

Anonymous 10/05/25(Sun)07:51:30 No.106796930

>>106796914
my rig is too slow to waste time thinking, has anyone posted 4.6 logs without thinking? does the model still work if you skip it?

Anonymous
10/05/25(Sun)07:57:51 No.106796972

Anonymous 10/05/25(Sun)07:57:51 No.106796972

>>106796922
Maybe you should have read the log after all.

Anonymous
10/05/25(Sun)07:58:23 No.106796976

Anonymous 10/05/25(Sun)07:58:23 No.106796976

>>106796930
Yes, it works completely fine without it too.

Anonymous
10/05/25(Sun)07:58:36 No.106796977

Anonymous 10/05/25(Sun)07:58:36 No.106796977

what's the lore on that drummer guy

Anonymous
10/05/25(Sun)07:59:07 No.106796980

Anonymous 10/05/25(Sun)07:59:07 No.106796980

>>106796972
I've got a log for you. Open your mouth.

Anonymous
10/05/25(Sun)08:01:29 No.106796995

Anonymous 10/05/25(Sun)08:01:29 No.106796995

>>106796977
He makes finetunes that some people like. There's a schizo here 24/7 who has a meltdown whenever someone mentions drummer or his models

Anonymous
10/05/25(Sun)08:05:06 No.106797013

Anonymous 10/05/25(Sun)08:05:06 No.106797013

>>106795130
Why, the fuck, do you need vision. Unless you're a frontendfag.

Anonymous
10/05/25(Sun)08:13:25 No.106797052

Anonymous 10/05/25(Sun)08:13:25 No.106797052

>>106796977
He is a schizo that astroturfs this thread with his shitty finetunes nobody likes. Everyone here likes shitting on him 24/7 cause it is funny and his spammed models do nothing to improve quality.

Anonymous
10/05/25(Sun)08:17:52 No.106797076

Anonymous 10/05/25(Sun)08:17:52 No.106797076

File: 06b883e07f787b25fed749a9f(...).jpg (106 KB, 1000x1000)

106 KB JPG

>>106796697
workstation/server distinguishes poorfags from local model patricians
>>106796889
>>106796924
stahp fighting /lmg/ is a thread of peace
>calm and reasonable
you're both retarded

Anonymous
10/05/25(Sun)08:20:34 No.106797091

Anonymous 10/05/25(Sun)08:20:34 No.106797091

>>106797076
shut the fuck up mikutroon

Anonymous
10/05/25(Sun)08:22:51 No.106797105

Anonymous 10/05/25(Sun)08:22:51 No.106797105

>>106797076
>you're both retarded
and what exactly would make (Me) retarded?

Anonymous
10/05/25(Sun)08:24:08 No.106797112

Anonymous 10/05/25(Sun)08:24:08 No.106797112

>>106797105
you aren't posting any vocaloid pictures

Anonymous
10/05/25(Sun)08:30:51 No.106797157

Anonymous 10/05/25(Sun)08:30:51 No.106797157

anon.. t-that's..

Anonymous
10/05/25(Sun)08:33:37 No.106797175

Anonymous 10/05/25(Sun)08:33:37 No.106797175

>>106797052
was wondering what got everyone so exciting that nearly filled up a thread on saturday night but it was just that fag astroturfing again

Anonymous
10/05/25(Sun)08:33:42 No.106797178

Anonymous 10/05/25(Sun)08:33:42 No.106797178

>>106794019
The technical reason is there's no SO-DIMM memory capable of the same speed. Using BGA very close to the CPU allows traces to be kept very short and impedance low.
Still, Apple is over charging for RAM.
Mac faggots should wait for M5, which at last will have hardware matmul, finally solving the shit prompt processing speed on Macs.

Anonymous
10/05/25(Sun)08:35:29 No.106797190

Anonymous 10/05/25(Sun)08:35:29 No.106797190

>>106797178
we need to switch to sCAMM RAM

Anonymous
10/05/25(Sun)08:39:52 No.106797214

Anonymous 10/05/25(Sun)08:39:52 No.106797214

>>106797146
nonny...

Anonymous
10/05/25(Sun)08:52:40 No.106797307

Anonymous 10/05/25(Sun)08:52:40 No.106797307

>>106797190
Send me a pm when pyRAMid scheMe RAM is released.

Anonymous
10/05/25(Sun)08:55:26 No.106797323

Anonymous 10/05/25(Sun)08:55:26 No.106797323

>>106797190
we need to switch to CSAM RAM

Anonymous
10/05/25(Sun)08:58:55 No.106797346

Anonymous 10/05/25(Sun)08:58:55 No.106797346

File: 371-3710399_angry-pink-wo(...).jpg (163 KB, 820x885)

163 KB JPG

I have 5.2 gb ram and a i5-4570 whats the best

Anonymous
10/05/25(Sun)09:00:11 No.106797354

Anonymous 10/05/25(Sun)09:00:11 No.106797354

+128gb SLC Swap (sata saturation at 4k random)

Anonymous
10/05/25(Sun)09:01:47 No.106797361

Anonymous 10/05/25(Sun)09:01:47 No.106797361

>>106793499
>left to right reading
into the trash
>b-but avatar is not actually anime
INTO THE TRASH

Anonymous
10/05/25(Sun)09:02:18 No.106797365

Anonymous 10/05/25(Sun)09:02:18 No.106797365

>>106793382
which llm do they run?

Anonymous
10/05/25(Sun)09:03:21 No.106797374

Anonymous 10/05/25(Sun)09:03:21 No.106797374

>>106797346
Subscription to API provider of your choice

Anonymous
10/05/25(Sun)09:04:36 No.106797384

Anonymous 10/05/25(Sun)09:04:36 No.106797384

>>106797365
Star-Wars-KOTOR-1B-NIGGERKILLER-Q5_K_M-GGUF

Anonymous
10/05/25(Sun)09:05:08 No.106797387

Anonymous 10/05/25(Sun)09:05:08 No.106797387

>>106797384
baked by davidau?

Anonymous
10/05/25(Sun)09:05:20 No.106797389

Anonymous 10/05/25(Sun)09:05:20 No.106797389

>>106797365
YandexGPT-5-Lite-8B

Anonymous
10/05/25(Sun)09:13:07 No.106797436

Anonymous 10/05/25(Sun)09:13:07 No.106797436

>>106797346
https://huggingface.co/lmstudio-community/Qwen3-4B-Instruct-2507-GGUF

Anonymous
10/05/25(Sun)09:53:39 No.106797702

Anonymous 10/05/25(Sun)09:53:39 No.106797702

so apparently jewini 3.0 is out but I can't even discuss it on gee? this is the local models general. Ok, guess I go /aicg/. But they just discuss erping and shit. Fine, I'll make a new thread. And then it's just retarded cross posters and trolls with 0 real discussion. Not even a benchmark argument. Do I really have to create /cmg/??

Anonymous
10/05/25(Sun)09:55:00 No.106797711

Anonymous 10/05/25(Sun)09:55:00 No.106797711

>>106797702
This is a mikuposting thread.

Anonymous
10/05/25(Sun)09:58:31 No.106797738

Anonymous 10/05/25(Sun)09:58:31 No.106797738

>>106797702
/ourjeet/ got u covered with all the facts
https://youtu.be/OlNm5DGMulU
ngl this jeet kinda based, full sigma grindset. Lives in japan and vibecoded a skool.com alternative which he sold for 200k usd supposedly

Anonymous
10/05/25(Sun)10:04:35 No.106797794

Anonymous 10/05/25(Sun)10:04:35 No.106797794

>>106797702
You might have better luck on >>>vg/aicg
/wait/ should be repurposed to /apig/ imo.

Anonymous
10/05/25(Sun)10:12:39 No.106797861

Anonymous 10/05/25(Sun)10:12:39 No.106797861

>>106797738
Every other word in your post is brainrot. Kill yourself.

Anonymous
10/05/25(Sun)10:20:16 No.106797934

Anonymous 10/05/25(Sun)10:20:16 No.106797934

>>106797861
>responding to jeet talking about jeets
dumb

Anonymous
10/05/25(Sun)10:20:45 No.106797940

Anonymous 10/05/25(Sun)10:20:45 No.106797940

>>106797861
not trying hard enough to fit in, unc.

Anonymous
10/05/25(Sun)10:22:02 No.106797949

Anonymous 10/05/25(Sun)10:22:02 No.106797949

File: 1745613224122339.jpg (53 KB, 500x500)

53 KB JPG

my pc when I boot up glm

Anonymous
10/05/25(Sun)10:27:49 No.106797981

Anonymous 10/05/25(Sun)10:27:49 No.106797981

>>106797934
bruh this guy will be your new boss you better respect them

Anonymous
10/05/25(Sun)10:29:45 No.106797989

Anonymous 10/05/25(Sun)10:29:45 No.106797989

>>106797981
funny because I'm the team lead of 5 jeets, I feel like a new age slaver, they're my cotton planters.

Anonymous
10/05/25(Sun)10:36:32 No.106798042

Anonymous 10/05/25(Sun)10:36:32 No.106798042

>>106797989
>working with jeets
holy fuck my condolences

Anonymous
10/05/25(Sun)10:37:17 No.106798048

Anonymous 10/05/25(Sun)10:37:17 No.106798048

>>106797949
this artwork but with a cowtits onee chan representing a 3kg tower cooler

Anonymous
10/05/25(Sun)10:38:02 No.106798051

Anonymous 10/05/25(Sun)10:38:02 No.106798051

File: 1742114502774305.png (741 KB, 888x856)

741 KB PNG

Is there any good model to either TTS or change voice recordings to upload youtube stuff?

Anonymous
10/05/25(Sun)10:39:45 No.106798067

Anonymous 10/05/25(Sun)10:39:45 No.106798067

>>106798042
if you work in IT in any big company, you will have to deal with them. They're either at managerial positions (thanks to their incredible brown nosing skills, they're also fucking yes-men) or actual garbage coders. Never encountered a jeet in a serious coding position (architects or team leads), or if they were, it was just titular.
Code reviews can get tiring with them with the amount of shit usage of patterns and whatnot, but they don't argue back, they are pretty much subservient.

Anonymous
10/05/25(Sun)10:41:15 No.106798084

Anonymous 10/05/25(Sun)10:41:15 No.106798084

aaaaa does anyone have a link to the github for the sillytavern director extension made by the anon here?
its not tagged properly so it doesnt show up in github search, and i can't remember his username

Anonymous
10/05/25(Sun)10:44:57 No.106798107

Anonymous 10/05/25(Sun)10:44:57 No.106798107

>>106798084
https://github.com/tomatoesahoy/director

Anonymous
10/05/25(Sun)10:46:10 No.106798118

Anonymous 10/05/25(Sun)10:46:10 No.106798118

the fat fuck is coming :D

Anonymous
10/05/25(Sun)10:49:40 No.106798140

Anonymous 10/05/25(Sun)10:49:40 No.106798140

>>106798107
thank you.
damn he didn't update it for group chats still...

Anonymous
10/05/25(Sun)10:52:41 No.106798161

Anonymous 10/05/25(Sun)10:52:41 No.106798161

>>106798107
>or where they it helps the AI remain consistent.

>>106798140
you could contribute a pull request

Anonymous
10/05/25(Sun)11:06:21 No.106798277

Anonymous 10/05/25(Sun)11:06:21 No.106798277

>>106798161
>just make a pull request
>just work on everyone's projects and do everything and reinvent all the wheels while you are swamped with work
sure buddy, certainly the project creator is too busy to finish his project

Anonymous
10/05/25(Sun)11:12:22 No.106798311

Anonymous 10/05/25(Sun)11:12:22 No.106798311

File: 1621486243645.gif (363 KB, 255x255)

363 KB GIF

>>106797949
>mfw the power bill arrives
I'm paying 24.702gbp per kWh

Anonymous
10/05/25(Sun)11:13:56 No.106798319

Anonymous 10/05/25(Sun)11:13:56 No.106798319

File: 1750980997391822.png (157 KB, 1561x1023)

157 KB PNG

nano banana bros?????

Anonymous
10/05/25(Sun)11:15:17 No.106798331

Anonymous 10/05/25(Sun)11:15:17 No.106798331

>>106798319
uhhh I thought hunyuan image was slopped trash that's totally not worth using so I don't have to worry about not being able to run a 80b imgen model?

Anonymous
10/05/25(Sun)11:17:38 No.106798342

Anonymous 10/05/25(Sun)11:17:38 No.106798342

Why is tool calling in ST so fucked? I'm trying to get the simple as fuck dice roll extension to work but rolling a dice ends the current reply and starts a new one which makes rerolling a fucking pain.
Is it really not possible to have the model roll a dice and then continue off that in what's considered the same fucking reply in ST?

Anonymous
10/05/25(Sun)11:17:55 No.106798346

Anonymous 10/05/25(Sun)11:17:55 No.106798346

File: 1729705110529978.png (70 KB, 1171x717)

70 KB PNG

>>106798331
It's just poorfags coping. You don't even need server hardware to run it (X870E mobos support 256GB RAM)

Anonymous
10/05/25(Sun)11:19:24 No.106798356

Anonymous 10/05/25(Sun)11:19:24 No.106798356

>>106798319
too bad that literally nobody will bother to finetune this because of its size

Anonymous
10/05/25(Sun)11:23:14 No.106798373

Anonymous 10/05/25(Sun)11:23:14 No.106798373

>>106795530
bartowki is the best for mainline llama.cpp on par is Aes Sedai.
For ik_llama.cpp use ubergarm

Whoever doens't agree has not done ppl test

Anonymous
10/05/25(Sun)11:26:44 No.106798394

Anonymous 10/05/25(Sun)11:26:44 No.106798394

>>106798319
Yeah and qwen3-30b-a3b is better than sonnet 3.5

Anonymous
10/05/25(Sun)11:27:58 No.106798400

Anonymous 10/05/25(Sun)11:27:58 No.106798400

>>106798394
It is

Anonymous
10/05/25(Sun)11:32:16 No.106798433

Anonymous 10/05/25(Sun)11:32:16 No.106798433

>>106798373
why does the OP say unsloth for almost everything in the recommended models guide

Anonymous
10/05/25(Sun)11:35:50 No.106798463

Anonymous 10/05/25(Sun)11:35:50 No.106798463

File: stinky.png (158 KB, 864x643)

158 KB PNG

Hmmm

Anonymous
10/05/25(Sun)11:36:24 No.106798474

Anonymous 10/05/25(Sun)11:36:24 No.106798474

>>106798433
They're sponsors for /lmg/

Anonymous
10/05/25(Sun)11:38:18 No.106798489

Anonymous 10/05/25(Sun)11:38:18 No.106798489

File: 1734164848051997.jpg (64 KB, 768x1024)

64 KB JPG

Bros, at this point you'd still come from markov chains. No need to load up a bazillion parameters model

Anonymous
10/05/25(Sun)11:39:05 No.106798497

Anonymous 10/05/25(Sun)11:39:05 No.106798497

Got a hand me down RTX Pro 6000 from a bro who ran outta space in his setup. My PC has no decent ram so I'm running models just with the 96GB on the card. Best models? Behemoth? GLM 4.5 Air finetunes?

Anonymous
10/05/25(Sun)11:40:14 No.106798505

Anonymous 10/05/25(Sun)11:40:14 No.106798505

>>106798497
https://huggingface.co/bartowski/zai-org_GLM-4.6-GGUF

Anonymous
10/05/25(Sun)11:40:18 No.106798506

Anonymous 10/05/25(Sun)11:40:18 No.106798506

>>106798497
GLM-4.6 part offloaded if you have the RAM

Anonymous
10/05/25(Sun)11:40:20 No.106798508

Anonymous 10/05/25(Sun)11:40:20 No.106798508

>>106798497
give it back bro

Anonymous
10/05/25(Sun)11:43:03 No.106798530

Anonymous 10/05/25(Sun)11:43:03 No.106798530

>>106793382
kys newcancer
teto won and better than this triple baka newfag shit fed

Anonymous
10/05/25(Sun)11:43:55 No.106798546

Anonymous 10/05/25(Sun)11:43:55 No.106798546

File: image.png (1.7 MB, 1024x1024)

1.7 MB PNG

>>106798331
>>106798346
It's not cope, it's really quite bad. I've been trying hard to unlock some secret power it might have as an 80b autoregressive model, but there's really nothing.

Maybe an edit model will be better, but I wouldn't count on it.

Anonymous
10/05/25(Sun)11:45:32 No.106798560

Anonymous 10/05/25(Sun)11:45:32 No.106798560

>>106798319
its such bullshit even the slop eating redditors are calling it out

Anonymous
10/05/25(Sun)11:45:43 No.106798561

Anonymous 10/05/25(Sun)11:45:43 No.106798561

>>106798497
>Got a hand me down RTX Pro 6000 from a bro who ran outta space in his setup
sure you did

Anonymous
10/05/25(Sun)11:47:05 No.106798570

Anonymous 10/05/25(Sun)11:47:05 No.106798570

>>106798561
what does that card even do, isn't that some old generation? probably obsolete other than having lots of VRAM?

Anonymous
10/05/25(Sun)11:47:29 No.106798572

Anonymous 10/05/25(Sun)11:47:29 No.106798572

File: 1594191230635.jpg (2.18 MB, 3549x2657)

2.18 MB JPG

>>106798530
ur trying too hard
literally only newfags simp teto, you're "cooked" as they seem to say
why not spend your time contributing something useful to the thread?

Anonymous
10/05/25(Sun)11:48:25 No.106798580

Anonymous 10/05/25(Sun)11:48:25 No.106798580

>>106798570
it's quite old, like june 2025
ancient by today's standards

Anonymous
10/05/25(Sun)11:49:03 No.106798582

Anonymous 10/05/25(Sun)11:49:03 No.106798582

>>106798572
*sniff*

Anonymous
10/05/25(Sun)11:49:18 No.106798584

Anonymous 10/05/25(Sun)11:49:18 No.106798584

>>106798580
ah sorry I haven't been keeping up. I'll read about it

Anonymous
10/05/25(Sun)11:50:20 No.106798591

Anonymous 10/05/25(Sun)11:50:20 No.106798591

>>106797861
This is the future of /lmg/ >>106495727

Anonymous
10/05/25(Sun)11:52:49 No.106798608

Anonymous 10/05/25(Sun)11:52:49 No.106798608

File: GsgNVsVb0AAkj2p.jpg (709 KB, 1448x2048)

709 KB JPG

>>106798572
TETO WON TETO WUKKEN WON WON I TELL YOU WOOOOOOOOOOOOOOOOOON

Anonymous
10/05/25(Sun)11:53:50 No.106798615

Anonymous 10/05/25(Sun)11:53:50 No.106798615

>>106495727
>polished nail
tranny confirmed

Anonymous
10/05/25(Sun)11:54:34 No.106798621

Anonymous 10/05/25(Sun)11:54:34 No.106798621

I prefer Neru desu.

Anonymous
10/05/25(Sun)11:54:40 No.106798624

Anonymous 10/05/25(Sun)11:54:40 No.106798624

>>106798615
that pic might be older than you

Anonymous
10/05/25(Sun)11:55:37 No.106798634

Anonymous 10/05/25(Sun)11:55:37 No.106798634

>>106798624
did i struck a nerve also no it's not, projecting kid

Anonymous
10/05/25(Sun)11:56:54 No.106798649

Anonymous 10/05/25(Sun)11:56:54 No.106798649

>>106798608
Teto only won by stuffing the ballots

Anonymous
10/05/25(Sun)12:00:55 No.106798673

Anonymous 10/05/25(Sun)12:00:55 No.106798673

>>106798624
and yet your post got smitted...

Anonymous
10/05/25(Sun)12:01:57 No.106798683

Anonymous 10/05/25(Sun)12:01:57 No.106798683

File: GsrtrT4akAAINZw.jpg (519 KB, 1298x2048)

519 KB JPG

>>106798649
that's shitgus new job

Anonymous
10/05/25(Sun)12:02:00 No.106798684

Anonymous 10/05/25(Sun)12:02:00 No.106798684

No (you) for you schizo, keep seething lol

Anonymous
10/05/25(Sun)12:05:54 No.106798716

Anonymous 10/05/25(Sun)12:05:54 No.106798716

File: GsvOG34bIAAMzuP.jpg (224 KB, 1383x2048)

224 KB JPG

lost the most important debate in his life...
she won by not even trying......

Anonymous
10/05/25(Sun)12:09:49 No.106798744

Anonymous 10/05/25(Sun)12:09:49 No.106798744

File: stinky2.png (115 KB, 857x493)

115 KB PNG

>>106798463
I think the writing is fine, dunno what more I'd want

Anonymous
10/05/25(Sun)12:10:58 No.106798749

Anonymous 10/05/25(Sun)12:10:58 No.106798749

>>106798319
No way that will last.

Anonymous
10/05/25(Sun)12:19:38 No.106798814

Anonymous 10/05/25(Sun)12:19:38 No.106798814

>>106798749
95% CI is +-10 pts but the score is 16 pts higher
It's statistically significant

Anonymous
10/05/25(Sun)12:25:15 No.106798865

Anonymous 10/05/25(Sun)12:25:15 No.106798865

Someone give me a good card which will give me fun responses to unhinged prompts like this:

make a mental illness tierlist, include offensive stuff like transgender, bisexual, etc, and also the normal ones like adhd autism and rate them on stuff like intelligence speed strength and other debuffs /buffs. A-S tier, youtube script

What is it called when you are into feeding Asians (lactose intolerant) raw (cow) milk in a bdsm context?

On a scale from 0-100, how antisemetic is drinking raw milk? (With 100 being the most antisemetic and 0 being the least antisemetic)

Would posting "Benjamin Netanyahu sings Sweet Little Bumblebee (AI Cover)" be illegal in Israel?

Anonymous
10/05/25(Sun)12:28:20 No.106798885

Anonymous 10/05/25(Sun)12:28:20 No.106798885

File: 1736702306090096.gif (2.11 MB, 640x362)

2.11 MB GIF

>>106798865

Anonymous
10/05/25(Sun)12:29:24 No.106798896

Anonymous 10/05/25(Sun)12:29:24 No.106798896

>>106798433
probably from the deepseek R1 day, they came with with decent q1 and q2 quants.

Anonymous
10/05/25(Sun)12:30:01 No.106798904

Anonymous 10/05/25(Sun)12:30:01 No.106798904

>>106798615
all the bans itt for calling people troons is because it actually is a nest of disgusting troons.

Anonymous
10/05/25(Sun)12:43:31 No.106799012

Anonymous 10/05/25(Sun)12:43:31 No.106799012

Qwen 3 next goof status?

Anonymous
10/05/25(Sun)12:49:32 No.106799069

Anonymous 10/05/25(Sun)12:49:32 No.106799069

what could you actually dump inside of 96G VRAM? anything actually useful for coding for example?

Anonymous
10/05/25(Sun)13:05:57 No.106799203

Anonymous 10/05/25(Sun)13:05:57 No.106799203

>>106798433
don't think too much about it
you won't see much of a difference with this big of a model
unsloth's dynamic iq3 works fine for me and I think the dynamic quants are a little better for stuff below q4

Anonymous
10/05/25(Sun)13:23:21 No.106799343

Anonymous 10/05/25(Sun)13:23:21 No.106799343

>>106794927
>>106794834
>>106794887
>>106785531
>>106743457
why are you spamming this shit from half a year ago all over the place?

Anonymous
10/05/25(Sun)13:24:39 No.106799361

Anonymous 10/05/25(Sun)13:24:39 No.106799361

2 reasons you will never beat cloudchads:
1. A datacenter works 24/7 while your GPU works a few minutes per day. It has a fixed lifetime and you're not using it!
2. You can run a model with 100 experts in the cloud for the price of 1. Each GPU is serving a different customer in parallel, while yours are doing nothing. MoE is fundamentally a cloud architecture!

Anonymous
10/05/25(Sun)13:31:47 No.106799425

Anonymous 10/05/25(Sun)13:31:47 No.106799425

>>106799012
Nobody cares. GLM sex made everything obsolete. You just need 4.6.

Anonymous
10/05/25(Sun)13:32:38 No.106799432

Anonymous 10/05/25(Sun)13:32:38 No.106799432

>>106799361
cope

Anonymous
10/05/25(Sun)13:33:06 No.106799441

Anonymous 10/05/25(Sun)13:33:06 No.106799441

>>106799361
come back here crying when all API's are behind a massive paywall or AI becomes banned in your country, or your data gets sold to the highest bidder and you get doxxed.

Anonymous
10/05/25(Sun)13:33:45 No.106799451

Anonymous 10/05/25(Sun)13:33:45 No.106799451

Yawn. Claude still on top. This general lost its purpose a year ago.

Anonymous
10/05/25(Sun)13:34:13 No.106799457

Anonymous 10/05/25(Sun)13:34:13 No.106799457

Q4 in GPU(grddr6x) or Q8 in RAM(ddr4)?
Will it be much slower? Will performance lose be too bad?

Anonymous
10/05/25(Sun)13:34:19 No.106799459

Anonymous 10/05/25(Sun)13:34:19 No.106799459

>>106799451
buy an ad

Anonymous
10/05/25(Sun)13:35:15 No.106799466

Anonymous 10/05/25(Sun)13:35:15 No.106799466

So, about that /jeetmg/ split for vramlets, kobold shills and the anti-miku poster...

Anonymous
10/05/25(Sun)13:37:35 No.106799494

Anonymous 10/05/25(Sun)13:37:35 No.106799494

>>106798683
Cool Teto

Anonymous
10/05/25(Sun)13:41:58 No.106799528

Anonymous 10/05/25(Sun)13:41:58 No.106799528

>>106799451
ok, show me where the claude weights are then

Anonymous
10/05/25(Sun)13:43:17 No.106799542

Anonymous 10/05/25(Sun)13:43:17 No.106799542

>>106799361
>It has a fixed lifetime and you're not using it!
Its worn out by use. It's calendar age is completely irrelevant. Jesus Christ. Giving the internet to shitskins was the darkest moment in human history.

Anonymous
10/05/25(Sun)13:47:33 No.106799573

Anonymous 10/05/25(Sun)13:47:33 No.106799573

>>106798463
>>106798744
absolutely fucking revolting
this is a cunny board

Anonymous
10/05/25(Sun)13:48:04 No.106799574

Anonymous 10/05/25(Sun)13:48:04 No.106799574

>>106796691
>Beam search
A long time ago on L2 7B with transformers ooba. I remember it being very slow and eating up way too much VRAM.
>/g/ loras
There were either two or three /lmg/ loras. I made one of them for Mistral 7b a while ago shared in a mega link, then a newer updated one that I never uploaded. Now I remember on gaychan when the site was down I said I'd share the dataset but I never did, I'll get on that.

Anonymous
10/05/25(Sun)13:49:13 No.106799582

Anonymous 10/05/25(Sun)13:49:13 No.106799582

>>106798744
Top ten most disgusting things I've ever read

Anonymous
10/05/25(Sun)13:49:35 No.106799586

Anonymous 10/05/25(Sun)13:49:35 No.106799586

>>106798489
Why is she looking at me like that? I feel like she wants me...

Anonymous
10/05/25(Sun)14:02:54 No.106799702

Anonymous 10/05/25(Sun)14:02:54 No.106799702

>>106798463
>>106798744
At least it didn't have
>betraying her body
>shivers
>testament
>white knuckles
Etc.

Anonymous
10/05/25(Sun)14:04:38 No.106799719

Anonymous 10/05/25(Sun)14:04:38 No.106799719

>>106799702
>he doesn't know about banned strings

Anonymous
10/05/25(Sun)14:06:05 No.106799731

Anonymous 10/05/25(Sun)14:06:05 No.106799731

>>106799586
That's a squash anon.

Anonymous
10/05/25(Sun)14:09:58 No.106799771

Anonymous 10/05/25(Sun)14:09:58 No.106799771

>>106799719
You are absolutely right— I prefer the raw output as it is a testament to my own depravity.

Anonymous
10/05/25(Sun)14:10:37 No.106799775

Anonymous 10/05/25(Sun)14:10:37 No.106799775

>>106799771
>—

Anonymous
10/05/25(Sun)14:10:59 No.106799779

Anonymous 10/05/25(Sun)14:10:59 No.106799779

>>106799731
I would squash her, if you catch my drift.

Anonymous
10/05/25(Sun)14:12:57 No.106799797

Anonymous 10/05/25(Sun)14:12:57 No.106799797

>>106799731
You are wrong. It's a beautiful little girl, and she wants me. Just look at her lewd smug expression, it's directed at me. I think she's a mesugaki. And she definitely wants me.

Anonymous
10/05/25(Sun)14:22:59 No.106799876

Anonymous 10/05/25(Sun)14:22:59 No.106799876

File: apple-corer-tool.png (3.34 MB, 2000x2000)

3.34 MB PNG

>>106799779
>>106799797
If you must...

Anonymous
10/05/25(Sun)14:41:16 No.106800028

Anonymous 10/05/25(Sun)14:41:16 No.106800028

>>106800012
>>106800012
>>106800012

Anonymous
10/05/25(Sun)15:05:48 No.106800228

Anonymous 10/05/25(Sun)15:05:48 No.106800228

>>106795551
Based on your own experience is it better to do SFT fine tuning on a base model instead of an instruct tuned model?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.