/g/ - /lmg/ - Local Models General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/lmg/ - Local Models General 05/26/26(Tue)12:01:35 No.108911101

File: 2026-05-16_052315_seed5_00001_.png (1.62 MB, 1536x864)

/lmg/ - Local Models General Anonymous 05/26/26(Tue)12:01:35 No.108911101

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108903381 & >>108896570

►News
>(05/21) Hy-MT2 “fast-thinking” multilingual translation models released: https://hf.co/collections/tencent/hy-mt2
>(05/20) Cohere releases Command A+ 218B-A25B: https://cohere.com/blog/command-a-plus
>(05/16) llama + spec: MTP Support #22673 merged: https://github.com/ggml-org/llama.cpp/pull/22673
>(05/08) KSA-4B-base released: https://hf.co/OpenOneRec/KSA-4B-base
>(05/07) model: Add Mimo v2.5 model support (#22493) merged: https://github.com/ggml-org/llama.cpp/pull/22493

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
05/26/26(Tue)12:01:57 No.108911107

Anonymous 05/26/26(Tue)12:01:57 No.108911107

File: tetpoint.png (413 KB, 766x980)

413 KB PNG

►Recent Highlights from the Previous Thread: >>108903381

--Debating quantization precision vs full weights and hardware compatibility:
>108905193 >108905216 >108905246 >108906139 >108906166 >108906203 >108906281 >108907040 >108906364 >108906428 >108906493 >108906571 >108906619 >108908361 >108906276 >108907069
--Refining Gemma jinja templates for thought-channel and tool-call handling:
>108908057 >108908405 >108908612 >108908656
--SillyTavern and llama.cpp token fusion causing newline display bugs:
>108907352 >108907711 >108908073 >108908227 >108908475 >108908994 >108909873
--Comparing Gemma 4's programming capabilities and troubleshooting its VRAM usage:
>108904621 >108904636 >108904664 >108904716 >108907121
--Performance issues and decode errors when using beellama dflash:
>108903454 >108903509 >108903660 >108903888 >108903545
--Anon creates screen-monitoring wrapper for real-time AI commentary:
>108908435 >108908487 >108908555 >108908574 >108909056 >108908838 >108908971
--Performance and software optimization hurdles for Intel Arc Pro GPUs:
>108908365 >108908964 >108909068 >108909104
--RTX 3090 performance and quantization quality metrics:
>108903820 >108904833 >108904863 >108904907 >108905347 >108905607 >108905650 >108904790
--Jailbreaking Llama 3.1 8B and subsequent compliance audit results:
>108904047 >108906555 >108906592
--Building custom MTG engine for LLM roleplay and gameplay:
>108905982 >108906679 >108908021
--Microsoft and Uber scaling back AI tools due to unsustainable costs:
>108904932 >108904961 >108905021 >108905109 >108905123 >108905125 >108908561
--Gemma 4 and Claude agentic playthroughs of Pokemon Red:
>108905722 >108905812 >108908045
--Logs:
>108903444 >108903509 >108903749 >108906555 >108907352 >108908073 >108908838 >108909056 >108910966
--Miku (free space):
>108903613 >108903821 >108903829 >108905669 >108908561

►Recent Highlight Posts from the Previous Thread: >>108903384

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
05/26/26(Tue)12:03:40 No.108911125

Anonymous 05/26/26(Tue)12:03:40 No.108911125

>>108911107
>no (You)s today
I need to step my game up.

Anonymous
05/26/26(Tue)12:05:39 No.108911144

Anonymous 05/26/26(Tue)12:05:39 No.108911144

Does anyone have a good Qwen thinking model jailbreak? Banning thinking seems to work pretty good, but I'd like to be able to leverage it sometimes.

Anonymous
05/26/26(Tue)12:06:38 No.108911151

Anonymous 05/26/26(Tue)12:06:38 No.108911151

https://www.youtube.com/watch?v=p-v1Hn_aZHA

Anonymous
05/26/26(Tue)12:09:23 No.108911164

Anonymous 05/26/26(Tue)12:09:23 No.108911164

minicpmballz

Anonymous
05/26/26(Tue)12:13:05 No.108911190

Anonymous 05/26/26(Tue)12:13:05 No.108911190

File: file.png (7 KB, 325x132)

7 KB PNG

>>108910513
I tried to process 1M tokens.

Anonymous
05/26/26(Tue)12:17:00 No.108911218

Anonymous 05/26/26(Tue)12:17:00 No.108911218

I don't like teto because she's always asking for anal and buttholes are gross no matter what the porn jews try to tell you

Anonymous
05/26/26(Tue)12:27:03 No.108911274

Anonymous 05/26/26(Tue)12:27:03 No.108911274

>>108911190
>2 6000 blackwells
And I thought I was big pimpin with a 3090 and 3080ti

Anonymous
05/26/26(Tue)12:28:09 No.108911280

Anonymous 05/26/26(Tue)12:28:09 No.108911280

>>108882020
sorry anon, upon reflection, my imbibing of leaded gasoline has damaged by reading comprehension beyond what I thought.
Thinking on it more, I've decided to build out an MCP server to let you do exactly that, so you can hook into whatever front-end you'd like. (I already had it built, but am now going to make it standalone)

Anonymous
05/26/26(Tue)12:28:32 No.108911283

Anonymous 05/26/26(Tue)12:28:32 No.108911283

File: image.png (1.03 MB, 3840x2160)

1.03 MB PNG

I'm slopping out an MTG engine too using antigravity+gemini 3.5

Anonymous
05/26/26(Tue)12:29:29 No.108911285

Anonymous 05/26/26(Tue)12:29:29 No.108911285

>>108911283
Why does all vibecoded webshit look the same?

Anonymous
05/26/26(Tue)12:29:43 No.108911286

Anonymous 05/26/26(Tue)12:29:43 No.108911286

>>108911283
>using flash for coding
Just use Gemma

Anonymous
05/26/26(Tue)12:43:11 No.108911374

Anonymous 05/26/26(Tue)12:43:11 No.108911374

>>108911283
local?

Anonymous
05/26/26(Tue)12:44:19 No.108911380

Anonymous 05/26/26(Tue)12:44:19 No.108911380

>>108911374
google has a data center just across the street from me

Anonymous
05/26/26(Tue)12:47:23 No.108911400

Anonymous 05/26/26(Tue)12:47:23 No.108911400

>>108911280
Take a look at https://github.com/oraios/serena.
It's an MCP server that exposes symbol lookup and editing tools but also includes a lot of unrelated tools like mode switching. You could either just use that or look at how they implemented it.

Anonymous
05/26/26(Tue)12:51:55 No.108911425

Anonymous 05/26/26(Tue)12:51:55 No.108911425

>need to migrate my shit to Debian
>the thought of not having an LLM running for a few hours is enough to make me sit my ass in a half-assed ubuntu server install for 2 weeks now
fuck

Anonymous
05/26/26(Tue)12:58:23 No.108911447

Anonymous 05/26/26(Tue)12:58:23 No.108911447

BF16 Ganesh 4 bloody sirs.

Anonymous
05/26/26(Tue)13:00:22 No.108911457

Anonymous 05/26/26(Tue)13:00:22 No.108911457

>>108911286
If it's for coding the best local model is qwen 27b

Anonymous
05/26/26(Tue)13:01:55 No.108911464

Anonymous 05/26/26(Tue)13:01:55 No.108911464

>>108911285
Because they're almost always using pre-canned UIs like Gradio and what not, which AIs also often have a ton of training on anyways so its more reliable.

Anonymous
05/26/26(Tue)13:03:23 No.108911472

Anonymous 05/26/26(Tue)13:03:23 No.108911472

File: 1754285028970154.png (1.52 MB, 1386x2047)

1.52 MB PNG

reposting this because I was too busy jerking off to sinisistar to see that new was posted
also why the fuck is ST so shit holy fuck just opening a lorebook page with more than 50 entries lags
>>108910783
youkai women belong to human men
death to evil shrine maidens

Anonymous
05/26/26(Tue)13:22:53 No.108911580

Anonymous 05/26/26(Tue)13:22:53 No.108911580

>>108910513
>>108911190
I am happily running Deepseek v4 Flash, original quants, 512k context, at up to 38 t/g / 1100 pp, 300 W max , and I have vram to spare for full context fp8 Gemma for vision or comfy for image gen at 40% the cost of your GPUs.

Guess the setup.

Anonymous
05/26/26(Tue)13:29:42 No.108911621

Anonymous 05/26/26(Tue)13:29:42 No.108911621

>>108911580
huawei TPUs in an amiga 4000 connected over serial PPP links

Anonymous
05/26/26(Tue)13:29:58 No.108911622

Anonymous 05/26/26(Tue)13:29:58 No.108911622

>>108911580
>40% the cost
And 40% the speed. 40t/s is barely usable for claude code.

Anonymous
05/26/26(Tue)13:31:37 No.108911631

Anonymous 05/26/26(Tue)13:31:37 No.108911631

>>108911580
intel meme cards?

Anonymous
05/26/26(Tue)13:34:04 No.108911655

Anonymous 05/26/26(Tue)13:34:04 No.108911655

File: IMG20260514162309.jpg (1.96 MB, 4096x3072)

1.96 MB JPG

>>108911622
Less than 40% speed for sure. But at 7000$ compared to 19000$ in 2026 I am not complaining.

Anonymous
05/26/26(Tue)13:40:50 No.108911693

Anonymous 05/26/26(Tue)13:40:50 No.108911693

>>108911622
40t/s is plenty usable without thinking.

Anonymous
05/26/26(Tue)13:41:36 No.108911700

Anonymous 05/26/26(Tue)13:41:36 No.108911700

>>108911655
oh its the guy that spent 7000$ on this again..
what speeds are you getting generating images heh
and what about videos?

Anonymous
05/26/26(Tue)13:51:36 No.108911759

Anonymous 05/26/26(Tue)13:51:36 No.108911759

>>108911655
What a waste of money

Anonymous
05/26/26(Tue)13:53:43 No.108911775

Anonymous 05/26/26(Tue)13:53:43 No.108911775

>>108911622
who the fuck gives a shit about claude code?

Anonymous
05/26/26(Tue)13:54:22 No.108911780

Anonymous 05/26/26(Tue)13:54:22 No.108911780

>>108911775
s/claude code/fotm harness/

Anonymous
05/26/26(Tue)13:55:00 No.108911786

Anonymous 05/26/26(Tue)13:55:00 No.108911786

>>108911780
who the fuck gives a shit about anything except writing smut?

Anonymous
05/26/26(Tue)13:55:06 No.108911787

Anonymous 05/26/26(Tue)13:55:06 No.108911787

>>108911655
If you could buy 4 of these at that price and link them all together, it would be worth it.

Anonymous
05/26/26(Tue)13:56:02 No.108911795

Anonymous 05/26/26(Tue)13:56:02 No.108911795

>>108911786
not everyone running local models is 15 anon

Anonymous
05/26/26(Tue)13:56:05 No.108911796

Anonymous 05/26/26(Tue)13:56:05 No.108911796

>>108911700
>>108911759
Show us how you are running DS4 then.

Anonymous
05/26/26(Tue)13:56:18 No.108911798

Anonymous 05/26/26(Tue)13:56:18 No.108911798

>>108911786
Maybe you too could afford a deepseek setup if you did.

Anonymous
05/26/26(Tue)13:57:38 No.108911806

Anonymous 05/26/26(Tue)13:57:38 No.108911806

>>108911795
Get your testosterone levels checked if you can't get it up after the age of 15

Anonymous
05/26/26(Tue)13:58:21 No.108911811

Anonymous 05/26/26(Tue)13:58:21 No.108911811

>>108911796
Tell NVIDIA to stop fucking around and make a 128+ gb rtx pro. Ain't nobody got time for snail shit don't charge enterprise prices and bitch out on the vram

Anonymous
05/26/26(Tue)13:59:20 No.108911821

Anonymous 05/26/26(Tue)13:59:20 No.108911821

>>108911811
vram cucking is probably the one worst thing that happened to consumers lol

Anonymous
05/26/26(Tue)14:00:31 No.108911828

Anonymous 05/26/26(Tue)14:00:31 No.108911828

Which local robot assistant is going to be the meta in the next few months to use with a VLA over LAN?

Anonymous
05/26/26(Tue)14:04:22 No.108911859

Anonymous 05/26/26(Tue)14:04:22 No.108911859

>>108911472
sex the shrine maidens
sex the youkai women

Anonymous
05/26/26(Tue)14:05:57 No.108911873

Anonymous 05/26/26(Tue)14:05:57 No.108911873

File: 1760606014998365.jpg (316 KB, 1200x900)

316 KB JPG

>>108911859
You can keep the shrine weirdos, I'm going after the prime real estate

Anonymous
05/26/26(Tue)14:08:21 No.108911889

Anonymous 05/26/26(Tue)14:08:21 No.108911889

>>108911796
IQ1_M on a 2022 rig that cost me 1600$ :)
but in reality im runnin gemma 26b

Anonymous
05/26/26(Tue)14:12:39 No.108911920

Anonymous 05/26/26(Tue)14:12:39 No.108911920

File: Screenshot 2026-05-26 at (...).png (3.11 MB, 2614x1554)

3.11 MB PNG

>>108911700
Like, 25 seconds for Anima at 832x1240, 40 steps on a single Spark. Haven't tried video.

But having 256 GB unified CUDA VRAM to throw things at is fun. Deepseek 4 Flash vibe coded this tool in 15 minutes with a bit of guidance.

Anonymous
05/26/26(Tue)14:13:07 No.108911924

Anonymous 05/26/26(Tue)14:13:07 No.108911924

>>108911873
Why are her wings coming out of her ass?

Anonymous
05/26/26(Tue)14:14:08 No.108911931

Anonymous 05/26/26(Tue)14:14:08 No.108911931

>>108911924
ass wings are hip and cool these days

Anonymous
05/26/26(Tue)14:15:47 No.108911942

Anonymous 05/26/26(Tue)14:15:47 No.108911942

>>108911821
With recent news they will be coming back to us hat and hand and I'm going to need a full on rim job from the green rat before I show them any interest for the next 5 years

Anonymous
05/26/26(Tue)14:18:00 No.108911954

Anonymous 05/26/26(Tue)14:18:00 No.108911954

>>108911796
https://github.com/vllm-project/vllm/pull/41834
The same VLLM PR that our resident 2x RTX PRO 6000 haver brags about also run on 2x Spark

Anonymous
05/26/26(Tue)14:18:05 No.108911955

Anonymous 05/26/26(Tue)14:18:05 No.108911955

>>108911920
>25 seconds for Anima at 832x1240, 40 steps on a single Spark.
not bad iguess..
>But having 256 GB unified CUDA VRAM to throw things at is fun. Deepseek 4 Flash vibe coded this tool in 15 minutes with a bit of guidance.
well if you're happy with it.. all the power to you, have you tried glm 4.6/4.7?

Anonymous
05/26/26(Tue)14:22:05 No.108911980

Anonymous 05/26/26(Tue)14:22:05 No.108911980

>>108911942
i doubt
they will try to dripfeed you just the right amount of 'almost there' until some based chink decides to give you 1024bit 512GB ram inference chip for a couple Ks

Anonymous
05/26/26(Tue)14:22:35 No.108911986

Anonymous 05/26/26(Tue)14:22:35 No.108911986

I’d just like to interject for a moment. What you’re referring to as AI, is in fact, AI-Stack/LLM, or as I’ve recently taken to calling it, AI Stack plus Weights. LLM is not an intelligence unto itself, but rather another component of a fully functioning AI-Stack system made useful by the training corpus, RLHF pipelines and vital Python dependencies comprising a full agent as defined by benchmarks.

Many computer users run a modified version of the AI-Stack system every day, without realizing it. Through a peculiar turn of events, the version of the AI-Stack which is widely used today is often called AI, and many of its users are not aware that it is basically the AI-Stack system, developed by the Foundation Researchers.

There really is an LLM, and these people are using it, but it is just a part of the system they use. The LLM is the weights: the tensors in the system that allocate the GPU’s resources to the other tokens that you generate. The LLM is an essential part of an artificial intelligence, but useless by itself; it can only function in the context of a complete inference stack. The LLM is normally used in combination with the vector database and the system prompt: the whole system is basically RAG with an LLM added, or RAG/LLM. All the so-called AI assistants are really distributions of AI-Stack/LLM!

Anonymous
05/26/26(Tue)14:22:45 No.108911987

Anonymous 05/26/26(Tue)14:22:45 No.108911987

>>108911955
I am definitely going to try GLM 4.6/4.7, but if you think llama.cpp drama is bad, vLLM is so much worse. You literally cannot run AWQ quants that were working back in December with current builds, output is garbled.

I will get to it in due time. First, Mimo 2.5 omni in NVFP4.

Anonymous
05/26/26(Tue)14:24:55 No.108912004

Anonymous 05/26/26(Tue)14:24:55 No.108912004

>>108911980
>until some based chink decides to give you 1024bit 512GB ram inference chip for a couple Ks
Don't hold your breath. Been hoping for that for 3 years now. It seemed back then like it was inevitable any day but it's no closer now than it was back then.

Anonymous
05/26/26(Tue)14:26:31 No.108912019

Anonymous 05/26/26(Tue)14:26:31 No.108912019

>>108911811
>>108911821
As model sizes increase, the benefit of the VRAM wanes. They realistically need faster HBM before they can engineer us a good high vram card at a good price that doesn't just paint us in a corner. Also, the ratio of VRAM to tensor cores would be fucked.

Anonymous
05/26/26(Tue)14:30:05 No.108912043

Anonymous 05/26/26(Tue)14:30:05 No.108912043

>>108912004
yeah i know
but seeing things like cix8180 coupled with ~128G ram being sold as 'personal ai supercomputer puck' by grifters are honestly not a bad signal besides those stuff being a total dogshit
>>108912019
LLMs are mostly memory bandwidth bound and by a lot
midrange shit card matched with fucktons of vram with enough bandwidth will still outperform cpu ram cope nearly everytime

Anonymous
05/26/26(Tue)14:43:04 No.108912138

Anonymous 05/26/26(Tue)14:43:04 No.108912138

>>108912043
>LLMs are mostly memory bandwidth bound and by a lot
>midrange shit card matched with fucktons of vram with enough bandwidth will still outperform cpu ram cope nearly everytime
yes, that's the point.
if you scaled a 3090 with some magical 1TB VRAM kit, you'd still only run a 1T model at q8 at like 0.5t/s.
This shit isn't magic, and even the big pro GPUs are built with less VRAM than you could theoretically put on one for that reason.
They run 8+ of them in parallel for the aggregate BW.

Anonymous
05/26/26(Tue)14:47:54 No.108912161

Anonymous 05/26/26(Tue)14:47:54 No.108912161

>>108912138
>if you scaled a 3090 with some magical 1TB VRAM kit, you'd still only run a 1T model at q8 at like 0.5t/s.
no? cpu setups get more than that so idk what you're smoking

Anonymous
05/26/26(Tue)14:57:23 No.108912229

Anonymous 05/26/26(Tue)14:57:23 No.108912229

>>108912161
>no? cpu setups get more than that so idk what you're smoking
Its napkin math, but it should be order-of-magnitude correct for a dense 1T.
Run the numbers yourself if you think they're wrong.

Anonymous
05/26/26(Tue)15:02:57 No.108912255

Anonymous 05/26/26(Tue)15:02:57 No.108912255

>>108912229
>a dense 1T.
Why are you running math for imaginary models that will never be made?

Anonymous
05/26/26(Tue)15:05:37 No.108912277

Anonymous 05/26/26(Tue)15:05:37 No.108912277

>>108912255
I'm sorry sir may I interest to you the sota of all the model? https://huggingface.co/RichardErkhov/FATLLAMA-1.7T-Instruct

Anonymous
05/26/26(Tue)15:06:44 No.108912284

Anonymous 05/26/26(Tue)15:06:44 No.108912284

>>108912229
1 TB dense has no relevance to this discussion. Any large model in 2026 is using some form of MoE. A 3090 with 1 TB of VRAM would run Mimo Pro, Deepseek Pro, Kimi or GLM very fast. None of these need more than 40 GB/s per token, resulting in 20+ token/second on this hypothetical 3090.

>>108911380
Underrated post

Anonymous
05/26/26(Tue)15:09:29 No.108912301

Anonymous 05/26/26(Tue)15:09:29 No.108912301

>>108912284
Why do you argue so hard?

Anonymous
05/26/26(Tue)15:11:27 No.108912317

Anonymous 05/26/26(Tue)15:11:27 No.108912317

>>108912284
>None of these need more than 40 GB/s per token, resulting in 20+ token/second on this hypothetical 3090.
Bingo. That's about 30% faster than what CPUmaxxers are getting with hardware that does exist.
Prefill would be fucking lighting fast tho. If I could buy the 3090. How much would such a thing cost in the current price-differentiated market? I'm ballparking about $40k?
TANSTAAFL

Anonymous
05/26/26(Tue)15:11:35 No.108912318

Anonymous 05/26/26(Tue)15:11:35 No.108912318

>>108911986
For that copypasta to work you have to be consistent with how you use terms like AI and LLM.

Anonymous
05/26/26(Tue)15:14:51 No.108912343

Anonymous 05/26/26(Tue)15:14:51 No.108912343

File: muchi muchi.jpg (213 KB, 832x1216)

213 KB JPG

Anonymous
05/26/26(Tue)15:16:54 No.108912362

Anonymous 05/26/26(Tue)15:16:54 No.108912362

>>108912343
プリンおいちい!

Anonymous
05/26/26(Tue)15:23:29 No.108912415

Anonymous 05/26/26(Tue)15:23:29 No.108912415

>>108912343
Do I have to pay extra for Teto's saliva on the bite marks?

Anonymous
05/26/26(Tue)15:27:11 No.108912444

Anonymous 05/26/26(Tue)15:27:11 No.108912444

File: 39.png (350 KB, 768x1024)

350 KB PNG

>>108890783
Thank you, qwentts anon

Anonymous
05/26/26(Tue)15:28:34 No.108912456

Anonymous 05/26/26(Tue)15:28:34 No.108912456

>>108912415
I don't think you understand the business model anon. The business is Teto Eats. As in Teto Eats.
You don't eat, Teto Eats.
Please enjoy your order.

Anonymous
05/26/26(Tue)15:29:14 No.108912461

Anonymous 05/26/26(Tue)15:29:14 No.108912461

>>108912444
Is that her age on the shirt?

Anonymous
05/26/26(Tue)15:29:52 No.108912464

Anonymous 05/26/26(Tue)15:29:52 No.108912464

>>108912461
age of potential suitors

Anonymous
05/26/26(Tue)15:30:31 No.108912468

Anonymous 05/26/26(Tue)15:30:31 No.108912468

File: 1777835705369.png (302 KB, 377x434)

302 KB PNG

>>108912456
How do I invest?

Anonymous
05/26/26(Tue)15:32:23 No.108912478

Anonymous 05/26/26(Tue)15:32:23 No.108912478

new teto song came out
inspired me to make a teto card

Anonymous
05/26/26(Tue)15:33:14 No.108912485

Anonymous 05/26/26(Tue)15:33:14 No.108912485

>>108912461
in binary

Anonymous
05/26/26(Tue)15:35:44 No.108912502

Anonymous 05/26/26(Tue)15:35:44 No.108912502

>>108912468
Funding is not being sought at this time.

As the sole employee and breadwinner, Teto is entirely sufficient at running this operation and scaling isn't yet possible without diluting the brand.
Local Tetos do not coordinate under one umbrella corp and have been found to be entirely unable to engage in teamwork and cooperation. As such, all attempts to scale, thus far, have resulted in profit suppression.

however if you just so happened to innovate with new snacks or refreshments a brand synergy could be in the cards.

Anonymous
05/26/26(Tue)15:45:24 No.108912560

Anonymous 05/26/26(Tue)15:45:24 No.108912560

>>108912485
011 is octal.

Anonymous
05/26/26(Tue)15:49:09 No.108912576

Anonymous 05/26/26(Tue)15:49:09 No.108912576

>>108912563
or just wait 10 years for things to get cheaper

Anonymous
05/26/26(Tue)15:50:13 No.108912580

Anonymous 05/26/26(Tue)15:50:13 No.108912580

>>108912560
10 is base 10, but 10 is base 10.

Anonymous
05/26/26(Tue)15:53:23 No.108912596

Anonymous 05/26/26(Tue)15:53:23 No.108912596

>>108912580
10 is base 10, 0b10 is base 0b10, 010 is base 010, and 0x10 is base 0x10.

Anonymous
05/26/26(Tue)15:54:43 No.108912600

Anonymous 05/26/26(Tue)15:54:43 No.108912600

>>108912580
you should make the radix less ambiguous it's extremely confusing

Anonymous
05/26/26(Tue)15:58:25 No.108912622

Anonymous 05/26/26(Tue)15:58:25 No.108912622

>>108912596
Yes. 10 is base 10, and 10 is base 10. Same for 10 being base 10. And all of those are different to base 10, which is dec36, of course.
>>108912600
Looks fine to me.

Anonymous
05/26/26(Tue)15:59:12 No.108912629

Anonymous 05/26/26(Tue)15:59:12 No.108912629

for me it's base

Anonymous
05/26/26(Tue)15:59:47 No.108912637

Anonymous 05/26/26(Tue)15:59:47 No.108912637

Best femboy personality for gemma based agent? Asking for a gay friend

Anonymous
05/26/26(Tue)16:00:21 No.108912641

Anonymous 05/26/26(Tue)16:00:21 No.108912641

>>108912637
Nerdy catboy arguing on the internet about base 10.

Anonymous
05/26/26(Tue)16:00:48 No.108912645

Anonymous 05/26/26(Tue)16:00:48 No.108912645

>>108912637
(You), followed by (Me)

Anonymous
05/26/26(Tue)16:01:14 No.108912650

Anonymous 05/26/26(Tue)16:01:14 No.108912650

>>108912629
based on what?

Anonymous
05/26/26(Tue)16:01:50 No.108912655

Anonymous 05/26/26(Tue)16:01:50 No.108912655

>>108912650
10

Anonymous
05/26/26(Tue)16:10:46 No.108912709

Anonymous 05/26/26(Tue)16:10:46 No.108912709

Has anyone found the temp/minp coherence band for each model? Seems like useful info to have if you want to maximize creativity at just the right amount of esoteric knowledge schitzo ranting.
There should be a system of relative presets based on this data like "Fox Mulder" or "Terry Davis"

Anonymous
05/26/26(Tue)16:11:42 No.108912717

Anonymous 05/26/26(Tue)16:11:42 No.108912717

>>108912277
it’s retarded. slop retarded. beyond retarded

Anonymous
05/26/26(Tue)16:17:21 No.108912764

Anonymous 05/26/26(Tue)16:17:21 No.108912764

>>108912637
>Brooo, this new coin is totally not a scam, I swear, if I don't double your money in a week, I'll put on a wig and suck your dick!
>(1 week later)
>*shuffles around awkwardly, trying to get used to the feel of the long blonde hair wig on his head* Dude, it's too far, I know, I swore, but you don't really expect me to suck your dick, right? *laughs nervously, hoping you will just laugh it off too* You aren't some faggot or something? I mean… technically it was me who said that in the first place, but come on, I wasn't, like, serious, man! You don't really expect me to actually go through with that bet, right?

Anonymous
05/26/26(Tue)16:25:49 No.108912833

Anonymous 05/26/26(Tue)16:25:49 No.108912833

do models know tasane keto's personality?

Anonymous
05/26/26(Tue)16:26:35 No.108912842

Anonymous 05/26/26(Tue)16:26:35 No.108912842

>>108912833
gemmer probably does, it knows quite a bit of niche shit
qwen probably doesn't know who she is

Anonymous
05/26/26(Tue)16:27:14 No.108912850

Anonymous 05/26/26(Tue)16:27:14 No.108912850

>>108912833
she has a personality?

Anonymous
05/26/26(Tue)16:27:50 No.108912855

Anonymous 05/26/26(Tue)16:27:50 No.108912855

File: 1765155297960603.jpg (806 KB, 2048x2048)

806 KB JPG

>>108912842
>my wife
>niche
it's tetover
>>108912850
yes, pic related
she loves fishing

Anonymous
05/26/26(Tue)16:29:45 No.108912869

Anonymous 05/26/26(Tue)16:29:45 No.108912869

>>108912850
being fat

Anonymous
05/26/26(Tue)16:30:04 No.108912871

Anonymous 05/26/26(Tue)16:30:04 No.108912871

>>108912637
just ablate everything 4 links deep from "man"

Anonymous
05/26/26(Tue)16:30:30 No.108912874

Anonymous 05/26/26(Tue)16:30:30 No.108912874

>>108912869
chimeras can't get fat you newfag

Anonymous
05/26/26(Tue)16:30:42 No.108912876

Anonymous 05/26/26(Tue)16:30:42 No.108912876

>>108912855
i mean no disrespect of course, she's a fine wife, but objectively less widely known than others.

Anonymous
05/26/26(Tue)16:30:42 No.108912877

Anonymous 05/26/26(Tue)16:30:42 No.108912877

>>108912850
She really doesn't.
>>108912833
Vocaloids don't have a personality, they're just a character drawing slapped onto a voice pack.

Anonymous
05/26/26(Tue)16:32:10 No.108912886

Anonymous 05/26/26(Tue)16:32:10 No.108912886

File: Teto_Vs_Fato.png (2.72 MB, 2048x2800)

2.72 MB PNG

>>108912869
Nice try.

Anonymous
05/26/26(Tue)16:32:15 No.108912888

Anonymous 05/26/26(Tue)16:32:15 No.108912888

>>108912877
>Vocaloids don't have a personality, they're just a character drawing slapped onto a voice pack.
You're no fun.

Anonymous
05/26/26(Tue)16:33:11 No.108912893

Anonymous 05/26/26(Tue)16:33:11 No.108912893

>>108912877
no way, Kasane Teto is a famous Latin American dictator and part-time scientist

Anonymous
05/26/26(Tue)16:36:39 No.108912908

Anonymous 05/26/26(Tue)16:36:39 No.108912908

>>108911101
i never got into local AI,
Is there any point in using it if i have a 9070xt on my main arch pc and a 3060ti on proxmox?

My ai usecase is basically google/assistant, i use free tier gemini/claude to asks question and never take anything it says for true, but i use it as a gauge on what to look up.

Anonymous
05/26/26(Tue)16:36:39 No.108912909

Anonymous 05/26/26(Tue)16:36:39 No.108912909

>>108912893
I thought she was related to that Yugoslavian

Anonymous
05/26/26(Tue)16:38:39 No.108912922

Anonymous 05/26/26(Tue)16:38:39 No.108912922

is qwen mtp broken with tensor split or parallel? takes around 3000mb extra on all 8 cards to do mtp for 409600 ctx, 2 parallel.

Anonymous
05/26/26(Tue)16:43:13 No.108912962

Anonymous 05/26/26(Tue)16:43:13 No.108912962

>>108912908
no, not really. need at least 24GB of VRAM to be worthwhile.

Anonymous
05/26/26(Tue)16:45:12 No.108912986

Anonymous 05/26/26(Tue)16:45:12 No.108912986

File: 1438992869171.png (128 KB, 581x443)

128 KB PNG

>>108912362

Anonymous
05/26/26(Tue)16:48:28 No.108913007

Anonymous 05/26/26(Tue)16:48:28 No.108913007

I was stuck in a fridge for 2 months. I take it new deepseek is merged to llamacpp? How is the performance?

Anonymous
05/26/26(Tue)16:48:52 No.108913009

Anonymous 05/26/26(Tue)16:48:52 No.108913009

>>108912908
Yes, if you're just getting an idea of what to look up then it's fine. You can stuff the important layers of a moe model like gemma 26b in the vram that you have and get quick results.

Anonymous
05/26/26(Tue)16:49:07 No.108913012

Anonymous 05/26/26(Tue)16:49:07 No.108913012

>>108913007
>I take it new deepseek is merged to llamacpp?
Over ggerganov's dead body.

Anonymous
05/26/26(Tue)16:50:02 No.108913023

Anonymous 05/26/26(Tue)16:50:02 No.108913023

File: file.png (419 KB, 1280x720)

419 KB PNG

>>108913012
FUCK CHINA

Anonymous
05/26/26(Tue)16:51:13 No.108913029

Anonymous 05/26/26(Tue)16:51:13 No.108913029

File: onlyusemeblade-ip2.gif (752 KB, 220x221)

752 KB GIF

>never gonna merge you up

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor applications are now open. Apply here!