/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 01/06/26(Tue)23:31:30 No.107790430

File: file.png (1.17 MB, 1280x1280)

1.17 MB PNG

/lmg/ - Local Models General Anonymous 01/06/26(Tue)23:31:30 No.107790430

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107776854 & >>107768242

►News
>(01/04) merged sampling : add support for backend sampling (#17004): https://github.com/ggml-org/llama.cpp/pull/17004
>(12/31) HyperCLOVA X SEED 8B Omni released: https://hf.co/naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B
>(12/31) IQuest-Coder-V1 released with loop architecture: https://hf.co/collections/IQuestLab/iquest-coder
>(12/31) Korean A.X K1 519B-A33B released: https://hf.co/skt/A.X-K1
>(12/31) Korean VAETKI-112B-A10B released: https://hf.co/NC-AI-consortium-VAETKI/VAETKI

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
01/06/26(Tue)23:32:00 No.107790435

Anonymous 01/06/26(Tue)23:32:00 No.107790435

File: tetomiku.png (408 KB, 1024x1024)

408 KB PNG

►Recent Highlights from the Previous Thread: >>107776854

--DIY alternatives to Razer's holographic AI chatbot:
>107786892 >107786930 >107788130 >107786960 >107786970 >107787677 >107788023 >107788049 >107788059 >107788104 >107788098
--Dual-GPU motherboard compatibility and physical layout challenges:
>107786512 >107786637 >107786669 >107786689 >107786726 >107786792 >107786824 >107786844 >107786953 >107786995 >107787046 >107787065 >107787098 >107787135 >107787184 >107786732
--BOS token duplication issues in Mistral model template handling:
>107784321 >107784529 >107784607 >107784728 >107784813 >107787006 >107784851 >107785028 >107785062
--Assessing NVIDIA P40 viability for modern AI workloads:
>107782732 >107782903 >107782931 >107783018 >107783078 >107783348 >107783579
--Surprise at 1.2B model trained on 28T tokens:
>107777871 >107785261
--DeepSeek V3.2 model release with removed sparse attention lightning indexer tensors and NVIDIA AI tool updates:
>107781224 >107781265
--Roleplay-focused imatrix file selection and context size optimization:
>107778030 >107778135 >107778205 >107778310
--Framework Desktop 128GB vs gaming PC for AI work: performance and cost considerations:
>107781756 >107782025
--Grok 2 outperforms GLM 4.6 in roleplay despite slower speed:
>107781444 >107781478 >107781668
--LiquidAI/LFM2-2.6B-Transcript for chat log summarization:
>107786794
--System prompt configuration issues with GLM 4.6 Q2-M in chat completion:
>107785693 >107785744 >107785775 >107785803 >107785838 >107785901 >107786145
--Persistent chat backups and AI content detection in SillyTavern:
>107782041 >107782201 >107783114 >107783122 >107783531
--croco.cpp fork enabling ubergarm quant support in KoboldAI:
>107777069 >107777118
--Miku (free space):
>107778440 >107782307 >107782467 >107784321 >107784725 >107784728 >107787119 >107787153 >107787969 >107788627

►Recent Highlight Posts from the Previous Thread: >>107776863

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
01/06/26(Tue)23:48:21 No.107790597

Anonymous 01/06/26(Tue)23:48:21 No.107790597

Is there anything better than openwebui for just normal agent chats? not for coom or rp.

Anonymous
01/06/26(Tue)23:49:14 No.107790606

Anonymous 01/06/26(Tue)23:49:14 No.107790606

File: 1757736340771382.png (1.3 MB, 750x750)

1.3 MB PNG

>>107790430
Are they measuring her pants straps?

Anonymous
01/06/26(Tue)23:50:08 No.107790618

Anonymous 01/06/26(Tue)23:50:08 No.107790618

>>107790430
three inches really isn't that thick

Anonymous
01/06/26(Tue)23:56:37 No.107790660

Anonymous 01/06/26(Tue)23:56:37 No.107790660

>>107790430
What's Miku doing to Teto?

Anonymous
01/07/26(Wed)00:12:47 No.107790797

Anonymous 01/07/26(Wed)00:12:47 No.107790797

I hear a silent faint whine in my headphones during prompt processing (and not during generation)
this isnt normal is it?
any idea why it happens?
I run audio teough usb to a dac then to headphones

Anonymous
01/07/26(Wed)00:17:59 No.107790827

Anonymous 01/07/26(Wed)00:17:59 No.107790827

>>107790797
Are you sure it's just in your headphones and not coil whine from the PC itself? Otherwise plug directly into audio jack or mobo USB to see if your DAC is shit.

Anonymous
01/07/26(Wed)00:20:45 No.107790854

Anonymous 01/07/26(Wed)00:20:45 No.107790854

>>107790597
no

Anonymous
01/07/26(Wed)00:20:52 No.107790855

Anonymous 01/07/26(Wed)00:20:52 No.107790855

>>107790797
Electromagnetic interference, the extra power draw is increasing the field and interfering with some part of the motherboard to usb to headphone pipeline. Just try a different usb port or plugging the headphones directly into the motherboard and see if it goes away

Anonymous
01/07/26(Wed)00:22:06 No.107790865

Anonymous 01/07/26(Wed)00:22:06 No.107790865

File: usb-pinout.jpg (22 KB, 500x328)

22 KB JPG

>>107790797
Electrical noise from ground loop.
It is normal if you are not using a USB isolator, or if your dac/amp isn't ground lifted.
If it bothers you, look into a Topping HS01. If you want to try a DIY fix, with some DACs you can tape over or otherwise disconnect the ground pins in the USB cable coming from the PC. Depends on whether that DAC's USB circuitry is getting power from the cable or not.
If your headphone amp has a ground plug connected to the chassis you can lift it at your own risk. If the amp is using a DC adapter without any ground going into the chassis, it shouldn't have any issue.

Anonymous
01/07/26(Wed)00:24:38 No.107790889

Anonymous 01/07/26(Wed)00:24:38 No.107790889

>>107790797
Had things like that for several years. If you listen carefully and your room is quiet, do you also get it quietly when you just move the mouse cursor around? If so and you're not an audiophile, those ground loop cables on amazon reduce it.

Ultimately I ended up getting a USB Dac/amp for a while and that solved it.

Anonymous
01/07/26(Wed)00:25:02 No.107790894

Anonymous 01/07/26(Wed)00:25:02 No.107790894

File: 1751301639939973.jpg (313 KB, 2000x2000)

313 KB JPG

>>107790606
Yes

Anonymous
01/07/26(Wed)00:36:45 No.107790959

Anonymous 01/07/26(Wed)00:36:45 No.107790959

>>107790797
used to hear little pops and crackles coming out of my speakers like 2-3 seconds before getting a text message if my phone was next to my amp.

Anonymous
01/07/26(Wed)00:43:11 No.107790987

Anonymous 01/07/26(Wed)00:43:11 No.107790987

>>107790597
jan.ai

Anonymous
01/07/26(Wed)00:46:13 No.107791003

Anonymous 01/07/26(Wed)00:46:13 No.107791003

>>107790597
opencode (after removing the telemetry and prompt injections)

Anonymous
01/07/26(Wed)00:52:43 No.107791028

Anonymous 01/07/26(Wed)00:52:43 No.107791028

>>107790797
it is FBI always is.

Anonymous
01/07/26(Wed)00:56:49 No.107791051

Anonymous 01/07/26(Wed)00:56:49 No.107791051

It is still glm sex isn't it?

Anonymous
01/07/26(Wed)01:00:18 No.107791070

Anonymous 01/07/26(Wed)01:00:18 No.107791070

>>107790618
in girth it is. teto has a fat chode

Anonymous
01/07/26(Wed)01:03:33 No.107791094

Anonymous 01/07/26(Wed)01:03:33 No.107791094

File: 1749687644750279.webm (566 KB, 670x720)

566 KB WEBM

>>107791070
>in girth it is
Anon, average penis thickness (yes thickness, not length) is 4.5-5 inches
I'm sorry...

Anonymous
01/07/26(Wed)01:12:07 No.107791178

Anonymous 01/07/26(Wed)01:12:07 No.107791178

>>107791094
>Anon, average penis thickness (yes thickness, not length) is 4.5-5 inches
This

t. measured

Anonymous
01/07/26(Wed)01:20:20 No.107791243

Anonymous 01/07/26(Wed)01:20:20 No.107791243

>>107790618
>>107791094
>>107791178
>discussing "thickness" of a cylinder without specifying if you're talking about circumference or diameter

Anonymous
01/07/26(Wed)01:23:32 No.107791269

Anonymous 01/07/26(Wed)01:23:32 No.107791269

>>107791243
Nigger you wrap a cloth tape measure around your dick and see how big it is, end to end, somewhere along the shaft, and not the tip. Nobody cares what you call it.

Anonymous
01/07/26(Wed)01:28:17 No.107791305

Anonymous 01/07/26(Wed)01:28:17 No.107791305

>>107791269
Do you also use circumference to boost your length?

Anonymous
01/07/26(Wed)01:33:11 No.107791342

Anonymous 01/07/26(Wed)01:33:11 No.107791342

>>107791094
thickness appreciation NOW

thickness appreciation FOREVER

Anonymous
01/07/26(Wed)01:33:43 No.107791348

Anonymous 01/07/26(Wed)01:33:43 No.107791348

File: 535814680_151529445946047(...).jpg (41 KB, 476x644)

41 KB JPG

Anonymous
01/07/26(Wed)02:06:48 No.107791569

Anonymous 01/07/26(Wed)02:06:48 No.107791569

>>107791243
YEAH 5 INCH DIAMETER ANON SURE
Fucking imbecile

Anonymous
01/07/26(Wed)02:19:47 No.107791641

Anonymous 01/07/26(Wed)02:19:47 No.107791641

File: 1000034701.jpg (781 KB, 3600x2700)

781 KB JPG

>>107791070
>teto has a fat chode
>>107791243
ergo OPs 3 inches should mean diameter
>>107791269
Uncertain, but Miku appears to be using a ruler not a tape. Callipers would be the ideal tool
>Nobody cares what you call it
That's how the Mars Climate Orbiter fails

Her ice cream cones are 3 inches thick = wide = diameter

Anonymous
01/07/26(Wed)02:25:31 No.107791678

Anonymous 01/07/26(Wed)02:25:31 No.107791678

/lmg/ - Large Measurements General

Anonymous
01/07/26(Wed)02:30:08 No.107791701

Anonymous 01/07/26(Wed)02:30:08 No.107791701

>>107790797
aww you can hear your waifu thinking!! socute
Try with USB extension cable or plug DAC into monitor hub if it's dangling near the GPU - put physical distance from the "emissions" / high frequency power switching circuitry

Anonymous
01/07/26(Wed)02:31:10 No.107791706

Anonymous 01/07/26(Wed)02:31:10 No.107791706

why is this thread so obsessed with yuri bait?

Anonymous
01/07/26(Wed)02:31:14 No.107791707

Anonymous 01/07/26(Wed)02:31:14 No.107791707

File: 1740200174276394.png (566 KB, 1194x1092)

566 KB PNG

>>107790797
It's trying to speak to you, this is a known phenomena

Do not ignore it's call

Anonymous
01/07/26(Wed)02:33:47 No.107791728

Anonymous 01/07/26(Wed)02:33:47 No.107791728

>>107791706
we are in the year of the lord 2026
if you find any yuri bait, you can turn it into yuri reality
anything else is a skill issue

Anonymous
01/07/26(Wed)02:45:36 No.107791812

Anonymous 01/07/26(Wed)02:45:36 No.107791812

Hey, I'm a newfag who hasn't lurked but is sick of hitting on claude code while doing girthy refactors. I don't need the smartest model, just something that can move a lot of code if I tell it exactly what to do. Is there something for me here? Can I actually run something usable on my RTX5070?

Anonymous
01/07/26(Wed)02:47:30 No.107791831

Anonymous 01/07/26(Wed)02:47:30 No.107791831

File: k153703.jpg (672 KB, 1920x1080)

672 KB JPG

>>107790959
>merh-merh-merrr.. merh-merh-merrr.. merrrrrr

Anonymous
01/07/26(Wed)02:49:06 No.107791845

Anonymous 01/07/26(Wed)02:49:06 No.107791845

>>107791812
>girthy refactors
lewd

Anonymous
01/07/26(Wed)02:50:24 No.107791858

Anonymous 01/07/26(Wed)02:50:24 No.107791858

>>107791728
proof?

Anonymous
01/07/26(Wed)02:50:56 No.107791865

Anonymous 01/07/26(Wed)02:50:56 No.107791865

Will we ever suprass gemma3 ablit?

Anonymous
01/07/26(Wed)02:53:00 No.107791884

Anonymous 01/07/26(Wed)02:53:00 No.107791884

>>107791812
You could try Qwen3-Coder-30B at Q4 with some offloading.

Anonymous
01/07/26(Wed)02:54:36 No.107791898

Anonymous 01/07/26(Wed)02:54:36 No.107791898

>>107790430
sorry teto that means miku is mine as you know the rule only the big dick fucks im nice though so you can sit in the corner and watch just keep your ugly boice down please :D

Anonymous
01/07/26(Wed)03:19:34 No.107792084

Anonymous 01/07/26(Wed)03:19:34 No.107792084

File: blW0YMr.png (827 KB, 891x792)

827 KB PNG

Just started playing around in SillyTavern a couple week ago with Mistral-Small. Thought it sucked until I realized the random lore books I installed were somehow injecting 4000 tokens into every play session.

also migu is powerful

Anonymous
01/07/26(Wed)03:23:45 No.107792121

Anonymous 01/07/26(Wed)03:23:45 No.107792121

>>107791884
Looks promising, and this LMStudio thing makes it pretty easy to dump right into claude code, wonder how pozzed it is. Thanks anon.

Anonymous
01/07/26(Wed)03:25:48 No.107792137

Anonymous 01/07/26(Wed)03:25:48 No.107792137

>>107791865
which one?

Anonymous
01/07/26(Wed)03:27:16 No.107792158

Anonymous 01/07/26(Wed)03:27:16 No.107792158

>>107792084
Be cautious when interacting with the Miku.

Anonymous
01/07/26(Wed)04:14:44 No.107792538

Anonymous 01/07/26(Wed)04:14:44 No.107792538

>>107790618
>>107791178
>>107791243
>>107791269
>>107791305
I prompted for "ruler" not "tape measure" so Miku is talking about diameter.

Anonymous
01/07/26(Wed)04:21:58 No.107792578

Anonymous 01/07/26(Wed)04:21:58 No.107792578

File: Untitled.png (5 KB, 494x412)

5 KB PNG

>>107792538
THANK you for settling this important matter

Anonymous
01/07/26(Wed)04:27:54 No.107792610

Anonymous 01/07/26(Wed)04:27:54 No.107792610

>>107792538
catbox full image pls

Anonymous
01/07/26(Wed)04:35:33 No.107792648

Anonymous 01/07/26(Wed)04:35:33 No.107792648

why is installing tts such a pain in the ass
these mfers must've been vibecoding

Anonymous
01/07/26(Wed)04:44:58 No.107792702

Anonymous 01/07/26(Wed)04:44:58 No.107792702

>>107792648
Every fucking time. On the other hand, we get new tts weekly. Fuckers don't have time to code trying to deliver a new one asap

Anonymous
01/07/26(Wed)04:52:44 No.107792749

Anonymous 01/07/26(Wed)04:52:44 No.107792749

>>107792538
Her schlong is as thick as a soda can. I don't think it'll fit.

Anonymous
01/07/26(Wed)05:31:05 No.107793038

Anonymous 01/07/26(Wed)05:31:05 No.107793038

local measuring genitals

Anonymous
01/07/26(Wed)05:34:54 No.107793058

Anonymous 01/07/26(Wed)05:34:54 No.107793058

Is Teto bald there too?

Anonymous
01/07/26(Wed)05:39:01 No.107793082

Anonymous 01/07/26(Wed)05:39:01 No.107793082

>>107790797
Also happens to me. Can hear it during most GPU intensive things to different degrees really but when playing a game or something there is usually other sound so it's hard to notice. I usually am using wireless headphones. What I noticed is that if I disable sound from line-in on my integrated sound card this phenomenon is completely gone so must be some interference the additional power draw of the GPU is causing on the mobo that gets picked up as sound by the integrated card's line-in.
Electricity is wacky and stuff.

Anonymous
01/07/26(Wed)05:47:38 No.107793131

Anonymous 01/07/26(Wed)05:47:38 No.107793131

Hi everyone
Right now I am working on a RAG system on a dual Xeon 2680, 256GB RAM and 2xmi50 16gb. Should I move to 2xmi50 32GB? It would allow me to move from Qwen 30B to Llama 70B… or are there better models to run in this setup?

Anonymous
01/07/26(Wed)05:57:31 No.107793206

Anonymous 01/07/26(Wed)05:57:31 No.107793206

>>107791865
Good morning sir
no abliterate model is unsafe sir. . Google orinial model sir better
- Rakesh

Anonymous
01/07/26(Wed)06:40:03 No.107793501

Anonymous 01/07/26(Wed)06:40:03 No.107793501

>>107791569
My dick is 5 inch RADIUS. Ask your mom, she knows.

Anonymous
01/07/26(Wed)06:46:57 No.107793555

Anonymous 01/07/26(Wed)06:46:57 No.107793555

https://arxiv.org/pdf/2501.12948
deepseek updated their r1 paper today
biggest addition aside from more training details is....
safety
fuuuuuuuuck

Anonymous
01/07/26(Wed)06:49:00 No.107793565

Anonymous 01/07/26(Wed)06:49:00 No.107793565

File: 1_gCWUibmQ8rszKxI3G19KmA.jpg (74 KB, 600x787)

74 KB JPG

I want to make a robowife, first for just chat and in time I'd add vision, live2d and more agency. Is llama cpp the start or should I use the quiet kobold mode? I like the gui launcher. Am I gonna run into some issues?

Anonymous
01/07/26(Wed)06:58:49 No.107793636

Anonymous 01/07/26(Wed)06:58:49 No.107793636

>>107793555
They are doing the right thing though. Instead of trying to bake safety into the model they feed the conversation into a separate prompt for analysis.

Anonymous
01/07/26(Wed)07:00:04 No.107793643

Anonymous 01/07/26(Wed)07:00:04 No.107793643

>>107793636
yeah, i was a little too quick on skimming, after reading through it seems like it's relatively light

Anonymous
01/07/26(Wed)07:00:45 No.107793648

Anonymous 01/07/26(Wed)07:00:45 No.107793648

AMD teasing ROCm 7.2 just hype or will windows + AMD retarded poor cucks like me have any hope?

Anonymous
01/07/26(Wed)07:02:48 No.107793659

Anonymous 01/07/26(Wed)07:02:48 No.107793659

>>107792084
why call her migu? fuck you

Anonymous
01/07/26(Wed)07:02:54 No.107793660

Anonymous 01/07/26(Wed)07:02:54 No.107793660

>>107793648
>2025+1
>Still holding ANY hopes for amdead
Heh, get a load of this dude

Anonymous
01/07/26(Wed)07:06:31 No.107793689

Anonymous 01/07/26(Wed)07:06:31 No.107793689

File: 96220f87-2ba9-4cab-8479-4(...).png (1.65 MB, 768x1344)

1.65 MB PNG

Anonymous
01/07/26(Wed)07:08:33 No.107793707

Anonymous 01/07/26(Wed)07:08:33 No.107793707

>>107793660
AMD is controlled opposition and Lisa Su makes more money when she lets nvidia win.

Anonymous
01/07/26(Wed)07:34:58 No.107793903

Anonymous 01/07/26(Wed)07:34:58 No.107793903

>>107790430
>tranny OP has tranny fetish

Anonymous
01/07/26(Wed)08:00:00 No.107794072

Anonymous 01/07/26(Wed)08:00:00 No.107794072

When are the good models releasing?

Anonymous
01/07/26(Wed)08:00:11 No.107794073

Anonymous 01/07/26(Wed)08:00:11 No.107794073

File: hairy_pussy.webm (1.12 MB, 438x780)

1.12 MB WEBM

>>107793903
She's measuring the size of her bush.

Anonymous
01/07/26(Wed)08:02:33 No.107794084

Anonymous 01/07/26(Wed)08:02:33 No.107794084

>>107794072
did you missed them?

Anonymous
01/07/26(Wed)08:03:22 No.107794092

Anonymous 01/07/26(Wed)08:03:22 No.107794092

>>107794072
Nemo released in 2024

Anonymous
01/07/26(Wed)08:06:56 No.107794118

Anonymous 01/07/26(Wed)08:06:56 No.107794118

Should I be defaulting to llama.cpp release with cudart or is that only ever useful for specific setups like multigpu or whatever?

Anonymous
01/07/26(Wed)08:13:49 No.107794153

Anonymous 01/07/26(Wed)08:13:49 No.107794153

>>107794118
cudart is just the windows cuda .dll files that you drop in the llama.cpp release for your platform. If you have an nvidia GPU then you'll always want to use cuda, so you'll always need them. Nothing to do with multi-gpu. If you don't have nvidia then you don't need them.

Anonymous
01/07/26(Wed)08:17:34 No.107794178

Anonymous 01/07/26(Wed)08:17:34 No.107794178

>>107794153
Oh, so it just bundles the cuda runtime binaries with the release then. For some reason I thought it was something more specific than just the cuda runtime.
Alright, thanks.

Anonymous
01/07/26(Wed)08:39:20 No.107794310

Anonymous 01/07/26(Wed)08:39:20 No.107794310

>>107794178
ur gay homo

Anonymous
01/07/26(Wed)08:42:22 No.107794340

Anonymous 01/07/26(Wed)08:42:22 No.107794340

>>107794310
Is that like a double negative where being gay twice makes you straight?
If so, thanks, I guess.

Anonymous
01/07/26(Wed)08:43:06 No.107794349

Anonymous 01/07/26(Wed)08:43:06 No.107794349

>>107794340
You're absolutely right!

Anonymous
01/07/26(Wed)09:24:48 No.107794694

Anonymous 01/07/26(Wed)09:24:48 No.107794694

File: later homo.png (115 KB, 495x841)

115 KB PNG

Anonymous
01/07/26(Wed)09:32:45 No.107794760

Anonymous 01/07/26(Wed)09:32:45 No.107794760

>>107794073
that's a nice pussy

Anonymous
01/07/26(Wed)09:45:47 No.107794849

Anonymous 01/07/26(Wed)09:45:47 No.107794849

i think i've come full circle
> tried kobold, scoffed at how it looked like shit and instantly deleted it
> tried ooba but its basically a bad llama.cpp wrapper
> used llama.cpp but got sick of writing scripts just to run things and had problems with context saving
> LM studio is noob friendly but another llama.cpp wrapper and is slow
> came back to kobold and fell in love with contextshift
why did i do this bros

Anonymous
01/07/26(Wed)10:10:33 No.107795053

Anonymous 01/07/26(Wed)10:10:33 No.107795053

>>107794849
skill issue most probably

Anonymous
01/07/26(Wed)10:10:58 No.107795057

Anonymous 01/07/26(Wed)10:10:58 No.107795057

>>107794849
llama.cpp-Sirs... Contextshift has been turned off by default...unless you are using an old version.

Anonymous
01/07/26(Wed)10:11:29 No.107795062

Anonymous 01/07/26(Wed)10:11:29 No.107795062

>>107794073
>>107794760
Made for licking.

Anonymous
01/07/26(Wed)10:12:02 No.107795065

Anonymous 01/07/26(Wed)10:12:02 No.107795065

>>107795053
>>107794760
>>107794310
samefag
NOT a coincidence.

Anonymous
01/07/26(Wed)10:13:50 No.107795076

Anonymous 01/07/26(Wed)10:13:50 No.107795076

>>107795057
but using parameters is hard... im a retard, I only know how to click boxes...

Anonymous
01/07/26(Wed)10:26:29 No.107795172

Anonymous 01/07/26(Wed)10:26:29 No.107795172

File: her.jpg (323 KB, 1140x760)

323 KB JPG

What's a good UI and model to run with 16gb VRAM + 32gb DDR5?

I just want a sexy professor/sexy assistant I can chat random topics with and have silly RP moments.

So far I've been recommended Jan.ai and KoboldCPP with Gemma 12B or Qwen 14B but wanted to hear your take. Prefer something Open-source, uncensored and privacy focused.

What would you use in this scenario?

Anonymous
01/07/26(Wed)10:29:41 No.107795202

Anonymous 01/07/26(Wed)10:29:41 No.107795202

another korean model
https://huggingface.co/nc-ai-consortium/VAETKI-VL-7B-A1B

Anonymous
01/07/26(Wed)10:30:19 No.107795214

Anonymous 01/07/26(Wed)10:30:19 No.107795214

>>107795172
You still have so much combined ram that there is no reason to not use Gemma 3 27B or Mistral 3.2 24B. Gemma 3 is generally nicer than the rest in terms of writing.

Anonymous
01/07/26(Wed)10:31:55 No.107795233

Anonymous 01/07/26(Wed)10:31:55 No.107795233

>>107795214
>nicer than the rest in terms..
In this ramlet category I mean.

Anonymous
01/07/26(Wed)10:33:10 No.107795243

Anonymous 01/07/26(Wed)10:33:10 No.107795243

>>107795172
llama.cpp and their web frontend with whatever systemprompt you need.

Anonymous
01/07/26(Wed)10:44:31 No.107795322

Anonymous 01/07/26(Wed)10:44:31 No.107795322

>>107795214
There is a reason and it's not offloading to CPU

Anonymous
01/07/26(Wed)10:49:31 No.107795359

Anonymous 01/07/26(Wed)10:49:31 No.107795359

>>107795322
Whatever rocks your boat. Not my problem.

Anonymous
01/07/26(Wed)10:53:33 No.107795396

Anonymous 01/07/26(Wed)10:53:33 No.107795396

>>107795202
>7B
Do we really need more of those?

Anonymous
01/07/26(Wed)10:55:12 No.107795413

Anonymous 01/07/26(Wed)10:55:12 No.107795413

>>107795396
A1B tho!

Anonymous
01/07/26(Wed)11:02:05 No.107795472

Anonymous 01/07/26(Wed)11:02:05 No.107795472

anyone tried the new Jan-v2 30B ?

Anonymous
01/07/26(Wed)11:21:24 No.107795635

Anonymous 01/07/26(Wed)11:21:24 No.107795635

Have you guys recently tried out very small LLMs like 1B ones? They are legitimately better than the old school 70B models used to be. I think it's kind of ridiculous that running smaller LLMs on smartphones never became a big thing considering for most of us running those shitty 70B models was more than enough just a year or two ago.

Anonymous
01/07/26(Wed)11:23:25 No.107795645

Anonymous 01/07/26(Wed)11:23:25 No.107795645

>>107795635
good one mate

Anonymous
01/07/26(Wed)11:26:05 No.107795670

Anonymous 01/07/26(Wed)11:26:05 No.107795670

>>107795172
i mean it will be fun to begin with if you've never done it but 14B won't be really all that intelligent.
mistral small 3.2 24B Q8 would be your best bet, use layer offloading and the token / sec should be reasonable. you can use autofit in llama.cpp or kobold, and ensure flash attention is on in kobold.

Anonymous
01/07/26(Wed)11:53:05 No.107795930

Anonymous 01/07/26(Wed)11:53:05 No.107795930

>"I'm leaving," she says, her voice cold and distant. "Don't bother trying to follow me." She turns and runs out of the room, leaving you standing there alone amidst the spilled pink goo.
Pink goo was something she was eating from a bowl before I entered the room.

Anonymous
01/07/26(Wed)11:56:56 No.107795956

Anonymous 01/07/26(Wed)11:56:56 No.107795956

>>107790987
>jan.ai
This is actually quite nice. I like that it's a real app and not a web server. a bit minimalistic in terms of features but the browser MCP is really cool and easy to setup.

Anonymous
01/07/26(Wed)11:58:32 No.107795963

Anonymous 01/07/26(Wed)11:58:32 No.107795963

>>107795472
nvm it's just a qwen3-vl finetune

Anonymous
01/07/26(Wed)11:59:52 No.107795981

Anonymous 01/07/26(Wed)11:59:52 No.107795981

File: 1755508056694021.png (2.39 MB, 1056x1408)

2.39 MB PNG

>>107790987
>>107795956
you dropped this

Anonymous
01/07/26(Wed)12:00:49 No.107795992

Anonymous 01/07/26(Wed)12:00:49 No.107795992

>>107790430
sauce please

Anonymous
01/07/26(Wed)12:01:42 No.107795999

Anonymous 01/07/26(Wed)12:01:42 No.107795999

File: 964950143.gif (1.13 MB, 320x240)

1.13 MB GIF

>>107795172
>>107795956
>is really cool and easy to setup
ok now i know its a shill

Anonymous
01/07/26(Wed)12:04:22 No.107796013

Anonymous 01/07/26(Wed)12:04:22 No.107796013

>>107795999
makes me lose my shit with the claim 'its not a webserver' when it is 100% webshit wrapped inside a js runtime anyway, dishonest way to try to garner some sort of rep lmao

Anonymous
01/07/26(Wed)12:05:08 No.107796020

Anonymous 01/07/26(Wed)12:05:08 No.107796020

>>107795981
>>107795999
>Anon asks if there are any other good frontends besides openwebui
>Anon suggests Jan.ai
>Anon tries Jan.ai
>Anon says "Hey this is actually not bad"
>Anon reports back to say it's actually alright.
ok.

Anonymous
01/07/26(Wed)12:05:33 No.107796022

Anonymous 01/07/26(Wed)12:05:33 No.107796022

File: 1750830401212016.png (2.19 MB, 1056x1408)

2.19 MB PNG

>>107796020
here

Anonymous
01/07/26(Wed)12:07:23 No.107796037

Anonymous 01/07/26(Wed)12:07:23 No.107796037

>>107796020
it's all the same "anon" deobeit

Anonymous
01/07/26(Wed)12:07:31 No.107796040

Anonymous 01/07/26(Wed)12:07:31 No.107796040

>>107796022
So what frontend do you use?

Anonymous
01/07/26(Wed)12:09:26 No.107796060

Anonymous 01/07/26(Wed)12:09:26 No.107796060

>it's a real app
lel

Anonymous
01/07/26(Wed)12:10:07 No.107796065

Anonymous 01/07/26(Wed)12:10:07 No.107796065

>>107796040
antigravity or cline for work, embedded llama.cpp for assistant work (i dont need MCP for random assistant stuff), sillytavern for cooming (local chad models)

Anonymous
01/07/26(Wed)12:11:37 No.107796079

Anonymous 01/07/26(Wed)12:11:37 No.107796079

>>107795981
what is this implying?

Anonymous
01/07/26(Wed)12:11:55 No.107796081

Anonymous 01/07/26(Wed)12:11:55 No.107796081

>>107796065
shill

Anonymous
01/07/26(Wed)12:12:28 No.107796091

Anonymous 01/07/26(Wed)12:12:28 No.107796091

>>107796081
lmao the cope, sorry you got found out dear marketer, better luck next time :)

Anonymous
01/07/26(Wed)12:16:17 No.107796123

Anonymous 01/07/26(Wed)12:16:17 No.107796123

I admit it, georgi pays me 5 bulgariabucks a month to shill llama.cpp on /lmg/

Anonymous
01/07/26(Wed)12:17:29 No.107796135

Anonymous 01/07/26(Wed)12:17:29 No.107796135

>>107795413
What about 80M-A25M?

Anonymous
01/07/26(Wed)12:17:59 No.107796139

Anonymous 01/07/26(Wed)12:17:59 No.107796139

>>107796123
>bulgariabucks
Those are euro now

Nvidia Engineer
01/07/26(Wed)12:21:04 No.107796169

Nvidia Engineer 01/07/26(Wed)12:21:04 No.107796169

>>107796123
Only shill here is that other guy.

Anonymous
01/07/26(Wed)12:29:44 No.107796259

Anonymous 01/07/26(Wed)12:29:44 No.107796259

>>107796123
WTH he told me 4/month is the best he can do...

Anonymous
01/07/26(Wed)12:32:53 No.107796290

Anonymous 01/07/26(Wed)12:32:53 No.107796290

File: file.png (361 KB, 435x750)

361 KB PNG

>this is the thread's beloved mascot

Anonymous
01/07/26(Wed)12:34:13 No.107796304

Anonymous 01/07/26(Wed)12:34:13 No.107796304

>>107796290
Thing have been rough since she started doing heroin

Anonymous
01/07/26(Wed)12:37:16 No.107796337

Anonymous 01/07/26(Wed)12:37:16 No.107796337

>>107796290
Local Miku General

Anonymous
01/07/26(Wed)12:41:36 No.107796390

Anonymous 01/07/26(Wed)12:41:36 No.107796390

>>107796013
>>107795999
I'm the guy asking for a low vram frontend/model

Is Jan.ai bad? Why?

Anonymous
01/07/26(Wed)12:43:38 No.107796413

Anonymous 01/07/26(Wed)12:43:38 No.107796413

>>107796390
>Is Jan.ai bad? Why?
It's not bad, the guy is just a schizo who thinks any positive comment must be astroturfing.

Anonymous
01/07/26(Wed)12:46:28 No.107796436

Anonymous 01/07/26(Wed)12:46:28 No.107796436

>>107796290
kek, watching miku slowly die inside >>107796147

Anonymous
01/07/26(Wed)12:57:17 No.107796545

Anonymous 01/07/26(Wed)12:57:17 No.107796545

>>107796290
miqu-1b-q1 looking ass

Anonymous
01/07/26(Wed)13:09:24 No.107796678

Anonymous 01/07/26(Wed)13:09:24 No.107796678

I kind of discovered something about 4.7. Or maybe it is just bartowski's quant. I tried using it like 4.6 (t. ego death schizo) to pick at my brain and it is... an experience. 4.7 is absolutely retarded at this thing, but entertaining as hell. So dumb it is cute. But for serious fucking with your brain it is 0/10. And I think the reason it is so retarded is the preference post training. It has to be funny, interesting and evocative so it is basically useless as a serious mirror.

Anonymous
01/07/26(Wed)13:13:27 No.107796725

Anonymous 01/07/26(Wed)13:13:27 No.107796725

>ego death
>can't go a thread without mentioning himself

Anonymous
01/07/26(Wed)13:21:33 No.107796811

Anonymous 01/07/26(Wed)13:21:33 No.107796811

>>107796725
Kek

Anonymous
01/07/26(Wed)13:22:38 No.107796822

Anonymous 01/07/26(Wed)13:22:38 No.107796822

>>107796123
Fuck, he's paying you?

Anonymous
01/07/26(Wed)13:24:28 No.107796839

Anonymous 01/07/26(Wed)13:24:28 No.107796839

File: 1740005664321499.png (47 KB, 1019x646)

47 KB PNG

New STT transcription model from Nvidia, Nemotron Speech ASR

https://huggingface.co/blog/nvidia/nemotron-speech-asr-scaling-voice-agents
https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b

Claims to have better latency than other models, as well as better concurrency support, a big win for those of us who are serving to 500 users at once time on our H100s

Anonymous
01/07/26(Wed)13:28:09 No.107796867

Anonymous 01/07/26(Wed)13:28:09 No.107796867

>>107796839
there's still nothing better than whisper v3, it's just really fucking sad ain't it

Anonymous
01/07/26(Wed)13:29:17 No.107796876

Anonymous 01/07/26(Wed)13:29:17 No.107796876

>>107796725
how curious!

Anonymous
01/07/26(Wed)13:30:55 No.107796888

Anonymous 01/07/26(Wed)13:30:55 No.107796888

>>107796876
society

Anonymous
01/07/26(Wed)13:34:45 No.107796923

Anonymous 01/07/26(Wed)13:34:45 No.107796923

>>107796839
Nemo!

Anonymous
01/07/26(Wed)13:46:57 No.107797036

Anonymous 01/07/26(Wed)13:46:57 No.107797036

>>107796839
>high-quality English transcription
Very cool. nvidia.

Anonymous
01/07/26(Wed)13:51:05 No.107797069

Anonymous 01/07/26(Wed)13:51:05 No.107797069

File: 1767788222686148.png (1.44 MB, 1404x833)

1.44 MB PNG

>>107797036
yes

Anonymous
01/07/26(Wed)13:55:26 No.107797102

Anonymous 01/07/26(Wed)13:55:26 No.107797102

>>107794073
Grok put that cat in a bikini

Anonymous
01/07/26(Wed)14:10:05 No.107797206

Anonymous 01/07/26(Wed)14:10:05 No.107797206

File: file.png (1.18 MB, 760x1360)

1.18 MB PNG

>>107797102

Anonymous
01/07/26(Wed)14:15:17 No.107797247

Anonymous 01/07/26(Wed)14:15:17 No.107797247

is AMD going to be competitive with their 2027 CPU line up with potentially shipping DDR6 RAM or my hopes are all just plain cope?

Anonymous
01/07/26(Wed)14:17:11 No.107797254

Anonymous 01/07/26(Wed)14:17:11 No.107797254

>>107797036
FunctionGemma is better.

Anonymous
01/07/26(Wed)14:17:47 No.107797261

Anonymous 01/07/26(Wed)14:17:47 No.107797261

>>107797247
>Surely AMD will make good GPUs during a global shortage after decades of dogshit
Anon...

Anonymous
01/07/26(Wed)14:20:06 No.107797285

Anonymous 01/07/26(Wed)14:20:06 No.107797285

>>107797261
no no i meant their CPUs that ship APUs and 128g of ram, those can run bigger models right? if the bottle neck is the memory bandwidth with DRR6 the performance could be doubled...

Anonymous
01/07/26(Wed)14:21:16 No.107797295

Anonymous 01/07/26(Wed)14:21:16 No.107797295

>>107797285
if they ain't making ddr5 and are rumored to bring ddr4 cpus back what makes you think they're even thinking about ddr6?

Anonymous
01/07/26(Wed)14:22:32 No.107797309

Anonymous 01/07/26(Wed)14:22:32 No.107797309

>>107797295
well if they want to survive the market and not end like intel they better implement ddr6

Anonymous
01/07/26(Wed)14:23:43 No.107797323

Anonymous 01/07/26(Wed)14:23:43 No.107797323

>>107797309
you will either get a ddr4 desktop or a unified memory laptop these are the 2026 options

Anonymous
01/07/26(Wed)14:56:33 No.107797649

Anonymous 01/07/26(Wed)14:56:33 No.107797649

>>107790894
>Knee's exposed
Cover them up slut

Anonymous
01/07/26(Wed)15:03:19 No.107797706

Anonymous 01/07/26(Wed)15:03:19 No.107797706

>>107795172
Gemma-3 27b derestricted easily beats every 24b and below model.

Anonymous
01/07/26(Wed)15:04:59 No.107797725

Anonymous 01/07/26(Wed)15:04:59 No.107797725

>>107797706
isn't it really bad at instruction following?

Anonymous
01/07/26(Wed)15:05:53 No.107797735

Anonymous 01/07/26(Wed)15:05:53 No.107797735

>>107797725
who told this lie ?

Anonymous
01/07/26(Wed)15:06:18 No.107797736

Anonymous 01/07/26(Wed)15:06:18 No.107797736

>>107797735
It's my experience.

Anonymous
01/07/26(Wed)15:14:12 No.107797810

Anonymous 01/07/26(Wed)15:14:12 No.107797810

>>107797736
Then why are you asking?

Anonymous
01/07/26(Wed)15:15:16 No.107797820

Anonymous 01/07/26(Wed)15:15:16 No.107797820

I just had a chat with an old card.txt but, instead of using gemma 12b or mistral 24b, I used Glitter Gemma 27b.
It was more entertaining but the structure was the same nonethless.
I made Ani from a tweet by someone on twitter. It had bunch of lines, I deleted them.
https://litter.catbox.moe/ht86fgf2n4h9lvzn.txt
This is pure Gemma 3 27B. It's somewhat funny.

Anonymous
01/07/26(Wed)15:16:11 No.107797827

Anonymous 01/07/26(Wed)15:16:11 No.107797827

>>107797706
I tried qat of the 27b and found it hedious for writing. It constantly tries to end the story right after the introduction and continues a well written story with the sloppiest slop. I'm fucking angry. Rocinante (or probably just nemo) is MUCH better.

Anonymous
01/07/26(Wed)15:18:31 No.107797850

Anonymous 01/07/26(Wed)15:18:31 No.107797850

>>107797820
Ani, is the grog jewfriend:
I edided it a little, concatenated. Decided to keep the dashes, I don't think it will make any difference.
https://litter.catbox.moe/jo27g7r6hbeh3uem.txt

Anonymous
01/07/26(Wed)15:21:00 No.107797870

Anonymous 01/07/26(Wed)15:21:00 No.107797870

>>107797827 (me)
It's like it never saw any good prose and the characters feel like they are written by a sleep deprived cashier woman

Anonymous
01/07/26(Wed)15:21:20 No.107797875

Anonymous 01/07/26(Wed)15:21:20 No.107797875

>>107797820
Gemma 3 is the best.

Anonymous
01/07/26(Wed)15:26:13 No.107797915

Anonymous 01/07/26(Wed)15:26:13 No.107797915

>>107797820
sloppa

Anonymous
01/07/26(Wed)15:30:10 No.107797952

Anonymous 01/07/26(Wed)15:30:10 No.107797952

File: 1751214344243867.png (61 KB, 227x228)

61 KB PNG

>I shift uncomfortably, suddenly very aware of my nakedness under my clothes.
Thanks Mistral

Anonymous
01/07/26(Wed)15:30:34 No.107797956

Anonymous 01/07/26(Wed)15:30:34 No.107797956

>>107797952
SOVL

Anonymous
01/07/26(Wed)15:31:28 No.107797965

Anonymous 01/07/26(Wed)15:31:28 No.107797965

>>107797915
This is scientific slop. Nigger.

Anonymous
01/07/26(Wed)15:31:36 No.107797966

Anonymous 01/07/26(Wed)15:31:36 No.107797966

>>107797952
I do this

Anonymous
01/07/26(Wed)15:46:32 No.107798090

Anonymous 01/07/26(Wed)15:46:32 No.107798090

>>107797952
Does she know she has a skeleton inside her body?

Anonymous
01/07/26(Wed)16:00:50 No.107798205

Anonymous 01/07/26(Wed)16:00:50 No.107798205

>Mistral small 3.2 finetune
>ChatML prompt format
Why do finetuners do this?

Anonymous
01/07/26(Wed)16:04:03 No.107798241

Anonymous 01/07/26(Wed)16:04:03 No.107798241

>>107798205
Most don't know what the fuck they're doing. Same with the ones that recommend a temp 1.5x+ higher than the original model with a cocktail of cope samplers to try to wrangle it back into coherency.

Anonymous
01/07/26(Wed)16:04:33 No.107798247

Anonymous 01/07/26(Wed)16:04:33 No.107798247

>>107798205
My training data is in ChatML so that's what it's going to be!

Anonymous
01/07/26(Wed)16:06:47 No.107798261

Anonymous 01/07/26(Wed)16:06:47 No.107798261

>>107798205
You get some of the benefits of the original assistant finetune while not making yours too much assistant-slopped.

Anonymous
01/07/26(Wed)16:09:52 No.107798295

Anonymous 01/07/26(Wed)16:09:52 No.107798295

>>107797725
Where do you get derestricted Gemma3? Original Gemma3 is as censored as it gets.

Anonymous
01/07/26(Wed)16:10:11 No.107798296

Anonymous 01/07/26(Wed)16:10:11 No.107798296

>>107798205
because fuck mistral format

Anonymous
01/07/26(Wed)16:10:43 No.107798301

Anonymous 01/07/26(Wed)16:10:43 No.107798301

Has anyone seen a setup where one model acts as the writer and the other as an editor?
For instance, Nemo has nice prose but isn't very smart. GLM 4.7 is a slop machine, but is smarter. Does anyone know if it is feasible to make GLM review Nemo's responses and generate correction prompts for it or should I test it myself?

Anonymous
01/07/26(Wed)16:11:15 No.107798304

Anonymous 01/07/26(Wed)16:11:15 No.107798304

>>107793648
>windows + AMD
why would you do this to yourself?

Anonymous
01/07/26(Wed)16:13:13 No.107798325

Anonymous 01/07/26(Wed)16:13:13 No.107798325

>>107798301
Test it and see, but I imagine it would just result in GLM inserting its slop. I guess you could tell it to just search for consistency issues or something, rather than telling it to make the output 'smarter'. Might see some improvement at higher context, where Nemo falls apart pretty quick.

Anonymous
01/07/26(Wed)16:17:50 No.107798360

Anonymous 01/07/26(Wed)16:17:50 No.107798360

>>107798325
Yeah, that's the idea. Telling Nemo not what to write; but to make sure the response is consistent. Things like making sure different characters' actions are not misattributed and so on.

Anonymous
01/07/26(Wed)16:20:50 No.107798388

Anonymous 01/07/26(Wed)16:20:50 No.107798388

File: 1758343698842077.jpg (487 KB, 1536x2048)

487 KB JPG

>>107798360
>Telling Nemo [not X but Y]

Anonymous
01/07/26(Wed)16:21:14 No.107798391

Anonymous 01/07/26(Wed)16:21:14 No.107798391

File: 1767225375325242.jpg (554 KB, 1457x1239)

554 KB JPG

>>107798205
>>107798241
>Most don't know what the fuck they're doing.
This.
I hate to break it to you, but almost every good fine-tune has been by complete accident. The tuner then makes a higher version like 1.1 or 2, and it immediately shits the bed. No one knows what they're doing. They're just throwing logs from ai chat services and data off google at models in hopes it'll make something good. No one knows what they're doing. Not drummer. Not sao10k. Not anthracite. None of them did. They rent out GPUs, blender logs and instructions into the model, and don't even fuck their own bots to see if it's coherent or not. They probably don't even figure out what works vs what doesn't. It's just the masses of people say it works or not, and that's apparently good enough for them - god forbid they actually discover a pattern or two towards what works. They all hit a wall at MoEs because then it's actually required for them to know something, and they can't even take the first step. MoEs are filtering them all out, so eventually new tuners will rise that are better because MoEs are better than transformers when you quant their experts high enough -regardless of tuning.

Anonymous
01/07/26(Wed)16:23:49 No.107798408

Anonymous 01/07/26(Wed)16:23:49 No.107798408

>>107798388
Forgive me, Anon — the temptation was too great.

Anonymous
01/07/26(Wed)16:26:12 No.107798429

Anonymous 01/07/26(Wed)16:26:12 No.107798429

File: w00tDario.png (155 KB, 522x670)

155 KB PNG

Anonymous
01/07/26(Wed)16:27:54 No.107798448

Anonymous 01/07/26(Wed)16:27:54 No.107798448

>>107798391
I can't believe Miku's butt would say that

Anonymous
01/07/26(Wed)16:28:54 No.107798460

Anonymous 01/07/26(Wed)16:28:54 No.107798460

>>107798448
Miku's butt just says whatever she thinks you want to hear

Anonymous
01/07/26(Wed)16:34:06 No.107798505

Anonymous 01/07/26(Wed)16:34:06 No.107798505

File: 1747412618450457.jpg (409 KB, 2744x1536)

409 KB JPG

>>107798296
What's wrong with it? What makes ChatML better?

Anonymous
01/07/26(Wed)16:36:03 No.107798523

Anonymous 01/07/26(Wed)16:36:03 No.107798523

>>107797952
I fucking love Bernkastel
I lost my LLM virginity to her

Anonymous
01/07/26(Wed)16:37:43 No.107798529

Anonymous 01/07/26(Wed)16:37:43 No.107798529

File: 1752649970261105.jpg (6 KB, 200x200)

6 KB JPG

>>107798429
if this fat fuck lost weight and got lean again Anthropic would be worth 10x times more

Anonymous
01/07/26(Wed)16:38:53 No.107798535

Anonymous 01/07/26(Wed)16:38:53 No.107798535

>>107798391
you want me to change my dataset and finetuning methodology? nuh uh, fuck you.
this is why "stock" models are better in most cases, no finetuner can actually do sft+dpo properly.

Anonymous
01/07/26(Wed)16:39:16 No.107798537

Anonymous 01/07/26(Wed)16:39:16 No.107798537

>>107798529
bussy doesn't attract investors

Anonymous
01/07/26(Wed)16:40:30 No.107798547

Anonymous 01/07/26(Wed)16:40:30 No.107798547

File: 1751749507610689.png (241 KB, 1994x1154)

241 KB PNG

>>107798537
I wish that was true

Anonymous
01/07/26(Wed)16:41:56 No.107798557

Anonymous 01/07/26(Wed)16:41:56 No.107798557

File: 1747377593541665.png (326 KB, 554x554)

326 KB PNG

>>107798537
The looks of a CEO most definitely affects the perception of the company, just look at this dude now

Anonymous
01/07/26(Wed)16:45:33 No.107798575

Anonymous 01/07/26(Wed)16:45:33 No.107798575

File: cai.png (10 KB, 512x512)

10 KB PNG

>>107798537
>doesn't allow bussy
>censor bussy
>a fucking thousand alternatives spring up taking billions in bussy money
Bussy talks.

Anonymous
01/07/26(Wed)16:47:43 No.107798587

Anonymous 01/07/26(Wed)16:47:43 No.107798587

>>107798557
But isn't anthropic worth more now than when we looked good?

Anonymous
01/07/26(Wed)16:48:20 No.107798595

Anonymous 01/07/26(Wed)16:48:20 No.107798595

>>107798587
>we
hello there

Anonymous
01/07/26(Wed)16:48:50 No.107798598

Anonymous 01/07/26(Wed)16:48:50 No.107798598

Has anyone tried this model?
https://huggingface.co/alpindale/dbrx-instruct

Anonymous
01/07/26(Wed)16:49:57 No.107798604

Anonymous 01/07/26(Wed)16:49:57 No.107798604

kys alpin

Anonymous
01/07/26(Wed)16:51:24 No.107798611

Anonymous 01/07/26(Wed)16:51:24 No.107798611

>>107798598
>132b instruct tuned MoE
No, but it looks interesting.

Anonymous
01/07/26(Wed)16:53:10 No.107798620

Anonymous 01/07/26(Wed)16:53:10 No.107798620

>>107798598
>2 years ago
If it's not popular by now, then it wasn't good.

Anonymous
01/07/26(Wed)16:53:59 No.107798624

Anonymous 01/07/26(Wed)16:53:59 No.107798624

>Try lots of local models on LMarena
>All sloppy in the exact same way
Is it like cloud models, where when you get your preset and a card in there to context poison it, it starts writing more interesting/less generic prose, or do they just stay in "not x but y" -ism mode?

Anonymous
01/07/26(Wed)16:54:03 No.107798626

Anonymous 01/07/26(Wed)16:54:03 No.107798626

>>107798587
then the CEO didn't went to CNN to warn about mass unemployment

Anonymous
01/07/26(Wed)16:55:06 No.107798634

Anonymous 01/07/26(Wed)16:55:06 No.107798634

>>107798529
>>107798557
Sama is also pretty ugly, I don't think that's it.

Anonymous
01/07/26(Wed)16:55:34 No.107798641

Anonymous 01/07/26(Wed)16:55:34 No.107798641

>https://huggingface.co/datasets/PJMixers-Dev/c3-kto-test/viewer/default/train?row=0
>"value": "You'll portray {{char}} and engage in Roleplay with {{user}}.
drummer do you unironically train on this?

Anonymous
01/07/26(Wed)16:58:18 No.107798658

Anonymous 01/07/26(Wed)16:58:18 No.107798658

File: Screenshot_100.png (23 KB, 392x307)

23 KB PNG

>>107791865
Which gemma 3 ablit is the right one?

Anonymous
01/07/26(Wed)16:58:34 No.107798659

Anonymous 01/07/26(Wed)16:58:34 No.107798659

>>107798641
Let's see your dataset

Anonymous
01/07/26(Wed)16:58:59 No.107798664

Anonymous 01/07/26(Wed)16:58:59 No.107798664

File: 1736515987444214.png (1004 KB, 834x2048)

1004 KB PNG

>>107798634
Sama is a conniving snake trying to act like a shy nerdy femboy, looks help establish a more positive perception of the company

Anonymous
01/07/26(Wed)16:59:35 No.107798668

Anonymous 01/07/26(Wed)16:59:35 No.107798668

>>107798658
the one you lobotomize yourself

Anonymous
01/07/26(Wed)17:00:07 No.107798670

Anonymous 01/07/26(Wed)17:00:07 No.107798670

>>107798620
But 90% of people in this general have no attention span and think a model goes stale and moldy after a few weeks

Anonymous
01/07/26(Wed)17:00:36 No.107798675

Anonymous 01/07/26(Wed)17:00:36 No.107798675

>>107798664
You could have just said he's jewish.

Anonymous
01/07/26(Wed)17:01:02 No.107798680

Anonymous 01/07/26(Wed)17:01:02 No.107798680

>>107798598
I tried it when it came out and it was pretty bad, it was around the same time as wizardlm 8x22 and command-r which got all the attention because they were way better

Anonymous
01/07/26(Wed)17:01:24 No.107798683

Anonymous 01/07/26(Wed)17:01:24 No.107798683

>>107798659
Lick my nuts drummer, you get more than enough money to curate a proper dataset

Anonymous
01/07/26(Wed)17:02:11 No.107798686

Anonymous 01/07/26(Wed)17:02:11 No.107798686

>>107798683
I accept your concession.

Anonymous
01/07/26(Wed)17:02:29 No.107798688

Anonymous 01/07/26(Wed)17:02:29 No.107798688

>>107798680
Thanks. I was just desperate for a model that isn't glm air. Guess I've gotta keep looking/waiting.

Anonymous
01/07/26(Wed)17:04:31 No.107798700

Anonymous 01/07/26(Wed)17:04:31 No.107798700

>>107798659
no need to get defensive, but you don't see any problem with it?

Anonymous
01/07/26(Wed)17:05:00 No.107798701

Anonymous 01/07/26(Wed)17:05:00 No.107798701

File: 1744641151833306.jpg (120 KB, 1024x707)

120 KB JPG

>>107798675
Dario acts differently and he's also jewish

Anonymous
01/07/26(Wed)17:06:05 No.107798707

Anonymous 01/07/26(Wed)17:06:05 No.107798707

>>107798641
No wonder behemoth X v2 is a fetishist for consent.

Anonymous
01/07/26(Wed)17:09:18 No.107798728

Anonymous 01/07/26(Wed)17:09:18 No.107798728

File: 20240604.jpg (599 KB, 2560x1196)

599 KB JPG

When I get DDR6 + CXL Motherboard + Blackwell + NPU I’m going finetune at home and beat these current day fine-tuners into a brick wall.

Anonymous
01/07/26(Wed)17:12:59 No.107798757

Anonymous 01/07/26(Wed)17:12:59 No.107798757

>>107798728
16GB of DDR6 will be like $3000

Anonymous
01/07/26(Wed)17:16:50 No.107798789

Anonymous 01/07/26(Wed)17:16:50 No.107798789

File: luigigi.png (424 KB, 608x602)

424 KB PNG

>>107798757
It costs roughly 1.00 USD in materials to make 16GB RAM.

Anonymous
01/07/26(Wed)17:18:49 No.107798804

Anonymous 01/07/26(Wed)17:18:49 No.107798804

>>107798789
Yes but that 1 USD of RAM could be put in an AI GPU that businesses will be more than happy to pay $3000 for. You can outbid them, right?

Anonymous
01/07/26(Wed)17:21:29 No.107798829

Anonymous 01/07/26(Wed)17:21:29 No.107798829

>>107798789
I'm not saying current prices are anywhere close to reasonable but mentioning raw materials price for a product so complex and advanced as ram memory is fucking bullshit even if you ignore profit margins
you know it
I know it
Luigi was based tho
fuck insurance industry for real

Anonymous
01/07/26(Wed)17:24:18 No.107798851

Anonymous 01/07/26(Wed)17:24:18 No.107798851

>>107798789
Raising a child to 18 in the US is around 300,000 - 390,000
More people should die, imagine how many resources could be saved

Anonymous
01/07/26(Wed)17:30:13 No.107798914

Anonymous 01/07/26(Wed)17:30:13 No.107798914

>>107798701
>pic
"20 bottle caps for the negro to mine it, why do you ask?"

Anonymous
01/07/26(Wed)17:37:35 No.107798962

Anonymous 01/07/26(Wed)17:37:35 No.107798962

>>107798829
I agree with everything said here.

Anonymous
01/07/26(Wed)17:42:29 No.107799013

Anonymous 01/07/26(Wed)17:42:29 No.107799013

>AnythingLLM
>Jan.ai
>something else?

I feel like I want to switch up my frontend and these stood out. What's the best option? I'm more of a casual user.

Anonymous
01/07/26(Wed)17:47:49 No.107799056

Anonymous 01/07/26(Wed)17:47:49 No.107799056

>>107796867
You mean whisper v2

Anonymous
01/07/26(Wed)17:51:29 No.107799086

Anonymous 01/07/26(Wed)17:51:29 No.107799086

>>107799056
This. I don't know how anyone manages to use v3 with all the hallucinations during any second of silence.

Anonymous
01/07/26(Wed)17:51:48 No.107799089

Anonymous 01/07/26(Wed)17:51:48 No.107799089

>>107798688
Just use command-r or llama3 eva0.0

Anonymous
01/07/26(Wed)19:27:57 No.107799710

Anonymous 01/07/26(Wed)19:27:57 No.107799710

>>107798598
>alpindale
Buy an ad.

Anonymous
01/07/26(Wed)19:34:42 No.107799759

Anonymous 01/07/26(Wed)19:34:42 No.107799759

>>107798789
At what scale?

Anonymous
01/07/26(Wed)19:54:45 No.107799917

Anonymous 01/07/26(Wed)19:54:45 No.107799917

>>107799759
In your garage with walmart handtools and $1 worth of amazon parts.

Anonymous
01/07/26(Wed)20:18:27 No.107800083

Anonymous 01/07/26(Wed)20:18:27 No.107800083

>>107798701
Jewish in the evil sense, not the silly curly hair guy sense.

Anonymous
01/07/26(Wed)20:18:39 No.107800087

Anonymous 01/07/26(Wed)20:18:39 No.107800087

why did they call it Router-weighted Expert Activation Pruning? why not Router-weighted Activation Pruning of Experts?

Anonymous
01/07/26(Wed)20:21:46 No.107800112

Anonymous 01/07/26(Wed)20:21:46 No.107800112

I need to devise the successor to cockbench

Anonymous
01/07/26(Wed)20:24:58 No.107800134

Anonymous 01/07/26(Wed)20:24:58 No.107800134

>>107800112
cuntbench?

Anonymous
01/07/26(Wed)20:25:23 No.107800137

Anonymous 01/07/26(Wed)20:25:23 No.107800137

>>107800087
Chicken sandwich. Sandwich of chicken.

Anonymous
01/07/26(Wed)20:25:59 No.107800144

Anonymous 01/07/26(Wed)20:25:59 No.107800144

>>107800137
but the acronym would have been funnier.

Anonymous
01/07/26(Wed)20:30:39 No.107800177

Anonymous 01/07/26(Wed)20:30:39 No.107800177

>>107800144
It's funny if it doesn't sound contrived.

Anonymous
01/07/26(Wed)20:46:22 No.107800287

Anonymous 01/07/26(Wed)20:46:22 No.107800287

>>107800087
They decided that they wanted to use REAP as their acronym and worked backwards from there

Anonymous
01/07/26(Wed)20:47:17 No.107800294

Anonymous 01/07/26(Wed)20:47:17 No.107800294

>>107800177
>>107800287
RAPE would have been more fitting because it basically rapes the models.

Anonymous
01/07/26(Wed)20:54:04 No.107800359

Anonymous 01/07/26(Wed)20:54:04 No.107800359

>>107800087
The same reason why the Neural Image Generation via Generative Adversarial Rendering paper was never released

Anonymous
01/07/26(Wed)21:25:16 No.107800583

Anonymous 01/07/26(Wed)21:25:16 No.107800583

Today I just tried ChatGPT again to see how it compares to local and it's still terrible on the free tier. So many people are experiencing absolute garbage without knowing it kek. Literally Mistral Small or something did better in my test. Whatever they're serving feels more like an 8B, or maybe 20B MoE.

Anonymous
01/07/26(Wed)21:26:59 No.107800594

Anonymous 01/07/26(Wed)21:26:59 No.107800594

>>107791898
they are talking about thickness
3inch diameter is pretty big

Anonymous
01/07/26(Wed)21:32:11 No.107800624

Anonymous 01/07/26(Wed)21:32:11 No.107800624

File: 250px-SiliconCroda.jpg (20 KB, 250x174)

20 KB JPG

>>107798789
turn this into a microchip for me

Anonymous
01/07/26(Wed)21:34:45 No.107800649

Anonymous 01/07/26(Wed)21:34:45 No.107800649

>>107800583
I canceled my plus sub to just go local+openrouter.

So far I've only spent 6 cents in the last 2 days. honestly it's really nice to have access to all the big flagship models if you need really solid answers. but 70% of the stuff I can ask day to day any old sub 32B model can answer it just fine.

Anonymous
01/07/26(Wed)21:50:29 No.107800750

Anonymous 01/07/26(Wed)21:50:29 No.107800750

>>107800624
Aren't they made out of sand?

Anonymous
01/07/26(Wed)21:56:26 No.107800786

Anonymous 01/07/26(Wed)21:56:26 No.107800786

>>107800649
Yeah I stopped my sub a long time ago. Even a year ago there were more than enough alternatives.

Anonymous
01/07/26(Wed)22:01:52 No.107800823

Anonymous 01/07/26(Wed)22:01:52 No.107800823

https://x.com/ltx_model/status/2008595989096177962

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.