[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: saintmakise.jpg (236 KB, 1614x992)
236 KB
236 KB JPG
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107643997 & >>107636165

►News
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7
>(12/17) Introducing Meta Segment Anything Model Audio: https://ai.meta.com/samaudio
>(12/16) MiMo-V2-Flash 309B-A15B released: https://mimo.xiaomi.com/blog/mimo-v2-flash
>(12/16) GLM4V vision encoder support merged: https://github.com/ggml-org/llama.cpp/pull/18042
>(12/15) llama.cpp automation for memory allocation: https://github.com/ggml-org/llama.cpp/discussions/18049

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
>>
K-Kurisulove??
>>
>rentry.org/jarted
you dropped this
>>
>>107652781
Too late. Now spit out miku's penis and worship the true redhead queen of /lmg/.
>>
>>107652789
Dammit. Merry Christinamas
>>
>>107644945
there were tests done similar to this with mixtral models, just dont do it
>>
>>107652781
I admire your dedication, migubaker.
>>
>>107652821
Not him but thanks
>>
►Recent Highlights from the Previous Thread: >>107643997

--Custom PC build challenges with dual-CPU motherboard and GPU development:
>107648704 >107648802 >107648838 >107648947
--Z.AI's upcoming model announcement generating speculation:
>107645563 >107645577
--Critique of uncensored release and adoption:
>107649191 >107649258 >107649238
--glm 4.7 template configuration issues causing excessive token usage:
>107645140 >107645154 >107645187
--Model 4.7 evaluation and comparison with previous versions:
>107647330 >107647367 >107647441 >107647641 >107647542 >107647495
--Finetuning scalability with web scraping and model distillation:
>107644302 >107644406 >107644448 >107644493 >107644521 >107644775 >107644791 >107644798 >107644810 >107645002
--GLM performance debates and benchmark analysis:
>107647533 >107647803 >107647892 >107648045 >107648063
--Training loss reduction strategies and model performance analysis:
>107648741 >107648779
--Critique of unsloth template and GGUF challenges:
>107648117 >107648188 >107648189 >107648231 >107648262 >107648306 >107648403 >107648431 >107649512 >107649536
--Scraping and roleplay prompting Opus 3 via Claude and OpenRouter:
>107648858
--Ethical concerns about Qwen3-TTS cross-species voice cloning:
>107644576 >107644634
--AI policy restrictions and model limitations in generating stylized Hatsune Miku SVGs:
>107644661 >107644704 >107644718 >107644785 >107644738 >107644861 >107644870 >107645028 >107645058
--GLM 4.7 cockbench and speculation on llama 4 scout behavior:
>107644330 >107644369
--GLM 4.7 shows unexpected reasoning capability about animal knowledge:
>107645272 >107646055
--Comparing GLM model perplexity across parameter sizes to analyze non-activated parameter impact:
>107644945
--Teto and Miku (free space):
>107644176 >107644885 >107649257 >107651275 >107651326 >107651473 >107652578 >107652818

►Recent Highlight Posts from the Previous Thread: >>107644002

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script
>>
>>107652827
Thank you Recap Miku
>>
>>107652819
>>107644945
https://xcancel.com/sbeastwindy/status/1735185274475524333
>>
>>107652836
https://github.com/ggml-org/llama.cpp/pull/4406#issuecomment-1855151885
last one i swear
>>
I have a half working pi zero W. How to get it working with a lm?
>>
File: 1746327033824790.png (36 KB, 560x379)
36 KB
36 KB PNG
>>107652846
What do you mean? Are you trying to host the LLM on the Pi, or just use it as like a frontend?
>>
File: 1755196481408026.jpg (468 KB, 1024x1440)
468 KB
468 KB JPG
>>
File: 1750829887950986.jpg (301 KB, 1024x1536)
301 KB
301 KB JPG
>>107652767
>>
>>107652999
didnt know miku was african american
>>
>>107653005
miku has always been a real black hood nigga
>>
>>107653005
apparently kfc for christmas is some sort of thing in Japan. I think it has to do with them having no real Christian tradition, some marketers swooping in, and the colonel kinda looking like santa clause.
https://www.bbc.com/worklife/article/20161216-why-japan-celebrates-christmas-with-kfc
>>
wew.. lads, just finished reading rhrough the last 3 threads
what am i gonna do now?!
>>
File: 1747851886257398.png (9 KB, 474x67)
9 KB
9 KB PNG
man
>>
File: file.png (19 KB, 544x134)
19 KB
19 KB PNG
>Delivery Attempted
This is the second time I've bought a GPU from Amazon and this has happened. Last time I had to buy a cheap GPU from China. I hope they don't screw me over again.

By the way, is this GPU good enough to play around with or is it too weak? My initial goal is to generate some assets for my game, nothing professional, it would be more for prototyping, nothing 4k.
>>
>>107653014
I don't understand how they could make that connection. The colonel has a puny beard compared to the real claus.
>>
>>107653018
Wait for Z-Image base and edit, and for ubergarm to wake up
>>
>>107653045
maybe I just made up that part. Seems to be all marketing https://old.reddit.com/r/Tokyo/comments/1hmg29n/dont_people_eat_kentucky_fried_chicken_on/
>>
>>107653041
>My initial goal is to generate some assets for my game
iirc Hunyuan3D only needs 10GB so it should be fine. Definitely fine if your game is 2D.
>>
>>107652978
Everything.
>>
>>107653047
Models that will never be released:
- WizardLM
- Meta Movie Gen
- Mistral Medium
- GLM 4.6 Air
- Z-Image base and edit
>>
File: wclivocw8e631.jpg (65 KB, 600x450)
65 KB
65 KB JPG
>>107653073
>>
File: 1765345142756066.jpg (16 KB, 326x326)
16 KB
16 KB JPG
>>107653073
512 MB of RAM isn't nearly enough to host most LLMs but I'm sure you could fit one in there if you really tried.
>>
>>107653073
you could do a q2 of gpt2
>>
>>107653080
you forgot llama2 34b
>>
Does running an LLM cause your GPU and CPU to run hot?
>>
>>107653129
It can if it has to process a lot of tokens at once, but it shouldn't be much worse than running a demanding video game for a long time
>>
>>107653129
It causes my cock to run hot.
>>
>>107653151
use lube, baka
>>
>>107653154
I don't need to, I have a foreskin.
>>
>>107653041
Amazon drivers will unironically steal your GPU and give a bogus delivery or error code.
>>
>>107653169
Then it becomes Amazon's problem to replace or reimburse.
>>
>>107653005
KFC is unironically a Japanese tradition thanks to a very successful marketing campaign several decades ago.
>>
File: メニュー .png (2.84 MB, 1500x1061)
2.84 MB
2.84 MB PNG
>>107653211
Don't forget to make your reservation, lines will be long
>>
GLM 4.7 GGJTs when?
>>
>>107653129
it takes 100% of anything it can get even more so than the most demanding video games
>>
bros.. i have to confess.. i had my first LLM coom today.
around a year and a half ago i complained about it and some anon recommended twin cow girl erp cards, but i couldnt be serious in any kind of erp
but after not jerking off for two weeks (reasons), i talked about reasons with deepseek ( for like a week ), and then told it about not jerking off
dipsy told me about joi, and then did it
crazy.
that was absolutely crazy. insane. never knew my balls could store that much cum
>>
>>107653257
Nips are fucking weird. Meat on Christmas is literal blasphemy in the first place.
>>
>>107653468
They're not Christians thougheverbeit
>>
>>107653466
Cooming after weeks of not cooming always feels amazing but sometimes it can be annoying to clean up the sheer volume of cum, especially since it usually shoots out further as well.
>>
>>107653177
>The driver marked it as delivered
>The driver took a picture on their phone to authenticate it as delivered before pocketing it
Enjoy the hassle of convincing Amazon what happened. Niggers abusing the system have completely eroded Amazon's consumer-facing trust in incidents like this in recent years.
>>
>>107653468
>Meat on Christmas is literal blasphemy in the first place.
what the fuck are you on about? that's for easter you TERTIARY.
>>
>>107653468
Christmas in December is blasphemy, you unorthodox fucks
>>
>>107653479
Why are you cumming all over the place instead of in a tissue or some toilet paper?
>>
>>107653479
>especially since it usually shoots out further as well.
this is so true, the most annoying part is when the cum lands on your hair and then you have to wash it (easy part)
the hard part is drying the hair, i bought a 2.6kW hair dryer and it still takes 30 minutes (best case) up to an hour :*
>>
>>107653476
Which is why them celebrating the birth of Christ makes no sense.
>>
>>107653510
nta but i literally just cummed on my hoodie and all over my thighs and the floor and the wall
usually you cant exactly focus on aiming and feeling good at thd same time
>>
>>107653493
Filthy heretic.

>>107653495
I blame the pope.
>>
File: w8qpc6yt919g1.jpg (172 KB, 1179x1864)
172 KB
172 KB JPG
ARC-AGI 2 has been finished.

What are the implications of this and how fast do you assume ARC-AGI 3 (Which will by dynamic real time environments instead of static puzzles) will be solved by LLMs with agentic scaffolding?
>>
>>107653466
My BROTHER you have no idea what you've been missing.

Think about every crush you've ever had. Every fetish you've got. Every single sexual fantasy and scenario you've ever thought about. Well now you can simulate them with a good model and system prompt.
>scifi rouge AI who inhabits a space pirates body who wants to rape you (her captain)
>big tiddied dark elf pampers you after a quest
>cute femboy with AGP who dresses up like princess peach needs help with homework
>loli childwife on a deserted island
>kobold adventure party shortstacks with fat butts get stuck in the wall of a dungeon
>rouge the bat JOI instructor
>Pinkie Pie PISS simulator
>busty anthro girl traps you in her trenchcoat

You are only limited by your imagination and the models SLOP. Truly the greatest coom tool in all of human history, iykyk.
>>
>>107653579
heh, i think i didnt make it clear
i was erping, a lot of the time not serious
but usually it was erp to get me hard and horny for regular porn
man, to think we've come so far since vicuna-unlocked
i wonder what gozfarb is up to nowadays
these scenarios are sick, ill have to try them out
>>
>>107653521
Were you backed up for like a week or something?
>>
>>107653641
around 2 weeks >>107653466
>>
File: 1761117575776279.png (10 KB, 957x596)
10 KB
10 KB PNG
>>107653614
>but usually it was erp to get me hard and horny for regular porn
Fair, I do this quite often as well. I find that text based porn fills a different need. Sometimes I'll load up a model and start RP for a few turns before I realize I'm only in the mood for standard pornography.

>man, to think we've come so far since vicuna-unlocked
I remember my first COOM to pyg6b. Its easy to shit on the state of local right now but the fact is that if you can run at minimum Nemo you essentially have the ability to RP any scenario you want. Pretty powerful stuff.
>>
File: 1753160740517797.png (469 KB, 853x1000)
469 KB
469 KB PNG
>>107653671
Damn, I feel antsy after just 2 days
>>
File: cumcupTM.png (23 KB, 1829x430)
23 KB
23 KB PNG
>>107653521
I'm gonna teach you a method I came up with in my teens.

Take two joined squares of toilet paper and roll them up in a cylinder a bit wider than your dick.
Fold the top third inwards. This keeps if from unrolling. Don't fold it all way, you want to form a closed end. Palm the folded then push your fingers through the open end and press the folded bits against your palm to bunch them up and form the closed end.

It only takes a few seconds and now you have a disposable cup you can cum in.

You can just plop it over your dick if it's wide enough or you can hold it with your other hand.
You can even use it with one hand if you only grip the very bottom of it with your thumb and index finger and move it along as you stroke. I prefer this because it's a complete seal even though it restricts your stroke length a bit. I think it only works with a foreskin because otherwise you'd be rubbing your dick with paper as opposed to it gliding along with the foreskin.
>>
File: 1737154283289681.png (505 KB, 860x720)
505 KB
505 KB PNG
>>107653692
Toilet paper is thin and completely dissolves when exposed to moisture. If you edge then your pre-cum is going to make your dick just rip through it after a few strokes.
Get a few tissues, ideally something soft like Kleenex™ and use 3-4 of them. Layer them on top of each other, then wrap it around your dick. It will feel great and your dick won't rip through it, and you won't end up with cum inside your belly button.
>>
File: 1763494921517727.jpg (83 KB, 500x642)
83 KB
83 KB JPG
>>107653692
>>107653705
>>
>>107653692
>>107653705
thank you anons, this is amazing
absolutely amazing.
another day thankful to have a foreskin
>>
>>107653705
It's not an issue. I only needs to hold for a few seconds.
Also I find that it feels so much worse if the cumshot is impeded by something covering the tip. The cup shape is great because it leaves a few cm of free space in front.
>>
>>107652999
good girl, consume the goyslop
>>
File: 1760737325627226.jpg (104 KB, 856x734)
104 KB
104 KB JPG
>>107653727
>I only needs to hold for a few seconds.
Oh, I like to go for a bit longer, and I tend to leak during.
>Also I find that it feels so much worse if the cumshot is impeded by something covering the tip.
My mother fell for the circumcision jew so this might also be different. Tapping a soft tissue with the tip feels nice for me.
>>
>>107653556
I'm no researcher or learned person so I could be completely wrong, but IMO the saturation of these benchmarks are less of the models being able to generalize better between releases and more of the researchers applying RL to specifically saturate the benchmarks. Impressive, but not the kind of improvements we need to increase the intelligence of the model.

This doesn't mean the models are dumb, or that RL isn't good enough to make models better than humans at certain tasks. It does mean that something fundamental needs to improve outside of RL in order to give these things the spark needed to be considered "AGI".

I think ARC-AGI 3 will follow a similar path, although I would expect it to take a little bit longer since the challenges are significantly harder. I don't think people will consider it AGI when ARC-AGI 3 gets saturated.

If anyone here actually knows what they're talking about please correct me
>>
File: 1741684311191967.png (195 KB, 894x281)
195 KB
195 KB PNG
>>107652767
>"official" /lmg/ mascot
>mentally ill tranny/tranny-adjacent retards trying to cram their generic tranime girl obsession into any and all online communities regardless of relevance, just like lgbt fags and pedo groomers
>also purely coincidentally its one of the most generic and popular normie npc tranime girls every single time, like troonku, lain, makise or whatever was the tranime girl of the year in the groups they hanged out at the point in their life where their life peaked (highschool) which doesnt speak to their low IQ and unironic mental illnesss and autism at all of course, they are completely normal and well adjusted 30+ year old men spamming the same generic copy paste tranime girl in every conversation online

Also reminder for the newfags that the main terminally online troonku baker of /lmg/ was exposed as a 4chan moderator, who literally instantly deleted messages posted in the general that made him look bad while he himself posted miku porn on /ldg/ that stayed up after being reported for multiple hours, multiple times. He was also exposed as an autogynephilia fetishist, and a poopdick fetishist, completely coincidentally again of course.
>>
>>107653692
>>107653705
I just finish jerking off on the toilet.
All of my important files are on my desktop with an automatic sync to my NAS.
So after edging for some time I can move seamlessly from my desktop to my laptop.
One hand for stimulation, one hand to hold a bit of toilet paper, and one foot to control my laptop.
>>
>>107653757
ARC-AGI 2 is a "generalization benchmark". They are hard puzzles that need a sort of generalization in logic for you to solve. It's correlated with better reasoning in for example coding but NOT creative endeavors like roleplaying.

ARC-AGI 3 will actually be worse because it will be dynamic puzzles for testing agents. So while that is probably a step towards AGI it isn't as correlated with generalization. For example an AI that is just better at being dynamic might score better than an AI that is actually smart purely because the dynamic shitty AI is better for real time environments. Humans would still consider the lower scoring one to be more general.
>>
>107653768
This is what you're spending Christmas Eve doing?
>>
You just want to cum.
I want a companion.
We are not the same.
>>
>>107653816
I want both
>>
For the deluded minds who think they can solo RP models, from the GLM 4.7 AMA:

https://www.reddit.com/r/LocalLLaMA/comments/1ptxm3x/ama_with_zai_the_lab_behind_glm47/nvkhgjk/

>I can analyze this from the perspective of post-training. At present, due to differences in compute reserves across organizations, the amount of compute invested in post-training also varies significantly. One clear trend we observe is that Chinese large model providers still invest substantially less compute in post-training compared with their U.S. counterparts, although this gap is gradually narrowing.
>
>For post-training, the compute consumed by experimentation is often much higher than that used in the final training runs. For example, during the post-training of GLM-4.7, the compute cost spent on post-training experiments was likely dozens of times higher than that of the final GLM-4.7 post-training run itself.
>
>Returning to the original question, in my view, building a reasonably strong model team for post-training requires at least a dozen highly talented researchers, along with compute resources equivalent to roughly 2,000 H100/H800 GPUs.
>>
>>107653829
A cumpanion, if you will
>>
>>107653848
heh
>>
>>107653815
It's always a good deed to publicly humiliate and shit on the disgusting mentally ill faggots like yourself.
>>
>>107653884
being mean on christmas? you're gonna get a lump of COAL
>>
>>107653903
what about a lump of CUM
>>
>>107653884
>mentally ill faggots
Look into a mirror. You went through the effort of posting your seethe in an otherwise friendly and on topic thread because you have an axe to grind. No one cares about your irrational grudge against the dumbest shit. Maybe reflect and become better instead of letting your anger dictate your actions. Hope you have a nice Christmas with your friends and family.
>>
>>107653930
>axe to grind
heh, this is funny because trannies have an axe wound between their legs LOL
>>
>>107653930
miku ritual posts > seething posts.
>>
Seethejeet is by far the brimmiest poster in these threads and it's not even close.
>>
>>107653936
Yeah, troons castrating themselves is quite funny, but not as funny as untermensch exposing their infantile ego and subhuman level emotional control for everyone to laugh at, kek.
>>
OK, I got 4.7 working on my new setup but it's kinda slow (3.5 T/s) but holy fuck! Coming from nemo and mistral small, it's like a another world.

Is there anyway to limit the amount of reasoning it does in sillytavern? Do I need it at all if I'm mainly using it for RP or will it drop the quality significantly?
>>
File: 1.png (33 KB, 420x229)
33 KB
33 KB PNG
>>107654025
you can just disable it
>>
>>107654033
yeah, but does it kill the creativity?
>>
>>107654054
I don't know because I always turn it off
>>
>>107654065
>I always chop my peen off
>>
.
>>
File: 2025-12-24_11-54-50.png (70 KB, 939x470)
70 KB
70 KB PNG
>>107654193
FUCK ME
>>
>>107654199
the point is hf has/had? a limit of 50gb per file
>>
>>107654199
there is no need to be upset. learn how2use hf cli
>>
just cat them, bwo
>>
>>107654222
https://huggingface.co/blog/rearchitecting-uploads-and-downloads
goddamit zer fucking juden every fucking time with their fucking hooknosed bullshit im fucking sick of it everything is a fucking pile of shit nothing fucking works easier to be a caveman and make everything from scratch then put up with all this bullshit anymore i hope aws goes offline again and deletes all theri fucking bullshit

does modelscope have the same ? can someone reupload there ?
>>107654280
may your cock rot and maggots eat it from the inside you son of a whore
>>
>>107654199
I can't believe those evil jannies robbed us of your high-quality post.
>>
>>107653512
What fucking bubble do you live in where you think everyone is just celebrating christmas as the birth of christ or think that's weird
>>
>>107652836
>>107652840
I was more thinking of comparing, say, running smaller MoE with the normal number of activated params vs running a larger MoE with a cut down number of activated experts to match the number of activated params with the smaller MoE.
Also, Mixtral is so old. I can only imagine that the numbers look pretty different for modern models.
I do remember that Qwen 3 30B got a nice bump in PPL when using 10 experts instead of 8.
>>
>>107653492
>Niggers abusing the system have completely eroded Amazon's consumer-facing trust in incidents like this in recent years.
amazon itself has also eroded its own reputation for me
I live in a country where delivery was never, and I mean it, never reliable (France) so I always get my packages delivered at pickup locations (think services like UPS Access Point if you're a burger) which 1/ gives me the time to inspect the package properly and reject it if I see signs of tempering - damage and 2/ avoid porch pirates because it's not uncommon for lazy delivery drivers to just leave the package in the building hall and it gets stolen by the neighbors, I've known people it kept happening to

a couple years ago, amazon has made it impossible to order packages over something like 400 bucks through this method. They only allow ordering this shit to your home. I absolutely refuse! no relay pickup no order period. I haven't ordered a single thing from amazon since then. I don't know why they did this to valuable packages because issues are more likely to happen when you have them delivered to your home than to a retailer pickup location, but they certainly lost me forever as a customer.
>>
>>107654401
how much is the pickuper getting paid to deal with 1000 amazon packages a day?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.