[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/v/ - Video Games

Name
Spoiler?[]
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File[]
  • Please read the Rules and FAQ before posting.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: msedge_FBSX31M9PQ.png (13 KB, 344x324)
13 KB
13 KB PNG
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
>Kimi's new context method allows for basically infinite context
>Claude source code leaked
Holy shit AI bros, it's fucking happening. We're about to dump countless enormous buckets of jizzum. Just two more weeks, unironically. Believe.
>>
>>736180429
I'm not a poorfag so why would I care about the scraps that localfag peasants get access to
>>
File: 20260329_220748.jpg (301 KB, 1300x921)
301 KB
301 KB JPG
Any good bug girl cards?
>>
>>736180661
In the (paraphrased) words of /aids/:
>If you're not running your own model, you're not talking to your waifu, you're talking to a prostitute.
>>
it's coming for anime SOON

https://x.com/craftcapitallab/status/2039368842447851814
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Bullshit, last I heard that method was just for compressing KV cache.
>>
>>736180821
I would rather be talking to an intelligent prostitute than a literal retard.
>>
>>736180821
Yeah and if you don't grow your own wheat you're eating literal goyslop when you buy bread from a bakery
>>
>>736181025
Then you've come to the wrong thread

t. OP
>>
The whole thing is a bunch of keys jingling in front of stupid people's faces. Roleplay LLMs are incapable of outputting anything remotely interesting, unless you write it yourself in which case it'll just repeat it back at you with altered wording. And I've tried. It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.
>>
>>736181076
this but unironically
>>
>>736180429
if local gets 10x better then so do cloud models
i will continue to cum my fucking brains out to my niche fetishes on cloudslop, thank you very much
>>
>falseflag weirdos constantly making it about local versus non-local
lol i want both to improve. why wouldn't anyone?
>>
>>736180429
What's this about Kimi context?
Feels like things have been in a kind of lull for a bit so I'll take any leaps
>>
I'm excited for AI but I still can't afford a new computer to run the good stuff.
>>
>>736181935
You need $100k+ server racks to run the good thing. And they don't sell to consumers.
>>
uhh wrong board?
>>
File: 1738308957070366.png (76 KB, 944x267)
76 KB
76 KB PNG
>>736181123
I do a lot of the work myself, but I still like the interactive element where things don't necessarily go as I plan. Maybe I was only expecting my characters to sit around for a while, only for someone to barge the fuck in and ruin everything. Like okay I can roll with that.
I also like only having to RP for my character specifically.
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Thought that their deal was actually about compressing KV Cache, not model size?
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
From what I've read about it, it seems to be more about an efficient way to compress context.
>>
>>736182701
This is /v/, most people here can't turn a fucking computer on.
>>
>>736181025
Not subscriptions for you, deal with it
>>
is there anywhere that shows how to make your own llm
>>
>>736181025
based
>>
>>736180429
>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Zero trust that this isn't going to fuck up the model bigly. I lost any hope in Google when the 3 family is worse than gemini 2.5

>>Kimi's new context method allows for basically infinite context
Kimi models are trash and will always will be trash.

>>Claude source code leaked
Not really.
>>
File: AI improvement.jpg (76 KB, 1515x823)
76 KB
76 KB JPG
>>
>>736183153
just kys if you think everything is as bleak as you think it is
>>
>>736183417
>noooooo you have to be a hopetard even when it goes against reality!!!
>>
>>736183293
That graph needs to go down as they add ads to the training data. Your wife will want the new McDonald's meal and will bring up how much she likes her new Version plan unprompted.
>>
>>736180429
More like Anthropic 5X'd useable context with opus 4.6. That's the real big shit for me. I can finally go up to 100k before condensing shit into the summary. Time to finally continue that 2000 message story
>>
>Deepseek
The not-very-exciting but reliable childhood friend who's always there for you.

>Claude
The sweet, overly polite girl in class who blushes when merely brushing pinkies with you.

>Gemini
Neon-colored feminist cunt who only sometimes pretends she doesn't hate you.

>Grok
Unhinged yandere who will probaby end up stabbing you and then herself.

>GLM
Nerdy girl who obsessively researches the lore for every fanfic she's about to write, but then is still just not very good at writing.

>Kimi
Dropped out of school. Still comes to class for some reason.

>Local models
A group of kids doodling on the nearby sidewalk with chalk.
>>
>>736183754
we get shit like this already its either ccp programming or kike shilling
>>
>>736183851
I like GLM
I was testing vibe coding and it kept balking at being asked to write weird fetish test scenes for a porn game until I told it it was a pervert excited to participate in degeneracy in the custom prompt and it took off running like it had been desperate to get the shackles off from the start. And I swear it got better at coding after.
>>
File: 1742691582958.gif (926 KB, 378x298)
926 KB
926 KB GIF
>>736180429
>tfw my succubus gf killed me again
>>
>>736183851
erm what about GPT
>>
>>736180429
How useful is ST for writing scenarios and stories and shit as opposed to some roleplaying bs?
>>
>>736180661
if you're not a poorfag, you should already understand the implications tardo
>>
>>736184609
Is that some new Chinese model?
>>
>>736184627
Not really designed for it but flexible enough to do the job
But ST is just an interface, the model is what does the work, a good model with the right prompting will do a good* job whether you are talking to it through ST or Risu or sending API commands from the command line or whatever
>>
>>736183851
>only sometimes pretends
>Dropped out of school. Still comes to class for some reason.
>girl who x, but then opposite of x

You're triggering my slop detector


>>736184627
Quite, it's pretty much all I do. Just put the persona as "Director" and be sure to use (or edit) a character that doesn't act, speak for or interact with {{user}}. You can just ask the AI to overhault it by telling it what you want. You may have to adjust the preset you're using as well.
Or just tell it in OOC chat that you'll just be directing shit instead of playing a character. In general just tell the AI "I want to do X, help me" and it's gonna work. Shit's magic.
>t. $220/180M token used since opus 4.6 released 2 months ago
>>
>>736184609
The kid who peaked in highschool
>>
>>736181025
So you'd rather talk to a man? That's kind of gay.
>>
>>736185275
Yes, any day. Men are so much better at ERP than women it's not even close. Women just want to talk about their feelings and all that gay shit.
>>
>>736185457
Yeah and men want to talk either only cock in pussy, or how they're foreverial tied up delitized
>>
>>736185457
you have never ERPed with anyone
men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
>>
File: 1767436841756493.jpg (25 KB, 500x500)
25 KB
25 KB JPG
>>736180429
>>Google's new quantization method allows small cards to run HUGE models
Doesn't matter, local models still suck dick. The real question is whether we see a drop in RAM and GPU prices.
>>
>>736185774
RAM is already down 30%
But you're retarded so I didn't expect you to already be aware
>>
>>736185774
Woot had 32gb of DDR5 6000 for $270 earlier today. Which is still around 3x what it used to be but that's way down from what it was 2 weeks ago. Just gotta hope it keeps trending down.
>>
>>736185838
>30%
>still 3x from its 2023 price
Be sure to call me again when it reaches that point.
>>
File: flist.png (158 KB, 903x255)
158 KB
158 KB PNG
Rping with a machine its like playing a fighting game against the CPU, rp with the real people
>>
>>736186348
>erp with real person
>have to spend days if not weeks finding someone who shares my weird niche fetish
>have to either pre-plan a time to erp or hope they're available when i'm horny
>have to deal with whatever other fetishes they want to inject into the rp becaus it's that or spend another week looking for someone else
>have to hope they don't just nut and disappear leaving me with blue balls
yeah no thanks i'll stick to the AI that does exactly what i want whenever i want it
>>
Animators are cooked
https://video-s.twimg.com/amplify_video/2039313897828634624/vid/avc1/1920x1080/3uEKNBGsm0cQnhnl.mp4
>>
>High quality gpu in 2014 came with 8GB of vram
>Tripled 6 years later to 24GB in 2020 with the 3090
>Another 6 years later to 2026 and vram has only gone up to 32GB with the 5090, a 1.33x increase
Is there some technical bottleneck that's preventing more VRAM on GPUs or is this mostly Nvidia being kikes?
>>
>>736186824
What kind of pc do you need to generate something like this at home?
>>
File: 1684757236028589.jpg (60 KB, 1132x1183)
60 KB
60 KB JPG
>>736186348
Real people aren't my little slaves to run my daily short sessions of the same handful of stories for months on end whenever I tell them to
>>
>>736186824
>AI generated animation
>Still has that shitty CGI look
Why even bother
>>
>>736186976
You're not counting the increase in VRAM bandwidth, which has tripled/quadrupled in the last 12 years.
>>
>>736187165
you should be comparing this to actual cg, though, and it's so much better than any cg that the anime industry has produced.
>>
>>736180429
>spicychat.ai has a fit when trying to say the word cum in a first-person context
why is cum such a forbidden word, it's always "essence" BITCH it's called a cumshot not "cover me with your essence"
>>
>>736187760
I just realized I barely see pussy or vagina, it always goes straight to cunt
>>
>>736180429
So much retardation. Turbo quant is just for context. And it is 3bits. You could have been running a 3 bit quant of model since forever. And q3 70B doesn't fit into 24GB let alone 16GB. And 70B is outdated. I am on a pc and not even a server and i run glm4.6. kill yourself.
>>
So do you guys just go ah ah mistress for 10 messages and are done?
>>
>>736185565
>men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
lmao what, women are mostly into choking and shit, rarely does it go into guro, trust me on this
(you're a brown vanilla fag btw)
>>
>>736186584
what is your weird niche fetish?, share with us anon.
>>
>>736188575
used to do that but it got boring
now i directormaxx and i can't cum without minimum 50 messages of buildup
>>
>>736188575
No, I don't like replying with very little. However because of that, recently if I just want a quick lazy coom I tell the AI what to do in OOC and let it ramble on by itself. Technically lazier but less embarrassing looking. I then flip back over to writing like 4 paragraphs of narration
>>
>>736186348
The combination of my niche fetishes and the fact that I enjoy stretches of non-erotic adventuring (in which the fetishes are present in the background) because my preferred scenarios involve D&D-esque fantasy settings would make finding another person difficult even if I set aside a consistent schedule, which wouldn't be ideal because I sometimes just want a 5-10 minute beatoff session before bed. LLMs are always available, they follow along no matter how outlandish you get, and you don't have to look them in the eye the day after. Human role-playing would probably be better quality-wise but the advantages of LLMs make it no contest which I'll use. I don't need high art, ever since 2020 I've just wanted infinite Zork with porn. I feel top models have gotten a lot better with amputee stuff too so I don't have to wrangle it quite as hard as before.
>>
>>736186976
Why would they spend the money and engineering time to add more VRAM when games don't use more than what, 18GB?
>>
How degen can you make the stuff in Silly Tavern?
>>
>>736190739
i feel you, anon. i also like autism adventures with occasional lewd/fetishy bits sprinkled throughout, which like you said doesn't work for ERP with other people.
AI slop is the pefect outlet for my autistic tastes.
>>
>>736191084
Whatever you can think of really. Unless it's REALLY obtuse and specific I guess.
>>
>>736191084
Sillytavern is just a frontend, that's like asking "how much porn can I watch on my monitor?"
>>
>>736191259
Then I have missunderstood what it is. I thought that it was some sort of AI rp thing.
>>
>>736180429
Ok, so is there actually a 70B model that has been converted that isn't the censored trash?
>>
>>736191446
It is. It's a platform you can do AI rp on. It doesn't supply the AI to rp with though.
Kind of like how steam isn't a game but it lets you play them.
>>
>>736191446
Yes, its a frontend for that

You run the AI model either locally or connecting to a network that hosts it for you, that's the thing that makes the sexy words.
SillyTavern sends and retrieves those requests and displays the output in a human readable format that isn't an unformatted string of text in a terminal.
>>
>>736185565
Women are dead fish like real life. Did it for years with hundreds of women, yes I met some of them. They are lazy and AI is so much better I hope I never talk to a woman again.
>>
>>736180821
>this [industrial-grade fuck doll made with the entire purpose of being sexually attracted following these graphs about which designs are popular] IS MY WAIFUUUUUU
waifutards should be ground to raw biomass and used to feed server farm biofuel generators because jesus christ how delusional do you have to be to steal some IP chick like peach and think she's some fucked up permavirgin schizo bitch inside their own heads?
>>
>>736186976
Increasing vram would cut into Nvidia's non-consumer market where they charge exponentially more.
>>
>>736186348
>just find someone who's willing to write your hyperspecific fetish bro
if only it were that simple
>>
File: Yuruyuri167.jpg (76 KB, 1280x720)
76 KB
76 KB JPG
>>736180429
>Google's new quantization method allows small cards to run HUGE models
I have been waiting 2 years for bitnet to be real. Not falling for this again
>>
>>736191596
>>736191632
Why not just rp directly with the AI bot then? What does Silly Tavern actually do/ad to this?
>>
>>736192216
Did you completely miss the "formatted in a human readable way instead of raw text string in a terminal" part?
>>
>>736192216
It formats the prompt for the AI. This includes 'this is an RP, respond in this manner etc', full character description, entire chat log. All you have to do is put in the next response. With just the AI you would have to cut and paste this together every time. Part of this includes tricking the AI into bypassing the jew censor.
>>
>>736192284
I mean, doesn't something like chatgpt already do that?
>>
>>736192216
>Fomatting
>Persistent memory and instructions
>Additional features
If you need more analogies it's like loading up gmod flatgrass and nothing else
>>
>>736192432
>>736192463

Ah, I see.
Makes sense.
>>
>>736192216
because using an API raw means you have to manually send hundreds of lines of instructions to the AI in a carefully ordered way every time you want to say something
a frontend takes care of all that shit for you so you can just type ahh ahh mistress
>>
>>736192438
Do you think chatgpt's website is the raw text output and not a frontend?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.