[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/v/ - Video Games

Name
Spoiler?[]
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File[]
  • Please read the Rules and FAQ before posting.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: msedge_FBSX31M9PQ.png (13 KB, 344x324)
13 KB
13 KB PNG
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
>Kimi's new context method allows for basically infinite context
>Claude source code leaked
Holy shit AI bros, it's fucking happening. We're about to dump countless enormous buckets of jizzum. Just two more weeks, unironically. Believe.
>>
>>736180429
I'm not a poorfag so why would I care about the scraps that localfag peasants get access to
>>
File: 20260329_220748.jpg (301 KB, 1300x921)
301 KB
301 KB JPG
Any good bug girl cards?
>>
>>736180661
In the (paraphrased) words of /aids/:
>If you're not running your own model, you're not talking to your waifu, you're talking to a prostitute.
>>
it's coming for anime SOON

https://x.com/craftcapitallab/status/2039368842447851814
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Bullshit, last I heard that method was just for compressing KV cache.
>>
>>736180821
I would rather be talking to an intelligent prostitute than a literal retard.
>>
>>736180821
Yeah and if you don't grow your own wheat you're eating literal goyslop when you buy bread from a bakery
>>
>>736181025
Then you've come to the wrong thread

t. OP
>>
The whole thing is a bunch of keys jingling in front of stupid people's faces. Roleplay LLMs are incapable of outputting anything remotely interesting, unless you write it yourself in which case it'll just repeat it back at you with altered wording. And I've tried. It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.
>>
>>736181076
this but unironically
>>
>>736180429
if local gets 10x better then so do cloud models
i will continue to cum my fucking brains out to my niche fetishes on cloudslop, thank you very much
>>
>falseflag weirdos constantly making it about local versus non-local
lol i want both to improve. why wouldn't anyone?
>>
>>736180429
What's this about Kimi context?
Feels like things have been in a kind of lull for a bit so I'll take any leaps
>>
I'm excited for AI but I still can't afford a new computer to run the good stuff.
>>
>>736181935
You need $100k+ server racks to run the good thing. And they don't sell to consumers.
>>
uhh wrong board?
>>
File: 1738308957070366.png (76 KB, 944x267)
76 KB
76 KB PNG
>>736181123
I do a lot of the work myself, but I still like the interactive element where things don't necessarily go as I plan. Maybe I was only expecting my characters to sit around for a while, only for someone to barge the fuck in and ruin everything. Like okay I can roll with that.
I also like only having to RP for my character specifically.
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Thought that their deal was actually about compressing KV Cache, not model size?
>>
>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
From what I've read about it, it seems to be more about an efficient way to compress context.
>>
>>736182701
This is /v/, most people here can't turn a fucking computer on.
>>
>>736181025
Not subscriptions for you, deal with it
>>
is there anywhere that shows how to make your own llm
>>
>>736181025
based
>>
>>736180429
>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Zero trust that this isn't going to fuck up the model bigly. I lost any hope in Google when the 3 family is worse than gemini 2.5

>>Kimi's new context method allows for basically infinite context
Kimi models are trash and will always will be trash.

>>Claude source code leaked
Not really.
>>
File: AI improvement.jpg (76 KB, 1515x823)
76 KB
76 KB JPG
>>
>>736183153
just kys if you think everything is as bleak as you think it is
>>
>>736183417
>noooooo you have to be a hopetard even when it goes against reality!!!
>>
>>736183293
That graph needs to go down as they add ads to the training data. Your wife will want the new McDonald's meal and will bring up how much she likes her new Version plan unprompted.
>>
>>736180429
More like Anthropic 5X'd useable context with opus 4.6. That's the real big shit for me. I can finally go up to 100k before condensing shit into the summary. Time to finally continue that 2000 message story
>>
>Deepseek
The not-very-exciting but reliable childhood friend who's always there for you.

>Claude
The sweet, overly polite girl in class who blushes when merely brushing pinkies with you.

>Gemini
Neon-colored feminist cunt who only sometimes pretends she doesn't hate you.

>Grok
Unhinged yandere who will probaby end up stabbing you and then herself.

>GLM
Nerdy girl who obsessively researches the lore for every fanfic she's about to write, but then is still just not very good at writing.

>Kimi
Dropped out of school. Still comes to class for some reason.

>Local models
A group of kids doodling on the nearby sidewalk with chalk.
>>
>>736183754
we get shit like this already its either ccp programming or kike shilling
>>
>>736183851
I like GLM
I was testing vibe coding and it kept balking at being asked to write weird fetish test scenes for a porn game until I told it it was a pervert excited to participate in degeneracy in the custom prompt and it took off running like it had been desperate to get the shackles off from the start. And I swear it got better at coding after.
>>
File: 1742691582958.gif (926 KB, 378x298)
926 KB
926 KB GIF
>>736180429
>tfw my succubus gf killed me again
>>
>>736183851
erm what about GPT
>>
>>736180429
How useful is ST for writing scenarios and stories and shit as opposed to some roleplaying bs?
>>
>>736180661
if you're not a poorfag, you should already understand the implications tardo
>>
>>736184609
Is that some new Chinese model?
>>
>>736184627
Not really designed for it but flexible enough to do the job
But ST is just an interface, the model is what does the work, a good model with the right prompting will do a good* job whether you are talking to it through ST or Risu or sending API commands from the command line or whatever
>>
>>736183851
>only sometimes pretends
>Dropped out of school. Still comes to class for some reason.
>girl who x, but then opposite of x

You're triggering my slop detector


>>736184627
Quite, it's pretty much all I do. Just put the persona as "Director" and be sure to use (or edit) a character that doesn't act, speak for or interact with {{user}}. You can just ask the AI to overhault it by telling it what you want. You may have to adjust the preset you're using as well.
Or just tell it in OOC chat that you'll just be directing shit instead of playing a character. In general just tell the AI "I want to do X, help me" and it's gonna work. Shit's magic.
>t. $220/180M token used since opus 4.6 released 2 months ago
>>
>>736184609
The kid who peaked in highschool
>>
>>736181025
So you'd rather talk to a man? That's kind of gay.
>>
>>736185275
Yes, any day. Men are so much better at ERP than women it's not even close. Women just want to talk about their feelings and all that gay shit.
>>
>>736185457
Yeah and men want to talk either only cock in pussy, or how they're foreverial tied up delitized
>>
>>736185457
you have never ERPed with anyone
men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
>>
File: 1767436841756493.jpg (25 KB, 500x500)
25 KB
25 KB JPG
>>736180429
>>Google's new quantization method allows small cards to run HUGE models
Doesn't matter, local models still suck dick. The real question is whether we see a drop in RAM and GPU prices.
>>
>>736185774
RAM is already down 30%
But you're retarded so I didn't expect you to already be aware
>>
>>736185774
Woot had 32gb of DDR5 6000 for $270 earlier today. Which is still around 3x what it used to be but that's way down from what it was 2 weeks ago. Just gotta hope it keeps trending down.
>>
>>736185838
>30%
>still 3x from its 2023 price
Be sure to call me again when it reaches that point.
>>
File: flist.png (158 KB, 903x255)
158 KB
158 KB PNG
Rping with a machine its like playing a fighting game against the CPU, rp with the real people
>>
>>736186348
>erp with real person
>have to spend days if not weeks finding someone who shares my weird niche fetish
>have to either pre-plan a time to erp or hope they're available when i'm horny
>have to deal with whatever other fetishes they want to inject into the rp becaus it's that or spend another week looking for someone else
>have to hope they don't just nut and disappear leaving me with blue balls
yeah no thanks i'll stick to the AI that does exactly what i want whenever i want it
>>
Animators are cooked
https://video-s.twimg.com/amplify_video/2039313897828634624/vid/avc1/1920x1080/3uEKNBGsm0cQnhnl.mp4
>>
>High quality gpu in 2014 came with 8GB of vram
>Tripled 6 years later to 24GB in 2020 with the 3090
>Another 6 years later to 2026 and vram has only gone up to 32GB with the 5090, a 1.33x increase
Is there some technical bottleneck that's preventing more VRAM on GPUs or is this mostly Nvidia being kikes?
>>
>>736186824
What kind of pc do you need to generate something like this at home?
>>
File: 1684757236028589.jpg (60 KB, 1132x1183)
60 KB
60 KB JPG
>>736186348
Real people aren't my little slaves to run my daily short sessions of the same handful of stories for months on end whenever I tell them to
>>
>>736186824
>AI generated animation
>Still has that shitty CGI look
Why even bother
>>
>>736186976
You're not counting the increase in VRAM bandwidth, which has tripled/quadrupled in the last 12 years.
>>
>>736187165
you should be comparing this to actual cg, though, and it's so much better than any cg that the anime industry has produced.
>>
>>736180429
>spicychat.ai has a fit when trying to say the word cum in a first-person context
why is cum such a forbidden word, it's always "essence" BITCH it's called a cumshot not "cover me with your essence"
>>
>>736187760
I just realized I barely see pussy or vagina, it always goes straight to cunt
>>
>>736180429
So much retardation. Turbo quant is just for context. And it is 3bits. You could have been running a 3 bit quant of model since forever. And q3 70B doesn't fit into 24GB let alone 16GB. And 70B is outdated. I am on a pc and not even a server and i run glm4.6. kill yourself.
>>
So do you guys just go ah ah mistress for 10 messages and are done?
>>
>>736185565
>men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
lmao what, women are mostly into choking and shit, rarely does it go into guro, trust me on this
(you're a brown vanilla fag btw)
>>
>>736186584
what is your weird niche fetish?, share with us anon.
>>
>>736188575
used to do that but it got boring
now i directormaxx and i can't cum without minimum 50 messages of buildup
>>
>>736188575
No, I don't like replying with very little. However because of that, recently if I just want a quick lazy coom I tell the AI what to do in OOC and let it ramble on by itself. Technically lazier but less embarrassing looking. I then flip back over to writing like 4 paragraphs of narration
>>
>>736186348
The combination of my niche fetishes and the fact that I enjoy stretches of non-erotic adventuring (in which the fetishes are present in the background) because my preferred scenarios involve D&D-esque fantasy settings would make finding another person difficult even if I set aside a consistent schedule, which wouldn't be ideal because I sometimes just want a 5-10 minute beatoff session before bed. LLMs are always available, they follow along no matter how outlandish you get, and you don't have to look them in the eye the day after. Human role-playing would probably be better quality-wise but the advantages of LLMs make it no contest which I'll use. I don't need high art, ever since 2020 I've just wanted infinite Zork with porn. I feel top models have gotten a lot better with amputee stuff too so I don't have to wrangle it quite as hard as before.
>>
>>736186976
Why would they spend the money and engineering time to add more VRAM when games don't use more than what, 18GB?
>>
How degen can you make the stuff in Silly Tavern?
>>
>>736190739
i feel you, anon. i also like autism adventures with occasional lewd/fetishy bits sprinkled throughout, which like you said doesn't work for ERP with other people.
AI slop is the pefect outlet for my autistic tastes.
>>
>>736191084
Whatever you can think of really. Unless it's REALLY obtuse and specific I guess.
>>
>>736191084
Sillytavern is just a frontend, that's like asking "how much porn can I watch on my monitor?"
>>
>>736191259
Then I have missunderstood what it is. I thought that it was some sort of AI rp thing.
>>
>>736180429
Ok, so is there actually a 70B model that has been converted that isn't the censored trash?
>>
>>736191446
It is. It's a platform you can do AI rp on. It doesn't supply the AI to rp with though.
Kind of like how steam isn't a game but it lets you play them.
>>
>>736191446
Yes, its a frontend for that

You run the AI model either locally or connecting to a network that hosts it for you, that's the thing that makes the sexy words.
SillyTavern sends and retrieves those requests and displays the output in a human readable format that isn't an unformatted string of text in a terminal.
>>
>>736185565
Women are dead fish like real life. Did it for years with hundreds of women, yes I met some of them. They are lazy and AI is so much better I hope I never talk to a woman again.
>>
>>736180821
>this [industrial-grade fuck doll made with the entire purpose of being sexually attracted following these graphs about which designs are popular] IS MY WAIFUUUUUU
waifutards should be ground to raw biomass and used to feed server farm biofuel generators because jesus christ how delusional do you have to be to steal some IP chick like peach and think she's some fucked up permavirgin schizo bitch inside their own heads?
>>
>>736186976
Increasing vram would cut into Nvidia's non-consumer market where they charge exponentially more.
>>
>>736186348
>just find someone who's willing to write your hyperspecific fetish bro
if only it were that simple
>>
File: Yuruyuri167.jpg (76 KB, 1280x720)
76 KB
76 KB JPG
>>736180429
>Google's new quantization method allows small cards to run HUGE models
I have been waiting 2 years for bitnet to be real. Not falling for this again
>>
>>736191596
>>736191632
Why not just rp directly with the AI bot then? What does Silly Tavern actually do/ad to this?
>>
>>736192216
Did you completely miss the "formatted in a human readable way instead of raw text string in a terminal" part?
>>
>>736192216
It formats the prompt for the AI. This includes 'this is an RP, respond in this manner etc', full character description, entire chat log. All you have to do is put in the next response. With just the AI you would have to cut and paste this together every time. Part of this includes tricking the AI into bypassing the jew censor.
>>
>>736192284
I mean, doesn't something like chatgpt already do that?
>>
>>736192216
>Fomatting
>Persistent memory and instructions
>Additional features
If you need more analogies it's like loading up gmod flatgrass and nothing else
>>
>>736192432
>>736192463

Ah, I see.
Makes sense.
>>
>>736192216
because using an API raw means you have to manually send hundreds of lines of instructions to the AI in a carefully ordered way every time you want to say something
a frontend takes care of all that shit for you so you can just type ahh ahh mistress
>>
>>736192438
Do you think chatgpt's website is the raw text output and not a frontend?
>>
>>736192525
I got no clue of how that shit works hence why I am asking
>>
>>736192216
you can give your AI gf portraits (and it actually tries to match the emotion) or a live2D rig which is neat
>>
>>736180429
>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Does this have any implications for StableDiffusion or is it just for LLMs?
>>
what's the current meta? I'm still using and waiting for the new deepseek model to come out (never ever)
>>
File: renachat.png (451 KB, 1290x812)
451 KB
451 KB PNG
>>736192792
in fact here's something i was working on. i was testing the VN portraits. it's a lot of work to gen every single expression and keep it consistent/on model though.
>>
>>736181123
>Roleplay LLMs
roleplay llms? most sota llms are definitely not built for roleplay. they're for coding meme or general purpose knowledge shit. i'd imagine storytelling and rp could be so much better if tech companies actually focused on it.
>incapable of outputting anything remotely interesting
well, i can't speak for doing straight chatbot stuff with ai. i like "roleplaying" as a self insert character in scenarios and stories that i guide and write, usually in first or second person. don't have any problems with sota llms there. the characters are pretty fucking smart, especially with good models like gemini pro and opus. smart enough to understand subcontext and deeper meaning behind words and events and scenes overall, and smart enough to give realistic and sometimes funny responses to things that happen in the narrative. it's great.
>It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.
it does cooperative storytelling and it does it pretty well. ai isn't at the level where it can create an interesting overarching plot line for you but it's an amazing filler tool that will basically texture out chunks of your story, provide novel and entertaining interactions with characters, and occasionally suggest threads that you can choose to build upon. it requires effort on your part but rewards it by allowing almost unlimited freedom in choice and setup. you dream up a wacky intro, which can be pulled from a game or movie or some other piece of media, tweak it to your liking, add in some coomer shit, slap in your self insert, and roll with it.
>>
>>736192216
The main thing of SillyTavern are the shareable cards. You can create a character and share it so other people can use it.

It's much more moddable than a normal chatbot. In a normal chatbot you would need to teach the bot how it is supposed to act, with SillyTavern this is done by the cards.

There is also this concept of lorebooks that you can add more information to the context "automatically". Example: you have a lorebook entry that teaches how blowjobs work in explicitly detail and when someone say "blowjob" (or other keyword that you set) that explanation is added to the context so now the bot can use that explanation to understand how blowjobs work.

Those are the main features, but there are more and you can add other features by writing a extension or downloading extensions from other people. I wasted like 1 week of my life gooning to this all day the first time I learned about SillyTavern.
>>
GLM 4.7 works pretty well and pretty fast but I can’t get KIMI to do anything without throwing a shit fit
>>
>>736192836
It is not real. You can't make a sub 2bit quant smart.
>>
>>736181123
LLMs are fancy autocompletes which is fine for what I want
obviously it won't really come up with anything special or too original and will half the time use shitty isms and you have to tard wrangle it but it'll do at least 30% of the work for you
as long as LLM companies are focused on coding all the models will stay positivity slopped since they're just meant to be helpers for vibe coders
>>
>>736188713
Me being a twink boy getting his ass annihilated by a hung futa resulting in mpreg
>>
>>736193003
Mistral Nemo
GLM 4.5 air
GLM 4.6/4.7
GLM 5 / Kimi

In the order of ram.
>>
File: 1774734701507216.gif (2.2 MB, 518x640)
2.2 MB
2.2 MB GIF
whats the meta for paypigs
>>
File: 1534870179750.png (369 KB, 1363x412)
369 KB
369 KB PNG
>>736181025
>>
>>736194241
that's not niche enough that you couldn't find anyone on f-list for it
boring
>>
>>736195345
Considering that 99.999% of futas on F-List have 'no males' in their profiles, I'd say it's pretty damn niche.
>>
>>736195562
Even futa know the logistical nightmare of preparing for anal.
>>
>>736180429
But are there good big models that don't produce cucked slop for my erps?
>>
>>736180429
Oh cool I hope a turboquant of WAN gets made.
>>
>>736181123
Time to work put at the RAM gym and get a bigger model.
>>
>>736186824
That was really well done.
>>736187165
OK but how much less effort was it to use AI vs manually animating everything?
>>736187098
That was probably made using LTX2. Only runs on NVIDIA hardware. Probably used a comfyUI workflow that supports first-last so they can continually bake 5 second clips that go exactly where he wants. That's probably how a lot of the flying camera perspective was done.
>>
>>736187760
Probably a system prompt saying don't say cum
>>
>>736187098
>>736196208
It's definitely Seedance2. All the gugugaga Endfield videos come from Seedance2. LTX2.3 and Wan2.2 are way too far behind.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.