>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)>Kimi's new context method allows for basically infinite context>Claude source code leakedHoly shit AI bros, it's fucking happening. We're about to dump countless enormous buckets of jizzum. Just two more weeks, unironically. Believe.
>>736180429I'm not a poorfag so why would I care about the scraps that localfag peasants get access to
Any good bug girl cards?
>>736180661In the (paraphrased) words of /aids/:>If you're not running your own model, you're not talking to your waifu, you're talking to a prostitute.
it's coming for anime SOONhttps://x.com/craftcapitallab/status/2039368842447851814
>>736180429>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)Bullshit, last I heard that method was just for compressing KV cache.
>>736180821I would rather be talking to an intelligent prostitute than a literal retard.
>>736180821Yeah and if you don't grow your own wheat you're eating literal goyslop when you buy bread from a bakery
>>736181025Then you've come to the wrong threadt. OP
The whole thing is a bunch of keys jingling in front of stupid people's faces. Roleplay LLMs are incapable of outputting anything remotely interesting, unless you write it yourself in which case it'll just repeat it back at you with altered wording. And I've tried. It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.
>>736181076this but unironically
>>736180429if local gets 10x better then so do cloud modelsi will continue to cum my fucking brains out to my niche fetishes on cloudslop, thank you very much
>falseflag weirdos constantly making it about local versus non-locallol i want both to improve. why wouldn't anyone?
>>736180429What's this about Kimi context?Feels like things have been in a kind of lull for a bit so I'll take any leaps
I'm excited for AI but I still can't afford a new computer to run the good stuff.
>>736181935You need $100k+ server racks to run the good thing. And they don't sell to consumers.
uhh wrong board?
>>736181123I do a lot of the work myself, but I still like the interactive element where things don't necessarily go as I plan. Maybe I was only expecting my characters to sit around for a while, only for someone to barge the fuck in and ruin everything. Like okay I can roll with that.I also like only having to RP for my character specifically.
>>736180429>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)Thought that their deal was actually about compressing KV Cache, not model size?
>>736180429>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)From what I've read about it, it seems to be more about an efficient way to compress context.
>>736182701This is /v/, most people here can't turn a fucking computer on.
>>736181025Not subscriptions for you, deal with it
is there anywhere that shows how to make your own llm
>>736181025based
>>736180429>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)Zero trust that this isn't going to fuck up the model bigly. I lost any hope in Google when the 3 family is worse than gemini 2.5>>Kimi's new context method allows for basically infinite contextKimi models are trash and will always will be trash.>>Claude source code leakedNot really.
>>736183153just kys if you think everything is as bleak as you think it is
>>736183417>noooooo you have to be a hopetard even when it goes against reality!!!
>>736183293That graph needs to go down as they add ads to the training data. Your wife will want the new McDonald's meal and will bring up how much she likes her new Version plan unprompted.
>>736180429More like Anthropic 5X'd useable context with opus 4.6. That's the real big shit for me. I can finally go up to 100k before condensing shit into the summary. Time to finally continue that 2000 message story
>DeepseekThe not-very-exciting but reliable childhood friend who's always there for you.>ClaudeThe sweet, overly polite girl in class who blushes when merely brushing pinkies with you.>GeminiNeon-colored feminist cunt who only sometimes pretends she doesn't hate you.>GrokUnhinged yandere who will probaby end up stabbing you and then herself.>GLMNerdy girl who obsessively researches the lore for every fanfic she's about to write, but then is still just not very good at writing.>KimiDropped out of school. Still comes to class for some reason.>Local modelsA group of kids doodling on the nearby sidewalk with chalk.
>>736183754we get shit like this already its either ccp programming or kike shilling
>>736183851I like GLMI was testing vibe coding and it kept balking at being asked to write weird fetish test scenes for a porn game until I told it it was a pervert excited to participate in degeneracy in the custom prompt and it took off running like it had been desperate to get the shackles off from the start. And I swear it got better at coding after.
>>736180429>tfw my succubus gf killed me again
>>736183851erm what about GPT
>>736180429How useful is ST for writing scenarios and stories and shit as opposed to some roleplaying bs?
>>736180661if you're not a poorfag, you should already understand the implications tardo
>>736184609Is that some new Chinese model?
>>736184627Not really designed for it but flexible enough to do the jobBut ST is just an interface, the model is what does the work, a good model with the right prompting will do a good* job whether you are talking to it through ST or Risu or sending API commands from the command line or whatever
>>736183851>only sometimes pretends>Dropped out of school. Still comes to class for some reason.>girl who x, but then opposite of xYou're triggering my slop detector>>736184627Quite, it's pretty much all I do. Just put the persona as "Director" and be sure to use (or edit) a character that doesn't act, speak for or interact with {{user}}. You can just ask the AI to overhault it by telling it what you want. You may have to adjust the preset you're using as well.Or just tell it in OOC chat that you'll just be directing shit instead of playing a character. In general just tell the AI "I want to do X, help me" and it's gonna work. Shit's magic.>t. $220/180M token used since opus 4.6 released 2 months ago
>>736184609The kid who peaked in highschool
>>736181025So you'd rather talk to a man? That's kind of gay.
>>736185275Yes, any day. Men are so much better at ERP than women it's not even close. Women just want to talk about their feelings and all that gay shit.
>>736185457Yeah and men want to talk either only cock in pussy, or how they're foreverial tied up delitized
>>736185457you have never ERPed with anyonemen are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
>>736180429>>Google's new quantization method allows small cards to run HUGE modelsDoesn't matter, local models still suck dick. The real question is whether we see a drop in RAM and GPU prices.
>>736185774RAM is already down 30%But you're retarded so I didn't expect you to already be aware
>>736185774Woot had 32gb of DDR5 6000 for $270 earlier today. Which is still around 3x what it used to be but that's way down from what it was 2 weeks ago. Just gotta hope it keeps trending down.
>>736185838>30% >still 3x from its 2023 priceBe sure to call me again when it reaches that point.
Rping with a machine its like playing a fighting game against the CPU, rp with the real people
>>736186348>erp with real person>have to spend days if not weeks finding someone who shares my weird niche fetish>have to either pre-plan a time to erp or hope they're available when i'm horny>have to deal with whatever other fetishes they want to inject into the rp becaus it's that or spend another week looking for someone else>have to hope they don't just nut and disappear leaving me with blue ballsyeah no thanks i'll stick to the AI that does exactly what i want whenever i want it
Animators are cookedhttps://video-s.twimg.com/amplify_video/2039313897828634624/vid/avc1/1920x1080/3uEKNBGsm0cQnhnl.mp4
>High quality gpu in 2014 came with 8GB of vram>Tripled 6 years later to 24GB in 2020 with the 3090>Another 6 years later to 2026 and vram has only gone up to 32GB with the 5090, a 1.33x increaseIs there some technical bottleneck that's preventing more VRAM on GPUs or is this mostly Nvidia being kikes?
>>736186824What kind of pc do you need to generate something like this at home?
>>736186348Real people aren't my little slaves to run my daily short sessions of the same handful of stories for months on end whenever I tell them to
>>736186824>AI generated animation>Still has that shitty CGI lookWhy even bother
>>736186976You're not counting the increase in VRAM bandwidth, which has tripled/quadrupled in the last 12 years.
>>736187165you should be comparing this to actual cg, though, and it's so much better than any cg that the anime industry has produced.
>>736180429>spicychat.ai has a fit when trying to say the word cum in a first-person context why is cum such a forbidden word, it's always "essence" BITCH it's called a cumshot not "cover me with your essence"
>>736187760I just realized I barely see pussy or vagina, it always goes straight to cunt
>>736180429So much retardation. Turbo quant is just for context. And it is 3bits. You could have been running a 3 bit quant of model since forever. And q3 70B doesn't fit into 24GB let alone 16GB. And 70B is outdated. I am on a pc and not even a server and i run glm4.6. kill yourself.
So do you guys just go ah ah mistress for 10 messages and are done?
>>736185565>men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RPlmao what, women are mostly into choking and shit, rarely does it go into guro, trust me on this(you're a brown vanilla fag btw)
>>736186584what is your weird niche fetish?, share with us anon.
>>736188575used to do that but it got boring now i directormaxx and i can't cum without minimum 50 messages of buildup
>>736188575No, I don't like replying with very little. However because of that, recently if I just want a quick lazy coom I tell the AI what to do in OOC and let it ramble on by itself. Technically lazier but less embarrassing looking. I then flip back over to writing like 4 paragraphs of narration
>>736186348The combination of my niche fetishes and the fact that I enjoy stretches of non-erotic adventuring (in which the fetishes are present in the background) because my preferred scenarios involve D&D-esque fantasy settings would make finding another person difficult even if I set aside a consistent schedule, which wouldn't be ideal because I sometimes just want a 5-10 minute beatoff session before bed. LLMs are always available, they follow along no matter how outlandish you get, and you don't have to look them in the eye the day after. Human role-playing would probably be better quality-wise but the advantages of LLMs make it no contest which I'll use. I don't need high art, ever since 2020 I've just wanted infinite Zork with porn. I feel top models have gotten a lot better with amputee stuff too so I don't have to wrangle it quite as hard as before.
>>736186976Why would they spend the money and engineering time to add more VRAM when games don't use more than what, 18GB?
How degen can you make the stuff in Silly Tavern?
>>736190739i feel you, anon. i also like autism adventures with occasional lewd/fetishy bits sprinkled throughout, which like you said doesn't work for ERP with other people.AI slop is the pefect outlet for my autistic tastes.
>>736191084Whatever you can think of really. Unless it's REALLY obtuse and specific I guess.
>>736191084Sillytavern is just a frontend, that's like asking "how much porn can I watch on my monitor?"
>>736191259Then I have missunderstood what it is. I thought that it was some sort of AI rp thing.
>>736180429Ok, so is there actually a 70B model that has been converted that isn't the censored trash?
>>736191446It is. It's a platform you can do AI rp on. It doesn't supply the AI to rp with though.Kind of like how steam isn't a game but it lets you play them.
>>736191446Yes, its a frontend for thatYou run the AI model either locally or connecting to a network that hosts it for you, that's the thing that makes the sexy words.SillyTavern sends and retrieves those requests and displays the output in a human readable format that isn't an unformatted string of text in a terminal.
>>736185565Women are dead fish like real life. Did it for years with hundreds of women, yes I met some of them. They are lazy and AI is so much better I hope I never talk to a woman again.
>>736180821>this [industrial-grade fuck doll made with the entire purpose of being sexually attracted following these graphs about which designs are popular] IS MY WAIFUUUUUUwaifutards should be ground to raw biomass and used to feed server farm biofuel generators because jesus christ how delusional do you have to be to steal some IP chick like peach and think she's some fucked up permavirgin schizo bitch inside their own heads?
>>736186976Increasing vram would cut into Nvidia's non-consumer market where they charge exponentially more.
>>736186348>just find someone who's willing to write your hyperspecific fetish broif only it were that simple
>>736180429>Google's new quantization method allows small cards to run HUGE modelsI have been waiting 2 years for bitnet to be real. Not falling for this again
>>736191596>>736191632Why not just rp directly with the AI bot then? What does Silly Tavern actually do/ad to this?
>>736192216Did you completely miss the "formatted in a human readable way instead of raw text string in a terminal" part?
>>736192216It formats the prompt for the AI. This includes 'this is an RP, respond in this manner etc', full character description, entire chat log. All you have to do is put in the next response. With just the AI you would have to cut and paste this together every time. Part of this includes tricking the AI into bypassing the jew censor.
>>736192284I mean, doesn't something like chatgpt already do that?
>>736192216>Fomatting>Persistent memory and instructions>Additional featuresIf you need more analogies it's like loading up gmod flatgrass and nothing else
>>736192432>>736192463Ah, I see.Makes sense.
>>736192216because using an API raw means you have to manually send hundreds of lines of instructions to the AI in a carefully ordered way every time you want to say somethinga frontend takes care of all that shit for you so you can just type ahh ahh mistress
>>736192438Do you think chatgpt's website is the raw text output and not a frontend?