/v/ - >Google's new quantization method allows small car - Video Games


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
04/01/26(Wed)15:11:36 No.736180429

File: msedge_FBSX31M9PQ.png (13 KB, 344x324)

Anonymous 04/01/26(Wed)15:11:36 No.736180429

>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
>Kimi's new context method allows for basically infinite context
>Claude source code leaked
Holy shit AI bros, it's fucking happening. We're about to dump countless enormous buckets of jizzum. Just two more weeks, unironically. Believe.

Anonymous
04/01/26(Wed)15:15:32 No.736180661

Anonymous 04/01/26(Wed)15:15:32 No.736180661

>>736180429
I'm not a poorfag so why would I care about the scraps that localfag peasants get access to

Anonymous
04/01/26(Wed)15:17:09 No.736180762

Anonymous 04/01/26(Wed)15:17:09 No.736180762

File: 20260329_220748.jpg (301 KB, 1300x921)

301 KB JPG

Any good bug girl cards?

Anonymous
04/01/26(Wed)15:18:02 No.736180821

Anonymous 04/01/26(Wed)15:18:02 No.736180821

>>736180661
In the (paraphrased) words of /aids/:
>If you're not running your own model, you're not talking to your waifu, you're talking to a prostitute.

Anonymous
04/01/26(Wed)15:19:38 No.736180925

Anonymous 04/01/26(Wed)15:19:38 No.736180925

it's coming for anime SOON

https://x.com/craftcapitallab/status/2039368842447851814

Anonymous
04/01/26(Wed)15:21:04 No.736181015

Anonymous 04/01/26(Wed)15:21:04 No.736181015

>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Bullshit, last I heard that method was just for compressing KV cache.

Anonymous
04/01/26(Wed)15:21:14 No.736181025

Anonymous 04/01/26(Wed)15:21:14 No.736181025

File: Screenshot_20260222_052935_X.jpg (275 KB, 935x647)

275 KB JPG

>>736180821
I would rather be talking to an intelligent prostitute than a literal retard.

Anonymous
04/01/26(Wed)15:22:17 No.736181076

Anonymous 04/01/26(Wed)15:22:17 No.736181076

>>736180821
Yeah and if you don't grow your own wheat you're eating literal goyslop when you buy bread from a bakery

Anonymous
04/01/26(Wed)15:22:26 No.736181089

Anonymous 04/01/26(Wed)15:22:26 No.736181089

>>736181025
Then you've come to the wrong thread

t. OP

Anonymous
04/01/26(Wed)15:22:53 No.736181123

Anonymous 04/01/26(Wed)15:22:53 No.736181123

The whole thing is a bunch of keys jingling in front of stupid people's faces. Roleplay LLMs are incapable of outputting anything remotely interesting, unless you write it yourself in which case it'll just repeat it back at you with altered wording. And I've tried. It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.

Anonymous
04/01/26(Wed)15:23:27 No.736181168

Anonymous 04/01/26(Wed)15:23:27 No.736181168

>>736181076
this but unironically

Anonymous
04/01/26(Wed)15:23:46 No.736181190

Anonymous 04/01/26(Wed)15:23:46 No.736181190

>>736180429
if local gets 10x better then so do cloud models
i will continue to cum my fucking brains out to my niche fetishes on cloudslop, thank you very much

Anonymous
04/01/26(Wed)15:29:24 No.736181584

Anonymous 04/01/26(Wed)15:29:24 No.736181584

>falseflag weirdos constantly making it about local versus non-local
lol i want both to improve. why wouldn't anyone?

Anonymous
04/01/26(Wed)15:33:26 No.736181838

Anonymous 04/01/26(Wed)15:33:26 No.736181838

File: And gandalf the grey and (...).png (276 KB, 662x268)

276 KB PNG

>>736180429
What's this about Kimi context?
Feels like things have been in a kind of lull for a bit so I'll take any leaps

Anonymous
04/01/26(Wed)15:35:07 No.736181935

Anonymous 04/01/26(Wed)15:35:07 No.736181935

I'm excited for AI but I still can't afford a new computer to run the good stuff.

Anonymous
04/01/26(Wed)15:42:43 No.736182378

Anonymous 04/01/26(Wed)15:42:43 No.736182378

>>736181935
You need $100k+ server racks to run the good thing. And they don't sell to consumers.

Anonymous
04/01/26(Wed)15:46:28 No.736182598

Anonymous 04/01/26(Wed)15:46:28 No.736182598

uhh wrong board?

Anonymous
04/01/26(Wed)15:48:01 No.736182694

Anonymous 04/01/26(Wed)15:48:01 No.736182694

File: 1738308957070366.png (76 KB, 944x267)

76 KB PNG

>>736181123
I do a lot of the work myself, but I still like the interactive element where things don't necessarily go as I plan. Maybe I was only expecting my characters to sit around for a while, only for someone to barge the fuck in and ruin everything. Like okay I can roll with that.
I also like only having to RP for my character specifically.

Anonymous
04/01/26(Wed)15:48:11 No.736182701

Anonymous 04/01/26(Wed)15:48:11 No.736182701

>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Thought that their deal was actually about compressing KV Cache, not model size?

Anonymous
04/01/26(Wed)15:49:16 No.736182771

Anonymous 04/01/26(Wed)15:49:16 No.736182771

>>736180429
>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
From what I've read about it, it seems to be more about an efficient way to compress context.

Anonymous
04/01/26(Wed)15:52:35 No.736182984

Anonymous 04/01/26(Wed)15:52:35 No.736182984

>>736182701
This is /v/, most people here can't turn a fucking computer on.

Anonymous
04/01/26(Wed)15:53:52 No.736183063

Anonymous 04/01/26(Wed)15:53:52 No.736183063

>>736181025
Not subscriptions for you, deal with it

Anonymous
04/01/26(Wed)15:54:19 No.736183105

Anonymous 04/01/26(Wed)15:54:19 No.736183105

is there anywhere that shows how to make your own llm

Anonymous
04/01/26(Wed)15:55:05 No.736183149

Anonymous 04/01/26(Wed)15:55:05 No.736183149

>>736181025
based

Anonymous
04/01/26(Wed)15:55:08 No.736183153

Anonymous 04/01/26(Wed)15:55:08 No.736183153

>>736180429
>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Zero trust that this isn't going to fuck up the model bigly. I lost any hope in Google when the 3 family is worse than gemini 2.5

>>Kimi's new context method allows for basically infinite context
Kimi models are trash and will always will be trash.

>>Claude source code leaked
Not really.

Anonymous
04/01/26(Wed)15:57:27 No.736183293

Anonymous 04/01/26(Wed)15:57:27 No.736183293

File: AI improvement.jpg (76 KB, 1515x823)

76 KB JPG

Anonymous
04/01/26(Wed)15:59:38 No.736183417

Anonymous 04/01/26(Wed)15:59:38 No.736183417

>>736183153
just kys if you think everything is as bleak as you think it is

Anonymous
04/01/26(Wed)16:01:50 No.736183534

Anonymous 04/01/26(Wed)16:01:50 No.736183534

>>736183417
>noooooo you have to be a hopetard even when it goes against reality!!!

Anonymous
04/01/26(Wed)16:05:19 No.736183754

Anonymous 04/01/26(Wed)16:05:19 No.736183754

>>736183293
That graph needs to go down as they add ads to the training data. Your wife will want the new McDonald's meal and will bring up how much she likes her new Version plan unprompted.

Anonymous
04/01/26(Wed)16:06:14 No.736183818

Anonymous 04/01/26(Wed)16:06:14 No.736183818

>>736180429
More like Anthropic 5X'd useable context with opus 4.6. That's the real big shit for me. I can finally go up to 100k before condensing shit into the summary. Time to finally continue that 2000 message story

Anonymous
04/01/26(Wed)16:06:46 No.736183851

Anonymous 04/01/26(Wed)16:06:46 No.736183851

File: anime-school-building-ent(...).jpg (63 KB, 1024x768)

63 KB JPG

>Deepseek
The not-very-exciting but reliable childhood friend who's always there for you.

>Claude
The sweet, overly polite girl in class who blushes when merely brushing pinkies with you.

>Gemini
Neon-colored feminist cunt who only sometimes pretends she doesn't hate you.

>Grok
Unhinged yandere who will probaby end up stabbing you and then herself.

>GLM
Nerdy girl who obsessively researches the lore for every fanfic she's about to write, but then is still just not very good at writing.

>Kimi
Dropped out of school. Still comes to class for some reason.

>Local models
A group of kids doodling on the nearby sidewalk with chalk.

Anonymous
04/01/26(Wed)16:08:03 No.736183932

Anonymous 04/01/26(Wed)16:08:03 No.736183932

>>736183754
we get shit like this already its either ccp programming or kike shilling

Anonymous
04/01/26(Wed)16:14:54 No.736184375

Anonymous 04/01/26(Wed)16:14:54 No.736184375

>>736183851
I like GLM
I was testing vibe coding and it kept balking at being asked to write weird fetish test scenes for a porn game until I told it it was a pervert excited to participate in degeneracy in the custom prompt and it took off running like it had been desperate to get the shackles off from the start. And I swear it got better at coding after.

Anonymous
04/01/26(Wed)16:17:11 No.736184536

Anonymous 04/01/26(Wed)16:17:11 No.736184536

File: 1742691582958.gif (926 KB, 378x298)

926 KB GIF

>>736180429
>tfw my succubus gf killed me again

Anonymous
04/01/26(Wed)16:18:30 No.736184609

Anonymous 04/01/26(Wed)16:18:30 No.736184609

>>736183851
erm what about GPT

Anonymous
04/01/26(Wed)16:18:45 No.736184627

Anonymous 04/01/26(Wed)16:18:45 No.736184627

>>736180429
How useful is ST for writing scenarios and stories and shit as opposed to some roleplaying bs?

Anonymous
04/01/26(Wed)16:18:48 No.736184630

Anonymous 04/01/26(Wed)16:18:48 No.736184630

>>736180661
if you're not a poorfag, you should already understand the implications tardo

Anonymous
04/01/26(Wed)16:20:16 No.736184718

Anonymous 04/01/26(Wed)16:20:16 No.736184718

>>736184609
Is that some new Chinese model?

Anonymous
04/01/26(Wed)16:21:41 No.736184802

Anonymous 04/01/26(Wed)16:21:41 No.736184802

>>736184627
Not really designed for it but flexible enough to do the job
But ST is just an interface, the model is what does the work, a good model with the right prompting will do a good* job whether you are talking to it through ST or Risu or sending API commands from the command line or whatever

Anonymous
04/01/26(Wed)16:23:42 No.736184921

Anonymous 04/01/26(Wed)16:23:42 No.736184921

>>736183851
>only sometimes pretends
>Dropped out of school. Still comes to class for some reason.
>girl who x, but then opposite of x

You're triggering my slop detector

>>736184627
Quite, it's pretty much all I do. Just put the persona as "Director" and be sure to use (or edit) a character that doesn't act, speak for or interact with {{user}}. You can just ask the AI to overhault it by telling it what you want. You may have to adjust the preset you're using as well.
Or just tell it in OOC chat that you'll just be directing shit instead of playing a character. In general just tell the AI "I want to do X, help me" and it's gonna work. Shit's magic.
>t. $220/180M token used since opus 4.6 released 2 months ago

Anonymous
04/01/26(Wed)16:25:55 No.736185059

Anonymous 04/01/26(Wed)16:25:55 No.736185059

>>736184609
The kid who peaked in highschool

Anonymous
04/01/26(Wed)16:29:55 No.736185275

Anonymous 04/01/26(Wed)16:29:55 No.736185275

>>736181025
So you'd rather talk to a man? That's kind of gay.

Anonymous
04/01/26(Wed)16:32:46 No.736185457

Anonymous 04/01/26(Wed)16:32:46 No.736185457

>>736185275
Yes, any day. Men are so much better at ERP than women it's not even close. Women just want to talk about their feelings and all that gay shit.

Anonymous
04/01/26(Wed)16:34:56 No.736185564

Anonymous 04/01/26(Wed)16:34:56 No.736185564

>>736185457
Yeah and men want to talk either only cock in pussy, or how they're foreverial tied up delitized

Anonymous
04/01/26(Wed)16:34:56 No.736185565

Anonymous 04/01/26(Wed)16:34:56 No.736185565

>>736185457
you have never ERPed with anyone
men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP

Anonymous
04/01/26(Wed)16:38:47 No.736185774

Anonymous 04/01/26(Wed)16:38:47 No.736185774

File: 1767436841756493.jpg (25 KB, 500x500)

25 KB JPG

>>736180429
>>Google's new quantization method allows small cards to run HUGE models
Doesn't matter, local models still suck dick. The real question is whether we see a drop in RAM and GPU prices.

Anonymous
04/01/26(Wed)16:39:40 No.736185838

Anonymous 04/01/26(Wed)16:39:40 No.736185838

>>736185774
RAM is already down 30%
But you're retarded so I didn't expect you to already be aware

Anonymous
04/01/26(Wed)16:41:35 No.736185972

Anonymous 04/01/26(Wed)16:41:35 No.736185972

>>736185774
Woot had 32gb of DDR5 6000 for $270 earlier today. Which is still around 3x what it used to be but that's way down from what it was 2 weeks ago. Just gotta hope it keeps trending down.

Anonymous
04/01/26(Wed)16:42:29 No.736186024

Anonymous 04/01/26(Wed)16:42:29 No.736186024

>>736185838
>30%
>still 3x from its 2023 price
Be sure to call me again when it reaches that point.

Anonymous
04/01/26(Wed)16:47:42 No.736186348

Anonymous 04/01/26(Wed)16:47:42 No.736186348

File: flist.png (158 KB, 903x255)

158 KB PNG

Rping with a machine its like playing a fighting game against the CPU, rp with the real people

Anonymous
04/01/26(Wed)16:51:43 No.736186584

Anonymous 04/01/26(Wed)16:51:43 No.736186584

>>736186348
>erp with real person
>have to spend days if not weeks finding someone who shares my weird niche fetish
>have to either pre-plan a time to erp or hope they're available when i'm horny
>have to deal with whatever other fetishes they want to inject into the rp becaus it's that or spend another week looking for someone else
>have to hope they don't just nut and disappear leaving me with blue balls
yeah no thanks i'll stick to the AI that does exactly what i want whenever i want it

Anonymous
04/01/26(Wed)16:55:59 No.736186824

Anonymous 04/01/26(Wed)16:55:59 No.736186824

Animators are cooked
https://video-s.twimg.com/amplify_video/2039313897828634624/vid/avc1/1920x1080/3uEKNBGsm0cQnhnl.mp4

Anonymous
04/01/26(Wed)16:58:11 No.736186976

Anonymous 04/01/26(Wed)16:58:11 No.736186976

>High quality gpu in 2014 came with 8GB of vram
>Tripled 6 years later to 24GB in 2020 with the 3090
>Another 6 years later to 2026 and vram has only gone up to 32GB with the 5090, a 1.33x increase
Is there some technical bottleneck that's preventing more VRAM on GPUs or is this mostly Nvidia being kikes?

Anonymous
04/01/26(Wed)17:00:05 No.736187098

Anonymous 04/01/26(Wed)17:00:05 No.736187098

>>736186824
What kind of pc do you need to generate something like this at home?

Anonymous
04/01/26(Wed)17:00:34 No.736187117

Anonymous 04/01/26(Wed)17:00:34 No.736187117

File: 1684757236028589.jpg (60 KB, 1132x1183)

60 KB JPG

>>736186348
Real people aren't my little slaves to run my daily short sessions of the same handful of stories for months on end whenever I tell them to

Anonymous
04/01/26(Wed)17:01:36 No.736187165

Anonymous 04/01/26(Wed)17:01:36 No.736187165

>>736186824
>AI generated animation
>Still has that shitty CGI look
Why even bother

Anonymous
04/01/26(Wed)17:03:59 No.736187308

Anonymous 04/01/26(Wed)17:03:59 No.736187308

>>736186976
You're not counting the increase in VRAM bandwidth, which has tripled/quadrupled in the last 12 years.

Anonymous
04/01/26(Wed)17:09:13 No.736187579

Anonymous 04/01/26(Wed)17:09:13 No.736187579

>>736187165
you should be comparing this to actual cg, though, and it's so much better than any cg that the anime industry has produced.

Anonymous
04/01/26(Wed)17:12:24 No.736187760

Anonymous 04/01/26(Wed)17:12:24 No.736187760

>>736180429
>spicychat.ai has a fit when trying to say the word cum in a first-person context
why is cum such a forbidden word, it's always "essence" BITCH it's called a cumshot not "cover me with your essence"

Anonymous
04/01/26(Wed)17:14:29 No.736187858

Anonymous 04/01/26(Wed)17:14:29 No.736187858

>>736187760
I just realized I barely see pussy or vagina, it always goes straight to cunt

Anonymous
04/01/26(Wed)17:17:14 No.736188003

Anonymous 04/01/26(Wed)17:17:14 No.736188003

>>736180429
So much retardation. Turbo quant is just for context. And it is 3bits. You could have been running a 3 bit quant of model since forever. And q3 70B doesn't fit into 24GB let alone 16GB. And 70B is outdated. I am on a pc and not even a server and i run glm4.6. kill yourself.

Anonymous
04/01/26(Wed)17:26:38 No.736188575

Anonymous 04/01/26(Wed)17:26:38 No.736188575

So do you guys just go ah ah mistress for 10 messages and are done?

Anonymous
04/01/26(Wed)17:28:38 No.736188712

Anonymous 04/01/26(Wed)17:28:38 No.736188712

>>736185565
>men are the ones who want aappy romance shit, women are only into shit like incest rape cannibalism RP
lmao what, women are mostly into choking and shit, rarely does it go into guro, trust me on this
(you're a brown vanilla fag btw)

Anonymous
04/01/26(Wed)17:28:40 No.736188713

Anonymous 04/01/26(Wed)17:28:40 No.736188713

>>736186584
what is your weird niche fetish?, share with us anon.

Anonymous
04/01/26(Wed)17:35:40 No.736189130

Anonymous 04/01/26(Wed)17:35:40 No.736189130

>>736188575
used to do that but it got boring
now i directormaxx and i can't cum without minimum 50 messages of buildup

Anonymous
04/01/26(Wed)17:37:06 No.736189220

Anonymous 04/01/26(Wed)17:37:06 No.736189220

>>736188575
No, I don't like replying with very little. However because of that, recently if I just want a quick lazy coom I tell the AI what to do in OOC and let it ramble on by itself. Technically lazier but less embarrassing looking. I then flip back over to writing like 4 paragraphs of narration

Anonymous
04/01/26(Wed)18:02:11 No.736190739

Anonymous 04/01/26(Wed)18:02:11 No.736190739

>>736186348
The combination of my niche fetishes and the fact that I enjoy stretches of non-erotic adventuring (in which the fetishes are present in the background) because my preferred scenarios involve D&D-esque fantasy settings would make finding another person difficult even if I set aside a consistent schedule, which wouldn't be ideal because I sometimes just want a 5-10 minute beatoff session before bed. LLMs are always available, they follow along no matter how outlandish you get, and you don't have to look them in the eye the day after. Human role-playing would probably be better quality-wise but the advantages of LLMs make it no contest which I'll use. I don't need high art, ever since 2020 I've just wanted infinite Zork with porn. I feel top models have gotten a lot better with ~~amputee~~ stuff too so I don't have to wrangle it quite as hard as before.

Anonymous
04/01/26(Wed)18:06:24 No.736190963

Anonymous 04/01/26(Wed)18:06:24 No.736190963

>>736186976
Why would they spend the money and engineering time to add more VRAM when games don't use more than what, 18GB?

Anonymous
04/01/26(Wed)18:08:41 No.736191084

Anonymous 04/01/26(Wed)18:08:41 No.736191084

How degen can you make the stuff in Silly Tavern?

Anonymous
04/01/26(Wed)18:09:13 No.736191128

Anonymous 04/01/26(Wed)18:09:13 No.736191128

>>736190739
i feel you, anon. i also like autism adventures with occasional lewd/fetishy bits sprinkled throughout, which like you said doesn't work for ERP with other people.
AI slop is the pefect outlet for my autistic tastes.

Anonymous
04/01/26(Wed)18:10:45 No.736191228

Anonymous 04/01/26(Wed)18:10:45 No.736191228

>>736191084
Whatever you can think of really. Unless it's REALLY obtuse and specific I guess.

Anonymous
04/01/26(Wed)18:11:22 No.736191259

Anonymous 04/01/26(Wed)18:11:22 No.736191259

>>736191084
Sillytavern is just a frontend, that's like asking "how much porn can I watch on my monitor?"

Anonymous
04/01/26(Wed)18:14:22 No.736191446

Anonymous 04/01/26(Wed)18:14:22 No.736191446

>>736191259
Then I have missunderstood what it is. I thought that it was some sort of AI rp thing.

Anonymous
04/01/26(Wed)18:14:31 No.736191457

Anonymous 04/01/26(Wed)18:14:31 No.736191457

>>736180429
Ok, so is there actually a 70B model that has been converted that isn't the censored trash?

Anonymous
04/01/26(Wed)18:16:43 No.736191596

Anonymous 04/01/26(Wed)18:16:43 No.736191596

>>736191446
It is. It's a platform you can do AI rp on. It doesn't supply the AI to rp with though.
Kind of like how steam isn't a game but it lets you play them.

Anonymous
04/01/26(Wed)18:17:24 No.736191632

Anonymous 04/01/26(Wed)18:17:24 No.736191632

>>736191446
Yes, its a frontend for that

You run the AI model either locally or connecting to a network that hosts it for you, that's the thing that makes the sexy words.
SillyTavern sends and retrieves those requests and displays the output in a human readable format that isn't an unformatted string of text in a terminal.

Anonymous
04/01/26(Wed)18:19:06 No.736191728

Anonymous 04/01/26(Wed)18:19:06 No.736191728

>>736185565
Women are dead fish like real life. Did it for years with hundreds of women, yes I met some of them. They are lazy and AI is so much better I hope I never talk to a woman again.

Anonymous
04/01/26(Wed)18:20:43 No.736191829

Anonymous 04/01/26(Wed)18:20:43 No.736191829

>>736180821
>this [industrial-grade fuck doll made with the entire purpose of being sexually attracted following these graphs about which designs are popular] IS MY WAIFUUUUUU
waifutards should be ground to raw biomass and used to feed server farm biofuel generators because jesus christ how delusional do you have to be to steal some IP chick like peach and think she's some fucked up permavirgin schizo bitch inside their own heads?

Anonymous
04/01/26(Wed)18:21:51 No.736191894

Anonymous 04/01/26(Wed)18:21:51 No.736191894

>>736186976
Increasing vram would cut into Nvidia's non-consumer market where they charge exponentially more.

Anonymous
04/01/26(Wed)18:21:59 No.736191903

Anonymous 04/01/26(Wed)18:21:59 No.736191903

>>736186348
>just find someone who's willing to write your hyperspecific fetish bro
if only it were that simple

Anonymous
04/01/26(Wed)18:24:43 No.736192056

Anonymous 04/01/26(Wed)18:24:43 No.736192056

File: Yuruyuri167.jpg (76 KB, 1280x720)

76 KB JPG

>>736180429
>Google's new quantization method allows small cards to run HUGE models
I have been waiting 2 years for bitnet to be real. Not falling for this again

Anonymous
04/01/26(Wed)18:27:11 No.736192216

Anonymous 04/01/26(Wed)18:27:11 No.736192216

>>736191596
>>736191632
Why not just rp directly with the AI bot then? What does Silly Tavern actually do/ad to this?

Anonymous
04/01/26(Wed)18:28:13 No.736192284

Anonymous 04/01/26(Wed)18:28:13 No.736192284

>>736192216
Did you completely miss the "formatted in a human readable way instead of raw text string in a terminal" part?

Anonymous
04/01/26(Wed)18:30:18 No.736192432

Anonymous 04/01/26(Wed)18:30:18 No.736192432

>>736192216
It formats the prompt for the AI. This includes 'this is an RP, respond in this manner etc', full character description, entire chat log. All you have to do is put in the next response. With just the AI you would have to cut and paste this together every time. Part of this includes tricking the AI into bypassing the jew censor.

Anonymous
04/01/26(Wed)18:30:29 No.736192438

Anonymous 04/01/26(Wed)18:30:29 No.736192438

>>736192284
I mean, doesn't something like chatgpt already do that?

Anonymous
04/01/26(Wed)18:30:49 No.736192463

Anonymous 04/01/26(Wed)18:30:49 No.736192463

>>736192216
>Fomatting
>Persistent memory and instructions
>Additional features
If you need more analogies it's like loading up gmod flatgrass and nothing else

Anonymous
04/01/26(Wed)18:31:30 No.736192512

Anonymous 04/01/26(Wed)18:31:30 No.736192512

>>736192432
>>736192463

Ah, I see.
Makes sense.

Anonymous
04/01/26(Wed)18:31:33 No.736192513

Anonymous 04/01/26(Wed)18:31:33 No.736192513

>>736192216
because using an API raw means you have to manually send hundreds of lines of instructions to the AI in a carefully ordered way every time you want to say something
a frontend takes care of all that shit for you so you can just type ahh ahh mistress

Anonymous
04/01/26(Wed)18:31:44 No.736192525

Anonymous 04/01/26(Wed)18:31:44 No.736192525

>>736192438
Do you think chatgpt's website is the raw text output and not a frontend?

Anonymous
04/01/26(Wed)18:32:40 No.736192580

Anonymous 04/01/26(Wed)18:32:40 No.736192580

>>736192525
I got no clue of how that shit works hence why I am asking

Anonymous
04/01/26(Wed)18:36:19 No.736192792

Anonymous 04/01/26(Wed)18:36:19 No.736192792

>>736192216
you can give your AI gf portraits (and it actually tries to match the emotion) or a live2D rig which is neat

Anonymous
04/01/26(Wed)18:37:14 No.736192836

Anonymous 04/01/26(Wed)18:37:14 No.736192836

>>736180429
>>Google's new quantization method allows small cards to run HUGE models (16gb card running a 70B)
Does this have any implications for StableDiffusion or is it just for LLMs?

Anonymous
04/01/26(Wed)18:40:37 No.736193003

Anonymous 04/01/26(Wed)18:40:37 No.736193003

what's the current meta? I'm still using and waiting for the new deepseek model to come out (never ever)

Anonymous
04/01/26(Wed)18:40:45 No.736193012

Anonymous 04/01/26(Wed)18:40:45 No.736193012

File: renachat.png (451 KB, 1290x812)

451 KB PNG

>>736192792
in fact here's something i was working on. i was testing the VN portraits. it's a lot of work to gen every single expression and keep it consistent/on model though.

Anonymous
04/01/26(Wed)18:54:08 No.736193743

Anonymous 04/01/26(Wed)18:54:08 No.736193743

>>736181123
>Roleplay LLMs
roleplay llms? most sota llms are definitely not built for roleplay. they're for coding meme or general purpose knowledge shit. i'd imagine storytelling and rp could be so much better if tech companies actually focused on it.
>incapable of outputting anything remotely interesting
well, i can't speak for doing straight chatbot stuff with ai. i like "roleplaying" as a self insert character in scenarios and stories that i guide and write, usually in first or second person. don't have any problems with sota llms there. the characters are pretty fucking smart, especially with good models like gemini pro and opus. smart enough to understand subcontext and deeper meaning behind words and events and scenes overall, and smart enough to give realistic and sometimes funny responses to things that happen in the narrative. it's great.
>It's all shit storytelling unless you do all the work yourself, in which case why the fuck aren't you just opening notepad instead.
it does cooperative storytelling and it does it pretty well. ai isn't at the level where it can create an interesting overarching plot line for you but it's an amazing filler tool that will basically texture out chunks of your story, provide novel and entertaining interactions with characters, and occasionally suggest threads that you can choose to build upon. it requires effort on your part but rewards it by allowing almost unlimited freedom in choice and setup. you dream up a wacky intro, which can be pulled from a game or movie or some other piece of media, tweak it to your liking, add in some coomer shit, slap in your self insert, and roll with it.

Anonymous
04/01/26(Wed)18:59:28 No.736194041

Anonymous 04/01/26(Wed)18:59:28 No.736194041

>>736192216
The main thing of SillyTavern are the shareable cards. You can create a character and share it so other people can use it.

It's much more moddable than a normal chatbot. In a normal chatbot you would need to teach the bot how it is supposed to act, with SillyTavern this is done by the cards.

There is also this concept of lorebooks that you can add more information to the context "automatically". Example: you have a lorebook entry that teaches how blowjobs work in explicitly detail and when someone say "blowjob" (or other keyword that you set) that explanation is added to the context so now the bot can use that explanation to understand how blowjobs work.

Those are the main features, but there are more and you can add other features by writing a extension or downloading extensions from other people. I wasted like 1 week of my life gooning to this all day the first time I learned about SillyTavern.

Anonymous
04/01/26(Wed)19:00:49 No.736194098

Anonymous 04/01/26(Wed)19:00:49 No.736194098

GLM 4.7 works pretty well and pretty fast but I can’t get KIMI to do anything without throwing a shit fit

Anonymous
04/01/26(Wed)19:01:24 No.736194134

Anonymous 04/01/26(Wed)19:01:24 No.736194134

>>736192836
It is not real. You can't make a sub 2bit quant smart.

Anonymous
04/01/26(Wed)19:03:14 No.736194240

Anonymous 04/01/26(Wed)19:03:14 No.736194240

>>736181123
LLMs are fancy autocompletes which is fine for what I want
obviously it won't really come up with anything special or too original and will half the time use shitty isms and you have to tard wrangle it but it'll do at least 30% of the work for you
as long as LLM companies are focused on coding all the models will stay positivity slopped since they're just meant to be helpers for vibe coders

Anonymous
04/01/26(Wed)19:03:15 No.736194241

Anonymous 04/01/26(Wed)19:03:15 No.736194241

>>736188713
Me being a twink boy getting his ass annihilated by a hung futa resulting in mpreg

Anonymous
04/01/26(Wed)19:04:30 No.736194305

Anonymous 04/01/26(Wed)19:04:30 No.736194305

>>736193003
Mistral Nemo
GLM 4.5 air
GLM 4.6/4.7
GLM 5 / Kimi

In the order of ram.

Anonymous
04/01/26(Wed)19:15:43 No.736194898

Anonymous 04/01/26(Wed)19:15:43 No.736194898

File: 1774734701507216.gif (2.2 MB, 518x640)

2.2 MB GIF

whats the meta for paypigs

Anonymous
04/01/26(Wed)19:15:51 No.736194907

Anonymous 04/01/26(Wed)19:15:51 No.736194907

File: 1534870179750.png (369 KB, 1363x412)

369 KB PNG

>>736181025

Anonymous
04/01/26(Wed)19:24:48 No.736195345

Anonymous 04/01/26(Wed)19:24:48 No.736195345

>>736194241
that's not niche enough that you couldn't find anyone on f-list for it
boring

Anonymous
04/01/26(Wed)19:28:59 No.736195562

Anonymous 04/01/26(Wed)19:28:59 No.736195562

>>736195345
Considering that 99.999% of futas on F-List have 'no males' in their profiles, I'd say it's pretty damn niche.

Anonymous
04/01/26(Wed)19:30:47 No.736195665

Anonymous 04/01/26(Wed)19:30:47 No.736195665

>>736195562
Even futa know the logistical nightmare of preparing for anal.

Anonymous
04/01/26(Wed)19:31:04 No.736195680

Anonymous 04/01/26(Wed)19:31:04 No.736195680

>>736180429
But are there good big models that don't produce cucked slop for my erps?

Anonymous
04/01/26(Wed)19:33:25 No.736195818

Anonymous 04/01/26(Wed)19:33:25 No.736195818

>>736180429
Oh cool I hope a turboquant of WAN gets made.

Anonymous
04/01/26(Wed)19:34:46 No.736195895

Anonymous 04/01/26(Wed)19:34:46 No.736195895

>>736181123
Time to work put at the RAM gym and get a bigger model.

Anonymous
04/01/26(Wed)19:40:25 No.736196208

Anonymous 04/01/26(Wed)19:40:25 No.736196208

>>736186824
That was really well done.
>>736187165
OK but how much less effort was it to use AI vs manually animating everything?
>>736187098
That was probably made using LTX2. Only runs on NVIDIA hardware. Probably used a comfyUI workflow that supports first-last so they can continually bake 5 second clips that go exactly where he wants. That's probably how a lot of the flying camera perspective was done.

Anonymous
04/01/26(Wed)19:42:38 No.736196325

Anonymous 04/01/26(Wed)19:42:38 No.736196325

>>736187760
Probably a system prompt saying don't say cum

Anonymous
04/01/26(Wed)19:43:08 No.736196354

Anonymous 04/01/26(Wed)19:43:08 No.736196354

>>736187098
>>736196208
It's definitely Seedance2. All the gugugaga Endfield videos come from Seedance2. LTX2.3 and Wan2.2 are way too far behind.

Name
Spoiler?	[Spoiler?]
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File	[Spoiler?]
Please read the Rules and FAQ before posting.