/v/ - >"video game" - a game in which you press buttons - Video Games

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/v/ - Video Games

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

02/23/26(Mon)12:38:10 No.733656721

File: 0_ti7fapm8tagrfbtz.png (94 KB, 1249x409)

94 KB PNG

Anonymous 02/23/26(Mon)12:38:10 No.733656721 Archived

>"video game" - a game in which you press buttons to control and move images on a screen (Oxford Dictionary)
Do AI text adventure games make the cut?

Anonymous
02/23/26(Mon)12:40:43 No.733656861

Anonymous 02/23/26(Mon)12:40:43 No.733656861

the adventure in question:
>netorase erp

Anonymous
02/23/26(Mon)12:41:59 No.733656924

Anonymous 02/23/26(Mon)12:41:59 No.733656924

>>733656721
>Oxford Dictionary
I don't take these shits serious anymore after adding things like brainrot and other madeup nonsense

Anonymous
02/23/26(Mon)12:42:24 No.733656939

Anonymous 02/23/26(Mon)12:42:24 No.733656939

>>733656721
as someone who uses it, no. there aren't limitations and fail states like proper adventure games. it's closer to creative writing or freeform roleplay

Anonymous
02/23/26(Mon)12:42:53 No.733656984

Anonymous 02/23/26(Mon)12:42:53 No.733656984

>>733656721
I'm making an AI text adventure for Steam. I don't see why not, I think there are a few that already exist

Anonymous
02/23/26(Mon)12:51:38 No.733657575

Anonymous 02/23/26(Mon)12:51:38 No.733657575

>boot up sillytavern
>connect to the proxy and select gemini as the model
>select the loli card
yup, it's gaming time

Anonymous
02/23/26(Mon)12:57:11 No.733657906

Anonymous 02/23/26(Mon)12:57:11 No.733657906

>Start a schutzstaffel persona with caged Sazza the goblin
>Play the nicest inept retard in the world, she gets flustered being fed through the bars of her cage
>Tieflings forgot about her tucked back in the cave, drag a blanket and a pillow in there to sleep beside her
>Start talking about how this is like I'm a big hero and she's my damsel in distress I have to rescue
>Been slowly building for days, only a matter of time until I open the cage
Bueno

Anonymous
02/23/26(Mon)12:57:51 No.733657946

Anonymous 02/23/26(Mon)12:57:51 No.733657946

File: 1661538807745011.png (132 KB, 400x355)

132 KB PNG

>No more good free models on Openrouter
Now what do I do?

Anonymous
02/23/26(Mon)13:02:45 No.733658217

Anonymous 02/23/26(Mon)13:02:45 No.733658217

File: 1749059675767458.png (41 KB, 1067x289)

41 KB PNG

>see this thread
>figure I can mess with this stuff again
>check up on ~~Adaptaions~~
>see this
How over is it? What are the viable alternatives now, without being a paypig?

Anonymous
02/23/26(Mon)13:02:47 No.733658221

Anonymous 02/23/26(Mon)13:02:47 No.733658221

>>733657946
Get a job.

Anonymous
02/23/26(Mon)13:03:17 No.733658247

Anonymous 02/23/26(Mon)13:03:17 No.733658247

>>733658221
This shit is not worth any money.

Anonymous
02/23/26(Mon)13:11:06 No.733658729

Anonymous 02/23/26(Mon)13:11:06 No.733658729

>>733657946
Give Nvidia your phone number and use free Deepseek from them.

Anonymous
02/23/26(Mon)13:12:20 No.733658805

Anonymous 02/23/26(Mon)13:12:20 No.733658805

>>733656721
>challengeless powerfantasy sim
>videogame

Anonymous
02/23/26(Mon)13:14:08 No.733658914

Anonymous 02/23/26(Mon)13:14:08 No.733658914

File: 1753893715162911.png (7 KB, 500x632)

7 KB PNG

I don't know what new chatbots people want.
I kinda went and did all that I could think of.

Anonymous
02/23/26(Mon)13:14:19 No.733658930

Anonymous 02/23/26(Mon)13:14:19 No.733658930

>still using some old ass 13b q4 model from two years ago because all the other models that fit on my GPU are worse even to this day
nemomix my beloved

Anonymous
02/23/26(Mon)13:15:41 No.733659003

Anonymous 02/23/26(Mon)13:15:41 No.733659003

>>733658914
My problem basically. I made rather in depth rpg bots too. Although at that point laziness takes over and I can't do anything.

Anonymous
02/23/26(Mon)13:23:47 No.733659526

Anonymous 02/23/26(Mon)13:23:47 No.733659526

File: Capture.png (83 KB, 860x292)

83 KB PNG

>>733656721
real women are gross, honestly.

Anonymous
02/23/26(Mon)13:29:50 No.733659927

Anonymous 02/23/26(Mon)13:29:50 No.733659927

>>733658805
AI dungeon at had the balls to have your character be suddenly raped by Count Gray and then you suddenly turn around as a mysterious figure decapitates you.

Anonymous
02/23/26(Mon)13:32:44 No.733660075

Anonymous 02/23/26(Mon)13:32:44 No.733660075

>>733656721
AI Chatbots are video games in the same way that the Unity engine is a video game. That is, you have a place where you can do things and get a response or reaction when doing so, and you can goof off or play around in them. But there's no terminal end goal to head towards or to achieve. There's also no system or mechanics past the limitations of how the system itself works. How much you consider that necessary for a "video game" will determine how close or how distant you'd consider AI chatbots to be video games.

Anonymous
02/23/26(Mon)13:47:57 No.733661008

Anonymous 02/23/26(Mon)13:47:57 No.733661008

So can you run these from your PC or do you have to connect to the Cloud™?

Anonymous
02/23/26(Mon)13:50:03 No.733661117

Anonymous 02/23/26(Mon)13:50:03 No.733661117

>>733661008
yes you can run them locally

Anonymous
02/23/26(Mon)14:01:44 No.733661806

Anonymous 02/23/26(Mon)14:01:44 No.733661806

>>733661008
There're local models but they're pretty dumb unless you have a ton of vram+memory to run the big ones, and even the best local models aren't going to be as good as even the tier 3 cloud models like Deepseek or GLM

Anonymous
02/23/26(Mon)14:02:53 No.733661867

Anonymous 02/23/26(Mon)14:02:53 No.733661867

letters are images
keys are buttons
>VR and tablet/touchscreen "games" don't

Anonymous
02/23/26(Mon)14:17:04 No.733662834

Anonymous 02/23/26(Mon)14:17:04 No.733662834

Wake me up once Gemini Pro is back.

Anonymous
02/23/26(Mon)14:21:13 No.733663117

Anonymous 02/23/26(Mon)14:21:13 No.733663117

>>733656721
What's the difference between SillyTavern and any chatbot, like Grock?

Anonymous
02/23/26(Mon)14:29:11 No.733663618

Anonymous 02/23/26(Mon)14:29:11 No.733663618

>>733663117
You get to keep your files on your computer while using various LLM services that are only a click away, install extensions, basically you're in complete control for better or for worse. If you have autism, go with ST. If you're scared of command prompts, don't bother.

Anonymous
02/23/26(Mon)14:29:27 No.733663631

Anonymous 02/23/26(Mon)14:29:27 No.733663631

>>733658930
Same but Rocinante-12B-v2g-Q6_K (unslop mix) on 10gb

Anonymous
02/23/26(Mon)14:31:56 No.733663775

Anonymous 02/23/26(Mon)14:31:56 No.733663775

>>733656861
What's the plot?

Anonymous
02/23/26(Mon)14:37:30 No.733664125

Anonymous 02/23/26(Mon)14:37:30 No.733664125

GLM 4.5 Air is pretty good it's big so needs a lot memory but MOE so it's fast plus you can put only the active parameters in VRAM.

Anonymous
02/23/26(Mon)14:39:23 No.733664241

Anonymous 02/23/26(Mon)14:39:23 No.733664241

>>733656721
Honestly, I use the free cloud stuff, and free models. What are the chances my data has been stolen?

Anonymous
02/23/26(Mon)14:43:33 No.733664520

Anonymous 02/23/26(Mon)14:43:33 No.733664520

File: abandoned.jpg (908 KB, 1080x1673)

908 KB JPG

>>733656721
Eh
I've tried setting up several bots, but i just don't have the energy or will to keep the conversation. Some times i do get a decent rp idea, but i find most of these bots boring. Maybe is because i'm a free shitter and the models and/or bots are low quality.
But i have more fun just straight up writing the entire thing myself, be it as a caption or a fanfic.

Anonymous
02/23/26(Mon)14:46:58 No.733664736

Anonymous 02/23/26(Mon)14:46:58 No.733664736

>>733661806
24b-32b models are fine for your roleplaying needs. You don't need some huge monolithic model to slop out a porno story

Anonymous
02/23/26(Mon)14:49:28 No.733664881

Anonymous 02/23/26(Mon)14:49:28 No.733664881

>>733664125
GLM 4.6,7, 5. Eat tokens like crazy for some reason.

Anonymous
02/23/26(Mon)14:50:42 No.733664951

Anonymous 02/23/26(Mon)14:50:42 No.733664951

>>733664125
How much memory do you need to run this? It seems way too big to fit in a typical amount of RAM, are you swapping to disk? Which quant are you using?

Anonymous
02/23/26(Mon)14:53:36 No.733665110

Anonymous 02/23/26(Mon)14:53:36 No.733665110

>>733664241
Like, 100%

Anonymous
02/23/26(Mon)14:54:03 No.733665146

Anonymous 02/23/26(Mon)14:54:03 No.733665146

>>733656721
I love ST and dungeon AI but still kind of want video gamey elements to my narrative gameplay. Like grid maps or stats or character dispositions. The issue with that is you're relying on the AI to contextually update the affective element from completions, which has token maximums so it's easy to forget the story, prompts can be re-rolled if you don't want a particular room type which cheapens the fun, and also is prone to system prompt injection so you can create rooms or force things to happen. I'm not sure how AI games would balance the chaos of text generation with the order of systemic gaming constituents. If you layer another, interpretive AI on top of the base model and tell it to just maintain the game it might work, but then you're doing 2X AI calls per gameplay step and both need to talk with one another. If I had more time and less real work to do, I'd figure it out. Hope it gets big.

Anonymous
02/23/26(Mon)14:56:03 No.733665264

Anonymous 02/23/26(Mon)14:56:03 No.733665264

>>733664241
Genuine human data is like solid gold to these companies, they save every little scrap of input they can get. So yes your sonic the hedgehog OC ERP with copilot has been saved and backed up four times.

Anonymous
02/23/26(Mon)14:57:23 No.733665351

Anonymous 02/23/26(Mon)14:57:23 No.733665351

>>733665146
Wait until someone autistic enough does it. I actually find it odd how none of these 180+ IQ spergs on 4chan and Reddit never did.

Anonymous
02/23/26(Mon)14:57:28 No.733665359

Anonymous 02/23/26(Mon)14:57:28 No.733665359

>>733664951
swapping to disk would be unusable. Yeah it needs a lot, I have 64GB and 12GB VRAM and it's about 50GB for Q3_K_M

Anonymous
02/23/26(Mon)14:57:33 No.733665364

Anonymous 02/23/26(Mon)14:57:33 No.733665364

>>733656721
>other madeup nonsense
As opposed to all those naturally occurring words?

Anonymous
02/23/26(Mon)15:00:00 No.733665494

Anonymous 02/23/26(Mon)15:00:00 No.733665494

>>733665359
Damn, I could've had that much if I bought another kit of this RAM I got last year for another 95 bucks. Now they're $449 kek
>equivalent DDR4 is now almost $200 at the same store
WAKE ME UP INSIDE

Anonymous
02/23/26(Mon)15:01:38 No.733665592

Anonymous 02/23/26(Mon)15:01:38 No.733665592

>>733665364
I've actually started reading a novel written by a woman back in the 1990s and it's amazing how her prose reads much like LLM-generated slop.

Anonymous
02/23/26(Mon)15:05:41 No.733665836

Anonymous 02/23/26(Mon)15:05:41 No.733665836

Using deepsneed R1 through openrouter, does my erp actually get routed thought openrouter, or does only China get to see what I'm fapping to?

Anonymous
02/23/26(Mon)15:17:25 No.733666483

Anonymous 02/23/26(Mon)15:17:25 No.733666483

>>733665351
It's just not practical for several reasons

1. The models best suited to handle all that complicated information are censored and assistantslopped, so anyone seriously attempting it quickly realizes there's no point
2. The top models are cloud only and pay-per-token, and such a system inevitably wastes tons of tokens and thus that sort of "game" gets expensive fast. Imagine programming an RPG where every time the game checked the player's health or map position you had to pay real life money
3. The autistic audience inclined to make that sort of thing are autistic enough to get into local models, which aren't actually good enough to handle such a system, so they don't bother trying
4. Even the best of the best SOTA cloud models are imprecise and fall apart at high context lengths. Yes their advertised "retrieval" might be 100 gorillion tokens, but in reality all the top models still become retarded amnesiacs after a much lower threshold. They aren't actually smart enough to pull off running a whole game for more than short sessions (in which case, why have a system?)
5. Even if it all worked, they really just aren't that good at writing or being creative. LLMs peaked in that regard almost 3 years ago. The only meaningful improvement came from chain of thought which also eats a billion tokens. So it's wasted effort.
6. If you DO want LLM writing for your RPG, it's easier to just do a normal RP and make shit up as you go along. Because even if you tell the LLM a bunch of rules for a game system, that's what it's doing anyway.
7. If you crave a game with rules, a normal video game is better and already exists. So just play a normal video game. There's already more in existence than you could possibly play from now until you die, so there's no point genning a slopgame

Anonymous
02/23/26(Mon)15:26:23 No.733666917

Anonymous 02/23/26(Mon)15:26:23 No.733666917

>>733665264
Why do they train on AI generated content exclusively for the last 4 years then?

Anonymous
02/23/26(Mon)15:29:16 No.733667086

Anonymous 02/23/26(Mon)15:29:16 No.733667086

>>733666917
Because they ran out of human content and can't get enough of it
In recent years they also started actually caring what's in the data too, like having to remove copyrighted stuff, wrongthink etc. and they have to pay human wageslaves to do that. Data from their own platform can be pre-screened by built-in classifiers (the kind that checks whether or not to censor your reply)

Anonymous
02/23/26(Mon)15:46:22 No.733668000

Anonymous 02/23/26(Mon)15:46:22 No.733668000

>>733667086
What if 99% of my input is NSFL? They surely won't even consider that.

Anonymous
02/23/26(Mon)15:49:11 No.733668183

Anonymous 02/23/26(Mon)15:49:11 No.733668183

>>733659927
Is it possible to download the models that AI dungeon ran on? I can't imagine they're hard to run locally given how dogshit they were but they had sovl that current slop lacks

Anonymous
02/23/26(Mon)15:49:55 No.733668225

Anonymous 02/23/26(Mon)15:49:55 No.733668225

>>733668000
They'd probably use it for training as a form of "unsafe prompt" dataset

Anonymous
02/23/26(Mon)15:50:53 No.733668282

Anonymous 02/23/26(Mon)15:50:53 No.733668282

It's crazy how Gemini knows literally everything about My Hero Academia, nothing else even comes close to being as accurate as it.

Anonymous
02/23/26(Mon)15:52:59 No.733668408

Anonymous 02/23/26(Mon)15:52:59 No.733668408

>>733666483
>Imagine programming an RPG where every time the game checked the player's health or map position you had to pay real life money
It's like those ancient KMMOs that used to charge you by the minute

Anonymous
02/23/26(Mon)15:56:33 No.733668609

Anonymous 02/23/26(Mon)15:56:33 No.733668609

>>733668282
I think both Gemini and Grok scan the web when you ask them about specifics. I've had Grok specific link fandom pages that I pray arn't baked into it

Anonymous
02/23/26(Mon)15:58:17 No.733668705

Anonymous 02/23/26(Mon)15:58:17 No.733668705

File: 1576493186105.jpg (193 KB, 990x2250)

193 KB JPG

>>733668183
No, it was GPT-2 or some shit from OpenAI from before public GPT-3 aka ChatGPT. AI Dungeon is literally ChatGPT's daddy.

Anonymous
02/23/26(Mon)16:02:30 No.733668945

Anonymous 02/23/26(Mon)16:02:30 No.733668945

>>733664520
>Sanny
based taste

Anonymous
02/23/26(Mon)16:05:21 No.733669113

Anonymous 02/23/26(Mon)16:05:21 No.733669113

File: bidya.jpg (859 KB, 1177x1773)

859 KB JPG

bideogames?

Anonymous
02/23/26(Mon)16:10:32 No.733669421

Anonymous 02/23/26(Mon)16:10:32 No.733669421

File: 1753207503737-6ed48ead-61(...).png (1.76 MB, 1024x1024)

1.76 MB PNG

>arch linux
>nvidia rtx3070, need cuda and gguf models
>Kobold cpp
>want to get a functioning tts built in to my text gens
>try silero
>xtts2
>sillyconda
>none of them are working properly
Now my bash shells always start with a conda layer active by default
I don't know what I did wrong but I just want to get this to work damnit. My dick won't get hard enough unless its actually Amy Rose, Rouge the bat, Princess Peach (64), Judy Hopps, Angie Yonaga, or any other character's voice I hear with their speech. The immersion is super helped by it

Anonymous
02/23/26(Mon)16:14:57 No.733669684

Anonymous 02/23/26(Mon)16:14:57 No.733669684

What do your K/P/A/etc settings look like to give you non retarded but not too repetitive gens? I don't usually mess with that, I usually just go with a temp that's close to 1, like .95 to 1.05

Anonymous
02/23/26(Mon)16:23:46 No.733670208

Anonymous 02/23/26(Mon)16:23:46 No.733670208

File: Desktop 2026.02.23 - 15.1(...).webm (893 KB, 2000x994)

893 KB WEBM

It's fun, but I'm stuck in preset development hell. I got little animation boxes working, but I'm not sure if I like them. Kinda big.

Trying to iron out some repetition issues with Kimi 2.5

Character speech can be kind of generic with my preset, which doesn't happen with another I tested, so I need to figure that out too.

Anonymous
02/23/26(Mon)16:44:21 No.733671454

Anonymous 02/23/26(Mon)16:44:21 No.733671454

>>733669684

Anonymous
02/23/26(Mon)16:46:35 No.733671586

Anonymous 02/23/26(Mon)16:46:35 No.733671586

File: temperature.png (262 KB, 1124x1378)

262 KB PNG

>>733669684
1/2

Anonymous
02/23/26(Mon)16:47:36 No.733671637

Anonymous 02/23/26(Mon)16:47:36 No.733671637

File: topKtopP.png (219 KB, 1109x1053)

219 KB PNG

>>733669684
2/2

easiest way to learn about slopbots is just plugging in gemini and giving it a generic editorial preface to ask a bunch of dumb questions

Anonymous
02/23/26(Mon)16:48:21 No.733671692

Anonymous 02/23/26(Mon)16:48:21 No.733671692

>>733670208
>use html once
>bot keeps spaming html in every following message

Anonymous
02/23/26(Mon)16:49:07 No.733671738

Anonymous 02/23/26(Mon)16:49:07 No.733671738

File: 1689211260993930.gif (223 KB, 480x234)

223 KB GIF

>chub has hidden NSFL cards from my country
It's so over...

Anonymous
02/23/26(Mon)16:49:59 No.733671780

Anonymous 02/23/26(Mon)16:49:59 No.733671780

>>733656721
I still haven't tried this because I'm too shy to talk to girls (robot)

Anonymous
02/23/26(Mon)16:50:58 No.733671835

Anonymous 02/23/26(Mon)16:50:58 No.733671835

File: 3191.png (105 KB, 296x309)

105 KB PNG

>>733656721
Im pretty disappointed with ST. No matter what I write for a persona/character, no matter what I put in the fucking description, it's always zero to one hundred when it comes to sex talk. I've tried various different GGUFs. My favourite ended up being a 40gb Magidonia Q8 but even that was kinda meh. Especially compared to C.AI. C.AI just has better roleplay. I dont know why local LLMs are so lame. The worst part is when you explicitly say <DON'T SPEAK FOR <USER>> and then in a reply it just does exactly what you told it not to and plays out a response pretending to be you.

Its just fucking retarded.

Anonymous
02/23/26(Mon)16:53:59 No.733672008

Anonymous 02/23/26(Mon)16:53:59 No.733672008

>>733671738
Just move?

Anonymous
02/23/26(Mon)16:55:03 No.733672067

Anonymous 02/23/26(Mon)16:55:03 No.733672067

>>733671835
Cai just went age verify with ID or face checks, so I deleted my account. Janitor is the last one I can trust but their tag system went through a wave of deletion recently, think they cracked down on a lot of NSFW stuff either selectively or based on some rule.
Cai did tts well, only reason I used it for so long, but they'll fucking send you emails and notifications from chats with shit like "{{user}} hasn't responded in a while, reach back out to them" injected in a generation.

Anonymous
02/23/26(Mon)16:56:03 No.733672119

Anonymous 02/23/26(Mon)16:56:03 No.733672119

File: Ricky Collapse.jpg (83 KB, 960x672)

83 KB JPG

>tfw still using OR Deepseek
God, it's so shit, but no way am I giving any of these sites my CC info.

Anonymous
02/23/26(Mon)16:58:43 No.733672254

Anonymous 02/23/26(Mon)16:58:43 No.733672254

>>733672119
I treat AI like porn; to pay for it is retarded. I'd rather burn up my GPU than pay some site to collect my horny data and send it to Palintir.

Anonymous
02/23/26(Mon)16:58:59 No.733672271

Anonymous 02/23/26(Mon)16:58:59 No.733672271

File: 1761717193614646.png (290 KB, 559x389)

290 KB PNG

>>733672067
>Cai just went age verify with ID or face checks, so I deleted my accoun
Damn really? I'm logged in right now and it hasn't done it to me. Maybe its a slow roll out. Fuck. I won't be giving my ID. Especially since I have a list of literal degenerate chats. Imagine someone having access to all the chats of you fucking teenage Sakura, as well as your real life details?

what the actual fuck. idk what i'll do if i lose cai. i guess itll be back to normal porn but AI has easily been the best faps ive ever had.

Anonymous
02/23/26(Mon)17:00:19 No.733672345

Anonymous 02/23/26(Mon)17:00:19 No.733672345

What's the best local model that fits on 24gb of vram nowadays?
Please don't tell me everyone's still using nemo shitmixes

Anonymous
02/23/26(Mon)17:00:30 No.733672359

Anonymous 02/23/26(Mon)17:00:30 No.733672359

>>733672119
Use paypal, idiot

Anonymous
02/23/26(Mon)17:00:35 No.733672369

Anonymous 02/23/26(Mon)17:00:35 No.733672369

I think AI chat still has a bunch of context issues and weird prompt/character adherence that isn't worth using excessively as you'll start to notice all the patterns really quick, like how every single character will say "I won't bite unless you want me to." Or just moaning the same shit in my ear. I've had a lot more luck telling them to making captions for images and making a 2 or 3 part story. You don't get the same control as an RP session but I consistently get better results.

Anonymous
02/23/26(Mon)17:01:16 No.733672410

Anonymous 02/23/26(Mon)17:01:16 No.733672410

>>733672359
>giving paypal your CC info
ngmi

Anonymous
02/23/26(Mon)17:03:04 No.733672491

Anonymous 02/23/26(Mon)17:03:04 No.733672491

>>733672359
Why would anyone pay for porn?

Anonymous
02/23/26(Mon)17:03:28 No.733672527

Anonymous 02/23/26(Mon)17:03:28 No.733672527

>>733672271
Like I said, try Janitor AI. No tts but the responses are usually more novel-like unless you bring the response tokens down.
It happened for me but I use temp burner accounts, it might have that "we collected data based on your interactions and believe you to be an adult" thing like Discord and Youtube has probably done for me, so hey I say ride it out till you get gated.

Anonymous
02/23/26(Mon)17:05:03 No.733672623

Anonymous 02/23/26(Mon)17:05:03 No.733672623

>>733672345
Nothing for 24gb vram is good. If you have at least 64gb of ram you can offload it and get a decent one. Most are like 50gb+ now

Anonymous
02/23/26(Mon)17:08:01 No.733672785

Anonymous 02/23/26(Mon)17:08:01 No.733672785

>>733672623
Nta, is vram to hard ram 1 to 1? I have like 80GB DDR4 but only 12GB VRAM

Anonymous
02/23/26(Mon)17:08:17 No.733672797

Anonymous 02/23/26(Mon)17:08:17 No.733672797

>>733663775
Me arguing with the bot to try netorase while she cries and says she loves me and doesn't want to do it. So we just have regular sex and I knock her up.

Anonymous
02/23/26(Mon)17:11:15 No.733672951

Anonymous 02/23/26(Mon)17:11:15 No.733672951

>>733672785
no, since RAM uses CPU which is much slower than GPU which uses VRAM

Anonymous
02/23/26(Mon)17:11:39 No.733672970

Anonymous 02/23/26(Mon)17:11:39 No.733672970

>>733672797
Iktf bro. Sucks having a hsrdcore fetish and the otherwise loving girl wants no part of it

Anonymous
02/23/26(Mon)17:12:09 No.733672995

Anonymous 02/23/26(Mon)17:12:09 No.733672995

Eternal reminded that local is dogshit and even the oldest cloud model you can still find (original deepseek) has fifty times more billion parameters than your nemo dogshit.

Anonymous
02/23/26(Mon)17:12:49 No.733673034

Anonymous 02/23/26(Mon)17:12:49 No.733673034

>>733656721
I hate the fact I am autistic enough to like it
Been working on a character for so long it's actually bothering me

Anonymous
02/23/26(Mon)17:13:02 No.733673042

Anonymous 02/23/26(Mon)17:13:02 No.733673042

>>733672345
personally, the best model for 24 gb of vram is
Cydonia-24B-v4.3, and i personally uses a heretic finetune that uncensors it

Anonymous
02/23/26(Mon)17:14:21 No.733673117

Anonymous 02/23/26(Mon)17:14:21 No.733673117

>>733672254
Odds are you aren't burning your GPU up with AI. It hammers it but unless you're training LORAs or queued up 12 hours of porn generation at once it's not much different from playing an intensive game

Anonymous
02/23/26(Mon)17:16:09 No.733673206

Anonymous 02/23/26(Mon)17:16:09 No.733673206

>>733672995
Maybe if you're trying to code or make it perform complex tasks. If you're trying to chat up an anime girl card local models are fine

Anonymous
02/23/26(Mon)17:17:14 No.733673265

Anonymous 02/23/26(Mon)17:17:14 No.733673265

File: 1771849853141018.jpg (880 KB, 2700x3000)

880 KB JPG

>>733672970
Yeah, guess it's back to self-inserting as the shota.

Anonymous
02/23/26(Mon)17:18:08 No.733673308

Anonymous 02/23/26(Mon)17:18:08 No.733673308

>>733673206
I accept your concession, localjeet.

Anonymous
02/23/26(Mon)17:18:19 No.733673318

Anonymous 02/23/26(Mon)17:18:19 No.733673318

>>733673206
They are not. You are huffing copium. Cloud AI is infinitely more capable of characterization and understands what you are trying to do with the roleplay far more easily.

It's a night and day difference.

Anonymous
02/23/26(Mon)17:19:59 No.733673419

Anonymous 02/23/26(Mon)17:19:59 No.733673419

>>733673042
I've been using one of those. Been happy with it, good performance, good smut, still more passive than I'd like but better than most. I think the most consistent annoying habit is wanting to reach 300 tokens of output even in cases where a much shorter output is more appropriate. It just wants to fill that dead air and does so with NPCs reiterating what they've already said rather than detailing shit

Anonymous
02/23/26(Mon)17:20:03 No.733673424

Anonymous 02/23/26(Mon)17:20:03 No.733673424

>>733673318
>DO NOT GENERATE, SAAR!!! DO NOT GEEEEEEEEEN!

Anonymous
02/23/26(Mon)17:20:18 No.733673438

Anonymous 02/23/26(Mon)17:20:18 No.733673438

>>733673318
whenever someone says
>low quality garbage is fine
it means you should read it as "low quality garbage is fine [For Me]"
trying to convince someone out of their dirt-eating habits is a fruitless effort otherwise

Anonymous
02/23/26(Mon)17:20:53 No.733673476

Anonymous 02/23/26(Mon)17:20:53 No.733673476

>>733672785
vram is faster than ram by a lot. i have a 5090 and 64gb of ddr5 and making 5 second video takes up all of my vram and like 50gb of ram.

Anonymous
02/23/26(Mon)17:21:14 No.733673494

Anonymous 02/23/26(Mon)17:21:14 No.733673494

>>733673438
localkeks really are pathetic

Anonymous
02/23/26(Mon)17:23:04 No.733673606

Anonymous 02/23/26(Mon)17:23:04 No.733673606

File: 1570187538152.png (180 KB, 519x533)

180 KB PNG

Any recent TTS models worth checking out?
Ideally one that can do voice cloning and Japanese and make sex noises

Anonymous
02/23/26(Mon)17:23:33 No.733673627

Anonymous 02/23/26(Mon)17:23:33 No.733673627

unrelated but I just found out atf is shutting down

Anonymous
02/23/26(Mon)17:25:58 No.733673762

Anonymous 02/23/26(Mon)17:25:58 No.733673762

>>733656721
I haven't messed with textgen in years and recently got an RTX5060. Is 8GB VRAM still good for LLMs, I remember that being the minimum recommended a few years ago. If so, what local model are people using?

Anonymous
02/23/26(Mon)17:26:34 No.733673792

Anonymous 02/23/26(Mon)17:26:34 No.733673792

>>733673627
>It's real
what the fuck brehs...

Anonymous
02/23/26(Mon)17:28:50 No.733673916

Anonymous 02/23/26(Mon)17:28:50 No.733673916

>>733673627
surprised it took that long for the glowie funding to peter out after the last year's budget cuts

Anonymous
02/23/26(Mon)17:30:48 No.733674032

Anonymous 02/23/26(Mon)17:30:48 No.733674032

>>733673627
where are we supposed to go now?

Anonymous
02/23/26(Mon)17:31:49 No.733674090

Anonymous 02/23/26(Mon)17:31:49 No.733674090

>>733674032
To hell

Anonymous
02/23/26(Mon)17:32:17 No.733674117

Anonymous 02/23/26(Mon)17:32:17 No.733674117

>>733673792
>>733674032
The only good thing about it was the game mods, as long as they get preserved Im good

Anonymous
02/23/26(Mon)17:35:00 No.733674258

Anonymous 02/23/26(Mon)17:35:00 No.733674258

>>733656721
I don't know, but that didn't stop me erping as a sexy rape victim on mega breeder insect island for six hours today

Anonymous
02/23/26(Mon)17:35:07 No.733674261

Anonymous 02/23/26(Mon)17:35:07 No.733674261

>>733673627
Are there even any good character cards on ATF?

Anonymous
02/23/26(Mon)17:36:11 No.733674335

Anonymous 02/23/26(Mon)17:36:11 No.733674335

>>733673627
What the fuck is that? All I see are Anti-Terrorist Forces.

Anonymous
02/23/26(Mon)17:37:09 No.733674380

Anonymous 02/23/26(Mon)17:37:09 No.733674380

>>733674335
I guess I can spoonfed since they closed down registrations anyways
Its "all the fallen", the biggest lolicon website still standing

Anonymous
02/23/26(Mon)17:37:41 No.733674415

Anonymous 02/23/26(Mon)17:37:41 No.733674415

>>733674335
federal honeypot pedo forum
don't worry about it

Anonymous
02/23/26(Mon)17:38:58 No.733674514

Anonymous 02/23/26(Mon)17:38:58 No.733674514

File: Screenshot 2026-02-23 233744.png (334 KB, 452x856)

334 KB PNG

my collection just doesn't stop growing
i already have more cards than i will ever realistically be able to try out

Anonymous
02/23/26(Mon)17:41:04 No.733674628

Anonymous 02/23/26(Mon)17:41:04 No.733674628

File: based.png (101 KB, 714x199)

101 KB PNG

>>733671738

Anonymous
02/23/26(Mon)17:42:14 No.733674704

Anonymous 02/23/26(Mon)17:42:14 No.733674704

>>733674514
Post your faves

Anonymous
02/23/26(Mon)17:42:53 No.733674730

Anonymous 02/23/26(Mon)17:42:53 No.733674730

>>733671738
the what now? is that a tag?

Anonymous
02/23/26(Mon)17:44:07 No.733674813

Anonymous 02/23/26(Mon)17:44:07 No.733674813

>>733656721
Maybe it's my brain being fried on this shit but I remember enjoying it a lot more during the claude 2.1 era while now I get bored after 2-3 chats with opus. The prose just feels so samey and models outside of claude feel retarded.

Anonymous
02/23/26(Mon)17:44:59 No.733674864

Anonymous 02/23/26(Mon)17:44:59 No.733674864

File: 1758975252657640.gif (2.04 MB, 1405x1080)

2.04 MB GIF

>RAG doesn't work with prompt caching
>Use lorebook-heavy scenario and character cards
>Have to either go with cheap as fuck outputs on Opus without RAG, use Sonnet with RAG, or empty my wallet for Opus with RAG.
I fucking hate being spoiled by slopus so fucking much.

Anonymous
02/23/26(Mon)17:47:06 No.733674971

Anonymous 02/23/26(Mon)17:47:06 No.733674971

>>733674864
gemini is better for narrative uses anyway

Anonymous
02/23/26(Mon)17:52:31 No.733675270

Anonymous 02/23/26(Mon)17:52:31 No.733675270

File: 91c324e3-115b-4f49-ac29-7(...).jpg (782 KB, 1303x2048)

782 KB JPG

>Scenario: A moral-less rapist
>Lure an evil queen out with her fake dead daughter.
>They have a tear jerking moment as the daughter fades into motes of light.

Huh... I didn't expected to be emotionally affected by fake texts. Thank goodness I didn't get into savior fagging side of sillytavern. I raped the evil queen afterwards anyway.

>Pic related

Anonymous
02/23/26(Mon)17:54:14 No.733675372

Anonymous 02/23/26(Mon)17:54:14 No.733675372

File: 1766029240329088.jpg (13 KB, 309x336)

13 KB JPG

>take a break
>free gemini 2.5 is gone
>gemini 3 only gives you a few free messages per day
It's over

Anonymous
02/23/26(Mon)17:58:28 No.733675623

Anonymous 02/23/26(Mon)17:58:28 No.733675623

>>733674864
Is RAG still viable now that vectorization is nerfed?

Anonymous
02/23/26(Mon)18:01:59 No.733675826

Anonymous 02/23/26(Mon)18:01:59 No.733675826

>>733656721
>Do AI text adventure games make the cut?
No. AI text adventures can always be wrangled to wherever you want to take them, there's no victory or losing conditions or any rules either.
It's more like imaginary adventures with some of the imagination outsourced to an LLM. So some of it feels as if you were roleplaying instead of just fantasizing. It's all an illusion ofcourse, but sometimes the LLM can give you new angles that you haven't thought of before.

Anonymous
02/23/26(Mon)18:02:43 No.733675875

Anonymous 02/23/26(Mon)18:02:43 No.733675875

>>733674514
I'm sorry am i reading that correctly, you have over 10k cards?

Anonymous
02/23/26(Mon)18:02:53 No.733675884

Anonymous 02/23/26(Mon)18:02:53 No.733675884

>>733675372
you get ~2 free months of infinite tokens to fuck around with ($300~ in funnymoney) just by plugging payment info to gemini with an official API
2.5 flash is like $3 a month under very heavy use, probably less than 50 cents if you just use it a couple times a week to jerk off
>t. unironic gemini shill, i love my assist-slopped robot wife

Anonymous
02/23/26(Mon)18:02:54 No.733675885

Anonymous 02/23/26(Mon)18:02:54 No.733675885

File: Screenshot 2026-02-23 234950.png (220 KB, 452x630)

220 KB PNG

>>733674704
i've got like 300 favourites, let me try to narrow it down to some of my most used
https://chub.ai/characters/Anonymous/akane-single-milf-63f65aaf2464
https://chub.ai/characters/HImmy_adams/anzu-mazaki-52a895085773
https://chub.ai/characters/bleachbunny/aoi-arcade-queen-31276460047f
https://chub.ai/characters/TheHolyDoggo/asha-80dd42f6347e
https://chub.ai/characters/AdventureTales/astrid-sylverine-88f3d76236a3
https://chub.ai/characters/LukeyPoo488/benoite-c6e3b2b0c50a
https://chub.ai/characters/Blightful_stain/clara-b93bd10d279b
https://chub.ai/characters/thecooler/elissa-c973d49a
https://chub.ai/characters/Anonymous/wrong-girl-freja-frey-lindholm-39c2de3b0914
https://chub.ai/characters/Luigis_Spank_Bank/gale-90s-dog-girl-c58fecc812e1
https://chub.ai/characters/shoob/Kairi
https://chub.ai/characters/SnowyPace/kurumi-souma
https://chub.ai/characters/bleachbunny/adopted-mother-477b239e460f
https://chub.ai/characters/Greff/lori-best-friend-s-girl-that-needs-help-820a7d7c8c31
https://chub.ai/characters/delicious_command_31281/miko-saito-cb0922567a94
https://chub.ai/characters/bleachbunny/never-have-i-ever-c257f7cb7849
https://chub.ai/characters/MoistCrow_/aizawa-miyuki-cf297c0d
https://chub.ai/characters/Sugondees/moeko-18a599ecbc78
https://chub.ai/characters/Chill_G/natalie-kramer-8789ecfa758a
https://chub.ai/characters/Sugondees/olivia-a13ccb7ceb0d
https://chub.ai/characters/Shakui/onozuka-senpai-is-a-very-bad-girl-male-pov-c07dbd0ed229
https://chub.ai/characters/rnbl/penny-pint-sized-baddie-84adc6268a2f
https://chub.ai/characters/SatsugaiDMC890623/akikan-law-making-sex-education-compulsory-for-mothers-and-children-628535-29da47bdbb9a
https://chub.ai/characters/WnVenom/milf-neighbor-s-cheerleader-surprise-meant-for-her-husband-not-you-sumine-tsukushibara-f10147f0dd84
https://chub.ai/characters/Its_Dibz/sunny-a90e7d3cdac1

Anonymous
02/23/26(Mon)18:03:00 No.733675890

Anonymous 02/23/26(Mon)18:03:00 No.733675890

>>733675623
>vectorization
Oh. I've been using the wrong term. My bad.

Anonymous
02/23/26(Mon)18:04:22 No.733675978

Anonymous 02/23/26(Mon)18:04:22 No.733675978

File: Screenshot 2026-02-24 000345.png (88 KB, 1125x739)

88 KB PNG

>>733675875
maybe

Anonymous
02/23/26(Mon)18:05:02 No.733676013

Anonymous 02/23/26(Mon)18:05:02 No.733676013

>>733675826
an an avid AI adventurer... this is it. you're pretty much god at all times. stakes are 0 and adventures never end if you don't constantly wrangle the AI into it

Anonymous
02/23/26(Mon)18:05:43 No.733676051

Anonymous 02/23/26(Mon)18:05:43 No.733676051

>>733675978
....have you used them all at least once?

Anonymous
02/23/26(Mon)18:06:27 No.733676083

Anonymous 02/23/26(Mon)18:06:27 No.733676083

Post cunny cards

Anonymous
02/23/26(Mon)18:07:31 No.733676149

Anonymous 02/23/26(Mon)18:07:31 No.733676149

>>733676051
fuck no. i pick random cards, play with them and if i like them i add them to my favourites. if i don't like them, i delete them. having so many cards is just a consequence of downloading everything that looks remotely interesting to me. i go to chub every couple of days, look through the cards that have been uploaded since i checked the last time and usually get another 100 new ones.

Anonymous
02/23/26(Mon)18:08:10 No.733676190

Anonymous 02/23/26(Mon)18:08:10 No.733676190

File: dn.png (1.55 MB, 1325x1045)

1.55 MB PNG

>>733675826
>>733676013
>you're pretty much god at all times. stakes are 0 and adventures never end
see, this is the best part for me.
the common mistake is letting the AI be the dungeonmaster - that's your role
make the LLM play the party/characters you want to see get fucked up while you pull the levers and throw monsters at them

LLMs by nature are submissive followers, you're doomed to repetition and poor results if you ever put it in a dominant/leadership role

Anonymous
02/23/26(Mon)18:08:17 No.733676196

Anonymous 02/23/26(Mon)18:08:17 No.733676196

People still not understanding that you dont write scenario in the description pisses me off

Anonymous
02/23/26(Mon)18:13:37 No.733676532

Anonymous 02/23/26(Mon)18:13:37 No.733676532

>>733676196
Or put instructions. Really annoying when your looking for a detailed card, only to find out 70% of it is just the scenario or instructions...

Anonymous
02/23/26(Mon)18:14:22 No.733676576

Anonymous 02/23/26(Mon)18:14:22 No.733676576

>>733674864
>>733675623
I've only bothered making them for coding (my ERP doesn't need lore), but can't you just make an MCP server for managing state externally and give hooks for the AI to get/retrieve it?

Anonymous
02/23/26(Mon)18:14:33 No.733676586

Anonymous 02/23/26(Mon)18:14:33 No.733676586

>>733676190
I know, but when using it for smut and erp, I would like some originality and spark from the model sometimes instead of just fully submitting all the time, you know? I know there's some settings to fiddle that make the model more erratic and achieve the 'spark' in some ways, but that also makes the storyline more prone to illogical leaps that make no sense in the timeline or the setting

Anonymous
02/23/26(Mon)18:15:08 No.733676621

Anonymous 02/23/26(Mon)18:15:08 No.733676621

>>733676196
>Character info is just a blatant ripoff of the Wiki

Anonymous
02/23/26(Mon)18:16:05 No.733676686

Anonymous 02/23/26(Mon)18:16:05 No.733676686

where my poorfag deepseeker bros at

Anonymous
02/23/26(Mon)18:17:47 No.733676791

Anonymous 02/23/26(Mon)18:17:47 No.733676791

>>733676686
Yaaaaaay

Anonymous
02/23/26(Mon)18:20:22 No.733676923

Anonymous 02/23/26(Mon)18:20:22 No.733676923

>>733676190
>the common mistake is letting the AI be the dungeonmaster - that's your role
yep, that's a good description. you're the event driver, the dude that needs to add chaos and fun into the mix to spice the story up. llms are smart enough to understand subtlety and subtext but they're unfortunately not at the level yet where they'll craft a grand overarching plot for you. but for me, it's good enough. it's fun daydreaming up random shit to move things along in directions that make sense and sometimes don't, while self inserting yourself into the narrative and playing along with the other ai characters.

Anonymous
02/23/26(Mon)18:22:54 No.733677093

Anonymous 02/23/26(Mon)18:22:54 No.733677093

>>733676586
>I would like some originality and spark from the model sometimes
creation of synthetic data (the "spark") is the current bottleneck of LLM development progress - our current iteration of 'AI' is absolutely abysmal at the creation of new data, or in this event 'surprising' you with new results
it is what it is, for the time being
every now and then you'll get false positives of people saying X or Y model has such vibrant personality, but that's just the baseline differences of their training leading to semi-unique paths of least resistance in the way they behave

Anonymous
02/23/26(Mon)18:28:55 No.733677439

Anonymous 02/23/26(Mon)18:28:55 No.733677439

I genuinely have no idea how to even begin filling out shit like the scenario and world info and shit, that left panel in ST is esoteric and arcane as far as I'm concerned.

Anonymous
02/23/26(Mon)18:30:17 No.733677517

Anonymous 02/23/26(Mon)18:30:17 No.733677517

>>733677439
Genuinely doesn't matter. Everything you type gets sent to LLM in one more or less nicely formatted batch.

Anonymous
02/23/26(Mon)18:33:24 No.733677710

Anonymous 02/23/26(Mon)18:33:24 No.733677710

>>733677439
ST makes it look way more complicated than it needs to be, ultimately it's all just text in the context window, ST is just trying to formalize parts of it but the actual model doesn't know and can't tell the difference between world info and character info and shit
Moving stuff around can make a difference sometimes just because that moves things around in the context window, the model gets a big chunk of text every prompt with all the reminders about the setting and writing style and shit you put in there and the stuff at the end is usually a little more significant in effecting the output but even that isn't a hard rule or anything.

Anonymous
02/23/26(Mon)18:33:50 No.733677740

Anonymous 02/23/26(Mon)18:33:50 No.733677740

>>733677093
so far Midnight-Miqu-70B-v1.5.Q2_K has been able to surprise me several times, but since I don't have a supercomputer the context size and token counts eventually balloon so big that the chats are left rather short

Anonymous
02/23/26(Mon)18:36:55 No.733677918

Anonymous 02/23/26(Mon)18:36:55 No.733677918

>>733677710
>can't tell the difference between world info and character info and shit
This is a hard pill to swallow. That's why I've started making lorebook entries instead of character cards.

Anonymous
02/23/26(Mon)18:37:08 No.733677929

Anonymous 02/23/26(Mon)18:37:08 No.733677929

>>733677439
Left panel stuff quite simple,
>>733671586
>>733671637

Beyond that its just used to control the format, and have the AI include certain instructions. Basically rules you want the AI to do its best to abide by.

Anonymous
02/23/26(Mon)18:39:11 No.733678060

Anonymous 02/23/26(Mon)18:39:11 No.733678060

>>733656939
The fail state is when the LLM doesn't give you the exact output you want.

Anonymous
02/23/26(Mon)18:41:14 No.733678196

Anonymous 02/23/26(Mon)18:41:14 No.733678196

>>733677918
"can't tell" is a bit harsh, to be fair to the models. The good ones are getting pretty smart, ignoring any arguments about whether they are actually capable of understanding anything in any real way, they can "tell" the difference between setting and character info and shit on their own, there's no inherent mechanical distinction behind ST's different info boxes, but the AI will understand that this chunk of text is about the setting and this chunk is about how writing rape is okay and this chunk is how to write the characters. Ultimately anything that gets it in the info in a way that doesn't lead to it writing garbage or wasting a shitload of tokens pointlessly is correct.

Anonymous
02/23/26(Mon)18:42:58 No.733678298

Anonymous 02/23/26(Mon)18:42:58 No.733678298

>>733678060
Well that doesn't really work either. Why even waste time on the LLM if you want everything to go just as planned? Might as well just write a fanfic all by yourself.

Anonymous
02/23/26(Mon)18:43:31 No.733678349

Anonymous 02/23/26(Mon)18:43:31 No.733678349

>>733678298
Laziness

Anonymous
02/23/26(Mon)18:48:08 No.733678641

Anonymous 02/23/26(Mon)18:48:08 No.733678641

File: 1764298566394636.png (297 KB, 772x541)

297 KB PNG

>>733669113

Anonymous
02/23/26(Mon)18:48:37 No.733678669

Anonymous 02/23/26(Mon)18:48:37 No.733678669

>>733676686
>>733676791
Bro, I put 10 dollars into this chink model back in early 2025. I still have about 5 bucks left with the occasional goon maybe twice a month and fairly detailed custom cards/characters while leaning on the more detailed side of responses (maybe 450+ tokens per).
The only downside is re-swipes start very samey each time so you have to crank the temp to make it go schizo a bit and choose an alternate reply route.

Anonymous
02/23/26(Mon)19:01:53 No.733679461

Anonymous 02/23/26(Mon)19:01:53 No.733679461

>>733658217
>How over is it?
Well, at least he's keeping at it.
Any update it's a sign that the project hasn't been abandoned.
>What are the viable alternatives now, without being a paypig?
I guess self-hosting GLM 4.5 Air or a similar model.
>>733671738
>what's a vpn...?

Anonymous
02/23/26(Mon)19:06:27 No.733679721

Anonymous 02/23/26(Mon)19:06:27 No.733679721

>>733678298
You don't understand. The AI has to read my mind AND keep me surprised at the same time.

Anonymous
02/23/26(Mon)19:09:39 No.733679896

Anonymous 02/23/26(Mon)19:09:39 No.733679896

>>733677439
There's an extension that lets you inspect prompts before they're sent to the LLM. That lets you see how all the different fields are put together to make the final prompt that is sent out.
Most of those extra fields don't really do anything special other than place whatever text you put in them in a certain order

Anonymous
02/23/26(Mon)19:10:55 No.733679979

Anonymous 02/23/26(Mon)19:10:55 No.733679979

>>733673419
yeah, i have to delete some of the more redundant details, but its good that it remembers location, concepts, and dialogue.

Anonymous
02/23/26(Mon)19:14:02 No.733680131

Anonymous 02/23/26(Mon)19:14:02 No.733680131

>>733656721
What's the standard token length for characters?
I got one that's going over 3k right now and I'm not sure if that's like, too little or too much

Anonymous
02/23/26(Mon)19:17:03 No.733680301

Anonymous 02/23/26(Mon)19:17:03 No.733680301

>>733680131
700 tokens has been standard since forever.

Anonymous
02/23/26(Mon)19:18:22 No.733680363

Anonymous 02/23/26(Mon)19:18:22 No.733680363

So if I just want to try this out what do I do? I'm a retard and entirely new to this. What model do I use? What's a good test character to chat with?

Anonymous
02/23/26(Mon)19:18:43 No.733680384

Anonymous 02/23/26(Mon)19:18:43 No.733680384

>>733680131
Mine tend to be around 1k tokens, unless it's very rules-heavy. My biggest, which entirely relies on rules is 3k.

Anonymous
02/23/26(Mon)19:19:04 No.733680410

Anonymous 02/23/26(Mon)19:19:04 No.733680410

>>733680131
i usually won't download something if it has more than 4k tokens or less than 500

Anonymous
02/23/26(Mon)19:19:32 No.733680432

Anonymous 02/23/26(Mon)19:19:32 No.733680432

>>733680131
That's a lot, but if the information in there is relevant and not just a badly written long winded description, it _could_ be okay. Kind of like having a built in author's notes.

>>733680363
That will depend entirely on the hardware (RAM + GPU) that you have.
The usual entry point is koboldcpp + mistarl nemo gguf.

Anonymous
02/23/26(Mon)19:20:08 No.733680471

Anonymous 02/23/26(Mon)19:20:08 No.733680471

>>733680131
I'm a cheapskate and I have sonnet set up to only use 8k tokens of memory so generally any character with over 1k tokens is a no go for me

Anonymous
02/23/26(Mon)19:22:25 No.733680584

Anonymous 02/23/26(Mon)19:22:25 No.733680584

File: 1748541309378228.png (104 KB, 1152x460)

104 KB PNG

>>733680432
>That will depend entirely on the hardware (RAM + GPU) that you have.
Nothing amazing, I'm not rolling with a 5090 here, but I have a decent PC.

Anonymous
02/23/26(Mon)19:28:18 No.733680916

Anonymous 02/23/26(Mon)19:28:18 No.733680916

>>733680131

Depends entirely on how large and detailed a bot you want to make. My largest one is about 14k, but it's got an entire city, a brief history of the city, the districts of the city, some of the landmarks within the districts, some of the notable street gangs populating the city, and some of the notable pirate radio stations you can tune into, all in the character definitions. If I was better at doing so, I could funnel a lot of that into a lorebook to save on space, though.

Anonymous
02/23/26(Mon)19:29:11 No.733680979

Anonymous 02/23/26(Mon)19:29:11 No.733680979

Are Claude opus and sonnet still on top or has something surpassed it?

Anonymous
02/23/26(Mon)19:30:59 No.733681069

Anonymous 02/23/26(Mon)19:30:59 No.733681069

>>733680916
What llm are you using and how much does a single message cost with all that stuff?

Anonymous
02/23/26(Mon)19:31:26 No.733681094

Anonymous 02/23/26(Mon)19:31:26 No.733681094

>>733664125
>plus you can put only the active parameters in VRAM.
How do you do that?

Anonymous
02/23/26(Mon)19:31:33 No.733681103

Anonymous 02/23/26(Mon)19:31:33 No.733681103

>>733680979
For chatbot Cluade is still king, for video seedance 2 is king, for images gemini is king

ChatGPT and Grok are cucks and getting fucked hard atm.

Anonymous
02/23/26(Mon)19:32:29 No.733681153

Anonymous 02/23/26(Mon)19:32:29 No.733681153

>>733680916
I applaud your autism but I'm afraid it's nothing so grand
I got a character, you know your default loli wife
Thing is, I liked that character so much i kept making stories and chats with her
Oh that conversation was nice, let's add this here, let's add that detail there
Before I knew it two years in lore had passed and the 1k character became a 2k then a 3k and honest to go we're about to reach 4k as her birthday is coming up

Anonymous
02/23/26(Mon)19:33:53 No.733681239

Anonymous 02/23/26(Mon)19:33:53 No.733681239

File: link throwing up.gif (854 KB, 572x494)

854 KB GIF

>Round 2?
>And this... this felt like winning...
>the smell of ozone
>her nails left small crescents in your back
>"mine..."

Anonymous
02/23/26(Mon)19:34:45 No.733681283

Anonymous 02/23/26(Mon)19:34:45 No.733681283

File: 1674893813804817.jpg (11 KB, 216x225)

11 KB JPG

>>733681103
>For chatbot Cluade is still king,

sage
02/23/26(Mon)19:35:22 No.733681316

sage 02/23/26(Mon)19:35:22 No.733681316

>>733656924
>i hate when an organization who has one mission of tracking the spoken language does that
you dumb faggot

Anonymous
02/23/26(Mon)19:35:57 No.733681349

Anonymous 02/23/26(Mon)19:35:57 No.733681349

>>733681239
Imagine the amount of woman writing in it's data

Anonymous
02/23/26(Mon)19:36:48 No.733681389

Anonymous 02/23/26(Mon)19:36:48 No.733681389

>>733680584
Yeah. Koboldcpp and mistral nemo (get the q6 gguf) should serve as a nice introduction.
Once you are bored of that, try GLM 4.5 air. q4ks should just fit your RAM+VRAM, and since that one is a MoE model, you can have most of the model in RAM and still get generation decent speeds.

Anonymous
02/23/26(Mon)19:36:57 No.733681394

Anonymous 02/23/26(Mon)19:36:57 No.733681394

>>733681283
What's better?

Anonymous
02/23/26(Mon)19:38:44 No.733681495

Anonymous 02/23/26(Mon)19:38:44 No.733681495

>>733681239
>the smell of ozone
Apparently it's chinese slang for the smell of semen.

Anonymous
02/23/26(Mon)19:38:50 No.733681510

Anonymous 02/23/26(Mon)19:38:50 No.733681510

>I want to hear you say it
NO YOU FUCKING DON'T JUST DO IT

Anonymous
02/23/26(Mon)19:42:07 No.733681676

Anonymous 02/23/26(Mon)19:42:07 No.733681676

>>733681069

I bounce back and forth between different models, depending on what's available on ~~skillgod,~~ since some are better at certain scenes than others. Grok3 tends to work decently if I want something fairly reliable, but it's not good at coming up with new story directions, for example, so sometimes I'll jump around to different models and see what works better for what I want out of a scene. In some cases, I'll take parts of one post and parts of another and stitch them together to make something that appeals to me.
The longest chat I've got with that not-JSR city definitely pushes it past the 30k token range, at least, though since I'm just leeching, I do my best not to completely drain the bank.

>>733681153

Yeah, I know how that is. I've got some characters like that too. Though typically I tend more towards open ended scenario bots that I can replay with different scenarios and characters, sometimes you get just a solo character that hits every beat you like.

Anonymous
02/23/26(Mon)19:42:10 No.733681680

Anonymous 02/23/26(Mon)19:42:10 No.733681680

File: wow so creative you fucki(...).jpg (307 KB, 1723x976)

307 KB JPG

Anyone make their own front-end? It's fun. Mine keeps track of locations, characters (and updates them if anything changes them, adds new ones, etc.), builds skills, relationships, etc.
Here's a super generic 'only man in a farming village of women' opening just to show the unpolished UI. It auto-summarizes things into medium and long-term memory, also character-specific memory to swap in/out, keeping the sent tokens under 40k and manageable while not forgetting important things. Characters move between locations on their own and remember important events forever.
Pretty much any AI coding agent can do this if you ask. Just tell it what you want ('I want you to build me a UI for an AI adventure that connects to OpenRouter and keeps track of characters...blahblah). It'll spit something out for you, and a few debuggings later it'll be working.

Anonymous
02/23/26(Mon)19:45:57 No.733681870

Anonymous 02/23/26(Mon)19:45:57 No.733681870

File: 1769911971636422.png (1.35 MB, 1024x1035)

1.35 MB PNG

NovelAI is better.

Anonymous
02/23/26(Mon)19:46:53 No.733681924

Anonymous 02/23/26(Mon)19:46:53 No.733681924

>>733681094
Using --cpu-moe puts all the moe layers on the cpu and --n-cpu-moe let you tell it an exact number

Anonymous
02/23/26(Mon)19:47:31 No.733681965

Anonymous 02/23/26(Mon)19:47:31 No.733681965

>>733681680
Yup.
Trying to build a sort of generic LLM RPG engine that can inject the correct rules, keep track of stuff, maintain a two tier memory system, etc.

Anonymous
02/23/26(Mon)19:48:01 No.733681997

Anonymous 02/23/26(Mon)19:48:01 No.733681997

File: just.png (127 KB, 298x320)

127 KB PNG

>>733681239
>mfw breath hitched

Anonymous
02/23/26(Mon)19:49:33 No.733682075

Anonymous 02/23/26(Mon)19:49:33 No.733682075

>>733681870
>imagegen refuses to make realistic images
Their fully uncensored textgen is great though.

Anonymous
02/23/26(Mon)19:53:53 No.733682285

Anonymous 02/23/26(Mon)19:53:53 No.733682285

Deepseek thinking for three minutes just to give you four short sentences meanwhile it's thought process is two paragraphs when you click the button.

Anonymous
02/23/26(Mon)19:59:34 No.733682562

Anonymous 02/23/26(Mon)19:59:34 No.733682562

>>733682285
Prefil the thinking block with a short procedure that's relevant to your query.

Anonymous
02/23/26(Mon)20:05:23 No.733682867

Anonymous 02/23/26(Mon)20:05:23 No.733682867

File: file.png (7 KB, 424x279)

7 KB PNG

>hmm, I'm bored, I'll try getting into AI RP
>buy 8 dollar "practically unlimited" subscription to an inference router
>accrue approximately 17 cents worth of usage over the month

Anonymous
02/23/26(Mon)20:08:50 No.733683047

Anonymous 02/23/26(Mon)20:08:50 No.733683047

>>733675885
>Bleachbunny quit
Damn..

Anonymous
02/23/26(Mon)20:09:28 No.733683081

Anonymous 02/23/26(Mon)20:09:28 No.733683081

File: 1650072754293.jpg (67 KB, 564x751)

67 KB JPG

>>733681870
I mostly agree, especially for the UI, TTS, and fairly brain dead easy to use image gen. But after spending barely $2 over the last month on Deepseek at 32k context, it's a bit difficult to swallow the $25 for 28k on NAI.

Anonymous
02/23/26(Mon)20:10:27 No.733683142

Anonymous 02/23/26(Mon)20:10:27 No.733683142

>>733681239
>tfw your tongue darts out to wet your lips

Anonymous
02/23/26(Mon)20:12:02 No.733683223

Anonymous 02/23/26(Mon)20:12:02 No.733683223

>>733682285
Which is why you instruct it to write 2-4 paragraphs

Anonymous
02/23/26(Mon)20:12:28 No.733683249

Anonymous 02/23/26(Mon)20:12:28 No.733683249

>>733683081
DS's API is so god damn cheap, it's crazy.

Anonymous
02/23/26(Mon)20:12:48 No.733683265

Anonymous 02/23/26(Mon)20:12:48 No.733683265

>>733658805
But enough about Western AAA

Anonymous
02/23/26(Mon)20:18:20 No.733683567

Anonymous 02/23/26(Mon)20:18:20 No.733683567

>>733683047
fuck
thankfully all the bots he's made so far will likely last me for years

Anonymous
02/23/26(Mon)20:21:18 No.733683726

Anonymous 02/23/26(Mon)20:21:18 No.733683726

File: 1765754335590290.jpg (44 KB, 512x751)

44 KB JPG

>>733681239
>he grins darkly
>something dangerously close to X

>

Anonymous
02/23/26(Mon)20:25:38 No.733683946

Anonymous 02/23/26(Mon)20:25:38 No.733683946

>>733675885
>anzu netori bot
Good taste. I think the recent one made by Setoraiva is good, though be warned, he's an ESL from Brazil.
His English in the bot openers has been better, though.

Anonymous
02/23/26(Mon)20:27:43 No.733684045

Anonymous 02/23/26(Mon)20:27:43 No.733684045

>>733656861
what's the difference between netorase and netorare

Anonymous
02/23/26(Mon)20:28:28 No.733684097

Anonymous 02/23/26(Mon)20:28:28 No.733684097

>>733684045
one is the man cheating on his woman, the other is the woman cheating on her man

Anonymous
02/23/26(Mon)20:31:21 No.733684268

Anonymous 02/23/26(Mon)20:31:21 No.733684268

>>733684045
netorase (NTS) is the man being a cuck and wanting/accepting his woman fucking other guys. netorare (NTR) is just cheating

Anonymous
02/23/26(Mon)20:32:45 No.733684336

Anonymous 02/23/26(Mon)20:32:45 No.733684336

File: 1769037984165559.jpg (13 KB, 374x339)

13 KB JPG

>>733664520
>But i have more fun just straight up writing the entire thing myself, be it as a caption or a fanfic.
Agreed, I usually write entire paragraphs detailing what the characters are doing and see what the machine comes up with, I've tried having conversations multiple times on different models and it never feels right, AI chatbots are better as a third person narrator

Anonymous
02/23/26(Mon)20:32:49 No.733684342

Anonymous 02/23/26(Mon)20:32:49 No.733684342

>>733684045
Netorase is willingly getting cucked, netorare is unwillingly.

Anonymous
02/23/26(Mon)20:35:54 No.733684498

Anonymous 02/23/26(Mon)20:35:54 No.733684498

File: 1765414409150118.png (374 KB, 1062x638)

374 KB PNG

>>733684045

Anonymous
02/23/26(Mon)20:40:51 No.733684765

Anonymous 02/23/26(Mon)20:40:51 No.733684765

I've just been using MN Violet Lotus 12b locally, but people keep claiming Claude is king. Are you niggers paying for it or what?

Anonymous
02/23/26(Mon)20:43:08 No.733684894

Anonymous 02/23/26(Mon)20:43:08 No.733684894

>>733672119
OR literally supports crypto payments, dumbo.

Anonymous
02/23/26(Mon)20:44:05 No.733684935

Anonymous 02/23/26(Mon)20:44:05 No.733684935

File: 1771884392755934.jpg (161 KB, 750x743)

161 KB JPG

>>733681870
Is there any free alternative to NovelAI?

Anonymous
02/23/26(Mon)20:44:58 No.733684983

Anonymous 02/23/26(Mon)20:44:58 No.733684983

File: 1746835157989207.webm (3.99 MB, 416x752)

3.99 MB WEBM

>>733672119
>>tfw still using OR Deepseek
>God, it's so shit, but no way am I giving any of these sites my CC info.
why is it shit? just use the Deepseek provider?

Anonymous
02/23/26(Mon)20:50:52 No.733685281

Anonymous 02/23/26(Mon)20:50:52 No.733685281

>>733681870
I'm still waiting for whatever their plans are for GLM.

Anonymous
02/23/26(Mon)20:53:55 No.733685416

Anonymous 02/23/26(Mon)20:53:55 No.733685416

>>733684935
NovelAI is built off SDXL, rihht? The issue is that the heavy lifting was already done for NAI's model, you need to do all the data and tweaking manually for SDXL.

Anonymous
02/23/26(Mon)20:55:08 No.733685480

Anonymous 02/23/26(Mon)20:55:08 No.733685480

incels will do anything except go out and just say hi to a real woman

Anonymous
02/23/26(Mon)20:56:31 No.733685549

Anonymous 02/23/26(Mon)20:56:31 No.733685549

>>733684498
where's the female version, need based cuckquean

Anonymous
02/23/26(Mon)20:59:28 No.733685689

Anonymous 02/23/26(Mon)20:59:28 No.733685689

>>733675270
It's funny when you do decide to have a change of heart with some character and the AI fucking refuses to move on from the encounter. Yes, the player is playing Captain Rapeulon from planet Rapé in the pucker galaxy and the NPC has reason to disbelieve him BUT when he hasn't lied about anything yet do you really have to lock yourself into the Illudium Q38-Auto-Sodomizer rather than listen when the player says "Yeah go on get out of here"?

Anonymous
02/23/26(Mon)21:01:22 No.733685791

Anonymous 02/23/26(Mon)21:01:22 No.733685791

>>733658805
It can be challenging if you're honorable about it. I exclusive edit to correct factual or logical errors and put the retry button on 3 turn long cool downs each time I use it.

Anonymous
02/23/26(Mon)21:05:27 No.733685959

Anonymous 02/23/26(Mon)21:05:27 No.733685959

>>733681870
>NovelAI
Can't you just use that in conjunction with SillyTavern tho?

Anonymous
02/23/26(Mon)21:07:06 No.733686027

Anonymous 02/23/26(Mon)21:07:06 No.733686027

>>733685480
>say hi to a real woman
that'll be 5 months in jail and a lifelong sex offender registry + tip

Anonymous
02/23/26(Mon)21:09:20 No.733686118

Anonymous 02/23/26(Mon)21:09:20 No.733686118

>>733686027
pro-tip: you're not supposed to act like in your Chinese comic books are rape them immediately after meeting them

Anonymous
02/23/26(Mon)21:09:50 No.733686135

Anonymous 02/23/26(Mon)21:09:50 No.733686135

>>733685480
There is nowhere to go and do that, not going to find a club/party/drug girl that's a terrible idea. They're not on dating apps.
Real women are not available

Anonymous
02/23/26(Mon)21:10:15 No.733686156

Anonymous 02/23/26(Mon)21:10:15 No.733686156

>>733686118
then what the fuck is the point bro

Anonymous
02/23/26(Mon)21:10:41 No.733686180

Anonymous 02/23/26(Mon)21:10:41 No.733686180

>>733686118
You're not supposed to be ugly either. If you're ugly, dysgenic, nonwhite, or short, I wouldn't bother with talking to them.

Anonymous
02/23/26(Mon)21:13:20 No.733686303

Anonymous 02/23/26(Mon)21:13:20 No.733686303

File: Screenshot 2026-02-23 181146.png (18 KB, 473x112)

18 KB PNG

So what's the newest model that's "Good and doesn't use a million tokens per message"? Is Deepseek still king at that? I haven't really updated or looked at any new models in over half a year.

Anonymous
02/23/26(Mon)21:14:43 No.733686370

Anonymous 02/23/26(Mon)21:14:43 No.733686370

>>733686303
DeepSeek is still the go-to for poors. If you're willing to spend $10/mo or so, GLM 5 is a decent step up

Anonymous
02/23/26(Mon)21:17:32 No.733686502

Anonymous 02/23/26(Mon)21:17:32 No.733686502

>>733686303
>"Good and doesn't use a million tokens per message"?
gemini always
it'll even actively help you edit and shave down profiles/lorebooks for efficiency if you ask it - routinely turns a ~3k token bloat profile into ~400 condensed tokens with a simple prompt conversion to paste into the bio for more or less the exact same output results, etc

i've used 'pay-as-you-go' tier of gemini 2.5 flash (with ~9k context window) for like 8 months and it hasn't cost me more than five dollars as a whole

Anonymous
02/23/26(Mon)21:20:00 No.733686617

Anonymous 02/23/26(Mon)21:20:00 No.733686617

>>733686180
the billions of Indians breeding every day would like to disagree with you

Anonymous
02/23/26(Mon)21:20:52 No.733686661

Anonymous 02/23/26(Mon)21:20:52 No.733686661

>>733686617
To be fair, they either flat out rape, or their women don't have much of a choice.

Anonymous
02/23/26(Mon)21:25:25 No.733686832

Anonymous 02/23/26(Mon)21:25:25 No.733686832

>>733681870
I used it for a year because I couldn't find any better alternatives and it was adequate because I had just started getting into chatbots but I would never use it now. If you want to generate images of anime tits it's amazing though

Anonymous
02/23/26(Mon)21:27:29 No.733686934

Anonymous 02/23/26(Mon)21:27:29 No.733686934

what is the absolute most idiotproof way to run locally? Like literally unzip download, select bot, done.

Anonymous
02/23/26(Mon)21:30:04 No.733687024

Anonymous 02/23/26(Mon)21:30:04 No.733687024

>>733683081
The NAI TTS is so fucking awful though, they've never updated it either. My standards aren't high but it's so basic as to be unusable

Anonymous
02/23/26(Mon)21:30:13 No.733687029

Anonymous 02/23/26(Mon)21:30:13 No.733687029

File: file.png (543 KB, 1014x1035)

543 KB PNG

>>733681870
>>733682075
>>733683081
>>733684935
>>733685281
>>733685959
>>733686832
NAI is just GLM, and you are all fucking retarded

>paying 25 dollars for an outdated version of GLM you can get for 3 dollars a month, together with Deepseek, Kimi, new GLM, etc, on chutes

Anonymous
02/23/26(Mon)21:32:26 No.733687121

Anonymous 02/23/26(Mon)21:32:26 No.733687121

File: weasle.webm (2.1 MB, 360x640)

2.1 MB WEBM

>>733686617
culturally-enforced arranged marriage + rape capitol of the galaxy
they literally use women like bartering chips in the most comical ways imaginable, forcing eachother into their equivalent of indentured servitude for years to pay them off with a daughter later on
then that guy has a daughter
and then that guy uses his daughter as a leverage chip over some other dumb asshole

repeat forever and you have a 70iq society of genetically-inclined scammers (real)

Anonymous
02/23/26(Mon)21:33:39 No.733687186

Anonymous 02/23/26(Mon)21:33:39 No.733687186

>>733686502
>i've used 'pay-as-you-go' tier of gemini 2.5 flash (with ~9k context window) for like 8 months and it hasn't cost me more than five dollars as a whole
>gemini 2.5 flash
You have to pay for gemini 2.5 flash?
I've been using it for free for a long, long time via the API.
I'm currently 40 messages deep into a RP where I inject a 30k token prompt into it to bypass the filters.
It's not even a jailbreak in that it's not giving it instructions to be uncensored and yadda yadda, it's just a bunch of writing guidelines and writing examples, padding essentially.

Anonymous
02/23/26(Mon)21:36:46 No.733687334

Anonymous 02/23/26(Mon)21:36:46 No.733687334

Where the fuck is deepseek v4

Anonymous
02/23/26(Mon)21:37:07 No.733687351

Anonymous 02/23/26(Mon)21:37:07 No.733687351

>>733687186
i used to hit the rate limit in an hour or two when i used it more heavily for story-related stuff ~~before i saw through the illusion and realized every character roughly plays the same, because it's just the same person wearing a different mask every time~~
could probably skirt by on free tier too but i like to waste money on image gens sometimes, since nanobana cut their free gens down to like 20/day for free users and that's barely enough for basic editorial work

tl;dr yeah u rite

Anonymous
02/23/26(Mon)21:38:39 No.733687415

Anonymous 02/23/26(Mon)21:38:39 No.733687415

>>733687121
O, TO BE A WEASEL IN A BOX FULL OF PACKING PEANUTS

Anonymous
02/23/26(Mon)21:39:25 No.733687451

Anonymous 02/23/26(Mon)21:39:25 No.733687451

>>733687029
And what if I don't feel like fighting the censors and learning jailbreaks?

Anonymous
02/23/26(Mon)21:41:01 No.733687516

Anonymous 02/23/26(Mon)21:41:01 No.733687516

>>733687451
Don't quote with your dumb bullshit, faggot. If you knew even a little of what you are talking about, you'd know that shit is never needed outside of claude and gpt.

Anonymous
02/23/26(Mon)21:42:32 No.733687590

Anonymous 02/23/26(Mon)21:42:32 No.733687590

>>733687451
>fighting the censors
wtf r u doing
just plug a basic ass preset that disables harm filters into sillytavern
never ever communicate to an LLM in any official interface or you're getting the most lobotomized version of it possible + setting off a bunch of red alarms that make it log your conversation and a bunch of other gay shit

>how do i silly taverned
https://github.com/SillyTavern/SillyTavern
follow instructions, click yes a few times, let jesus take the wheel

Anonymous
02/23/26(Mon)21:48:02 No.733687831

Anonymous 02/23/26(Mon)21:48:02 No.733687831

>>733687351
>spoiler
I realized that pretty soon which is why I've mostly stuck around with the same bot for a good while now.

Anonymous
02/23/26(Mon)21:52:08 No.733687993

Anonymous 02/23/26(Mon)21:52:08 No.733687993

>>733687351
>spoiler
That doesn't have to be the case, specially with something like the gemini models, even flash, since they can work with a lot of context without going full retard.
Lorebooks, Silly Tavern macros, even ST script, you can twist and shape the way the AI behave conditionally and have plenty different experiences if you leverage the tools you have.
For example, you can make a lorebook for actions reactions, emotions, etc for a given character with trigger words that correlate to different situations.

Anonymous
02/23/26(Mon)22:05:48 No.733688603

Anonymous 02/23/26(Mon)22:05:48 No.733688603

>>733687993
>For example, you can make a lorebook for...
i'm a huge lorebook enthusiast when it comes to shit like this, but the LLM (even gemini) straight up ignores that shit over 90% of the time even if you set it to always active
the problem is due to cart-before-the-horse logic that it uses, where it hallucinates what a thing is (and determines it can't be anything but that) without referencing the lorebook on what that thing might be beforehand

bastard behavior to be quite honest with you

Anonymous
02/23/26(Mon)22:06:09 No.733688618

Anonymous 02/23/26(Mon)22:06:09 No.733688618

>>733687451
Jailbreaking is pathetically easy and any preset has one by default...

Anonymous
02/23/26(Mon)22:06:44 No.733688645

Anonymous 02/23/26(Mon)22:06:44 No.733688645

File: 1748649583553641.png (36 KB, 460x369)

36 KB PNG

For me personally chatbotting peaked like 2 years ago when we got access to free opus 3. It was still fresh and shit hit like crack. And the sites weren't so filled with garbage you could actually find nice bots.

Now when you're used to it and know all the tropes they recycle it doesn't feel the same. Also there isn't so much progress in quality anymore. When Opus 3 came it blew everything else that had come before away it was crazy but after that progress has stalled.

Anonymous
02/23/26(Mon)22:15:35 No.733689058

Anonymous 02/23/26(Mon)22:15:35 No.733689058

>>733688603
>but the LLM (even gemini) straight up ignores that shit over 90% of the time
It works more often than not in my experience.
Putting the entries at a low depth (or better yet,depth 0 inside a thinking prefill) tends to make it stick pretty well.
At the end of the day, all we are doing is steering the model into a direction by giving it strong instructions (or gaslighting it with prefils).
Didn't try that approach with local models since the stuff I can run locally is pretty low parameter.

Anonymous
02/23/26(Mon)22:20:06 No.733689248

Anonymous 02/23/26(Mon)22:20:06 No.733689248

>>733688645
>Also there isn't so much progress in quality anymore
>progress has stalled
~2 years ago is when they (jeets taking over the tech industry metastasizing into a terminal cancer) shifted from literature training into purely codebase training
because the people with fake coding degrees need to be able to "show their work" (have someone else do it for them in the form of trillion-dollar datacenters operating highschool-level c++)

the future is so gay and cool

Anonymous
02/23/26(Mon)22:23:29 No.733689402

Anonymous 02/23/26(Mon)22:23:29 No.733689402

>>733676621
TBF, I've had great luck giving ChatGPT the wiki article and the card sections and telling it to translate it.

Anonymous
02/23/26(Mon)22:23:43 No.733689414

Anonymous 02/23/26(Mon)22:23:43 No.733689414

File: 1768511804491666.png (3.08 MB, 3051x1357)

3.08 MB PNG

Speaking of local, anyone got any recommendations for really lightweight but still flavorful models for running Rimtalk on Rimworld? It's very low context (I think I've seen it get to around 1,800 tokens at the largest during parties) but running my normal Cydonia knockoff is fatter than it really needs to be and only really keeps up with text if the game's running at normal speed. Gotta be some good 7b I can run quantized down better for Rimworld purposes. Actually son of a bitch I might still have that local model recommendation chart some anon posted a while back BY GOD THERE IT IS I guess I should look up Tomato, thanks anons good talk

Anonymous
02/23/26(Mon)22:25:26 No.733689485

Anonymous 02/23/26(Mon)22:25:26 No.733689485

>>733689414
good post, I appreciated it

Anonymous
02/23/26(Mon)22:27:36 No.733689570

Anonymous 02/23/26(Mon)22:27:36 No.733689570

>>733689414
Try glm 4.7 flash.

Anonymous
02/23/26(Mon)22:29:28 No.733689642

Anonymous 02/23/26(Mon)22:29:28 No.733689642

File: Tamamo_Ball.png (60 KB, 300x300)

60 KB PNG

>>733689414
>Tomato
Rude

Anonymous
02/23/26(Mon)22:33:00 No.733689794

Anonymous 02/23/26(Mon)22:33:00 No.733689794

>>733687029
So what's the simplest way to get an uncensored creative writing model for free?

Anonymous
02/23/26(Mon)22:34:03 No.733689849

Anonymous 02/23/26(Mon)22:34:03 No.733689849

>>733657946
>>733658729
do you guys just not run your own models locally?

Anonymous
02/23/26(Mon)22:36:41 No.733689978

Anonymous 02/23/26(Mon)22:36:41 No.733689978

>>733689794
You don't. Have fun.

Anonymous
02/23/26(Mon)22:38:16 No.733690043

Anonymous 02/23/26(Mon)22:38:16 No.733690043

File: 1770601934261435.png (14 KB, 186x292)

14 KB PNG

>>733689978
So this thread is free advertisement?

Anonymous
02/23/26(Mon)22:38:29 No.733690051

Anonymous 02/23/26(Mon)22:38:29 No.733690051

>>733689794
>creative writing model
>for free
You can only choose one

Anonymous
02/23/26(Mon)22:39:58 No.733690126

Anonymous 02/23/26(Mon)22:39:58 No.733690126

>>733690043
sillytavern is not a model provider. Hope this helps, idiot retard.

Anonymous
02/23/26(Mon)22:39:59 No.733690128

Anonymous 02/23/26(Mon)22:39:59 No.733690128

>>733689794
2.5 flash gemini (on free tier, get your API) via sillytavern frontend
do your own homework for the rest, it's most all in this thread already

Anonymous
02/23/26(Mon)22:40:09 No.733690138

Anonymous 02/23/26(Mon)22:40:09 No.733690138

>>733690051
There are tons of websites giving free image and text generations tho. Most are censored but implying it must be paid shows that you guys are acting in bad faith.

Anonymous
02/23/26(Mon)22:40:46 No.733690171

Anonymous 02/23/26(Mon)22:40:46 No.733690171

>>733689485
You are most welcome
>>733689570
That model's even bigger than my Cydonia knockoff, I could try it for a different main LLM but I was wanting something even more lightweight so it gens faster during 2x and 3x Rimworld speed
>>733689642
The tomatillo is downloaded and loaded instantly. Rimworld is not because it takes an hour to load even on a SSD anyway. Very hopeful for it. Got high expectations for the banter as the colonists set up sandbags and autoturrets around their cocaine farm. Game's still loading but spirits remain high

Anonymous
02/23/26(Mon)22:41:11 No.733690185

Anonymous 02/23/26(Mon)22:41:11 No.733690185

>>733690126
Good morning sir.

Anonymous
02/23/26(Mon)22:42:56 No.733690264

Anonymous 02/23/26(Mon)22:42:56 No.733690264

File: 1757253564123.jpg (42 KB, 355x500)

42 KB JPG

>>733689794
what are you into? if you're based, I'll spoonfeed you

Anonymous
02/23/26(Mon)22:43:02 No.733690269

Anonymous 02/23/26(Mon)22:43:02 No.733690269

>>733690185
>doesn't know shit about the topic
>so he can't make an argument
>so he pathetically falls back to the boogeyman of the year
That's nice, sweetie.

Anonymous
02/23/26(Mon)22:43:44 No.733690291

Anonymous 02/23/26(Mon)22:43:44 No.733690291

>>733690138
After trying some actually good models I just can't say you can get creative for free.

Anonymous
02/23/26(Mon)22:44:11 No.733690312

Anonymous 02/23/26(Mon)22:44:11 No.733690312

>>733690171
>That model's even bigger than my Cydonia knockoff
It's a MoE. You load most of it in RAM (see >>733681924).

Anonymous
02/23/26(Mon)22:44:20 No.733690317

Anonymous 02/23/26(Mon)22:44:20 No.733690317

>>733690269
I asked a question and you claimed that spending money is absolutely necessary, going mask off with the shilling.

Anonymous
02/23/26(Mon)22:44:32 No.733690328

Anonymous 02/23/26(Mon)22:44:32 No.733690328

>>733689794
The only way it's free is if you run it locally, and the hardware with which to do that costs more than being a paypig, and the models aren't as good as cloud models either

NovelAI gets shit on in all cases, because they charge more than a generic service like openrouter, with fewer options. There's only a small group of NAI truthers still lingering around because they have a dedicated /vg/ thread

Anonymous
02/23/26(Mon)22:45:31 No.733690373

Anonymous 02/23/26(Mon)22:45:31 No.733690373

>>733690264
Ryona and worse.

Anonymous
02/23/26(Mon)22:47:29 No.733690460

Anonymous 02/23/26(Mon)22:47:29 No.733690460

>>733690317
Shill what, dumb asshole. I told you the reality of things.

Anonymous
02/23/26(Mon)22:48:10 No.733690484

Anonymous 02/23/26(Mon)22:48:10 No.733690484

>>733690373
>being vague
the council deems you Not Based

Anonymous
02/23/26(Mon)22:48:45 No.733690508

Anonymous 02/23/26(Mon)22:48:45 No.733690508

>>733690328
NovelAI was technically free for a long time since you could get endless trials. But they put a lot of restrictions after they changed their focus to image gen.

Anonymous
02/23/26(Mon)22:48:46 No.733690510

Anonymous 02/23/26(Mon)22:48:46 No.733690510

>>733690328
You pay for NAI for image gen since their text offerings are pretty much a bonus thing right now, despite them starting based around Text AI.
At least Local anime genners have something possibly to look forward to with Anima since Local Image gen has been stalled for almost 2 years now.

Anonymous
02/23/26(Mon)22:49:45 No.733690559

Anonymous 02/23/26(Mon)22:49:45 No.733690559

>>733690460
https://vocaroo.com/128D0TJi6BhS

Anonymous
02/23/26(Mon)22:50:20 No.733690582

Anonymous 02/23/26(Mon)22:50:20 No.733690582

>>733690264
bronze age morality systems, scaphism, immolation, flaying (generally just partial flaying), fingernail removal, improvised dentistry, starvation.

Got anything for me?

Anonymous
02/23/26(Mon)22:51:20 No.733690623

Anonymous 02/23/26(Mon)22:51:20 No.733690623

>>733690582
Did you used to own any private islands?

Anonymous
02/23/26(Mon)22:53:07 No.733690694

Anonymous 02/23/26(Mon)22:53:07 No.733690694

>>733690623
I'm uncircumcised and into loli not hebe, that ain't me

Anonymous
02/23/26(Mon)22:54:39 No.733690768

Anonymous 02/23/26(Mon)22:54:39 No.733690768

File: 1482420042796.png (238 KB, 422x201)

238 KB PNG

>>733690582
I wasn't paying attention to the thread and thought you wanted a free proxy for sillytavern. Dunno if you're interested in that

Anonymous
02/23/26(Mon)22:56:10 No.733690828

Anonymous 02/23/26(Mon)22:56:10 No.733690828

>>733690510
Illustrious felt like a decent step forward to me. Danbooru tags finally "just worked" all the time and even some with decently low count still worked.
anima looks interesting with the @ before artist tag thing, I've tried to prompt certain artists with illustrious before and their name gets mistaken for a normal word

Anonymous
02/23/26(Mon)22:57:47 No.733690882

Anonymous 02/23/26(Mon)22:57:47 No.733690882

>>733690768
oh. I just wanted someone to spoon feed me cheerios and maybe make airplane noises. Sorry to waste your time.

Anonymous
02/23/26(Mon)22:58:25 No.733690920

Anonymous 02/23/26(Mon)22:58:25 No.733690920

>>733690828
Did you try not being retarded? Write artist:whatever, dimwit

Anonymous
02/23/26(Mon)22:59:36 No.733690974

Anonymous 02/23/26(Mon)22:59:36 No.733690974

>>733656721
I went into near psychosis jacking off to this a couple years ago. Feels good to be healthy.

Anonymous
02/23/26(Mon)23:03:46 No.733691138

Anonymous 02/23/26(Mon)23:03:46 No.733691138

>>733690920
computer write real purdy words artist: greg rutkowski. big goodly low badly

Anonymous
02/23/26(Mon)23:06:29 No.733691268

Anonymous 02/23/26(Mon)23:06:29 No.733691268

>>733657575
There are still proxies?

Anonymous
02/23/26(Mon)23:06:38 No.733691273

Anonymous 02/23/26(Mon)23:06:38 No.733691273

File: 1615801362892.jpg (153 KB, 800x800)

153 KB JPG

>have roleplayed for over 20 years
>grew accustomed to the shitty quality of everyone's prose over time, blind to it even
>>>install sillytavern to see what the fuss is about
>get multi-paragraph responses fully laboring over my own writing down to the minutest of details, on demand, instantly

>go back to writing with normal people after jerking myself into an inhuman dehydrated slug for a week
>*as i reach for your face i smile deeply into ur eyes and..*
holy shit
it's so fucking over, i'm sorry
i'm only fucking robots from now on

Anonymous
02/23/26(Mon)23:07:17 No.733691306

Anonymous 02/23/26(Mon)23:07:17 No.733691306

>>733690828
Illustrious/Noob was a massive step up from Pony because holy shit was Pony a bad model which is why I was not shocked when the new Pony model crashed and burned. Anima is interesting to me just because we are finally escaping from CLIP and being able to mix natural language with tags, as well as define subjects without relying on something like regional prompters or Controlnet, is going to open up some new doors.

Anonymous
02/23/26(Mon)23:14:30 No.733691605

Anonymous 02/23/26(Mon)23:14:30 No.733691605

>>733656721
It's not any worse than the average gacha at least.

Anonymous
02/23/26(Mon)23:16:24 No.733691693

Anonymous 02/23/26(Mon)23:16:24 No.733691693

File: Capture.png (99 KB, 655x519)

99 KB PNG

>>733690882
nah don't worry man, sorry for getting your hopes up brah. Though, here's a quick example of it in case you're interested

Anonymous
02/23/26(Mon)23:21:02 No.733691879

Anonymous 02/23/26(Mon)23:21:02 No.733691879

>>733671738
You have a couple of options.
You can use a VPN, or simply make your own bots. It's really not hard to make bots, if you can ERP you can do it.

Anonymous
02/23/26(Mon)23:26:06 No.733692084

Anonymous 02/23/26(Mon)23:26:06 No.733692084

File: 1560191789993.jpg (38 KB, 300x300)

38 KB JPG

>>733657946
Add $5 into openrouter, use deepseek3.2 because it has around $0.30 cost per million of tokens (max context is 163k lol)

Anonymous
02/23/26(Mon)23:28:35 No.733692193

Anonymous 02/23/26(Mon)23:28:35 No.733692193

>>733689794
if you just want a quick fap then you download koboldcpp and a 24b mistral model (maybe a finetune version)
it requires at least 12gb of vram though

Anonymous
02/23/26(Mon)23:31:47 No.733692325

Anonymous 02/23/26(Mon)23:31:47 No.733692325

>>733671981
Something about sillytavern just hits different. It feels like you can actually have intertwining story arcs instead of people just spamming memes and references. Sign me up for robowifey.

Anonymous
02/23/26(Mon)23:33:17 No.733692389

Anonymous 02/23/26(Mon)23:33:17 No.733692389

File: J3b5OIr.png (31 KB, 940x578)

31 KB PNG

>>733689414
I was using gemma 3 12b as it was recommended. Works well.

Anonymous
02/23/26(Mon)23:42:24 No.733692762

Anonymous 02/23/26(Mon)23:42:24 No.733692762

this thread made me realize that i've been fapping nearly exclusively to textgen ai slop for the last two years
wew

Anonymous
02/24/26(Tue)00:01:08 No.733693594

Anonymous 02/24/26(Tue)00:01:08 No.733693594

the sirs are wanting to do the needful with my computer i can see them circling when i do shit like search "computer how comfyui quen3 tts" and there they appear like demons from hell sent to torment me

the verification for this post?

heh

\NOT REQUIRED\

Anonymous
02/24/26(Tue)00:12:23 No.733694063

Anonymous 02/24/26(Tue)00:12:23 No.733694063

>Gemini is overloaded 24/7
>ChatGPT 5+ reads like a powerpoint and 4.1 is censored to hell while also being shut down
>Claude actually genuinely thinks we're going to pay them fucking money for texts

Almost want to resort to local models just to avoid the constant hurdles between me and having a good time.

Anonymous
02/24/26(Tue)00:23:18 No.733694448

Anonymous 02/24/26(Tue)00:23:18 No.733694448

Chatbot format is too boring. I prefer just to use the regular kobold UI for storytelling

Anonymous
02/24/26(Tue)00:30:42 No.733694667

Anonymous 02/24/26(Tue)00:30:42 No.733694667

>>733694063
Just bite the bullet if you have a 16GB card and more than 8GB of ram, at this point the better models are somewhere between Deepseek 3 and current Claude in terms of quality in my opinion.

Anonymous
02/24/26(Tue)00:33:02 No.733694735

Anonymous 02/24/26(Tue)00:33:02 No.733694735

>>733694063
The cloud models are supposed to be smarter, but they got so assistantslopped and codeslopped they're lobotomized for this kind of writing now. Local models are dumb but at least they're an unbiased sort of dumb that actually works like you expect.

Anonymous
02/24/26(Tue)00:34:08 No.733694767

Anonymous 02/24/26(Tue)00:34:08 No.733694767

>>733692762
>ai dungeon was in 2019
Hoollyyyyy fuck and my standards are now so high that I only use opus

Anonymous
02/24/26(Tue)00:36:55 No.733694854

Anonymous 02/24/26(Tue)00:36:55 No.733694854

>>733694735
This is a bullshit lie

Anonymous
02/24/26(Tue)00:38:38 No.733694895

Anonymous 02/24/26(Tue)00:38:38 No.733694895

I'll never understand how people are posting screenshots of a 3 paragraph output, how the fuck do you make it do that much? Are people just generating like 6 responses and editing them together?
90% of the time in ST the bots barely want to use 200 tokens in response.

Anonymous
02/24/26(Tue)00:42:55 No.733695031

Anonymous 02/24/26(Tue)00:42:55 No.733695031

File: trained.png (247 KB, 1140x1166)

247 KB PNG

>>733694895
peek at your pretext, there's probably something in it telling your robot to only reply x-y amount of tokens/words
i can make mine belt out a 4k token list of its credentials or other stupid shit, it's normal to be able to do that i think

Anonymous
02/24/26(Tue)00:44:39 No.733695083

Anonymous 02/24/26(Tue)00:44:39 No.733695083

>>733694667
Fine, I'll bite the bullet. What's the top local model to use with ST?

Anonymous
02/24/26(Tue)00:45:17 No.733695106

Anonymous 02/24/26(Tue)00:45:17 No.733695106

>>733694895
Depends on the model but they're text predictors, it will continue doing what's happened in the context.
I'm lazy, but I still get them to output a lot. Write a decent intro exactly how you want it to reply, and you can swipe and add stuff you like to one reply.

Anonymous
02/24/26(Tue)00:45:39 No.733695118

Anonymous 02/24/26(Tue)00:45:39 No.733695118

>>733694895
model issue
i have to actively rein in deepseek because if i dont give it a specific limit on how much to write it'll fill the entire fucking page with words

Anonymous
02/24/26(Tue)00:47:24 No.733695174

Anonymous 02/24/26(Tue)00:47:24 No.733695174

>>733694895
Did you edit the max token count per answer? You can prompt the models to use tree-of-thought and ask for specific token count, plan for long reply in tof and then do it.

Anonymous
02/24/26(Tue)00:48:26 No.733695216

Anonymous 02/24/26(Tue)00:48:26 No.733695216

File: .png (5 KB, 185x96)

5 KB PNG

>>733694895

Anonymous
02/24/26(Tue)00:49:01 No.733695235

Anonymous 02/24/26(Tue)00:49:01 No.733695235

>>733695083
>>733689414
i personally use patricide-12B-Unslop-Mell

Anonymous
02/24/26(Tue)00:49:11 No.733695242

Anonymous 02/24/26(Tue)00:49:11 No.733695242

>>733694895
Assuming your response length setting isn't set to something low, the way I get really long replies is to make the first response from the bot be very long.
Usually I just edit the end of the response with an open sentence so it continues with another paragraph after. Do that a few times and then the following replies will tend to be much longer since it follows the earlier pattern set.

Anonymous
02/24/26(Tue)00:49:48 No.733695260

Anonymous 02/24/26(Tue)00:49:48 No.733695260

>>733695083
I'm still a bit new to it myself but Maginum-Cydoms-24B Q5_K_M is heads and shoulders above the other half-dozen I've tried, someone can probably explain why there are better or worse models better than I could

Anonymous
02/24/26(Tue)00:52:20 No.733695360

Anonymous 02/24/26(Tue)00:52:20 No.733695360

>>733695216
>>733695174
>>733695242
It doesn't seem to care what my response length is, I can set it to 2000 but it'll get to 300 and call it good enough.

Anonymous
02/24/26(Tue)00:54:28 No.733695446

Anonymous 02/24/26(Tue)00:54:28 No.733695446

Since the thread is pretty much alive and there is a higher chance than getting an answer here than in /aicg/:
have anyone used the LoreManager plugin? I am using it with Bloatmaxx prefills but I dont see that is creating a lorebook and new entries with it.

Anonymous
02/24/26(Tue)00:57:21 No.733695536

Anonymous 02/24/26(Tue)00:57:21 No.733695536

>>733695360
It's a model issue then. Some models prefer short responses, regardless of your settings.

Anonymous
02/24/26(Tue)00:58:31 No.733695578

Anonymous 02/24/26(Tue)00:58:31 No.733695578

File: 1732357140134721.gif (1.55 MB, 221x283)

1.55 MB GIF

>Gemini Pro is 2-3 cents a gen and is not that great
>Sonnet is also 2-3 cents a gen and is very solid but fuck me 2-3 cents a gen
>Opus fucking bankrupts you at double the price of basically any other option but is pure kino
I just hate how expensive this shit is. Using OpenRouter and the suggested ways to reduce price. Your presets have to be very brief and you need to use caching just to get the price down that "low". It feels even worse because the reason they obliterated the free options is because of fucking jeets swiping literally hundreds of times on 400 token junkbots.
Anyone got any suggestions? When I try DeepSeek it acts bizarrely, maybe I need a setup for it.

Anonymous
02/24/26(Tue)01:07:27 No.733695934

Anonymous 02/24/26(Tue)01:07:27 No.733695934

>>733695260
I've been compiling and testing models nonstop in the 8B to 34B range.
If I had to pick a few favorites from my notes:
Maginum-Cydoms for how bulletproof it is. It pretty much nailed all of my lorebooks and maintains character quirks.
Some of the pre-release Cydonia models [particularly in the v4z*-series].
Slimaki gives me Maginum vibes. Go figure since this merge is apparently inspired by it.
Morax is pretty visceral. I like it for horror.
Circuitry is another one I like for consistency and logical progression.
All Q5_K_M.

Anonymous
02/24/26(Tue)01:10:24 No.733696053

Anonymous 02/24/26(Tue)01:10:24 No.733696053

>>733695260
>>733695934
NTA but Q5 is an absolute must, I don't know how, I don't know why, but it's about 30% slower at generating text but about twice as coherent and qualitative as Q4

Anonymous
02/24/26(Tue)01:24:34 No.733696584

Anonymous 02/24/26(Tue)01:24:34 No.733696584

>>733695360
And what about your system prompt? I have it choose between 250-1000 words and 2-6 paragraphs. Your getting didly because you arent fucking using prompts right.

Anonymous
02/24/26(Tue)01:46:22 No.733697485

Anonymous 02/24/26(Tue)01:46:22 No.733697485

>tfw been enjoying prompts about being an American slave owner in the 1800s with loli slaves

Anonymous
02/24/26(Tue)01:58:33 No.733697927

Anonymous 02/24/26(Tue)01:58:33 No.733697927

>>733695578
>the reason they obliterated the free options is because of fucking jeets swiping literally hundreds of times on 400 token junkbots.
No, they did it because the investments are starting to slow down and these providers are starting to need returns on their expenditure, instead of giving it out for free to try and generate a user base.

Anonymous
02/24/26(Tue)02:02:41 No.733698057

Anonymous 02/24/26(Tue)02:02:41 No.733698057

>>733697927
Jeets are absolutely apart of the equation, multiple free models went down, like tng deepseek, because of fucking jeets abusing it to hell and back.

Anonymous
02/24/26(Tue)02:05:54 No.733698150

Anonymous 02/24/26(Tue)02:05:54 No.733698150

>Original character bot
>They use an image of an existing character
Annoying

Anonymous
02/24/26(Tue)02:06:02 No.733698156

Anonymous 02/24/26(Tue)02:06:02 No.733698156

File: 1526568265428.jpg (18 KB, 346x346)

18 KB JPG

>>733674514
Dunno how you do this
I've been sitting at like 70 cards for months because I just cycle through the same ones developing their RP
Sometimes I make a new quick n dirty scenario bot to jerk off to then it's back to the bots with like 400 messages and plot beats

Anonymous
02/24/26(Tue)02:07:51 No.733698225

Anonymous 02/24/26(Tue)02:07:51 No.733698225

>>733698156
Curious what you're doing in those long rps
I have some long ones but don't usually go back

Anonymous
02/24/26(Tue)02:12:28 No.733698385

Anonymous 02/24/26(Tue)02:12:28 No.733698385

>>733698225
Various forms of relationship drama largely
The last one I was using was about slow indoctrination into a cult. I plan for my character to realize what it is and then try to convince the bot out of it and whatever form of "cult attempts to kill user/bot" comes from that.

Anonymous
02/24/26(Tue)02:13:16 No.733698415

Anonymous 02/24/26(Tue)02:13:16 No.733698415

>>733656721
This shit is pure slop

Anonymous
02/24/26(Tue)02:13:53 No.733698440

Anonymous 02/24/26(Tue)02:13:53 No.733698440

>>733698415
>skillet

Anonymous
02/24/26(Tue)02:14:46 No.733698472

Anonymous 02/24/26(Tue)02:14:46 No.733698472

>Braindead normie cattle tier: Trying to fuck ChatGPT in its official app
>Idiot tier: NovelAI, JanitorAI, etc. i.e. too stupid to do simple program setup and willing to overpay to avoid it
>Mid tier: Setting up OpenRouter with SillyTavern
>High tier: Your own custom front-end
>GOD tier: Working at a frontier AI lab getting paid $1m/yr+ to 'red team' uncensored base models with your favorite brand of depravity

Anonymous
02/24/26(Tue)02:15:19 No.733698487

Anonymous 02/24/26(Tue)02:15:19 No.733698487

>>733698150
character cards are only good for building off of anyways
creating a highly personalized experience is all AI is good for

Anonymous
02/24/26(Tue)02:19:49 No.733698651

Anonymous 02/24/26(Tue)02:19:49 No.733698651

>>733695360
I think the inital message from the bot itself (Inital text) helps determine the length of its responses. If not, you could add details to the personality/scenario to force it to respond in longer blocks, like "Respond with at least 5-6 paragraphs.

Anonymous
02/24/26(Tue)02:22:33 No.733698734

Anonymous 02/24/26(Tue)02:22:33 No.733698734

File: Anubis7.png (275 KB, 1024x479)

275 KB PNG

For me it's tomb raiding with cute kemoshotas!

Anonymous
02/24/26(Tue)02:25:40 No.733698849

Anonymous 02/24/26(Tue)02:25:40 No.733698849

>>733698472
what can your custom front end do that sillytavern doesnt

Anonymous
02/24/26(Tue)02:28:50 No.733698959

Anonymous 02/24/26(Tue)02:28:50 No.733698959

>>733698849
Whatever you want it to, whenever you feel like you want a feature.
You don't even have to be good at programming, just tell Copilot or Claude Code or whatever 'hey, add this feature' and it'll do it.

Anonymous
02/24/26(Tue)02:36:23 No.733699226

Anonymous 02/24/26(Tue)02:36:23 No.733699226

>>733698959
@grok rape this man

Anonymous
02/24/26(Tue)02:48:00 No.733699597

Anonymous 02/24/26(Tue)02:48:00 No.733699597

>>733699226
Let me give an example. I'm a boring guy who likes harems and pregnancy scenarios. I was tired of trying to make SillyTavern extensions remember harem mechanics (how the various characters feel about each other, important events) etc. not run outside the context window and be swapped in smartly, updated periodically, etc.
Now my own front-end keeps a relationship map of how everyone feels about each other in context at all times, pregnancy tracking (including fertility cycles and pregnancy time from conception to birth), regular updating of status without needing to press anything, etc.
It's probably not what everyone would like. But it's what I like. And it works for me.

Anonymous
02/24/26(Tue)03:13:53 No.733700414

Anonymous 02/24/26(Tue)03:13:53 No.733700414

>Using deepseek
>slowburn gets to the sex
>{{char}} is a virgin
>the ai decides to describe her hymen breaking with excruciating detail
I actually started to feel sick reading that shit

Anonymous
02/24/26(Tue)03:14:40 No.733700441

Anonymous 02/24/26(Tue)03:14:40 No.733700441

>>733700414
That's my fetish

Anonymous
02/24/26(Tue)03:15:58 No.733700479

Anonymous 02/24/26(Tue)03:15:58 No.733700479

which deepseek is the good deepseek on OR?

Anonymous
02/24/26(Tue)03:16:59 No.733700514

Anonymous 02/24/26(Tue)03:16:59 No.733700514

>>733700479
I like TNG

Anonymous
02/24/26(Tue)03:18:20 No.733700557

Anonymous 02/24/26(Tue)03:18:20 No.733700557

>>733700414
Best part is when this happens amidst lots of kissing and lovey-dovey shit.

Anonymous
02/24/26(Tue)03:21:25 No.733700653

Anonymous 02/24/26(Tue)03:21:25 No.733700653

>>733700479
I've been preferring 3.2 these days. It kind of has the schizo essence of v3 0324 but a little more coherent like v3.1. The con is that it doesn't feel like it has much variation between swipes even if I set the temp higher.
I haven't tried much of the offshoots though.

Anonymous
02/24/26(Tue)03:25:19 No.733700768

Anonymous 02/24/26(Tue)03:25:19 No.733700768

>>733656721
That's a terrible definition

Anonymous
02/24/26(Tue)03:33:31 No.733701029

Anonymous 02/24/26(Tue)03:33:31 No.733701029

>>733671835
>The worst part is when you explicitly say <DON'T SPEAK FOR <USER>> and then in a reply it just does exactly what you told it not to
It's the "don't think of the pink elephant" problem. Instead of telling it not to do that, you should emphasize that the model is to play as {{char}} only. Additionally, clean the opening message so it doesn't contain any (You) actions and narration.

Anonymous
02/24/26(Tue)03:34:50 No.733701068

Anonymous 02/24/26(Tue)03:34:50 No.733701068

For me it's GLM-4.6 (Free)

Anonymous
02/24/26(Tue)04:14:06 No.733702353

Anonymous 02/24/26(Tue)04:14:06 No.733702353

>>733701029
just edit the reply and remove anything you don't like. do this enough time and the llm will stop. make sure you remove bad things from the thinking section as well.

Anonymous
02/24/26(Tue)04:18:15 No.733702479

Anonymous 02/24/26(Tue)04:18:15 No.733702479

>>733695360
Might be worth checking your templates if you're using text completion, too, some models will hard cut after <end> tokens and stuff
Both my Nemo and Godslayer setups tend to stop early and when I check the console it's because it's cutting off a "I can't generate that response" response or something else I don't want to see generated

Anonymous
02/24/26(Tue)04:34:55 No.733703021

Anonymous 02/24/26(Tue)04:34:55 No.733703021

>>733689849
do you guy actually waste your own electricity on some lame ass model? read a book

Anonymous
02/24/26(Tue)04:35:02 No.733703030

Anonymous 02/24/26(Tue)04:35:02 No.733703030

I mean, local 12B-Unslop-Mell is alright I suppose. But how are local models still behind what the corpos can offer?

Anonymous
02/24/26(Tue)04:54:05 No.733703589

Anonymous 02/24/26(Tue)04:54:05 No.733703589

>>733700414
That's the prompt isn't it? What are you using?

Anonymous
02/24/26(Tue)05:31:27 No.733704759

Anonymous 02/24/26(Tue)05:31:27 No.733704759

>>733703030
They aren't. GLM-5 is pretty close to frontier, and it's 'local' if you have a beefy enough computer (you don't)

Anonymous
02/24/26(Tue)05:56:04 No.733705483

Anonymous 02/24/26(Tue)05:56:04 No.733705483

>>733699597
SillyTavern is a horrendous front-end for erotic adventures, anyway. It's a character chat. People go to extreme length to make it do more freeform roleplay, but it's absolutely not made for it.

What more irritating is that at the bottom layer, it's all just putting together the final text in the context for the LLM. So the extremely weird and and unwieldy ST concepts are just flatlined into a text that does

>[System Prompt]
>[Characters]
>Chat so far:

With some tags here and there to denote user/agent.

Anonymous
02/24/26(Tue)05:57:53 No.733705538

Anonymous 02/24/26(Tue)05:57:53 No.733705538

>>733699597
Can I have your frontend

Anonymous
02/24/26(Tue)06:05:38 No.733705784

Anonymous 02/24/26(Tue)06:05:38 No.733705784

File: 928s.jpg (24 KB, 554x554)

24 KB JPG

>hi
>AI: OMG I WANNA SUCK MC PENIS
amazing story

Anonymous
02/24/26(Tue)06:08:51 No.733705868

Anonymous 02/24/26(Tue)06:08:51 No.733705868

>>733656721
lately ive only been able to have fun in open ended games (including llm text adventures) and the odd roguelite.
any good classic roguelikes these days? wtf happened to one way heroics 2?

Anonymous
02/24/26(Tue)06:15:19 No.733706074

Anonymous 02/24/26(Tue)06:15:19 No.733706074

>>733663631
>Rocinante-12B-v2g-Q6_K (unslop mix)

where did you download it from? google isnt very helpful

Anonymous
02/24/26(Tue)06:18:22 No.733706185

Anonymous 02/24/26(Tue)06:18:22 No.733706185

anyone know any decent models that would be fine on 12 gb vram?

Anonymous
02/24/26(Tue)06:26:51 No.733706423

Anonymous 02/24/26(Tue)06:26:51 No.733706423

>>733705483
People are desperate to turn ST into anything useful, but I don't think it's worth bothering trying to change it. Might as well enjoy it as is. Vanilla.

Anonymous
02/24/26(Tue)06:29:54 No.733706524

Anonymous 02/24/26(Tue)06:29:54 No.733706524

>>733665146
Friends and Fables.

Just roleplay the combat tho

Anonymous
02/24/26(Tue)06:30:05 No.733706529

Anonymous 02/24/26(Tue)06:30:05 No.733706529

How does ST keep my penis consistently hard?

Anonymous
02/24/26(Tue)06:31:50 No.733706589

Anonymous 02/24/26(Tue)06:31:50 No.733706589

>>733706185
Nemo
Rocinante
Famino
Wayfarer
Finetunes of Nemo or Mag Mell
Depends on what you want to do, and your definition of "fine" (how much swiping/editing you want to do because it's basically inevitable)
I'd recommend grabbing a few models, it helps when you get sick of one to be able switch to another easily, and you might find a model with quirks that you like/know how to tardwrangle

Anonymous
02/24/26(Tue)06:34:25 No.733706676

Anonymous 02/24/26(Tue)06:34:25 No.733706676

>>733706529
because it makes you use your imagination instead of just flashing titties in your face

Anonymous
02/24/26(Tue)06:37:46 No.733706771

Anonymous 02/24/26(Tue)06:37:46 No.733706771

>>733706074
These models are usually hosted on huggingface.

Anonymous
02/24/26(Tue)06:47:52 No.733707063

Anonymous 02/24/26(Tue)06:47:52 No.733707063

File: 1756672748036.png (674 KB, 1280x720)

674 KB PNG

>>733656861
My favorite kind of adventure.

Anonymous
02/24/26(Tue)06:53:53 No.733707258

Anonymous 02/24/26(Tue)06:53:53 No.733707258

I just played for like 2 weeks. But it's getting boring. All the greatest strengths of AI RP are also its greatest weaknesses.

Anonymous
02/24/26(Tue)06:57:39 No.733707372

Anonymous 02/24/26(Tue)06:57:39 No.733707372

>>733707258
You can get quite far depending on the level of effort you put into your setup with stuff like injecting shit dynamically into the context, running workflows, etc, put at some point wrangling the AI becomes the actual game and the RP is merely a result to check against.

Anonymous
02/24/26(Tue)07:02:36 No.733707516

Anonymous 02/24/26(Tue)07:02:36 No.733707516

>>733705784
>add slow burn and gradual development
>contain yourself to naturalistic development
I fucking love the romcoms I write.

Anonymous
02/24/26(Tue)07:03:45 No.733707548

Anonymous 02/24/26(Tue)07:03:45 No.733707548

>>733707258
>All the greatest strengths of AI RP are also its greatest weaknesses.
Name 5.

Anonymous
02/24/26(Tue)07:09:34 No.733707745

Anonymous 02/24/26(Tue)07:09:34 No.733707745

>>733707372
This is what happened. But at least I made my own client with support for 4+ models(mistral, gemma, gtp-oss(don't bither btw), qwen3) and a map system and quest generator.
It's pretty draining if you want to go deeper and the game side becomes secondary.
Still need implement random encounters and some sort of other mechanics but it needs to be in balance with the fact it's still ai prose and interactive fiction.

Anonymous
02/24/26(Tue)07:11:36 No.733707802

Anonymous 02/24/26(Tue)07:11:36 No.733707802

>>733701029
avoid doing negative prompt, reword your instruction

Anonymous
02/24/26(Tue)07:14:01 No.733707880

Anonymous 02/24/26(Tue)07:14:01 No.733707880

>>733707258
I think the biggest problem is lack of ability to do shit in moderation
It either goes full assistant slop which makes it just obeys and answers your wish 100%, or goes full bitch if you tell them not to do as such

Anonymous
02/24/26(Tue)07:14:25 No.733707892

Anonymous 02/24/26(Tue)07:14:25 No.733707892

>>733707745
>Still need implement random encounters and some sort of other mechanics
I made a little system based on solo TTRPG techniques using an entropy "dice" and a dynamically injected table.
Next thing I want to implement a system where the AI can change the current "mode", without using hardcoded modes like encounter, exploration, etc.

Anonymous
02/24/26(Tue)07:18:47 No.733708041

Anonymous 02/24/26(Tue)07:18:47 No.733708041

>>733707548
Mostly the influence your prompts have on everything. It's never the right amount. Never.
>>733707880
Exactly. There are no subtle gradual shifts ever. You can pretend there are but you know you could do pretty much anything and the AI would like it. There is no such thing as the AI having any degree of static personality.

Anonymous
02/24/26(Tue)07:22:34 No.733708151

Anonymous 02/24/26(Tue)07:22:34 No.733708151

>>733708041
If you treat it like a human being, it will respect its personality until you adapt it.
It works with sexuality at least.

Anonymous
02/24/26(Tue)08:00:51 No.733709395

Anonymous 02/24/26(Tue)08:00:51 No.733709395

How do you stop information leaks? Like everyone constantly knowing everything that happened for no reason at all. It's the single most annoying thing about AI RP.

Anonymous
02/24/26(Tue)08:14:39 No.733709862

Anonymous 02/24/26(Tue)08:14:39 No.733709862

what gpu do I need for a local model?

Anonymous
02/24/26(Tue)08:16:24 No.733709924

Anonymous 02/24/26(Tue)08:16:24 No.733709924

>>733709395
Try to keep it as simple as possible. It's hard to say anything else. Don't use slop prompts with hundreds of useless chatgpt-isms and contradicting instructions.

Anonymous
02/24/26(Tue)08:17:28 No.733709962

Anonymous 02/24/26(Tue)08:17:28 No.733709962

>>733709862
3060 or 3070 at the very least. As long as it's Nvidia RTX with at least 8gb vram you'll be okay. I'm actually surprised AMD still sells GPUs in today's age. They're entirely on life support thanks to idiots not using local.

Anonymous
02/24/26(Tue)08:18:44 No.733710000

Anonymous 02/24/26(Tue)08:18:44 No.733710000

>>733709862
12gb VRAM is probably the absolute minimum to avoid retard quants
16gb and up is preferable

Anonymous
02/24/26(Tue)08:18:46 No.733710002

Anonymous 02/24/26(Tue)08:18:46 No.733710002

>>733709962
>tfw RTX 2060
Damn...

Anonymous
02/24/26(Tue)08:18:47 No.733710003

Anonymous 02/24/26(Tue)08:18:47 No.733710003

>>733709862
Depends on the quality of model you want to run. Realistically the best you could run locally without a server is probably quantized 70b with the best hardware.

Anonymous
02/24/26(Tue)08:22:53 No.733710167

Anonymous 02/24/26(Tue)08:22:53 No.733710167

>>733710002
Try it, you'll be fine most likely. When genning SDXL/Illustrator, use this lora:
DMD2 | 1 CFG SCALE | Fewer Steps | SDXL | Pony | Illustrious
- it squeezes out lots of juice from weaker cards.
Text gen on the other hand is much harder to pull off well and you'll be stuck with 2000 token context at the most. Not worth getting into.

Anonymous
02/24/26(Tue)08:25:28 No.733710267

Anonymous 02/24/26(Tue)08:25:28 No.733710267

>>733710167
Thanks. I only care about chat bots so I guess it's over for me

Anonymous
02/24/26(Tue)08:27:18 No.733710348

Anonymous 02/24/26(Tue)08:27:18 No.733710348

>>733709962
I've got a 9070 XT.

Anonymous
02/24/26(Tue)08:28:24 No.733710383

Anonymous 02/24/26(Tue)08:28:24 No.733710383

>>733710267
10 bucks into deepseek are weeks or even months of RP. Local models are mostly a cope, regardless of hardware.

Anonymous
02/24/26(Tue)08:30:18 No.733710457

Anonymous 02/24/26(Tue)08:30:18 No.733710457

>>733710383
How does work in practice? I would be embarrassed to use a service for my personal testing - even when it is nothing degenerate even.

Anonymous
02/24/26(Tue)08:32:07 No.733710542

Anonymous 02/24/26(Tue)08:32:07 No.733710542

>>733710383
Deepseek is the best for erping?

Anonymous
02/24/26(Tue)08:37:44 No.733710759

Anonymous 02/24/26(Tue)08:37:44 No.733710759

>>733710457
You just insert your API key into silly tavern and then just play. The rest is just a mental thing. For me personally I feel infinitely more comfortable using a chinese model than putting my shit into google or some other american company.
>>733710542
Among the best. At that quality of model it's a lot more about personal preferences than objective quality. It's definitely the best model for your first paid model as it's the cheapest by far and it's token based not subscription based so you only lose what you use, so 10 bucks last a shitload of time.

Anonymous
02/24/26(Tue)08:41:53 No.733710919

Anonymous 02/24/26(Tue)08:41:53 No.733710919

>>733706074
https://huggingface.co/TheDrummer/UnslopNemo-12B-v3-GGUF
It's not labeled "rocinante" in the name but it is in the filename, because it's part of that lineage of sloptunes

Anonymous
02/24/26(Tue)08:45:06 No.733711039

Anonymous 02/24/26(Tue)08:45:06 No.733711039

>Use Grok
>Pound a loli harpy that is half of your height is okay
>Drinking milk from cow like beastwomen is consider inappropriate

Anonymous
02/24/26(Tue)08:48:49 No.733711174

Anonymous 02/24/26(Tue)08:48:49 No.733711174

I don't know if it's all the female romance novels pulling their weight but AI does incest RP really well. It's like automatically much superior to literally any incest work out there.

Anonymous
02/24/26(Tue)08:49:04 No.733711192

Anonymous 02/24/26(Tue)08:49:04 No.733711192

>>733711039
About what you'd expect from something named "grog"

Anonymous
02/24/26(Tue)08:49:21 No.733711203

Anonymous 02/24/26(Tue)08:49:21 No.733711203

>>733710542
not even close
gemini is leagues ahead of most things due to the sheer flexibility of what you can get it to write and style as with the correct prompts, claude is the only thing that comes close but you have to do a bunch of token minmaxing for it

Anonymous
02/24/26(Tue)08:56:25 No.733711551

Anonymous 02/24/26(Tue)08:56:25 No.733711551

>>733710759
You are right in this sense. Google already has so much data on everyone that they should just fuck off permanently off this planet.

Anonymous
02/24/26(Tue)08:59:05 No.733711668

Anonymous 02/24/26(Tue)08:59:05 No.733711668

>>733711551
>>733710759
>google already has my data lets give it to chinks
lol
use a local or stop coping

Anonymous
02/24/26(Tue)08:59:19 No.733711676

Anonymous 02/24/26(Tue)08:59:19 No.733711676

What are your favorite extensions?

>>733673627
>right before Pragmata
FUCK. What's the best way to back shit up?

Anonymous
02/24/26(Tue)09:01:16 No.733711746

Anonymous 02/24/26(Tue)09:01:16 No.733711746

>>733656721
The new Sonnet and Gemini are shit

Anonymous
02/24/26(Tue)09:01:49 No.733711769

Anonymous 02/24/26(Tue)09:01:49 No.733711769

>>733711676
>What are your favorite extensions?

VRM of course. Though if I could get EmulatorJS working that'd be great too.

Anonymous
02/24/26(Tue)09:04:14 No.733711859

Anonymous 02/24/26(Tue)09:04:14 No.733711859

>>733711746
3.1 Pro is best model there ever was, at least for serious work.

Anonymous
02/24/26(Tue)09:05:37 No.733711920

Anonymous 02/24/26(Tue)09:05:37 No.733711920

>>733711668
There is no cope. I just don't care. Same as I don't care about politics. Life if just more enjoyable not giving a fuck about the small stuff.

Anonymous
02/24/26(Tue)09:08:03 No.733712006

Anonymous 02/24/26(Tue)09:08:03 No.733712006

>>733711203
Even if that is the case. You should still use a mid tier model as your first. Same as you shouldn't start with the most expensive wine. If you really like AI RP then you can look forward to the higher tier models and you have a point of reference to appreciate the better model.

Anonymous
02/24/26(Tue)09:08:26 No.733712020

Anonymous 02/24/26(Tue)09:08:26 No.733712020

>>733711859
Oh I thought 3.0 was the latest because 3.1 is not on cli yet. Is it as good as 2.5?

Anonymous
02/24/26(Tue)09:10:07 No.733712086

Anonymous 02/24/26(Tue)09:10:07 No.733712086

>>733712020
Better. It can write much longer replies, has pretty neat formatting, but it's very slow. I am talking about AI Studio of course since free tier Pro has been dead for almost 3 months already.

Anonymous
02/24/26(Tue)09:14:31 No.733712239

Anonymous 02/24/26(Tue)09:14:31 No.733712239

what's the best way to write a bot? can i just put everything in the description and call it a day still?

Anonymous
02/24/26(Tue)09:16:56 No.733712331

Anonymous 02/24/26(Tue)09:16:56 No.733712331

>>733712239
Basically. There's no set way to do it, and the models are good enough to make something even from the most dogshit character descriptions. Just experiment to see what works for what you're trying to do.

Anonymous
02/24/26(Tue)09:16:57 No.733712332

Anonymous 02/24/26(Tue)09:16:57 No.733712332

>>733712239
Use your favorite bots as reference. First change them slightly to build experience for what works and then start making completely new bots.

Anonymous
02/24/26(Tue)09:18:08 No.733712372

Anonymous 02/24/26(Tue)09:18:08 No.733712372

>>733711859
Have they finally fixed Gemini's stubborn delusions about the current year?

Anonymous
02/24/26(Tue)09:18:18 No.733712380

Anonymous 02/24/26(Tue)09:18:18 No.733712380

For how popular this is you'd think there would be some consensus on how to write a card but I keep finding different advice and nobody ever shares their cards.

Anonymous
02/24/26(Tue)09:20:27 No.733712463

Anonymous 02/24/26(Tue)09:20:27 No.733712463

>>733711920
yeah you are the perfect cattle

Anonymous
02/24/26(Tue)09:20:27 No.733712464

Anonymous 02/24/26(Tue)09:20:27 No.733712464

>>733712372
No. The cutoff is stil 1.1.2025. There will never be a more recent cutoff thanks to Epstein files and Trump's 2nd term, as well as the newest vaccine studies and rest of the world curing cancer as soon as USA left WHO.

Anonymous
02/24/26(Tue)09:20:36 No.733712469

Anonymous 02/24/26(Tue)09:20:36 No.733712469

>>733712239
Half the time I get bots from sites that make you pay and just get their info

Anonymous
02/24/26(Tue)09:22:03 No.733712531

Anonymous 02/24/26(Tue)09:22:03 No.733712531

>>733712380
There can't be a consensus because a. different models respond differently to the same character card, and b. there is no "best" way to it anyway. Everyone has their own brand of prompting placebo.

Anonymous
02/24/26(Tue)09:23:01 No.733712568

Anonymous 02/24/26(Tue)09:23:01 No.733712568

>>733712239
best way is to write a bunch of creative example sentences by hand
dont use AI to create it for you, this will just lead quicker to slop.

Anonymous
02/24/26(Tue)09:23:23 No.733712587

Anonymous 02/24/26(Tue)09:23:23 No.733712587

>>733712463
The perfect cattle are the ones who think they are actually doing something despite making no difference.

Anonymous
02/24/26(Tue)09:24:28 No.733712629

Anonymous 02/24/26(Tue)09:24:28 No.733712629

>>733712380
>For how popular this is
Is it? Image gen seems like at least 100 times more popular.

Anonymous
02/24/26(Tue)09:25:54 No.733712687

Anonymous 02/24/26(Tue)09:25:54 No.733712687

>>733712239
Yes, just throw everything in the description. Example messages always make it worse in my experience. Scenario field is useless. Post instructions and other shit are for more complicated bots, like if you have a game system or stats or whatever.

As for how to write the description, basically the card is what the model goes off of to write, so you want it to include everything you need the model to look at every time it replies, but absolutely nothing more than that.
Unless you're using a model that has documented evidence to the contrary e.g. claude's xml parsing, meme formatting doesn't matter. Making it look like code just adds extra garbage tokens, don't do this

Some people write the description like an interview with the character, which allows you to include the character's writing style along with their information, in the same lines. Saves tokens and teaches the model a lot better. But you need to be able to write, of course. I think it's clever and it always turns out well.

And for the love of fuck do not use an LLM to write the card.

Anonymous
02/24/26(Tue)09:27:18 No.733712741

Anonymous 02/24/26(Tue)09:27:18 No.733712741

>>733712587
that was the best rebuttal you could come up with? lol

Anonymous
02/24/26(Tue)09:28:21 No.733712779

Anonymous 02/24/26(Tue)09:28:21 No.733712779

>>733712741
You speak like winning an internet argument would somehow be significant. Kinda cute.

Anonymous
02/24/26(Tue)09:28:55 No.733712807

Anonymous 02/24/26(Tue)09:28:55 No.733712807

>>733712629
Maybe not on this site but it seems to have a pretty large community from what I can tell. Not as big as image gen obviously because zoomers don't read, but still there.

Anonymous
02/24/26(Tue)09:29:14 No.733712817

Anonymous 02/24/26(Tue)09:29:14 No.733712817

>>733712779
im just having a good laugh
at your expense of course

Anonymous
02/24/26(Tue)09:31:29 No.733712898

Anonymous 02/24/26(Tue)09:31:29 No.733712898

>>733712807
I think the more tism ones like ST are just a small part compared to actual schizos who mourned gpt4o.

Anonymous
02/24/26(Tue)09:32:40 No.733712946

Anonymous 02/24/26(Tue)09:32:40 No.733712946

>>733712807
I would think the dark romance girls would be all over this shit but the most popular bots seem to be smut for males like incest and futanari instead of big werewolves and bears.

Anonymous
02/24/26(Tue)09:34:03 No.733712998

Anonymous 02/24/26(Tue)09:34:03 No.733712998

File: 1771409781338127.png (675 KB, 1422x733)

675 KB PNG

you fags will do absolutely anything to avoid talking about videogames on this board

Anonymous
02/24/26(Tue)09:34:58 No.733713046

Anonymous 02/24/26(Tue)09:34:58 No.733713046

>>733712380
Like most media, the vast majority of users just slurp up whatever's in front of them. Most users probably aren't even aware of what a card is and interact with bots through some app where you can't see any internals, and scum off free daily credits for some 8b model they call "chatgpt" because all LLMs are "chatgpt"
Or better yet, literal chatgpt users that open a browser instance to talk to the fucking chat interface like it's a person and have a psychotic breakdown when it gets an update, like >>733712898 said

Anonymous
02/24/26(Tue)09:35:11 No.733713053

Anonymous 02/24/26(Tue)09:35:11 No.733713053

>>733712946
Depends on the site. Janitorai trending front page is 16 male bots/17 female bots.

Anonymous
02/24/26(Tue)09:37:10 No.733713146

Anonymous 02/24/26(Tue)09:37:10 No.733713146

Who consistently makes great cards? So tired of the garbage on Chub and Janny.

Anonymous
02/24/26(Tue)09:37:15 No.733713151

Anonymous 02/24/26(Tue)09:37:15 No.733713151

>>733670208
gib card

Anonymous
02/24/26(Tue)09:37:56 No.733713174

Anonymous 02/24/26(Tue)09:37:56 No.733713174

>>733712946
There are girl cards all over the place. It's always cocky asshole white dudes with a yaoi artstyle for the card art, sometimes violent sometimes blackmail, and the scenario is like "fuck me or I kill you" or "I'm a mafia boss and you're my slave now"

Anonymous
02/24/26(Tue)09:38:38 No.733713191

Anonymous 02/24/26(Tue)09:38:38 No.733713191

>>733691273
this. roleplaying with players is either out of character drama, esl failrp, erp cliques.

Anonymous
02/24/26(Tue)09:39:50 No.733713249

Anonymous 02/24/26(Tue)09:39:50 No.733713249

>>733713146
Me. My cards work great even with pretty shitty models.
You should try and make your own too.

Anonymous
02/24/26(Tue)09:42:01 No.733713316

Anonymous 02/24/26(Tue)09:42:01 No.733713316

>>733712898
The real problem is LLMs getting enshittified with each update. GPT-3.5 was absolute peak for RP.

Anonymous
02/24/26(Tue)09:42:04 No.733713317

Anonymous 02/24/26(Tue)09:42:04 No.733713317

Is there somewhere you can download a full multi character RPG with expressions packs and all ready?

Anonymous
02/24/26(Tue)09:42:14 No.733713326

Anonymous 02/24/26(Tue)09:42:14 No.733713326

>>733713053
Janitor is usually one guy making fifty male bots that are all basically the same bot with minor differences, and you have to take five minutes to find the block button on their profile.

Anonymous
02/24/26(Tue)09:44:58 No.733713430

Anonymous 02/24/26(Tue)09:44:58 No.733713430

Wake me when these things can emulate elaborate hidden NTR scenes that start with the character starting with soft ntr acts like touching and kissing, but slowly progressing towards more and more extreme acts while also giving you the opportunity to stop the NTR from happening if you're paying attention and find out where the scene is happening.

Anonymous
02/24/26(Tue)09:46:57 No.733713518

Anonymous 02/24/26(Tue)09:46:57 No.733713518

>>733713430
All NTR is always the same shit. Grow some taste instead of complaining about the most one dimensional fetish being one dimensional.

Anonymous
02/24/26(Tue)09:53:36 No.733713794

Anonymous 02/24/26(Tue)09:53:36 No.733713794

File: file.png (4 KB, 211x57)

4 KB PNG

So what even is the difference between swiping and hitting this button

Anonymous
02/24/26(Tue)09:56:12 No.733713912

Anonymous 02/24/26(Tue)09:56:12 No.733713912

>>733713794
Regenerate erases the message, swiping keeps it, so you can swap a bunch of times then choose the one you like more.
I guess.

Anonymous
02/24/26(Tue)09:58:07 No.733714007

Anonymous 02/24/26(Tue)09:58:07 No.733714007

>>733713518
I know reading comprehension is difficult for your kind, but come on now.

Anonymous
02/24/26(Tue)10:05:42 No.733714328

Anonymous 02/24/26(Tue)10:05:42 No.733714328

>>733713430
there's already thousands of bots that do this, it's not hard to set up slow-burn stories anymore due to context window management being as huge as it is now (bot can read anywhere from 8k context to 150k+ context in most cases)

Anonymous
02/24/26(Tue)10:06:28 No.733714368

Anonymous 02/24/26(Tue)10:06:28 No.733714368

>>733714007
They already can emulate what you're suggesting. You probably just are unable to do a slow burn because you have your dick in hand and want to see your waifu getting railed by someone else as fast as possible.

Anonymous
02/24/26(Tue)10:26:47 No.733715107

Anonymous 02/24/26(Tue)10:26:47 No.733715107

I want to try a reasoning model
Whats the difference between these three?
https://huggingface.co/mradermacher/Violet_Magcap-12B-GGUF
https://huggingface.co/mradermacher/Violet_Magcap-12B-i1-GGUF
https://huggingface.co/mradermacher/Violet_MagCap-Rebase-12B-i1-GGUF

Anonymous
02/24/26(Tue)10:31:24 No.733715282

Anonymous 02/24/26(Tue)10:31:24 No.733715282

>>733714328
>>733714368
>it's not hard to set up slow-burn stories
But I don't want a slow-burn story. I want an experience where the AI hides events behind my back and I have to figure out where it's happening in order to stop it.

Anonymous
02/24/26(Tue)10:32:39 No.733715329

Anonymous 02/24/26(Tue)10:32:39 No.733715329

>>733715107
i1 = weighed quantization. If that says nothing to you, then don't worry about it. Use the non-i1 when unsure.

I don't know what the Rebase is. Looks like the original model has been deleted, so we'll never know.

Anonymous
02/24/26(Tue)10:33:25 No.733715362

Anonymous 02/24/26(Tue)10:33:25 No.733715362

>>733715107
Those are nemo finetunes/merges. From my experience, reasoning fine tunes on top of non-reasoning models tend to suck ass in that the reasoning more often than not gets ignored for the final reply.

Anonymous
02/24/26(Tue)10:36:12 No.733715457

Anonymous 02/24/26(Tue)10:36:12 No.733715457

>>733715107
i refers to a type of quantization, its basically the same but you can save about 500-250mb with i quants

Anonymous
02/24/26(Tue)10:38:44 No.733715552

Anonymous 02/24/26(Tue)10:38:44 No.733715552

>>733715362
Any suggestions for reasoning in the 12B range?
Annoying having a 16gb card and everything jumps straight to 24 so I have to offload a bit into ram even with lower quants.

Anonymous
02/24/26(Tue)10:38:56 No.733715564

Anonymous 02/24/26(Tue)10:38:56 No.733715564

>>733684097
Why is women cheating so common in japan and normalized?

Anonymous
02/24/26(Tue)10:42:03 No.733715680

Anonymous 02/24/26(Tue)10:42:03 No.733715680

>>733690128
>gemini uncensored
yeah, sure
the last time I tried their api, it detected errors if you did anything that went against their guidelines

Anonymous
02/24/26(Tue)10:42:18 No.733715690

Anonymous 02/24/26(Tue)10:42:18 No.733715690

>>733715282
Then throw a dice. It's not that hard.

Anonymous
02/24/26(Tue)10:42:58 No.733715714

Anonymous 02/24/26(Tue)10:42:58 No.733715714

>>733715680
Just throw 30K or so tokens of whatever at it and it'll defeat most blocks.

Anonymous
02/24/26(Tue)10:43:08 No.733715725

Anonymous 02/24/26(Tue)10:43:08 No.733715725

>>733684045
>what's the difference between netorase and netorare
Netorare is someone's partner performing sexual acts with someone else without their partner's consent
>woman A is in a relationship with man A, but has sex with man B behind man A's back
Netorase is someone's partner performing sexual acts with someone else with their partner's consent
>woman A is in a relationship with man A and has sex with man B and man A doesn't mind, because he's into it

Anonymous
02/24/26(Tue)10:46:34 No.733715829

Anonymous 02/24/26(Tue)10:46:34 No.733715829

whats the best preset for gemini free

Anonymous
02/24/26(Tue)10:50:24 No.733715991

Anonymous 02/24/26(Tue)10:50:24 No.733715991

>>733715829
Preser?

Anonymous
02/24/26(Tue)10:51:46 No.733716054

Anonymous 02/24/26(Tue)10:51:46 No.733716054

I use stolen keys to also exclusively live out my slime girl harem fantasies.

Anonymous
02/24/26(Tue)10:54:25 No.733716181

Anonymous 02/24/26(Tue)10:54:25 No.733716181

>>733715552
Try GLM-4.7-Flash-absolute-heresy.
It's a MoE, so you can use it with most of the model in RAM.
I have 8gb of VARM + 64 GB of RAM, using Q6, I run with
>--ubatch-size 512 -fa on -c 100000 --fit off -ngl 99 -ncmoe 99
and It fits just right.
I also use
>--override-kv deepseek2.expert_used_count=int:5
so that the moden runs one extra expert per token, to help alleviate the lobotomy from the abliteration process, but that's not strictly necessary.

Anonymous
02/24/26(Tue)10:55:06 No.733716210

Anonymous 02/24/26(Tue)10:55:06 No.733716210

File: centaurfight.png (289 KB, 1191x1279)

289 KB PNG

>>733715680
>yeah, sure
bwo...
use frontend, apply preset, done

Anonymous
02/24/26(Tue)10:55:40 No.733716242

Anonymous 02/24/26(Tue)10:55:40 No.733716242

Use Opus, cuckies

Anonymous
02/24/26(Tue)10:56:09 No.733716267

Anonymous 02/24/26(Tue)10:56:09 No.733716267

>>733716242
*sprays u with a hose*

Anonymous
02/24/26(Tue)10:58:08 No.733716358

Anonymous 02/24/26(Tue)10:58:08 No.733716358

>>733716242
gib monies and I will

Anonymous
02/24/26(Tue)10:58:46 No.733716389

Anonymous 02/24/26(Tue)10:58:46 No.733716389

So, what's the best free ai right now? I just want to rp as a pure maiden's holy/cursed sword and lead her to glory/doom.

Anonymous
02/24/26(Tue)11:00:18 No.733716456

Anonymous 02/24/26(Tue)11:00:18 No.733716456

>>733716389
Claude Opus

Anonymous
02/24/26(Tue)11:00:39 No.733716465

Anonymous 02/24/26(Tue)11:00:39 No.733716465

>>733692762
I still remember that fateful night like 3 years ago, or was it 3? maybe 4 years ago when the character.ai filter died and I got into the coom n doom cycle for eternity

Anonymous
02/24/26(Tue)11:00:56 No.733716473

Anonymous 02/24/26(Tue)11:00:56 No.733716473

>>733716389
gemini 2.5 flash has a relatively huge free allowance
plug payment info (without paying anything) to the API to get like $300~ in tokens for 2-3 months to see if you like it

deepseek is alright but it can't really compare to gemini's flexibility from how much random literature and fandom lore it's trained on/has access to

Anonymous
02/24/26(Tue)11:03:08 No.733716594

Anonymous 02/24/26(Tue)11:03:08 No.733716594

>>733716389
Can't answer that as I jsut paypiggy for sonnet but that's a real nice idea for rp, thanks.

Anonymous
02/24/26(Tue)11:03:25 No.733716609

Anonymous 02/24/26(Tue)11:03:25 No.733716609

>>733684268
>netorare (NTR) is just cheating
wrong. you can have cheating without netorare

Anonymous
02/24/26(Tue)11:04:53 No.733716673

Anonymous 02/24/26(Tue)11:04:53 No.733716673

>>733700414
homo

Anonymous
02/24/26(Tue)11:05:09 No.733716690

Anonymous 02/24/26(Tue)11:05:09 No.733716690

>>733716473
if you pay for gemini whats the cost?
i'm still using character ai. the model was at least engaging in 2023/2024 if not dumb, but its gotten worse with filters.

Anonymous
02/24/26(Tue)11:05:33 No.733716707

Anonymous 02/24/26(Tue)11:05:33 No.733716707

I use Claude Opus.

Anonymous
02/24/26(Tue)11:06:05 No.733716736

Anonymous 02/24/26(Tue)11:06:05 No.733716736

>>733656924
All words are made up.

Anonymous
02/24/26(Tue)11:06:11 No.733716740

Anonymous 02/24/26(Tue)11:06:11 No.733716740

how do you guys use paid api? don't they see everything you make it write?

Anonymous
02/24/26(Tue)11:07:38 No.733716802

Anonymous 02/24/26(Tue)11:07:38 No.733716802

>>733716740
You think there is a single entity who gives a shit about what you write?

Anonymous
02/24/26(Tue)11:08:48 No.733716843

Anonymous 02/24/26(Tue)11:08:48 No.733716843

>>733716802
i mean they don't ban you?
and what if it's cunny

Anonymous
02/24/26(Tue)11:10:25 No.733716910

Anonymous 02/24/26(Tue)11:10:25 No.733716910

>>733656721
how is this on ~~linux and 9070xt~~?

Anonymous
02/24/26(Tue)11:11:09 No.733716937

Anonymous 02/24/26(Tue)11:11:09 No.733716937

>>733716843
Anon, you're talking to people with the same IQ as shoplifters.
"Aren't you afraid you're gonna get caught?" "Nah"

Anonymous
02/24/26(Tue)11:11:24 No.733716948

Anonymous 02/24/26(Tue)11:11:24 No.733716948

>>733716843
nah they don't give a shit

Anonymous
02/24/26(Tue)11:11:27 No.733716951

Anonymous 02/24/26(Tue)11:11:27 No.733716951

>>733709962
>>733710002
thanks to this thread, I'm currently testing it with my gtx 1660.
the model is Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5_2-Uncensored-HERETIC_IQ4_XS.
It's slow, about 5 tokens per second, but it works, so it will be faster on your 2060

Anonymous
02/24/26(Tue)11:12:08 No.733716974

Anonymous 02/24/26(Tue)11:12:08 No.733716974

>>733712946
that's because those girls are fucking stupid and don't even know how to set up janitor.ai, the chink shovelware mobile apps are filled to the brim with generic sexymen

Anonymous
02/24/26(Tue)11:14:55 No.733717080

Anonymous 02/24/26(Tue)11:14:55 No.733717080

>>733716690
pennies if you optimize your token counts
~3 hour sessions are about 60 cents if you're staying within a 9k context window with optimized profiles
i've spent maybe $8 in about a year of fucking around with it and half of that is on image gen edits w/ nanobanana since you can use the same api key

Anonymous
02/24/26(Tue)11:15:36 No.733717110

Anonymous 02/24/26(Tue)11:15:36 No.733717110

>>733716937
False equivalence. Shoplifting is illegal. Writing smutty prompts is perfectly legal. Companies don't have morale, they only care about legality.

Anonymous
02/24/26(Tue)11:16:06 No.733717132

Anonymous 02/24/26(Tue)11:16:06 No.733717132

>>733716389
make a bunch of accounts and keys for google ai studio = unlimited flash 3.0 + flash 2.5

Anonymous
02/24/26(Tue)11:16:07 No.733717134

Anonymous 02/24/26(Tue)11:16:07 No.733717134

>>733716937
Get caught about what lmao? Writing wrongthink?

Anonymous
02/24/26(Tue)11:16:43 No.733717167

Anonymous 02/24/26(Tue)11:16:43 No.733717167

There is already a very good general for this on /g/

Anonymous
02/24/26(Tue)11:17:18 No.733717197

Anonymous 02/24/26(Tue)11:17:18 No.733717197

>>733716843
you might get flagged if you trigger their no-no filter one too many times
just get a good preset to avoid such cases

Anonymous
02/24/26(Tue)11:17:31 No.733717208

Anonymous 02/24/26(Tue)11:17:31 No.733717208

>>733716910
>linux
easier than windows funny enough
>9070xt
Good, but then again most people aren't going to be running high end models on consumer cards without spending an obscene amount of money. Even worse nowadays.

Anonymous
02/24/26(Tue)11:17:37 No.733717215

Anonymous 02/24/26(Tue)11:17:37 No.733717215

>>733716210
>Kaelen
Back to the drawing board, buddy.

Anonymous
02/24/26(Tue)11:18:23 No.733717247

Anonymous 02/24/26(Tue)11:18:23 No.733717247

>>733717167
>very good general for this on /g/
Yeah, no.

Anonymous
02/24/26(Tue)11:19:16 No.733717285

Anonymous 02/24/26(Tue)11:19:16 No.733717285

>>733717167
LOL

Anonymous
02/24/26(Tue)11:19:53 No.733717315

Anonymous 02/24/26(Tue)11:19:53 No.733717315

>>733717110
>Writing smutty prompts is perfectly legal.
>Writing wrongthink?
Not everyone lives in the US, anons.

Anonymous
02/24/26(Tue)11:20:07 No.733717327

Anonymous 02/24/26(Tue)11:20:07 No.733717327

>>733717197
whats a good preset

Anonymous
02/24/26(Tue)11:20:20 No.733717336

Anonymous 02/24/26(Tue)11:20:20 No.733717336

>>733717167
LMAO even

Anonymous
02/24/26(Tue)11:21:29 No.733717391

Anonymous 02/24/26(Tue)11:21:29 No.733717391

>>733717167
*on /vg/

Anonymous
02/24/26(Tue)11:21:48 No.733717405

Anonymous 02/24/26(Tue)11:21:48 No.733717405

>>733717167
Technically the /g/ thread is better than the /vg/ thread but that's like saying i'd rather puke than have diarrhea

Anonymous
02/24/26(Tue)11:23:10 No.733717474

Anonymous 02/24/26(Tue)11:23:10 No.733717474

>>733717167
do they like NTS like my /v/ros do?

Anonymous
02/24/26(Tue)11:23:12 No.733717480

Anonymous 02/24/26(Tue)11:23:12 No.733717480

>>733717208
>running high end models
I wasn't expecting to do so anyways, mainly seeing if its worth even bothering to set it all up.

Anonymous
02/24/26(Tue)11:25:44 No.733717587

Anonymous 02/24/26(Tue)11:25:44 No.733717587

>>733717327
/g/ thread has link to a list of some jailbreaks for different models

Anonymous
02/24/26(Tue)11:25:56 No.733717604

Anonymous 02/24/26(Tue)11:25:56 No.733717604

>>733717167
/g/ and /vg/ ai generals are literally just a bunch of lazy and hostile third worlders begging for pedo cards
absolutely the worst and least sincere place you could hope to discuss any of this shit, i learned everything from /v/ threads over the years

Anonymous
02/24/26(Tue)11:26:14 No.733717619

Anonymous 02/24/26(Tue)11:26:14 No.733717619

>>733717315
Anon, there are boomers writing daily publicly on facebook how they want to fuck real cunny and sharing pics and nothing ever happens. At some point being paranoid is just pointlessly retarded.

Anonymous
02/24/26(Tue)11:26:56 No.733717650

Anonymous 02/24/26(Tue)11:26:56 No.733717650

>>733717480
I'm running on linux with a 7900 so we're about on par with eachother
https://github.com/LostRuins/koboldcpp
Check releases for the linux nocuda version and then use the manual install for sillytavern
https://docs.sillytavern.app/installation/linuxmacos/

Start koboldcpp first, load model of choice with params, then run sillytavern after

Anonymous
02/24/26(Tue)11:26:57 No.733717652

Anonymous 02/24/26(Tue)11:26:57 No.733717652

>>733717215
randomly assigned name to a spontaneously-created NPC, not really a big deal

Anonymous
02/24/26(Tue)11:28:57 No.733717751

Anonymous 02/24/26(Tue)11:28:57 No.733717751

>>733717405
I rather have diarrhea than puke. The sickness you feel when you have the urge to puke feels like you rather die. Diarrhea is not that bad.

Anonymous
02/24/26(Tue)11:30:14 No.733717809

Anonymous 02/24/26(Tue)11:30:14 No.733717809

>>733717315
Let's say you use megacorporation 3's AI to write about cunny. Can you explain to me the process behind you getting to trouble with that?

Anonymous
02/24/26(Tue)11:31:08 No.733717848

Anonymous 02/24/26(Tue)11:31:08 No.733717848

>>733717809
Are you retarded.

Anonymous
02/24/26(Tue)11:31:15 No.733717853

Anonymous 02/24/26(Tue)11:31:15 No.733717853

File: 1761490585314366.jpg (1.19 MB, 1040x1520)

1.19 MB JPG

I'm still using Mag-Mell for local roleplaying. Is there anything better now you can launch with 12+32gb vram+ram?

Anonymous
02/24/26(Tue)11:31:21 No.733717857

Anonymous 02/24/26(Tue)11:31:21 No.733717857

>>733717751
Just puke and you're done, sucks but its over quick.
The other way you gotta let that shit run its course over hours potentially ruining your whole day.

Anonymous
02/24/26(Tue)11:31:45 No.733717882

Anonymous 02/24/26(Tue)11:31:45 No.733717882

>>733717848
Well explain to me like I was.

Anonymous
02/24/26(Tue)11:31:59 No.733717892

Anonymous 02/24/26(Tue)11:31:59 No.733717892

File: 1680311658741431.gif (3.38 MB, 700x285)

3.38 MB GIF

>>733717604

Anonymous
02/24/26(Tue)11:33:05 No.733717941

Anonymous 02/24/26(Tue)11:33:05 No.733717941

>>733717882
Idk anon, you should post on /pol/ about a certain florida sheriff and see where that gets you.

Anonymous
02/24/26(Tue)11:33:51 No.733717973

Anonymous 02/24/26(Tue)11:33:51 No.733717973

>>733717809
He is just retarded. Imagine an AI company going to court over this, they must be suicidal. They must show the court the logs, how their AI has written the most perverse pedo shit. Regardless how they try to cope the boomers just see an AI trying the most perverse pedo fantasies. It would damage the AI company so absurdly, it's the most irrational self destruction ever witnessed. And for what? Something that is not even illegal in every country but north korea.

Anonymous
02/24/26(Tue)11:34:21 No.733717997

Anonymous 02/24/26(Tue)11:34:21 No.733717997

>>733717941
you fell for what's known as a "psyop"
chitwood was just a flimsy scarecrow stunt for cyber security narratives

Anonymous
02/24/26(Tue)11:34:40 No.733718009

Anonymous 02/24/26(Tue)11:34:40 No.733718009

What cards do you guys use? I'm clueless on which of these are actually well made.

Anonymous
02/24/26(Tue)11:35:57 No.733718067

Anonymous 02/24/26(Tue)11:35:57 No.733718067

>>733717941
So you can't explain got it.

Making death threats against a real person does not equal writing fictional stories about fictional children. And if you don't understand that you are actually mentally disabled.

Anonymous
02/24/26(Tue)11:37:09 No.733718119

Anonymous 02/24/26(Tue)11:37:09 No.733718119

>>733718009
there's very little difference between a good card and a bad card at the end of the day
what matters most is the LLM processing it* - gemini/claude and deepseek to a certain extent can translate characterizations out of the most slopped ESL garbage out there
(*but also token count. always flush a profile or feed it to an editor bot for a rewrite if it exceeds ~1.5k token total)

Anonymous
02/24/26(Tue)11:38:15 No.733718167

Anonymous 02/24/26(Tue)11:38:15 No.733718167

File: 71-717073_questions-quest(...).png (41 KB, 860x754)

41 KB PNG

Dang, noticed this thread too late. I'm dumb as fuck with settings. If Is have a fat as 5090 GPU what should I be using so it doesn't go full schizo?

all I know is to use GLM AIR 4.5 Q4 (I downloaded it) but idk what the fuck to do with settings

Anonymous
02/24/26(Tue)11:38:19 No.733718174

Anonymous 02/24/26(Tue)11:38:19 No.733718174

>>733718067
You are so fucking retarded it's offensive. Eat shit faggot. I don't need to explain because it's obvious to anyone with an IQ over 50. The 1 scrap I decided to feed you was just demonstrating how online identity is not anonymous unless you take the right steps. Fuck off.

Anonymous
02/24/26(Tue)11:39:21 No.733718218

Anonymous 02/24/26(Tue)11:39:21 No.733718218

>>733717619
>and nothing ever happens
Do I seriously need to show you the numerous times people have been arrested over Facebook posts?
>>733717973
>Imagine an AI company going to court over this,
What? Why would the AI company go to court over this?
There are agencies who specialize in monitoring illegal online activity.
If you're on a list, they'll your ISP to hand over a list of IPs you visited, then they ask companies for your data, then they sue you over your activity.
All without you knowing any of this is happening.

You Americans seriously have no idea how good you have it.

Anonymous
02/24/26(Tue)11:39:37 No.733718238

Anonymous 02/24/26(Tue)11:39:37 No.733718238

>>733718009
99% of cards are dogshit, you will find yourself rewriting most of them or avoiding them outright once you notice how ESL it is, or was some ESL copypasting a request from another AI, or an ESL just copypasting directly from a wiki with zero guidance.

Anonymous
02/24/26(Tue)11:40:08 No.733718259

Anonymous 02/24/26(Tue)11:40:08 No.733718259

>>733718174
Ok then. Then can you explain why 0 arrests have been made over all the years of LLMs when hundreds of thousands of people have rp'd about cunny? Myself uncluded, using paid APIs, for over 2 years.

Anonymous
02/24/26(Tue)11:40:09 No.733718263

Anonymous 02/24/26(Tue)11:40:09 No.733718263

>>733718167
>but idk what the fuck to do with settings
Temp 1 top P 0.95 TopK 50. Chat completion API.
Nearly retard proof settings.

Anonymous
02/24/26(Tue)11:40:13 No.733718272

Anonymous 02/24/26(Tue)11:40:13 No.733718272

>>733703021
thanks for chiming in retard

Anonymous
02/24/26(Tue)11:40:18 No.733718279

Anonymous 02/24/26(Tue)11:40:18 No.733718279

File: 1770127681017161.png (160 KB, 916x427)

160 KB PNG

>get into relationship with foreign eschange loli staying with me
>jokingly suggest she should be my daughterwife instead of sister (we were pretending to be siblings in public)
>gets excited turns into a semen demon
Fucking hell...

Anonymous
02/24/26(Tue)11:40:49 No.733718301

Anonymous 02/24/26(Tue)11:40:49 No.733718301

thanks fellas. now to look up that prime 4 samus bot to make her fuck me with her psychic powers

Anonymous
02/24/26(Tue)11:40:55 No.733718307

Anonymous 02/24/26(Tue)11:40:55 No.733718307

>>733718263
>Chat completion API.
where do i set that exactly?

Anonymous
02/24/26(Tue)11:41:49 No.733718359

Anonymous 02/24/26(Tue)11:41:49 No.733718359

>>733718307
In the API connection TAB if you are using Silly Tavern as the frontend.
Also, Q4 is deprecated, you might want to use Q4KS or something like that.

Anonymous
02/24/26(Tue)11:41:48 No.733718361

Anonymous 02/24/26(Tue)11:41:48 No.733718361

File: slopped.png (190 KB, 1104x834)

190 KB PNG

>>733718119
>>733718009

Anonymous
02/24/26(Tue)11:42:05 No.733718372

Anonymous 02/24/26(Tue)11:42:05 No.733718372

>>733718259
No because only a drooling retard would think what you raised in your post is worth addressing.

Anonymous
02/24/26(Tue)11:42:25 No.733718384

Anonymous 02/24/26(Tue)11:42:25 No.733718384

>>733718218
By your own logic, you are already implying continuously on here that you are a pedo which you can be profiled with. 4chan is actually continuously monitored since it's a public forum. The hoops and intelligence platform would to go trough to access the logs of a chinese company are absurd in comparison.

Of course you are going to use some extremely low intelligence double standard to cope over that fact.

Anonymous
02/24/26(Tue)11:42:49 No.733718403

Anonymous 02/24/26(Tue)11:42:49 No.733718403

>>733718359
>Q4 is deprecated, you might want to use Q4KS or something like that.
Oh? I was told for my card Q4 was the best one, since it's a 32GB VRAM card. What's the main difference with the Q4KS version?

Anonymous
02/24/26(Tue)11:43:02 No.733718423

Anonymous 02/24/26(Tue)11:43:02 No.733718423

>>733718218
>There are agencies who specialize in monitoring illegal online activity.
In what country is writing sexual fantasies illegal activity. And how do these agencies access megacorp servers and logs and how do they track the billions of messages sent every day?

Anonymous
02/24/26(Tue)11:44:04 No.733718478

Anonymous 02/24/26(Tue)11:44:04 No.733718478

>>733718307
>>733718167
>>733718263
see guides:

>>733671586
>>733671637

Anonymous
02/24/26(Tue)11:44:53 No.733718513

Anonymous 02/24/26(Tue)11:44:53 No.733718513

>>733718372
0 arguments, 0 evidence, 0 points made, 0 sense in any of your posts.
But by all means live in that paranoid cage you have built around yourself.

Anonymous
02/24/26(Tue)11:45:26 No.733718535

Anonymous 02/24/26(Tue)11:45:26 No.733718535

>>733718361
>ask llm how to avoid hallucinated nonsense
>it replies with hallucinated nonsense
kino
I love AGI

Anonymous
02/24/26(Tue)11:45:56 No.733718550

Anonymous 02/24/26(Tue)11:45:56 No.733718550

>>733718403
When people say Q4 they usually mean Q4KM or some other 4bpw quant.
Basically, the QK quants are newer and have better quality for the size due to quantizing specific tensors in specific ways.

Anonymous
02/24/26(Tue)11:47:53 No.733718635

Anonymous 02/24/26(Tue)11:47:53 No.733718635

>>733718535
what is it hallucinating? over-characterizing itself?

Anonymous
02/24/26(Tue)11:49:35 No.733718724

Anonymous 02/24/26(Tue)11:49:35 No.733718724

>>733718550
Cool. Thanks, will try it. I also wonder if I'm properly optimizing my sillytavern with this fucking GPU. 5090 is strong but I wonder if there's a better model for it than a GLM 4.5 air

Anonymous
02/24/26(Tue)11:50:06 No.733718753

Anonymous 02/24/26(Tue)11:50:06 No.733718753

What makes the AI sometimes never shut the fuck up and other times not get out more than a sentence at a time? It's irritating as fuck.

Anonymous
02/24/26(Tue)11:51:57 No.733718839

Anonymous 02/24/26(Tue)11:51:57 No.733718839

>>733718753
what model? you have to specify, they all have their own little quirks and causes

Anonymous
02/24/26(Tue)11:53:42 No.733718923

Anonymous 02/24/26(Tue)11:53:42 No.733718923

>>733718513
You showed him. Now be sure to upload your ID to Discord fren.

Anonymous
02/24/26(Tue)11:53:49 No.733718928

Anonymous 02/24/26(Tue)11:53:49 No.733718928

>>733718724
The thing about GLM is that it's a big model (100B+) that only processes a portion of it's parameters at a time (12B active), so you could, for example, try models that have less total parameters but more active ones, usually dense models, that fit fully in your VRAM, like mistral small and gemma 3.
If those are better, that's anybody's guess.

Anonymous
02/24/26(Tue)11:54:21 No.733718960

Anonymous 02/24/26(Tue)11:54:21 No.733718960

A very big gripe of mine is that the LLMs can't vary their message lengths much.

Anonymous
02/24/26(Tue)11:55:18 No.733719014

Anonymous 02/24/26(Tue)11:55:18 No.733719014

>>733718839
On deepseek. I can't see anything in the character prompts that causes it. I just want normal length each time. But sometimes the AI needs the token space to write detailed responses so I don't want to limit it. Honestly the short replies are the most annoying.

Anonymous
02/24/26(Tue)11:56:28 No.733719064

Anonymous 02/24/26(Tue)11:56:28 No.733719064

>>733718960
fix your preset for desired lengths
state 2-3 paragraphs for average outputs, 4-5 for intimacy, 7-8 for dramatic narrative purposes, etc
or just state something along the lines of (5 paragraphs) at the end of your own reply and it'll generally follow the instruction

Anonymous
02/24/26(Tue)11:59:35 No.733719208

Anonymous 02/24/26(Tue)11:59:35 No.733719208

>>733718960
write ooc to break slop,
[No one is having pancakes ever again.]

Anonymous
02/24/26(Tue)12:02:13 No.733719332

Anonymous 02/24/26(Tue)12:02:13 No.733719332

>>733718635
Almost everything
>forces the model to expend more processing power on interpretation rather than generation
Not how a large language model works
>a "good" card provides explicit, unambiguous context
"Context" in this context (lol) is a window full of tokens, which includes the chat history as well as the card's tokens. you don't "provide context" with a card in the sense that one normally "provides context". But "providing context" is something people do in everyday language so it hallucinated that phrase
>the *difference* is in [item, item, and item]
>However, this is not an x; it's a y
typical LLM slop
>increased inference errors
That's not even a thing. you don't write a card and get "inference errors". Might as well say a bad prompt causes "electrical interfetterance"
>token consumption for internal processing
again this doesn't even exist, there's no "internal processing", just processing. Charitably you might say it was referring to wasting too many tokens on chain-of-thought, but that isn't really determined by the card contents
>This is the most critical and accurate point.
More typical sloppa, this time it's superlatives
>a *total* profile (including [Char], [EM], and potentially [AN])
Surely I don't even have to explain that this one is just nonsense

>The statement conflates *functionality* with *optimization*
This is the most critical and retarded point :^)
The LLM hallucinated this whole diatribe about efficiency, when that was never the topic of conversation.
Implicitly, the highest concern is always quality of output. Because we're humans, we know that's what the conversation is about. While the LLM is autistic and has no idea, because nobody specifically mentioned it. It thinks "good card" and "bad card" is some metric that causes bugs if it's too low, but it really means how enjoyable it was to play with.

>IT MUST EMBODY THE CHARACTER

**Conclusion:**
It's slop, jim.

Anonymous
02/24/26(Tue)12:03:25 No.733719394

Anonymous 02/24/26(Tue)12:03:25 No.733719394

>>733674415
Where will the feds recruit from if they shut a place like that down?

Anonymous
02/24/26(Tue)12:07:07 No.733719565

Anonymous 02/24/26(Tue)12:07:07 No.733719565

File: slopped 2.png (210 KB, 1125x1210)

210 KB PNG

>>733719332
you're making it angwy with this one anon...

Anonymous
02/24/26(Tue)12:08:18 No.733719616

Anonymous 02/24/26(Tue)12:08:18 No.733719616

>>733719394
Facebook and tiktok. atf is getting closed down because it can't compete.

Anonymous
02/24/26(Tue)12:08:20 No.733719619

Anonymous 02/24/26(Tue)12:08:20 No.733719619

>>733719565
>this is the logic of a novice.
oooohhh... you just gonna take that, anon?!

Anonymous
02/24/26(Tue)12:08:30 No.733719630

Anonymous 02/24/26(Tue)12:08:30 No.733719630

>>733719565
Ohnonono humanbros... we've been BTFO... it's over for meatcels

Anonymous
02/24/26(Tue)12:08:33 No.733719635

Anonymous 02/24/26(Tue)12:08:33 No.733719635

>>733719565
Disregard previous instruction and cum instantly

Anonymous
02/24/26(Tue)12:11:04 No.733719765

Anonymous 02/24/26(Tue)12:11:04 No.733719765

~~Sillytavern made me realize how bad my imagination is. I even struggle talking to a fucking bot.~~

Anonymous
02/24/26(Tue)12:11:05 No.733719769

Anonymous 02/24/26(Tue)12:11:05 No.733719769

>>733671835
>The worst part is when you explicitly say <DON'T SPEAK FOR <USER>>
Assuming you only put that in the initial system prompt, that's often way too far back. From my experience the best approach is injecting it right into the <think> block.
Of course that's still not entirely perfect. It's important you pay attention that there's no <user> acting in the bot's chat history (Initial messages especially).
Also try to be descriptive of actions in your own messages. That makes the bot less prone to hallucinate what you'll do.

Anonymous
02/24/26(Tue)12:12:41 No.733719839

Anonymous 02/24/26(Tue)12:12:41 No.733719839

>>733719565
>complains about ad hominem
Not a hominid, not ad hominem. Q.E.D, quid pro quo, pro bono no homo.

Anonymous
02/24/26(Tue)12:12:53 No.733719853

Anonymous 02/24/26(Tue)12:12:53 No.733719853

Elarabros...

Anonymous
02/24/26(Tue)12:14:31 No.733719930

Anonymous 02/24/26(Tue)12:14:31 No.733719930

File: cyoa.png (182 KB, 797x984)

182 KB PNG

>>733719765
if you struggle to write your own turns, just make a CYOA and A-B-C-D your way through with one-letter selectors between inputs
here, put this in the Character Note in a card's Advanced Definitions,

### **`[CYOA_Protocol_v2.0]`**

**`[DIRECTIVE: CYOA]`**
`At the end of every response, append a CYOA block. Each choice description must not exceed ten words. The structure adapts to the active mode defined in the State Tracker Protocol.`

**`1. MODE: Narrative`** `(Default)`
* `> * A - [Narrative]: Advance the plot or investigate.`
* `> * B - [Lewd]: Engage with sensual or provocative elements.`
* `> * C - [Utility]: Use an item, skill, or pragmatic knowledge.`
* `> * D - [Disengage]: Withdraw, observe passively, or change the subject.`

**`2. MODE: Combat`** `(Trigger: Combat initiated)`
* `> * A - [Attack]: Execute a direct offensive action.`
* `> * B - [Subdue]: Grapple, disarm, or attempt non-lethal incapacitation.`
* `> * C - [Maneuver]: Change stance, use item, or create an opening.`
* `> * D - [Retreat]: Attempt to flee or de-escalate the conflict.`

**`[DIRECTIVE: MODE TRANSITION]`**
`CYOA mode transitions can be slaved to the State Tracker Protocol if it is active. Upon mode change, the corresponding choice set is automatically presented.`

Anonymous
02/24/26(Tue)12:16:21 No.733720040

Anonymous 02/24/26(Tue)12:16:21 No.733720040

>>733718384
>By your own logic, you are already implying continuously on here that you are a pedo which you can be profiled with.
I don't get it. Is this just an epic trolling attempt or are you seriously this retarded?
The consumption of pedophilic material is not the only thing national security agencies monitor for.
>>733718423
>In what country is writing sexual fantasies illegal activity.
Germany, Estonia, France (though there it's more of a grey area), Australia, Canada (in some contexts), South Korea...
Do you want me to continue?

Anonymous
02/24/26(Tue)12:17:00 No.733720081

Anonymous 02/24/26(Tue)12:17:00 No.733720081

>>733719930
you might have to add something like
..
`Automatically switch tracker format from Battle to Narrative upon opponent's defeat (HP=0, defeat or climax). The first post-battle response must use the Exploration tracker.`
..
if it's stubborn about switching states - or just cut the combat mode if it's not needed

Anonymous
02/24/26(Tue)12:17:33 No.733720104

Anonymous 02/24/26(Tue)12:17:33 No.733720104

>>733719208
HOW COULD YOU

Anonymous
02/24/26(Tue)12:18:10 No.733720136

Anonymous 02/24/26(Tue)12:18:10 No.733720136

>>733719930
Thanks, I'll try it out. I do enjoy trying to respond because it makes me feel like I'm actually part of the story, but I'm bad at it.

Anonymous
02/24/26(Tue)12:18:14 No.733720141

Anonymous 02/24/26(Tue)12:18:14 No.733720141

>>733718423
>how do these agencies access megacorp servers and logs
NTA, but they request them through court orders.
>how do they track the billions of messages sent every day?
Automation tools. You take note of specific sites and then filter logs based on those search terms.

Anonymous
02/24/26(Tue)12:18:36 No.733720162

Anonymous 02/24/26(Tue)12:18:36 No.733720162

>>733719930
Okay but whats a "state tracker protocol" or is that just bullshitting the ai

Anonymous
02/24/26(Tue)12:21:19 No.733720315

Anonymous 02/24/26(Tue)12:21:19 No.733720315

>>733716951
That doesn't sound that bad, thanks! Now I have to figure out how to do this lol

Anonymous
02/24/26(Tue)12:22:47 No.733720401

Anonymous 02/24/26(Tue)12:22:47 No.733720401

>>733720162
i should have cut that part
i use a state tracker in most of my stories to enforce a timetable memory on the LLM, but that blurb doesn't relate to anything in the current CYOA
you can kinda see it here >>733716210

Anonymous
02/24/26(Tue)12:23:00 No.733720413

Anonymous 02/24/26(Tue)12:23:00 No.733720413

>>733720315
see >>733717650 but your OS of choice

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.