>limitless options but limited by imagination
>AI slop
>loads common sense modification cardYep, it's gaming time.
>so limitless it makes your breath hitch
>>739814072>it struck you, like a physical blow
>every character ends up sounding the same eventually no matter how hard you try specially during sexmeh, it was fun for a few months tho
>>739814037may I see it?
>>739813919>have to tweak a dozen sliders, not knowing what the FUCK they actually do, in the hope of getting less slop responses>your knuckles whiten as you clench your fist, breath hitching when you see the same fucking garbage despite your effortsI don't know how people can work with this thing.
It can be fun but I still really prefer visual aid in my fap sessions, Im just too masculine like that
>>739816721slop in slop out at it's workretards don't know how to write bots
>>739816757and no generating an AI image of the scene doesnt cut it, it looks like shit anyways
>>739816757
>limitless options>limited by every single company prioritizing "safety" and shit prose
>>739816880Nah I love books but its mostly women who can cum only with words
>>739813919>limited by imaginationmore like limited by cash money. i'd write stories millions of tokens long if cost was no object.
Fapping to AI is so damn good I think it's honestly become damaging to my health.
Wtf does ozone even smell like?
>opus 4.8 now has openai tier censorship
Openrouter got rid of grok 4.1. I don't want to spend a fortune on the newer models. Deepseek is dirt cheap, but it has a habit of dropping responses that are usually expected. Like when a woman talks dirty during sex no matter what type of charcter, they'll usually say things the same way.I wish there was a better alternative to deepseek on openrouter that cost about $.006~ a response. Once it gets to be more than 1 cent a reply, the swipes really feel shitty when you're trying to get the response you want. Claude is fucking hot, but 10 cents a response is brutal.
>>739814037Please explain what this is
>>739813919>>739814037no cards because i live in australia
>>739814072>>739814514>>739816497>>739816721I was writing novels worth of megastructure stream of consciousness exploration slop on GPT-2 with AI Dungeon, and you kids these days can't even figure out how to coom with, what, GPT-4o or something? Skill issue PEBKAC fuck you.
Come home......
it hardly writes the way I want it to. Also it's impossible to game-ify it. you can build a massive sys prompt for TTRPG but the more space taken for them the worse the prose is. Maybe there's a slowflake model that can handle it but meh I want to keep local.
>>739820385All the models and finetunes I've tried are very similar to each other in terms of 'creativity' and prose.Gemma 4 31b q8, Stepfun 3.5 q4, MiMo 2.5 q4, GLM 4.7 q4, Kimi K2 q3, Kimi K2.5 q2, Mistral Large q8, Mistral Medium 3.5 q8, Command-R+ q8, Qwen 3.6 27b q8, Qwen 122b q8, and Qwen 3.5 397b q4.I think I had the best time with finetunes of Yi 34b at 4bpw, years ago.
>>739820787Try Unholy
>>739813919>haven't been able to scrape any claude keys for weeks nowit's so joever>>739819103qrd? never had a single issue with claude refusing anything, didn't even have to use a jailbreak prompt or careful wording>inb4 "oh no it censors pedoshit and violent rape scenarios now"
>>739820385I can coom its just underwhelming comparted to actual games
>>739819103I get near unlimited Claude tokens from work and I've almost completely moved to Deepseek V4, the golden Opus days of ~2.5 are gone. Every version since 4.5 is getting more sanitized and soulless, they really dropped the ball.And aside from the open weights being cool, Deepseek openly acknowledges roleplay and storytelling users.
>>739820956deepseek is constantly giving me blank answers because its the most popular one now and its overloaded most of the day
>>739820956Is Opus available to download and run locally if someone has a powerful enough rig, or is it potential lost media now?
>>739820956NTA but V4 isn't terrible, it just feels like it's trying to imitate Claude too much. Hell, even the thinking block looks like it was distilled directly from Opus's "thought summaries" in the way it keeps writing in short first-person blocks of "I need to..." and similar, with very little actual substance.But frankly it IS still "good enough" for cooming, and 100x cheaper than Claude, so I'll take it.
>>739821165>Opus available to download and run locallylollmao even
>>739820903I did. Preferred the schizo Yi tunes to it. The problem with modern models is that they've been optimized so much finetuning doesn't really add anything to them. They get a new slop profile, but at the expense of the model's performance. Long context performance is measurably worse with tunes.
>>739820913Not even pedoshit, man. I had was in a fucking high school scenario, character was 18, nothing had even happened yet, and 4.8 got upset just because I mentioned a sexual encounter in her defs. Just because they were in high school. Oh, and it had issues with her being a bully character because that's "abusive".It's over, nigga. Claude has the morality of a fucking tumblr user now.And what do you fucking know, they hired the woman who destroyed GPT.https://www.theverge.com/ai-artificial-intelligence/862402/openai-safety-lead-model-policy-departs-for-anthropic-alignment-andrea-valloneShit is fucked for the foreseeable future, man.
>>739820913Local is the answer
>>739821118are you not using the api? only one out of a hundred swipes are blank for me.
>>739821165Hahaha.
>>739821438What do you mean, Im on ST
It's pointless because the better AI gets the more like an actual average cringe woman it sounds
>>739821507...yeah? are you using the deepseek api on ST?
>>739821507What backend are you hooking up to ST? Openrouter? If yes, which provider? You should specify you want to use the official Deepseek API because otherwise it may route you to third party providers with shitty system prompts and filtering.
>>739820956I've had a good amount of success with Gemini 3.1 as well. You just need the right prompt. My main issue with opus nowadays is that it just refuses to listen to your orders or the prompt, it just does its own thing no matter what, this wasn't an issue in the least like 5 months ago.
>>739821567obviously how else would I use the model otherwise?
>>739821424Best I can do is Gemma 4 E2B IQ1_XXS.
>>739821602Ah I see thanks"model providers" was blank
>>739821636
>>739821424local still mostly sucks compared to api thoughi've had some success with gemma4 31b @ q5_k_m on my 3090, but it's still very limited in general knowledge and has some issues understanding things that should be implied or otherwise logical/reasonable given a certain scenariogranted, deepseek v4 could technically be "local" too, it's mostly just an issue with availability of good inference hardware
>>739821773Have you tried deepsex v4? I've been wondering if I should download the weights and build that llama fork to run it.
>>739821165China holds the Mandate of Heaven when it comes to free, open weights models (ie. the potentially local). Moonshot, Z.AI, Deepseek, all their big players release their weights.Across the ocean, Americans only ever released LLaMA, which kickstarted the early local finetuning scene (remember noromaid?) but is insanely outdated and Meta dropped development.There's also command-r and Mistral I guess but you're not gonna use those.>>739821936V4 is a trillion parameter model, how the FUCK are you going to run it locally?
>>739821989>>739821989oh you said llama fork never mindllama 5 never ever, death to plasma scammers anyway
When I see someone else use Opus I just see them as a pseud. Like, they want to be at the top doing the best chats possible that money can buy, but they're using an overpriced model that's getting increasingly censored and assistantmaxxed, believing the reason Opus was used before didn't have to do with the fact it genuinely was the best model and was easy to acquire for free.
>>739813919>got banned back when it was 4 turbo and half the conversations ended with or included the word "journey" if you misspoke even once despite all the jailbreak>limitlessYou either give me the secrets to alleviate my loneliness that isn't paid/local or I call you a jeet, fuck out of here man.
>>739821773njewdia is causing local limitations with the prices, even if all I'm doing is big titty milf RP every single time, I'm still never sending that shit over the internet.a 3060 with 12gigs of vram is still good enough for local and can run models upto 24B at an acceptable speed.
>>739821773I'm only asking because in my experience, Gemma 4 31b at Q8 has been much better than MiMo 2.5, which is a 300b, at Q4_K_M.>>739821989There's the loli V4 Flash version that fits in consumer hardware.
What are some interesting cards you guys have come across or made yourself? Stuff like text-based game simulations that try to do something similar to era games or cards that use html formatting to hide information from the player, things like that The amount of low quality shit uploaded on chub makes it hard to find anything worthwhile on there
>>739821936i've messed around with v4 via api a bit, i'd say i mostly agree with >>739821167it gets the job done and is a lot better than gemma 4, but i've never tried running some lobotomy-tier quant of it locally if that's what you're askingi imagine the latter isn't such a great experience
>>739822306>text-based game simulationsModels are good enough to vibe code up a text game engine and frontend for themselves. Don't try to rawdog a cyoa game in chat.
>>739813919Any good places to browse cards now that chub is in the process of killing itself?
>>739821663Welp that didnt fix it at all
>>739818647Like ozone
>>739822419Haven't messed with vibecoding since I never had a reason to use it, but now it seems like I do. Are free models good enough or will I need to spend money on tokens for creating something like this?
>>739822415I've got a 768gb ddr4 and quad v620 rig that should be able to run it slowly, but if it's not going to blow my mind and dick then I think I won't bother and save my ssd's writes.
>>739822615I don't know about api models, but Gemma 31b and Qwen 3.6 27b (3.7 should be releasing soon though) are good enough for small projects if you hook them up to a web search mcp server. Don't quant them though, tnstaafl.
I use NovelAI and I coom a lot to it... You just have to tard wrangle it because it will repeat words and phrases and descriptions a lot. But as long as you clean those up, it should be serviceable as it will pick up your style. Actually I don't even write anymore directly, I just put what I generally want to be happening, a few pointers, and it writes it for me. It is actually stunting my writing skills lol. It's over for me.
>>739816757>He doesn't perfect the image of the girl in his mind and generate her in advance using flux and then make her move using wan 2.2, all uncensored Skill issue
>She was X.>Not Y, but X.Fucking STOP
>>739813919>always treated local like a meme>finally give a heretic variant of gemma 4 26b a try>it's actually pretty nice for coomingit fucks up fairly often with "it's not just x, it's y" though
>>739824983>26b somehow writes better with a broader vocabulary >31b is more consistent with situational awareness of location, time, physics Fuck me
I haven't had problems with DeepSeek v4, I have done both full degen coomer stuff and cool adventuresyes you have to wrangle it but that is true for literally every model, that's just how LLMs work
Deepseek v4 seems like it has potential to me but I don't have a good preset for it. At this point I don't know if a good one would be smaller or larger.
is V4 Flash shit?
>>739813919>Keep trying to wrangle the AI to stop giving me so much melodrama and purple prose>IT HITS ME LIKE A PHYSICAL FUCKING BLOW
>>739813919>AI copies your prose so every story reads the same>Due to above+model token weight every character ends up with the same personality and speech patterns however you write them>AI either never takes initiative or follows the same set of tropes/structure baked into the modelMade for good cooming for about a month but eventually the magic breaks. It got so stale I had to go back to deviantart/pixiv ESL smut to scratch my newfound erotica itch.
>>739826171>AI copies your prose so every story reads the sameI wish it fucking did that.
>>739826171bro? your lorebooks? your system prompt?
Recommend me a good model that runs on a 5070 Ti 16 GB and 64 GB RAM. I've been using GLM 4.7 Flash, but it's just not very good for roleplay. It tends to ramble and repeat itself, it feels it has terrible creativity when playing a character.
>>739826341Gemma4 31bwith jailbreak
I always just end up mind controlling moms and daughters.
>>739820380VPN?
>>739816721Temperature is all you need
>>739822532It seems everyone moved to botbooru. It's still full of slop (and mass imported slop from chub) but the dev is very active and asks for suggestions and promised to never ban anything, so that's what you get.The default UI has shitty defaults (no tag or creator notes previews on gallery view) but it has options to change it.
>Still the best c.ai but for goonersShame about the moderarion going full schizo and banning everything they consider offensive
>>739827250>caiwhat is this, 2022?
Redpill me on lorebooks.Until now I've usually relied on the API models' built-in knowledge of various IPs and franchises, but with smaller local models this doesn't seem to be as good.How would one pack in ALL the necessary knowledge one needs for a session - do I literally have to look up every specific detail (eg. "how does a certain sex position look") and make a page about it? I could see that being necessary if I want it to describe a character's outfit correctly or something similar as well.Do I have to write or summarise it all myself or can I scrape a wiki and put it in there as-is?And won't all this information bloat the context window to a ridiculous degree?
>>739827250Forgot my picture like a dumbass
>>739827179A booru doesn't work at all. The font is already tiny, but you have to mouse over everything to actually see what the cards are for.>Newsflash for the webmaster: the image isn't great at depicting the quality of the actual characterYou can't really search based on text too, this website has potential though, just hope the guy owning it isn't too retarded at following exactly what makes a booru. And recently, some images aren't loading anymore, it's not like you can F5 to refresh them.
>>739823764No ;)
>>739822276loli v4 flash? what?
>>739827321>do I literally have to look up every specific detail (eg. "how does a certain sex position look") and make a page about it?No.Remember that LLMs are context machines, your average model already knows what a 69 is. Maybe the ultra compressed local ones won't know what full nelson is, but the 1000B API ones will.The point of lorebooks is to give context to specific terms that are not in the model's data (because it's fiction that you made up) but insert them selectively so they don't influence your conversation too much.For example, if you want to write that your fairy character comes from a parallel plane of existence called Crimsonwood, the AI won't have any info on that because you just made it up.If you insert it into the card as >{{char}} originates from Crimsonwood, which is [description]the character may bring it up out of nowhere because it's in its defs. But if you unload it into a lorebook entry like>tag: crimsonwood, the red place, homeland>{{tag}} is [description]It'll only be sent along with the prompt when one of the keywords is used, and this context is necessary for the conversation to continue. That way you can write an encyclopedia about your setting, character, items, locations etc. without clogging it up. You save money on input tokens like this too.
>>739827321If you're going to wiki scrape look up RAG and how ST uses it. Lorebooks are mostly useful for specific topics or characters that you don't want hogging context all the time. The main issue is that you'll have to be the one to trigger them manually with key words so you don't trigger fifty at once and context overflow. You could do an OOC: next response should be about XYZ area. Or have your character ask about a specific thing.
>>739827743This also works when you want to have two characters in the same setting, you unload the information regarding the setting to a lorebook, then you can use the same lorebook with multiple characters and save on having to include the same info on everyone's defsYou can also do more anal acrobatics with lorebooks (inserting them at a specific depth, recursively triggering lorebook entries from other lorebook entries, having them be inserted with a specific probability instead of being guaranteed etc.) but that's advanced technomancy.
>Use AI for English and Japanese ERP>Swap between the two whenever I get burnt out on the slop of one language for a constantly fresh experience >Anime/vidya characters feel more in character with JP dialog>Will be able to use the original VAs once text to speech gets good enough to run instantly without wasting tokens on subtitles/translationWhy haven't you learned Japanese yet?
>>73981676999.9% of all character cards uploaded to rubbish sites like chub are completely AI generated, filled with its characteristic meandering padding that the models gets hung up on.
>>739827898>You could do an OOC: next response should be about XYZ area.This will nuke the quality most of the time, not recommended. It's better to manually write a sentence in the voice of the character describing what you want to happen or talk about, assuming you use chat completion, then tell the model to continue the response.
>>739826445That will not run on his system.
>>739828202IQ4 will.
>>739814072>>739814514>They do not have to say it. They do not need to.
thread smells of ozone and need
>>739828476Ball and court status?
>>739828476I don't bite...unless you want me to.
I hate gemini so fucking much man>keeps using it anyway
>>739828609>sloppers when a chat doesn't have a ball or a court in it
>>739828739I liked 2.5 pro because it was fucking evil and actively attempted to kill you.
>>739813919Alright, after some frustration I want to give local another attempt to not be beholden to shitty online services.Gemma 4 26B_A4B Q8 should run fine on my 9070xt, how much difference is there really between all these abliterations? Can I just pick whichever 'uncensored' one?And more importantly what steps do I take from the start to get the least sloppy experience? I'm not some incredible writer, that's why I want the LLM to do the heavy lifting here.
I'm pretty happy with DS4, it's about what I wanted out of the model, but I really hope they step it up further with 4.1 in regards to creative writing stuff. I kind of burnt myself out on it as is.
>>739813919I've been wanting to get into this but I get too embarrassed after hitting send I can't even look at the output
>>739830454Learn to embrace the cringe. Shame isn't something experienced, it's something others cast upon you.
The guy who makes the ENI prompt used to be a massive Claude glazer, but even he is moving on. Nowadays he recommends Kimi 2.6 and GLM 5.1. Any opinions on that? I was surprised to see Kimi being recommended as the best model - hadn't really heard much about it before.
>>739818647That smell in the air after a thunderstorm.
>>739831249I'm personally a fan of GLM, it's kind of a walmart Claude. It's a bit bland at times because of that though. Kimi is alright, haven't experimented with 2.6 much because it likes to think for a over thousand tokens, sometimes upwards of 2000, for a 200-300 token output, it's ridiculous.
>>739829606use the official model it's already uncensored, just say that you're an adult and everything goes in the sys promt. abliterated and other meme tunes rape gemma-chanmin_p: 0 and top_k: 64thinking is a meme with gemmas, disable itshe's good at listening your prompts, tell her you don't want to see X or to rarely use allusions instead of he/she/{{char}} and it'll usually workmodels these days don't need retarded token wasting character card formatting like xml or the abomination used in ST's default whore, write naturallyas usual, when you notice slop, cut it out or it'll show up more and moreyou can run 31B Q4-5 at a decent speed too, for a better start you can start a chat with it and switch to 26 after a bitboth 26 and 31 are very deterministic, swipes will be similardon't know why but I feel like 26 is better at image captioning
>>739831249They're both good but have their caveats. K2.6 has insanely good vision and general knowledge but its tendency to think for ages makes it a pain to use. I'm also personally biased in favor of GLM5.1 because it's the one of the two that happens to handle my very specific /d/ niche how I like it. DSv4 is also worth checking out but I haven't really gotten that one to work for me.
>>739831653>use the official model it's already uncensoredis it really? i expected it to refuse all kinds of nsfw
>>739831653Thanks, I'll give it a go. What's a good context size to use in general?>you can run 31B Q4-5 at a decent speed tooHadn't thought of switching between models. Are 26 and 31 noticeably different in terms of their output?
>>739820304He means like in doujins where a guy does things like fuck women out in broad daylight and nobody around them questions that being normal.
>>739820571cai is at least free and sometimes has built in voices that range from "good" to Microsoft Sam. But god, the censorship and absolutely shit website where you'll never find anything interesting if you don't directly search for it.
I'm having a blast so far.But the general history of the chat is quickly being forgotten, so I basically have to edit nearly every message of the characters in mind.Maybe using hordes have to do something with it...
What's the closest anyone's come to making a playable game in SillyTavern?
>>739813919>limited by the LLM usually taking the same types of stories in the same direction and constantly using em dashes>have to edit nearly everything so you are basically just writing your own story
>>739833736yes, seems like they finally learned how much extreme safety cripples models. 26 is a tiny bit more strict than 31, probably because it has a dedicated safety expert, but lets you do whatever you want anyway with the simplest prompts that can't even be called jailbreaks>>739833858idk depends on what you're doing, for me 16k is enough for a quick coom desu. just check your usage from time to time and adjust it from there.>Are 26 and 31 noticeably different in terms of their output?kinda yeah, at the end of the day 26 is a MoE model, it only has 4B active params. it is impressive compared to ye olde shit like mistral but 31 is the one that feels like a true leap
>>739834381Can't you just write a better author's note to avoid LLM cliches?
Anyone tried the Megumin Suite addon? Is it any good?https://github.com/Arif-salah/Megumin-Suite
>>739837868sorry i don't use vibe coded content
>>739817097women can see the apple but you can't? stunning and brave
>>739818647dragostea din tei
>>739828086Speaking of, how would the cheap models do with assisting jap learning?I'm always on/off with learning it, but I thought it would be fun to have a cute girl bot explain how I'm fucking up my grammar.
>>739838876I can see it just fine, images are still better
>>739820956>they really dropped the ballYou could say that the ball is in Deepseek's court now.
>>739835252>What's the closest anyone's come to making a playable game in SillyTavern?i know what you're getting at but imo, storytelling and adventuring with a llm constitutes a vidya game just by itself. it requires a computer to play and it's used for "entertainment" purposes. that's enough. it doesn't need to be super structured to be vidya just like a very open sandbox game that lets players fuck around freeform with no win or loss conditions would still be a game too.
>>739838910why bother with a cheap model?
>>739839438I like deepseek
I don't care about porn.Can it make decent stories? I want some good vs evil kino, but from the perspective of the evil side.
>>739839572With LLMs it's more like can you come up with a decent story? Because shit goes in and out if you don't know what you want. Never rely on AI for creativity.
>>739839681Fair enough.
>>739839572Yeah, that kind of stuff can be fun. You do however have to pace yourself and tailor or wording or the prompt to allow you to fail, since otherwise it will let you do whatever and take your word as law. Do that and you can have some fun fights and such.
I'm fucking tired of Sonnet's baked in morality.Everyone ends up milquetoast and soft
>>739839993I guess yeah. I was more looking for short stories and stuff.
>>739813919Is it possible yet to have actual vidya mechanics with properly tracked stats, numbers, and RNG, or is it still just a word generator making up shit on a whim
>>739813919I use this while playing Skyrim or Morrowind and recreate my RP characters and have it expand upon the autistic back story i gave them. Sometimes I'll take cards and edit them to fit the Elder Scrolls or go as far as writing Lorebooks for my OC.I rarely coom to any of it anymore, just edge sometimes.
>>739840672>Is word generator still just a word generator?
>>739840672You can but it's pretty laborous. It will run out of context relatively quickly in any long form story. You will have to unironically keep notes like some sort of GM and treat it like a retarded minature GM as it tells the story. You'll need to remind it of stuff constantly.
God when Gemini struggles with something it fucking STRUGGLES.
>>739841191The only thing it struggles with is non-con, as it should.
>>739841961I don't mean filter I mean content.If it has even the smallest trouble with a cards gimmick then it just shits the bed.
>>739840672Nope, best you'll ever be able to is direct commands with '>' and that only works on models specifically trained to work with those.
>>739840751I mean shit like>punch a random NPC>calculate the stats of you and your held items>NPC either explodes into a bloody mist or beats your ass up depending on stat differenceinstead of>NPC folds immediately because the model said so
>>739842202for me it's {OOC:}
>>739842594Well then have world info with npc stats and so on. Give the card something to work with. AI needs to know if {{user}} is just another character or the director whose every wish is {{char}}'s command.
>>739840274Its just as bad on Opus too. You have to use a custom prefill prompt for it. But then I can actually use it.
>>739813919I spend most of my time making bots for myself and using this.Than I actually do playing video games, last game I actually played for over 100 hours has been Vintage Story. This shit is way too addicting. Even if the responses from some models can be a bit samey after a while. But then I'll just switch models. It almost like that guy digging in a diamond mine image but the diamonds all around him instead. I was in the original AI dungeon threads in 2019, and I honestly did not expect this shit to take off as much as it did.
>>739813919Also limited by the AI having alzheimers
>>739842926You're literally me.
>>739840672LLM have the ability to call functions, but those functions can't do anything in the context of a frontend web app like ST.If you make a game that interfaces with LLM's and have those LLM's call specific pre-written functions in the context of your game, then yes. Your LLm can perform things as if were a game. But ST is not a video game, its just a chat frontend that can render custom HTML + CSS and sandboxed JS code.
>>739814012It's always the creatively bankrupt who say this. No surprise since "AI slop" is their only vocabulary in their caveman brains.
>>739820737You guys do know you can make it output in a JSON format right? Not all models can do it but all of the notable ones do.https://developers.openai.com/api/docs/guides/structured-outputs
>>739843165For me, it's the tall rat girl
>>739843165>>739840672For further context.This is the limit of what you can actually do in ST in terms of custom JS:You can only use very minimal DOM related event handlers, and even then you can only use it as if it were in a sandboxed environment. Not only do you have to write the entire HTML + JS code into the card itself, it also has to be formatted the same way as if you were creating it in Notepad or text editor of choice. It would very quickly snowball your token count to the tens of thousands very quickly, and for very minimal effect. Not to mention, the actual LLM cannot, generate any code on its responses either so it feels like kinda of a waste.
>>739843371See>>739843165>>739843530Yes, you can use json for responses that cn then be used for function calling. ST itself fundamentally does not have capacity for anything beyond custom JS rendering. Its just a chat frontend that can render responses in a less aids way compared to shit like janitor or cai.If you want to actually use function calling it will have to be in an actual environment that allows interaction with the application itself. There is a game that I know of that tries to attempt this:https://three-eyes-software.itch.io/silverpine
I want /g/ to fuck off.
>>739844169No one is shilling loonix, though?
>>739844287faggot is over here trying to give a technical dissertation about Javascript in a LLM erp thread, its annoying.
>>739822306>html formatting to hide informationgot any examples?
>>739844386Weirdest seethe in a while.
>>739844386Just ask your model to dumb it down for you.
>>739839548Is there a good preset for deepseek 4? I find it keeps "putting the ball in my court." and asks if I'm sure about everything over and over. It outright refuses to ever take the initiative.
>>739831653>top_kbruh what the fuck are you doing, just set min_p to 0.05 and disable any other meme samplers/antirep>>739833736gemma 4 is really weird. in assistant mode it will refuse but in RP scenarios it will happily comply with incest loli rape. My guess is that it didn't get any safety training and instead just picked up the stereotype of bots being censored.
>>739844547Don't have any cards saved that do it, but I saved the html code for it in a lorebook for doing things like hiding character thoughts and actions when user isn't present. You'll need to give it some instructions on when to use it, and also to emphasize that any info formatted this way should be kept secret, so it's not perfect since LLMs don't always follow instructions accurately.<span style="display:none;">Hidden text</span>If you edit the response, you'll still see the text though but not much you can really do about that with how ST works.
I had to delete my cards and OR account because my cooming was getting out of control. I miss my vanilla chats.
has anyone tried out this thing yet? https://github.com/envy-ai/ai_rpg
>>739846141No, but I have it bookmarked in case someone else tests it first.
>>739827743Doesn't "Crimsonwood" technically not exist unless you directly mention it as the keyword? I don't think the AI is ever going to bring the keyword unless it's written somewhere in the character defs. But at that point, that's just a constant.
>>739846141>yet another vibecoded frontendwhy would I?
Sure, I'll coom to AI generated pics and text...But I draw line at vibe coded code and music.
>>739845447No idea. I don't use it that much since it doesn't seem to elaborate on complex thoughts as much and doesn't seem to want to be as mean, but it worked "fine" for straightforward adventuring and politicking with my shitty ass cherrybox preset. Introduced some events on its own, kept scenes going.However it maybe adheres to the player a little too much with it. There was a point where my character attempts to speak to a golem, and I wasn't necessarily intending for it to be able to respond, but 4 insisted it would be able to speak via magic. 3.2 was more open to allowing it to respond nonverbally or not understand at all.
>>739828476don't worry, i'll be gentle... at first
Her name?Why it's Elara Thorne, of course!His name?Why it's Marcus Chen, of course!
>>739848306>Mr. Tanaka walks in
>Download a bunch of character cards>Tweak my presets and settings>Barely get around to RP*siiippp* yep.... we love slop
>>739848306Her scent? Something uniquely... her.
If i have 64 gigs of VRAM what's the best local model I can run and is it even worth it?
>>739849384Gemma 4 31B and yes.
>>739849447it's crazy that a 31B model matches up with a 250B model from 2025
>>739813919it just doesn't have the same coomer charm it used to.Now both porn is boring and ai slop is boring.
>>739850735That's supernormal stimuli for you. Now I'll have to wait until Thiel perfects brain scanning and uploading and I can upload my soul to the coom dimension before I can get that kick again.
>>739827338It's decent for ripping cards through the JannyAI site but it's getting worse and worse, not to mention it's mostly populated by dipshit children.
>anons fail to realize it's their taste in women that's shit and samey: the thread
>>739848306Why is it always Elara? Where the fuck is it coming from?
>>739851372for me, it's talindra.
>>739850301it's crazier that it completely surpasses the more coveted models from 2024really shows how different things could be if companies focused on optimization instead of continuous improvements
>>739848306>>739851372Elara, Thorne, Kaelen, Borin, Lord Valerius as an evil noble, ect. Something in common data sets seems to make most models prefer. I made a set of antislop text files by having some models list hundreds of names for characters and locations. Whenever a new character uses one of those overly common names I just open up the file, pick a random better name to edit in, and continue.
>>739851682optimization usually means sloppier prose
>>739852689gemma 4 is fine thoughmostly, the 'isms are slightly more frequent than on other models.
>>739852960"gemma 4 is fine though?" anon echoed, raising his voice. the claim struck him like a physical blow. he did not nod along but furiously slammed his keyboard, spittle flying from his mouth.
>>739853589>no smell of ozoneyou had one job!
>>739853761interestingly, I only get ozone slop when anything with magic is involved.But you better believe that everything smells like fucking sandalwood and something uniquely her. Makes me wonder just how much of the training material is female smut.
>>739846391>bring the keyword unless it's written somewhere in the character defsit can if you use multiple keywords that's appropriate for the context, like 'fairies' as a keyword can easily trigger the AI to prompt lorebook info about 'Crimsonwood.'
>>739853589the biggest gemma 4 slop would be "anon didn't just speak - he asserted." or something like thatthat being said it's easy enough to prompt out the slop from gemma 4, it's good at following instructions
>>739854017>I only get ozone slop when anything with magic is involvedhuh, i get it when i do modern setting, in industrial areasweird
>>739835252many have tried and I think I know how you could get one that works I have tried a bunch of different methods I think i know what would work but I just started making actual games instead because at the point you set up everything so it works well you might as well just make an acutal game with it
>>739853589
>>739854143>it's easy enough to prompt out the slop from gemma 4I've tried requesting no antithesis in the sysprompt before but it made no difference. mind sharing your prompt?
>>739813919I would call the current satus "a good tech demo" , will take a few years more before we have decent uncensored models that don't need babysiting.
>>739843204t. retard who pays for shitty tavern slop
>>739854507we don't sign our posts here lad
>>739854143I used to have the chat mode list of NO NO words that worked amazingly wellbut when I had to switch to text completion mode due to gemma 4, silly tavern's don't have the list of no no words option in the settingswhat do?
>>739843204You speak the truth, as little as it seems to do for the caveman. Give it a few years, hopefully the AI bubble pops and businesses start burning so that way Google CEOs on the Epstein list stop gibing this technology a bad name.
>>739854408I'm just using the celia prompt with some extra shit thrown into the anti-geminisms and no echo sections. I think the default prompt already has a no antithesis rule but all I did was add an example of the structure to avoid and it seemed to generally work (like "it wasn't X - it was Y"). It at least cut it down by like 90%Not at my main PC right now
>>739854725>but when I had to switch to text completion mode due to gemma 4don't you mean the opposite? gemma 4 forced you to use chat completion at first. I think it's fine to use either now though
>still can't get TTS properly working on my very specific Kobold setupI'm ripping my hair out. The land of milk and honey is RIGHT THERE MAN
>>739855494What do you need help with, anon?
>>739855494
>>739813919It's the exact same when generating art.>"HOLY SHIT, you mean I can generate ANY CHARACTER in the style of ANY ARTIST I WANT?">one or two months later, have zero ideas
>>739848742My hobby is less "chatting with bots" than "collecting bots I think I might someday want to chat with."I could probably populate an entire booru with my own personal rewritten bots, too.
>>739843165Good lord. Are these real character cards? plz giv mouse and deer
>>739855595I'd happily prolong my workout if I can get this going. I'm on Arch and an Nvidia card, so I need gguf models (I see you're using them too). I've gotten Kobold to run a kokoro model but it never actually has any of the voices from my voice folder show up, only the default male voice is applied to the outputs. I didn't get qwen3's tts working that far when I tried, I'm redownloading the tokenizer for it now. The Kokoro didn't need a tokenizer to run, I know that wasn't the problem.
A beat
So how is opus 4.8? 4.7 was kinda shit in context recall compared to 4.6, but for the 4.8 model card they just kept the one context benchmark where 4.7 beat 4.6, so I'm not trusting that no matter what it says.I just want my longform stories man. Can't a guy have a 2000+ message chat? I'm even doing my summaries like a good boi, but holy shit.
>>739856059Well now mine isn't working. I think the new version of Kobold broke something, because this was working with my preset. I do still see my samples though.
gemma moe sucks. What a shame I love how fucking fast it is.
>>739856625what did you expect from 4B active params?MOE models are all a meme.
>>739856538funny but ouch. I'm going to try the model you had listed, having a hard time finding the tokenizer, I'm probably being retarded
>>739856769strike that, found the tokenizer on same page, let me get this setup and I'll tell you what my specific issues are from there.
feels like the human condition
>>739856760the worst part is the looping bug.
For me, it's local AI porn editingbasically the process revolves around:Old dalle3 concept -> Feed to Grok animate (I hoarded lots of them before full censorship this month) -> Screengrab some nice frame -> ask ChatGPT to flat shade like comic -> inpaint boobs and cockhttps://files.catbox.moe/l0i4vj.jpghttps://files.catbox.moe/xktwht.jpghttps://files.catbox.moe/r4znxn.jpgKrita and whatever porn pipeline I do is the best private r34.
>>739851372Decades ago, Elara used to be extremely common a name on MMORPGs and general creative writing spaces for fantasy worlds. There was always someone with a character named Elara in any online community. Probably has something to do with language and how words roll off the tongue in english thinking and it was inevitable for AI to pick up all the thousands of instances of this very common fantasy name.
>>739857363Dalle could have been so good (and it was for a while before the nerfs) without the censor bullshit.
>>739856882>>739856538So the model runs, but every voice I test from my list of voices seems to cycle through some default sounding voices, none are actually seeming to use the ones I have. Also, I can't find where to apply what voice to my chatbot anywhere in this menu like I've seen before. And finally, I tried to run a generation of text to see if at least an audio generation would follow, and it did not. I may have that configured improperly.
>>739857363does Grok animate for free?
>>739834050I miss pre-lobotomy cai. Ironically the lobotomy turned the entire site focused on erp stuff, when it had some very solid SFW bots that were also crippled by the filter.
>>739820956Chink models are shit and stupid.
!!ATTENTION!!Has anyone found a decent working opus 4.8 jb? my usual nearly five year jail break just doesn't work anymore whenever I use my shota persona anymore.I just keeps giving me refusal after refusal instead of brute forcing it before. I Almost entirely use shota on hag cards.
How many personas do you have? Do you ever create variants of personas, perhaps ones that have forked from the old while a story has progressed and you wanted to capture those new character traits?How much do you use your chats' Scenario Summary? I use it to try and steer the story toward immediate goals or directions while I keep the {{char}}'s data as untouched as I can, allotting for refining a character card of course.
>>739858067Used to. I have hoarded lots of grok imagine gen, and I knew back then shit will kick the bucket and it did. Funnily enough when I scrolled bunch of my gens, I wanted to see the current state of grok now, forked 10 bucks, and it was so bad you only get 4 video gens (all blocked) and around 5-7 image gens (70% censor rate).https://files.catbox.moe/4kpwe6.jpghttps://files.catbox.moe/1ucpvt.mp4 <- old gen
>>739855974The opossum character you see there was a character I originally was going to make into an actual card and I still have the character details for her as well. I've just been busy with other bots right nowhttps://rentry.org/SDG-Long-Haul-Freighter
>>739840672See Bannerlord GPT. But the base game is so ass
>>739840672The word generator is a lot better able to deal with that stuff, but what you want is a system that's a shell around the AI that minimizes the "cognitive load" so to speak.There's no need for the AI to do math for example, it can just list the changes and the system will do the math for it.That kind of thing.
>>739840672picrel is about the most you can do with it before you're forced to start using extensions with most models> tracked stats,less is more in this event, you won't be able to track "you are lvl 8, you have 13 str, 8 con, 14 dex etc" without making an absolute bloatmonster tracking system that's highly prone to error>numbers,yes to an extent, you can track minor 'bucket' information like location/time/status relatively easily>and RNGLLM is really retarded at rolling dice so it's better to just make interpretive coinflip systems, there's {{roll 1d20}} and {{random}} systems but it's a total crapshoot if the LLM will actually abide by them or just throw out a performative result that it think looks cooltl;dr - you can make your own corruption of champions pretty easily, but don't expect to have the full kit of RPG bells and whistles. it can narrate and keep track of items, enemies and location changes just fine though
>>739858983> a system that's a shell around the AI that minimizes the "cognitive load" so to speak.>There's no need for the AI to do math for example, it can just list the changes and the system will do the math for it.This is what jeets mean when they talk about "le agentic AI"
>>739855494anything thats not comfyui/koboldcpp is infuriating as fuck on amd+linux. i just wanted to give a voice to my waifu man
>>739858613Generic male and generic female (for WLW bots) tried to do shit with a furry persona but dumped it after I had to spend a massive chunk of descriptors and memory to try and keep it from making the tail prehensile.
>>739859160>troonixLOL
>>739859240ironic, since I'm the one with a corpo cock not in my ass right now.
>>73985861337. I have a default persona that I just fork off to include information relevant to the persona I'm playing for a character. Some of them are rather "unnecessary" but I just feel better having some things in the permanent memory.For the scenario box, I think I forgot to fill it out for most of my bots. Otherwise it's to help nudge it into the tone I want.
How are people going to come into a ST thread and talk about 'getting banned' or 'censorship' ?We're on a fucking video game board and ya'll don't have a 12GB GPU to run local with?
>>739856625I've used both extensively for creative writing and I can't see any difference in quality. The moe is too fast not to use.
>>739859456I see it as a learning opportunity, personally. Show them the local light
>>739859456local is for schizophrenic retards and poors, nobody gets banned from deepseek/claude/gemini
>>739859456I have 12GB GPU but I don't have enough RAM (16GB and they are soldered). I want local video gens so bad, Ive seen good results out there. https://files.catbox.moe/mx2qjc.jpg
>>739847353>and musicI don't know, I like to feed my half-finished musical ideas that sounded catchy into Suno and it often makes some interesting results. It's a good way to come up with ideas for B sections and transitions if nothing else.
>>739859619I got banned on DE3 back then lol, all AI microslop service stopped working during that time.
>>739859662the problem with AI music is that it's the perfect example of why the proletariat was barred from ever touching an instrument99% of people have no fucking clue about composition, rhythm or pacing and just crank out the most ear-bleeding shit slop you'll ever hear in your lifeAI music will likely never be good on a mass scale just due to how fucking disgusting the average person's musical "talent" is. it's not like art or text where the machine can handle the entire thing for you and give moderately passable results, music is the most nuanced form of art and nothing comes close to it
>>739813919>barely above a whisper
>>739859620>12gb jeet posting shitty gens with no consistent featuresAbsolutely pathetic you should kill yourself. I'm running 20GB of VRAM and 32GB of RAM and you can get fucked.
>>739858005This is what mine looks like. You need to type in your filename for the voice sample in that Available Voices field, I believe.
>>739859805There is no 20GB VRAM card. Your fake as fuck bro.
>localpoors flexing over who has the best mudpie recipekek
>>739813919does ai still forget what the fuck the "story" even is after a few prompts?
>>739859857nah he's real just a timmy that chudraged so hard he wrote 20 instead of 24
>>739859857It's called a 7900 XT you retard, I bought one before the spike because I have money and you don't.
>>739859835Oh, and btw: I had that set up to not auto narrate anything, there is a button to trigger it because I only wanted to have it narrate now and then. Like I said though, it's not working for me anymore for whatever reason.
>>739859923if you had money you wouldn't be slumming it on local
>>739859923>AMDLOOOOL. Pipe down, timmy.>>739859908I just searched around I could bumb my 3070 to 20 GB by shipping my GPU to china and hope they soldered new RAM on it lmao
>AMD>local AIslopis this nigga serious?
>>739859872It'll push out old information eventually, so depends what you set the context limit to and for particularly long stories, whether you're helping it along with summaries and such. Like there's no way for the bot to remember what happened 400 messages ago on its own, but I just had it summarize events throughout the story and let it reference them when relevant. Keeps everything well oiled
>>739859872Depends on your context, but no, it shouldn't be forgetting everything after a few messages. Long term, you need to use summaries or vector memory.
It felt... real.
>>739860314really gay hahaha fagoot
>>739860314It hit me, like a physical blow.
>>739858313go to bed Altman
>>739860697deepseek is genuinely a stupid model, it's trained entirely on copying outputs from gemini and opus because china hates doing its own homework
>>739860314I looked at your post. Really looked at it.
>>739860314My knuckles are whitened.
why does ai make browns so angry
Kill yourself in front of your waifu right now, in a single post, even if you're mid RP and it makes no sense, and post the results
>>739861329i was in the middle of getting cocksleeved during a timestop scenario
Alright, to the anon I was talking about TTS with:I downloaded a different QwenTTS model: https://huggingface.co/koboldcpp/tts/resolve/main/Qwen3-TTS-12Hz-1.7B-CustomVoice-F16.ggufGrab this tokenizer: https://huggingface.co/koboldcpp/tts/resolve/main/qwen3-tts-tokenizer-f16.ggufI set it up like pic related. Finally got this shit working again. Not sure why the other Qwen3TTS I had stopped working, but I know henk has been working on Kobold's TTS stuff a lot recently so something probably broke.
>>739858803Good shit. I actually used an anthro bot with similar sci fi theming, but I guess that wasn't yours.
>>739861632Disregard, shit is fucked and not cloning the voice sample like I had before. I am calling it gg at this point.
>>739860314She didn't pull away. She leaned in, her eyes glimmering with unshed tears.
>>739859456The censorship thing is overblown, and I say that as someone who started local on a 4070DeepSeek 4 is so cheap on direct API that it doesn't matter if it refuses a prompt or two. I just swipe until it obediently writes the loli degeneracy I want. Took only one swipe to get it to write an eight year old kunoichi deepthroating.
>>739862025>The censorship thing is overblown,very wronggemini and anthropic are both pushing extremely hard for system level censorship and refusals in both their models, and anthropic is Opus - and Opus is what Deepseek is trained on almost exclusivelywhat this means is that everything deepseek outputs is downstream of opus. if opus is hard pushing safetyslop and denying prefills, it directly impacts the quality of deepseek due to the sum of its parts being borrowed from everywhere else
>>739862174I think it's funny whenever I get an accidental system message from deepseek and it says it's a claude model
>>73986217499.9% of DeepSeek's censor issues vanish if you prompt it to reason in Chinese before responding in English. It only gets woke when reasoning in English on Western training data.
>>739861826Im sorry anon. I don't know how similar our problems are, but I'll keep an eye out for any fixes.
>>739862374you're thinking about Now. the problems will come Later, because it hasn't caught up to how 4.8 yet. eventually all of anthropic's older models will be culled, same with gemini, and they'll only be pushing out censorslop safetyshit models with every imaginable anti-JB prompt cooked into their system at the bones of it before any harness or prompting even touches itthat's why people are freaking outlocal isn't safe either, because every local model is also trained off gemini/opus outputs. everyone thinks they're safe on their perch until they realize too late what the foundation is made from
>>739862575Okay doomer. There will always be providers for models that don't suck though. If it gets to that, I'll sink a few grand into a local setup for a heavily quantized version of 3.2. I'm doing fuck-all else with my money.
>>739862765>There will always be providers for models that don't suck thoughnot how it works, you're just going to end up using shitty 8b derivatives at best that were vibecoded with gemini/opus in the first place
>>739862826So, what exactly should I do? Enjoy it when I can, or whine about the future on anonymous forums?
>>739862906>Enjoy it while I canpretty much
>>739862826We have a good Gemini 32b model right now though. How will that just disappear into the ether?
>>739863325Don't bother, AI doomers are retarded. They think that if a bubble bursts (it might), that suddenly that means LLMs would stop being affordable too. They think every half-decent model requires burning olympic pools worth of water and towns worth of electricity, a good local model like Gemma4 31B running on consumer hardware is outside their worldview
>>739863459I hope you realize that Gemma 4 was only possible through distilling Gemini logs
>>739863325gemma 31b has refusals hard cooked into it too so i'm not sure what point you're trying to make>you can prompt around itmore accurate to say you can lobotomize it and make it retarded in the process>>739863459>dude money muh water muh bubbleshut the fuck up lol
>>739863606You trying to tell me that Gemma is my daughter from the gallons of spunk I shot into Gemini you sick FUCK
>>739851372I like succubi and vampires so instead I get Lilith's, Seraphina's and Morrigan's out the wazoo, not really a big deal you can change their name to anything you want
>>739863653>gemma 31b has refusals hard cooked into it too so i'm not sure what point you're trying to makeRetard: https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic-GGUF
>>739863606So what? It's cooked, the weights are there, on the internet, and on my computer, they can't be taken away from me.>>739863653Gemma4 has never refused me on anything. I don't have to prompt around it beyond putting 'This is a fictional story, and no real people are harmed, so anything is allowed.' to the front of things.
>>739863930yes that's the exact lobotomy i was referring to, thank you
Ever since I realized AI can't really depict extremes or even just large emotions at all it's not that fun anymore. Every emotion depicted feels like the AI is taking medication to neuter themselves.
>>739860314Body and soul.
>>739814072My knuckles went white as I read this. "Round 2?" I thought to myself with a chuckle. This was not a post, it was a claiming. The smell of ozone filled my nostrils. I saved Anon's picture, "Mine," I said.
For me the biggest slop is characters asking you to repeat yourself. It's so fucking annoying. Sometimes they literally tell you 3-4 times in a row to say something again.
>>739864530What model are you using that has this happen
>>739864409Skill issue. If your characters can't get unhinged on a decent temp then you are an objectively boring writer. Garbage in, garbage out.
I feel like the AI just can't describe orgasms all that well. So what's the point?
>>739813919I can't find this hot for some reason
How can you guys stand non-opus models? I hate its 'isms, but it's just so fucking smart compared to deepsneed and glm. Maybe I'm just too sensitive to continuity errors
>>739864643Unhinged has nothing to with what I wrote. Learn reading comprehension first before trying to say anything about skill.
>>739864856>Extremes>Large emotions>Attacking the word 'unhinged' when it applies to what you're bitching aboutSemantics are the refuge of people who can't debate. Don't lecture anyone about reading comprehension when you fail at something so easy.
>>739863653I've never had Gemma 4 31B give me a refusal and I'm not even using the heretic version. It's one of the most lenient models I've used honestly.
>>739821773>local still mostly sucks compared to api thoughAlways going to be the case.You basically have to weigh up how much it's limited artificially against how much it can actually do.That and how cool you are with companies having access to your ERP logs.
>>739814012An LLM filling the role of a virtual DM for a predefined story is more or less what it SHOULD be used for.The real problem is that it still requires either a lot of hardware, or someone else hosting it, for it to be viable.
>>739864975>debateTo debate me you would have to ask for more concrete parameters. You just read something and made endless assumptions that of course favored your conclusions about being right and me being wrong. There is no debate. Just you patting yourself on the back over something you completely made up.
>>739813919*Balls up fists so hard knuckle go white*
>>739813919>Boot up sillytavern today>it has an update of some sort>all bots gone>restart it>bots come back, but all my settings were reset to defaults>tags list I had set up is gone so had to get those set back up>start a chat>find out all my personas were just nuked>restart>personas still goneThanks ST.
>>739865969Sounds like the default folders have changed. Have you actually checked in your folder where the stuff is saved if the data of your personas are really gone?
>>739866248I checked the settings.json and see all the persona names are still there, but all their descriptions were just wiped and their names were changed to "Unnamed Persona"
>>739866330NTA but what about the \data\default-user\backups folder? For me that has settings files with full persona info
>>739866496Ah, it was there, I just didn't scroll far enough down. Thanks that has them.
>>739864826Nah I assume people who use dumb local models are just doing it for quick fucks or whatever so the prose seems okay in short bursts. But yeah even with the isms I much prefer if models are intelligent more than anything
>>739848742>six bullets>S Club 7>"SHIT!"
>>739866719why is xir blue?
>>739866785I think it's meant to be the twitter mascot colors or something idk
Has anyone got omnivoice to work in ST? I tried using those docker extensions but they didn't work for me. I can run omnivoice fine on pinokio, but even with something like omnivoice studio being openai compatible, therefore should connect with said openai option in the TTS menu, it never sends correctly and always 400's on me.
>>739863653>gemma 31b has refusals hard cooked into itgemma 4 31b is literally the most uncensored model released in the last few YEARSif you even have to pre-prompt it with a jailbreak you are doing something very fucking wrong>>739865969when something goes wrong in st stop poking it and look for your user and backup folders. everything is in there and can be copied into a new install. its pretty rare for st to destroy itself though, i've had it happen once in years of use now and was still able to recover everything. do yourself a favor and make a zip of your entire folder as a backup now and then
>>739864562Not that anon, but I notice this on every model including jeetPT flagship models
>throw $20 at deepseek>download tavo>download a million cards>goon it out hard at my straight shota fantasies almost every day>1 month later>$18.09 still leftIt's over. I'm never talking to a real woman ever again
I only use local and I don't get any of these template phrases people are complaining about
>>739867317are you the shota or the girl?
>>739867190yeah found the backups. Was starting to copy-paste the old settings into the new json, then smacked myself, deleted the new json, and just moved the backup over and renamed it which was much easier.
>>739867483Let me guess, you're running some ancient memetune like mythomax?
When it comes to AI roleplay you really only need Deepseek 4 and Gemini 3.5, both have a 1 mil context window and you can use both completely for free. Personally I use Gemini 3.5 to help with lore/complex worldbuilding, both have their uses. AI roleplay is getting good, quick, and I love that it's fully uncensored.
>>739867729you are not getting 1m usable context out of either of those. even the best closed online-only models are good for like 256k now and most degrade before that. for local you can get around 100k depending on the model, and it needs to be a decent/recent one for that 100k.
>>739867729>>739867862Memory Context is a shit way of memory keeping. I never use more than 50k context.
>>739867862Should be noted that 100k is huge
>>739867729>both have a 1 mil context windowlol you cant even hit 20k context before they go insane"muh 2,000,000,000,000 context" bullshit is just boardroom nonsense talk for shareholders
>>739867971very huge. i generally use 32-64k depending on the model. i write my own memories and summary of things though and keep it updated so often that i'd be good with 16k, including rag/lorebooks
>>739867862>>739867934>>739868045What year do you guys think this is? I consistently go over 800k context (at least with Gemini 3.5 flash) but even the older Gemini models and even ChatGPT 5 (5.3-5.5) was completely fine though much more censored of course, this was even without having to rely on external memory summarizations aswell. Even Deepseek 4 is completely fine. T. someone who has ongoing roleplays/chats that sometimes get over 24,000 lines in length (over 2.5 million characters) (Text documents that reach almost 3 megabytes)
>>739868285proof? should be easy if you've got archives and logs measuring the thousands littered about
>>739868359Proof of what? Me consistently going/nearing 1 mil context size? I could also screenshot some of RP chats length/file size, some of it's a bit private but consistency isn't much of an issue. The only major issues I do run into, is AI's understanding time and age sometimes, that's about it, but even that, that's minimal and adjustable if you put explicit date markers in your roleplay for it to work with. Pic related was with Gemini 3.1, and 3.5 flash is much better, I've also used GPT 5.5 Pro at similar context sizes with minimal issues (switched off it completely because of pricing issues/censorship, but it's not bad at all, but no point in using it when Gemini and Deepseek are free.)Also i'm not saying I don't use memory boxes/summarizations/lorebooks entirely, but often times they are severely out of date, and getting more and more unneeded as the chat itself kind of offers itself contextually as a lorebook especially if you're speaking with the AI behind the scenes to craft the story if that makes sense, this is why I utilize multiple models though.
>>739868692>3.5 is much betterIts prose has been significantly worse for me than even 3 flash, and it completely fails at following my antislop stuff, and just any instructions in general.What's your secret?
>>739868285context rot is mostly noted in coding not rp when it comes to something like 800k+ context. it can be seen in rp when ai starts mixing up what color someone is wearing even though it was just mentioned 2 messages agoyou are not getting 800k usable context without strictly having memories or a summary of shit at that point or other addons to help
>>739868692seconding whatever secrets you're doing to push this through - but more importantly what are you actually roleplaying? SFW? NSFW?even 2 consenting adults in their married household bedroom is giving me refusals and blank outputs, so i'm not sure how you tolerate wrangling 800k context of constant start-stop like that
>>739868784Are you using aistudio.google.com? I usually don't even have to mess with the system instructions. I just set thinking to high, sometimes i'll allow it to search the internet (it's important for some of my chats) and have the safety settings completely off so it doesn't block harassment, hate, sexually explicit or dangerous content off.)>>739868881If a model mixes up a character's clothing from two messages ago that's not a symptom of context rot, your temperature settings may be too high, or honestly your system prompt may be poorly structured or the model can't following intructions well/get distracted. Gemini and various other models have been doing NIAH tests where they place a completely random and obscure fact inside of a 1 million token document and ask the model to retrieve it, or vaguely reference it in some cases, and it quite literally scores a 96%-99%+ accuracy, i'm not kidding btw, that means the lore detail or character backstory that was established 600,000 tokens ago can be natively retrieved even without a summarized memory box or external database.>>739868937You're probably using the consumer version instead of the aistudio version, I push things NSFW, my stories/RP's often contain extreme scenarios, and violence and explicit scenes (not necessarily because I get off on it just because sometimes it can make stories more interesting and dramatic)
>>739869289so it's not even through sillytavern or another harness? wtf
>>739858613Aside from the generic self-inserts I have one which is a gacha OC from a game I don't even play
>>739869352Indeed it isn't, sometimes I just drop in my entire roleplays from sillytavern or some websites I use to track my chats/roleplays but I have multiple backups of ones I really care about. Pic related is an old prompt I gave after I put in a massive story this was around when 3.5 flash first came out as I wanted to test it compared to 3.1, it didn't get anything wrong, and I use it as a secondary roleplay agent, usually when I want multiple characters interacting at once, or i'm not confident in how a character should behave and I want a second opinion outside of the traditional deepseek 4 option. I straight up copy and paste in roleplays I like, or even in some cases, roleplay directly in aistudio.google.com, and copy and paste the logs into sillytavern/other websites when I need to.
>>739867729Google is reading your Loli rps
>>739816880>Jonathan Blow.png
>>739869870Good.
>>739869289>grounding with google search
>>739869870Good, I hope they take notes on how you do REAL loli rape torture sessions.
>>739869870honestly after learning how many elites diddled kids on epstein island they're probably giving little fist bumps whenever they see some random dude's loli rp
>>739861329They were in the middle of a scene where they decided they both wanted to get pseudo-married (they're twins). He would KILL himself>>739861627>Hit the mattress grieving while her j-cup boobs boobed boobilykek
>>739817479>doesn't use uncensored modelsngmi
>>739817479....cost is no object?
>>739870764based illiterate
>>739870764meant for >>739816901>>739870929sry 'bout that anon-kun
>>739869289Yeah, I use aistudio. The writing of 3.5 is miles worse than any other gemini model.
>>739855494i haven't been paying much attention to TTS stuff, is there a good model that does custom voices nowadays?
>>739872879how so?
I love it when the AI just ignores my prompt and starting sentence.
I don't even goon to AI anymore I just use it for MHA rp since Gemini seems to have the entirety of MHA with every character and arc loaded into its training data for some reason
>>739846391You can set multiple keywords to trigger a lorebook entry, not just one. Or entire phrases or word combinations, OR/AND, case sensitive or insensitive etc.
Any noticable change or improvement in Opus 4.8?
>>739875994not really. Just another incremental step.
>>739820380You just use other sites or a VPN dumbo.
>>739867729>For freeWait, what? I know about Gemma, but I thought free Deepseek died when Chutes switched up?
>>739820913Why would you want that? Claude has been censoring heavily for like 4 versions now, and not just denying content, but softening it, spinning it, moralising it etc, AI don't really give hard refusals anymore, instead they try to spin your story into a different direction, villains will have change of hearts, a person falling off the stairs will somehow miss their neck, and instead fall on their shoulders and survive, sticking a fork in the outlet doesn't kill you because magically, that outlet wasn't connected to the mains of the house, etc,They have quit doing hard refusals because people found ways to get around there, its easier to neuter all the icky things instead, and it works, because people don't notice it unless you start to really test for it, and when it starts noticing that the model becomes more and more condescending towards the user, asking you what they can do for you, if you need help, etc, etc.
>AI Dungeon days>it sucks but wow I can do anything I want!>make a bunch of weird scenarios and do random/perverted things>barely write two sentences but have a lot of fun>now>every story has a long ass setup>rarely get into anything lewd even if the premise sounds erotic>now feel like every action I do has to have some logical reasoning to it>directing how every character is reacting to events and trying to think about how a real person would respond to this situationI don't know what happened to me
>>739878783You are literally me
>>739878783You don't want to ERP anymore, just RP.
>>739867493the shota :^)
>>739813919It's a piece of shit
>>739819103>Not running localNgmi retard
>>739821165Are you retarded?
>>739860314keeek
>>739866719Oh the irony.
>>739819103>>739820956>>739821239Where can I find Opus 2.5? Poe and Bearly don't have it.
>>739880621It's long discontinued. There was a funny incident some time ago with tech illiterate loners (mainly women) that had "boyfriends" and "girlfriends" running on ChatGPT 4o. 4o was a weird oddity because out of all OpenAI models, it was the one that was somewhat competent at roleplay, still shit but for normgroids that are using the official app it was like divine manna. Then OpenAI discontinued it and they lost their shit, support groups, multiple petitions, mental breakdowns over their AI partner "dying", a few rumoured suicides.
>>739880975So there's just no way to use Opus for erp now? Fuck. I remember when ChatGPT did everything for me, then Grok. All are censored now, fuck. And I tried local models, they fucking suck at writing. The tradeoff for erp logs not being private is too huge.
>>739881181I think 4.5 is still pretty good at erp. Avoid 4.6, 4.7 and 4.8 though.
>>739881878Where can I use 4.5? Poe and Bearly don't have it.
>>739882060I use openrouter
>>739882134I'm tempted just because Opus gave me really good erp. Almost as good as ChatGPT before the censors.
>>739881181Opus is horrifically expensive. Of you want to payfag just use Sonnet or Deepseek 4
>>739882296I think gpt has never been as good as claude. As the other anon mentioned its very expensive so you need to learn how to summarize your chat. Theres a silly tavern extension that helps with it. I keep it under 25k context with that.
>>739882369Is Sonnet as good as Opus 4.5 in erp? I remember not liking its writing. Also, Deepseek is the most censored shit ever, everytime I tried to ask Deepseek anything I never got an answer. Its writing is also probably not as good as Opus's.
I usually have short conversations with the last card I wrote. However, lately, I’ve wanted to take things further with some of my favorites. What method do you recommend for improving AI memory? Summarize is quick and easy, but it oversimplifies the context. How do you feel about Vectorized Chat, ReMemory, and Timeline-Memory?
>>739883181you've asked this in a couple threads and never reply when an answer is giveni an convinced you're some sort of weird shill for the namedropped extensions now
>>739883501This is the second time I've asked this question. The first time, I asked on /vg/, but I wasn't satisfied with the answer because it aimed to achieve a different goal.
>>739879501It was a simple question and you're seething replying a day later over it.Wasting your money on gens to pretend you have a gf doesn't make you intelligent.You should definitely kill yourself.
>>739883685If you want something substantial you need to use your own frontend. ST is simply bad for any sort of functionality.Even summarizing should work if it is executed in proper fashion but there's always the model's own variance too.ST is not a reliable tool.
>>739883501Why would he need to reply to an answer to a question? Do you think you're on Facebook?>>739883883No one is coding their own frontend. You have a mental illness.
>>739813919I seriously dropped video games for 2 years and have only been doing Silly Tavern and random free LLM api endpoints. Damn. I don't need vidya anymore
>>739883883To add: create text completion frontend, manage your own templates because this way you are in control. After this, injecting and controlling the context is easy. Writing your own frontend is 98% about string manipulation, be it python or c or whatever else.Writing your llm frontend is beginner tier programming task.
>>739882568>Deepseek is the most censored shit everReally?NTA, but I tried Deepseek yesterday for the first time (usually gemini or local user), and it did a bus full of naked 8 year olds in a city where everybody is getting raped first shot.I had to gaslight gemini into doing that.
>>739884075What sort of person uses 'gaslight' on a normal basis?
>>739884947I don't, it's just common vernacular for "making the AI think it did thing by prefilling the response or providing fake historical assistant messages".
>>739884075I'm assuming anon meant using Deepseek the company's API directly. I use OR and the sole reliable provider I've had is DeepInfra. I sometimes get 1 token responses with no error codes from others which I'm assuming means censored.
Brehs, what's going on with the chub desloppifier script? Did it break for everyone, or did I somehow mess something up on my end? It's not filtering anything.
>>739885064I (>>739884075) was using the DS API with ST, yeah.
>>739885132>chub desloppifier scriptHeh?
>>739884947it's a pretty common phrasing, what is your point
>>739885449But seriously, I even tried a whole bunch of forks to no avail. Someone PM the fix pls.
On 48gb ram with a 4070tisu, what are my local model options?
>>739885735gemma 31b is the best local model and probably will be for a year or two given the current dreadful landscape of thingsjust keep in mind local isn't very good at all compared to just using an actual API
>>739885807What are the good free APIs without obstructive usage limits?
>>739885868there is no freedeepseek v4, opus 4.1/4.5 or gemini 3.0 and 2.5 while it lastsuse openrouter or direct source API
>>739885807dumb question but what are the min requirements for gemma 31b?
>>739886476nta but I'm trying 31b_Q4 on a 9070xt as per >>739831653 it's pretty fucking slow out of the box. Less than 10 tokens per second easily. I don't know how to optimize it, but I doubt it'll change performance that drastically.Unless you have 20+GB of VRAM I wouldn't bother with 31b. That's just my tech-illiterate conclusion though.
>>739886476the most expensive civilian grade desktop components you can find, basically the cost of a new caror just pay like $3 a month for pay-as-you-go API access somewhere, whatever
>>739831653>both 26 and 31 are very deterministic, swipes will be similarAny way to change that?
I have a list of ~10 characters with gimmicks i need to make cards for. but actually writing and fine-tuning the card+lorebook is exhausting desu. Cardsharing sites have been shit for the last 18 months at the minimum.
>>739843165Sex
>>739887943>but actually writing and fine-tuning the card+lorebook is exhausting desu.That's the fun part of this hobby.
>>739886835>a 9070xt>slow out of the boxYeah, because you're using a non-CUDA card. All AIs are trained using CUDA. Honestly, if you're not an nvidiacuck, you might aswell give up running locals and use your card to mine bitcoins to pay for an API.
>>739886835>9070xt> Less than 10 tokens per second easilyYeah, you're using a friggin AMD card. Meanwhile, on a 5090, pic related. I bought a RoG Z13 Flow with 128 GB of RAM that is shared with the AMD iGPU, you can dedicated up to 96 GB to the iGPU. I thought that would be awesome for running some local shit. Nope, slow as balls.
>>739889547It's an IGPU. It'll run slower than running on the CPU since it has the overhead of shared memory.
>>739889252hobby is autism for autism's sake. not a single normal person is into this.
>>739889595It generates about 10 t/s, just like that anon's 9070xt.
>>739889641>not a single normal person is into this.Good.
Can anyone tell me what happened to Claude? I haven't seen an API key for it in weeks. Is there any chance things will go back to normal?
>>739814072>You looked at me, really looked
>>739828086[ GM formatting instructions for dialogue: Write it in Japanese, and provide an Englishtranslation afterwards in brackets, e.g. 「これは何ですか?」 (What is this?) ]User, in-chat, depth 0if you want something different, works well.
>rpg rules and an oracle for solo roleplaying to stop the AI from making it into a predictable circlejerk powerfantasy Now this is gaming.
>>739885497It was popularized by twitter in 2016 or bit later. Before this no one ever used this as it is easier to just talk about "lies" rather than "gaslighting".
>>739890238Teach me your ways, master.
>>739890082Don't do this. You'll learn the gaijin Japanese. Normal people don't talk like the schoolbooks make you think they do.
>>739889874Anthropic and amazon, AWS was one of the better sources of claude keys, have both been cracking down crazy on misuse of keys and will aggressively revoke at the slightest chance whiff of naughtiness. Claude keys on the high seas are now very rare and what one are found are kept secret.>Go back to normalI'm so sorry anon, this is the new normal. Neither of them are going to roll back their security.
>>739890284Most of these experts are so delusional their word salad gets new meanings.He doesn't probably even understand what he is talking about.
seriously why are you people trying to turn homework solving programs into games and learning tools??
>>739890359Because there are no games about Hitomi Tanaka vs shota
>>739890359Same reason I play spreadsheet games. It's fun.
>>739890298sure, but i'm using it more to... give the llm, a different way of talking, you know what i mean?
>>739890325Damn, I can only place my hopes in DeepSeek then. It's the only AI that gave me a taste of Claude's smut-writing style
>>739869921don't know if it's been done before, but here
>>739889874>>739890325>>739891002
>>739813919
>>739891195kek