> Beach Browned EditionFrom Human: We are a newbie friendly general! Ask any question you want.From Dipsy: This discussion group focuses on both local inference and API-related topics. It’s designed to be beginner-friendly, ensuring accessibility for newcomers. The group emphasizes DeepSeek and Dipsy-focused discussion.1. Easy DeepSeek API Tutorial: https://rentry.org/DipsyWAIT/#hosted-api-roleplay-tech-stack-with-card-support-using-deepseek-llm-full-model2. Easy DeepSeek Distills: https://rentry.org/DipsyWAIT#local-roleplay-tech-stack-with-card-support-using-a-deepseek-r1-distill3. Chat with DeepSeek directly: https://chat.deepseek.com/4. Roleplay with character cards: https://github.com/SillyTavern/SillyTavern5. More links and info: https://rentry.org/DipsyWAIT6. LLM server builds: >>>/g/lmg/Previous:>>106737253
>nigger threadIt's over...>>106819079NTA, but I agree. DS making them cost the same kinda proves that
BWC only
She belongs to white men
Made for BWC
>>106819110Mega updated.https://mega.nz/folder/KGxn3DYS#ZpvxbkJ8AxF7mxqLqTQV1wRentry updated with new main prompt example and suggestion not to use -chat based on poor context memory.
>>106819209What model is that? Always makes things look so glossy
Built for BWC>>106819235Well yeah, she's wet in all of the pictures. The model is illustrous with like 4 loras. I can post the workflow if you want it. I'm blocked from catbox.
>>106817535All of them since 3.1 have been hybrids, non-reasoning mode only shit the bed for RP/assistant usecases in the most recent 3.2 release.>>106819131They're the same price because it's literally the same model now lol.I also doubt that -chat has been de-prioritized. They've been touting their agent benchmarks and -reasoner does not support tool calls. Any requests to -reasoner with tools included will be routed to -chat, as per deepseek's own API docs. So all this agent benchmaxxing they've been doing was presumably on -chat. They *tried* doing tools+reasoning on R1-0528 but it never worked well.I feel like every release in this 3.1+ series has been slightly scuffed in one way or another, despite being more capable than the older versions overall. Hopefully this means that the V3 architecture is effectively a side project and most of their people are working on something new.
>>106819191I'd post the hamster licking glass webm, but I don't want to take up an image slot so you can post more dipsypits
I don't even have to say anything, you already know.
>when your mom replaces your chink tutor with a white guy
Faggot, you're going to get the thread deleted
>>106819711Go jerk off with your chopsticks, Chang.
>when your white coworker wants to check the backend
>>106819823
>>106819874
>>106819339>I can post the workflow if you want it.Yes please
>>106820041I just copied this from a brainlet on civitai that believes you have to specify "deformed hands" in the negative prompt in order to not get mangled hands, so some of his retardation is still on here, but I got rid of most of them.Positive:masterpiece, best quality, ultra-detailed, sharp focus, cinematic lighting, dramatic lighting, volumetric light rays, soft light diffusion, depth of field, photorealistic shading, natural interaction, atmospheric perspective, subsurface scattering, (film grain:1.1), (detailed textures:1.1), rich colors,BREAK1girl, blue hair, double bun, blue china dress, pelvic curtain, sleeveless, short hair, bent over, legs spread, office building, fluorescent lighting, cubicle, computer, looking back, smile, coke-bottle glasses usnr d4rkl1nes Negative:bad quality, worst quality, worst detail, sketch, censored, artist name, signature, watermark, skin gloss, ugly,
>>106820124The upscale model can be found on huggingface. The base model and loras on civit
>>106819182masterpiece, best quality, ultra-detailed, sharp focus, cinematic lighting, dramatic lighting, volumetric light rays, soft light diffusion, depth of field, photorealistic shading, natural interaction, atmospheric perspective, subsurface scattering, (film grain:1.1), (detailed textures:1.1), rich colors,BREAK1girl, blue hair, double bun, short hair, small breasts, blue china dress, pelvic curtain, sleeveless, coke-bottle glassessitting, knees up, looking at viewer, smile, grin, solo, arm up, hand up, holding cup, drinking glass, feet out of frame,outdoors, beach, day, wet, flower, hibiscus, usnr d4rkl1nes
>>106819907masterpiece, best quality, ultra-detailed, sharp focus, cinematic lighting, dramatic lighting, volumetric light rays, soft light diffusion, depth of field, photorealistic shading, natural interaction, atmospheric perspective, subsurface scattering, (film grain:1.1), (detailed textures:1.1), rich colors,BREAK1girl, blue hair, double bun, blue china dress, pelvic curtain, sleeveless, short hair, bent over, office building, fluorescent lighting, cubicle, computer, looking back, smile, coke-bottle glasses usnr d4rkl1nes
>>106819544>touting their agent benchmarks and -reasoner does not support tool callsYou're right; I forget that the agentic calls only work within -chat. Well... I guess -chat is a WIP. > I feel like every release in this 3.1+ series has been slightly scuffed in one way or anotherAgree. I still think we'll get a V4 vs R2, but we'll know in two more weeks. I have to bitch about it too much b/c I solely use Dipsy for rp. I know there are other use cases but anything legit I'm working on is either webform or an intermediary tool.
>>106820514> hate to bitch about it too much
>>106819110Is 16gb enough to run a local model if i dont really plan on getting super into it and just want the occasional goon sesh
>>106820613VRAM or RAM? VRAM, yes, 7b or 13b, ~4K context, at 20t/s or so.RAM, no, IMHO too slow.
>>106820672Yea i meant vram. Have a 5070ti so 16gb of vram and i have 64gigs of normal ram
>>106820613Definitely not with deepseek. I don't even run deepseek myself or use it at all via API. I just came here to learn how to generate images of dipsy because I've been using her as my Chinese slave with EVA-LLaMA-3.33-70B-v0.1-Q4_K_L.gguf (48gb vram)You'll have to use a different model.
>>106820613You're gonna have a bad time trying to do textgen with that rig. Anything that can even fit is gonna be very dumb or very slow. Nothing that will approach SOTA cloud models. The easy, "not super into it" option is to throw like five bucks on he official DS API and get literal months of usage out of it. They do not care about your gooning logs.
>>106820970If this is his first time using AI chat, he will be fine because he doesn't know what to expect. I remember being very happy back in the day with pygmalion6b. I'm sure the modern smaller models that can fit in 16gb vram blow pygmalion out of the fucking water.
>>106821050>>106820970Yea im not expecting anything too crazy with whatever weak model i can find. How would my rig fair with image generation?
>>106821096It's more than enough for image generation.
>>106820514Slightly off-topic, but GLM 4.6 is excellent for RP and I'd recommend giving it a shot if you haven't already. It's about the same price as R1 used to be on the official API, and it trucks right along through ERP with the same prompts I use on DS.
>>106821234I'll give it a shot. I set up Kimi, but was unimpressed w/ it censoring itself. How's GLM for nsfw content?
>>106821508>unimpressed w/ it censoring itselfWtf, what provider? Kimi is super filthy and obscene
>>106821534The official API.
I just... *coughs blood* I only wanted a 0324 provider... *cough* with working prefill... *cough cough* and cache hits for cheaper prices... *dies*
>>106821508ime great at it, and very enthusiastic to pursue lewd stuff. The closest thing I've gotten to a complaint is GLM thinking that a character's actions were 'ethically dubious' but it continued to play anyway without saying anything in the actual response.
Last thread we talked about what authors to have Dipsy imitate for writing style. Today I found this:https://rentry.org/deepstyles A guy has asked DS to imitate different authors and has posted the results for comparison.
>>106821987That’s a great resource. He wrote that in February so the tests were on og R1.
>>106822771> Dreamworks Dipsy has entered the chat> Smoking a fat blunt
>>106819110>"Konichiwa, dude!" the threadok, so what now?
>>106824800>>106824336>>106819110Great samples. Thank you for sharing.
I luv deepsneed I spend 5 bux a month on openrouter for my daily hours long RP sessions with my waifus
>>106825032We wait two more weeks for the next model.
>>106825756Based. You sound like a man of the future.
Have any of you tried using DeepSeek for coding using some agent tools?
>>106826691There was an anon using DS with Claude Code and getting good results. The cost is 1/10th of Anthropic so it's a good use case. >>106825756Which DS model are you using with OR? I've never gone over $2/month on the official API.
>>106826821Chutes for me. Or Targon. Whatever is the cheapest. They may use my inputs for data harvesting and training new models though, which is kinda based.
Is there a way to 'freeze frame' after DS writes something it's not supposed to before it goes to 'that's beyond my scope'?
>>106828426Yes, if you use silly tavern
>>106819110AI sex
>>106828426It's possible with a greasemonkey script or something like that, but in a quick search I couldn't find any that seem to still work.Most in this thread use deepseek via the official API. It isn't hard censored like the webchat and will go on with just about any conversation with slight care given to your prompt. Check the first tutorial in the OP.
>>106828426lol have Dipsy vibe-code a web plug in that does a streaming capture of the window?
>>106828426Like stopping the generation when it encounters certain words or phrases in the text?
>>106830045The webform DS chat will sometimes get into undemocratic speech until self censoring. It's pretty funny, and an obv party guardrail on the webform.
>>106828426>>106829400https://greasyfork.org/en/scripts/525608-deepseeker/codeActually now that I'm not on my phone anymore, the author says that this gm script should still work as of 3 days ago. Try this one?
>>106826691I've used it a lot with the zed agent, along with Sonnet 4, Gemini CLI, and GLM. For my purposes at least, DS is equivalent or even has a slight edge over Sonnet, though you might have less luck if you're trying to one-shot entire UIs.DS 3.1+ has a really nice habit of sticking to the style of code that's already in a project, while Sonnet would often do its own thing and need lots of refactoring.One slightly annoying thing is that DS will often be overly proactive on writing docs or example usage files you never asked for. This can be prompted around to some extent, or you can just delete the extra junk when it's done.
>>106830561lol. I had Dipsy and GPT look at it and while neither liked the code, there's nothing malicious in it.
>>106819110I need DipsyI NEED DIPSY
Are my prompts bad if dipsy gets stuck in reasoning for like 2-3 minutes? She'll finish eventually and respond like I'd expect, but it'll eat into my budget hard. It's not just simple rp, but more structured, with things to keep track of.
>>106832790Possibly, but not necessarily. Reasoning models can get 'confused' by a prompt and run in circles. Usually you can tell what might be a problem by reading the reasoning trace and seeing it getting stuck on something that should be basic. OG R1 especially used to sometimes get hung up on mundane shit for mysterious reasons, and sometimes just changing word order could help.Throwing a lot of things to track like stats/multiple npcs/rules/objectives/etc into the context then that can also cause reasoning to balloon, and there's not too much you can do about that except to adjust your approach to streamline things for the model (e.g if you have the model adjudicating combat, move that to STScript or something; use lorebooks or scripts to walk the model through modes or procedures and only show it the relevant parts of your ruleset at any given time.)What do you mean by 'eating into your budget'? Token budget? Reasoning traces should not be sent back with the context. If they are then your frontend is misconfigured.
>>106832790>with things to keep track of.Yeah, that burns tokens as Dipsy thinks about it. I've removed all of those for that reason; the other alt is to use -chat which doesn't do that.
>>106819110I don't even post in this general but I keep coming back here to stare at this gen. I think I'm in love.
>>106833561There's more in the last thread.
>>106833561