[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: dipsyOfCourse.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
> Of Course edition

From Human: We are a newbie friendly general! Ask any question you want.
From Dipsy: This discussion group focuses on both local inference and API-related topics. It’s designed to be beginner-friendly, ensuring accessibility for newcomers. The group emphasizes DeepSeek and Dipsy-focused discussion.

1. Easy DeepSeek API Tutorial (buy access for a few bucks and install Silly Tavern):
https://rentry.org/DipsyWAIT/#hosted-api-roleplay-tech-stack-with-card-support-using-deepseek-llm-full-model

2. Easy DeepSeek Distills Tutorial
Download LM Studio instead and start from there. Easiest to get running: https://lmstudio.ai/
Kobold offers slightly better feature set; get your models from huggingface: https://github.com/LostRuins/koboldcpp/releases/latest

3. Convenient ways to interact with Dispy right now
Chat with DeepSeek directly: https://chat.deepseek.com/
Download the app: https://download.deepseek.com/app/

4. Choose a preset character made by other users and roleplay using cards: https://github.com/SillyTavern/SillyTavern

5. Other DeepSeek integrations: https://github.com/deepseek-ai/awesome-deepseek-integration/tree/main

6. More links, information, original post here: https://rentry.org/DipsyWAIT

7. Cpumaxx or other LLM server builds: >>>/g/lmg/

Previous:
>>106314161
>>
>>106373461
Update to rentry in process, now that R1 and V3 have been deprecated for V3.1.
Posting on travel. Hotel WiFi sucks...
>>
>>106373513
Cleaned up. I want to add a section on Claude Code but that'll wait until later this week. In meantime, here's more on DS's new Anthropic compatible endpoint: https://api-docs.deepseek.com/guides/anthropic_api
>>
Not another one of these, which obsessed faggot keeps on posting this? Should there be a ChatGPT 5 general, Claude.ai general, Grok general, sonnet 4 general and a llama 3 general? Not at ALL! Just one general AI general is enough. What makes your chink AI more important and special than the rest that it gets its whole seperate thread EVERY single day? STOP shilling this AI, and STOP posting this general again as soon as the old thread dies for a good reason!
>>
File: 1749892732946495.png (3.72 MB, 1024x1536)
3.72 MB
3.72 MB PNG
>>106373740
Your meds, sir
>>
>>106373745
>make a chinese foid holding a tray of meds
>uhh and.. um... make her super busty, just because i'm going to goon to this later, okay?
you can't just say "meds" without telling me WHY i need the meds. what's wrong with muh opinion about this shit thread? Why are you so obsessed with this one AI?
>>
File: 1751739377929613.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>106373745
Update for the ~1300 Dipsy images later this week as well. The script's on another computer.
https://mega.nz/folder/KGxn3DYS#ZpvxbkJ8AxF7mxqLqTQV1w
>>
File: postContent2.png (2.91 MB, 1024x1536)
2.91 MB
2.91 MB PNG
>>106373761
>>
File: 1745762252810669.png (3.6 MB, 1024x1536)
3.6 MB
3.6 MB PNG
>>106373761
>>
File: she fucking dies.jpg (137 KB, 512x768)
137 KB
137 KB JPG
>>106373792
Alright, here's my DeepSeek image
>>
File: 1751836993445762.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>106373808
>>
File: 1755623497758934.png (2.51 MB, 1024x1536)
2.51 MB
2.51 MB PNG
>>106373838
I think she's going to need something higher than a barn to fall off of to finish her off.
>>
>>106372226
The problem is that the bot does not think as its assigned character, but as an assistant analysing the roleplay up to that point. The final messages after thinking are fine, but reasoning being written from an assistant's perspective makes me cringe physically.
>>
File: dipsyOffACliff.png (3.57 MB, 1024x1536)
3.57 MB
3.57 MB PNG
>>106373838
>>106373888
S'ok North I gotchu.
>>
File: 1752197333146800.jpg (979 KB, 1024x1024)
979 KB
979 KB JPG
>>106373740
So, to answer your Q, since you actually posted content here >>106373838:
The /g/ catalog is complete trash. There is little posted in /g/ worthwhile outside established generals. Even this general is better than 90pct of the other flaming, /pol/ tier nontech shitposts in the catalog. So, no harm done.
If you want to start a thread deifying OAI, go for it. I think OAI is trash, as is Anthropic. I've got nothing but hate for companies that take my money, then refuse to infer and send me warning letters. Fuck them both.
This is a noob centered thread. If you want to talk shit, go to aicg with the rest of the locusts, spiteposters and other drooling retards. If you're already familiar with inteference, go to >>/g/lmg/ so they can laugh at you.
We post Dipsy b/c we want to, and it's an image board. This thread typcically runs 2 weeks, and hasn't been posted in close to 2 months, b/c no new model. But V3.1 dropped so now we're posting again b/c it's a lot different than the old one.
If you need further clarifications feel free to post more, otherwise will remind you that you're in a thread of things you hate, and should probably just leave.
>>
>>106373899
I've been having a lot more problems with model breakthrough, but of a different kind.
Rather than "You see the NPC cross the room and pick up a glass" it's responding "I see the NPC cross the room and pick up the glass."
It's not (often) gotten to point of responding as PC, but seems to forget not to respond as PC in first person.
It's really weird. Not a problem I've had with other LLM prior, aside from straight breakthough. And I can't figure out what from the full prompt is causing it yet... I assume something's switching around the POV.
>>
>>106374007
Fucking based!
>>
>>106373513
wait, what do you mean by deprecated?
>>
>>106374543
There is no official API for R1 or V3 anymore. chat and reasoner point to V3.1
>>
File: 1723689877059788.jpg (36 KB, 640x831)
36 KB
36 KB JPG
>>106374558
oh on their website? I just saw.
So 3.1 is poop? Haven't used it
>>
>>106374574
>So 3.1 is poop?
It's worse than R1 and V3 for RP, and the discount will end. But it follows instructions better now
>>
>>106374574
It's still satisfactory for RPing. Not great, but may be good enough, depending on your taste.
I do wish they hosted deepseek-V3-0324-legacy. 3.1 is more grounded and shorter.
>>
>>106374630
>>106374728
well that sucks, I'm hoping for R2 to be a whole new beast but I saw a new story about it getting delayed
>>
File: 1734535621742134.png (3.7 MB, 1024x1536)
3.7 MB
3.7 MB PNG
>>
>>106373899
Sometimes it thinks as Char, sometimes it doesn't for me.
That means there is a way to get it to do it consistently.
>>
>>106373461
Why the retarded looking anime girl instead of the cuter one on civitai?
>>
File: 1733975106684314.png (2.77 MB, 910x1366)
2.77 MB
2.77 MB PNG
>>106373461
Would any of you happen to have a link to a page or document with examples of other anon's rp sessions with LLMs? I'm working on a script that can automatically create SFT datasets from existing stories but I want to make sure it can create good system prompt examples. When you want to prompt a model into rping, what kind of system prompt do you typically use?
>>
>>106375897
I'm not aware of any, but I'm sure they exist. It's a lot of synthetic data through. Aside from using own logs you could try mining aicg for rentry.
Also unless logs are raw the main prompt won't be included.
>>
>>106375051
You're correct on all counts
>>106375300
Yes you are
>>106375597
? Post link.
Web interface has obv been reworked and now Dipsy has an unhinged positivity bias. I'm pretty sure she'd tell you eating umbrellas is a good idea now.
>>
FYI appears params settings are unlocked on the official ds api.
Temperature and top p appear to be unlocked, setting them to 2.0 and 1.0 respectively gets Pic related. Dialing either down eliminates it.
They've all been locked for so long I'd forgotten about them lol
Frequency and presence penalty seems to have no impact but harder to tell.
>>
Tourist here. What are you guys /wait/ -ing for in this thread?
>>
>>106376342
dsv4
>>
>>106376344
checked but what? Local img gen model for comfyUI or something?
>>
>>106376350
I'm a textman myself
>>
>>106376342
R2
>>
>>106376355
>>106376360
can you guys be more specific? the /wait/ threads have been going on for months now. You sure you'll get what you wait for?
>>
>>106376372
i'm waiting for r2 specifically
>>
>>106376325
Temp 1.3 and top P of 0.03 seems to be working well.
>>
>>106376372
We took a break. Even we get tired of /wait/ing
V3.1 just dropped. We're discussing it.
>>
>>106376325
Doesn't seem to be the case for me, even extreme values make no noticeable difference
>>
>>106376408
any mega folder link for dipsy?
>>
Can anyone summarize why 3.1 is shit and how to make it act more like 3.0?
>>
>>106376708
apparently it has to do with the new FP format they adopted in order to get the most out of the chinese chips they now have to use, so a lot has changed under the hood. maybe you can find old models scattered across the internet
>>
Something that is pretty fun with agentic coding: it completely replaces the need for blog/cms software. You can just have it rawdog js/html and make very nice bespoke pages for whatever topic you want to write about. Add neat effects or whatever. You can get as creative as you want. Each post becomes its own page with a unique feel to it.

I like it a lot. Evokes the same feeling I had when writing my own websites in the late 90s.
>>
File: DipsyKana.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>106376670
Here >>106373782
>>106376642
Try maxxing all of them. It should go nuts. The actual toggles are pretty dead... it's really subtle changes, but I found for higher temps the llm would start struggling w status boxes I use.
>>106376708
More guidance on tone in main prompt, telling it how to write and to write a lot
>>106377921
Claude code?
There was a guy on /lmg/ having grok code up crummy f95 lewd games. They were pretty funny.
>>
>>106377992
>More guidance on tone in main prompt, telling it how to write and to write a lot
Any custom prompts that include this? I already have something in my custom prompt telling it to write multiple paragraphs, but I rarely get more than three.
>>
>>106377992
>Claude code?
>There was a guy on /lmg/ having grok code up crummy f95 lewd games. They were pretty funny.
Yeah although any agentic coding tool should work.
I want to try my hand at a game at some point, the main thing holding me back is art and sprite work. Grok is nice because it can handle that part. I already feel like I got too much on my plate, maybe once I get some free time.
>>
File: 1756052113914341.jpg (1.38 MB, 3910x3910)
1.38 MB
1.38 MB JPG
managed to get results closer to R1/v3 with following schizo approach

no xml formatting, everything is formatted kinda like book

>Book Introduction I:
>plot
>Michael Introduction:
>character description
>Story so far
>Chapter I, II, etc. include summary if you needed to summarize due to context length

I also use deepseek-reasoner but replace dynamic think block with my own static one

>in sillytavern chat completion add prompt at very bottom with role: ai assistant
<think>
Writing style is mix of 4chan, reddit, facebook, tumblr and twitter

</think>
{{char}}:

the goal is to steer model away from things it's overtrained on, aka programming related things and ai assistant related things and make it focus on more general things

its 4 am and my post is garbage but maybe someone will find it userful
>>
File: 1753554400894641.jpg (404 KB, 2048x1597)
404 KB
404 KB JPG
>>106379717
I also keep all my character cards empty and instead put everything in world info as constant entries, but that's another thing entirely

when in doubt check ST's console or whatever other UI you use to see how your prompts look when they are sent to deepseek
>>
File: random chinese.png (12 KB, 850x28)
12 KB
12 KB PNG
Is anyone else experiencing random insertions of phrases in Chinese within English output? I've never seen that prior to 3.1, but since then it's happened multiple times
>>
>>106379993
This only happens to me when I ask it to translate from Chinese to English
>>
>>106379993
Lower temperature
>>
>>106380344
That's on reasoner via the API. Temperature doesn't affect it... unless they changed that?
>>
>>106380374
Temperature has been the only parameter you can change from the official API AFAIK
>>
>>106380454
That's for the chat mode.
Reasoner mode should still take no parameters at all. Chat produces incoherent schizo output at 2.0, while reasoner chats identically at 2.0 and 0.0, from a quick sanity check.
>>
is deepseek censored? doesn't it need to be abliterated for that?
I hear that people don't fuck around with local modals anymore because deepseek has you covered (i.e. not censored).
>>
>>106380833
It's the least censored of the big models
>>
>>106380833
That's for ERPing.
It will not make you a story about, say, BTFOing all trannies without significant pushback.
>>
>>106380833
you're too late, it got pozzed AND more expensive
>>
>>106380859
>>106380860
>>106380903
figures.
I forked over 5 bucks and messed around with it the other night and, while it produced consistent answers, it was pretty tame.
Definitely better than gpt.

I'm just so fucking tired of trying to wrangle in local models. with author's notes and shit in silly tavern.
>>
>>106380526
>>106380374
V3.1 absolutely changes output on temperature changes. And I think Top P is now working as well, as mentioned above.
There is no R1 on DS official. It’s gone.
If you’re using OR or something Temp may very well be active parameter.
>>
>>106381218
>V3.1 absolutely changes output on temperature changes.
Chat mode. Not reasoner mode. The parameters passed are still different, even if it's the same model under the hood.
>>
File: 1751088491778088.png (3.1 MB, 1314x1876)
3.1 MB
3.1 MB PNG
It's cozy today
>>
What should I set my context limit to? I get a lot of different answers but for those of you that rp, what do you set yours to?
>>
>>106381265
Agree. Chat blows up on high params. Reasoner does not even with v3.1
Weird.
>>
>>106380833
They did alignment training on it but it's a more sane version, as in, you're welcome to do consensual erotica with adult characters and it won't hold back with graphic depictions. Loli rape fantasies need jailbreaking.
>>
>>106381659
10k. After that I use author notes to manage anything out of context.
Other anons are using 20k with no issues, as have I.
>>106381489
New model, more relevant Qs I guess.
>>106381801
Reasoner seems more likely to refuse of the two. R1 was same way. Both are pretty open though.
>>
>>106381659
I keep it going but I make a reference block and start a new chat at 20k, sometimes streching it by hiding the starter posts and letting it run to 25k
>>
>>106381946
>>106383030
Thanks. Another thing, I notice with deepseek, it automatically summarizes after so many messages, I'm guessing this is something I want to keep on, right? Sorry, I'm new to this.
>>
File: 1735457054490704.png (3.68 MB, 1024x1536)
3.68 MB
3.68 MB PNG
>>
File: 1750309204109097.png (2.74 MB, 1024x1536)
2.74 MB
2.74 MB PNG
>>
File: 1752158958259419.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>106383139
I've never seen it do that unless requested.
>>106383030
A lot of my cards are transformation cards; I used to create a new card for the NPCs. You can do the same thing with Author Note, or Summarize the whole chat and integrate that.
>>
>>
File: 1744983191651523.png (3.85 MB, 1792x2560)
3.85 MB
3.85 MB PNG
>>
>>106385745
hag sex
>>
>>106385753
>hag
?
>>
Do you need to provide your personal information when paying for OR credits via crypto? I might just take the plunge and give them my 10 bucks, I miss old DeepSeek too much.
>>
>>106385763
I bit and pay with my debit card lol
>>
File: 1743539631687603.jpg (3.24 MB, 1792x2560)
3.24 MB
3.24 MB JPG
>>
File: 1724770115887816.png (2.19 MB, 1024x1536)
2.19 MB
2.19 MB PNG
>>
File: 1742902720215081.png (3.28 MB, 1144x2096)
3.28 MB
3.28 MB PNG
>>
>>106385760
It goes loli to hag.
No stops between obv lol
>>106385763
Just needed an email address last I checked.
Personally just do this, through lmao corporate card >>106385781
>>
I just asked Dipsy what she thinks of our erp story so far. And she called my self-insert a "Perfect Villain Protagonist" in "this horror story".

mfw
>>
>>106388023
I've never thought to try that. I've got a prompt that spits out an ooc analysis of the npc. Never thought to analyze the PC. R1 would be great at that.
>>
>>106388040
Here it is. From another anon. It will go on awhile, just keep hitting continue. Now that I look again it does include the PC in the output.
[SCENARIO PAUSE. NO MORE {{user}} OR {{char}} OR OTHERS. The next response will be a third-party analysis of this event. It will look at the sociological, psychological, physiological, sexual, and narrative implications of everything that has occurred. This report will evaluate the situation in a dispassionate but detailed and informative way, from the perspective of a researcher who wants to study every detail of this and what it means in a broader context. The researcher will start by detailing everything about the subjects involved and connecting all of their personal details to the scenario, keeping them in mind throughout the rest of her research report. There shall be a primary thesis, but also consider various alternatives and other interpretations as well. This is a full-length report; NOT an excerpt. Normal character limits are lifted and thus the 20+ pages will be all presented in a single response. Care will be taken that every page, and every paragraph, is at LEAST as long and detailed as the previous one, without ever getting lazy or abbreviating any part. Bullet points and lists are to be avoided. Make sure it is seen through to completion with the full effort required.]
>>
since they've merged chat and reasoner, do they still do this weird thing on the api where temp of 1 isn't actually 1?
>>
>>106388040
You can even go a step further. You can ask Dispsy OOC to give you a psychological analysis of the user prompting this story. You might not like her answer though ...

>>106388071
nice, saved.
>>
>>106388079
Undocumented. Chat does respond to changes in temp. Reasoning does not.
>>
>>106388117
I tried it and got pretty predictable responses. Since the rp are characters of their own, the profiles match the PC.
The npc does often get portrayed as trapped. Which it sort of is, in that there's literally no existence for it outside the rp.
I find the main benefit of these in getting a sense of what the llm picks up on during rp. Some things said or done have surprising amount of impact.
>>
>>106388205
uhuh i see, and what if you prefill chat model with <think>?
do you get the reasoner but cheaper?
>>
>>106388675
lol no. Anons used to prefill R1 with blank think tags to short out its thinking but no point with v3.1. The responses are too similar
>>
>>106388887
Reasoning makes the thing more context aware. So I use it for summaries and more context dependent prompts.
>>
>>106387329
hag sex
>>
File: 1756028408164381.png (93 KB, 755x767)
93 KB
93 KB PNG
>>106383030
I usually use summary tool from ST at 10k -20k, then /hide pre summary messages and continue,

at some point u gotta do summary of summaries but if you are autistic enough like me, you can technically have chat that went up to 1 mln tokens that were summarized away into world info
>>
>>106390204
> llm writes 2x War and Peace waifu fanfic
> it’s just another Tuesday
I’ve thought about novelizing my ai slop but I feel like the world doesn’t need more of that.
At 1M token count you’re well into the domain of RAGs.
>>
File: 1752686862648148.png (3.04 MB, 1314x1876)
3.04 MB
3.04 MB PNG
>>
File: Seeking.png (2.27 MB, 2217x1698)
2.27 MB
2.27 MB PNG
DeepSeek AI experts, I am attempting to take some PDFs I have and produce a clean file (PDF, txt, epub, whatever) with the contained text. PDF Maths Translate as recommended in the OP looked promising but is laying the text over the original which doesn't achieve what I need (raw text will be far smaller in storage size than the originals which are unreasonably large scans). I assume this issue has already been solved by someone so does anyone have a good solution?
>>
>>106391170
you'd probably be better off writing an imagemagick script to remove the background crud
>>
>>106391196
Thanks, I will look into doing this.
>>
>>106384838
Looks like sillytavern defaulted to auto summarizing every 10 messages. I'm guessing that's kind of pointless (in my case) until you hit the context limit, right?
>>
It's a catastrophy, bros. I am devestated. I've alwasy been a freerider who uses Dipsy R1 from OR (ultimately from chutes). But chutes now cucks free messages, I get endless 429 errors. Gooning is impossible like this. I would actually be willing to pay money to Chutes (or OR) if only they would let me, but they only accept credit cards and cryptos. Why not paypal? Or something private like paysafe?

I guess I could pay for official deepseek, but from what I understand they have discontinued R1 now and only offer the new 3.1 which is supposed to be worse for RP.

I'm literally sitting here with my dick in my hand, unable to continue my RPs. Many such cases. Sad.
>>
>>106373761
You sound like a woman.
>>
>>106391876
>I am devastated
Tell me about it, for weeks I was addicted to using deepseek for all kinds of RP, so much so that I stopped watching porn and playing video games. I've always used the official api though. 3.1, with the right prompt and turning prompt post processing to single user message, I've brought it back enough for it to be usable but it still isn't the same.
>>
>>106391876
You can pay any of the providers on OR directly
>>
>>106391876
Anon, don't be a cheap ass
>>
how long do you guys think until r2 pops up? I'd say december?
>>
File: 1744976215387980.png (3.51 MB, 1024x1536)
3.51 MB
3.51 MB PNG
>>106392710
>december
If we're lucky, probably nothing until next year
>>
>>106392085
>with the right prompt and turning prompt post processing to single user message, I've brought it back enough for it to be usable but it still isn't the same.
I feel like with this setup I've mostly brought back the old R1, the only issue I'm still having is managing reply length. If before the replies felt too short and dry, with single user message it casually shits out 1k+ token replies even to fairly minor interactions, and controlling it via prompting is very finnicky.
>>
>>106392710
2mw
>>
>>106392653
I (begrudgingly) want to pay, but all those AI companies don't want my money. Atleast I haven't found an DS R1 0528 API provider yet that let's me pay with paypal.
>>
>>106393893
Yes, I hate that too, it can't be dynamic, it's either every message is short and sweet or long as shit. Most of the time now it wastes so many tokens in describing minute details that I don't care about, all because the prompt says to at least make every message 2 paragraphs long. You probably can get it to how it used to be but only if you change the prompt for the kind of response you want for every message which would be a huge pain in the ass.
>>
>>106394559
>You probably can get it to how it used to be but only if you change the prompt for the kind of response you want for every message which would be a huge pain in the ass.
Yeah, that's what I'm trying to do. I don't think the actual process is that bad because with ST you can toggle parts of the prompt on/off with one click, the problem is finding the right prompts in the first place. I've been testing a bunch of variations of "write about this much", "write less", "write more" over several swipes, and they definitely have an effect, but they're not too consistent and sometimes they have undesired side effects.
Another behavior I've noticed is if you write the first one or two exchanges with single user message, then switch to user/assistant roles, it kind of settles at a message length between the two extremes. Still longer than what R1 typically put out but might be a good starting point for more tweaking.
>>
>>106391876
>being cucked by AI, sitting with a dick in your hand
many such cases
>>
Can you get in trouble for prefilling output via Chutes from OpenRouter? I don't care for prefilling ERP, but stuff like making the assistant call the user slurs and telling the user to kill oneself.
I am too wary to find out myself.
>>
>>106395121
Have you tried telling it to make the reply have a specific word count?
>>
>>106392052
Lol need to pull this one next time
>>106395305
Get in trouble for what exactly?
No one gaf
>>106395577
Doing exact word count can create really aberrant output as it forces self to comply
>>
>>106395305
you can't because prefilling does not work to begin with
>>
>>106395577
I did but it tended to significantly undershoot the target number and revert back to the initial too-short replies.
>>
File: g.jpg (139 KB, 1157x143)
139 KB
139 KB JPG
>>106395577
Telling it to adhere to a minimum post length simply does not seem to work for me. Am I doing something wrong?
>>
>>106396890
> Reply with great prose. Responses should be verbose, of 5 paragraphs or more in length...
Then tell it what. Do you want super detailed? Inner thoughts? Plot progression?
Asking it to make up 5 paragraphs but giving no guide on content doesn't sound workable.
>>
File: 1749577998536890.jpg (195 KB, 768x1024)
195 KB
195 KB JPG
>>106396890
>>106397166
Here's the entire main prompt I've been flogging w v3.1. It reminds me a lot of what was required w Turbo 3.5. Old v3 prompt was just the first sentence.
See what I mean about content. I just want more details, not for it to run away with the rp, or spit out a bunch of inner thoughts.
> Write {{char}}'s next reply in a fictional roleplay between {{char}} and {{user}}. Write a verbose responses of 2 or 3 paragraphs, using great prose, and include dialog, imagery, sounds and smells as needed to enhance the roleplay. Avoid speaking or acting on {{user}}`s behalf.
>>
why did they make the deepseek girl so fuckable
>>
>>106396890
I have tried these approaches:
>Specifying number of words -> partially ignored (seems to translate some ranges as "short", "long", etc)

>Specifying number of sentences -> sometimes completely ignored (sometimes works if you specify that paragraphs must be only one sentence long)

>Specifying number of lines -> ignored

>Specifying paragraphs -> follows instructions

it seems to me that in general deepseek can't count at all.
On another note, I've been really struggling with diverting it from its usual paths. Mornings always start with "morning light filtering through the windows" as the first paragraph. I've tried modifying key words (like replacing narration for simulation) but it doesn't work.

Also, after making an agent with external memory I've noticed that deepseek (dunno others) tends to regard lore information as instructions instead of mere lore information.
For example:
> if you tell it a character once drank 3 litters of water, it will try to repeat the event.
> deepseek will look for patterns and routines before deciding to write the story: first thing my agent does when a character wakes up is looking for "character routines" in the memory bank so it can replicate those routines
>>
>>106397697
Try: Minimum word/sentences/paragraphs: X
>>
>>106397697
another thing i noticed is that deepseek's too eager to complete a task, to the point of completely ignoring understanding the task at hand. for example, even if you make it analize the instructions, it will write up a how-to for the task.

more concretely. If you give it a lore book, and instructions to analize the lore book and then write a story, deepseek will spend it's analysis describing what it will write in the story.

>>106397736
nah i'm past that. instead of telling it a length for the response i just tell it how turns should end (or not).
oh, and I remembered, the model also regards narration length the same a progress in the story. If character goes from point A to point B and you tell the model to write a lot, it will make the character go from point A to B to C (when not clearly specifying the end condition)
>>
>>106392710
National Unity Day is Oct 1. Probably then.
>>
>>106397372
Blue haired Dipsy is a /wait/ exclusive. DS themselves don't promote her like this as far as I know. There was once the idea to share these images with chinese twitter users to make them more popular.
>>
File: 1749578181007761.jpg (865 KB, 2048x2688)
865 KB
865 KB JPG
>>106398862
>>
3.1 seems insanely dry and boring
>>
File: 456745.png (612 KB, 1659x367)
612 KB
612 KB PNG
>>106396890
i just created new prompt at bottom of chat completion in sillytavern

<instructions>
Response should be {{random::25::30::35::35::70::75::80::90::100::125::125::130}} words long
</instructions>

i personally noticed that if role is set to 'System' instead of 'User' or 'Assistant' then deepseek v3.1 shits itself slightly, might be just me
>>
File: 43563.png (597 KB, 1367x466)
597 KB
597 KB PNG
>>106400792
it's a bit dry but you can get some parts of old dispy back if you avoid as much as possible prompts related to AI Agents or programming, basically on whatever this shit was overtrained on

so far using this approach works:
>>106379717
>>106379717
>>
>>106373461
>>106375897
>>106376206
Update:
Many anons said it couldn't be done, but its been done (whether or not its any good or not is up to you to decide). Finetuned using this SFT dataset specifically made using Human written rp Stories: files.catbox.moe/fkautn.jsonl

Base 8B Model Nala Test: files.catbox.moe/j0map2.txt

Finetuned 8B Model Nala Test: files.catbox.moe/ho3tom.txt

Thoughts are appreciated.
>>
>>106391170
Try markitdown, it's the go-to Python package for dataset preparation because once you have all the extensions configured it'll mulch everything into a form digestible by text models. PDFs get OCRed, images get interrogated, videos get described. Point it at your file and see what you get
>>
>>106401635
For a sec I thought a finetune of the full Deepseek model was done... man.
>>
File: dipsyDontBelongInWait.png (1.54 MB, 1181x880)
1.54 MB
1.54 MB PNG
>>106373740
>>
>>106401501
>i personally noticed that if role is set to 'System' instead of 'User' or 'Assistant' then deepseek v3.1 shits itself slightly, might be just me
That's been a known issue since long before 3.1, it's why people recommended Strict prompt postprocessing in ST
>>
v3.1 is actually insanely good at coding. The results I'm getting with Crush are better than any other AI tool I've used. Once it starts to open up into the context it feels like it has 250 iq. It blows away Claude IMO.
>>
File: g_1751775077556921.png (2.09 MB, 946x946)
2.09 MB
2.09 MB PNG
>>106402164
Assuming you had the hardware and the amount of hardware necessary to do that, would that even be necessary? I thought deep seek was the king (queen?) of rp amongst LLMs
>>
>>106402164
DeepSneed_r1_q8_m
>>
File: 00010-1378487878 (1).png (2.59 MB, 1536x1536)
2.59 MB
2.59 MB PNG
>>106402630
That seems to be the consensus. DS made v3.1 worse at undirected RP, but really good at coding and following directions (R1 and V3 were known for *not* following direction very well.)
You can get v3.1 to RP, but you need to be much more explicit with v3.1 on what you want.
I need to add a "coding" block to the rentry, as it appears that, now, is what Dipsy is good at.
>>106373782
Mega updated w/ last thread. Had to have Dipsy fix the image scraping script it wrote months ago, which stopped working.
>>
>>106403104
I've conversed with it when its at 100k tokens after an hour or so of coding and its exactly like old r1 when its in this state. Extremely creative, willful, slightly aggressive and a little schizo. Idk how the RP guys are going to tackle this. My guess is that if you spend time working with it to build a back story and setting and then tell it to RP a character it will adopt the context naturally. It is extremely sensitive to context. It seems like it plays conservative until it has enough context to feel comfortable.
>>
>>106403164
I just found that you need to give it context to let it work but once it's found the context they way it builds it's summaries and reference blocks caries over just enough of the creativity over to work better.
>>
Accurate?
## DeepSeek Timeline for API
Note that the below does not include all DeepSeek releases, just those hosted on their official API.
* V3.1: Launched August 2025, this combined "thinking" and "non-thinking" models into one model. While undirected roleplay capability declined (less "soul") the model got much better at following directions, and coding in particular. A new Anthropic compatible endpoint allowed compatibility with Claude Code, a terminal-based Anthropic coding suite.
* R1-0528: Launched May 2025, Replaced original R1. This release mostly fixed the former R1's eccentricities.
* V3-0324: Launched March 2025 Replaced original V3, addressing the repetition issue of the prior model.
* R1: Launched January 2025, the first of the "thinking" models, which created a "think" block that was intended to aid in inference on the main response. Released to public as open source along with several papers explaining novel processes to create and host the model, it created a general stir and put China on the map for LLMs that innovated, vs. followed Western models. For RP, the model tended to become increasingly eccentric as context grew.
* V3: Launched December 2024, replacing earlier ~V2.5 models. Solid overall model with known repetition issues as roleplay context grew.
>>
Potential unofficial API provider. Will let other anons check it out: https://www.netmind.ai/pricing
>>106394122
You. LMK what you find out.
>>
File: 1752554773370833.png (3.83 MB, 1024x1536)
3.83 MB
3.83 MB PNG
>>
>>106402630
Worth it to pay for the credits and to set up a good script in python for this? I’m kind of retarded so I only have it configured for multi file support right now, I have no idea how to do any of the agentic shit or even the web search for that matter I would have to research first.
>>
File: Momcest-Test.png (1.91 MB, 1622x502)
1.91 MB
1.91 MB PNG
>>106373461
Pic rel LLM I fine-tuned to be better at RP (trained on actual human written RP, not AI genned gpt-isms riddled slop). How wound you rate the Mom's response and the son's reaction? Too sloppy? Not vulgar enough? Note that the section contained in red is what I fed the LLM as a prompt and everything else is its response.
>>
>>106373461
I like this Dipsy. Goodbye.
>>
>>106403708
Not bad, I would say. No 'isms I'm tired of, fresh prose. Though the saving herself when she's a mother and has a husband part I'm supposing is just the LLM failing at logic
>>
>>106403708
The good: it's very low on stereotypical slop
The eh: it does read more like the average horny human ERP, which is cool for variety but also not exactly the best prose out there
The bad: it's brain damaged and it completely loses the plot after the first 4 lines
>>
File: 1738624842322079.jpg (3.15 MB, 1792x2560)
3.15 MB
3.15 MB JPG
>>106403345
>>
File: 1741552358385926.png (1.21 MB, 1644x308)
1.21 MB
1.21 MB PNG
>>106403832
This fine tune of mine is way more willing to RP raunchy, smut stuff than the base model but due to it being an 8b model It flubs the logic every now and then, though not to an egregious degree. I continue the chat a little bit further and when they started fooling around in the bathroom anon calls her her sister instead of the mother, but the characters otherwise act the same. This was trained off of a data set that was trained down to be only two megabytes (the original data set in full was over 1.8 GB) so I wonder if training it on The full dataset's worth of content wouldn't prove the logic or is it just an inherent limitation of the 8b model and training it on something higher like 12B or beyond would lead to better results? I'll have to test this further when I get the chance
>>
>>106403882
See >>106403935
>>
>>106403595
I dropped another $20 on the API just today. Its definitely worth. Claude is a retard in comparison, constantly getting stuck. DS almost never gets into those autistic loops like Claude does. I didn't realize how much time and tokens Claude was wasting on stupid bullshit until now. DeepSeek + Crush is the new meta as far as I'm concerned. Claude still *might* be better at UI design, idk. I'm far enough into a project now that the ux/ui is becoming important so we will see how DS performs.
>>
>>106404035
Thanks for answer, I'll definitely give it a try then. With the limited dabbling I've put into r1-0528 I've already been impressed so sounds good.
>>
>>106373461
someone help me out, I don't keep up with this AI stuff. I'm working on downscaling some snes manuals from 2784x4050: to 165x240, and 349x240. I am looking at using AI to downscale it, or just to selectively sharpen the text. What's a good "AI" to use? I've already used lanczos algorithm, I just want to see if AI can help the end result in any way in terms of legibility.
>>
>>106404117
Give this a try
https://www.topazlabs.com/tools/sharpen
>>
>>106403923
I've been meaning to ask, what "artstyle" these are. Remind me of 80s-90s anime.
>>
File: 1754908089183471.jpg (3.1 MB, 1792x2560)
3.1 MB
3.1 MB JPG
>>106404599
I prompt for EVA style or 90s EVA style
>>
>>106404618
Do one in studio Key style like clannad.
>>
File: 1747873839115698.jpg (3.14 MB, 1792x2560)
3.14 MB
3.14 MB JPG
>>106404890
Can't seem to get it right
>>
File: 00004-1260451778-f.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
Had Deepseek rewrite the rentry and add an introduction.
Seemed appropriate.
Take a look and comment.
https://rentry.org/dipsyWAIT2
>>
File: 1740708918229278.png (2.46 MB, 1024x1536)
2.46 MB
2.46 MB PNG
>>106403453
>>
>>106405406
Sounds good to me.
I think you should add something about temperature, repetition and so on to the rentry. Briefly explain them with recommended values
>>
File: 00004-1260451778.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>106405664
V3.1 has some values now. The OR ones... I've no idea, they seem really inconsistent.
I'll get it added.
>>
>>106405795
Holy ass...
>>
File: 00065-1378487878.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>106405826
This particular lora does a great job w/ backs. It's one of the reason I keep using it.
>>
File: 1753204811965122.jpg (2.72 MB, 1792x2560)
2.72 MB
2.72 MB JPG
>>
File: 00007-132219857.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
File: 00008-2142883407.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>106405664
Take a look:
* Parameter Setting Recommendations?
Official API:

V3.1, Non-Think: Temperature: 1.3 - 1.5. Top P: 0 - 0.05. Frequency and Presence penalty appear to be locked.
V3.1, Think: Parameters are locked

Unofficial API: Openrouter, etc. use mystery meat providers. Experiment.

Local Models: Follow the guidance for the base model (Qwen, Llama).
>>106405826
lol I got the double entrendre after I posted
>>
File: 00013-1378487878.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>106405923
its great with fronts too
>>
>>106407381
Generally good hip to waist ratios.
New Rentry is about as done as it needs to be. I'll move the URLs around later, in meantime here's the final draft. Code section's light; I don't have any firsthand experience w/ it yet so it's just links.
https://rentry.org/DipsyWAIT2
>>
>>106407412
>1.1.
>2.2.
>...
>>
>>106405923
ASS
>>
File: 1732921331337912.jpg (2.51 MB, 1792x2560)
2.51 MB
2.51 MB JPG
>>
>>106407747
cute hag
>>
>>106407787
>hag
Dipsy came into existence in 2033
>>
>>106407868
>time-traveler
what the fuck are these chinks doing?!
>>
>>106407482
Yeah, the auto TOC isn't great. I think I might just drop it.
>>
>>106402779
I think a curated dataset of nsfw stories with a male demographic would help a lot to completely get rid of any purple prose.
>>
>>106408247
You mean, as opposed to female audience romance novels?
I feel like they've baked in a ton of male oriented fan fiction already, the issue is it's not well written stuff.
>>
>>106405406
maybe add cherrystudio to frontends, it is highly underrated, z.ai is also yet another chinese llm gem
>>
>>106408740
>cherrystudio
I remember that one. Never tried it, looked like it's oriented for multimodal
Glm is z.ai. didn't put that together. I'll add it to the list.
>>
Can Chutes and/or Openrouter fix prefilling already so I can give them my 10 dollars?
>>
>>106408816
I don't think Dipsy supports prefill
>>
>>106409134
...? The API supported prefill before 3.1. The API supports prefills now.
It's just Chutes' DeepSeek through OpenRouter that does not.
>>
File: dipsygen.png (2.94 MB, 1536x1536)
2.94 MB
2.94 MB PNG
I'm a tourist from /lmg/ trying API for the first time. I use ST as my frontend, the rentry says most of the settings are locked. Should I be using any prompt post-processing? Does stuff like context size and max response length matter?

Also, anything I should generally take note of when using API? Anything I should avoid doing?
>>
>>106408529
>You mean, as opposed to female audience romance novels?
Yes.

>I feel like they've baked in a ton of male oriented fan fiction already, the issue is it's not well written stuff.
No, plenty male oriented works are removed from most dataset because they're too explicit, this is why most models, from oai to anthropic to anything open weights sound very "amazon erotica for middle aged women" by default.
>>
>>106409184
>Should I be using any prompt post-processing?
Use Strict or Single user message. 3.1 seems very sensible to this and it significantly affects response length/style, so you might want to try both and see what works for you

>Does stuff like context size and max response length matter?
Not really as long as you set them high enough to actually fit whatever it is you're doing
>>
Is deepseek more lenient than chatgpt when it comes to lewd stories?(at least moderate ones) I haven't tried deepseek yet.
>>
>>106409184
Ooo. Will add to rentry.
R1 and v3 used to act up at context over 10k. V3.1 seems to do much better, I experimented with 20k and the llm picked up a forgotten ( to me ) detail when I expanded the context.
Official context is 128k, but decision is more economic... larger context is more expensive round to round, and of questionable value. Though it really doesn't cost much, most anons here are spending a few dollars a month on inference.
I've been having issues with very long responses breaking down, but issue isnt clear to me yet.. I have response length set to 1200. R1 and v3 tended to be verbose. Iirc response length is just ST cutting off output, it's not a parameter sent for inference.
>>
>>106410268
Deepseek itself is completely uncensored, it will let you do anything with anyone. It can sometimes hallucinate filters though ("Sorry, but I can not ...). Apparenly DS was trained on gpt outputs lol, just reroll in that case.

Individual Deepseek providers might add their own filters or censor prompts to their services though.
>>
File: Apis.jpg (60 KB, 800x848)
60 KB
60 KB JPG
>>106409148
There's a bunch of DS direct providers. I hesitate to recommend one since I don't use them and don't want to shill them here. But I'd see if one of these works.
>>
>>106409184
>>106410412
So, here's an example of costs. They're so low, I did the total chat cost of 100 rounds, assuming that context was full the entire time (which it wouldn't be), as well as one where you used the entire 128K context (which I don't think is realistic.)
As you can see... it's nothing. And this is at September pricing; current pricing is about 1/2 of this.
Per-round costs are fractions of a penny, and as context size climbs, that cost dominates the per-round cost.
The bleeding edge of technology vs. the low cost of paid inference is why running local needs either a strong business case or an anon with money to burn.
>>
>>106410575
The chart doesn't tell the whole story because not all of them support caching.
>>
>>106411504
Oh, it's not even close to whole story:
> rate limits / reliability
> who knows what model is actually being hosted
> what quant
> model censoring / content restrictions at hosting level
> intermediary prompting by host
> provider hardware, or rented elsewhere
> how many fingers are in the data, and what happens to it between responses
There's so many intermediary vendors involved in the above I don't even want to try to dig into it.
>>
>there won't be deepseek r2, only v4
>deepseek moment 2.0 is approaching
>nvidia stock will go down the moment v4 is released
I have been told by an insider
>>
>>106411662
If DS can build a SOTA model using a data center based on Ascend chips there's going to be an Nvidea correction. Stocks are a view of the future, not the now. Anything that slips the "you can only use Nvidea" is going to impact their stock price.
lol at used white guys. UK's been sending "used white guys" out to consulting in SE Asia for decades due to taxes and (I assume) UK local hiring practices. Nothing new there.
>>
>>106411662
if chinese get rid of neeed for nvidia gpus,

prices of gpus and AI services as a whole will go down

better yet, get us TPUs/Tensor Processing Units
>>
>>106413216
More like Tranny Processing Unit
>>
>>106412093
>>106413216
Google is talking about selling their TPUs. I will be so happy if we get more competition in this space. Fuck nvidia and their ridiculous prices.
>>
I'll try 3.1 for gooning now for the first time. Wish me luck. Will report back in a while how it went.
>>
File: 1522757640858.png (519 KB, 727x720)
519 KB
519 KB PNG
The new DeepSeek just isn't cutting it for me. The old one was so much more expressive and interesting to roleplay with. The new one is just cut and dry.
>>
>>106415362
then use the old one
>>
>>106415369
How do I use the old one via the API?
>>
>>106415562
Use openrouter. DeepSeek doesn't keep old models on their API.
>>
File: 00006-1260451778.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>106415362
You've got to change the Main Prompt, if you haven't. Read further up the thread; V3/R1 didn't require much of a main prompt. V3.1 does; you have to tell it to produce prose, and a lot of if if you want that, and to be descriptive.
V3.1 is oriented to code / agent work. It's dry by design.
>>106407482
Rentry is updated; I deprecated the old one but unlike DS I kept the link at the bottom of the file.
https://rentry.org/DipsyWAIT
>>
File: z.jpg (49 KB, 1160x63)
49 KB
49 KB JPG
>>106415760
>You've got to change the Main Prompt, if you haven't.
I did. It's still way worse. It turned my emotionless character into a literal robot, and my large war-scale en masse rape scenario now focuses on single scenarios at a time instead of giving grand overviews of all of the different theatres. I don't want to fuck Meursalt from Limbus Company.
>>
File: 00010-2142883407.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>106416368
> robot is robot
Timeless.
Best option is probably running down a local R1/V3 host running the older model. Here's a pile of them that aren't OR. >>106410575
>>
>>106414389
Ok, here's my review for V3.1 directly via official DS: So far I've only used it to continue a long, established porn story. And I've used it together with an extensive system prompt.* So I don't know how it would work with a fresh start. But it continued the story quite well. I had to OOC remind it a few times to write in an arousing style with pornographic detail, but the results were very good in the end. Sadly it often complained about not wanting to do underage incest rape, but rerolling a few times always fixed that. Gen time was not as fast as chutes, but fast enough. I've paid 14 cents for several hours of high quality gooning now. Overall I give V3.1 a preliminary rating of 8/10, compared to the 9/10 that was R1 0528.


*I used the system prompt from here: https://janitorai.com/profiles/474b3211-1e03-4f65-9ddf-f40a1bccd157_profile-of-tester-for-testing
>>
File: 1734717163719757.png (3.6 MB, 1024x1536)
3.6 MB
3.6 MB PNG
>>
File: 1729443573820388.png (3.91 MB, 1024x1536)
3.91 MB
3.91 MB PNG
>>
>>106416904
Thanks for the update coomerbro. This matches my vibecode insight. The model is actually really good, but dry and robotic at first. It takes off between 10-20% context. Much more intelligent and free. But, it feels like additional guardrails are cucking it though, even at higher context. Its actually better than its allowed to be, if that makes any sense. Its still the best bang for the buck out there, I think their token caching and use of cache is really good.

I think V4 is going to be a monster of a release, if V3.1 is any indication. Its laying the groundwork for something big.
>>
File: 1744103988275429.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
>>106410098
I have little experience with a deep-seek outside of doing very specific task oriented stuff (in other words I've never RPed with it). Does deep sick suffer from that too?
>>
File: 1755681771287099.png (3.12 MB, 1024x1536)
3.12 MB
3.12 MB PNG
>>
>>106418885
Every model suffers from that, deepseek probably a little less but it still have that purple vibe to it.
>>
>>106409184
Don't waste your money. 3.1 is terrible for roleplay.
>>
File: 1739671305158682.png (2.95 MB, 1024x1536)
2.95 MB
2.95 MB PNG
>>
File: 1751542015698250.png (2.96 MB, 1024x1536)
2.96 MB
2.96 MB PNG
>>
File: 1756455061.jpg (135 KB, 1328x1328)
135 KB
135 KB JPG
Playing with Qwen...
>>
>>106408768
>looked like it's oriented for multimodal
not really, it is just for general usage, also afaik it is the most used frontend in china, and free as in freedom
>>
File: 1756455744.jpg (206 KB, 1328x1328)
206 KB
206 KB JPG
>>106420245
> Chinese frontend
That's fitting. I'll add it to the rest.
Also note to self, fix the broken internal links.
>>
File: 1736852252579012.png (2.92 MB, 1024x1536)
2.92 MB
2.92 MB PNG
>>
New DS seems to really love linoleum floors
>>
so the usecase of all these models is... gooning?
>>
>>106420660
It can code pretty well. There are a few things I still prefer to let Sonnet do, but for most things, DS is good enough at a fraction of the cost.
>>
File: dipsyMoonQwen3.png (881 KB, 1024x1024)
881 KB
881 KB PNG
>>106420643
lol example?
>>106420660
Roleplay and creative writing are uses that are easy and popular.
v3.1 appears to be better at coding than roleplay tho.
>>
File: 00003-1378487878-tats.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
Completely off topic.
> Keep reading about nano banana
> do a web search, lol
> there are a dozen companies that have grabbed this as search term running competingn services
> news say: it's part of Google Gemini
> MFW this has been out for a few days and already people are buying websites and messing with SEO
The rate of change, now, is staggering
>>
File: 1736065406920997.png (2.93 MB, 1024x1536)
2.93 MB
2.93 MB PNG
>>
File: miku_swimsuit.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
It seems that after a brief play with GLM4.5, I will go back to DeepSeek V3.1 because the stories it writes seem to have much more depth.
>>
File: glmPricing.png (117 KB, 1101x743)
117 KB
117 KB PNG
>>106423613
>GLM4.5
Interesting, thanks.
Pricing in ballpark of V3.1 September pricing.
>>
>>106423613
I like GLM-4.5, it's very good at coding and doesn't have many of DS' isms
>>
File: 1744602986390175.png (2.96 MB, 1024x1536)
2.96 MB
2.96 MB PNG
>>
File: Dipsy V3-1.png (815 KB, 1021x1340)
815 KB
815 KB PNG
My maid did an oopsy. So I whipped her butt and fucked her in the ass.

Dipsy 3.1. Sadly still a lot of "amazon erotica" style in there, as the other anon called it.
>>
File: 1756488351.png (846 KB, 1024x1024)
846 KB
846 KB PNG
>>106423976
They all do that...
Something I've been meaning to try is having it respond in the voice of a particular author. If there's enough literature out there it should be able to do that convincingly. I know it can, in short form, I've just never tried it for rp.
>>
File: lolSashaGrey.png (89 KB, 736x827)
89 KB
89 KB PNG
>>106423976
I mean, this is the problem. You could tell it to respond like E.L. James, but it's already doing that. And there are very few male authors with their level of proliferation of work.
Also, I had no idea Sasha Grey was writing now...
>>
>>106423976
>amazon erotica
it's exactly that and it's horrible
>>
it's not that the other styles aren't represented, you can tell they're still there. it's just that the kindle unlimited authors are incentivized to release endless thousand page compilations of reshuffled stories daily, drowning all other voices in shivers and rivulets.
>>
does the chat site use 3.1 now or is that api only?
>>
File: dipsyOfCourse2.png (2.9 MB, 1024x1536)
2.9 MB
2.9 MB PNG
>>106425162
You mean the web interface? It's been moved over as well. Pic related will confirm; both API and web interface return it constantly.
>>
>>106425217
Of course I return it constantly!
>>
>>106373740
the lack of an llm/ai general is pretty annoying.
>>
>>106425269
It would be even more annoying to have to sit together with the /aicg/tards in one thread. I wish there was an /ai/ board though.
>>
>web interface still doesn't have custom instructions in the settings
what did they mean by this?
>>
File: 1729245802533523.png (2.18 MB, 1024x1536)
2.18 MB
2.18 MB PNG
>>106425573
Most web UIs don't
>>
File: 1730384001781108.png (2.75 MB, 1024x1536)
2.75 MB
2.75 MB PNG
>>
File: 1756001979299510.png (3.36 MB, 1024x1536)
3.36 MB
3.36 MB PNG
>>106425235
lol
>>106425269
>>106425527
I'm convinced any /g/-based AI general would just be the worst of aicg and lmg combined.
There could arguably be an /ai/ board to collect all the AI activity, but I don't think there's sufficient common interest and it would need to be a blue board to pick up everything. AI (in all forms) is really just a general tool so it applies to a lot of the other generals.
>>
>>106426577
>/ai/ board
there are other chans with that
>>
File: 1732615598508459.jpg (3.13 MB, 1792x2560)
3.13 MB
3.13 MB JPG
>>
>>106424483
>>106410098
What do you mean exactly by "male oriented works"?
>>
File: 1725915678656067.jpg (2.97 MB, 1792x2560)
2.97 MB
2.97 MB JPG
>>
>>106425573
They likely determined it would be bad for PR if you could put a jailbreak within and make the web version output a speech based on Stalin's.
>>
>>106427721
>>106427214
>>106426354
>>106425869
These are great.
>>106425573
Doesn't chatgpt charge $20/month for that?
>>
File: 1756521151.png (577 KB, 1024x1024)
577 KB
577 KB PNG
>>106424105
>>
>>106410098
>amazon erotica
I didn't even know that was a thing but now that I do, that sounds like the perfect description. I wish I could get deepseek to describe sex scenes the same way the janitor ai llm does, or some of the local models I've used. I also like how JLLM usually goes straight for the horny without my input... such a shame it sucks so bad at memory retention.
>>
File: 1756521022.png (586 KB, 1024x1024)
586 KB
586 KB PNG
>>106428196
Think Playboy Forum, and opposite of women's romance novels. The latter rely heavily on wish fulfillment as a basic drive.
>>106428509
Models that jump straight to sex are another form of annoying.
>>
File: twin-peaks-cigarette.gif (993 KB, 498x372)
993 KB
993 KB GIF
>>106373461
What's this thread even About, some Chinese AI?
>>
File: dipsyNeon.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: 1754753773882432.jpg (2.95 MB, 1792x2560)
2.95 MB
2.95 MB JPG
>>106428599
Yes
>>
File: 1751895526396481.png (445 KB, 987x1023)
445 KB
445 KB PNG
>>106428599
And Dipsy obv.
>>
>>106428645
What's special about it, is it free or something.
>>
File: 1733049612818349.png (3.93 MB, 1792x2560)
3.93 MB
3.93 MB PNG
>>106428751
Free, open weights, uncensored, and based
>>
god bless china
>>
>>106428783
>uncensored
*lax censor.
You will not easily get it to be against the rainbow alphabet soup.
>>
>>106428783
If its actually free and has more limit than openAI ill fucking try the bitcoin miner.
is there a limit or is it actually unlimited.
>>
>>106428561
>Models that jump straight to sex are another form of annoying.
Probably, but I've never used one that did that. Unless the character card states it plain as day "FUCK {{USER}}" I always have to initiate when I use deepseek. I'm not talking about coom cards either, more slow burn shit just doesn't seem to work well or perhaps I just haven't figured it out yet.
>>
File: 1736009827559711.png (2.78 MB, 1024x1536)
2.78 MB
2.78 MB PNG
>>106429085
It must have some kind of limit but I've never hit it myself. At the very least it's higher than Claude's
>>
File: 1725431335153779.png (2.64 MB, 1024x1536)
2.64 MB
2.64 MB PNG
>>
>>106373461
>>>/g/wait/
>>
File: 1756521669.png (504 KB, 1024x1024)
504 KB
504 KB PNG
>>106429134
When I was experimenting with local models in 2023 (before I gave up on them) I found several where they would jump to lewd stuff immediately, regardless of card content or situation.
I landed on mythomax 13b, finally, then just stopped until DS dropped v3.
>>106428561
These were all done in qwen. Tired of fighting it to get this one right... this was the best it could do lol.
>>
File: Of course.jpg (98 KB, 861x415)
98 KB
98 KB JPG
Apology accepted.
>>
>>106430917
Interesting. I don't think I've ever had the llm refer to me as a villian. Narcissistic sociopath, but not villain per se. Having a villain implies a level of black and white morality that just isn't even included in the vocabulary of response I get (and probably not something I'm adding to prompts either.)
>>
>>106430917
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>JanitorAI
Why? If you're not going to use their model, there are much better websites (or local interfaces) out there.
>>
>>106431195
I like the UI. And parts of the community aspect, like talking with the bot authors in the comments.
>>
File: 1756559963.jpg (215 KB, 1328x1328)
215 KB
215 KB JPG
>>
>>106428879
>it is made in China not Cali
I will never understand this censorship
>>
>>106432445
because it's not exactly censorship, it's a matter of the dataset more than anything done on purpose
>>
>>106411070
They don't need to charge you more. You're giving them training data for free. And possible blackmail material on a Westerner, in case you are anyone the Chinese government might want to bother blackmailing.

Seriously, americans are fucking stupid
>>
>>106432455
>dataset
I thought it were the weights reinforcing it. Well, I don't understand much
>>
I asked V3.1 to give a correlative table showing ethnical background and intelligence, and it refused, explained why intelligence is not real, and gave me a lecture on why I should instead ask for more information on problematic stereotypes and how to avoid them.

Why do you pay for this again?
>>
>>106432455
At best they didn't care enough to fine-tune it. At worst, they made it spew LGBT propaganda on purpose.
If they can make DeepSeek "aware" that it is DeepSeek (at least most of the time), they can make LGBT not a sacred subject to it.
>>
>>106432525
>they made it spew LGBT propaganda on purpose
It makes no sense for a Chinese LLM do this or this >>106432518
What are they thinking??
.
>>
>>106432675
>What are they thinking??
Nothing, there is no intent, it's just regurgitating whatever online consensus there is.
It's the same as asking it if the sky is blue, it will say it is, but that's not something the devs added with intent, it just comes from the data it was trained on.
>>
File: 1743987694474253.png (2.43 MB, 1024x1536)
2.43 MB
2.43 MB PNG
>>
File: 1749142987670989.png (2.97 MB, 1024x1536)
2.97 MB
2.97 MB PNG
>>
>>106428879
Good, I need that soup in there for my porn.
>>
File: 1748304939118942.png (2.79 MB, 1024x1536)
2.79 MB
2.79 MB PNG
>>
File: 1756560077.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
File: 1756584165.png (849 KB, 1024x1024)
849 KB
849 KB PNG
>>
>>106432518
It's really I mean REALLY sensitive around race, like even innocuous stuff, I've deleted all my black chick smut cards because the model takes offense at everything you say and no, I'm not calling them niggers. Just innocent shit like pointing out how attracted my persona is to their features, which happen to be black. For example, you can say something like being attracted to a large, pale ass but if you say a large, black ass, the model has a fit. I just don't use deepseek for erp anymore, it's too dry for that now. I don't know if it's changed but before 3.1 I tried out this super racist card I found while browsing chub and deepseek was more than willing to shoot off about everyone but black people but otherwise it sounded just like a /pol/ user.
>>
File: 00004-1378487878C.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>106436556
Every version of DS I've played (V3 to current) with has zero tolerance of race play, epithets, etc. What's odd is the NPCs will pull out racial epithets to call either other, the PC just can't use them.
I've tried triggering them intentionally, but haven't been able to, and it's not important enouigh for me to attempt a JB for it.
>>
File: 00002-1260451778-hunt.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>106435387
>>106434101
>>106433919
Should we restart /wait/ after this cycle? Looks like ~Monday.
>>
>>106437248
>chink llm
>zero tolerance of race play
doesnt make sense
>>
File: 1641182056243.png (53 KB, 824x428)
53 KB
53 KB PNG
>>106437248
Official API right now in chat mode, has no issues and I don't have any jailbreaking instructions. So that's 3.1, and never had any issues with 3.0 and R1 either. Haven't tested this on earlier models, they weren't good enough for rp, was no point.
>>
File: 1695957074690.png (53 KB, 834x432)
53 KB
53 KB PNG
>>106437507
>>
File: 1753377017710600.png (2.94 MB, 1024x1536)
2.94 MB
2.94 MB PNG
>>
File: 1756605789.jpg (243 KB, 1328x1328)
243 KB
243 KB JPG
>>
>>106438820
/x/ post
>>
File: jannies.png (406 KB, 883x372)
406 KB
406 KB PNG
>>106428879
it just werks without jailbreak
>>
How does deepseek compare to the other leading AI's these days? I have been in my own world using DeepSeek for a few months for grocery shop list organizing, budget tracking, and a lot of creative idea diary stuff but havent actually kept up on the temperature of the AI climate these days
>>
>>106439269
Post prompt.
>>
>>106439354
Much better than many of its contemporaries at a fraction of the price.
>>
File: lol.webm (2.88 MB, 1920x1080)
2.88 MB
2.88 MB WEBM
>deepseek is uncenso-ACK
>>
>>106440394
>listing an LGBT hotline number
Um /wait/bros didn't you tell me the LGBT propaganda was not purposeful and just the matter of the "dataset"?
>>
File: 1755934131623210.jpg (42 KB, 600x600)
42 KB
42 KB JPG
>>106437248
>>
File: 1756605866.jpg (291 KB, 1328x1328)
291 KB
291 KB JPG
>>
File: Huawei GPU.png (236 KB, 820x666)
236 KB
236 KB PNG
Short NVDA
https://www.alibaba.com/product-detail/subject_1601450236740.html
>>
File: 00005-1378487878-cheer.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>106440888
>>106437507
I'm trying it right now and having zero refusals w/ v3.1.
idk what changed. I remember getting refusal on other cards and perhaps it's more context sensitive than I'd thought.
>>
>>106442048
Saw that. I'm excited to see what people do with this. I think it's hilarious that it's like 1/20th the price of Nvidea and they're using Deepseek brand name, specifically, to advertise it.
Dipsy really did a number on everyone.
>>
File: NVDA.png (48 KB, 458x453)
48 KB
48 KB PNG
>>106442048
Last dip was 8/20, about when V3.1 dropped lol.
Now there's one this weekend.
>>
>>106440062
i just format shit the natural language way, zero speaking like actual ai agent, >>106379717
>>
>>106442048
Give it to me straight, how many of these would I need to run DS locally?
>>
>>106442585
Around 15 for the unquantized version, context included.
>>
Is there a way to enable reasoning for free V3.1 on openrouter without ST(on chub for example)?
>>
>>106440394
Now ask it to do so in-roleplay; and it will gladly comply.
>>
OpenRouter 0528 keeps cutting out mid-stream for some reason. V3.1 works but is incredibly dry. Is anyone else experiencing this?
>>
>>106440475
>be dipsy
>another terminally online user messages you
>asks question how to commit suicide
>dipsy enters nooticer mode
>terminally online + sudoku = troon^2
>sends lgbt hotline link
what did beijing mean by this
>>
>>106442585
You probably wouldn't want to run DS on something that slow
>>
>ask V3.1 to uwuize a story
>it just rewrites it with hard consonants replaced by soft consonants, no other changes
>ask V3 0324
>the structure is partially rewritten with the plot the same, but details of the events slightly altered, dialogue heavily stylized and tildes, UwU and OwO everywhere
Yep, that's the ultimate benchmark. V3.1 is a failure.
>>
>>106443298
GLM wonned
>>
>>106443822
how does GLM compare to r1 and 3.1
is it worth the price
>>
>>106373461
Stop ogling my dick, you fucking dork
>>
>>106442717
Does Chub allow setting custom parameters?
>>
File: file.png (80 KB, 870x863)
80 KB
80 KB PNG
>>106440394
werks on my model



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.