Anonymous
12/18/25(Thu)21:45:54 No.107597835 >>107597102
DOES ANYONE HAVE A SUGGESTION FOR A SFW ROLEPLAY DATASET?
Since lmarena doesn't allow nsfw...
The way I was planning to abuse the arena for multi turn data was to first get some conversational multi turn datasets like openassistant or roleplay stuff. The quality doesn't really matter since I'm going to be generating the responses fresh anyway.
Cut off the conversation within range [0, n] for any value of n from 1 to the maximum length.
Flatten the conversation into
User:
...
Assistant:
...
User:
...
Assistant:
Then generate the response to that which will be message n+1.
Even without the right chat template Claude understands perfectly well it's supposed to generate the next turn. I haven't been able to visually distinguish any difference with the right way you're supposed to do it with one message per turn.
In fact with the lower quality models, if you add a line like "Analyze the following conversation/roleplay scenario:" and then the log in that format, it will miss the fact it's supposed to analyze it and will default to continuation mode.
The only issue is most of the time it will also generate the message for the User and continue the conversation for a couple turns, but maybe that's a good thing for distillation. When doing inference you can just cut off the message at "\nUser:\n".
As for my own logs, maybe as a final pass, but first I want to get something more systematic. My own logs are a mix of Opus and Sonnet from web frontend, API, Claude Code proxy, a few messages from other models, etc.
I'm finding it easy to get data from the arena. I'm not sure whether I should try to automate the IP cycling on my phone or join one of those residential proxy botnets.