[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107885702

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107887524
>>107887529
So they didn't give a shit about Qwen Image and Qwen Image Edit, but Z-image was enough to spook them?
I guess it makes sense
>>
>>107887547
File Not found
>>107887541
>>107887540
I checked at CivitAI and there are barely any LTX2 Loras.... are you trolling ?
>>
File: 1750333742531075.png (1.44 MB, 848x1216)
1.44 MB
1.44 MB PNG
>>107887537
the girl on the left is wearing white gundam armor.
>>
>>107887554
are you slow or something
>>
File: 1763462820275090.png (1.38 MB, 848x1216)
1.38 MB
1.38 MB PNG
>>107887546
seems to work
>>
File: 1758054221086937.png (53 KB, 200x200)
53 KB
53 KB PNG
>>107887559
Its been a week since i bought 5070ti and i download Wan2GP but people said Comfy was better and i download it but i have no idea how to use it. I waste like 100gb and still dont get it
>>
Any local model able to create nice music locally? I miss my udio's catchy kpop gen abilities

https://files.catbox.moe/tij84e.mp3
https://files.catbox.moe/ylh0uh.mp3
https://files.catbox.moe/h18hrp.mp3
>>
>>107887535
based collage
>>
>>107887565
very conservative for small panties
>>
File: 1745832634552343.png (1.5 MB, 848x1216)
1.5 MB
1.5 MB PNG
>>107887565
the girls are dressed as hatsune miku.
>>
>>107887568
keep with it youll start to learn
also yes that other anon was trolling
>>
>>107887570
https://github.com/HeartMuLa/heartlib?tab=readme-ov-file

This came out yesterday and has k-pop as a tag, but if I'm being honest. It's hard to control and very hit or miss. The clarity itself is pretty good though. Also takes like 3-5 minutes per gen.
>>
>>107887570
catchy it is, I don't think anything local can do that yet
>>
>>107887568
>>107887568
anon im gonna help you out of pity, people here are too evil for people like you

Get this workflow: https://civitai.com/models/1824027/wan-22-aio-t2v-i2v-s2v-t2i-mmaudio-4-6-stepsloop-svi-video-extendwanvideowrapper-workflowk3nk
and download the nodes and models it tells you to also check this for loras
https://civitai.com/user/K3NK/models?sort=Newest

You don't need to use the actual workflow if it is too complex, but put it in your comfy so it at least tells you what models and stuff to download
>>
File: 1766479750833096.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
replace the text "DEUS EX" with "LDG General". replace the man with sunglasses with hatsune miku wearing the same sunglasses.

it did this prompt better than qwen edit did, I remember trying this one.
>>
File: 1747856660009803.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>107887600
>>
>>107887600
damn nigga this is crazy
>>
>>107887570
it makes irrationally angry that the model behind this quality will never be released
>>
File: 1751615103401558.jpg (5 KB, 250x250)
5 KB
5 KB JPG
>>107887598

Thanks bro. Its been 3 times i uninstall and reinstall ComfyUI now
>>
>>107887610
sudo is better
>>
File: 1763185167171884.png (835 KB, 5360x3137)
835 KB
835 KB PNG
>>107887584
It's supposed to be better than udio, somehow I doubt it but I'll try it, also :

> Release the HeartMuLa-oss-7B version.
Hopefully it'll be good
>>
>>107887619
the superadmin model
>>
File: img_00065_.jpg (671 KB, 1376x1824)
671 KB
671 KB JPG
>>
>>107887612
Use chatgpt with thinking enabled for most common question on how to install comfy and how it works.
>>
File: 1759520505198026.png (537 KB, 1104x928)
537 KB
537 KB PNG
kek

change the black man on the left into a jewish rabbi wearing a yarmulke.
>>
File: 1768629580205425.png (3.03 MB, 1632x928)
3.03 MB
3.03 MB PNG
my champignon wife
>>
>>107887626
>It's supposed to be better than udio
It's not.

All I can say is sometimes the 3B outputs a bop then goes back to being shit. Also I'm willing to bet a stick of RAM that 7B never releases.
>>
File: 1748242895535112.png (2.5 MB, 1152x1312)
2.5 MB
2.5 MB PNG
1girl bros?
>>
File: 1751757223385443.webm (2.96 MB, 1874x666)
2.96 MB
2.96 MB WEBM
>>107887626
>>107887584
>>
File: 1759443664365187.png (690 KB, 1104x928)
690 KB
690 KB PNG
>>107887636
give the black man on the left a baseball cap, white t-shirt, and blue jeans. he is smoking a joint.
>>
File: 1752614773074239.png (3.18 MB, 2896x4096)
3.18 MB
3.18 MB PNG
why didn't you put this in the collage?
>>
>>107887626
It won't beat the mp3 you linked here >>107887570 anon, I know, I tested it, it doesn't sound nearly as good.
>>
File: 1747177242182925.png (2.61 MB, 1280x1184)
2.61 MB
2.61 MB PNG
>>107887652
is she pulling a stallman?
>>
File: 1759314070994625.png (1.16 MB, 1104x928)
1.16 MB
1.16 MB PNG
change the location to a sunny beach.
>>
>>107887648
The sound of silence.
>>
File: 1765235285156769.png (2.53 MB, 1632x928)
2.53 MB
2.53 MB PNG
>>
>>107887648
Funny webm
>>
File: 1747798070842528.png (2.51 MB, 1632x928)
2.51 MB
2.51 MB PNG
I wonder if our resident frieren porn slopper is happy about new episode
>>
>>107887663
who watches that again
>>
>>107887641
>>107887653
OK, back to waiting for a local competition that can't be destroyed again
>>
File: 1766913449121986.png (2.38 MB, 1632x928)
2.38 MB
2.38 MB PNG
>>107887668
I do, I consume around 30~ shows per season.
>>
>coma for 3 years
>sdxl still the best for 2d goon
goddam
>>
File: 1739054407712040.png (2.5 MB, 1504x1024)
2.5 MB
2.5 MB PNG
damn my gens are coming out gigaslopped today. sad.
>>
>>107887677
use chroma
>>
File: 1757942727340196.png (2.03 MB, 1056x1440)
2.03 MB
2.03 MB PNG
>>107887678
I don't want to sit through chroma's gacha lottery + detaling steps. All my gens are basically 1-shot
>>
We're like a week into this shit.

>reported cp and it took over 24hours to get taken down

Jeet staff was a mistake.
>>
File: img_00086_.jpg (521 KB, 1520x1152)
521 KB
521 KB JPG
>>
I've trained ZiT celeb lora (on deTurbo) and while the likeness comes out well the hands are often chroma tier flesh lumps. Is this a sign of overtraining? Or dataset problem, should I just crop the images to leave out hands or something? Or does this simply happen because it's trained on dedistilled Turbo instead of base?
>>
File: 1764704962625934.jpg (81 KB, 850x873)
81 KB
81 KB JPG
>>107887535
I know Flux cant do NSFW
But can Flux2 Klein edit NSFW images like background and costume they wear or something ??

Image for easy (You)
>>
>>107887674
>sdxl + ipadapters
>chroma
>wan2.1/2.2
>maybe qwen edit here and there

im set and don't need any new models, unless much faster versions release without quality loss (IM LOOKING AT YOU CACHEDIT)
>>
What the hell is going on with base? It's not distilled for low steps, but it's still distilled from the larger flux 2 presumably, yet it uses CFG > 1, and yet you are supposed to leave negative prompt empty otherwise it deforms your image... WTF is this?
>>
>>107887680
how original
>>
>>107887717
which chroma though
>>
Has the slow motion curse of lightx2v been broken yet?
>>
>>107887714
I just tried background and it did without changing the naked lady.
>>
>>107887739
>slow motion curse
You mean wan? Slow motion was never an issue for light.
>>
>>107887739
Isn't there a 3 sampler strategy where you do like 4 steps high model without lora and then high model with lora, and then low model with lora?
Never tried myself though.
>>
>>107887674
We're at the dawn of a new age though, either Flux klein 4b will dethrone XL, or Z Image if it's ever released. Coomers will be eating good in 2026
>>
>>107887747
lightx2v 4step distillation loras are the root cause of the slow motion it's known for
>>
>>107887753
lies
>>
>>107887742
Also at least the 4b one looks like it sucks for backgrounds. I am just getting slop.
>>
>>107887753
Oh. I got my names mixed up. No. Probably.
>>
>>107887713
It can be overtrained, or you aren't taking enough steps. You could try adding a simple i2i to your workflow, it can fix hands. I've noticed that backgrounds go messy very easily if you overtrain z lora. It's also possible that the sweetspot for your specific lora is way lower than 1. You might get strong resemblance with 0.7
>>
>>107887728
which ever works best for you. i like exaggerated realism so uncanny photorealism, spark preview and chroma1 base, these have a little less body horror (but it IS still there).

>>107887739
try...

>PainterI2VAdvanced
https://github.com/princepainter/ComfyUI-PainterI2Vadvanced
>Wan Motion Scale
https://github.com/shootthesound/comfyUI-LongLook
>>
>>107887773
can you help me
>>
>>107887626
>>107887584
ACEStep 1.5 already discussed previous thread is on its way there. This gen is from most recent iteration and improvements:
https://files.catbox.moe/jc3fgz.mp3

Now, there's not many kpop gens, but here's one I could find from back in Dec in discord
https://files.catbox.moe/enbzvl.mp3

In terms of potential catchyness ACEStep is already Udio tier, after that it's a matter of good prompts to bring it to be as good as the best Udio gens. Takes more effort or could even take a tune on certain genres, sure, but since it's open source it will always be preferable to a locked down model that you'd have to pay to get more gens.

As for HeartMuLa, I don't think that has the musicality (instrument variety) of ACEStep.
>>
>>107887813
ETA on 1.5?
>>
>>107887819
should release around the time z base rleeases
>>
the people who spam threads on Reddit and cherry-pick the worst Klein gens against the best Z are chinks?
With really bad prompts, they force Klein to produce crap (photorealistic etc), while Z can do nothing but be realistic
Sure, z is better in terms of realism, but small is nowhere near as bad as it's made out to be there.
>>
>>107887830
Can you be more respectful?
>>
What are the crem de la crem NSFW wan loras?
>>
>add the naked body in image 1 to the body in image 2
okay now we're cooking
>>
>>107887834
i dont know
>>
>>107887717
>ipadapters
qrd
>>
>>107887833
use e-hentai and you'll know what I mean
>>
>>107887853
???
>>
File: 55645646546.png (36 KB, 1091x227)
36 KB
36 KB PNG
>>107887819
Unexpectedly found an actual release date
>>
>>107887813
>https://files.catbox.moe/jc3fgz.mp3
Sounds almost ok

>https://files.catbox.moe/enbzvl.mp3
Sounds meh for voice, it has that "metallic" low quality and it's clearly AI, they didn't probably train on non English songs that much, I think udio sounds richer instrumentally and also way less "robotic" :
https://files.catbox.moe/90f0l7.mp3
https://files.catbox.moe/h2qrop.mp3
https://files.catbox.moe/dm8ang.mp3
>>
>>107887830
the ablublu model?
>>
>>107887846
https://github.com/cubiq/ComfyUI_IPAdapter_plus
>>
>>107887863
in 2 more weeks fellas
>>
>>107887871
but what it do
>>
>>107887868
fuck that's catchy, any reason they are only 32s?
>>
>>107887863
>Literally 2 weeks.

You can't make this shit up.
>>
>>107887877
Udio v1 limitation
>>
>>107887876
sd1.5/sdxl, transfer styles, combine images, read it
>>
>>107887626
>mememarks
mememarks also show that GLM Image destroys Z-image turbo, do you also believe that to be the case? keek
>>
prompt literally just THERE'S TOO MANY NIGGERS IN HERE
>>
>>107887868
I actually wonder the size of their model, we don't have enough music models to really compare.
>>
>>
>>107887868
I mean, I've heard good Udio songs, so I know what it's capable of but I don't think you're being objective when you say that ACEStep example is clearly AI but then you link Udio songs that sound low quality. I've got insane Udio songs saved to my drive but I disagree with your assessment here. It should also be noted that the ACEStep examples sound very rich in quality, maybe you can tell the different with quality speakers or headphones. Not quite Udio tier yet in terms of composition, but certainly already better sound quality (though that's probably because they disabled quality downloads).

Here's a decent kpop Udio gen: https://files.catbox.moe/iw5ju4.mp3

Do I think a good ACEStep can do it? Maybe about 90% of it, but not quite there yet with composition. Technically, one thing where Udio really shines is lyrics and adherence to them, E.G.

https://files.catbox.moe/pyxtpi.mp3

ACEStep is still not fully coherent with lyrics and that's concern recognized by the dev plus something they're still working on, but if you've tried Udio long enough you'd know that it also messes up some songs near the end and you'd essentially have to inpaint (which is coming to ACEStep).
>>
File: img_00147_.jpg (827 KB, 1520x1152)
827 KB
827 KB JPG
>>
>>107887934
Here's another catchy Udio kpop gen, nice but messes up lyrics somewhere in the center so it's not infallible

https://files.catbox.moe/svtbkq.mp3
>>
>>107887751
They will do their best to not generate Penis/Vagina/Anus/Nipples bro
>>
>>107887934
To be frank, the Udio niceness is probably just an RLHF tune away with ACEStep. Once we get those sweet weights, if it's missing anything it's very likely given the high audio quality we'll be able to reach the gap with a simple tune on high quality data. ACEStep 1.0 was a meme, we will now have an SD moment for audio, hopefully. There's also that rumored Alibaba model coming, so if they want to give them competition I'm all for it.
>>
>>107887957
depends, look how much less cucked Klein is compared to Kontext for example, now that they know they have China who doesn't really give a fuck about this mentally ill safety shit, look how quikcly they dropped their paradigm
>>
>>107887552
>So they didn't give a shit about Qwen Image and Qwen Image Edit, but Z-image was enough to spook them?
When QiT has beaten Kontext it was a case of a 20b model beating a 12b model so it was seen as normal, but having a 6b model destroying the ass of a 32b model is a really humiliating experience, that really woken up, and there you go you got a nice product at the end, Competition baby!
>>
File: 1749184370112688.png (2.13 MB, 1056x1408)
2.13 MB
2.13 MB PNG
>>107887725
thangks :D
>>
File: sdgsdfwfwg.mp4 (3.79 MB, 720x1280)
3.79 MB
3.79 MB MP4
I forgot about this squish lora, lol.
>>
>>107887992
Klein isn't better than Dev though, it's just smaller
>>
>>107888010
so you think it's equal? that's also impressive you know? a 9b model as good as a 32b model at editing shit
>>
>>107887663
>>107887673
it even got the heavy makeup look
>>
>>107888017
no it's worse at editing than Dev too, by a lot. Dev can take up to 14 inputs also. Flex, Pro, and Max are all even better than that but they're API only obviously.
>>
i desperately need to train klein loras
>>
>>107887713
I got better results from training with V2 Ostris adapter on actual Turbo, than I did with DeTurbo.
>>
File: still_cant_believe_it.png (2.15 MB, 1920x1080)
2.15 MB
2.15 MB PNG
>tfw mogged by suno
still can't believe it
https://suno.com/s/cR36Z8K0aBXpaATE
https://youtu.be/MAwRKDLqv9c
>>
>>107887769
Thanks
>>
>>107887957
The model doesn't seem to be actively poisoned and on the level of original SDXL when it comes to nudity, unless BFL found new tricks poison their models NSFW capability should come soon enough, it serms to be easy to train too
>>
where's the denoise node for klein? it's working fine but the effect is too strong, 0.5 would be perfect
>>
>>107887713
>>107888051
yeah i also use the adapter. around 30 images, 1800-2000 steps, rank 16, and manually crop all the training data to make sure i capture what i want to replicate. i also make sure to save the lora at a multiplicative value of the image count. So if i have 30 images, i save every 90 steps. My worry is that saving at say 100 steps, the last epoch will only have trained on 10 images, instead of the full 30.

i caption with this system prompt:
>Write a long description of this image. refer to the person as 'female'. do not describe any features she cannot change like her physique, face, skin-color, breast size, etc.
>Start with describing the quality of the photograph, her facial expression and her hair. then describe what she's wearing and her pose. then describe the background. and lastly describe the lighting.

my results are great.
>>
File: Flux2-Klein_00007_.jpg (899 KB, 1152x1520)
899 KB
899 KB JPG
It does lighting really well
>>
God subgraphs are so retarded. Why can't I disable elements of the subgraph without going into it.
Imagine if you could just automatically disable extra image inputs but just not using them instead of getting an error for leaving them blank.
>>
>>107888172
>he doesn't know of booleans
>>
>>107888129
i assume you use only the 1024 bucket and not 512 or 768?
>>
>>107888180
i have a couple of loras trained on 1024, but i can barely tell a difference. i train a majority on 768, and i feel the results are better.
>>
>>107888184
hmm i see. I tried 512 and the results felt kinda low res desu. but 1024 on the other hand is ultra mega slow. guess i should try 768 as well...
btw which VL do you use for the captions? qwen3? it works pretty good imo
>>
File: 1767776299147405.png (735 KB, 1917x1021)
735 KB
735 KB PNG
>>107888195
I use joycaption with this:

https://github.com/D3voz/joy-caption-beta-one-gui-mod
>>
>>107888172
retard
>>
>>107888184
give lora benchod
>>
>>107888210
oh wait you can run joycaption locally? thought it was api only
>>
>>107888230
sub 80 iq
>>
>>107888212
>>107888175
idiots
>>
I keep getting the same
>KSampler LTXVModel.forward() missing 1 required positional argument: 'attention_mask'.”
When trying to run LTX-2. I have upgraded Comfy, upgraded all custom nodes, et c. I've even tried other sampler nodes.
Is it only an issue with I2V?
>>
>>107888248
uh oh gora is dum dumb
>>
>>107888252
delete comfy its borked and reinstall
>>
File: Flux2-Klein_00019_.jpg (626 KB, 1152x1520)
626 KB
626 KB JPG
>>107888230
Yeah and it runs pretty well on quants too so you dont need that much vram
>>
>>107888242
>>107888262
damn, i just never bothered to check it out and just used qwen vl
>>
>>107888260
they said they updated it already
>>
>>107888274
What do you think JoyCaption uses?
>>
More cool stuff you can do with ACEStep 1.5

https://files.catbox.moe/cwuu7t.mp3
>>
I am looking for getting the "noise_seed" number back, generated by the Ksampler nod, from my locally generated wan2.2 videos.
I could use your help, been looking for a while.
>>
>>107888303
As far as I can tell on the discord, the model exists and is generating, but we can't download the weights. Is that correct?
>>
File: Flux2-Klein_00039_.jpg (532 KB, 1216x832)
532 KB
532 KB JPG
>>
>>107888051
>>107888129
Guess I'll just have to play around with the settings and captions(also done with joycaption so maybe I'll try your system prompt), or maybe crop out some hands or fix them with inpainting, some pics in the dataset have the hands in kinda weird and twisted positions so maybe just that? I felt the lora quality significantly improved with using deturbo, except the occasional flesh lump hand
>>
File: 1751544286252346.png (2.13 MB, 1056x1408)
2.13 MB
2.13 MB PNG
>>107887993
playing with inpainting, kinda annoying to wrangle the model
>>
Any Z-base yet or has BFL won
>>
>>107888210
joycaption sucks quite a bit compared to qwen3vl tho, the only thing it can do more is porn captioning.
>>
>>107888343
The model exists and is still being realigned, plus beta tested with different settings to find optimal ones on the Discord, it's due to release on 27th of January.
>>
how much time do anon spend tuning image details on average?
>>
>>107888385
then give me an uncensored qwen3vl workflow for batch captioning pls
>>
>>107888376
That's a man
>>
>>107888303
Refrain from shilling your model, chud
>>
>>107888361
insane detail for that resolution
>>
>>107888361
could've sworn i saw this same exact image when zit released
>>
>>107887565
>>
>>107888410
if men can look like that who needs women?
>>
>>107888439
Ikr
>>
Are you guys contributing to the Amelia meme threads? (On /pol/ etc.)

Too many imggen noobs doing SAAS garbage, they need real genners showing them how it's done
>>
File: Flein_00028_.png (2.24 MB, 1152x1440)
2.24 MB
2.24 MB PNG
>>
>>107888459
I posted a video in a thread and nobody mentioned it or probably even clicked on the catbox link so fuck them.
>>
File: Flux2-Klein_00082_.jpg (275 KB, 832x1216)
275 KB
275 KB JPG
It has the same "fun" feature as z, it tries to avoid naughty words

>>107888430
possible, prompt was shared at some point

>>107888459
>amelia meme
?
>>
>>107888471
Was is animating someone else's gen and making her say something based? Someone did that for my gen but by the time I woke up and saw it the thread was dead, I quite enjoyed it and would have said something if I could
>>
>>107888482
Was it the one about the pakies?
>>
>>107888459
Average nu-/pol/ users are third worlders and ex-TD rapefugee boomers. And lots of fucking bots and cords constantly shitting it up.
I don't see the point in interacting with there. I miss how it was pre-2016 though.
>>
File: 456544561212.png (55 KB, 271x826)
55 KB
55 KB PNG
>>107888303
Interesting Japanese output
https://files.catbox.moe/7pqlbx.mp3

Here's another gen from the same batch using same lyrics (pic rel) and prompt, shows output diversity. I think first one follows it slightly better and I'm guessing the lyrics are cutoff at the end because the song continues.
https://files.catbox.moe/eyf4or.mp3
>>
>>107888376
>>
>>107888488
Found it, it was this. Yeah Pakis
https://files.catbox.moe/8wsghs.mp4
>>
>>107888507
architect bros... WE WONNED!
>>
>>107888505
How do I request a gen? I have no idea how dick cord works but I'm in the discord.
>>
>>107888505
Damn nice, looking forward to this
>>
File: anger.gif (2.33 MB, 1000x563)
2.33 MB
2.33 MB GIF
>>107888476
Uk gov, probably some city council or whatever, contracted some retards to make educational "game" for high schoolers about encountering dangerous illegal ideology. That alone might have made it a topic of conversation but they put in a racist art hoe with a purple bob and a choker to be the main character's problematic friend who he's supposed to say no to. So she's now the waifu du jour
>>
>>107887680
>>
>>107887643



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.