Discussion of Free and Open Source Diffusion ModelsPrev: >>107885702https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107887524>>107887529So they didn't give a shit about Qwen Image and Qwen Image Edit, but Z-image was enough to spook them?I guess it makes sense
>>107887547File Not found>>107887541>>107887540I checked at CivitAI and there are barely any LTX2 Loras.... are you trolling ?
>>107887537the girl on the left is wearing white gundam armor.
>>107887554are you slow or something
>>107887546seems to work
>>107887559Its been a week since i bought 5070ti and i download Wan2GP but people said Comfy was better and i download it but i have no idea how to use it. I waste like 100gb and still dont get it
Any local model able to create nice music locally? I miss my udio's catchy kpop gen abilitieshttps://files.catbox.moe/tij84e.mp3https://files.catbox.moe/ylh0uh.mp3https://files.catbox.moe/h18hrp.mp3
>>107887535based collage
>>107887565very conservative for small panties
>>107887565the girls are dressed as hatsune miku.
>>107887568keep with it youll start to learn also yes that other anon was trolling
>>107887570https://github.com/HeartMuLa/heartlib?tab=readme-ov-fileThis came out yesterday and has k-pop as a tag, but if I'm being honest. It's hard to control and very hit or miss. The clarity itself is pretty good though. Also takes like 3-5 minutes per gen.
>>107887570catchy it is, I don't think anything local can do that yet
>>107887568>>107887568anon im gonna help you out of pity, people here are too evil for people like youGet this workflow: https://civitai.com/models/1824027/wan-22-aio-t2v-i2v-s2v-t2i-mmaudio-4-6-stepsloop-svi-video-extendwanvideowrapper-workflowk3nkand download the nodes and models it tells you to also check this for loras https://civitai.com/user/K3NK/models?sort=NewestYou don't need to use the actual workflow if it is too complex, but put it in your comfy so it at least tells you what models and stuff to download
replace the text "DEUS EX" with "LDG General". replace the man with sunglasses with hatsune miku wearing the same sunglasses.it did this prompt better than qwen edit did, I remember trying this one.
>>107887600
>>107887600damn nigga this is crazy
>>107887570it makes irrationally angry that the model behind this quality will never be released
>>107887598Thanks bro. Its been 3 times i uninstall and reinstall ComfyUI now
>>107887610sudo is better
>>107887584It's supposed to be better than udio, somehow I doubt it but I'll try it, also : > Release the HeartMuLa-oss-7B version.Hopefully it'll be good
>>107887619the superadmin model
>>107887612Use chatgpt with thinking enabled for most common question on how to install comfy and how it works.
kekchange the black man on the left into a jewish rabbi wearing a yarmulke.
my champignon wife
>>107887626>It's supposed to be better than udioIt's not.All I can say is sometimes the 3B outputs a bop then goes back to being shit. Also I'm willing to bet a stick of RAM that 7B never releases.
1girl bros?
>>107887626>>107887584
>>107887636give the black man on the left a baseball cap, white t-shirt, and blue jeans. he is smoking a joint.
why didn't you put this in the collage?
>>107887626It won't beat the mp3 you linked here >>107887570 anon, I know, I tested it, it doesn't sound nearly as good.
>>107887652is she pulling a stallman?
change the location to a sunny beach.
>>107887648The sound of silence.
>>107887648Funny webm
I wonder if our resident frieren porn slopper is happy about new episode
>>107887663who watches that again
>>107887641>>107887653OK, back to waiting for a local competition that can't be destroyed again
>>107887668I do, I consume around 30~ shows per season.
>coma for 3 years>sdxl still the best for 2d goongoddam
damn my gens are coming out gigaslopped today. sad.
>>107887677use chroma
>>107887678I don't want to sit through chroma's gacha lottery + detaling steps. All my gens are basically 1-shot
We're like a week into this shit.>reported cp and it took over 24hours to get taken downJeet staff was a mistake.
I've trained ZiT celeb lora (on deTurbo) and while the likeness comes out well the hands are often chroma tier flesh lumps. Is this a sign of overtraining? Or dataset problem, should I just crop the images to leave out hands or something? Or does this simply happen because it's trained on dedistilled Turbo instead of base?
>>107887535I know Flux cant do NSFWBut can Flux2 Klein edit NSFW images like background and costume they wear or something ??Image for easy (You)
>>107887674>sdxl + ipadapters>chroma>wan2.1/2.2>maybe qwen edit here and thereim set and don't need any new models, unless much faster versions release without quality loss (IM LOOKING AT YOU CACHEDIT)
What the hell is going on with base? It's not distilled for low steps, but it's still distilled from the larger flux 2 presumably, yet it uses CFG > 1, and yet you are supposed to leave negative prompt empty otherwise it deforms your image... WTF is this?
>>107887680how original
>>107887717which chroma though
Has the slow motion curse of lightx2v been broken yet?
>>107887714I just tried background and it did without changing the naked lady.
>>107887739>slow motion curseYou mean wan? Slow motion was never an issue for light.
>>107887739Isn't there a 3 sampler strategy where you do like 4 steps high model without lora and then high model with lora, and then low model with lora?Never tried myself though.
>>107887674We're at the dawn of a new age though, either Flux klein 4b will dethrone XL, or Z Image if it's ever released. Coomers will be eating good in 2026
>>107887747lightx2v 4step distillation loras are the root cause of the slow motion it's known for
>>107887753lies
>>107887742Also at least the 4b one looks like it sucks for backgrounds. I am just getting slop.
>>107887753Oh. I got my names mixed up. No. Probably.
>>107887713It can be overtrained, or you aren't taking enough steps. You could try adding a simple i2i to your workflow, it can fix hands. I've noticed that backgrounds go messy very easily if you overtrain z lora. It's also possible that the sweetspot for your specific lora is way lower than 1. You might get strong resemblance with 0.7
>>107887728which ever works best for you. i like exaggerated realism so uncanny photorealism, spark preview and chroma1 base, these have a little less body horror (but it IS still there).>>107887739try...>PainterI2VAdvanced https://github.com/princepainter/ComfyUI-PainterI2Vadvanced>Wan Motion Scalehttps://github.com/shootthesound/comfyUI-LongLook
>>107887773can you help me
>>107887626>>107887584ACEStep 1.5 already discussed previous thread is on its way there. This gen is from most recent iteration and improvements:https://files.catbox.moe/jc3fgz.mp3Now, there's not many kpop gens, but here's one I could find from back in Dec in discordhttps://files.catbox.moe/enbzvl.mp3In terms of potential catchyness ACEStep is already Udio tier, after that it's a matter of good prompts to bring it to be as good as the best Udio gens. Takes more effort or could even take a tune on certain genres, sure, but since it's open source it will always be preferable to a locked down model that you'd have to pay to get more gens.As for HeartMuLa, I don't think that has the musicality (instrument variety) of ACEStep.
>>107887813ETA on 1.5?
>>107887819should release around the time z base rleeases
the people who spam threads on Reddit and cherry-pick the worst Klein gens against the best Z are chinks?With really bad prompts, they force Klein to produce crap (photorealistic etc), while Z can do nothing but be realisticSure, z is better in terms of realism, but small is nowhere near as bad as it's made out to be there.
>>107887830Can you be more respectful?
What are the crem de la crem NSFW wan loras?
>add the naked body in image 1 to the body in image 2okay now we're cooking
>>107887834i dont know
>>107887717>ipadaptersqrd
>>107887833use e-hentai and you'll know what I mean
>>107887853???
>>107887819Unexpectedly found an actual release date
>>107887813>https://files.catbox.moe/jc3fgz.mp3Sounds almost ok>https://files.catbox.moe/enbzvl.mp3Sounds meh for voice, it has that "metallic" low quality and it's clearly AI, they didn't probably train on non English songs that much, I think udio sounds richer instrumentally and also way less "robotic" : https://files.catbox.moe/90f0l7.mp3https://files.catbox.moe/h2qrop.mp3https://files.catbox.moe/dm8ang.mp3
>>107887830the ablublu model?
>>107887846https://github.com/cubiq/ComfyUI_IPAdapter_plus
>>107887863in 2 more weeks fellas
>>107887871but what it do
>>107887868fuck that's catchy, any reason they are only 32s?
>>107887863>Literally 2 weeks.You can't make this shit up.
>>107887877Udio v1 limitation
>>107887876sd1.5/sdxl, transfer styles, combine images, read it
>>107887626>mememarksmememarks also show that GLM Image destroys Z-image turbo, do you also believe that to be the case? keek
prompt literally just THERE'S TOO MANY NIGGERS IN HERE
>>107887868I actually wonder the size of their model, we don't have enough music models to really compare.
>>107887868I mean, I've heard good Udio songs, so I know what it's capable of but I don't think you're being objective when you say that ACEStep example is clearly AI but then you link Udio songs that sound low quality. I've got insane Udio songs saved to my drive but I disagree with your assessment here. It should also be noted that the ACEStep examples sound very rich in quality, maybe you can tell the different with quality speakers or headphones. Not quite Udio tier yet in terms of composition, but certainly already better sound quality (though that's probably because they disabled quality downloads).Here's a decent kpop Udio gen: https://files.catbox.moe/iw5ju4.mp3Do I think a good ACEStep can do it? Maybe about 90% of it, but not quite there yet with composition. Technically, one thing where Udio really shines is lyrics and adherence to them, E.G.https://files.catbox.moe/pyxtpi.mp3ACEStep is still not fully coherent with lyrics and that's concern recognized by the dev plus something they're still working on, but if you've tried Udio long enough you'd know that it also messes up some songs near the end and you'd essentially have to inpaint (which is coming to ACEStep).
>>107887934Here's another catchy Udio kpop gen, nice but messes up lyrics somewhere in the center so it's not infalliblehttps://files.catbox.moe/svtbkq.mp3
>>107887751They will do their best to not generate Penis/Vagina/Anus/Nipples bro
>>107887934To be frank, the Udio niceness is probably just an RLHF tune away with ACEStep. Once we get those sweet weights, if it's missing anything it's very likely given the high audio quality we'll be able to reach the gap with a simple tune on high quality data. ACEStep 1.0 was a meme, we will now have an SD moment for audio, hopefully. There's also that rumored Alibaba model coming, so if they want to give them competition I'm all for it.
>>107887957depends, look how much less cucked Klein is compared to Kontext for example, now that they know they have China who doesn't really give a fuck about this mentally ill safety shit, look how quikcly they dropped their paradigm
>>107887552>So they didn't give a shit about Qwen Image and Qwen Image Edit, but Z-image was enough to spook them?When QiT has beaten Kontext it was a case of a 20b model beating a 12b model so it was seen as normal, but having a 6b model destroying the ass of a 32b model is a really humiliating experience, that really woken up, and there you go you got a nice product at the end, Competition baby!
>>107887725thangks :D
I forgot about this squish lora, lol.
>>107887992Klein isn't better than Dev though, it's just smaller
>>107888010so you think it's equal? that's also impressive you know? a 9b model as good as a 32b model at editing shit
>>107887663>>107887673it even got the heavy makeup look
>>107888017no it's worse at editing than Dev too, by a lot. Dev can take up to 14 inputs also. Flex, Pro, and Max are all even better than that but they're API only obviously.
i desperately need to train klein loras
>>107887713I got better results from training with V2 Ostris adapter on actual Turbo, than I did with DeTurbo.
>tfw mogged by sunostill can't believe ithttps://suno.com/s/cR36Z8K0aBXpaATEhttps://youtu.be/MAwRKDLqv9c
>>107887769Thanks
>>107887957The model doesn't seem to be actively poisoned and on the level of original SDXL when it comes to nudity, unless BFL found new tricks poison their models NSFW capability should come soon enough, it serms to be easy to train too
where's the denoise node for klein? it's working fine but the effect is too strong, 0.5 would be perfect
>>107887713>>107888051yeah i also use the adapter. around 30 images, 1800-2000 steps, rank 16, and manually crop all the training data to make sure i capture what i want to replicate. i also make sure to save the lora at a multiplicative value of the image count. So if i have 30 images, i save every 90 steps. My worry is that saving at say 100 steps, the last epoch will only have trained on 10 images, instead of the full 30. i caption with this system prompt:>Write a long description of this image. refer to the person as 'female'. do not describe any features she cannot change like her physique, face, skin-color, breast size, etc. >Start with describing the quality of the photograph, her facial expression and her hair. then describe what she's wearing and her pose. then describe the background. and lastly describe the lighting.my results are great.