Discussion of Free and Open Source Diffusion ModelsPrev: >>107880290https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
cooomers eating good, XL era is ending
euler, simple
Qwen3 Image is better than Klein 9b right?Is Klein just a good option for poorer people?
it knows shakira?
>>107883170honestly the klein image edits i have seen here were very good
SHAKIRA SHAKIRAOO BABY YOU WANNA FUCK MY ASSYOU MAKE A WOMAN GO MADSHANIQUATANIQUAFEELING THE SIDES OF MY BODYIMMMMM ON TONIGHT AND MY HIPS DONT LIE AND IM TRYING TO FEEL YOU BOYLETS GOMEXICO
>>107883147>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonuh oh, low thread quality inbound!
>>107883207are you debo or ani
>>107883197I've been using the edit nodes on anime images, and qwen3 still has clearly better results/adherence.
Why was the last one deleted?
>>107883197klein hits a little different
hmm!
Thoughts?
>>107883256green animation is largest
>>107883256white purple and green
Do you guys miss the good old SD1.5 days? Remember when it was a simpler time and everyone was excited about making gens?
>>107883274no
>>107883274if you're not excited by every new improvement its your brain issue
>>107883296Wow, did you have to be this rude to me? Yes, I'm feeling depressed, alright, but there was no need to call me mentally ill.
Blessed thread of frenship
>>107883274>getting stable diffusion to even work on Linux was a total fucking bitch with random python errors >after trouble shooting for hours, your non flagship card that barely had any tensor cores (in my case i used a 1080 ti which had none) would either crash out or struggle to give you a half decent washed out image No. Early sd 1.5 days sucked and I was largely unimpressed, immediately going back to cloud based shit.
>>107883274>This post brought to you by Greg Rutkowski and Alphonse Mucha.
>>107883274I remember the first time trying to get anything to work>windows>amd>some shitty workarounds >something onnx>something converting models>....>like 10 minutes per picture>gpu at 100%I was happy when i had my picture of a pink cocktail on a table at a beach and was fascinated by it kek
>>107883256absolutely based they recommend using llama.cpp over pootorch
>>107883274the sidegrades actually felt impactful. nowadays it's "use this snake oil to get realistic skin!" (no difference). I remember ipadapter, loras and controlnets actually being useful and easy to plug and play. nowadays everything sucks and just plateaued
>>107883388benchod
>>107883364>Greg RutkowskiI remember he was bitching about being used in the dataset and then as soon as he was removed everyone forgot he existed lol. AI was the only reason his generic fantasyslop got recognized.
Hey people using klein edit, where is this node from?
>>107883437https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced
>>107883454real?
>>1078832741.5 was dogshit. the anatomy was catastrophic. It required extensive retouching. luckily, i only discovered this garbage, a few weeks before the release of the XL version, kek
>>107883274no because it didn't make dicks, I want dicks
>>107883501There were a lot of 1.5 models that could make dicks
>>107883546that arm looks extremely huge/weird
>>107883274I had fun making HR Giger interpretations of politicians.Lost most of the gens tho
>>1078832741.5 is unoriginal soul. It's trained with everything and the kitchen sink. You ask these new "good" models for a kodak brownie photograph they don't know anything.
>>107883454
>>107883183Only thing we know is that you are face blind
>>107882616Kleinfags in shambles
>>107883596about time you finally admitted that migu is a girltook you ages
>>107883481
if i pass the SamplerCustomAdvanced an SD3 latent it doubled my image res? does that happen to anyone else?
>>107883251>just numbers in the filenameI now use ComfyUI %date:yyyy-MM-dd hh-mm-ss% %KSamplerAdvanced.noise_seed%
ComfyUI %date:yyyy-MM-dd hh-mm-ss% %KSamplerAdvanced.noise_seed%
>>107883688Nobody asked, though?
>>107883465>>107883655huh? when did i say migu isn't a grill
>>107883686
>>107883274I remember in October 2022 being super excited about genning a 512x768 Injun lmao I thought it was amazing. That was SD 1.4.
>>107883729
>>107883752Just checked the archives and I posted it on /aco/ lmaohttps://desuarchive.org/aco/thread/6779207/#6779754
>>107883560
>>107883612kek'd
>>107883718It's for your edification, sequentialfag>>107883776Jesus
>>107883752oh hey it knows jessica alba?
>>107883246
>>107883785It vaguely knew a bunch of celebs, but this one's not Jessica Alba.
>>107883798
>>107883274i am still quite excited
>>107883752
>>107883274i remember upgrading from 1080ti to 3090 during sd 1.5 days and was like holy shit i can gen 8 shitty 512x512 terrible images at once!
>>107883823
>>107883812interesting
>>107883834Qwen wins again
i will be excited once z image base drops, and then releases a noobai finetune which blows illustrious out of the water
>>107883481i don't have a single XL gen saved that wasn't an edit.i keep loads of 1.5 models, some of which have probably disappeared, because creative weirdos easily fine-tuned 1.5 and then vanished.
>>107883818
Is there any website I can use to generate a 20-30 minute movie, that can do consistent scenes?
>>107883904kek
>>107883891<is the singularity out yet?
>>107883891>Jarvis, create a free website where users can generate a 30 minute movie that makes sense for free
When will I stop getting these demonic captchas and get the normal ones? No wonder even the 4chan xt dev bailed out, this website works againsts its users
wen base
Tried chatgpt image gen the other day for workShit is light years away from local nglLocal is just too unwieldy, you have to fucking study how to do anything with itThat shit was just plug and play and better resultsSo that is very unfortunate
>>107883925>>107883949So what's the best this horseshit AI can do then? 10 sec clips? that may or may not be consistent or have continuity
Mr President, another Z-image commit has hit the tower.https://github.com/kohya-ss/musubi-tuner/pull/843
>>107883999yes. it's only for gooning and memes but the memes kinda suck now and the new model has to have someone spend several hundred thousand dollars to train porn into the base model but it's already fried on release
>>107884010https://github.com/kohya-ss/musubi-tuner/pull/843#issuecomment-3759879680>I will test and merge this as soon as the base weights are released.based kohya not falling for their bullshit, no weight = no merge
>>107883686explain?
>>107884017How many more years until I can make 30 minute movie that is indistinguishable or at least very close in quality to the real thing?
>>107883256This sounds absolutely insane! Is there a paper somewhere where we could listen to samples? Realtime too, the TTS model I've been waiting for. Can we control emotions like in ElevenLabs?
>>107883999Yeah but some people manage. Today I found this guy lmao, makes 2-4 minute erotic films about Wonder Woman getting hypnotised/enslaved/fucked, with clearly several short scenes just stitched together.https://www.deviantart.com/deviant-wonders/gallery/all
saw some solid faceswaps with flux2 in the thread before. does it work with the basic workflow or do i need dark magic?
Krea still compares pretty well to both Klein and Z Image IMO. They're usually always pretty similar to each other too I guess because of Qwen as the TE, whereas T5 takes Krea in a bit of a different direction usually on the same prompt.
>>107884029i'm passing in 1152x864 and getting 2304x1728 out.
>>107884053krea looks burned af ngl
>>107884049You need a workflow for this with some special nodes for editing like this https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>107883256Thanks doc
>>107884060ahh nice find, gonna test it later, I'm trying to train a klein lora, diffusion-pipe added support, its going pretty fast
>>107884060
>>107884047Looks stupid and inconsistent, I only saw one clip since all others need to login. 1/10 would not watch and would not bother with this horseshit.
>>107883919>>107884003one thing that I noticed about klein is that it tends to give very manly hands to women
>>107884065cheers, will have a look at it
>>107883164
>>107884089ahh that explains it. good find anon. good luck with training, i found it was almost as easy as zimage. are you doing i2i or t2i?
>>107884089>diffusion-pipeugh hate that one. so clunky to use and uses pointless deepspeed like everyone has fifty gpus
>>107883147Klein is better at edits than NBP. But it understands only a fraction of it, how did they do it?
>>107884061Yeah it's a bit contrasty. It retains more fine detail when upscaling then the other two though.
>>107883842hm. it did not become real
>>107884121thanks for the comparisons anon.Looks like qwen is typically better but with the tradeoff that it can change things beyond what's been asked for
>>107884151>how did they do it?getting humiliated by Alibaba and Z-image turbo does that to you, when your ego has been hurt you only want to prove everyone wrong so you work as hard as you can
>>107883842 >>107883823BTW I'm glad that prompt still works.Some SD1.5 prompts did not fare quite as well on flux2 klein.
I like that models tend to be more and more unified, for example they managed to make Flux 2 Klein good at both edititing and as a text2image model, that's how it should be, a model that can do it all at the same time
>>107884170gasper
>>107884022>based kohya not falling for their bullshit, no weight = no mergeMerging takes on second, only thing that takes time is reviewing, he can review at his leisure and merge the second Base drops
>>107884222>Merging takes on secondhe needs to test the base model and see if the script works on it before approving and merging though
>>107884229didn't ask
>>107884170are you trying his latest radiance models?https://huggingface.co/lodestones/Zeta-Chroma/blob/main/zeta-chroma-x0-pixel-proto.safetensors
>>107884232
>>107884239puto
Hey bros what's the good news
>>107884129t2i, thats only supported for now>>107884146Is not that bad once you got it set it up
>>107884251i got fired
>>107884251I got hired because some fag got fired lmao
>>107884251main thing rn: flux2 klein is out>>107884233checking some old SD1.4/1.5 prompts on flux2 kein and current radiancehaven't tested zeta radiance yet
so what is the use of flux-2-klein-base-9b over distilled?
>>107884270training (actually no, the licence is shit lul)
https://www.reddit.com/r/StableDiffusion/comments/1qdl0dd/ltx2_vs_wan_22_the_anime_series/Absolute cinema
>>107884233nta is there an actual workflow for this?
>>107884282lol at the hunyuan grave and jensen laughing like an evil maniac
>>107884273Still good for training loras, only full finetuners care about licenses.
>>107884282kj boss and potato 3060 gpu and many other good references.fucking saved.
>>107884166A Qwen Edit with the realism of the 2512 T2I model would be WAY better than Klein (but still slower). I hope they do one, regardless.
so where Z-Base
Why the fuck do you guys hate Pony so much?
>>107884315I hope Z-image edit will have the realism of Z-image turbo, this shit would be goated
whats the verdict on klein? i get body horror with t2i and the edits make the images slightly pinker/orange
>>107884344>the edits make the images slightly pinker/orangethat's because they're still using a VAE in the year of our lord 2026, they should go pixel mode on edit models ffs
damn. klein actually gives me better results than qwen when inpainting.
Why didn't any of you tell me about this user on civitAI?https://civitai.com/user/Synthdark8Absolute god-tier anime style loras.
>>107884340he's a cuck that removes artist styles on his models, fuck that horse fucker
5090 but i got 2TB disk space and running out. i'm like the guy who parks his Porsche in front of a trailer park home
>>107884372>shovelware shit
>>107884344It still needs more training to replace SDXL for anime, but it's actually quite good. I can see people using it as an alternative to ZIT for various subjects.Shame only 4b has an open license.
>>107884253I trust it even less for cumfart cocksucking. their formats suck and he won't make anything better than gguf imatrix
kill ani
>>107884344It's best in class for a 9b model but isn't the best overall. It's fast, can handle edits on large images and is solid as an upscaler. For t2i use the distilled, it's nothing special but it's fast.For edits use the base model, it's slower but much higher quality
>>107884408no
>change to high definitionalright it's good
>>107884415>For edits use the base model, it's slower but much higher qualityreally? do you have some comparison pictures between a distilled edit and a base edit?
>>107884423benchod
>>107884427prompt was >add colour, keep detailed sketch style
>>107884415>For edits use the base modelhow many steps? honestly think the distill results are great already
>>107884441damn, base managed to keep the sovl wheras the distilled version slopified it
>>107884267Too soon
ACEStep 1.5 is apparently still in the oven and improvements are still being made bros, we'll be eating goodhttps://files.catbox.moe/jc3fgz.mp3(7 min song with coherence and great audio/vocals, insane)https://files.catbox.moe/6pmrsy.mp3Also ACEStep dev has this to say about HeartMuLa>It still sounds very impressive, the lyrics alignment is spot-on, and the details are incredibly realistic.>However, the limitations are as follows: our 2B model supports 50 languages, which does cause a certain degree of capacity issues. Additionally, in terms of inference speed, the time it takes for this model to generate one song is enough for us to generate 200 songs.
>>107884478These gens are from recent improvements that have been made
>>107884478>https://files.catbox.moe/6pmrsy.mp3the guitar sounds so fake, looks like a cheap VSL from the 00s lmao
>>107884478doesn't feel quite right to me yet but it is progressing
>>107884478pretty cool! would love to have a local model music that doesn't suck
>>107884478Wow I had given up on ACEStep since 1.5 has been the next on their roadmap since last summer and nothing has happened
>>107884421yes but what about Mr catjak?
>>107884526Which part? The instruments on that first one sound really good, vocals mostly okay. No idea what 8k, 9k and 10k mean btw. If those atre training steps then the first one is the one with 10k steps, bottom is 8k.
>all custom nodes are updated with no errors>finally get x0 to run>finishes gen>100% static outputOk, cool. Still not updating comfy though.
>>107884584>Still not updating comfy though.are you sure you won't update comfy once Z-image base will be released?
Great model, and the output has enough variation to keep genning.
>>107884535>1.5 has been the next on their roadmap since last summer and nothing has happenedYou can try out 1.5 on their Discord and the dev there has been sharing updates/samples ever since it's been pretraining phase (even when you couldn't hear anything coherent out of it).
>>107884605tmws
>>107884610>5 hp noobs trying to reach for the master sword
>>107884441note Base is DEFINITELY giga-worse than Distilled in like, a lot of cases though. It might depend on the prompt / style.
>>107884605I dont care about z. I just want to gen and the constant downloading, tinkering, then deleting every new model every other week isn't fun.
>>107884605are people really expecting it to be better than Turbo?
>>107884441base is too noisy and distilled is too slopped, now hear me out, what if we merge them together and find the sweet spot
>>107884668I'm waiting for Z-image edit personally, there's a big chance it's gonna be even better than Klein
i guess i will join the fun. Will a dual GPU setup work 5070ti + 5060ti work for 32gb of vram or do i have to spend 3x as much for a 5090 for 32gb vram? I already have 128gb in my system already.
>>107884676>Will a dual GPU setup work 5070ti + 5060ti work for 32gb of vramit will, with that nodehttps://github.com/pollockjj/ComfyUI-MultiGPU
>>107884610There's a weird criss cross line pattern. It's most evident in the guy's hair but I think it's on the entire image.
>>107884703Yeah, there's a few weird things which can happen with klein depending on the res, model size (4b or 9b), model type (base or distilled), number of steps, sampler etcMakes it hard to assess quality
>>107884730I would say that it's a bit inconsistent, sometimes you can get a good quality image, and sometimes you have pure slop, I think it was undertrained a bit, maybe they've rushed it so that they can get some hype before Z-image base and Z-image edit destroys every competition once and for all
>>107884669there might actually be some benefit to that DESU, for both T2I and editing
>>107884618Cool, here's hoping it's closing in on a release
from what I've gathered ltx i2v is kinda fucked is that right? just wait and hope for them to fix it with an updated model?
>>107884777I think it's already fixed. I was struggling to get more than 10 seconds of video before but now I can go to 20+ without a problem. Didn't change much to my old workflow either other than add the new vae
>>107884668It will be better for training, the big question is if the loras will work well with Z-Image Turbo or if you will have to gen on Base as well, who knows, maybe there will be a new Turbo better aligned with the Base models.Either way, for large finetuning, Z-Image Base will be what NSFW etc trainers will use both for the license and the quality of the model(s) despite only being 6B (which makes training much faster).
>>107884777correct, there are issues with i2v (you can sorta fix it by genning at 48fps) and audio.Fixes for both, along with better portrait mode coming 'soon'
Was making a character for some chatbot cardI busted my gpu before I got to generate it If anyone is kind enough I'd appreciate something with the followingIllustrious nsfw as the checkpoint. Sexualized but not nsfw No Lora512 width1024 heightQipao,middle length hair, wine color hair, cowboy shot, big or huge breasts, fully clothed, wide hips, serious expression, portraitNegative tag nsfw, cleavageWas gonna make it myself but GPU died 3 days ago.
>>107884802>>>/r/
>>107884802There's free image generators online you know, and I'm not talking about Sora or Gemini.
>>107884802Which gpu and how did it die?
>>107884791it's not about the length it's more about all sorts of video issues and quirks with i2v which make it barely usable>>107884800I tried 48 fps but it didn't seem that much better, and after updating nodes and comfy it stopped giving coherent outputs with 48 and instead gave me a stuttery mess of a video for some reason>Fixes for both, along with better portrait mode coming 'soon'sounds good
>>107884441This is on distill with "keep everything else exactly the same, especially the grainy texture" and I also described the colors. The scheduler is the main culprit, I used shift=1 and simple.
You can make extremely long ltxv videos with this: https://github.com/RandomInternetPreson/ComfyUI_LTX-2_VRAM_Memory_Management
>>107884857what wf are you using? neither the default or the bigstation one let me adjust shift or choose simple as the sampler
>>107884798blush is a bit much but still awesome>>107884838>>107884844awesome genssome anons bitch about it but sexy asians always keep me coming back
>>107884911https://files.catbox.moe/cjrc0u.json