Discussion of Free and Open Source Diffusion ModelsPrev: >>107849812https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107851707>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy is low effort trolling in the OP?
>>107851707>my 1girls didnt make itim sneething
>>107851961yet another trollbake by out resident schizo
https://xcancel.com/ostrisai/status/2011065036387881410That's the reason there's so few ltx2 loras so far? They were all waiting for ostris to make it work?
>>107851966troll baker has shit taste. the collage doesn't mean anything anymore
>>107851966all the 1 girls in op are made by the same guy exluding the anime
>>107851973too big, too slow
>>107851961It makes you seethe, welcome back
>>107851977hmmm noticing bros...
>>107851985>too slowwhat? it's faster than Wan 2.2
>>107851994forgot to add too shit
>>107851985its smaller than ltx
>>107851977Nope, I made the 1girl doing the presentation and the jeets worshipping AI
>>107852011sorry should have excluded trans girls too
>>107852003lumina?
>>107851999>too shityou lost Chang, it's the kike era nowhttps://files.catbox.moe/oc0un3.mp4
>>107852032illustrious, uguu
>>107852034>tin can ear rapeyeah no. what did modelscope cook up today?
>>107851765>>107851869>>107851912>>>/wsg/6071886
why are so many ltx videos fried to fuck?
>>107851976>>107851961>>107851967You have a thread, how about you give anons a reason to post in it beebo?
>>107852066jewish culture
>>107852066brown hands
>>107851973it's 19 billion parameters senpai
>>107852067you don't have a job unfortunately
>wan getting no more good loras>no signs of a new wan model past 2.2>ltx2 apparently getting a 2.1 updatewansisters...i-i dont feel so good...
>>107852061>only sits properly on fp>by noclipping
what a sloppy model. it's cool it does audio but the quality is unacceptable
>>107852066classifier free guidance was a mistake
>>107852087you still need wan to animate clips to feed to ltx2 or else you get slideshows
>>107852083What does that have to do with maintaining a image diffusion thread that doesn't have the things you hate in it?
>>107852087That's a good thing though, imagine if we manage to bake goon with ltx, free audio
>>107852061kek
>>107852105how did they do that with so much precision?
We’re probably at the point where wan copers should be mocked.
>>107851955Replying to someone in the older thread regarding LTX loras, I've seen a NSFW one made from a really good 2.2 lora maker and it was horrible. Don't think LTX is going to have legs simply because of this.
>>107852101will you fuck off already drama faggot?
>>107852122stop samefagging
I'll do sone anti Ani gens again when I get home
>>107852122I'm not looking to argue, why are you posting in a thread where there is another general image gen thread that doesn't have the things you dislike?/sdg/ is active, what's wrong with that thread?
>use wan to goon>will continue to use wan to goon until nsfw loras and further optimization develops for ltx2thanks for beta testing!
>>107852139why are you never home
>>107852145I have a job
>>107852143vramlet
>>107852139why? they all suck. how about make good gens when you get home
>>107852116I think it responds to training parameters very differently to wan so expecting a good output from someone who trains wan is a mistake. You’ll need a few training runs at a few different settings to see what works.
>>107852147yeah sure you do lol
>>107852122Hey, you still here? I wanted to ask you a question.
>>107852142maybe you should go there instead :)
>>107852152>whyIt's fun
>>107852142>/sdg/ is active, what's wrong with that thread?that thread is a schizo containment thread, which clearly doesn't work judging by the posts here
>>107852163what's fun about shitting up the thread with low quality images and drama?
>>107852099true>>107852104not really, the quality and fidelity of ltx2 is hot plastic garbage, pure steaming vaseline smeared trash. unless they fix the quality in version 2.1, i have no reason to use it
>>107852113this is a great bit
>>107852178vinesauce if he robot girl
>>107852162I'm fine with the thread>>107852167The ward was placed to keep it out, but since you dislike the ward the anon must like the creatures.We're off topic and it feels like I'm talking to a vampire begging me to let it in with a face full of blood
>>107852175>hot plastic garbage, pure steaming vaseline smeared trash.it's way less plastic with the new vae though >>107850554
>>107852169i wonder if we've been domesticated like dogs
>>107852174Your reaction
There are people who saw ltxv on the first day and completely made up their mind about the model and went back to gooning to wan. These are the ones that will be left below
>>107852196oh anon
>>107852196it's not the least sensible approach to wait out until workflows are figured out, memory issues with comfyui fixed, loras trained, etchell event might be worth it to wait for LTX 2.1 given the i2v quality issues at the moment
>>107852196they are not that different from the ones who saw AI fingers and to this day think that's what AI is capable off
>>107852061fuck should I just use full weight? the difference is astronomical
>>107852230on this note, where can I download full weight split, i dont want to redownload vae/clip embeds
>>107852196if it was a 14b and didn't have those powerpoint censorship bullshit then yeah it would've been a no brainer, it's obviously superior to wan 2.2 but I think it's too big to be succeful enough, I hope I'm wrong though I'm having a lot of fun with that model
>>107852200ultra plastic
>>107852232*vomits*
>>107852230>fuck should I just use full weight?theorically, you could go for DFloat11, it's a 100% lossless quanthttps://github.com/mingyi456/ComfyUI-DFloat11-Extended
>>107852196>see ltx2 videos every day created from various setups, workflows, etc spammed everywhere>still prefer waneveryone is too focused on speed and audio, means dickall if the visuals are bad
>>107852232fixed
>>107852247
>>107852265this but the opposite
>>107852066foundational models are expected to be a little undercooked when they release. ltx is clearly fried so they fucked up the post training pretty badly and there is no going back. hopefully they see that for the 2.1 release but I have a feeling they are going to fuck it up again. this is something you can't fix with fine-tuning or loras
https://huggingface.co/profpeng/ltx2-bjnoice
>>107852265you have texture problems, try euler simple 5 shift
>>107852271>t. 3dcgi pornsloppa illushitmix enjoyer
>>107852272i mean they trained it on mr. bean cartoon post-credit scenes and bollywood vhs rips, so it's amazing it can even do anything at all
>>107852265>>107852276 (me)could also be your resolution (wild guess), try 1 million pixels as well
>>107852265the skin in you gen looks like noisy
>>107852061interesting comparison, thanks
>>107852265fixed
>>107852297why is her belly so fat?
>>107852368
>>107852368lordosis
>>107852368*angry tranny sounds*
>>107852346
>>107852368good pussy needs shelter
>>107849729So you guys won't legally be able to spend an intimate evening with your 1girl? Damn... as if being British wasn't bad enough already!
After 23 tries almost decent result. Not impressed with LTX2.Prompt: "A dark, cinematic movie clip. A powerful woman sits motionless on an ornate throne. Low, dramatic lighting. The camera begins a slow, deliberate push-in toward her. She calmly lifts a sword from her lap, gripping it firmly by the handle. Her eyes lock onto the camera—cold, furious, unblinking. In one sharp motion, she extends her arm and points the tip of the sword straight toward the lens. The camera rush halts abruptly, stopping just before the blade’s tip. She holds the pose, staring into the camera with raw anger and quiet menace. She says "Ambar na-môr" in Sindarin. Windy ambiance sound."
>>107852440closest thing we got to veo
https://github.com/Comfy-Org/ComfyUI/pull/11831>Support the siglip 2 naflex model as a clip vision model.Z-image base omni is using siglip 2
>>107852435think that has to do with people stupidly making deep fakes of real people and spreading it online. its a retarded law because people are still going to do it anyway and there's way too many tools out in the wild to enforce anything.
have the comfy memory leaks been fixed by now?
>>107849729it's shit like this that makes me so grateful to be american
>>107851707I love the panties the tie-chick has. Black, seamless, shiny, sorta plastic.Amazing that the best aesthetics can be generated now.
>>107852488Scaled to 0.15 megapixels? I'm uncomfy suspecting this might happen again.
>>107852495it can leak faster than ever before!
>>107852496You made the right decision in declaring your independence 250 years ago. Those bong fucks are crazy
>>107852346*pukes*
>>107852217hmmm, yeah I think I'll wait a bit before adding it to my goonstack
>>107852488peak copiumi like it
>>107852500yeah, looks like they're using the same method as Qwen Image Edit, prepare to see some random zoom in again...
>>107852488>Z-omni-turbo nigslip 2We are so back
>>107852346fixed again, youre welcome
>>107852440try to do the same thing with wan 2.2 and see how much of an improvement we've actually got
>>107852488Maybe it's for GLM-Image's i2i mode.
>>107852418
>>107852551GLM doesn't use siglip at allhttps://github.com/huggingface/transformers/pull/43100/files
>>107852530i can see you changed your settings
>>107852346soul vs souless >>107852530
>>107852592make her skin dark
>>107852496https://en.wikipedia.org/wiki/TAKE_IT_DOWN_Act
>>107852562I am not gonna go through all that and just take your word for it.
>>107852603nah. its miss fortune if your curious.https://civitai.com/models/2250628/miss-fortune-league-of-legends-sdxl-lora-or-13-outfits-illustrious
>>107852622did I stutter?
>>107852530
>>107852563shift 6 to 5
>>107852274>404What culture is this?
>>107852592corrected
QRD on siglip nipslip? New snake oil?
>>107852682schizobabble
Dang that qwen multiple angle lora looks crazy. Does it work on distill good enough?
No base general
>>107852701prompt?
>>107852720
>>107852720here
grub
yum
>>107852682Z-image base omni uses 2 encoders, SigLIP2 and DINOv3.https://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668and it's for the I2L (image2lora) thing that was already been used on Qwen Imagehttps://huggingface.co/blog/kelseye/qwen-image-i2l
>>107852805Wow, very nice! feels so real omg it migu but with big boobies! Love your posts anon please post more! :]
I have a feeling the powerpoint shit happens because of this node, I bypassed it and I got less of that shit
>>107852805how did a single slopmaxxed sona mog the last 10k mikunigger posts?
>>107852539It almost got the picking up of the sword right the first try. Q5 + rife. Camera shaking is probably just a seed thing. Less cinematic for sure.
>>107852830Yeah but the idea of that node is to resize to make the compression from LTXVPreprocess less visible.You're just trading worst image quality vs more movement.I don't have any powerpoint issue so I use it.
>>107852830the workflows they provided for comfy are such a shitshow
>>107852863why do we have to compress the image anyway? it's weird that the model needs something jpg compressed to work well
>>107852722It's a reaction image, retard
>>107852274>no example images>no example prompts>no trigger wordsyeah this isn't going to work. need a better spot for hosting loras.
>>107852813the model is called mergesteinUncanny_uncannyAPlus
>>107852936Revolting
>>107852936Your gens are mesmerizing. Do not listen to the schizo hater... your gens really have something special about them
>>107852720Can't wait for retards to mald over the fact that base is actively worse than Turbo for immediate inference because they don't understand the point of it
>>107852682Seems like another tokenizer for vision image data.Think of clip_vision_h if you remember that from any workflows.Probably better since it's a lot newer.
>>107852616more like Take My Pants Down Act
Could sdxl have nigslip
>>107852969pls give z danbooru finetune :((
>>107852900...and? gimme the prompt
>>107852881It adds noise/"grip" so the model thinks it's a movie still and not a screenshot, so it "knows" it needs to be animated.
1/3
>>107853016Wan doesn't need to do that to work, I think those jews need to learn something or two from the chinks
Anons, I might just be bad at prompting but I'm really struggling to get Wan 2.2 T2V (Q6_K_M + lightx2v loras, 6 steps split, CFG 1.0) to give me actually uniformly light blonde hair. Either I get wigs that are almost white, or I get dirty blondes with dark roots. Doesn't seem to work with any of the following prompts: "blonde, light blonde, very light blonde, uniformly blonde, completely blonde, naturally blonde, ... "This anyone else's experience or is my setup fucked in some way?
2/3
3/3
>>107853034Use Q8 or bf16/fp16.Don't use quantized text encoder unless very desperate.Don't use tag1,tag2,tag3 style prompts with (um)t5, use natural language.
>>107852274It works but it's nowhere near as good as the wan ones.
>>107852954:)
>>107852831those were tests!!!! TESTS!!!!
>>107852830use the anti still lora, fixed it for me
>>107853096ugly abomi
>>107853070Thanks for the tips, I already tried to use the Q8 model, but it was too much for my RAM to handle, so I had to go back to Q6. Can it really have that big of an impact?I'm not using any quantized text encoder, I'm using umlt5-xxl-enc-bf16.I'm not actually prompting with only tags, I was just giving examples of what I have tried to put as descriptors for the hair. Prompt is generally natural text:"Medium shot. A girl with <COLOR DESCRIPTION> hair is looking at the camera smiling. She waves to the camera."
>>107853142qwenvl is better for encoding
>>107852936stunning. is this even real >_<? xDyou are the best genner in this general..your amazing attention to detail..6 figures..butt chin..such amazing tastes that you even made all the girls have the same beautiful face and plastic texture..god you are just such an artisttell me your secrets ai god
Why the fuck is civitaiarchive.com so fucking slow? It takes minutes for searches to show up.
>>107853029
the tech feels so stagnant now. uis are getting worse. people are being greedy with workflows. cumfart is just going to get sold to Nvidia. what is there to look forward to anymore? a shitty video model that does sound too but everything it gens is worse video quality than wan and the worst audio I've heard generated from a model? what the fuck happened?
>>107853157No way qwenvl works as a text encoder for Wan, isn't that only for prompt extension/rewrite?
>>107853116link?
>>107853202go ask in /sdg/ or trooncord ani why do you insist on trying to talk to an entire community that hates you?
>>107853179millions of jeets downloading bob and vagene lora pls understand saar
>>107853220take your meds schizo
>>107853230the pharmacist called. Mr catjak didn't pick up the prescription :'(
>>107853230do you realize you've said this line to every anon itt?
>>107853229not having the infrastructure of civitai while having comparable traffic
>>107853229Nuke India
>>107853241I didn't know everyone was sleeping over at catjak's place. Imagine the smell
>>107853171kek
>>107853251Hey by the way, I wanted to ask you something. Do you have uhhh, Batman and The Shadow? Issue number four?
can't talk about how much we hate comfyui. Nope! Not while the schizo is in the thread. Just can't happen! Every complainer must be ani and it triggers the baker schizo retard into shitting up the thread. Can't do it anons. We have to mind the schizo freaking out every single thread 24/7 just to placate the diaper autist. He could seriously harm himself if that keeps up and we have to keep the schizo safe from mean old ani! Yep, all ani's fault he has to stay here and shit all over everything. Schizo did nothing wrong!
uh oh meltie
hello I just started using ComfyUI portable and it seems like there are 2 ways you can do it: numpy v1 or numpy v2Which version should I use? It came with v2 and all these node packs I try to install keep saying Import Failed. Do I have to redownload a different ComfyUI version to downgrade or can I fix this one?
Meltdown hours again, this will take several threads until he sleeps
>>107853142Ask Grok to paraphrase until it works?You can try weights like (blond:1.5)You can also try NAG with hair colors/shades you don't want to see.>>107853204Yes it's a bot/troll
>>107853296>>107853306It's the horned schizo ignore him, he's willing to use the other one to torment the thread. The dev is more angry and targeted in his seething
>lolcow dev melting again
>>107853306he never sleeps
>>107853315Which schizo is this schizo?
>>107853330The first rentry retard not the dev retard. Dev is more violent in his post.
>>107853330Ani
>>107852936fixed
>>107853338I think the schizos have mindbroken you, ranposter ;o This is definitely Ani.
>>107853202a basic gradio ui gets the job done. the problem is lack of commitment to maintaining them and updating them to support newer models. impressive how strong wan2gp is still going and deepbeepmeep works to update it. Haoming02 is a smugly faggot and nuked most the features from forge and reforge beacuse of "bloat" with forge neo.
>>107853342nice slopface
lol at the bogeyman posting. maybe the problem is the guys spending a quarter of the thread accusing anon of being x schizo
>>107853096fixed
>>107853350fixed
Any custom nodes for evening out audio loudness when the original audio is quiet and generated one is significantly louder? KJ's audio normalization node somewhat works but it's not perfect https://files.catbox.moe/vo6j40.mp4
fixed
>>107853350you can keep using your illu shitmixes with old forge retard, but that wont fix your shit taste
>>107853383yes!!! we need more shitty ltx gens in catboxes!!!! please shit out more!!!!
>>107853393ill take 100 garbage plastic ltx gens over another single plastic illustrious jeetmix gen
>>107853377yeah... it did a worse job at keeping his voice lol
>>107853399I personally don't see a difference in the thread improving
>>107853311I didn't think weights worked with t5 in comfy, maybe I should give it a try.Feels crazy that I have to NAG something like this, have no one else in /ldg/ genned people with naturally blonde hair with Wan?
finally tried the video extension stuff for cooming with LTX-2 and it's not just a meme, I was wrong to question reddit. 1. Generate 3-5 seconds for start of motion you want with wan 2.2.2. Use video as input to LTX-2 with prompt specifying the sounds and dialog you want and extend it to like 10-15 seconds or whatever your GPU can handle.
>>107853034I like to use "*-blonde" (* = platinum, honey, strawberry, etc). I do that for a lot of colors now that I think about it... I'm sure the hyphen probably doesn't matter, but I've never tested very thoroughly. You could also try prompting for a Nordic ethnicity, they're all blonde.
is there an issue on using loras for qwen edit 2509 on 2511
>>107853498yeah the issue is that they were trained on 2509, not 2511. could still work though
>>107853350look at those fucked up paws. you need chroma asap
>>107853371man chroma looks like shit
>>107853465zimage moment
>>107853549tranny moment
>>107853291this is what zit generated with that as a prompt
>>107853493Thanks, I'll test these as well. I can get the tips of the hair to look blonde, but as seen in your image, the roots are always dark.
>>107853531chroma is a hit or miss in terms of quality and prompt adherence. i hope the new sparks model turns out better.
Im a honest personI like AI, but everyone here who contributes nothing but their slop should die miserably of cancer.
>>107853545>chroma
>>107853647benchod
>>107853545Chroma Flash at CFG 1 / 8 steps without negatives is unironically better in terms of basic output quality than regular Chroma by a lot. Less versatile though
>>107853342>fixedyou made it worse with that noise on the skin. Learn how to use z-image.
>>107853606I am not hopeful. I think you need at least 5 digit something images, not 3k, perhaps even more ideally, and far longer training run than a single 4090 running for a few days to thoroughly unfuck it, assuming that it's even possible, given how brittle the model is after the schizo training it went through.
>>107853464>no examples>no workflow>reddit mentionedyup, that's a snake oil
sdxl forever and ever