Discussion of Free and Open Source Diffusion ModelsPrev: >>107843132https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2https://comfyanonymous.github.io/ComfyUI_examples/wan22/>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107846749thanks for the bake anon
You made sure to only include 4 images benchod
>>107846749>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy is this still being baked in the OP? it just invites drama
Garbage op. Snubbed again. Although tbf I didn't post anything last thread
https://github.com/Comfy-Org/ComfyUI/pull/11829KJ God saved us once again btw, now I can go for only 2 gb of offload instead of 5
>>107846769Leaving it out causes drama too
>>107846773https://github.com/Comfy-Org/ComfyUI/pull/11748there's also that PR waiting to be merged, dunno if it's gonna help even more but if it does I'll take it
>>107846769you had time to make the thread if you wanted, which you did and are samefagging
>>107846780it's easier for the mods to nuke one schizo than god knows how many
>>107846749only 4 images in the Op? what an absolute faggot of a baker.
threads too popular i miss when it was more niche
>>107846785maybe we shouldn't bake a new one at page 1? is that too much to ask?
>>107846790didnt ask
>>107846778I wish
>>107846805>missed the point award
>>107846778would
>>107846778Is this a reference to Kafka - Metamorphosis book?
>>107846818Being a cute pokemon is better than being a roach even if their situations are the same, faggot. Not replying any further.
>>107846827yes!
>>107846749should i use Regularisation images when training an anime style lora? and if so where do i even get them from
>>107846827no shit sherlock
>>107846828cry me a river
>>107846835>everyone read Kafkaoh god I wish, there would be way less retarded people on earth if it was true
https://files.catbox.moe/kwv9pc.mp4
>>107846841>I'm a roach Morty! I'm roach Gregor!!Wow such amazing plot
Blessed thread of frenship
>>107846854a lot of philosophers love to talk about roaches somehow
Please refrain from responding to the anon attempting to slide this thread.
>>107846870eh, I let beetles and spiders live in my house, roaches are dirty but i still try my best to get them out instead of killing them
reminder LTX2 is amazing.https://files.catbox.moe/0q28jd.mp4
>>107846925>didn't say the n word, some slur towards lgbt+ folx or some other /pol/ shitthis is new
>>everyone read Kafka>oh god I wish, there would be way less retarded people on earth if it was true>>107846870Not real quote, btw.
HAHAHAit knows spongebob and patrick natively, we need a list of what voices LTX2 knows. anyone have a list?https://files.catbox.moe/swy22q.mp4
>>107846941prompt?
>>107846944it is overtrained on cartoons
z-image omni? hah, great joke
>>107846946>A comic character illustration. Upper body of a caricature man. He has a very arrogant smirk. He wears three, very large award pins. The one on the right says: "READ KAFKA AWARD". The one on the center says: "INTELLECTUAL AWARD". The one the left says: "GETS REFERENCES AWARD". He is wearing a black jacket and a black fedora hat. He has his arms crossed. The man has very dirty, unkempt beard. He is looking at the viewer. Insane meme.Needs the jak lora too.
>>107846769You spend all day cryin lil bro, can you please give some positive vibes instead of doing that?So many positive changes happening especially in the software space and here you are being salty not being positive vibes mon.
>>107846950it also does a perfect Trump if you i2v with him, even the mannerisms. Now i'm wondering what list of characters it knows. Cause I didn't provide the spongebob voice.hahahaha, this is a gold mine.https://files.catbox.moe/9nyy17.mp4
>>107846960evil benchod fuck you
Spongebob says "Hey Patrick! Let's open a learing center!". Patrick says "did you mean LEARNING center?". Spongebob says "haha, not in somalia".it even made his hat lol, this is great. using q8 ltx2 from kijai's repo and I only have 16 vram (4080) and 64gb ram. If you have the memory you can load it all without issue.https://files.catbox.moe/zq4go1.mp4
>>107846941>Not real quote, btw.
>>107846959why lie
>>107846957omni looks useless, but the SFT one (Z-image) will be a better starting point to make finetunes
LTX2 is amazing. I *need* to know what other characters work for voices.https://files.catbox.moe/9uwlja.mp4
>>107846638I eventually got it to run. Switching to one of the fp8 options (doesn't seem like there is any noticeable difference between them) in the quant_format produces images similar to running it normally.Unfortunately, no speed improvement in my Ampere GPU. Disappointing but expected.Maybe there is very low chance that one of the other dozen different other variants they made will run faster, but I don't really feel like bothering with testing all that for what I believe to be very slim chance. My curiosity is sated for now.End of blogpost.
https://www.reddit.com/r/StableDiffusion/comments/1qatuni/ltx219bdistilled_vs_ltx219bdev_distilledlora/distill lora at 0.6 is better than distill model or distill lora at 1.0
>>107847003>grasping at straws to one up after LARPing as bespoke literature connoisseur and posting fake quote popular on plebbitKek, had a laugh>>107847018Do you want catbox or something schizo?
>>107846967>*currynigger detected*
How do you draw the controlnet mask for an image like this? Do you just draw the mask over both their arms? Is mask overlap acceptable?
>>107847049that's the point, you must get the references and be an actual intellectual to deboonk false philosophical quotes, hence the funny irony, hope that helps
new qwen is the first model I've tried that obeys this prompt decently. especially the stabbing, it's better at gore/violence than other models for some reason.black and white political cartoon scanned from an old newspaper.on the left, a palestinian man wearing a keffiyeh scarf around his neck is screaming and writhing in pain and agony and bleeding from a knife wound in his back. On the right, an Orthodox Jewish Rabbi with sidelocks and an orthodox jewish suit holds the hilt of the knife in the palestinian's back with right-handed overhand grip and stabs it deeply into the palestinian. the rabbi is looking back and happily calls out with his cupped left hand next to his shouting mouth, with a speech bubble saying "Help! This goy is attacking me!", sneering. the pommel of the knife has a small jewish Star of David design inscribed on it.
>>107847042the lora is gigantic though, I'd prefer someone to merge that shit into the model instead
so this week we have both GLM-Image and Z-Image-Omni-Base release?
>>107847068>a palestinian manhe looks like a jew though, first of all a palestinian is brown
>>107847075source on getting base this week?
>>107847085it was revealed to me in a chinese dream
>>107846882I'll spend 15 minutes trying catch some asshole spider so I can relocate him outside but I have a zero tolerance policy for roaches. Cockroaches aren't from this planet and must be destroyed.
hype from social media posts means nothingshow me those sweet sweet commits
>able to run video upscalers up to 4k resolution just fine>use a frame interpolation node and it shits the bed almost immediatelyWhy is it so hard to just make 60 fps videos
>>107847085>source on getting base this week?how about some random trooncord post
>>107847068Z... kek. ZIT is good for some stuff, but if you can fit it in VRAM, qwen + lightning 8 step is far superior at prompt following and coherence, while being about as fast. their styles are equally slopped, just in different ways. a chroma i2i pass helps fix the slop.
>>107847085(C|H)opium from the cryptic "Patience will be rewarded" post in their discord and bdsqlsz tweet about an open source model being released this week.
>>107847075>so this week we have both GLM-Imagethey're waiting for the commit to be merged, and only god knows when it's gonna happenhttps://github.com/huggingface/transformers/pull/43100/files>>107847110and a new commit merged on Modelscopehttps://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668
>>107847120have we seen even one example gen of this GLM Image?
>>107846638I struggle to believe that a 6b model and 20B one would run at the same speed. Are you running qwen nvfp4?
>>107847130nope, they're like Z-image base they're suspiciously silent about that, which is a bad sign imo, if they're not proud of the output of their models it means that it must be really really mid
>>107847135Tagged wrong post>>107847107>>107847120I mean yes but we had various Z-Base related commits since early December.The primarily interesting thing here is that it has checksums which implies they finished training/finetuning.
>>107846788it is not a participation trophy
is base out yet?
>>107847154>they finished training/finetuning.then what are they waiitng for???
>>107847130>have we seen even one example gen of this GLM Image?there's a new model on the Arena called "Goldfish" and it might be GLM-imagehttps://xcancel.com/testingcatalog/status/2008576286638424387#m
>>107846827No, just a coincidence.
>>107846788Try making some better images next time. If you want to go to the anime diffusion thread where 65% of all posts make it into the OP collage, you are welcome to.
Tongyi tongue my anus
How many Qwen Image versions must we get before the release of Z-image base??
>>107846790How was it before it got popular?
>>107847181tried it until i rolled one, left one was goldfishi'm trying to roll a photorealistic one though, unlucky so far
>>107847260less slop and more kino
How does Z Image Edit compare to Qwen Image Edit?
>>107847061>DEBOonk#mentioned
>>107847267since it's an autoregressive model you can give a very vague prompt and get something sophisticated, try to verify if it has this AR capability
Since A1111 is basically dead. Anyone know the next best alternative?I'm not picky about control and comfy is too much for my baby brain to handle.
I need to know what other characters LTX2 knows natively, for audio.>confirmed: Trump, Spongebob, Patrickpiece of shit catbox wont load the video, so here is a streamable example.https://streamable.com/7uzltt
>>107847294We'll never know
>>107847164>>107847225>*Ha... ha... ha... ha.*
>>107847316>We'll never knowI'm still not ready to accept that fact, let me some time anon
>>107847319dumb gwailo yu no andastand chinese cultcha
>>107847294>How does Z Image Edit compare to Qwen Image Edit?how can we know that :(
>>107847260>How was it before it got popular?it was a time when Chinese companies would release models without having to say "Soon Soon Soon" for months, we'll never get that shit again, sad
>>107847312Command line. Or FORGE.
>>107847312>comfy is too much for my baby brain to handlewhyare you stupid or somethingit's just node graphs. what exactly is 'too much'? do you not understand the terminology or how to connect things? what's going on man
>>107847052anyone?
>>107847367You never explained what you are trying to mask. If it's just the 1girl, then only mask her body parts, not the male's.
>>107847312neoforge
>>107847154>>107847135Z is faster than Qwen, but not significantly when they're both just 8 steps and 1 CFG. I don't get a speed boost from nvfp4 or fp8 even.
>>107847380I am specifically talking about a 2girl output (or 1boy, 1girl).If their arms are completely overlapping, am I still supposed to mask the arm being overlapped? Or just the visible parts of the arm?
>>107847107>their styles are equally slopped, just in different ways.i am pretty tired of the default z styles but significantly less so than qwen. even in your own comparison z looks sharper and more detailed IMO
>>107847347>it was a time when Chinese companies would release models without having to say "Soon Soon Soon" for monthshttps://files.catbox.moe/b5sx5o.mp4
>>107847388Are you on 3090?Also based gen.
>if 4chan was a lorahttps://www.reddit.com/r/StableDiffusion/comments/1qbd7gb/john_kricfalusiren_and_stimpy_style_lora_for/
>>107847388>No persian ever called me goyoh I get it
To heck with photorealism. I want knowledge of sundry styles.
Persians do call me "infidel", tho.
what to do we about the gacha nature of AIsame prompt, same settings, 100 iterations. some gens are pure incoherent dogshit. some gens are amazing. i dont know what to do other than generate batches until i find a needle in the haystack. seems wasteful, this shit will wear down my SSDs.has anyone found a way to control this
anyone tried the sarah peterson loras for zit? the jeet has massive black fetish but i was wondering if it can output normal scenes too
I have a question regarding the rentry guide.I notice the guide now recommends using the "Load Lora" node over Lora Loader Model Only because it includes a clip output.But how would you connect the pins in a multi-character controlnet workflow? Like pic related.Should lora clip outputs only connect to their respective character on the controlnet? And what clip should connect to the above text encodes?
>>107847478>has anyone found a way to control thisA question for the ages
>>107847478No. There's probably an infinite number of ways to generate a image regardless of the prompt based on what the model knows. You cannot possibly expect non-RNG results doing T2I/T2V.
>>107847478>the gacha nature of AIin a way they definitely look like humans, sometimes we rock and sometimes we suck lol
what other characters work for i2v voice cloning, trump works, spongebob and patrick work, anyone else know? is there a list?
>>107847464Iran has a large Christian minority, treated a lot better than Israel treats theirs, though.
>>107847500Checkpoint clip connects to Lora nodes first. Lora node clips connect to text encodes.>Should lora clip outputs only connect to their respective character on the controlnet?Yes. So if Region 1 is the 1girl character lora, then only connect that lora's clip to it.
pretty funny that TongyiMAI played their card way too early just to cash in on some flux hype. pretty obvious nothing else was ready
https://files.catbox.moe/nic9xu.mp4
>>107847553And it worked. They undermined and disrupted Flux.2 completely. They really don't need to release base at this point. Just makes it obvious China is only releasing quality free models to cut competition in the west.
>>107847553>>107847567the most impressive thing is that they managed to make Z-image turbo good out of a really undertrained base model
>>107847079You fell for NPC propaganda.Levant Arabs look white as hell. They are not Berbers or Gulf Arabs.They push the narrative that a (((European))) country is fighting against evil browns so that you would be more willing to be a zog tax slave funding them.
>>107847532Ok, and what model should connect to Attention Couple? The Checkpoint again?
>>107847592>The Checkpoint again?No. You daisy chain the loras.Checkpoint > Lora 1 > Lora 2 > Attention Couple
>>107847592Yes.Many regional prompting extensions on Cumfart are broken nowadays btw.No idea if this one works or not.
>>107847585Palestine isn't as levant as Iran though, you're delusional, there's a lot of browns in there
>>107847603Oh ok, I see.It's spaghetti, but it makes sense. Thanks.
>>107847611>Iran>LevantWhy are you talking shit about countries you can't point at map?>there's a lot of browns in thereNot as much as you think.
>>107847613Any involved workflow will devolve into spaghetti. You need to use get/set nodes if you want clean workflows. They are like variables.
>>107847613You are doing it wrong.Second lora should go from directly checkpoint to the second lora loader
>>107847637who gives a fuck about those brown countries though? only a brown would know how to put those countries in a map, show your hands
embrace the noodle
>>107847650all that for 1girl, large breasts, (ai generated:1.4)
>>107847660and i fucking love it
>>107847650nice cow
How the fuck am I supposed to run songbloom in comfyui? It looks like the original repo got nuked. Do I just grab some clone from some other dev?
>>107847650i dont know how you can run this without melting your browser after 2 gens.
>>107847042Do we still do CFG at 1? Or 4? Somewhere in between?
>>107847709always cfg 1 since it's using the distilled lora
>>107847560Man what a throwback to waking up in the dead of night to watch GSL games live on the fucking Gom Player
>>107847692Clean your fans.
>>107847720peak era for streaming imo, good starcraft and commentary, all through some jank player but it worked fine.
>>107847647What are you talking about? I was told each "Load Lora" node should plug their clip into their respective controlnet character: >>107847532The model itself is daisy chained from checkpoint to lora to lora..
>>107847720>>107847735Streaming hadn't been completely consolidated by faang companies yet. Esports is also kind of dead in the US.
>>107847739My bad you are right.
Anyone using i2v ltx2 with the detail lora? Does that even work?
>>107847787it does. you can even add the lora to the 2nd stage sampler to get an even sharper result.
>>107847835What weight do you use anon? Sounds useful.
best local model for img2img that fits in 24gb vram? i tried z-image turbo but it changes too much other stuff in the image
https://github.com/kijai/ComfyUI-KJNodes/commit/838a731f2fa250ee8ef5ac0b1299a1b1f5cbb3a01GB more vram reduction for LTX-2 model