Discussion of Free and Open Source Diffusion ModelsPrev: >>108018763https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>108022550bad fagollage
>>108022550>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonew. gross schizobabble in the op again. schizo is gonna do some mental gymnastics now
Hi guys, can someone help me?
>>108022550Alternative kino collage
>>108022586no
>>108022589proper bake templateDiscussion of Free and Open Source Diffusion ModelsPrev: >>108018763 #https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg
https://files.catbox.moe/jf9k7g.png
https://files.catbox.moe/sdryqa.png
>>108022632do you have non comfy metadata?
Blessed thread of frenship
>>108022656No I don't, sorry. The prompt is one of those "war-and-peace" multi paragraph ones so I can't paste them here. You can save the png and open as text file; you should be able to get what you needhttps://files.catbox.moe/tljubc.png
>>108022674neoforge has a metadata viewer. dunno why comfy doesn't have one, it's been years. it's just annoying copying values
>>108022632nice
>>108022696true>>108022715thankshttps://files.catbox.moe/4od2ay.png
>>108022685ai slop
friendly reminder you cannot use z-image loras with z-image turbo.
https://files.catbox.moe/92d0yi.png
https://files.catbox.moe/xm25fx.png
https://files.catbox.moe/vry28w.png
Imagine wan but with klein quality: super fast, multiple inputs, easily trainable, incredible vae.
>>108022793Let's all imagine this guy's mom's pussy
Can anybody please tell me what I am doing wrong? I am getting the infamous "You do not have CLIP state dict!" error and I don't understand which file I am missing.
whats the latest face detailer meta?
>>108022632>>108022674>>108022754>>108022765>>108022782Kino is back on the menu
> win7> forge> ff> 2026
https://files.catbox.moe/g5prdi.png>>108022793one can dream
>>108022813>literal retard unable to use UI made for retardsLMAOwhy do you have 2 vaes?why is the diff model in the vae?
>>108022602>>108022583Lol suffer tr*ni or de*o
>>108022826i made this image
>>108022828huh?
>>108022821thanks fren :)https://files.catbox.moe/phlg81.png
>>108022813Search the github repository and google for "You do not have CLIP state dict!", there will be multiple other people who had the same problem and the possible causes/solutions, if you can't do that you can also ask a LLM such as Grok, chatgpt, gemini or copilot on how to search issues on the github repository of the webui you are using
>>108022836erm acksually z-image's algorimth made it, you just prompted ithttps://files.catbox.moe/t8foa7.png
>>108022825based
https://files.catbox.moe/lwd2kp.png
>>108022840You heard meSuffer
>>108022813Out of pure curiosity what card do you have
>>108022898what?
https://files.catbox.moe/4os6p6.png
https://files.catbox.moe/xd9ld6.png
Why is SaaS adware included in the OP for local models?
>>108022923are you mad your uncle stopped diddling you cuz you got older and ugly or something?
https://files.catbox.moe/8981vu.png>>108022923because money dear boy
https://files.catbox.moe/amyore.png
>>108022933Damn, new models could be good for baking lightning + diffusion maps.
>>108022932that explains a lot about catjak. if you author the rentry you should put that in
https://files.catbox.moe/1t97ag.png
I'm trying out LoKr instead of LoRA for the first time. do you load these things the same way you load loras in comfy?
>>108022957Why do you give catjak so much power?
>>108023026does lokr even work in comfy?
>>108023026Is LoKr short for low key retarded?
https://files.catbox.moe/763vi5.png>>108022956FOSS models are this close to being production-ready; I think that the cloud/SaaS models are already there. If the pace of dev continues, we could see FOSS catching up in 6 months or a year
>>108023026It's only better than lora when it's used in the same model it was trained on btw (and you should use 99999 dim to trigger full matrix and control the size by the factor), lokr transfers terribly
>>108023032wdym? he's the thread lolcow. he just chose this thread to graze in and shit his pampers. sometimes it's funny, sometimes it's annoying. sometimes he pretends to be other anons like what you are doing right now. all we know for sure is he is a failure.
>>108023056According to you and your "friends" he's been able to>destroy /sdg/>get anons to migrate to /ldg/>delete post that goes against his narrative>be present 24/7>control the OP for the majority of the threads>ruin ani's project>frame ani as debo as the thread schizosYou make him sound like some autistic schizo god.
>>108023082yfw that anon IS catjack and does all those things
a message from ani in /adt/:>>108022262
How the fuck do I prompt for pov shit? Camera, cameraman, camera man, ain't working.
>>108023087The thread is more deader than usual because the OP was vandalizedDev schizo also tried to claim upscaling destroys artist styles and that's objectively false and a serious skill issue. I find it funny he ignored the post pointing that out to him.Did he say that because his frontend can't do upscaling?
>>108023090He's running off to share that beer with an underage prostitute.
>>108023087*yawn*
>>108023119>Dev schizo also tried to claim upscaling destroys artist styleshe isn't wrong. the style is always more sloped than the input. realism seems to hold up better nowadays
>>108023090What the fuck, now it's working. Rng I guess..
>>108023156prompt thief
>>108022910RTX 3060 12GB
>>108023151>Being wrongPlease stop
>>108023156Did you try "cohesium"?
>>108023090 >>108023156there's 8 billion people on this flat earth and you hollow brain fuckers keep posting the same slop.
>>108023208show us you are right. I am really interested in what you have learned about it
>>108023026>LoKrThat's for people that deep-fry their datasets. Doing 15k steps and using only that final, extra crispy output, completely ignoring any point where it might have converged earlier.
>>108023026yes>>108023034yes>>108023035no
>open reference klein workflow>try to make sense of it>2-level nested subgraphs>to hide picrelFUCK this shit.
>>108023119>Did he say that because his frontend can't do upscaling?Since he didn't post proof and oldfags know it's not true, he's probably hoping newfrens take him at face value. >>108023228The original claim that upscaling "destroys styles" has yet to be proven.
>>108023243damn, how do you get so much detail? or is this zit?
>>108023240>>108023240tf this bih look like da grinch
>>108023240mentally ill
So yeah, I can 100% confirm that ZIB-trained loras are WORSE used on ZIT than ZIT-trained loras used on ZIT. However, ZIB-trained loras used on ZIB itself are pretty fine, assuming you use the right negative. Picrel is the exact same Rubi Rose dataset, 120 pics, Gemini 3 Pro captioned, trained at the best possible quality on both ZIT and ZIB (1024x1024, full BF16 models, no quantization during training) with the same settings.ZIB on ZIT is the least close to what she actually looks like, whereas ZIT on ZIT and ZIB on ZIB are both quite believable in terms of facial likeness given the dataset did have her with a ton of different hairstyles and different lighting conditions and stuff.
>>108023295Are you using AI Toolkit for training? Does the cope of upping ZiB LoRA to 2 strength on ZiT work?
>>108023295>So yeah, I can 100% confirm that ZIB-trained loras are WORSE used on ZIT than ZIT-trained loras used on ZITyeah i have drawn the same conclusion. did you bump up the strength? it helps a little with likeness but also looks more borked. its a bit of bummer because genning takes ages with zib....>assuming you use the right negativeanything out of the ordinary or do you mean stuff like "low quality", "deformed"?
>>108023295I guess impatientlets stuck with ZiT will have to cope with using sub optimal versions of my kinosovl LoRAs.
Is there a decent workflow anyone has been using for video to video?
is there a comfy custom node that takes in any input and returns the filename? so vae, image, model, -> node -> filename ?
It is confirmed: z-image is another chinkshit failbake. Flux Klein won
>>108023257that was zit, this is base, i have to update my naming still
>>108023295>100% confirmed >single test erm....
>>108023352also looks pretty good. are you using default base and default workflow or is some magic involved?
:3
>>108023247What's exactly wrong with it? This is how subgraphs should be used - to hide stuff that you set up once and never touch again
>>108022865I did but I can't find. Search engines are so shit these days that they only give me unrelated results. And AIs tell me to check if I have the necessary files, without telling me which files are necessary. That's why I asked here, you guys know your stuff, I thought you would notice my error right-away from my screenshot and point it out.
>>108023257looks like ZIT to me.
>>108023403>first google result>"You do not have CLIP state dict!">https://www.reddit.com/r/StableDiffusion/comments/1exhalk/anyone_know_why_i_get_assertionerror_you_do_not/>>108023361don't tell him, kek>>108023368just different scheduler/sampler settings from r*ddit and an optional face detail but i only turn that on if it's losing details>https://files.catbox.moe/bxd5v8.png
>>108023313I used Musubi. And no, turning it up to 2 in any of the three setups fries the absolute fuck out of the image (as you'd expect), anyone who that works for at all just didn't train their lora correctly, in means the lora is just woefully undertrained in general.
Anyone having trouble using Adetailer with forge neo? Just doesn't seem to work.
>>108023361I've tested others and am testing more but the conclusion to be clear is that ZIB simply is not in fact the direct parent model of ZIT, not that training is the problem. Again as I said the "ZIB on ZIB" output looks fine for what it is in terms of the likeness (and I was using a negative for that one as ZIB generally needs).
>>108023274More like; "The QT that brought Sexmas!"
>>108023430thanks for the catbox.when i do close-ups like yours, i also get a lot of detail. but as soon as the person is farther away, faces usually start to look quite plastic.
https://files.catbox.moe/nq6pfc.png
thoughts?
>use wildcards to generate various characters in various locations>if either the character or the background is good, save it for later>use edit models to swap the good characters onto the good backgrounds>relight, inpaint, make changesI'm having fun over here.
>>108023451>the conclusion to be clear is that ZIB simply is not in fact the direct parent model of ZIToh yeah that was obvious from the getgo. you could just look at their original examples in the paper and see it. also the fact that they sat on ZiB for two months.
Greta/Kirk lora merge?
>>108023499nice
https://files.catbox.moe/ftqvm6.mp4Are there any current new workflows that give longer vids with Wan 2.1? like 10 sec and i'd be happy. only have 12GB VRAM though
>>108023352I gave the ZIT one to Gemini and prompted Klein 9B Distilled with the output, 4 steps
>>108023322nothing weird for the negative, specifically it was `worst quality, low quality, bad quality, very displeasing, lowres, unfinished, pixelated, low resolution, jpeg artifacts, disfigured, deformed, fused, conjoined, disproportionate, missing eye, closed eyes, cross-eyed, lazy eye, asymmetrical irises, bad hands, missing finger, extra digits, broken, crushed, melted, illustration, cartoon, anime, drawing, painting, watercolor, 2d, 3d, cgi, render, simple background, blurry, sketch, ugly`for the ZIB gens
>>108023472yeah face detail is for that exact scenario>>108023543yeah i know a lot of folks are miserable in these threads but i can get great output from klein 9b, zit/ base, i can't even choose which one to use for my daily driver, they all take loras great too and train fast as shit.
>>108023570sdxl slop
>>108023570>i can't even choose which one to use for my daily driver, they all take loras great too and train fast as shit.in the mists of trolling anon often forgets how incredible it is we even have more than a single model competing for the top
https://huggingface.co/circlestone-labs/Animababe wake up, new SOTA uncensored anime model
>>108023663>this model constitutes a "Derivative Model" of Cosmos-Predict2-2B-Text2ImageUnironically DOA
>>108023663>Built on NVIDIA CosmosDOA
Tell me about Flux Klein. I see it's 64 GB in size. Does it run on 24GB cards? does it run well? is it worth it?
>>1080236631girl standing holding a sign benchmark SOTA
What's the latest snake oil?
>Tag order>[quality/meta/year/safety tags] [1girl/1boy/1other etc] [character] [series] [artist] [general tags]lmfao
>>108023663i don't mind anime models but how is it SOTA?
Is there an alternative to WanAnimate?
>>108023663I saw Comfy add the support a while ago, wondered what it was. Seems like it has to be fast at 2B + 0.6B encoder. Gonna give it a shot now.
>>108023678it's not 64GB, wtf do you mean. The 9B base and distilled Klein are like 18GB, the 4GB base and distilled Klein are like 7GB.
>>108023671It must be an architectural mod though, the original Cosmos 2 used T5-XXL for the TE
>>108022550Mod is eeez rawleee
>>108023717In short:Architecture modifications allow very good training efficiency, so it is trained on a gorillion epochs. It rivals the best SDXL tunes in terms of breadth of knowledge, while being a DiT-based model that also supports natural language. It particularly excels at more niche concepts and complex prompts.I would encourage you to just try the model yourself. It's small, natively supported in comfy, and runs on a potato. I could cherrypick images that make it look good but you can do that with any model.
>>108023761>natively supported in comfyI want it in neoforge and anistudio instead. comfy is malware
>>108023754interesting. I knew there was subversion going on but i didn't think it was this embedded
So is there a simple client thats similar to lmstudio for image generation where I can upload an image and ask a model to generate an image output? "Remove watermark" etc?
>>108023761If you have it running, give a few examples of more complex prompts, like multiple characters interacting, etc..
>>108023747I see one of the models - flux 2 dev - is 64 GB. Naturally, I feel this is probably the superior model. Bigger is better right?But I guess you have a point. I don't have 64GB vram so I'll have to do with one of these other models. This is why I ask.So I'm guessing I probably want to go for 9B right?
>>108023761just feels like nothing is using the optimizations yet. like sure, use a small dataset first to test it out but then you're supposed to use a giant dataset after but it seems like we are just getting the small dataset prototypes
>>108023769killed the>it's an anime websitelie foreverstolen fromMini Modu@MinModulation
>>108023761>>108023663Post examples pleaseWhoever posted the repo with one image is a fucking retard
>>108023799too hairy
>>108023747Sorry, I guess I'm being retarded dev is obviously not klein
>>108023796ok, some random prompts from the validation set
>>108023768surely the developer of anistudio is not a nocoder and is able to implement it without relying on a 3rd party library... oh wait
>>108023799this poster is a pedophile btw
>>108023663This model is horny as fuck, can't post images here.
just realized this significantly improves FPS when moving around in comfyits a bit ugly though
>>108023690Regardless of your initial model selection, do a second pass with Klein image edit to fix possible issues etc.
>>108023844
>>108023856
>>108023854So klein is good at picking up styles?
>>108023663that example image on the page REEKS of qwen image, dont tell me this is a distill of it
>>108023867
>>108023870I wish. I'd still be using the edit model if it did.
>>108023876
>>108023883
what is this dogshit model? why is it fucking up in every example image?
>>108023894not perfect yet at a bunch of characters at once but it's getting there
>>108023788yeah Flux.2 Dev is their mainline big chungus open weights model and uses a big Mistral TE, the Kleins are unrelated and use smaller Qwen TEs
>>108023877is it at least fast?
>>108023871it uses Qwen VAE and Qwen 3 0.6B TE, but it appears to be an arch mod of Cosmos 2 overall
>>108023844>>108023883>>108023905These are pretty good actually.
>>108023877i've had good results doing style transfer with Klein
>>108023912Conclude your own tests. don't believe the schizo who replies to other people's posts.
>>108023916yeah it looks at least as good as what I can get out of NetaYume V4.0, gonna try it now
>>108023914what im saying is was this trained on booru or on synthetic garbage
>>108023912Yes.>>108023920We have different standards for "good".
>>108023927it was trained on 5 million booru images and 800k non-anime artistic images
Seems good, looking forward for the final versionhttps://files.catbox.moe/yfbt5c.jpg
>>108023940weirdo
does zim/zit have problems with belly buttons? they often look deformed
>>108023939ok good i will try it
>>108023940>catbox down for mefuck this gayass earth
>>108023922you're right. there are a lot of people with questionable motives itt
>>108023949For me too, it didn't properly uploaded the first time I tried, now it won't load the image for me either.
>>108023949it always does this shit. it's garbage
>>108023960>it always does this shitit doesn't tho
>>108023969it just did thougheverbeit
>>108023956try not uploading gcn next time