Discussion of Free and Open Source Diffusion ModelsPrev: >>107877194https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of friendship
fun stuff, even the 4 step 9b distill model works well for klein. I still like qwen edit 2511 but now I have to see which does what better. two edits, worked fine.also you can do face swaps now, bfl learned a lesson I guess.
>2025: flux 2 dev is bad and a skipped model, bfl is so done>2026: flux 2 klein is great, china is so doneexplain
In some aspects Klein is better than zimage but worse in others. Still a surprisingly good model regardless.
>>107878590Also nobody's to say, since the audio can also vary in quality, that there aren't simply seed based failgens, but obviously gens superior to Suno v4.5 and Udio if you simply vary the seed or improve the settings. That's the advantage of local after all, and the devs are running experiments to find best settings for everything, so once we do get ACEStep 1.5, it will be the real deal.
>>107878598flux 2 was great, it was just too big for 99% of people so they all cried sour grapes.
>>107878598all part of the plan
>>107878607>flux 2 was greatprove it with some images
>update comfyui>gen previews broken againneat
>>107878598>explainlooks like BFL has an ego and finally decided to work harder to please us and shut some mouths, I like it :^)
>>107878616plus they released a base model, which they've never done before (?)
>>107878615How do you fix this?
>>107878598don't worry, Alibaba will come back to put others company back to their place, which is not first>>107878615>>107878638add the --preview-method taesd flag
>>107878598Chinks released ZIT and BTFO'd Flux2 bloat, leading their hand forcing them to release decent small models in retaliation, demonstrating that competition is good for innovation and the end consumer
>>107878598it was a trap for chinese so they release prematurely and lose reputation>>107878424big boobs saar
>>107878646>competition is good for innovation and the end consumerthis, you only give your best if you have a worthy rival in front of you
>>107878645
>>107878646yeahif they don't release z image base then it'll be flooded with klein loras and it'll be too late
>>107878607>muh sour grapes Models of this size are always dead on arrival because no one will finetune them
nice edit, transposed my wf to klein, was painless
>when you run out of fentI have to say, pretty clean inpainting for klein. but now we have a template.
comfy bros, we wonned!!!
>>107878746vl_megapixels = 1it should've been at 0, a value > 0 is only for QiE
>>107878746how do you use q8 with the base klein workflow? default only allows non gguf .safetensors files
>>107878726the anime girl in image 2 is stepping on the black man lying on the floor with her right boot in image 1. keep the appearance of the black man on the ground the same.
>>107878753meh, does it really matter?
>>107878771you use the gguf loaderhttps://github.com/city96/ComfyUI-GGUF
is this 'put her in a bikini' at home?we've probably had this for a while, but kontext sucks and I never got qwedit to work, so this is all new to me
>>107878781cant link it with this default one, might need a new workflow for q8 I guess
does hires fix work for qwen image on forge neo? i keep getting this error TypeError: Cannot handle this data type: (1, 1, 1, 896), |u1Cannot handle this data type: (1, 1, 1, 896), |u1
>>107878800tbf qwen can also do it
>>107878777the anime girl in image 2 is stepping on the black man lying on the floor with her left boot in image 1, and is holding a green leek vegetable with her right hand. keep the appearance of the black man on the ground the same.kino
>>107878803https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.jsonhere's a workflow without the Subgraph AIDS, replace the default loader by the gguf loader
>>107878820thanks, and yeah I hate the subgraph stuff i'd rather have it all in the open not nested.
any lora trainer supports klein yet? i wanna try training some shit
Klein bros, wtf is this?
>>107878841weird error, did you update comfyui?
same input image for caption, but diffused in kleinlmao>>107878280
>>107878820anon i love you so much thank you
>>107878856when asked to make the source image photorealisticlole. qwen is still a bit uncanny but not this bad looking from what I recall
>>107878868you're welcome :3
>>107878810>light modehow do you even see this shit, it looks like a flashbang in the thumbnail
Decided to try that music model someone linked in the last threadHere's the output using the example lyrics and tags after the first test run.https://voca.ro/1hMV9WTu7RY7https://github.com/HeartMuLa/heartlib?tab=readme-ov-fileGonna try some more whimsical shit next.
>>107878889yo that's kind of a bopi have shelf speakers hooked to my pc and the base at the beginning was crazy
>>107878886lmao'd
>>107878889It supports Japanese! Nice! Thanks for the link.
>>107878875uwu :3333
https://voca.ro/1b34wdfSy0wWWas supposed to be upbeat. But sounds sad like the last one. Hmm
>>107878871original bikinize'd. im off to lunch now, all in all, good model.
small price to pay for not using rife interp and sneedvr2
>>107878915lmao god that's depressing.
Can you use Flux2Klein on Forge Classic Neo?
https://xcancel.com/bdsqlsz/status/2012047511566107012#mOh NOW they're about to release it, WHAT A COINCIDENCE
>>107878849I did from the manager but turns out I needed to do this from the update folder.Now I'm getting something different.
>>107878889>>107878915jesas that's pretty good, it kinda falls apart in some places but stillwe're really approaching the age of dead internet theory at an insane pace...
>>107878956based Hans forcing Chang's hand
>>107878956Based actually, BFL can pound sand I don't care how good their models are. I just hope Z-Image-(Omni-)Base meets expectation and gets people excited to finetune.
>>107878966turn off animated previews
>>107878956disappeared :smiley_face: haha
>>107878889>>107878915What vram/ram requirements?
1 day worth of training ltxv, its gonna be crazy:https://files.catbox.moe/iibwa3.mp4https://files.catbox.moe/ykmdq3.mp4https://files.catbox.moe/0zm6rf.mp4https://files.catbox.moe/qcpd33.mp4https://files.catbox.moe/r70qg1.mp4https://files.catbox.moe/qac25e.mp4
>>107878992>surprise!https://youtu.be/2tWHvQQMkLE?t=7
>>107878984I just set it to none but the problem remains
>>107878997Sat at around 20 on a 3090 for the 3b model but that's just their basic inference script.
>>107878998Looks like shit
>>107878956the damage has done
>>107879004are you using the right text encoder from HERE:https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>107878966your clip is probably wrong how is it lookin
>>107879012its 14000 steps of training on T2V on a model that knew nothing of nsfw. It looks incredible for how early the training it compared to wan2.2
>>107879004try disabling this too for good measure if it's onotherwise no idea
>>107879020It's furryshit. Take all the steps, doesnt matter.
>>107879029why would he need to disable previews for Klein? they work just fine for me. it looks like some issue with his text encoder
>>107879036you will be able to use it for humans as well. It also works for I2V
>>107879012It's not bad considering the time input. Not even a furfag, I just think it's an impressive proof on concept of the thing training well on something it can't do otherwise.
>>107878915That sounds ACEStep 1.0 at best.Meanwhile here is just a random ACEStep 1.5 gen I came acrosshttps://files.catbox.moe/hm2stn.mp3
>>107878956https://xcancel.com/bdsqlsz/status/2012022892461244705#m>Z-image in the final testing phase, although it's not z-video, but there will be a basic version z-tuner, contains all training codes from pretrain sft to rl and distillation.>z-videowhat is he talking about???
>>107879070I think he is just being Chinese "its not something like video to be excited about"
>>107879070>but there will be a basic version z-tuner, contains all training codes from pretrain sft to rl and distillation.this is good though cause it seems those are needed to unslop the base
>>107879039NTA, but VHS's video previews use their own hacky system and hijack image previews as well.
>>107879017>>107879018ah yeah you guys are right I was on the wrong text encoder, now it works, thanks!>>107879029Where is this?
>>107879070>>107879098this is actually really interesting, I thought they would keep the RL script to themselves (since it's the secret sauce), based, now let's see lodestone ignore all of that and go for another one of his schizo mumbo jumbo and we'll go for another 6 month cope ride until everything gets broken :(
>>107878992where is this posted?did you just lie on the internet? D:
>>107879124https://xcancel.com/ModelScope2022/status/2012055794020409361#m
>>107879116lodestone is gonna merge Klein-Base with Z-Base, remove the VAE, randomly delete half the parameters and then train the Klein-Z-Chroma-Radiance-Furry-Edition-FrankensteinV2 we've all been waiting for
>>107879065>Meanwhile here is just a random ACEStep 1.5 gen I came acrossThat's very nice but neither your nor I have ace step so kindly piss off until you do.
Also Ace step was worse than HeartMuLa. That's not to say it's great, but Ace step was pretty bad considering.
>>107879148We will soon.
>>107878998infinite daddy daughter princess blowjobs with cute squeals and gurgling bubbling noises locally by 2029 and it'll be the furries that got us there just like they did with imagegen >>107879020>>107879036don't listen to him furry anon he doesn't understand the value of technological progress and the potential to save the anuses of children and dogs all around the world as a result of the substitution effect >>107879082>>107879070No zvideo is a thing they're working on, its been mentioned before, but that doesn't mean they're releasing it, it's literally just a thing they're testing
what is used here to combine two images?
>>107879190flux 2 klein
>>107879194I think he's asking about the image concatenate node but yeah.
>>107879190>>107878820
>>107879177>zvideo is a thing they're working on, its been mentioned beforesource??>that doesn't mean they're releasing itoh maaan :(
You guys said klein was good?
>>107878950Nobody answered my question....
>>107879244its ok
>>107879247because nobody is using forge
>>107879244GROK, PUT THE GIRL IN TOP LEFT IN A BIKINI PLS
>>107879247maybe start using the big boys tool instead of retarded subhuman children toys?
>>107879247as a forgesissie you'll have to wait two more weeks for support while comfychads feast
Can't believe he never won an oscar...
Klein?More like Kucked. "replace just the face of image 1 with the face in image 2 and maintain the hair and clothing of image 1. change the woman in image1 to lift her skirt with her hands above her waist. the woman in image1 has visible panties."
it seems like bfl allowed nudity this time. It can do boobs better than z image. Same detailless crotches though
>>107879310lmao this looks like a shitty photoshop
>>107879310Flux Kek 2
ty gguf workflow anon, things seem to be working pretty well, wanted to try q8 klein edit cause q8 is closer to full quality and the model isn't large at all.
It might be fun for some memes, but ye nah, deleting.
Looks like I can run Klein 4b and 9b-fp8 on my iGPU if I make the text encoder unload once it's done. The only weird thing is after the run is finished, I get like a solid minute or more of 100% disk usage by "System" that seems longer than any disk usage during the actual gen. I can only assume a bunch of OS stuff was swapped to disk and is being loaded back into RAM, but it's still not something I've seen with other models, e.g. Z-Image. Hopefully we still get those other Z-models sometime.
>>107879310obviously it's far from perfect, now I'm waiting for Z-image edit to raise the bar even higher lul
>>107879310Skill issue. Literally for face swap the best method is a single input image, then passing in the image you want edited as a latent and lower the denoise.
>>107879337use 8 steps instead of the default 4, they are not nearly enough
>>107879337would
>>107879206>>107879194workflow?
>>107879337but it is fun, and not a large model.
>>107879337>>107879310alternatives to flux 2 klein then?
>>107879352>>107878820
>>107879357big flux 2
>>107879355go for billy mitchell and karl jobst lol
>>107879361thanks
>>107879371kek this model is good2 edits: one for the cop face, one for fent man.replace the face of the black man lying face down on the floor in image 1 with the man in image 2.
>>107879244>>107879310>>107879337china please
>>107879394lmao, nice
>>107879244cox is so hot
>>107879394man looks fucking photoshopped lmao, bad.
>>107879341Also, the seed value seems stuck for some reason. (And I'm not on Nodes 2.0.) It's not on Fixed mode, but I can't get it to change when I click Run again. Weird.
>>107879394diff billy, also a bit nicer kek
https://voca.ro/185Dz7rIIPthAn example of the music model doing Japanese. Not exactly the genre I pictured. But it mostly got the reading right.
give the asian man on the left black skin. add a Netflix logo above the text "RUSH HOUR".
>>107879425https://voca.ro/11VGjmohRUXOOh damn.