Discussion of Free and Open Source Diffusion ModelsPrev: >>107875932https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107877185>even little details which is impressive (flux 2 klein 9b distilled)yeah, for the details and slop I'd say it's between Z-image turbo and Qwen Image edit 25/12, it's not the best, but it's good enough to not be bothered by it
>>107877194Tanks 4 bake
https://www.youtube.com/watch?v=2OrOufa3eocYou know klein is a big deal when a channel with 600k subscribers is talking about it lol
so..will chroma klein happen?
Wake me when it has more loras than ZiT.
>>107877251workin on it
>>107877261vram status?
so multi image, do you reference the nodes as image 1 or image 2? for klein edit
>>107877194nice collage
replace the face of the girl in image 1 with the face of the girl in image 2. change the boots of the girl in image 1 to the boots of the girl in image 2.nikke anis is now teto:
>>10787726520/24GB so far so we're not dead. gotta do bf16 it looks like
>>107877266yes >>107876256
>>107877281>20/24GB so far so we're not dead.its over
@redditors do NOT steal images from here without properly crediting or you will face jail time>- anonymous hacker 4chan
The legend of Migu, Ocarina of time
>>107877271Nice image
remove the long grey hair of the girl in image 1. replace the face of the girl in image 1 with the face of the girl in image 1.
>>107877293The legend of Costanza lul
the girl in image 1 is sitting beside the girl in image 2 on a bench.
>>107877315did you prompt low poly? thats really good kek
>>107877324yeah I had to push it a bit, or else it would make it too realistic>Replace the character from image 1 by the character of image 2 while keeping the same low poly 3d artistic style of image 1
>>107877266>nodeswhat?
replace the man in the middle in image 1 with a small pixel art version of the girl in image 2.
lol chinese trolls are legit trying to slide klien, not even a joke. One of them posted a "body horror" pic on twitter saying it was from it and people are saying it has body horror without proof which is easily disproven. https://www.reddit.com/r/StableDiffusion/comments/1qe76fc/comment/nzvqh0z/
>>107877339good luck finishing that game with such a big hitbox kek
>>107877343bro her right hand...
remake of one of my old Flux Krea gens
>zit and flux 2 dev came out>zit better!>klein came out>klein better!wat
>>107877363those fuckers managed to make klein better than flux 2 dev, competition is good, competition is healthy, it forces companies to work harder
>>107877371>klein better than dev?
change the clothes of the anime girl in image 1 to the clothes of the anime girl in image 2, with the same black panties.teto + fubuki:
>>107877381Ikr
>>107877385nice
>>107877385face swap on a BFL model? the fuck is going on
I need a klein workflow I lagged behind, too many snippets on what to run it with
>>107877301
>>107877397they were so desperate of gaining relevancy again they decided to stop cucking their model, just imagine that lol
>>107877404is that the klein equivalent of "make it realistic"? nice
Upscaling is pretty good too
5Head
>>107877388wdym?klein shills are delusional because it's inferior to dev and thus inferior to zit. I hope you agree with that
the man in image 1 is holding a magazine with a picture of the girl in image 2 on the cover. the title of the magazine is "TETO". keep the appearance of the man in image 1 the same.
>>107877366Bacon? OwO
>>107877424Dev isn't worse than ZiT lmao, it's just chungus so people can't run it
>>107877362>>107877409Does it know artists? Pic rel "Hatsune Miku as surreal screaming cube of flesh by Francis Bacon">>107877428damn right
>>107877408Yes. From what I can tell so far, it tends to keep proportions better than Qwen Image Edit 2511 with A2R LoRA but Flux slops faces and skin often.
>>107877361NTA but here's a better one with Klein
>>107877427make the image in the style of an 8-bit nintendo game.neat
>>107877442nice, I'll steal it. I cba myself
>>107877444the man appears as a low polygon model:
>>107877438Is that your own lora? Don't wanna check Civitai because that site is cancer srry.
Finally an edit model that's fast AND decent. Eat shit Z image.
>>107877436fact, and the bigger the model is, the better it is, that's why HunyuanImage 3.0 is the best model, because it's the biggest, that's it
"the woman is looking at the camera and keeps her expression when the camera zooms out to reveal the woman wearing a bikini bra and panties on a brown leather sofa and she is holding a plain white box with the text "base" on it in her lap as an happy obese caucasian nerd with ragged facial hair receding hairline and greasy skin with nerdy clothing sits down in the sofa next to the woman. the woman looks at the man with disgust and quickly runs away out of frame escaping in fear as the man sighs in despair and is sad."Wan 2.2 left, ltx2 right.Kek. Also, why is wan 2.2 so good at male nipples..?
>>107877438>Change the person to Hatsune Miku. Change the image to a painting by Francis Bacon
>>107877449steal it for what lol? I just replied to someone in the Reddit thread with it also, not the same person who made the original grass pic but someone else.
>>107877399go for that onehttps://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
13s for a gen 9b distill, even faster than qwen edit + 4 step lightning lora.
>>107877466last time someone posted this image I did a comparison that showed both of those outputs are weird, because in mine the Zit guys were NOT all asian (no matter how many times I ran it) and the composition was far more similar between the two models.
>>107877436cope
>>107877483(samefag) also it's the most chinkmaxxed engrish prompt too lmao, I just had to point that out
finally the future is here
unusable at Q8? Can I not swap in the gguf in place of the fp8 safetensor from the default template?
change the face of the man in image 1 to the face of the man in image 2.so bfl learned to uncuck their models huh?
>>107877489cope with what, that an obvious Giga ESL did a shitty comparison with a shitty prompt and results that I couldn't really reproduce on either Z Image or Flux 2 Dev even when I copied their broken grammar verbatim?
you're not allowed to finetune 9b so what's the point
>>107877501are you saying this is FP8 versus GGUF Q8 with no other changes? that's weird if so
swap the face of the man in image 1 with the face of the man in image 2.prompt still works
>>107877514Post passport, chang.
>>107877458Yeah, uploading it soon>>107877473Not even close. I hope it's has a good base/support for lora.
replace the face of the man in image 1 with the face of the man in image 2.forsenSmug + sam altman
>>107877514yeah you are, you just can't sell it or host it as a SAAS unless you pay them
Sam Asmongold:so clearly there is no "no face swaps allowed" bs in the model
>>107877542kek
>>107877353how did you make your image?
>>107877501>unusable at Q8?wait, the image on the right is with klein at Q8? dude that sucks
kekreplace the man with glasses in image 1 with the asian man on the left in image 2 who is wearing a business suit.
>>107877556replace the dog with this
>>107877542>clearly there is no "no face swaps allowed" bs in the modelso far I haven't reached any moment where the model decided to not do anything, like on Kontext dev and its censorship layers
>>107877477>https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.jsonhere's how it looks like
>>107877554yeah it shouldn't be worse than FP8, that makes no sense
Why is infinitetalk changing anything but the face..?
>>107877560replace the dog on the left in image 1 with the dog skeleton in image 2. Change the blue neon text saying "HASAN" to "SHOCK". The man wearing glasses is pointing in the air with one finger, which is emitting electricity.kek, what a model, I need to get the point to be just right but still.
should I use nvfp4 or q4?
>>107877470I think you can wrangle sequential actions in ltx if you spell it out like you're explaining to a toddler, and use the word 'then' a lot.>then the man sits down>then the woman turns to look right>then the womans expression changes to disgust>then the woman stands upor so
>>107877579if 5000 series nvfp4 is 4x faster
>>107877578also note the sign text swap, that's flawless and better than qwen edit did it.here we go:
>remove the yellow filter. Make the colors normal looking.
>>107877571I gave up because I cant ever get ' model-00001-of-00005.safetensors' to work for qwen 3 8b
>>107877590kek, poor kaya
>>107877584I think it's funny how it included the little chub on his lower stomach
>>107877547Was Flux2 Klein 9B image edit of an Illustrious image with prompt as "Change image 1 to a photorealistic style." I posted a different one a couple of days ago using that one Qwen Edit LoRA you linked so thanks for that. Klein slops faces and skin 80% of the time though so might be better once someone trains a LoRA.
>>107877618you have to download this, never go for split safetensorshttps://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
the man with glasses is holding up a magazine titled "new ways to abuse your dog", with a picture of the dog on the left below the title. the man is grinning.
>cue the floyd/hasan/miku spam
>The current topic test anon is testing the same 4 or 5 subjects for the next three days again.Ugh. I don't hate that you test things. I hate that you use the same things over and over again.
>>107877643I have to compare. No worries I will test diff stuff. but a -> b testing initially cause im used to qwen
we need a "vae-less" edit modelcan't deal with the shift
Question for all non-Flux fanboys who tried klein:Is it worth downloading or is klein cope at best?
>>107877657Just anything but Hasan, CIA agent, Ryan Gosling, or Miku.
>>107877675It's good.
>>107877675It's a really good edit model, much better than qwen but not better than Z in T2I so yeah definitely get it. Year of the small models baby fuck bloat
>>107877675for image alone it's worse than Z-image turbo, but as an edit model it's pretty great, probably the best thing we got locally
>>107877675it can edit stuff good and quick. You need it.
>>107877689>not better than Z in T2I>>107877506>cope with whatI'm confused
"the subject is sitting at an outdoor patio, sipping a coffee.">>107877554>>107877518>>107877501i figured it out; i'm retarded and was using a base gguf in a distill workflow
>>107877675I'm just glad we got an edit model that can actually compare to Nano Banana
>>107877721So much so I think Z, especially base might be better with artist.
fuck I'm already annoyed by the blur, and there's no NAG to save the day (yet)
>>107877715there's no gains in using goofs when they're created from the fp8, what is this retardness?
>>107877571thanks for your help, Im glad, I think I quit and dont have to be here anymore, was trying a simple head swap test
>>107877748>there's no gains in using goofs when they're created from the fp8what makes you believe they're created from fp8? the bf16 file exists they made their gguf from thathttps://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>107877715where are the ggufs for klein? wanna try q8 out of curiosity
>>107877750show a screen of your workflow, you probably messed something up, and did you update comfyui?
>>107877755I mean for the unet retard
>>107877757write "flux klein gguf" on huggingface? >>107877763this is the unet brown subhuman
>>107877761Thank you lol, I will migrate out of comfy desktop and retry the clone strategy, gonna take a wide break anyway.
>>107877763are you retarded or something? they gave us the unet on bf16, it never happened that a company gave us unet at fp8, are you fucking retarded or what?https://huggingface.co/black-forest-labs/FLUX.2-klein-9B/blob/main/flux-2-klein-9b.safetensors
How do I load a video here to use as a mask? And why does loading this mask as an image make me OOM when I barely use a third of my vram without it?
fuck tongyifuck alibabafuck tencentfuck chinafuck lodestonesall hail bfl
replace the clothes of the anime girl a white crop top, blue jeans, and white sneakers. keep her head unchanged.
>>107877782>fuck tongyi>fuck alibaba>fuck tencent>fuck china>fuck lodestonesAlibaba and Tongyi are basically the same thing at this point. And especially fuck lodestones.
>>107877770>>107877778im fucktarted, somehow thought that BFL only released the fp8
>>107877793>Alibaba and Tongyi are basically the same thingwell yeah since Tongyi belongs to Alibaba lol
but is it better than kontext?
>>107877800definitely better than Kontext
>>107877800Unfortunately no..too bad
>>107877793tongyi is zimagealibaba is tongyi and qwen and whatever (including wan 2.5, never forget)
replace the clothes of the anime girl with a white bikini, with the text "2" on one breast and "B" on the other breast. keep her head unchanged.