Discussion of Free and Open Source Diffusion ModelsPrev: >>107875932https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>107877185>even little details which is impressive (flux 2 klein 9b distilled)yeah, for the details and slop I'd say it's between Z-image turbo and Qwen Image edit 25/12, it's not the best, but it's good enough to not be bothered by it
>>107877194Tanks 4 bake
https://www.youtube.com/watch?v=2OrOufa3eocYou know klein is a big deal when a channel with 600k subscribers is talking about it lol
so..will chroma klein happen?
Wake me when it has more loras than ZiT.
>>107877251workin on it
>>107877261vram status?
so multi image, do you reference the nodes as image 1 or image 2? for klein edit
>>107877194nice collage
replace the face of the girl in image 1 with the face of the girl in image 2. change the boots of the girl in image 1 to the boots of the girl in image 2.nikke anis is now teto:
>>10787726520/24GB so far so we're not dead. gotta do bf16 it looks like
>>107877266yes >>107876256
>>107877281>20/24GB so far so we're not dead.its over
@redditors do NOT steal images from here without properly crediting or you will face jail time>- anonymous hacker 4chan
The legend of Migu, Ocarina of time
>>107877271Nice image
remove the long grey hair of the girl in image 1. replace the face of the girl in image 1 with the face of the girl in image 1.
>>107877293The legend of Costanza lul
the girl in image 1 is sitting beside the girl in image 2 on a bench.
>>107877315did you prompt low poly? thats really good kek
>>107877324yeah I had to push it a bit, or else it would make it too realistic>Replace the character from image 1 by the character of image 2 while keeping the same low poly 3d artistic style of image 1
>>107877266>nodeswhat?
replace the man in the middle in image 1 with a small pixel art version of the girl in image 2.
lol chinese trolls are legit trying to slide klien, not even a joke. One of them posted a "body horror" pic on twitter saying it was from it and people are saying it has body horror without proof which is easily disproven. https://www.reddit.com/r/StableDiffusion/comments/1qe76fc/comment/nzvqh0z/
>>107877339good luck finishing that game with such a big hitbox kek
>>107877343bro her right hand...
remake of one of my old Flux Krea gens
>zit and flux 2 dev came out>zit better!>klein came out>klein better!wat
>>107877363those fuckers managed to make klein better than flux 2 dev, competition is good, competition is healthy, it forces companies to work harder
>>107877371>klein better than dev?
change the clothes of the anime girl in image 1 to the clothes of the anime girl in image 2, with the same black panties.teto + fubuki:
>>107877381Ikr
>>107877385nice
>>107877385face swap on a BFL model? the fuck is going on
I need a klein workflow I lagged behind, too many snippets on what to run it with
>>107877301
>>107877397they were so desperate of gaining relevancy again they decided to stop cucking their model, just imagine that lol
>>107877404is that the klein equivalent of "make it realistic"? nice
Upscaling is pretty good too
5Head
>>107877388wdym?klein shills are delusional because it's inferior to dev and thus inferior to zit. I hope you agree with that
the man in image 1 is holding a magazine with a picture of the girl in image 2 on the cover. the title of the magazine is "TETO". keep the appearance of the man in image 1 the same.
>>107877366Bacon? OwO
>>107877424Dev isn't worse than ZiT lmao, it's just chungus so people can't run it
>>107877362>>107877409Does it know artists? Pic rel "Hatsune Miku as surreal screaming cube of flesh by Francis Bacon">>107877428damn right
>>107877408Yes. From what I can tell so far, it tends to keep proportions better than Qwen Image Edit 2511 with A2R LoRA but Flux slops faces and skin often.
>>107877361NTA but here's a better one with Klein
>>107877427make the image in the style of an 8-bit nintendo game.neat
>>107877442nice, I'll steal it. I cba myself
>>107877444the man appears as a low polygon model:
>>107877438Is that your own lora? Don't wanna check Civitai because that site is cancer srry.
Finally an edit model that's fast AND decent. Eat shit Z image.
>>107877436fact, and the bigger the model is, the better it is, that's why HunyuanImage 3.0 is the best model, because it's the biggest, that's it
"the woman is looking at the camera and keeps her expression when the camera zooms out to reveal the woman wearing a bikini bra and panties on a brown leather sofa and she is holding a plain white box with the text "base" on it in her lap as an happy obese caucasian nerd with ragged facial hair receding hairline and greasy skin with nerdy clothing sits down in the sofa next to the woman. the woman looks at the man with disgust and quickly runs away out of frame escaping in fear as the man sighs in despair and is sad."Wan 2.2 left, ltx2 right.Kek. Also, why is wan 2.2 so good at male nipples..?
>>107877438>Change the person to Hatsune Miku. Change the image to a painting by Francis Bacon
>>107877449steal it for what lol? I just replied to someone in the Reddit thread with it also, not the same person who made the original grass pic but someone else.
>>107877399go for that onehttps://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
13s for a gen 9b distill, even faster than qwen edit + 4 step lightning lora.
>>107877466last time someone posted this image I did a comparison that showed both of those outputs are weird, because in mine the Zit guys were NOT all asian (no matter how many times I ran it) and the composition was far more similar between the two models.
>>107877436cope
>>107877483(samefag) also it's the most chinkmaxxed engrish prompt too lmao, I just had to point that out
finally the future is here
unusable at Q8? Can I not swap in the gguf in place of the fp8 safetensor from the default template?
change the face of the man in image 1 to the face of the man in image 2.so bfl learned to uncuck their models huh?
>>107877489cope with what, that an obvious Giga ESL did a shitty comparison with a shitty prompt and results that I couldn't really reproduce on either Z Image or Flux 2 Dev even when I copied their broken grammar verbatim?
you're not allowed to finetune 9b so what's the point
>>107877501are you saying this is FP8 versus GGUF Q8 with no other changes? that's weird if so
swap the face of the man in image 1 with the face of the man in image 2.prompt still works
>>107877514Post passport, chang.
>>107877458Yeah, uploading it soon>>107877473Not even close. I hope it's has a good base/support for lora.
replace the face of the man in image 1 with the face of the man in image 2.forsenSmug + sam altman
>>107877514yeah you are, you just can't sell it or host it as a SAAS unless you pay them
Sam Asmongold:so clearly there is no "no face swaps allowed" bs in the model
>>107877542kek
>>107877353how did you make your image?
>>107877501>unusable at Q8?wait, the image on the right is with klein at Q8? dude that sucks
kekreplace the man with glasses in image 1 with the asian man on the left in image 2 who is wearing a business suit.
>>107877556replace the dog with this
>>107877542>clearly there is no "no face swaps allowed" bs in the modelso far I haven't reached any moment where the model decided to not do anything, like on Kontext dev and its censorship layers
>>107877477>https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.jsonhere's how it looks like
>>107877554yeah it shouldn't be worse than FP8, that makes no sense
Why is infinitetalk changing anything but the face..?
>>107877560replace the dog on the left in image 1 with the dog skeleton in image 2. Change the blue neon text saying "HASAN" to "SHOCK". The man wearing glasses is pointing in the air with one finger, which is emitting electricity.kek, what a model, I need to get the point to be just right but still.
should I use nvfp4 or q4?
>>107877470I think you can wrangle sequential actions in ltx if you spell it out like you're explaining to a toddler, and use the word 'then' a lot.>then the man sits down>then the woman turns to look right>then the womans expression changes to disgust>then the woman stands upor so
>>107877579if 5000 series nvfp4 is 4x faster
>>107877578also note the sign text swap, that's flawless and better than qwen edit did it.here we go:
>remove the yellow filter. Make the colors normal looking.
>>107877571I gave up because I cant ever get ' model-00001-of-00005.safetensors' to work for qwen 3 8b
>>107877590kek, poor kaya
>>107877584I think it's funny how it included the little chub on his lower stomach
>>107877547Was Flux2 Klein 9B image edit of an Illustrious image with prompt as "Change image 1 to a photorealistic style." I posted a different one a couple of days ago using that one Qwen Edit LoRA you linked so thanks for that. Klein slops faces and skin 80% of the time though so might be better once someone trains a LoRA.
>>107877618you have to download this, never go for split safetensorshttps://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
the man with glasses is holding up a magazine titled "new ways to abuse your dog", with a picture of the dog on the left below the title. the man is grinning.
>cue the floyd/hasan/miku spam
>The current topic test anon is testing the same 4 or 5 subjects for the next three days again.Ugh. I don't hate that you test things. I hate that you use the same things over and over again.
>>107877643I have to compare. No worries I will test diff stuff. but a -> b testing initially cause im used to qwen
we need a "vae-less" edit modelcan't deal with the shift
Question for all non-Flux fanboys who tried klein:Is it worth downloading or is klein cope at best?
>>107877657Just anything but Hasan, CIA agent, Ryan Gosling, or Miku.
>>107877675It's good.
>>107877675It's a really good edit model, much better than qwen but not better than Z in T2I so yeah definitely get it. Year of the small models baby fuck bloat
>>107877675for image alone it's worse than Z-image turbo, but as an edit model it's pretty great, probably the best thing we got locally
>>107877675it can edit stuff good and quick. You need it.
>>107877689>not better than Z in T2I>>107877506>cope with whatI'm confused
"the subject is sitting at an outdoor patio, sipping a coffee.">>107877554>>107877518>>107877501i figured it out; i'm retarded and was using a base gguf in a distill workflow
>>107877675I'm just glad we got an edit model that can actually compare to Nano Banana
>>107877721So much so I think Z, especially base might be better with artist.
fuck I'm already annoyed by the blur, and there's no NAG to save the day (yet)
>>107877715there's no gains in using goofs when they're created from the fp8, what is this retardness?
>>107877571thanks for your help, Im glad, I think I quit and dont have to be here anymore, was trying a simple head swap test
>>107877748>there's no gains in using goofs when they're created from the fp8what makes you believe they're created from fp8? the bf16 file exists they made their gguf from thathttps://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>107877715where are the ggufs for klein? wanna try q8 out of curiosity
>>107877750show a screen of your workflow, you probably messed something up, and did you update comfyui?
>>107877755I mean for the unet retard
>>107877757write "flux klein gguf" on huggingface? >>107877763this is the unet brown subhuman
>>107877761Thank you lol, I will migrate out of comfy desktop and retry the clone strategy, gonna take a wide break anyway.
>>107877763are you retarded or something? they gave us the unet on bf16, it never happened that a company gave us unet at fp8, are you fucking retarded or what?https://huggingface.co/black-forest-labs/FLUX.2-klein-9B/blob/main/flux-2-klein-9b.safetensors
How do I load a video here to use as a mask? And why does loading this mask as an image make me OOM when I barely use a third of my vram without it?
fuck tongyifuck alibabafuck tencentfuck chinafuck lodestonesall hail bfl
replace the clothes of the anime girl a white crop top, blue jeans, and white sneakers. keep her head unchanged.
>>107877782>fuck tongyi>fuck alibaba>fuck tencent>fuck china>fuck lodestonesAlibaba and Tongyi are basically the same thing at this point. And especially fuck lodestones.
>>107877770>>107877778im fucktarted, somehow thought that BFL only released the fp8
>>107877793>Alibaba and Tongyi are basically the same thingwell yeah since Tongyi belongs to Alibaba lol
but is it better than kontext?
>>107877800definitely better than Kontext
>>107877800Unfortunately no..too bad
>>107877793tongyi is zimagealibaba is tongyi and qwen and whatever (including wan 2.5, never forget)
replace the clothes of the anime girl with a white bikini, with the text "2" on one breast and "B" on the other breast. keep her head unchanged.
>>107877689I'm very tired of zit look, so better than zit in t2i, too. Until I get tired of klein's own look. Damn that easily trainable human perception.
>>107877812Please try large mesh fishnet thighhighs.
>>107877833a fine choice, works
Cleans images nicely (removed the two rectangles on the left), the whole 'resizes your image slightly lmao' thing is annoying though
>>107877825It's not really the look that's the problem, you will get wonky mutants anatomy and shitty hands every now and then (It's not a huge issue though imo). My guess is bfl finally realized and tried to unfuck their censored dataset they used for flux 2 but wasn't enough.
Should I get the base or distilled version of klein?
>>107877242yes, and it will be a melty undertrained mess just like the rest of his failbakes as he splits his attention between 10 different projects and doesn't give a single one enough time to bake.
>Grok is completely cuckedHelp me set up a local gen anons. Its so fucking complicated
>>107877870distilled, base isn't meant to make images >>107876098
>>107877870I found base did a better job of preserving likeness in edits, but distilled was easier to prompt for raw image outputs.
>>107877875just watch a youtube tutorial bro
the anime girl in image 1 is using the pose of the girl in image 2.neat
is today the day I finally make a huggingface account
>>107877889Needs a second pass to fix hands, though.
>>107877875Download A1111 and SD 1.5 to get started.
>>107877873I would much rather nutbutter Klein, in fact. No architectural changes, just a finetune on a sane dataset.
Would flux.klein be an acceptable basis for illustration/noob?
>>107877896Why?
>>107877851Thanks. Does it support image2 and image3 like qwen edit?
Good at colourising, doesn't kill any details
>>107877932yes >>107877285
>>107877934Her unform changed color.
>>107877925Lodestone already blamed the license and will finetune the very worse 4B mode and I assume so will the other finetunersl so if you want a shitty lumina level update to it yeah I guess...
>>107877941yeah so did the girl's hair. I'll try again with the base model
Kleiniggas, using the edit, are you able to fix mismatched lighting between subject & bg?
the girl is wearing a business suit and top hat.
>>107877959show a screen of your workflow we can't really help you if we don't see anything
>>107877960make a pixel art sprite of the character.
>>107877963Nah i mean whether the new model is able to fix lighting in older pics. Sometimes 1girl has obvious studio lighting on the subject while the background looks meh, I was wondering if Klein could fix it
>>107877941>>107877958damn, that's way better
>>107877875Ask Grok
>>107877959I don't know about klein but dev can fix light and shadow. just tell it to make the subject and background have coherent lighting
make a pixel art sprite of the characters.ff13-2, neat
>>107877987>ff13-2such an underrated ff, and its soundtrack is absolutely amazinghttps://www.youtube.com/watch?v=zRYgA3dNwCE
>>107877952He should simply train 9b for himself and store it on a server with the admin password: 123456.The 9b model and its derivatives may not be used:“To create non-consensual intimate images or illegal pornographic content.”Illegal pornography is prohibited, so legal pornography is okay?
>>107877963I'm really shocked how well this shit works lmao
>>107877981damn, so we should go for base for edits?
the anime girl in image 2 is standing behind the blue hair anime girl in image 1.also got a neat upscale on the original image
please care about z image
>>107877888Link ?>>107877902Even i know thats severly outdated anon. Im pretty sure the current gen capable of doing like "Grok, make her wear a bikini" now>>107877984Lol Grok is cucked anon, you cant edit anything that has exposed skin now, both Anime and Real
>>107878031this time with one less hand
>>107878042>Im pretty sure the current gen capable of doing like "Grok, make her wear a bikini" nowyou appeared at the right time because the best edit local model got released today and it's called Flux 2 Klein >>107878041
>>107878041prompt? are you saying to keep it in the style of the other image or no?
>https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite/issues/610>this still hasnt been fixedkosinfag fucking FIX your shit I want my high quality wan previous you piece of SHIT
>>107878050Link ?
>>107877194DELETE THAT OFF TOPIC SHIT FROM THE OP ALREADY FFS
haha, this is a pretty good one. misato but teto:replace the girl in image1 with the girl in image2, wearing the same clothes.
>>107878053>prompt?>Replace the man on the right by Hatsune Miku, replace "Yes, I'm ready to go" by "Miku? What are you doing here?">are you saying to keep it in the style of the other image or no?it depends, sometimes you have to help it a bit for example for that one >>107878011I went for this>Add to the right side of image 1 a 3d render of the female character from image 2, she is facing left
>>107878054Sounds like something somebody needs to vibecode and fix
>>107878058https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
>>107878076somebody gotta DO SOMETHINGand that oneis not me
REEEEEEEEPlugging in a mask into infinitetalk makes me oom on a fucking 5090 with full offload of the models unless I go down to 480p.Then the mask doesn't even fucking work, it's still changing the gun entirely. The mask is just the face as a video.Is there anything else that lipsyncs as good as infinitetalk and is capable of masking?
Teto? pff, for me, it's Rin.
>>107878091>somebody gotta DO SOMETHINGahah it do be like that mr stancilhttps://youtu.be/lq_dM0y86pQ?t=405
>>107878071
>>107878076i might try unironically throwing this to gemini to see if it can fix it, but I thought our resident fag (bigstation) might have already tried it. didnt you faggo?
>no hype for ltxv2why
>>107878143That's all that gets posted now lol, haven't seen wan in a while
>>107878143Basically impossible to deliver the videos here in a way anon will bother to even look.I still use it and am genning stuff right now. I just can't be assed to share it here because the board won't allow sound.
>>107878099all right you asked for it
>>107878156>I just can't be assed to share it here because the board won't allow sound.that's why wsg exists >>>/wsg/6072442
>>107878165That place is pretty dead
>>107878150look at this thread again and repeat what you said
>>107878165I do share my stuff there. I just... don't think it's worth calling attention to because it's all kind of experimental. >>>/wsg/6073567
>>107878172yeah :[, they don't allow images so it's a huge deal breaker, why can't we have both?? sad
>>107878157What did you use for this? It looks pretty good
>>107878182Flux 2 Klein, it got released today
try using 8 steps with the distill workflow, 4 is already super fast as is.
>>107878181The most active thread on nearly every board is the AI thread. It makes no sense for there not to be an AI board at this point.
>>107878191I think I'll just use distill for image gen and base for edit, both excel respectively in that regard. I don't see a point in settling for sub-par editing when I know the base will do a better job.
>>107878190neat thanks
>>107877194>Maintain Thread Qualityhttps://rentry.org/teto
>>107878204>links to a ponyfag pagelole
I've found latest Qwen to be good with edits too, in particular, IDs.
klein can remove clothes, but the flux tit remains.if we had attention manipulation we could solve it.
meet my wives>>107878220just do a detail pass with chromer (lol)
>Prompt executed in 12.07 secondsFuck me that's quick
>>107878228Huh? Where have I seen this before?
>>107878097Have you tried with LTX? I've never done video masking before
Klein ist gut, nicht wahr?
>>107878246Someone was pissing and moaning about some stupid bullshit about pic rel a few threads back, I uploaded the image to chatgpt and asked for a detailed description of the image so I could try recreating it in future models
>>107878255Lumiiiii :3:3:3:3:3;3:3
>>107878192because other board will have no traffic
>>107878275is this just klein edit on manga panels?
>>107878278hmmph
>mfw forgot the autoposter onwhoops>>107878286no these are my z-image gens I had cooked, im downloading klein right now to test.also these are done through captioning, not editing.
Schizoprompts with K9B-distilled, will try the same prompts with base next
is zit obsolete?use case?
>>107878293
>>107878280Reminded me of M. Bison's dolls. I take it that image is based off the KPop Demon Hunter film?
>>107878293its a good model sir, happily retarded. bless its tender heart.
>>107878301nightrein dlc
>>107878288>>107878255are you the real lumi from /sdg/? what do we have to do to make you stay here and not go back to those filthy anons.
>>107878301
>>107878303nope, its about a fetish ecchi anime that is airing right now (mato seihei no slave)
which text encoder is compatible with qwen for forge neo?
>>107878305forgot to take off your name, retard.
>>107878311ONE MILLION DOLLARS!!
>>107878299>is zit obsolete?absolutely not, it's still the best text to image model for realism, Klein is the best at editing, which is different use cases
>>107878316>mato seihei no slaveThanks.
>>107878319or did i?
expressions were certainly censored on klein lol, sexual expressions turn into either default resting face or anguish
>>107878299zit is good. very good. klein pretty neat tho
klein can't lynch niggers.it's ogre
>>107878337how terrible for you!
is there any reason to use dev over klein?
>>107878320>>107878328>>107878336Lumi stay here! , you don't need Debo!. We can give you all the head pats you want! :3
>>107878351prove it