[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107875932

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107877185
>even little details which is impressive (flux 2 klein 9b distilled)
yeah, for the details and slop I'd say it's between Z-image turbo and Qwen Image edit 25/12, it's not the best, but it's good enough to not be bothered by it
>>
>>107877194
Tanks 4 bake
>>
https://www.youtube.com/watch?v=2OrOufa3eoc
You know klein is a big deal when a channel with 600k subscribers is talking about it lol
>>
so..
will chroma klein happen?
>>
File: perseus.png (1.57 MB, 1504x992)
1.57 MB
1.57 MB PNG
>>
Wake me when it has more loras than ZiT.
>>
File: Untitled.jpg (21 KB, 536x63)
21 KB
21 KB JPG
>>107877251
workin on it
>>
>>107877261
vram status?
>>
so multi image, do you reference the nodes as image 1 or image 2? for klein edit
>>
File: 1764742084499274.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>107877194
nice collage
>>
File: 1756733306383426.png (1.39 MB, 848x1216)
1.39 MB
1.39 MB PNG
replace the face of the girl in image 1 with the face of the girl in image 2. change the boots of the girl in image 1 to the boots of the girl in image 2.

nikke anis is now teto:
>>
>>107877265
20/24GB so far so we're not dead. gotta do bf16 it looks like
>>
>>107877266
yes >>107876256
>>
File: 1740014178691796.jpg (127 KB, 1061x1567)
127 KB
127 KB JPG
>>107877281
>20/24GB so far so we're not dead.
its over
>>
@redditors do NOT steal images from here without properly crediting or you will face jail time
>
- anonymous hacker 4chan
>>
File: Klein 9b.png (2.31 MB, 2880x720)
2.31 MB
2.31 MB PNG
The legend of Migu, Ocarina of time
>>
>>107877271
Nice image
>>
File: 1766228252348093.png (799 KB, 864x1184)
799 KB
799 KB PNG
remove the long grey hair of the girl in image 1. replace the face of the girl in image 1 with the face of the girl in image 1.
>>
File: 1761145782543578.png (2.17 MB, 2014x720)
2.17 MB
2.17 MB PNG
>>107877293
The legend of Costanza lul
>>
File: 1759302035293978.png (1016 KB, 864x1184)
1016 KB
1016 KB PNG
the girl in image 1 is sitting beside the girl in image 2 on a bench.
>>
>>107877315
did you prompt low poly? thats really good kek
>>
>>107877324
yeah I had to push it a bit, or else it would make it too realistic
>Replace the character from image 1 by the character of image 2 while keeping the same low poly 3d artistic style of image 1
>>
>>107877266
>nodes
what?
>>
File: 1753196832678305.png (1.21 MB, 1360x768)
1.21 MB
1.21 MB PNG
replace the man in the middle in image 1 with a small pixel art version of the girl in image 2.
>>
File: 4165456.jpg (564 KB, 1024x768)
564 KB
564 KB JPG
lol chinese trolls are legit trying to slide klien, not even a joke. One of them posted a "body horror" pic on twitter saying it was from it and people are saying it has body horror without proof which is easily disproven. https://www.reddit.com/r/StableDiffusion/comments/1qe76fc/comment/nzvqh0z/
>>
>>107877339
good luck finishing that game with such a big hitbox kek
>>
File: 1761400804422287.png (3.32 MB, 1152x2048)
3.32 MB
3.32 MB PNG
>>
>>107877343
bro her right hand...
>>
remake of one of my old Flux Krea gens
>>
>zit and flux 2 dev came out
>zit better!

>klein came out
>klein better!

wat
>>
File: img_00008_.jpg (1.24 MB, 1520x1728)
1.24 MB
1.24 MB JPG
>>
>>107877363
those fuckers managed to make klein better than flux 2 dev, competition is good, competition is healthy, it forces companies to work harder
>>
>>107877371
>klein better than dev
?
>>
File: 002.jpg (903 KB, 2025x1759)
903 KB
903 KB JPG
>>
File: 1756462340492174.png (1.01 MB, 640x1632)
1.01 MB
1.01 MB PNG
change the clothes of the anime girl in image 1 to the clothes of the anime girl in image 2, with the same black panties.

teto + fubuki:
>>
>>107877381
Ikr
>>
>>107877385
nice
>>
>>107877385
face swap on a BFL model? the fuck is going on
>>
I need a klein workflow I lagged behind, too many snippets on what to run it with
>>
File: 1757393687274944.png (911 KB, 864x1184)
911 KB
911 KB PNG
>>107877301
>>
>>107877397
they were so desperate of gaining relevancy again they decided to stop cucking their model, just imagine that lol
>>
>>107877404
is that the klein equivalent of "make it realistic"? nice
>>
Upscaling is pretty good too
>>
File: file.png (504 KB, 500x563)
504 KB
504 KB PNG
>>
5Head
>>
>>107877388
wdym?
klein shills are delusional because it's inferior to dev and thus inferior to zit. I hope you agree with that
>>
File: 1755999217497592.png (1.15 MB, 928x1104)
1.15 MB
1.15 MB PNG
the man in image 1 is holding a magazine with a picture of the girl in image 2 on the cover. the title of the magazine is "TETO". keep the appearance of the man in image 1 the same.
>>
>>107877366
Bacon? OwO
>>
>>107877424
Dev isn't worse than ZiT lmao, it's just chungus so people can't run it
>>
File: Flux2K9b.jpg (137 KB, 1024x1024)
137 KB
137 KB JPG
>>
File: img_00017_.jpg (1.01 MB, 1520x1728)
1.01 MB
1.01 MB JPG
>>107877362
>>107877409
Does it know artists? Pic rel "Hatsune Miku as surreal screaming cube of flesh by Francis Bacon"

>>107877428
damn right
>>
>>107877408
Yes. From what I can tell so far, it tends to keep proportions better than Qwen Image Edit 2511 with A2R LoRA but Flux slops faces and skin often.
>>
>>107877361
NTA but here's a better one with Klein
>>
File: 1754655108446374.png (989 KB, 928x1104)
989 KB
989 KB PNG
>>107877427
make the image in the style of an 8-bit nintendo game.

neat
>>
>>107877442
nice, I'll steal it. I cba myself
>>
File: 1749901957823316.png (1018 KB, 928x1104)
1018 KB
1018 KB PNG
>>107877444
the man appears as a low polygon model:
>>
>>107877438
Is that your own lora? Don't wanna check Civitai because that site is cancer srry.
>>
Finally an edit model that's fast AND decent. Eat shit Z image.
>>
File: 1752470613795787.jpg (457 KB, 2371x1434)
457 KB
457 KB JPG
>>107877436
fact, and the bigger the model is, the better it is, that's why HunyuanImage 3.0 is the best model, because it's the biggest, that's it
>>
File: Untitledsdfsdf-1.mp4 (3.18 MB, 1400x540)
3.18 MB
3.18 MB MP4
"the woman is looking at the camera and keeps her expression when the camera zooms out to reveal the woman wearing a bikini bra and panties on a brown leather sofa and she is holding a plain white box with the text "base" on it in her lap as an happy obese caucasian nerd with ragged facial hair receding hairline and greasy skin with nerdy clothing sits down in the sofa next to the woman. the woman looks at the man with disgust and quickly runs away out of frame escaping in fear as the man sighs in despair and is sad."

Wan 2.2 left, ltx2 right.

Kek. Also, why is wan 2.2 so good at male nipples..?
>>
File: dui.jpg (1.27 MB, 1536x2050)
1.27 MB
1.27 MB JPG
>>107877438
>Change the person to Hatsune Miku. Change the image to a painting by Francis Bacon
>>
>>107877449
steal it for what lol? I just replied to someone in the Reddit thread with it also, not the same person who made the original grass pic but someone else.
>>
>>107877399
go for that one
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
File: 1745521210936325.png (1.26 MB, 1024x1008)
1.26 MB
1.26 MB PNG
13s for a gen 9b distill, even faster than qwen edit + 4 step lightning lora.
>>
>>107877466
last time someone posted this image I did a comparison that showed both of those outputs are weird, because in mine the Zit guys were NOT all asian (no matter how many times I ran it) and the composition was far more similar between the two models.
>>
>>107877436
cope
>>
>>107877483
(samefag) also it's the most chinkmaxxed engrish prompt too lmao, I just had to point that out
>>
File: 1766020022321242.png (1.19 MB, 864x1200)
1.19 MB
1.19 MB PNG
finally the future is here
>>
unusable at Q8? Can I not swap in the gguf in place of the fp8 safetensor from the default template?
>>
File: 1760908528784079.png (1.24 MB, 1248x832)
1.24 MB
1.24 MB PNG
change the face of the man in image 1 to the face of the man in image 2.

so bfl learned to uncuck their models huh?
>>
>>107877489
cope with what, that an obvious Giga ESL did a shitty comparison with a shitty prompt and results that I couldn't really reproduce on either Z Image or Flux 2 Dev even when I copied their broken grammar verbatim?
>>
you're not allowed to finetune 9b so what's the point
>>
File: file.png (2.87 MB, 1664x1232)
2.87 MB
2.87 MB PNG
>>
>>107877501
are you saying this is FP8 versus GGUF Q8 with no other changes? that's weird if so
>>
File: 1756564286979478.png (1.23 MB, 1248x832)
1.23 MB
1.23 MB PNG
swap the face of the man in image 1 with the face of the man in image 2.

prompt still works
>>
>>107877514
Post passport, chang.
>>
File: img_00028_.jpg (1.45 MB, 1520x1728)
1.45 MB
1.45 MB JPG
>>107877458
Yeah, uploading it soon

>>107877473
Not even close. I hope it's has a good base/support for lora.
>>
File: 1752567322717506.png (1.21 MB, 1248x832)
1.21 MB
1.21 MB PNG
replace the face of the man in image 1 with the face of the man in image 2.

forsenSmug + sam altman
>>
>>107877514
yeah you are, you just can't sell it or host it as a SAAS unless you pay them
>>
File: 1750974781125858.png (1.21 MB, 1248x832)
1.21 MB
1.21 MB PNG
Sam Asmongold:

so clearly there is no "no face swaps allowed" bs in the model
>>
>>107877542
kek
>>
File: 00006-1978590719.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>107877353
how did you make your image?
>>
>>107877501
>unusable at Q8?
wait, the image on the right is with klein at Q8? dude that sucks
>>
File: 1751826796069711.png (1.29 MB, 1360x752)
1.29 MB
1.29 MB PNG
kek

replace the man with glasses in image 1 with the asian man on the left in image 2 who is wearing a business suit.
>>
File: 1765414859451256.png (1.93 MB, 1000x1000)
1.93 MB
1.93 MB PNG
>>107877556
replace the dog with this
>>
>>107877542
>clearly there is no "no face swaps allowed" bs in the model
so far I haven't reached any moment where the model decided to not do anything, like on Kontext dev and its censorship layers
>>
File: 1765517397129313.jpg (413 KB, 3465x1428)
413 KB
413 KB JPG
>>107877477
>https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
here's how it looks like
>>
>>107877554
yeah it shouldn't be worse than FP8, that makes no sense
>>
Why is infinitetalk changing anything but the face..?
>>
File: 1762868876474394.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: 1742737773393815.png (1.35 MB, 1360x752)
1.35 MB
1.35 MB PNG
>>107877560
replace the dog on the left in image 1 with the dog skeleton in image 2. Change the blue neon text saying "HASAN" to "SHOCK". The man wearing glasses is pointing in the air with one finger, which is emitting electricity.

kek, what a model, I need to get the point to be just right but still.
>>
should I use nvfp4 or q4?
>>
>>107877470
I think you can wrangle sequential actions in ltx if you spell it out like you're explaining to a toddler, and use the word 'then' a lot.
>then the man sits down
>then the woman turns to look right
>then the womans expression changes to disgust
>then the woman stands up
or so
>>
File: file.png (2.74 MB, 1520x1377)
2.74 MB
2.74 MB PNG
>>
>>107877579
if 5000 series nvfp4 is 4x faster
>>
File: 1763191949798390.png (1.45 MB, 1360x752)
1.45 MB
1.45 MB PNG
>>107877578
also note the sign text swap, that's flawless and better than qwen edit did it.

here we go:
>>
File: 1762484969731237.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>remove the yellow filter. Make the colors normal looking.
>>
>>107877571
I gave up because I cant ever get ' model-00001-of-00005.safetensors' to work for qwen 3 8b
>>
>>107877590
kek, poor kaya
>>
>>107877584
I think it's funny how it included the little chub on his lower stomach
>>
>>107877547
Was Flux2 Klein 9B image edit of an Illustrious image with prompt as "Change image 1 to a photorealistic style." I posted a different one a couple of days ago using that one Qwen Edit LoRA you linked so thanks for that. Klein slops faces and skin 80% of the time though so might be better once someone trains a LoRA.
>>
>>107877618
you have to download this, never go for split safetensors
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
File: 1761184013954676.png (1.17 MB, 1360x752)
1.17 MB
1.17 MB PNG
the man with glasses is holding up a magazine titled "new ways to abuse your dog", with a picture of the dog on the left below the title. the man is grinning.
>>
>cue the floyd/hasan/miku spam
>>
>The current topic test anon is testing the same 4 or 5 subjects for the next three days again.

Ugh. I don't hate that you test things. I hate that you use the same things over and over again.
>>
File: 1751462850369489.png (1.31 MB, 1360x752)
1.31 MB
1.31 MB PNG
>>107877643
I have to compare. No worries I will test diff stuff. but a -> b testing initially cause im used to qwen
>>
File: file.png (496 KB, 627x687)
496 KB
496 KB PNG
we need a "vae-less" edit model
can't deal with the shift
>>
File: Migudayoo.png (2.28 MB, 1652x848)
2.28 MB
2.28 MB PNG
>>
Question for all non-Flux fanboys who tried klein:
Is it worth downloading or is klein cope at best?
>>
>>107877657
Just anything but Hasan, CIA agent, Ryan Gosling, or Miku.
>>
>>107877675
It's good.
>>
>>107877675
It's a really good edit model, much better than qwen but not better than Z in T2I so yeah definitely get it. Year of the small models baby fuck bloat
>>
File: 1751462714768423.png (2.47 MB, 1888x848)
2.47 MB
2.47 MB PNG
>>107877675
for image alone it's worse than Z-image turbo, but as an edit model it's pretty great, probably the best thing we got locally
>>
File: 1752466698161938.png (1.02 MB, 554x1416)
1.02 MB
1.02 MB PNG
>>107877675
it can edit stuff good and quick. You need it.
>>
>>107877689
>not better than Z in T2I
>>107877506
>cope with what

I'm confused
>>
"the subject is sitting at an outdoor patio, sipping a coffee."

>>107877554
>>107877518
>>107877501
i figured it out; i'm retarded and was using a base gguf in a distill workflow
>>
>>107877675
I'm just glad we got an edit model that can actually compare to Nano Banana
>>
>>107877721
So much so I think Z, especially base might be better with artist.
>>
File: 1741470022589906.png (2.77 MB, 2720x768)
2.77 MB
2.77 MB PNG
fuck I'm already annoyed by the blur, and there's no NAG to save the day (yet)
>>
File: 1753862168092908.png (1.61 MB, 735x1844)
1.61 MB
1.61 MB PNG
>>
>>107877715
there's no gains in using goofs when they're created from the fp8, what is this retardness?
>>
File: file.png (64 KB, 1421x159)
64 KB
64 KB PNG
>>107877571
thanks for your help,

Im glad, I think I quit and dont have to be here anymore, was trying a simple head swap test
>>
>>107877748
>there's no gains in using goofs when they're created from the fp8
what makes you believe they're created from fp8? the bf16 file exists they made their gguf from that
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
>>107877715
where are the ggufs for klein? wanna try q8 out of curiosity
>>
>>107877750
show a screen of your workflow, you probably messed something up, and did you update comfyui?
>>
>>107877755
I mean for the unet retard
>>
>>107877757
write "flux klein gguf" on huggingface?
>>107877763
this is the unet brown subhuman
>>
>>107877761
Thank you lol, I will migrate out of comfy desktop and retry the clone strategy, gonna take a wide break anyway.
>>
>>107877763
are you retarded or something? they gave us the unet on bf16, it never happened that a company gave us unet at fp8, are you fucking retarded or what?
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B/blob/main/flux-2-klein-9b.safetensors
>>
How do I load a video here to use as a mask? And why does loading this mask as an image make me OOM when I barely use a third of my vram without it?
>>
fuck tongyi
fuck alibaba
fuck tencent
fuck china
fuck lodestones
all hail bfl
>>
File: 1760535352616942.png (691 KB, 736x1392)
691 KB
691 KB PNG
replace the clothes of the anime girl a white crop top, blue jeans, and white sneakers. keep her head unchanged.
>>
>>107877782
>fuck tongyi
>fuck alibaba
>fuck tencent
>fuck china
>fuck lodestones

Alibaba and Tongyi are basically the same thing at this point. And especially fuck lodestones.
>>
>>107877770
>>107877778
im fucktarted, somehow thought that BFL only released the fp8
>>
>>107877793
>Alibaba and Tongyi are basically the same thing
well yeah since Tongyi belongs to Alibaba lol
>>
but is it better than kontext?
>>
>>107877800
definitely better than Kontext
>>
>>107877800
Unfortunately no..too bad
>>
>>107877793
tongyi is zimage
alibaba is tongyi and qwen and whatever (including wan 2.5, never forget)
>>
File: 1750870999889641.png (647 KB, 736x1392)
647 KB
647 KB PNG
replace the clothes of the anime girl with a white bikini, with the text "2" on one breast and "B" on the other breast. keep her head unchanged.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.