[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107875932

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107877185
>even little details which is impressive (flux 2 klein 9b distilled)
yeah, for the details and slop I'd say it's between Z-image turbo and Qwen Image edit 25/12, it's not the best, but it's good enough to not be bothered by it
>>
>>107877194
Tanks 4 bake
>>
https://www.youtube.com/watch?v=2OrOufa3eoc
You know klein is a big deal when a channel with 600k subscribers is talking about it lol
>>
so..
will chroma klein happen?
>>
File: perseus.png (1.57 MB, 1504x992)
1.57 MB
1.57 MB PNG
>>
Wake me when it has more loras than ZiT.
>>
File: Untitled.jpg (21 KB, 536x63)
21 KB
21 KB JPG
>>107877251
workin on it
>>
>>107877261
vram status?
>>
so multi image, do you reference the nodes as image 1 or image 2? for klein edit
>>
File: 1764742084499274.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>107877194
nice collage
>>
File: 1756733306383426.png (1.39 MB, 848x1216)
1.39 MB
1.39 MB PNG
replace the face of the girl in image 1 with the face of the girl in image 2. change the boots of the girl in image 1 to the boots of the girl in image 2.

nikke anis is now teto:
>>
>>107877265
20/24GB so far so we're not dead. gotta do bf16 it looks like
>>
>>107877266
yes >>107876256
>>
File: 1740014178691796.jpg (127 KB, 1061x1567)
127 KB
127 KB JPG
>>107877281
>20/24GB so far so we're not dead.
its over
>>
@redditors do NOT steal images from here without properly crediting or you will face jail time
>
- anonymous hacker 4chan
>>
File: Klein 9b.png (2.31 MB, 2880x720)
2.31 MB
2.31 MB PNG
The legend of Migu, Ocarina of time
>>
>>107877271
Nice image
>>
File: 1766228252348093.png (799 KB, 864x1184)
799 KB
799 KB PNG
remove the long grey hair of the girl in image 1. replace the face of the girl in image 1 with the face of the girl in image 1.
>>
File: 1761145782543578.png (2.17 MB, 2014x720)
2.17 MB
2.17 MB PNG
>>107877293
The legend of Costanza lul
>>
File: 1759302035293978.png (1016 KB, 864x1184)
1016 KB
1016 KB PNG
the girl in image 1 is sitting beside the girl in image 2 on a bench.
>>
>>107877315
did you prompt low poly? thats really good kek
>>
>>107877324
yeah I had to push it a bit, or else it would make it too realistic
>Replace the character from image 1 by the character of image 2 while keeping the same low poly 3d artistic style of image 1
>>
>>107877266
>nodes
what?
>>
File: 1753196832678305.png (1.21 MB, 1360x768)
1.21 MB
1.21 MB PNG
replace the man in the middle in image 1 with a small pixel art version of the girl in image 2.
>>
File: 4165456.jpg (564 KB, 1024x768)
564 KB
564 KB JPG
lol chinese trolls are legit trying to slide klien, not even a joke. One of them posted a "body horror" pic on twitter saying it was from it and people are saying it has body horror without proof which is easily disproven. https://www.reddit.com/r/StableDiffusion/comments/1qe76fc/comment/nzvqh0z/
>>
>>107877339
good luck finishing that game with such a big hitbox kek
>>
File: 1761400804422287.png (3.32 MB, 1152x2048)
3.32 MB
3.32 MB PNG
>>
>>107877343
bro her right hand...
>>
remake of one of my old Flux Krea gens
>>
>zit and flux 2 dev came out
>zit better!

>klein came out
>klein better!

wat
>>
File: img_00008_.jpg (1.24 MB, 1520x1728)
1.24 MB
1.24 MB JPG
>>
>>107877363
those fuckers managed to make klein better than flux 2 dev, competition is good, competition is healthy, it forces companies to work harder
>>
>>107877371
>klein better than dev
?
>>
File: 002.jpg (903 KB, 2025x1759)
903 KB
903 KB JPG
>>
File: 1756462340492174.png (1.01 MB, 640x1632)
1.01 MB
1.01 MB PNG
change the clothes of the anime girl in image 1 to the clothes of the anime girl in image 2, with the same black panties.

teto + fubuki:
>>
>>107877381
Ikr
>>
>>107877385
nice
>>
>>107877385
face swap on a BFL model? the fuck is going on
>>
I need a klein workflow I lagged behind, too many snippets on what to run it with
>>
File: 1757393687274944.png (911 KB, 864x1184)
911 KB
911 KB PNG
>>107877301
>>
>>107877397
they were so desperate of gaining relevancy again they decided to stop cucking their model, just imagine that lol
>>
>>107877404
is that the klein equivalent of "make it realistic"? nice
>>
Upscaling is pretty good too
>>
File: file.png (504 KB, 500x563)
504 KB
504 KB PNG
>>
5Head
>>
>>107877388
wdym?
klein shills are delusional because it's inferior to dev and thus inferior to zit. I hope you agree with that
>>
File: 1755999217497592.png (1.15 MB, 928x1104)
1.15 MB
1.15 MB PNG
the man in image 1 is holding a magazine with a picture of the girl in image 2 on the cover. the title of the magazine is "TETO". keep the appearance of the man in image 1 the same.
>>
>>107877366
Bacon? OwO
>>
>>107877424
Dev isn't worse than ZiT lmao, it's just chungus so people can't run it
>>
File: Flux2K9b.jpg (137 KB, 1024x1024)
137 KB
137 KB JPG
>>
File: img_00017_.jpg (1.01 MB, 1520x1728)
1.01 MB
1.01 MB JPG
>>107877362
>>107877409
Does it know artists? Pic rel "Hatsune Miku as surreal screaming cube of flesh by Francis Bacon"

>>107877428
damn right
>>
>>107877408
Yes. From what I can tell so far, it tends to keep proportions better than Qwen Image Edit 2511 with A2R LoRA but Flux slops faces and skin often.
>>
>>107877361
NTA but here's a better one with Klein
>>
File: 1754655108446374.png (989 KB, 928x1104)
989 KB
989 KB PNG
>>107877427
make the image in the style of an 8-bit nintendo game.

neat
>>
>>107877442
nice, I'll steal it. I cba myself
>>
File: 1749901957823316.png (1018 KB, 928x1104)
1018 KB
1018 KB PNG
>>107877444
the man appears as a low polygon model:
>>
>>107877438
Is that your own lora? Don't wanna check Civitai because that site is cancer srry.
>>
Finally an edit model that's fast AND decent. Eat shit Z image.
>>
File: 1752470613795787.jpg (457 KB, 2371x1434)
457 KB
457 KB JPG
>>107877436
fact, and the bigger the model is, the better it is, that's why HunyuanImage 3.0 is the best model, because it's the biggest, that's it
>>
File: Untitledsdfsdf-1.mp4 (3.18 MB, 1400x540)
3.18 MB
3.18 MB MP4
"the woman is looking at the camera and keeps her expression when the camera zooms out to reveal the woman wearing a bikini bra and panties on a brown leather sofa and she is holding a plain white box with the text "base" on it in her lap as an happy obese caucasian nerd with ragged facial hair receding hairline and greasy skin with nerdy clothing sits down in the sofa next to the woman. the woman looks at the man with disgust and quickly runs away out of frame escaping in fear as the man sighs in despair and is sad."

Wan 2.2 left, ltx2 right.

Kek. Also, why is wan 2.2 so good at male nipples..?
>>
File: dui.jpg (1.27 MB, 1536x2050)
1.27 MB
1.27 MB JPG
>>107877438
>Change the person to Hatsune Miku. Change the image to a painting by Francis Bacon
>>
>>107877449
steal it for what lol? I just replied to someone in the Reddit thread with it also, not the same person who made the original grass pic but someone else.
>>
>>107877399
go for that one
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
File: 1745521210936325.png (1.26 MB, 1024x1008)
1.26 MB
1.26 MB PNG
13s for a gen 9b distill, even faster than qwen edit + 4 step lightning lora.
>>
>>107877466
last time someone posted this image I did a comparison that showed both of those outputs are weird, because in mine the Zit guys were NOT all asian (no matter how many times I ran it) and the composition was far more similar between the two models.
>>
>>107877436
cope
>>
>>107877483
(samefag) also it's the most chinkmaxxed engrish prompt too lmao, I just had to point that out
>>
File: 1766020022321242.png (1.19 MB, 864x1200)
1.19 MB
1.19 MB PNG
finally the future is here
>>
unusable at Q8? Can I not swap in the gguf in place of the fp8 safetensor from the default template?
>>
File: 1760908528784079.png (1.24 MB, 1248x832)
1.24 MB
1.24 MB PNG
change the face of the man in image 1 to the face of the man in image 2.

so bfl learned to uncuck their models huh?
>>
>>107877489
cope with what, that an obvious Giga ESL did a shitty comparison with a shitty prompt and results that I couldn't really reproduce on either Z Image or Flux 2 Dev even when I copied their broken grammar verbatim?
>>
you're not allowed to finetune 9b so what's the point
>>
File: file.png (2.87 MB, 1664x1232)
2.87 MB
2.87 MB PNG
>>
>>107877501
are you saying this is FP8 versus GGUF Q8 with no other changes? that's weird if so
>>
File: 1756564286979478.png (1.23 MB, 1248x832)
1.23 MB
1.23 MB PNG
swap the face of the man in image 1 with the face of the man in image 2.

prompt still works
>>
>>107877514
Post passport, chang.
>>
File: img_00028_.jpg (1.45 MB, 1520x1728)
1.45 MB
1.45 MB JPG
>>107877458
Yeah, uploading it soon

>>107877473
Not even close. I hope it's has a good base/support for lora.
>>
File: 1752567322717506.png (1.21 MB, 1248x832)
1.21 MB
1.21 MB PNG
replace the face of the man in image 1 with the face of the man in image 2.

forsenSmug + sam altman
>>
>>107877514
yeah you are, you just can't sell it or host it as a SAAS unless you pay them
>>
File: 1750974781125858.png (1.21 MB, 1248x832)
1.21 MB
1.21 MB PNG
Sam Asmongold:

so clearly there is no "no face swaps allowed" bs in the model
>>
>>107877542
kek
>>
File: 00006-1978590719.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>107877353
how did you make your image?
>>
>>107877501
>unusable at Q8?
wait, the image on the right is with klein at Q8? dude that sucks
>>
File: 1751826796069711.png (1.29 MB, 1360x752)
1.29 MB
1.29 MB PNG
kek

replace the man with glasses in image 1 with the asian man on the left in image 2 who is wearing a business suit.
>>
File: 1765414859451256.png (1.93 MB, 1000x1000)
1.93 MB
1.93 MB PNG
>>107877556
replace the dog with this
>>
>>107877542
>clearly there is no "no face swaps allowed" bs in the model
so far I haven't reached any moment where the model decided to not do anything, like on Kontext dev and its censorship layers
>>
File: 1765517397129313.jpg (413 KB, 3465x1428)
413 KB
413 KB JPG
>>107877477
>https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
here's how it looks like
>>
>>107877554
yeah it shouldn't be worse than FP8, that makes no sense
>>
Why is infinitetalk changing anything but the face..?
>>
File: 1762868876474394.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: 1742737773393815.png (1.35 MB, 1360x752)
1.35 MB
1.35 MB PNG
>>107877560
replace the dog on the left in image 1 with the dog skeleton in image 2. Change the blue neon text saying "HASAN" to "SHOCK". The man wearing glasses is pointing in the air with one finger, which is emitting electricity.

kek, what a model, I need to get the point to be just right but still.
>>
should I use nvfp4 or q4?
>>
>>107877470
I think you can wrangle sequential actions in ltx if you spell it out like you're explaining to a toddler, and use the word 'then' a lot.
>then the man sits down
>then the woman turns to look right
>then the womans expression changes to disgust
>then the woman stands up
or so
>>
File: file.png (2.74 MB, 1520x1377)
2.74 MB
2.74 MB PNG
>>
>>107877579
if 5000 series nvfp4 is 4x faster
>>
File: 1763191949798390.png (1.45 MB, 1360x752)
1.45 MB
1.45 MB PNG
>>107877578
also note the sign text swap, that's flawless and better than qwen edit did it.

here we go:
>>
File: 1762484969731237.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>remove the yellow filter. Make the colors normal looking.
>>
>>107877571
I gave up because I cant ever get ' model-00001-of-00005.safetensors' to work for qwen 3 8b
>>
>>107877590
kek, poor kaya
>>
>>107877584
I think it's funny how it included the little chub on his lower stomach
>>
>>107877547
Was Flux2 Klein 9B image edit of an Illustrious image with prompt as "Change image 1 to a photorealistic style." I posted a different one a couple of days ago using that one Qwen Edit LoRA you linked so thanks for that. Klein slops faces and skin 80% of the time though so might be better once someone trains a LoRA.
>>
>>107877618
you have to download this, never go for split safetensors
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
File: 1761184013954676.png (1.17 MB, 1360x752)
1.17 MB
1.17 MB PNG
the man with glasses is holding up a magazine titled "new ways to abuse your dog", with a picture of the dog on the left below the title. the man is grinning.
>>
>cue the floyd/hasan/miku spam
>>
>The current topic test anon is testing the same 4 or 5 subjects for the next three days again.

Ugh. I don't hate that you test things. I hate that you use the same things over and over again.
>>
File: 1751462850369489.png (1.31 MB, 1360x752)
1.31 MB
1.31 MB PNG
>>107877643
I have to compare. No worries I will test diff stuff. but a -> b testing initially cause im used to qwen
>>
File: file.png (496 KB, 627x687)
496 KB
496 KB PNG
we need a "vae-less" edit model
can't deal with the shift
>>
File: Migudayoo.png (2.28 MB, 1652x848)
2.28 MB
2.28 MB PNG
>>
Question for all non-Flux fanboys who tried klein:
Is it worth downloading or is klein cope at best?
>>
>>107877657
Just anything but Hasan, CIA agent, Ryan Gosling, or Miku.
>>
>>107877675
It's good.
>>
>>107877675
It's a really good edit model, much better than qwen but not better than Z in T2I so yeah definitely get it. Year of the small models baby fuck bloat
>>
File: 1751462714768423.png (2.47 MB, 1888x848)
2.47 MB
2.47 MB PNG
>>107877675
for image alone it's worse than Z-image turbo, but as an edit model it's pretty great, probably the best thing we got locally
>>
File: 1752466698161938.png (1.02 MB, 554x1416)
1.02 MB
1.02 MB PNG
>>107877675
it can edit stuff good and quick. You need it.
>>
>>107877689
>not better than Z in T2I
>>107877506
>cope with what

I'm confused
>>
"the subject is sitting at an outdoor patio, sipping a coffee."

>>107877554
>>107877518
>>107877501
i figured it out; i'm retarded and was using a base gguf in a distill workflow
>>
>>107877675
I'm just glad we got an edit model that can actually compare to Nano Banana
>>
>>107877721
So much so I think Z, especially base might be better with artist.
>>
File: 1741470022589906.png (2.77 MB, 2720x768)
2.77 MB
2.77 MB PNG
fuck I'm already annoyed by the blur, and there's no NAG to save the day (yet)
>>
File: 1753862168092908.png (1.61 MB, 735x1844)
1.61 MB
1.61 MB PNG
>>
>>107877715
there's no gains in using goofs when they're created from the fp8, what is this retardness?
>>
File: file.png (64 KB, 1421x159)
64 KB
64 KB PNG
>>107877571
thanks for your help,

Im glad, I think I quit and dont have to be here anymore, was trying a simple head swap test
>>
>>107877748
>there's no gains in using goofs when they're created from the fp8
what makes you believe they're created from fp8? the bf16 file exists they made their gguf from that
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
>>107877715
where are the ggufs for klein? wanna try q8 out of curiosity
>>
>>107877750
show a screen of your workflow, you probably messed something up, and did you update comfyui?
>>
>>107877755
I mean for the unet retard
>>
>>107877757
write "flux klein gguf" on huggingface?
>>107877763
this is the unet brown subhuman
>>
>>107877761
Thank you lol, I will migrate out of comfy desktop and retry the clone strategy, gonna take a wide break anyway.
>>
>>107877763
are you retarded or something? they gave us the unet on bf16, it never happened that a company gave us unet at fp8, are you fucking retarded or what?
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B/blob/main/flux-2-klein-9b.safetensors
>>
How do I load a video here to use as a mask? And why does loading this mask as an image make me OOM when I barely use a third of my vram without it?
>>
fuck tongyi
fuck alibaba
fuck tencent
fuck china
fuck lodestones
all hail bfl
>>
File: 1760535352616942.png (691 KB, 736x1392)
691 KB
691 KB PNG
replace the clothes of the anime girl a white crop top, blue jeans, and white sneakers. keep her head unchanged.
>>
>>107877782
>fuck tongyi
>fuck alibaba
>fuck tencent
>fuck china
>fuck lodestones

Alibaba and Tongyi are basically the same thing at this point. And especially fuck lodestones.
>>
>>107877770
>>107877778
im fucktarted, somehow thought that BFL only released the fp8
>>
>>107877793
>Alibaba and Tongyi are basically the same thing
well yeah since Tongyi belongs to Alibaba lol
>>
but is it better than kontext?
>>
>>107877800
definitely better than Kontext
>>
>>107877800
Unfortunately no..too bad
>>
>>107877793
tongyi is zimage
alibaba is tongyi and qwen and whatever (including wan 2.5, never forget)
>>
File: 1750870999889641.png (647 KB, 736x1392)
647 KB
647 KB PNG
replace the clothes of the anime girl with a white bikini, with the text "2" on one breast and "B" on the other breast. keep her head unchanged.
>>
>>107877689
I'm very tired of zit look, so better than zit in t2i, too. Until I get tired of klein's own look. Damn that easily trainable human perception.
>>
>>107877812
Please try large mesh fishnet thighhighs.
>>
File: 1748902869098274.png (753 KB, 736x1392)
753 KB
753 KB PNG
>>107877833
a fine choice, works
>>
File: file.png (863 KB, 1016x494)
863 KB
863 KB PNG
Cleans images nicely (removed the two rectangles on the left), the whole 'resizes your image slightly lmao' thing is annoying though
>>
>>107877825
It's not really the look that's the problem, you will get wonky mutants anatomy and shitty hands every now and then (It's not a huge issue though imo). My guess is bfl finally realized and tried to unfuck their censored dataset they used for flux 2 but wasn't enough.
>>
Should I get the base or distilled version of klein?
>>
>>107877242
yes, and it will be a melty undertrained mess just like the rest of his failbakes as he splits his attention between 10 different projects and doesn't give a single one enough time to bake.
>>
>Grok is completely cucked
Help me set up a local gen anons. Its so fucking complicated
>>
>>107877870
distilled, base isn't meant to make images >>107876098
>>
>>107877870
I found base did a better job of preserving likeness in edits, but distilled was easier to prompt for raw image outputs.
>>
>>107877875
just watch a youtube tutorial bro
>>
File: 1744299939385663.png (1.14 MB, 736x1392)
1.14 MB
1.14 MB PNG
the anime girl in image 1 is using the pose of the girl in image 2.

neat
>>
is today the day I finally make a huggingface account
>>
>>107877889
Needs a second pass to fix hands, though.
>>
>>107877875
Download A1111 and SD 1.5 to get started.
>>
>>107877873
I would much rather nutbutter Klein, in fact. No architectural changes, just a finetune on a sane dataset.
>>
Would flux.klein be an acceptable basis for illustration/noob?
>>
>>107877896
Why?
>>
>>107877851
Thanks. Does it support image2 and image3 like qwen edit?
>>
File: Flux2-Concat_00001_.png (2.43 MB, 2240x1120)
2.43 MB
2.43 MB PNG
Good at colourising, doesn't kill any details
>>
>>107877932
yes >>107877285
>>
>>107877934
Her unform changed color.
>>
>>107877925
Lodestone already blamed the license and will finetune the very worse 4B mode and I assume so will the other finetunersl so if you want a shitty lumina level update to it yeah I guess...
>>
File: Flux2-Concat_00003_.jpg (1.58 MB, 2147x1344)
1.58 MB
1.58 MB JPG
>>107877941
yeah so did the girl's hair.
I'll try again with the base model
>>
Kleiniggas, using the edit, are you able to fix mismatched lighting between subject & bg?
>>
File: 1737071836647719.png (1 MB, 1040x992)
1 MB
1 MB PNG
the girl is wearing a business suit and top hat.
>>
File: Klein 9b.png (1.93 MB, 1784x768)
1.93 MB
1.93 MB PNG
>>107877959
show a screen of your workflow we can't really help you if we don't see anything
>>
File: 1756832248373122.png (723 KB, 1040x992)
723 KB
723 KB PNG
>>107877960
make a pixel art sprite of the character.
>>
>>107877963
Nah i mean whether the new model is able to fix lighting in older pics. Sometimes 1girl has obvious studio lighting on the subject while the background looks meh, I was wondering if Klein could fix it
>>
File: Flux2-Concat_00004_.jpg (1.87 MB, 2147x1344)
1.87 MB
1.87 MB JPG
>>107877941
>>107877958
damn, that's way better
>>
>>107877875
Ask Grok
>>
>>107877959
I don't know about klein but dev can fix light and shadow. just tell it to make the subject and background have coherent lighting
>>
File: 1740612170821039.png (917 KB, 1360x768)
917 KB
917 KB PNG
make a pixel art sprite of the characters.

ff13-2, neat
>>
>>107877987
>ff13-2
such an underrated ff, and its soundtrack is absolutely amazing
https://www.youtube.com/watch?v=zRYgA3dNwCE
>>
>>107877952
He should simply train 9b for himself and store it on a server with the admin password: 123456.

The 9b model and its derivatives may not be used:
“To create non-consensual intimate images or illegal pornographic content.”

Illegal pornography is prohibited, so legal pornography is okay?
>>
File: Klein 9b.png (2.24 MB, 2040x768)
2.24 MB
2.24 MB PNG
>>107877963
I'm really shocked how well this shit works lmao
>>
>>107877981
damn, so we should go for base for edits?
>>
File: 1764210953190048.png (1.24 MB, 1168x880)
1.24 MB
1.24 MB PNG
the anime girl in image 2 is standing behind the blue hair anime girl in image 1.

also got a neat upscale on the original image
>>
please care about z image
>>
File: Klein 9b.png (3.23 MB, 2720x768)
3.23 MB
3.23 MB PNG
>>
>>107877888
Link ?
>>107877902
Even i know thats severly outdated anon. Im pretty sure the current gen capable of doing like "Grok, make her wear a bikini" now
>>107877984
Lol Grok is cucked anon, you cant edit anything that has exposed skin now, both Anime and Real
>>
File: 1746234990646597.png (1.24 MB, 1168x880)
1.24 MB
1.24 MB PNG
>>107878031
this time with one less hand
>>
>>107878042
>Im pretty sure the current gen capable of doing like "Grok, make her wear a bikini" now
you appeared at the right time because the best edit local model got released today and it's called Flux 2 Klein >>107878041
>>
>>107878041
prompt? are you saying to keep it in the style of the other image or no?
>>
>https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite/issues/610
>this still hasnt been fixed
kosinfag fucking FIX your shit I want my high quality wan previous you piece of SHIT
>>
>>107878050
Link ?
>>
>>107877194
DELETE THAT OFF TOPIC SHIT FROM THE OP ALREADY FFS
>>
File: 1755149205292144.png (1.4 MB, 1360x768)
1.4 MB
1.4 MB PNG
haha, this is a pretty good one. misato but teto:

replace the girl in image1 with the girl in image2, wearing the same clothes.
>>
>>107878053
>prompt?
>Replace the man on the right by Hatsune Miku, replace "Yes, I'm ready to go" by "Miku? What are you doing here?"

>are you saying to keep it in the style of the other image or no?
it depends, sometimes you have to help it a bit for example for that one >>107878011
I went for this
>Add to the right side of image 1 a 3d render of the female character from image 2, she is facing left
>>
>>107878054
Sounds like something somebody needs to vibecode and fix
>>
>>107878058
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
>>
>>107878076
somebody gotta DO SOMETHING
and that one
is not me
>>
REEEEEEEE

Plugging in a mask into infinitetalk makes me oom on a fucking 5090 with full offload of the models unless I go down to 480p.

Then the mask doesn't even fucking work, it's still changing the gun entirely. The mask is just the face as a video.

Is there anything else that lipsyncs as good as infinitetalk and is capable of masking?
>>
Teto? pff, for me, it's Rin.
>>
>>107878091
>somebody gotta DO SOMETHING
ahah it do be like that mr stancil
https://youtu.be/lq_dM0y86pQ?t=405
>>
File: 1763591289631978.png (1.4 MB, 1360x768)
1.4 MB
1.4 MB PNG
>>107878071
>>
>>107878076
i might try unironically throwing this to gemini to see if it can fix it, but I thought our resident fag (bigstation) might have already tried it. didnt you faggo?
>>
>no hype for ltxv2
why
>>
>>107878143
That's all that gets posted now lol, haven't seen wan in a while
>>
>>107878143
Basically impossible to deliver the videos here in a way anon will bother to even look.
I still use it and am genning stuff right now. I just can't be assed to share it here because the board won't allow sound.
>>
File: 1756291752215179.png (1.9 MB, 1744x768)
1.9 MB
1.9 MB PNG
>>107878099
all right you asked for it
>>
>>107878156
>I just can't be assed to share it here because the board won't allow sound.
that's why wsg exists >>>/wsg/6072442
>>
>>107878165
That place is pretty dead
>>
>>107878150
look at this thread again and repeat what you said
>>
>>107878165
I do share my stuff there. I just... don't think it's worth calling attention to because it's all kind of experimental.

>>>/wsg/6073567
>>
>>107878172
yeah :[, they don't allow images so it's a huge deal breaker, why can't we have both?? sad
>>
>>107878157
What did you use for this? It looks pretty good
>>
File: 1758750835746784.jpg (2.59 MB, 1248x1824)
2.59 MB
2.59 MB JPG
>>
File: 1756332272937960.png (2.37 MB, 1216x1216)
2.37 MB
2.37 MB PNG
>>
>>107878182
Flux 2 Klein, it got released today
>>
try using 8 steps with the distill workflow, 4 is already super fast as is.
>>
>>107878181
The most active thread on nearly every board is the AI thread. It makes no sense for there not to be an AI board at this point.
>>
File: 1760963263557103.png (1.88 MB, 1184x1280)
1.88 MB
1.88 MB PNG
>>
>>107878191
I think I'll just use distill for image gen and base for edit, both excel respectively in that regard. I don't see a point in settling for sub-par editing when I know the base will do a better job.
>>
>>107878190
neat thanks
>>
File: Flux2-Image_00019_.png (1.05 MB, 1200x672)
1.05 MB
1.05 MB PNG
>>
>>107877194
>Maintain Thread Quality
https://rentry.org/teto
>>
File: 1751248207822338.png (2.34 MB, 1152x1312)
2.34 MB
2.34 MB PNG
>>107878204
>links to a ponyfag page
lole
>>
I've found latest Qwen to be good with edits too, in particular, IDs.
>>
File: 1762517761892570.png (1.89 MB, 1504x1024)
1.89 MB
1.89 MB PNG
>>
klein can remove clothes, but the flux tit remains.

if we had attention manipulation we could solve it.
>>
File: 1764273472703745.png (1.98 MB, 1280x1184)
1.98 MB
1.98 MB PNG
meet my wives
>>107878220
just do a detail pass with chromer (lol)
>>
File: Flux2-Klein-T2I_00001_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>Prompt executed in 12.07 seconds
Fuck me that's quick
>>
>>107878228
Huh? Where have I seen this before?
>>
>>107878097
Have you tried with LTX? I've never done video masking before
>>
Klein ist gut, nicht wahr?
>>
File: 1756436197551932.png (1.79 MB, 1056x1408)
1.79 MB
1.79 MB PNG
>>
>>107878246
Someone was pissing and moaning about some stupid bullshit about pic rel a few threads back, I uploaded the image to chatgpt and asked for a detailed description of the image so I could try recreating it in future models
>>
File: 1761041411704337.png (2.06 MB, 1152x1312)
2.06 MB
2.06 MB PNG
>>
File: 1757171561038541.png (1.9 MB, 1536x992)
1.9 MB
1.9 MB PNG
>>
>>
File: 1739716692479000.png (1.91 MB, 1632x928)
1.91 MB
1.91 MB PNG
>>
>>107878255
Lumiiiii :3:3:3:3:3;3:3
>>
File: 1741009722179263.png (2.05 MB, 1632x928)
2.05 MB
2.05 MB PNG
>>
>>107878192
because other board will have no traffic
>>
>>107878275
is this just klein edit on manga panels?
>>
File: 1740604359641347.png (2.14 MB, 1568x960)
2.14 MB
2.14 MB PNG
>>
>>107878278
hmmph
>>
>mfw forgot the autoposter on
whoops
>>107878286
no these are my z-image gens I had cooked, im downloading klein right now to test.
also these are done through captioning, not editing.
>>
File: Flux2-Klein-T2I_00012_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
Schizoprompts with K9B-distilled, will try the same prompts with base next
>>
is zit obsolete?
use case?
>>
File: Flux2-Klein-T2I_00018_.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>107878293
>>
>>107878280
Reminded me of M. Bison's dolls. I take it that image is based off the KPop Demon Hunter film?
>>
>>107878293
its a good model sir, happily retarded. bless its tender heart.
>>
>>107878301
nightrein dlc
>>
>>107878288
>>107878255
are you the real lumi from /sdg/? what do we have to do to make you stay here and not go back to those filthy anons.
>>
File: Flux2-Klein-T2I_00027_.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>107878301
>>
>>107878303
nope, its about a fetish ecchi anime that is airing right now (mato seihei no slave)
>>
which text encoder is compatible with qwen for forge neo?
>>
>>107878305
forgot to take off your name, retard.
>>
>>107878311
ONE MILLION DOLLARS!!
>>
>>107878299
>is zit obsolete?
absolutely not, it's still the best text to image model for realism, Klein is the best at editing, which is different use cases
>>
>>107878316
>mato seihei no slave
Thanks.
>>
>>107878319
or did i?
>>
expressions were certainly censored on klein lol, sexual expressions turn into either default resting face or anguish
>>
>>107878299
zit is good. very good. klein pretty neat tho
>>
klein can't lynch niggers.

it's ogre
>>
>>107878337
how terrible for you!
>>
is there any reason to use dev over klein?
>>
>>107878320
>>107878328
>>107878336
Lumi stay here! , you don't need Debo!. We can give you all the head pats you want! :3
>>
>>107878351
prove it



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.