[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107916419

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: SOON.gif (1.86 MB, 498x274)
1.86 MB
1.86 MB GIF
https://github.com/Comfy-Org/ComfyUI/pull/11979
>>
File: 1756932967628290.png (2.4 MB, 1680x848)
2.4 MB
2.4 MB PNG
>A woman with long blonde hair drinking coffee on a bar, Use the style from the reference images
it's not perfect but it's pretty good, Klein is really a solid model >>107918826
>>
File: 1744927011473182.png (21 KB, 628x192)
21 KB
21 KB PNG
>>107918862
XD FIRST!! CHUNGUS
>>
File: 1743627299064627.png (2.83 MB, 1573x880)
2.83 MB
2.83 MB PNG
>>107918880
meh, I wish edit models were trained more on style transfer, this shit has potential
>>
seems inpaint isn't working as intended with Flux Klein, it modifies the image outside the masked area.

Anything that can be done?
>>
Blessed thread of frenship
>>
>>107918862
>>107918368
>I just need a bdsqlsz tweet now and ill calm down
https://xcancel.com/bdsqlsz/status/2013524951492682079#m
there you go anon kek
>>
>>107918862
>blueballs.gif
>>
File: 1754237140535209.png (677 KB, 2114x1043)
677 KB
677 KB PNG
https://xcancel.com/bdsqlsz/status/2013501904404258924#m
>Qwen Image Turbo
come on dude, just give up on that piece of crap lmao
>>
File: 1754453887588088.png (3.39 MB, 1536x2040)
3.39 MB
3.39 MB PNG
>>107918851
Too brainlet to use ComfyUI so ill just stick with Illustrious. Feels like a waste on my RTX 5070ti but thats what my brain capable of
>>
>>107918971
>How do you add three reference images?
take my workflow anon https://files.catbox.moe/c6qpph.json
>>
>>107918994
you're actually smart for avoiding the worst interface ever conceived. enjoy prompting anon, using nothing but constant frustration about its bugs and bloat
>>
>>107918994
What are you trying to do with comfy. I used to think I was too brainlet, too. But it’s easy to find good workflows.
>>
File: 1739767429280052.jpg (3.86 MB, 1536x2040)
3.86 MB
3.86 MB JPG
>>107919011
>>107919012

I heard ComfyUI is more optimized for Poor GPUs. But i absolutely no idea how to use it. I hate nodes so much. I hate that in Blender, I hate that in ComfyUI. I have no idea why """They""" enforce Nodes so much
>>
>>107919027
don't bother. comfy is garbage.
>I have no idea why """They""" enforce Nodes so much
mental illness
>>
>>107918994
Just use the templates. You'll eventually figure it out. That said comfy isn't all that great.
>>
>>107919027
>>107919043
Comfy is the only way to experience new models at they drop or sometimes at all.
>>
Is there any proper desktop program for this shit is it all python website garbage?
>>
File: Klein 9b.png (2.44 MB, 1678x1024)
2.44 MB
2.44 MB PNG
Jesus Fucking Christ
>>
>>107919057
sdcpp uis are getting traction, look into that. poothon has more time invested in it but people are finally figuring out that it's a dead end
>>
>>107918994
Comfy as unironic UI is completely unusable. Use comfy + krita plugin or just vibe code the UI you want using comfy via API
>>
File: ComfyUI_00038_.mp4 (1.62 MB, 656x448)
1.62 MB
1.62 MB MP4
Enlo ladies and gentle men.
How can i solve "gray" frames and flickers in my wan 2.2 loop.
Video very related.
>>
>>107919085
>Enlo
try the anime diffusion general
>>
File: flicker.jpg (583 KB, 2554x1210)
583 KB
583 KB JPG
>>107919085
The workflow.
>>
>>107919086
Anything else?
>>
>>107919089
>those pain in the ass looking nodes
who were thought this was a good idea.
>>
>>107919064
it's cursed but i am impressed it got the outfit right, this is the 9b one right?
>>
>>107918994
>>107919027
use templates there is a small learning curve but it's quite easy to use. imo one of the best advantages is the automation you can do with it. just make sure to have enough RAM if you plan on switching between multiple models.
>>
>>107918862
>>107918974
so sick of this shit, don't even care anymore
>>
Can SWARMUI use Flux 2 Klein ?
>>
>>107919002
thanks, anon
>>
>>107919103
yeah it's the 9b model
>>
>>107919064
I FORGOT ME CRACK COCAINE GROMIT
>>
>>107919127
>>107919151
>>107919163
epic samefagging bro
no wonder your frontend is shit and no one uses it
>>
>>107919127
>i'm nooooticing
what I'm noooticing is that ComfyUi has 100k stars on Github, a man with an average IQ would go to the conclusions it has enough fans spread everywhere, including /ldg/, but it seems like you were unable to go to that conclusion, curious
>>
>>107919170
>>107919177
epic samefagging comfy-anon
>>
I don't care about this shit but I have stirring the pot in this argument..heh heh heh fight my little puppets, fight
>>
Z-Base status?
>>
>>107919195
this
>>
>>107919195
It's me I am one of those schizos, it's nice to get noticed
>>
>>107919217
5 comfy credits and 10 bucks a month is a lot in his third world shithole anon
>>
>>107919259
>google gemini
>we should all go use it
local diffusion?
>>
File: 1756140582497318.png (1.73 MB, 1280x1280)
1.73 MB
1.73 MB PNG
what happened?
>>
any flux klein nsfw loras yet
>>
>>107918954
Replace the reference conditioning subgraph with an inpaint model conditioning node, and connect the image+mask to that.
>>
File: AAAA.gif (2.78 MB, 304x210)
2.78 MB
2.78 MB GIF
>>107918862
How many fucking commits must we see before finally getting this fucking model???
>>
>>107919300
there are
https://civitai.com/models/2322539/nude-legs-spreading-klein-9b?modelVersionId=2612784
>>
File: Klein 9b.png (1.98 MB, 1803x784)
1.98 MB
1.98 MB PNG
>>107919338
>>
>>107919354
garbage
>>
File: Klein 9b.png (2.87 MB, 2592x800)
2.87 MB
2.87 MB PNG
>>107919354
>>
>>107919377
Now that's useful. There is no bigger grifters than this "image hosting" sites.
>>
>>107919395
facts, fuck them
>>
File: ComfyUI_00042_.mp4 (1.64 MB, 704x512)
1.64 MB
1.64 MB MP4
I am still looking to avoid the last frames shift in my wan 2.2 loop.
halp.
>>
>>107918851
Pic in the middle is terrible, with a katana with a sabre tip, and the girl resting the edge on her shoulder.
>>
>>107919449
/g/ technology & slop, you should go to /ic/ artwork & manmade slop
>>
>>107918988
wow, such amazing seed variance
>>
>>107919511
first qwen?
>>
>update arch
>comfy is now fucked
and I was having so much fun
>>
Does anybody else have this weird issue with LTX I2V where it seems to kinda ignore the positive prompt for the video part and instead "narrates" the prompt in audio instead?
>>
>>107918994
>>107919027
dodged a bullet. comfy is unusable garbage for midwits
>>
>>107919560
I've had lots of issues but not that one
>>
>>107919560
I have had that one. It takes the description of the video as a voice prompt.

Not sure how to completely avoid it. I think it's just gatcha.
>>
File: 1740176481171208.png (1.59 MB, 1360x768)
1.59 MB
1.59 MB PNG
https://www.youtube.com/watch?v=UdDc0W7GpZM
>>
>>107919560
I have had that issue. It went away strangely on its own using a different image reference. If youre schizo like me, Id slightly modify the reference image and try it again.
>>
>>107919585
I don't think there's much competition for comfy if you need to be able to build custom automated workflows.
>>
>>107919616
I'm using a video as an input. I'll try to fuck around with the prompt some more and see if that helps. I kinda suspect that the input frames from the video fuck with the LTX encoder in some way, since it can't influence those video frames it just narrates audio over them and then when it takes over for the video generation it just continues with the narration.
>>
>>107919673
>I'm using a video as an input.

Mouth movements and sound are basically joined at the hip with this model. Is there someone in the video talking? If someone is talking and there's no audio cue to go off, I imagine it's just going to eat the prompt as the script.
>>
Missed a couple shitstains
>107919043
>107919057
>107919070
>107919182
>107919222
>107919370
>107919585
>>
>>107919686
There's someone NOT talking and then they're supposed to talk afterwards. And I think that's fucking with the model. It sees somebody not talking, it reads "she says:" somewhere in the prompt, and it goes GUESS I'LL DO IT MYSELF. So I need to try and steer it to not talk in the first second or so.
>>
my comfy is completely fucked
arch upgraded to Python 3.14.2 and now nothing works.
Can I just delete the venv and reinstall everything?
>>
>>107919712
i think your whole install might be fucked now anon
reinstalling python and comfy from scratch might help but if i were you i'd never use cumfy ever again. it's only gonna get worse
>>
>>107919712
isn't the whole point of the venv to prevent that sort of thing from happening?
>>
>>107919711
In my experience it's a fairly rare issue though. Did you try just changing the seed?
>>
Alright, my impression of Flux 2 Klein after messing around with it for a couple of days: if you're willing to spend a bit of time, you can do virtually anything with this model. This statement will obviously age poorly as we'll likely have Nano Banana Pro level stuff local within 2-3 years at this rate and F2K will look like a pathetic joke, but my point is that by throwing in enough reference images, generating 5-10 for the best result, feeding that back in and making further changes, there's a ridiculous amount you can do even on the base model which obviously has a bunch of anatomy/sexual terms fucked with in training.
>referenced faces are inaccurate except from perfectly front on
I've seen this said; I think it's partly correct but if you give it a few rerolls on a "swap X character for Y character" prompt you'll get something of good quality, and you can then fix it up further by feeding the result back in and asking for exactly the same replacement again.
>still a bit too stupid for more complex composition requests
again, half a dozen rerolls and you'll get something that's broadly correct with a couple of bonus hands or flyaway legs here and there. feed it back in and you can fix them quite easily. it takes quite a few feedback loops before the image starts to really degenerate akin to 4o turning things into sepia piss, and its changes don't even seem to be directionally consistent so sometimes just balance out despite many loops.
>anatomy/sex bad
despite the workarounds above, yeah this is still the really difficult thing. it'll do alright tits and basic innie pussy but terrible dicks/balls, no sex, i spent like 30 minutes trying to re-add a ballsack to an otherwise good 69 gen that had disappeared the scrotum and it was unfixable through any means. hype for finetune
>>
>>107919727
python had plenty of good ideas but every single one turned into disaster
>>
>>107918974
I KNEEL XI
I KNEEEEEEL
>>
>>107919729
Yeah I'm changing the prompt and the seed is randomized, it seems to be better now but still a bit fucky. But that's LTX I guess.
>>
>>107918974
Just two weeks. Two more weeks.
>>
>>107919712
>arch upgraded to Python 3.14.2
well then rolback retard. skill issue
>>
>>107919762
hes too retarded to figure out how to pin python versions
>>
>>107919762
wtf? How do I do that?
>>
File: file.png (306 KB, 597x643)
306 KB
306 KB PNG
our darling bdsqlsz is shilling Flux Klein now
chinese bros whats our response??
>>
>>107919762
python issue*
>>
>>107919759
Some irrelevant avatartroon on the verge of suicide
>>
>>107919774
everyone is tired of finding workarounds to every lame fucktarded python design issue
>>
>>107919777
blessed trips of truth
>>
>He ban evaded then immediately proceeded to insult and samefag like a lunatic.
>>
>>107919776
This dude is basically Chinese Furk and nobody here has the balls to admit it due to sunk cost fallacy.
>>
>>107919800
when the bdsqlsz guy tweets, I cream my pants
sorry not sorry!
>>
>>107919796
who?
>>
>>107919777
>>107919789
which is why peaople are all flocking to some 33 star dogshit imgui wrapper, right?
god, you're such a worthless subhuman retard
>>
WHy does CivitAI have Klein 9b and Klein 9b base as separate categories? arent you supposed to train loras on Klein 9b base no matter what? Loras trained on base work with 1 cfg distil just fine
>>
File: Klein 9b.png (937 KB, 1328x784)
937 KB
937 KB PNG
>>107919776
desu that's a really risky move, he could unironically lose his inside status by shilling other models than China's model
>>
>>107919812
see >>107919782
>>
>>107919817
its managed by indians
>>
Can onetrainer train klein 4b loras yet?
>>
>>107919813
>can't even discuss glaring python issues without some schizo screaming about 33 stars
seek help
>>
>>107919849
your suicide will be celebrated by your family
>>
>>107919849
issues with package management and versioning (btw its the same with every other language) have been resolved for a while now in python.
>>
cumfart shills are really something else
>>
>>107919849
>>107919875
no one is EVER gonna use your 33 star garbage, you subhuman faggot
>>
>>107919864
>btw its the same with every other language
Read about content-addressing and the Unison programming language. Solves everything in the most elegant way possible. For realistic use cases, nix pretty much solves dependency management. Now if only it was statically typed.
>>
>>107919828
yeah
>>
File: file.png (2 KB, 214x93)
2 KB
2 KB PNG
For some reason the VAE decode node at the end of my LTX workflow completely ignores the --reserve-vram argument and just maxes out my card. Anybody know what's up with that? It still finishes the task, it just makes my PC stutter uncomfortably and I don't like it.
>>
>>107919928
go for vae decode (tiled)
>>
File: sjkvrkjkehqe1.jpg (240 KB, 1179x1616)
240 KB
240 KB JPG
Can we talk about the big pink elephant in this room? WD tagger, the tool every SDXL user uses to tag and utilize, hasn't been updated in over a year and a half. How can we expect SDXL to improve if we continue with an outdated tag recognition model?
>>
>>107919828
it's still work in progress
https://github.com/Nerogar/OneTrainer/pull/1261
>>
>>107919935
Oh shit I totally forgot that existed. Thanks.
>>
File: Klein 9b.png (1.93 MB, 1588x768)
1.93 MB
1.93 MB PNG
>>
Cumfart UI confirmed day 1 support for ace step 1.5.

LoRA training confirmed working too and extremely quick.
>>
>>107919957
In an real life situation, someone would have hit you so hard in the face, the wall next to you would have hit you back, before getting thrown out like a dirty rag.
>>
File: Klein 9b.png (2.12 MB, 1814x768)
2.12 MB
2.12 MB PNG
>>
>>107919992
what it feels like posting in this thread when mods are nuking all the schizoposts
>>
>>107919987
we got a tough guy here fellas
>>
>>107920000
kek, it do be like that
>>
Guess which is which.
>>
File: 1766087813045019.png (1.14 MB, 816x1264)
1.14 MB
1.14 MB PNG
the anime girl in image1 is in the style of the anime girl in image2.

image1: miku, image2: 2b

these models are pretty great for concept art or coming up with ideas, even if you are creative it's still neat that you can plug 2 images in and get a hybrid/idea.
>>
>>107919817
Good question
>>
File: 1757731184218324.png (1.14 MB, 736x1408)
1.14 MB
1.14 MB PNG
>>107920020
and with the images swapped:
>>
>>107920031
?
>>
>>107920003
We have an angry little shit here, guys.
>>
File: Klein 9b.png (2.21 MB, 1608x880)
2.21 MB
2.21 MB PNG
>>
File: 1764695355234580.png (1.19 MB, 816x1264)
1.19 MB
1.19 MB PNG
>>107920034
miku + rei fusion:
>>
>>107920015
left is ultra platic, so my guess i qwen, right is klein then? I wonder if these same edits at Q4 in flux2 would produce better results
>>
>>107920054
prompt? nice mikudayo
>>
>>107920070
>Remplace the monkey by the character from image 2, keep the pose the same

>nice mikudayo
thanks
>>
>>107920043
>no u
>>
when is someone going to make a klein sdxl finetune?
>>
File: 1763308106423419.png (1.55 MB, 832x1248)
1.55 MB
1.55 MB PNG
replace the red hair girl in image1 with the girl in image2.

nothing against kasumi but just testing. actually a pretty tricky edit but it got it right without clipping other stuff. klein is pretty neat.
>>
>>107920102
ai slop
>>
File: Klein 9b.png (1.92 MB, 1439x880)
1.92 MB
1.92 MB PNG
>>
ComfyUI execution error: No operator found for `memory_efficient_attention_forward` with inputs:
query : shape=(1, 1999, 16, 64) (torch.float32)
key : shape=(1, 1999, 16, 64) (torch.float32)
value : shape=(1, 1999, 16, 64) (torch.float32)
attn_bias : <class 'NoneType'>
p : 0.0
`fa3F@0.0.0` is not supported because:
xFormers wasn't build with CUDA support
dtype=torch.float32 (supported: {torch.bfloat16, torch.float16})
operator wasn't built - see `python -m xformers.info` for more info
`fa2F@0.0.0` is not supported because:
xFormers wasn't build with CUDA support
dtype=torch.float32 (supported: {torch.bfloat16, torch.float16})
operator wasn't built - see `python -m xformers.info` for more info
`cutlassF-pt` is not supported because:
xFormers wasn't build with CUDA support

Thanks Cumfy
>>
>>107920134
proof?
>>
>wasn't build
tfw brown terminal
>>
File: soom.png (1.58 MB, 832x1216)
1.58 MB
1.58 MB PNG
>>107920102
>nothing against kasumi but just testing
based
>>
yeah bro hold on my cup i need to spam another 30 edits, i'll soon begin with yeah you guessed it, pepe the meme frog at the beach!
>>
File: ComfyUI_00894_.png (1.7 MB, 1088x1088)
1.7 MB
1.7 MB PNG
>>107920066
Correct.

Oh yeah, I saw some comparisons where the 4b was outperforming the 9b. But something must be wrong.
>>
>the reddit tranny that steals images from here is fuming at AI's existence
kek
>>
File: file.png (1.79 MB, 1488x832)
1.79 MB
1.79 MB PNG
can you guys start reporting the single word reply flooder? he's more annoying than the schizos
>>
File: 7769.png (1.39 MB, 768x944)
1.39 MB
1.39 MB PNG
>something something not related to local diffusion
>>
>>107920189
kek, keep seething loser
>>
>>107920134
why is cumfart like this? what a failure, why do people put up with this piece of shit?
>>
>>107920234
You know you do this because of low dopamine levels, right? There are meds for that. You should see a doc
>>
File: 1759498844542888.png (1.08 MB, 912x1136)
1.08 MB
1.08 MB PNG
the character in image1 is dressed as the character in image2, and has the same hairstyle as the character in image2.

tetodayo:
>>
File: 1767370817076891.png (1.17 MB, 912x1136)
1.17 MB
1.17 MB PNG
>>107920253
ok there we go, but with original hairstyle but colored.
>>
>>107920134
>>107920234
Anustudiolickers are so transexual, subhuman and retarded
>>
File: 1742365390621701.png (2.24 MB, 2040x768)
2.24 MB
2.24 MB PNG
>>107920253
>>107920264
kawaii!
>>
>>107920272
are you using the distil version or base? 4 steps? seems to be fine with 4 so far
>>
>>107920272
can you edit a pick of a woman giving birth and replace the baby coming out with an anime girl
>>
>>107920278
9b distill at 10 steps, it looks less slopped with more steps imo
>>
File: Flux2-Klein_00205_.png (2.03 MB, 1360x912)
2.03 MB
2.03 MB PNG
>>107920189
>>
>>107920284
can you prove that?
>>
>>107920284
ok, yeah I did some text gens and it was much nicer with 8.
>>
File: 1752057071566400.png (1.1 MB, 1216x848)
1.1 MB
1.1 MB PNG
replace the character on the right in image1 with the character in image2, keep the pose the same.
>>
>hatsune miku testing
>hatsune teto testing
we are so back
>>
File: 1739957757696505.png (1.23 MB, 1216x848)
1.23 MB
1.23 MB PNG
add the character in image2 behind the two characters in image1, keep the pose of the character in image2 the same.

neat how the hair is on diff layers without messing stuff up
>>
>>107920318
kek
>>
File: 1758602922846326.png (988 KB, 1264x816)
988 KB
988 KB PNG
give the man black skin like a black man. replace the cigar with a white joint. give him an afro.

elon musk if he was from north africa:
>>
File: 1766698131804716.png (1.82 MB, 1880x752)
1.82 MB
1.82 MB PNG
>>107920189
>>107920318
based
>>
File: 1759195957798848.png (1.07 MB, 1264x816)
1.07 MB
1.07 MB PNG
>>107920380
give the man asian skin like a asian man. the man is holding chopsticks in the air. give him a fumanchu moustache.

elon ma
>>
why ***troons are so angry?
>>
File: 1754101501519799.png (1.64 MB, 1200x864)
1.64 MB
1.64 MB PNG
replace the asian man on the right in image1 with the anime girl in image2 holding a brown rifle. keep the pose unchanged.
>>
File: 1742074314001900.mp4 (3.73 MB, 1152x768)
3.73 MB
3.73 MB MP4
>>107920318
>>
What's the most up to date, most model supporting NAG node?
As a side note, is there any that support klein yet?
>>
>>107920408
yeah
>>
File: 1752983835520712.png (2.19 MB, 1714x832)
2.19 MB
2.19 MB PNG
>>107920387
>elon ma
even though Klein is an impressive model, its face consistency isn't that great with multiple people, let's hope Z-image edit will be better on that
>>
>>107920408
>What's the most up to date, most model supporting NAG node?
https://github.com/scottmudge/ComfyUI-NAG
>>107920408
>As a side note, is there any that support klein yet?
not yet
>>
>>107919193
>he trusted comfy
>he trusted tonguey
>he trusted random anime man
>>
File: 1739758973481142.png (31 KB, 718x281)
31 KB
31 KB PNG
!! HEADS UP !!
https://huggingface.co/Comfy-Org/z_image_omni_base
https://huggingface.co/Comfy-Org/z_image_omni_base
https://huggingface.co/Comfy-Org/z_image_omni_base
>>
File: 1748622814311446.png (1.68 MB, 1200x864)
1.68 MB
1.68 MB PNG
>>107920404
replace the asian man on the right in image1 with the anime girl in image2 holding a brown hunting rifle with one hand, and holding a cigarette with the other. keep the pose unchanged.

that's a bit better.
>>
>>107919193
>Z-Base status?
Just 2 more commits bro! >>107918862
>>
File: 1753240951525889.png (466 KB, 720x720)
466 KB
466 KB PNG
>>107920448
you sneaky son of a bitch I really believed it
>>
File: 1768842690889500.png (30 KB, 914x547)
30 KB
30 KB PNG
>>107920448
ebin
>>
>>107920431
i suspect Z isn't going to beat klein on all fronts, it will be a case of both being good for something else
like Z for anatomy and text rendering and Klein for seed variance, non-photorealistic stuff etc
>>
File: 1747486179133040.png (1.7 MB, 1200x864)
1.7 MB
1.7 MB PNG
>>107920452
replace the asian man on the right in image1 with the anime girl in image2 pointing a brown hunting rifle to the right and firing it. she has a cigarette in her mouth.

rooftop mikus:
>>
>>107920494
she looks badass as fuck there
>>
>>107919937
Weren't there a tagger made by dudes making a finetune? Noob or something else.
>>
File: 1749863208790452.png (1.77 MB, 1200x864)
1.77 MB
1.77 MB PNG
>>107920494
stay away from my store!
>>
File: 1767615096818417.png (1.55 MB, 848x1216)
1.55 MB
1.55 MB PNG
>>
File: file.png (23 KB, 606x358)
23 KB
23 KB PNG
another day of chang wakes up, merges in a one line update to README.md, goes to sleep, two more weeks for Z base
>>
>>107920515
i forgot
>>
File: 1762132406045281.png (122 KB, 1857x907)
122 KB
122 KB PNG
>>107920563
>their previous commit was 2 weeks ago
ROFL
https://youtu.be/9v-33jcEDk4?t=23
>>
>>107920605
You mom forgot to close the trash bin your fetus was thrown in. That's why you escaped.
>>
which do anon prefer in comfyui? node 2.0 or the old one?
>>
>>107920655
proof?
>>
>>107920663
>thing that doesn't work with most custom nodes
vs
>thing that works
gee anon, not sure
>>
>>107920675
Look up your ass.
>>
>>107920687
how?
>>
>>107920698
Ask your mom.
>>
>>107920719
no
>>
>>107920724
Yes.
>>
File: 1751209007812334.png (2.6 MB, 1536x992)
2.6 MB
2.6 MB PNG
hey gwailo, base soon ror
>>
>>107920752
image thief
>>
why isnt my gen in the collage thats it im going to take a shit on this general for the next 8 hours
>>
>>107920824
corroboration?
>>
File: fk9b_00004.png (1.68 MB, 960x1440)
1.68 MB
1.68 MB PNG
>>107920824
gen better?
>>
>>107919935
But it produces horrible color mismatches where the tiles get stitched
>>
>>107920833
>>107920855
hold on let me take my meds
>>
File: 1768830847517499.png (187 KB, 876x591)
187 KB
187 KB PNG
>mfw fags still think Z-base will get released
>>
>>107920663
old one obviously. most of the main custom nodes work with 2.0, but i still hate how big it feels. the peformance is also just objectively worse somehow, which defeats the purpose of it existing
>>
>>107920909
Has anyone that's not a subhuman retard weighted in?
>>
File: fk9b_00130.png (1.95 MB, 960x1440)
1.95 MB
1.95 MB PNG
>>107920909
this is incredible bait
>>
>>107920909
Ani should sit down on a cactus
>>
>>107920909
No one cares about your dogshit opinions. Anonymous imageboard. This is namefagging with extra steps. Do you think you're an authority here or something? Retard.
>>
>>107920909
>the scizo on the subway said
good bait
>>
Is Klein 9B actually better at editing than Qwen 2511?
>>
>>107920909
>base loras won't work on turbo, which sucks
(You)
>>
>>107920909
>pavlov ringing the bell
>>
>>107920909
fuck it heres another you
>>
>>107920938
>>107920954
>>107920967
>>107920994
How does it feel to have to samefag to make it seems like people don't find you absolutely insufferable?
>>
File: 6.png (2.65 MB, 1920x1072)
2.65 MB
2.65 MB PNG
for me its american culture
>>
>>107921051
He looks pretty happy doe
>>
Does anyone know the best alternative for removing/changing clothes now that Grok is unavailable ? Is Flux.2 Klein a good solution ?
>>
>>107921081
your imagination
>>
>>107920917
they will likely remove the old node ui just like they removed the old mask editor
>>
>>107921081
Wait until someone makes a proper nsfw finetune of klein 4b. It has a permissive license + small enough to be actually trainable without costing a fortune.
>>
>>107921081
>removing clothes
kontext already has a lora for that
>>
>>107921159
>who
wrong question
>>
>>107921172
NTA but like an actually good one?
>>
>>107921081
>Grok is unavailable
Did it get pozzed?
>>
>>107921196
usable. better than qwen edit because kontext does keep all other things unchanged unlike qwen edit which often shifts the scene
>>
>kontext
>>
I am sickened by how many people came to this thread looking for Grok alternatives.
>>
>>107921223
sar pls
>>
>>107919300
>any flux klein nsfw loras yet
Yeah, it learns quick. concept test lora: https://litter.catbox.moe/k9nbudxex5f7oc6j.jpg
>>
>>107921211
Which one do you refer to? I am seeing a few loras that fit the description on civarchive.
>>
>>107921237
insane, do post
>>
File: 1767395735006272.png (46 KB, 377x151)
46 KB
46 KB PNG
average Grok alternative searcher
>>
>>107921241
I downloaded from pastebin a long time ago and I forgot the link. but I guess people put it on hf
>>
when will we get an ACTUAL next gen video model? Wan is unbelievably shit, and LTX-2 is a joke. Not even asking for sora2 level
>>
>>107921266
Alright, I am not going to get worked up too much over it.
We will likely get something better in klein or z-(omni )base/edit in the following months anyway.
>>
File: sez.png (2.2 MB, 1344x768)
2.2 MB
2.2 MB PNG
alright where are the prompt chads at? i tried throwing a "serial experiments lain in the art style of invader zim" prompt into the free online image gen ones but it's not quite understanding.
i just want a nice 16:9 image.
>>
>>107921298
might need to use a lain reference image
>>
>>107921289
exactly. people already built the dataset themselves, so it's just a matter of time these loras get released
>>
bro where the fuck is base did we seriously get chinese cultured again?
>>
>>107921341
Normally Chinese people have to work hard to Chinese culture other Chinese people. But when it comes to you guys, it's like shooting fish in a barrel.
>>
>>107921341
We are getting american cultured and the base will be free for a 15$/month fee.
>>
File: ComfyUI_04017_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>107921237
How's training, any unusual settings to follow? Which trainer and model?
>>
File: Jeff Lightyear.png (1.29 MB, 832x1248)
1.29 MB
1.29 MB PNG
>>
>>107921400
based
>>
File: MasterJeff.png (1.43 MB, 832x1248)
1.43 MB
1.43 MB PNG
>>107921416
the flux klein shit is funny af
I'm surprised the Big Fucking Losers didnt fuck it up somehow.
>>
>>107921341
Model is probably sorta shit so they're last minute trying to fix it
>>
>>107921424
/r/ing snake
>>
>>107921428
That's a good point actually considering the hype it generated.
I have still to understand what's so special about "base", no one answered yet.
>>
>>107921369
>free for a 15$/month fee
no tip? you aren't required to clap when it generates your images?
>>
>>107921448
"base" implies it's a full base model without distillation or other fuckery that'd impede any lora training, finetuning, etc. much more malleable and more raw than something with RL like z-turbo
>>
>>107921448
better lora training, more variance
>>
>>107921428
>>107921428
Can you distill a good model from a shit base? How do we explain ZIT?
>>
>>107921463
>what is distilling
>>
File: 1739853817779811.jpg (506 KB, 1680x1688)
506 KB
506 KB JPG
>>107918974
thanks doc
>>
>32GB RAM
>16GB VRAM
>2 monitors
>3 browser windows with 15 tabs each
>reading 4chan threads
>watching youtube
>running WAN video gens in the background
life is great honestly
>>
>>107921424
>didnt fuck it up somehow
If Z didn't release, they would have. Would have only been a barely usable shitty ad for their API without the competition putting heavy pressure on them.
>>
>>107921463
ZIT was post-trained and RLHF'd on portraits and other aesthetically chosen images with an overall smaller dataset. A smaller more focused dataset allows the model to better generalize on the data and thus perform better than a full base model would in that domain.
>>
>>107921477
>>32GB RAM
>>running WAN video gens in the background
Yeah, calling the cap on things going that smooth, especially with the shit going on the background.
>>
>>107921477
>32GB RAM
that's below the recommended amount of 64GB.

>16GB VRAM
that's barely enough to do WAN gens. you probably have to use a really shitty quant or supplement with ram, which means slow as shit gens.

your life is comparatively shit. meanwhile

>128GB RAM
>24GB VRAM
>1x 50 inch curved monitor @ 144hz
>only 1 browser open for optimization
>reading 4chan threads
>playing a game
>running WAN video gens in the background
now THIS life is GREAT.
>>
>>107921510
>whole model + te not sitting in vram comfortably
pathetic tbqh
>>
more nsfw 9b testing: https://litter.catbox.moe/hkpym4jp0eugp5r4.jpg

>>107921254
needs more training, I'm not happy with it. Edit works really well, but there's too many anatomy errors.

>>107921391
OneTrainer, training on 9b base. Had to spend whole day tardwrangling these cursed base settings
>>
>>107921519
my car broke and I decided to get a 2025 pickup truck. sadly the RTX 6000 PRO will have to wait.
>>
just get a dgx spark if you're low on vram
it's actually quite good
>>
File: 1768319563617455.gif (1.31 MB, 480x360)
1.31 MB
1.31 MB GIF
>>107919168
>>
>>107921522
Can you share info on training parameters?
Dataset size, steps, learning rate, rank, trainer you are using etc?
Which GPU are you training on and how long does it take?
>>
>>107919511
that issue was more prominent with the first qwen image model than with 2512.
>>
>>107921545
stop posting bad advice. dgx spark is not for inference.
>>
>>107921548
too many questions try again
>>
>>107921554
>Dataset size
>steps
>Which GPU are you training on
>an how long does it take?
The essential ones.
>>
>>107921567
These are literally irrelevant. You want LR and shit retard
>>
File: file.png (231 KB, 1007x1303)
231 KB
231 KB PNG
>>107921497
>>107921510
It's running. Takes about 8mins per gen (no upscale) but I think that's acceptable and it's decent quality.
>playing a game
>running WAN video gens in the background
Yeah I'd love to be able to do that but I'm happy with what I've got.
>>
>>107921585
>node 2.0
>>
>>107921585
thankfully I kept my old gpu when I upgraded. imo, anyone that gens constantly must have dual gpus if you want your gaming life back. or just a dedicated ai machine.
>>
File: 1766447540523059.png (2.09 MB, 1120x1344)
2.09 MB
2.09 MB PNG
1GIRL BROS
>>
File: 1750667771911308.png (2.05 MB, 960x1568)
2.05 MB
2.05 MB PNG
>>
>>107921510
No, you want separate machine for wan running 24/7
>>
File: 1764374057478228.png (1.92 MB, 1152x1312)
1.92 MB
1.92 MB PNG
>>
>>107921610
>1GIRL
are you sure about that?
>>
File: 1751703510458256.png (1.9 MB, 1152x1312)
1.9 MB
1.9 MB PNG
>>
>>107921610
sir your wings are on wrong
>>
>>107921610
Are you sure that is how wings work?
>>
>>107921615
That's too expensive now with current ram/ssd prices.
>>
>>107921619
>>107921626
>>107921627
damn I noticed now the fuck up lol, lemme gen this THOT again
>>
File: hylas and the migudayos.jpg (1.48 MB, 1584x976)
1.48 MB
1.48 MB JPG
>>
>>107919733
after playing with the klein 9b for 3 days, its currently the closest image model we have compared to nanobanana pro at the moment on the image editing side of things. i like klein way more than z image and i have feeling its way better than z image base. I wonder what is the chinks gameplan is on trying to outperform Klein.
>>
>>107921641
>I wonder what is the chinks gameplan is on trying to outperform Klein.
Pretend it never happened.
>>
File: Flux2-Klein_00274_.png (1.5 MB, 896x1152)
1.5 MB
1.5 MB PNG
>>
>>107921641
They didn't plan to even release base because they thought BFL was out of the game
>>
File: hylas and the migudayos2.jpg (1.12 MB, 1584x976)
1.12 MB
1.12 MB JPG
>>107921639
>>
File: 1739214126325618.png (2.36 MB, 1120x1344)
2.36 MB
2.36 MB PNG
alright fixed it up a bit
>>
>>107921641
woah bro thanks for yet another ultra slop 3d shitgen
>>
>>107921673
sir the wings go on the back
>>
The only time yogaposters have ever actually been correct about a model's inability to do it well was with the original SD3.
>>
>>107921679
uhmmm its arm wings chuddie
>>
>>107921682
can it do handstands?
>>
>>107921679
only some harpies have wings on the back. the most common depiction is winged arms.
>>
>>107921691
fuck off
>>
>>107921691
arsehole
>>
File: file.png (36 KB, 800x519)
36 KB
36 KB PNG
>>107920866
try other values
>>
>>107921691
>no rentries
>no fagollage
yeah not migrating here sorry, shant post
>>
>I baked another garbage OP with an irrelevant garbage image instead of collage. Damn they must have felt fucking OWNED now! Tro-lo-lo-lol! Haha, I am a comedic genius!
>>
I think I ask this every time a edit model comes out but;
if you merge images into a single big image, that's the same as feeding two images through the reference node thingy, right?
>>
can we get a real bake not a trollabke pls?
>>
>>107921691
a grown ass man btw
>>
>>107921719
more or less yes but theyll be at lower res (comfy's node resizes the input to 1MP for klein specifically)
>>
>>107921738
i meant one with a collage and rentries
>>
>>107921641
I really like your style
>>107921677
>3d
Hey, there's a whole dedicated general just for you to yell into the void if you want, you schizo
>>
File: 657495434.png (2.87 MB, 1280x1536)
2.87 MB
2.87 MB PNG
btfo'd lmao
>>
based mods nuked the trollbake. now someone make a real one before the schizo does
>>
>>107921691
>>107921676
Where? Why?
>>
>>107921825
kek?
>>
>>107921834
>>107921834
new

>>107921825
retard
>>
bad day to be a proxycuck
>>
oh fuck we doublebaked
>>
>>107921833
>>107921838
stop bullying????
>>
>>107921081
you can do that with wan2.2 or 2.5 with a simple prompt. There loras for that with qwen image edit models but its janky and the details are very poor.
>>
>>107921852
deserved for being hasty.
baking is serious business
>>
>>107921613
awesome
>>
Training Klein-9B is so fucking easy holy shit.

No captions for anything (just the trigger keyword), 40 images in the dataset, 2000 steps and shit's on fire yo.

This is the next SDXL.
>>
>>107922175
>No captions for anything (just the trigger keyword)
ew
>>
>>107919047
it isnt? been usnig it for a while after forge and it works quite well for me. automating a image getting made to it getting upscaled then detailed is pretty nice. what other atlernatives is there to local genning anyway? to my knowledge only comfy/SD/FORGE exist
>>
>>107919027
I thought the same anon, and am the kind of guy who needs to read a sentence 5 times to understand it and just google each complex word to understand. It took me over, I think, at least 3-5 days to understand it? Just took the workflow of a fellow anon and started reverse engineering how it works. why it works and why it's there. If I remove this node and gen, and this happens, I know its job. If I do this and this happens now, I know that. It's just trial and error to reverse engineer a workflow and understand that workflow. I am an iqlet, so I can't innovate or understand new nodes. But after I understand everything in an already-made workflow, I can add a little new stuff (most of those new stuff is just new nodes/ideas from other workflows lmao)
>>
>>107920134
i rarely rlly ever have proplems on comfy. but in genral when i have any issue in anything using code or the like i just use antigravity and let it solve my sht. it was pretty helpful also when i tried to train lorras but it kept crashing. so it helped me make it work stabely. so my point is. try Google Antigravity it gives free cluade opus 4.5 for some reason
>>
>>107921597
am a iqlet who just uses comfy sense i need to tunnel vison to do anything. so what is 2.0? been browsing the thread and realised that theres even version of nodes eixsting.
>>
>>107920133
distilled or full?
>>
>>107920079
It's funny if this ESL shit actually works better.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.