[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

prev: >>107974443

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SD.Next: https://github.com/vladmandic/sdnext


>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Wan 2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Check Metadata of an Image: https://sprites.neocities.org/metadata/viewer
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
Ace Step 1.5 just released
>www.modelscope.com/ace-step-15.pt
Also ZiT base just released
>www.modelscope.com/zit-omni-base.pt
>>
>>107978081
the zit link doesnt load for me
>>
>>107978081
GET HYPED FOR HECKIN BASED CHINA!!!!
>>
>>107978081
I'm waiting for them to release in their hf
https://huggingface.co/ACE-Step
>>
>>107978099
anon if base released we'd have 20 ldg threads by now come on.
>>
>>107978124
But they haven't really released it yet have they? Neither link loads, but maybe it does if you're in China
>>
>>107978081
>>107978124
For Zimage, actual link to follow :
https://huggingface.co/Tongyi-MAI

Soon hopefully.
>>
>>107978132
I didn't check the link, but it ending in pt makes no sense for the website so it's bs, follow my link and the one here >>107978135 they'll release there first
>>
>>107978058
Thank you for baking this thread, anon
>>107978077
Thank you for blessing this thread, anon
>>
File: 1764154198908689.jpg (491 KB, 1824x2288)
491 KB
491 KB JPG
>>
>he even included the boring sd1.5 tier close up portraits
this specific faggollage is more like a participation trophy baka my head
>>
>>107978149
Klein? If so which upscaler did you go wtih, looks great anon
>>
>>
zib is out
>>
>>107978184
Klein can generate large images natively. There's no need to upscale.
>>
>>107978197
can you make her give birth?
>>
>>107978184
Klein 9B with two passes, second at 4MP, nothing special. It's such a great model, still baffled that BFL even released it without a billion "safety" features ruining it.
>>
>>107978242
Proof?
>>
>>107978253
yeah
>>
>>107978250
Hey man can you post workflow to confirm that?
>>
>>107978248
This. Any orientation works as well.
>>
For Klein being so great, I sure don't see that many klein images posted. Threads seem deader than ever. When ZIT was released we were getting 1hour threads for a month.
>>
>>107978277
it's 1AM, there's onlty 2 gens in this thread
>>
>>107978253
check under your foreskin
>>
>>107978058
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
why did you include those in the OP? wtf is wrong with you?
>>
>>107978277
Klein came out essentially a year ago in AI time lol anon was posting about it a bunch the day it released
>>
>>107978129
this is my news outlet
>>107978174
maybe my collage was ironic
>>107978297
because i missed you
>>
>>107978297
he's a maintainer of low thread quality
>>
>>107978288
8PM on west coast and 5:30PM on west coast.
>>
>>107978302
kys schizo nigger
>>
>>107978309
excellent logic. next you will say time zones only exist in canada
>>
>>107978301
anon it was only released about a week ago. threads were still being flooded with ZIT images during that time. In fact I see more people talking about Z Image Base than Klein 9B.

I think most people just don't care to deal with anything related to Flux
>>
>>107978318
majority posting comes from americans
everything else is irrelevant.
>>
>>107978297
Kill ani
>>
is ther still no way to download klein base without doxxing yourself to BFL
>>
>>107978329
yes create an account just for that
>>
>>107978302
based
>>
>>107978302
I don't like that you made a shitty collage
>>
>>107978347
You never make collages tho
>>
>>107978302
what's up with the slop cumfart image? It wasn't even posted last thread
>>
>>107978248
you can upscale old gens with it though. I'm using an anime lora to upscale my illus gens
>>
>>107978321
> but the majority does something else
the majority is also only of average intelligence

I bet you can make a connection between spamming AI slop, intelligence, and herd behavior in humans.
>>
>>107978248
I'm upscaling my old pornstash with it and it's glorious
>>
>cries at the baker
uh oh melty
>>
>>107978358
who the fuck cares retard lmao
>>
>>107978371
cumragui is the most midwits application and most people use that
>>
>>107978385
me, retard, that's why I asked
>>
File: bs33rff.webm (806 KB, 1024x1920)
806 KB
806 KB WEBM
>>107978149
OMGSISA
>>
File: ed5-1959231412.png (180 KB, 680x1112)
180 KB
180 KB PNG
>>107978058
Please include >>>/g/adt/ we are your cute little sister, remember?
>>
>>107978081
Wait, ace step is out?
>>
>>107978423
do you see any gens itt?
>>
>>107978347
i think its a good collage
>>
>>107978437
no because it's a music model
>>
>>107978438
gee I wonder why you narcissistic opportunist
>>
File: file.png (73 KB, 2246x429)
73 KB
73 KB PNG
>>107978423
So you see it in their official hf repo anon?
>>
>>107978438
too old
>>107978302
basado
>>
File: 1756406465457348.jpg (664 KB, 1712x2736)
664 KB
664 KB JPG
>>
that's a very sus pic
>>
>>107978451
yeah
>>
>>107978457
too many Asians on zit release. I am tired of the bugs
>>
10am, still no models
china culture in its purest form
>>
>>107978461
but why add the schizo rentries?
>>
>>107978463
Nope, it's klein 9b, it can also do "kpop idol" like images.
>>
>>107978467
its already out if you know what you are doing
>>
>>107978470
doesn't matter. they were spammed
>>
>>107978470
> no jpg artifacts on the eyelashes, no zit gen

so simple
>>
>>107978484
>jpg artifacts on the eyelashes
is that a zit issue?
>>
>>107978484
yes but Asians were spammed
>>
>>107978500
good thing I like asians
>>
>>107978469
threads dull without you <3
>>
HOW DO I STOP KLEIN FOR MAKING HAIRY WOMEN WITH SO MUCH PEACH FUZZ HOLY SHIT
>>
>>107978515
so you decided to roll the dice and see if the thread slides?
>>
>>107978527
choose one of the hundreds of chinese plastic models as a refiner
>>
File: 1768333893045369.png (350 KB, 1292x1017)
350 KB
350 KB PNG
What generator made this? I haven't seen this prompt format before.
>>
So the big fine tuners are all excited about Klein, Z would have to deliver something really nice otherwise there will be a community split.
The big fine tunes for Klein, the millions of slop Loras for Z.
>>
File: file.png (30 KB, 803x72)
30 KB
30 KB PNG
>>107978543
>>
>>107978544
No, there is also a difference in license.
>>
File: a wild hippo appears.png (751 KB, 768x768)
751 KB
751 KB PNG
CPU text-encode with ZIT seems to have gotten faster at some point. A shortish prompt takes like 20 seconds now if the text model's still in RAM, instead of over a minute back when ZIT first came out. (A long prompt can still take like 50s though.) On my iGPU laptop, it can be quicker to leave the text encoder on CPU and the sampler on GPU, rather than having them take turns on the GPU and hitting SSD swap in the process. (It only works if the image is small enough to do VAE decode without borrowing too much RAM though, otherwise the text encoder gets kicked out again. Tiled VAE doesn't seem to get around this, unfortunately.)
>>
>>107978515
This image is hot.
>>
>>107978529
>>
>>107978552
Thank you, one impervious to the TLDR. I guess NAI changed the format up.
>>
>>107978558
enlighten me, dear anon
im retarded
>>
Good morning everyone.

I am sorry to inform you that prior to the scheduled release of omni and base, alibaba has requested that we delay the release of base, and cancel release of omni, due to perceived risks.

Have a nice day.
>>
>>107978566
>CPU text-encode
why? are you that poor?
>>
>>107978544
western slopbakers will bang their head against the wall trying (and failing) to finetune 4b because they're terrified of BFL. china won't touch klein at all because it's western goyware and they don't want to admit z-image betrayed them
>>
File: Flux2-Klein_00001_.png (1.01 MB, 768x768)
1.01 MB
1.01 MB PNG
>>107978566
Strangely, CPU text-encode with Klein 4B distilled takes a lot longer (like 2-3 minutes), even though it's also using Qwen 3-4B like ZIT. And there's a higher rate of fuckups in the images, plus the person just doesn't look as good. (ZIT having too little variety is frustratingly real though.)
>>
>>107978584
it's not very cool of you to attract drama
>>
>>107978058
thank you for baking real
>Check Metadata of an Image: https://sprites.neocities.org/metadata/viewer
some random aicg link snuck in there with all those no rentry troll bakes looks like
>>
>>107978595
Oh yes, anon, thank you for ur brain slop

The exact same team that released SDXL 1.0 with a stricter license (CreativeML Open RAIL) will enforce Flux Klein legally with a less restrictive license (Apache 2.0)
>>
i dunno i think ZIT looks better than klein
>>
>>107978589
Zimage base license will be likely more permissive (apache 2.0) than the one for Klein base 9B that has its own license.
And while klein 4B base is also apache 2.0, they give themselves insane overreach by officially stating they'd watch anything released and strike it if they find it "unsafe" (they can shut anything down by politely asking civitai or hf).
Meanwhile Tongyi-MAI isn't there to moralfag and that shows on their hf page : no long text about safety or monitoring or whatever, they release their stuff, and it's up to people to respect their own local laws.
So my guess is that if zimage base is a good training base, it'll be used over klein.
Most likely scenario is that you will see loras for both, and finetunes for zimage base.
All of these models will be step ups from sdxl anyway, so overall it's good progression either way.
>>
>>107978277
There's only so much to be done with an edit model and only a single anon claimed 4B was good enough
>>
Jamming it to random ACEStep 1.5 songs before release
https://files.catbox.moe/cbo7c6.mp3
https://files.catbox.moe/567tjd.mp3
https://files.catbox.moe/klw8a6.mp3
https://files.catbox.moe/2t4h82.mp3

Local is so back
>>
>>107978584
great, now niggerjak is back. we are supposed to bully him until she kills herself and you gave her hope. fuck you
>>
>>107978645
it's good to have competition (and choice) again
I feel like bfl wouldn't have given a shit about releasing klein without zit
>>
>>107978544
>So the big fine tuners are all excited about Klein
Like who ? Lodestones trains on everything, even finetuning ZIT as we speak.
>>
>>107978655
did you gen anything good?
>>
>>107978592
Everyone is poor right now with current prices. But in my case, I went for an AMD iGPU laptop before knowing too much about genning requirements, and have been working with what I got. I'm glad I went for 32GB instead of 24 because that made stuff minimally viable, but 32 was the max, and they're soldered. Slots would've been nice, but the sale was good.
>>
>>107978639
only the 4b has apache, and the only reason it has apache is because it's untrainable. this has been the case for every bfl model so far, i don't know why fluxkeks still delude themselves into thinking anything will come from this garbage
>>
Local would be in an infinitely better place if the money to train had been given to me instead of these furfags
>>
>>107978655
it's suno at home, but not yet udio, it lacks that smooth non robotic voice
>>
>>107978678
>AMD iGPU laptop
>the sale was good.
sounds like a shitty deal to me
pro tip: you should be doing sdxl only
>>
>>107978671
4b costs less than 6b, flux vae2 is significantly better than vae1.

People don't shit money. Want to bet I'm right? I can do simple multiplication.
>>
>>107978692
With current market state, no actual good deal as in 6 months ago will happen until supply catches up in 2027 for ram, ssd and gpus.
>>
>>
>>107978677
Not my gens, just random gens from the preview. The quality is what Udio/Suno would give you, or better.
>>
>>107978680
So far, the Chinese have delivered nothing but a distilled model that they have frozen on a realism dataset with RL, including jpg artifacts.
I bet I can get something better out of flux klein 4B if I throw 24,000 hours of h100 at it to burn in a small realism dataset.
>>
>>107978655
Can ACEStep do SegaGenesis/SNES music?
>>
>>107978686
>smooth non robotic voice

Not sure what you're talking about. The voices don't sound robotic at all (just synthetic, but that was in the prompt for most of them I'm sure).
>>
>>107978686
have you heard pop music from the past 2 decades? it's all autotuned
>>
>>107978739
I dunno how to explain it, that's the main difference from udio vs suno to me, and clearly this model examples were always more like suno than the naturalness of udio

>>107978748
I don't mean autotune at all
>>
File: 1174950800.jpg (737 KB, 1821x1120)
737 KB
737 KB JPG
>>
File: yadayada.jpg (343 KB, 598x1596)
343 KB
343 KB JPG
>>107978645
>moralfag
BFL are truly the specialists of that, aren't they
>>
>>107978692
The giant text encoders on newer models does bite. If ComfyUI could run text-encoding on the NPU it would help a lot, since that uses the shared RAM. Not sure if they'll get around to it, though.
>>
File: 85.jpg (726 KB, 2295x896)
726 KB
726 KB JPG
>>
>>107978685
imagine if comfyorg spent their money training an epic local-first model instead of spending it on saas
>>
>>107978738
I think you'll need a tune for things like that as the model is only 2B
>>
Yeah so I'm going to have to find a replacement for comfyui. None of the nodes that handle arrays work anymore. Everything is broken.
>>
>>
Why aren't people mad that all prompting wf are being destroyed by comfyui?
>>
>>107978766
Which makes their release of klein even more nonsensical, I wonder if there is some kind of internal political shift within them.
>>
>>107978778
Imagine if they had a shuffle array node so you could pick words or phrases.
>>
>>107978816
it's cascading
>>107978822
you mean wildcards?
>>
>>107978766
>acceptable use policy
Their lawyers need to google "what is a contract?"
>>
>>
>>107978827
I don't think wildcards support picking.

say you have a list:
king, knight, queen, donkey.

[pick 1] punches [pick 2]

You don't want king punches king ever, as an example. There's only one king. This is called "picking". (or combinations)

I tried two thingies in the manager. they both don't work, for different reasons. jinja2 has a shuffle function, but comfyui can't allow it for security reasons, apparently. And jovimetrix has a totally broken array function. (it's quite sad)
>>
>>107978755
ACEStep 1.0 truly sounded robotic, but 1.5 sounds much more natural in comparison. I don't think the voice quality has caught up to Udio yet in quality, but it's certainly not robotic like it used to be. Maybe you mean it sounds very autotuned. Either way, that is the kind of stuff you might either be able to prompt away with negs or tune a LoRA against. The sound quality in ACEStep 1.5 surpasses what Udio lets you stream, vocals sound richer and instrumentals much better.
>>
>>107978851
nested wildcards would work
you'd have to make a script that generates it though
>>
>>107978698
So who are these big time finetuners excited for Klein ?

You just pulled it out of your ass. And 4b vs 6b is nothing you will pick which model to finetune over.
>>
Kinda wish I had more ram. inference is like 20% of the total time of my workflow.
>>
>>107978873
man zibase is so good
>>
File: z-image_00038_.png (552 KB, 768x768)
552 KB
552 KB PNG
Oh, so ZIT CAN generate a glove that stops at the wrist. It was super stubborn about making them cover the forearms when I was trying >>107978566, even when I kept modifying the prompt to try and stop it. Sometimes the sample view would show them starting on the hands at step 1, then literally growing up the arms step-by-step as the gen progressed.
>>
>>107978873
why? you just need to randomize the array and pick from the array. This is extremely basic functionality.

why would you need to nest anything?

You don't even have a random function?

btw the reason this is like this is that comfyui is for sd type models, not modern ones where the only relevant paramenter is strength. order matters, and repetition matters too.
>>
>>107978902
they did some black magic with hands. in the previews, hands sometimes turn black for no reason.
>>
>>107978902
https://i.4cdn.org/wsg/1768840483811046.mp4
>>
>>107978903
i was offering a workaround
>>
>>107978912
don't make me POOP on ur MOM
>>
>>107978894
I was in a second hand electronics shop yesterday that buys hardware for cash, and their ram display shelf was pure ddr4 8gb sticks that they're seling for £40 each. That's what I paid for a 32gb stick last year
you snooze you looze
>>
File: tenor.gif (2.29 MB, 278x202)
2.29 MB
2.29 MB GIF
>>107978907
>>
File: such variety.png (1018 KB, 1536x768)
1018 KB
1018 KB PNG
>>107978902
>gloves, half-gloves, and fingerless gloves all come out the same
Dammit ZIT. Is it because I asked for a slim feminine hand?
>>
>>107978832
ai slop
>>
How the FUCK do I offload to RAM on linux? I just want to run wan 2.2 bro...
>>
>>107978944
i dont remember man, im sorry
>>
File: 64545554456645.png (42 KB, 1139x215)
42 KB
42 KB PNG
Turns out ACEStep is not coming out today. Apparently it's not ready, and no word on when it's gonna be available after all. I'm not usually a Chinese culture memer, but one of their investors probably went "wait a sec, your model is so good, reevaluate, you're realizing THAT" so it's 50/50 we might be fucked.
>>
>>107978149
I love casual elf maidens
>>
>>107978982
i think it will come out when zib comes out
>>
>>107978939
just ask klein to clean it up
>>
File: 1739476082733001.mp4 (3.79 MB, 1280x640)
3.79 MB
3.79 MB MP4
>>107978936
>>
>>107978957
--novram

learn how to llm just a tad ok
>>
>>107978993
thanks poopman
>>
>>107978982
LOCAL LOSES AGAIN
the humiliation ritual continues
base is next
>>
>>107978990
https://knowyourmeme.com/photos/1449412-piper-perri-surrounded

bet you can't one shot this with any local model.

well, non-local won't allow panties? idk, never tried lmao
>>
>>107978991
Why does ai suck at weapons so much?
>>
>>107978982
ok
i don't think that many people are interested in acestep anyway. it's way too small/limited.
>>
>>107978995
no problem, I like to enhance race relations between us whites and your kind.
>>
File: 1768848057421689.mp4 (3.68 MB, 1088x768)
3.68 MB
3.68 MB MP4
>>107978926
>>
why does base sd1.4 -ify faces under certain conditions?
>>
>>107979003
we 'preciate it goy ;)
>>
>>107979004
lxt talking is so weird. I think they trained it using synthetic hyper muscular 3d models.
>>
File: 1746091680412051.mp4 (3.8 MB, 704x1152)
3.8 MB
3.8 MB MP4
>>107978873
>>
>>107978982
but today is day 1
should they have got started on that on day -14 when they announced that day 1 will be day 1?
I know chinese culture is an enigma to us, but surely time is the same for everybody?
>>
File: 1745895885305787.png (81 KB, 740x886)
81 KB
81 KB PNG
comfyxisters... our respose???
>>
>>107979001
Small, but in no way is 1.5 limited. There's unlimited potential there, local's only unironic chance at a comeback from a s
Suno/Udio if you combine all the things possible with it (good seeds, LoRAs, audio inpainting, remixing, etc...)
>>
>>107979029
i asked grok about the post and it said its fake news
>>
>pull
>get pozzed by fennec cock
>>
>>107979029
>look inside
>op made his comfyui available online and someone exploited weak coded nodes to hijack it
yawn
>>
File: file.png (167 KB, 2641x676)
167 KB
167 KB PNG
>>107979029
Directly reachable from the internet, what a retard.
https://www.reddit.com/r/comfyui/comments/1qn4w1j/i_think_my_comfyui_has_been_compromised_check_in/
>>
File: 1748198045476051.mp4 (3.66 MB, 704x1152)
3.66 MB
3.66 MB MP4
>>107978832
>>107979019
>>
File: 1738335405962189.mp4 (3.26 MB, 704x1152)
3.26 MB
3.26 MB MP4
>>107978810
>>
File: z-image_00046_.png (597 KB, 768x768)
597 KB
597 KB PNG
>>107978936
>a short leather glove that leaves the base of the palm uncovered
Still no dice.

>>107978991
Nifty, albeit sloppy.
>>
File: 1752920490280192.png (1023 KB, 704x1152)
1023 KB
1023 KB PNG
ltx face
>>
File: 1745136451301087.mp4 (3.77 MB, 1472x576)
3.77 MB
3.77 MB MP4
>>107978771
>>
>>107979057
i can't help but notice
>>
>>
>>107979065
It can be way worse. his settings are really well tuned lol.
>>107979051
I missed your vids. catbox?
>>
File: 5455645645654.png (17 KB, 1236x72)
17 KB
17 KB PNG
Chinese need to wise up to relying on Comfy for further open source releases.
>>
>>107979104
I'm not gonna use comfy, so can I have it now?
it's only fair right
>>
>>107979109
Usually if it's a good release that forces Comfy to make it a priority, and unofficial nodes come out beforehand anyways. But if they can just take their time they can stall.
>>
https://github.com/Comfy-Org/ComfyUI/pull/12102
https://github.com/Comfy-Org/ComfyUI/pull/12102
https://github.com/Comfy-Org/ComfyUI/pull/12102

WORKFLOW FOR Z-IMAGE-BASE UPDATED IN TEMPLATES
>>
>>107978982
Seems stupid to release a new model at the same day Z-Image Base is supposed to release, even if it targets a different use (image vs audio)

So likely a smart move
>>
File: 2311323121231.png (38 KB, 1278x208)
38 KB
38 KB PNG
>>107979104
>>107978982
Wait so Comfy doesn't have the weights yet kek. But the guy made it sound like they did and were actively working on it. Yeah, that is looking bad for local...
>>
>>107978982
the only reason they release the models is for publicity since they're basically free, so it makes sense they want to maximize their reach
>>
File: 1739651070174495.mp4 (3.58 MB, 704x1152)
3.58 MB
3.58 MB MP4
>>107979091
>I missed your vids.
:3
Didn't save metadata for the previous ones, you can have for this one
https://files.catbox.moe/g0ga7y.png
>>
File: ComfyUI_03695_.png (1.58 MB, 1456x992)
1.58 MB
1.58 MB PNG
>>
>>107979162
wtf are they waiting for
>>
>>107979162
>We can't release until comfy has implemented it
>I can't implement until they release
ah shit, paradox
>>
>>107979168
>pubes
*vomits*
>>
>>107979154
>comfywikifag, who doesnt even know how to use the software, gets base before me
fuck this gay earth
>>
>>107979176
I prefer clean shaved or nothing too, but at least this isn't amazonian forest bad.
>>
>>107978738
No, most of the time you would get gibberish because it's very bad with instrumentals. It's worse than SD1.5 in terms of prompt adherence.
>>
>>107979176
>*vomits*
You must be 18 to post here
>>107979166
Specs to gen?
>>
>>107979183
if you are blonde you should at least have blonde pubes
>>
>>107979166
damn. how do you know what i look like
>>107979190
she is wearing a wig and contacts.
>>
File: radiance_x32.jpg (239 KB, 1280x1280)
239 KB
239 KB JPG
>>107979162
good reaction by comfy tho
>>
>>107979190
Good piece of evidence to know if hair color is natural or not.
>>
>>107979154
I'm sure there's a few more hopemmits that can be squeezed out before release
>>
>>107979190
Asians can't be blonde or blue eyed so the model gets that perfectly right..
>>
>>
>>107979188
5090 and 9950x3d for file processing, but that's not really needed. I'm sure any CPU can do that, just a bit slower. Also, I'm still leaving a vram margin of 10-20% while running, so I can still use my pc, watch videos, and stuff, so if you fully commit, you could run with a slightly worse GPU.
>>
>>107979154
Not omni or edit?
>>
>>107979168
she looks like she fucks human men
>>
File: 1756851483385951.mp4 (3.71 MB, 1152x704)
3.71 MB
3.71 MB MP4
>>107978757
>>
>>107979218
I'm noticing a pattern with your videos
>>
>>107979169
Furkan
>>
>>107979222
Go on
>>
File: 1741210879807004.mp4 (3.69 MB, 576x1536)
3.69 MB
3.69 MB MP4
>>107978766
>>
>>107979203
I wish I had better specs. Was going to upgrade this summer
>>
File: bx7d.png (1.74 MB, 1024x1536)
1.74 MB
1.74 MB PNG
>>
>>107979166
cheers!
>>
>>107979237
:( check if you can run Q4, it is worse yes but can still make funny looking stuff
>>
File: 1742998518515802.mp4 (3.75 MB, 1088x768)
3.75 MB
3.75 MB MP4
>>107979168
>>
File: 1761065497104633.mp4 (3.41 MB, 1024x544)
3.41 MB
3.41 MB MP4
>>107979226
>>
>>
>>107979252
Should have her pouring water on her self
>>
>>107979166
Did you find NAG makes a big difference?
>>
File: 1753152734743614.jpg (768 KB, 1520x3104)
768 KB
768 KB JPG
>>
>>107979266
Very minimal difference, but enough that I keep it. Putting it higher makes a lot more, and imo makes the content better, but seems to destroy frame flow.
>>
File: bx7d8.png (1.34 MB, 992x1328)
1.34 MB
1.34 MB PNG
>>
>>107979258
kek
>>
>>107979252
cool youre back
>>
>>107979259
>z-image-experimental
experimental?
>>
File: 1753704863294947.mp4 (3.17 MB, 1152x704)
3.17 MB
3.17 MB MP4
>>107978584
>>
File: 1740517404614996.mp4 (3.76 MB, 896x896)
3.76 MB
3.76 MB MP4
>>107979277

>>107978566
>>
>>107979271
I've been struggling with LTX morphing my faces. but I'm wondering if it might just be that my input image face has slightly off proportions so it's fighting with what LTX wants a face to be.
>>
god, how the fuck comfyui gets worse with every update
>>
File: 1768197186546682.mp4 (2.87 MB, 704x1152)
2.87 MB
2.87 MB MP4
>>107979318
yeah nag might help with that, give it a shot

>>107979199
>>
>>107979328
They hired some retard that keeps changing things for no other reason that to seem like they're working.

Also this person clearly doesn't actually use Comfy in any major capacity, since they're clueless as to what a efficient UI is.
>>
>>107979340
can you stop being disrespectful to me?
>>
>>107979344
No
>>
Is memory management and model loading completely fucked on the newest comfy version for anyone else? Klein offloads a bunch of shit to RAM even though I can fit the full FP8 unet and it OOMs within 2-3 gens.
>>
File: ComfyUI_07731.png (3.33 MB, 1280x2048)
3.33 MB
3.33 MB PNG
>>107979154
'Bout time! Curious to know if it can use more than one LoRA without getting destroyed.

>>107979328
They like to change things under the hood all the time now. They don't seem to be building towards anything in particular, but they're furiously changing the names of calls/functions constantly.
>>
>>107979353
>he pulled
>>
>>107979361
I wanted to try Klein but I should have known better.
>>
>>107979353
comfyui is managed by fucking troglodytes
>>
>>107979367
and here comes the racism
>>
>>107979372
which race would that imply?
>>
>>107978645
ZIT was a fun model, but I don't think it will hold a candle to Klein 4B/9B in realism.
>>
>>107979374
>just dox yourself
>>
>>107979353
I know I have an issue where I have to restart comfy if I've used another workflow when switching to klein otherwise it'll offload all VRAM every gen.

try setting --reserve-vram 1 if you haven't.
Also this https://github.com/Windecay/ComfyUI_Dynamic-RAMCache
Fixed my OOM issues with LTX. similar problem to you. comfy would just try to cram everything in ram and get OOM killed instead of just making space.
>>
>>107979360
I knew a girl who looked like her, and now she's fat.
>>
>>107979162
I saw this on this discord and immediately was confused.
Then I remembered Chinese culture.
>>
>>107979383
i dont think you did
>>
File: 1765108973909946.mp4 (3.69 MB, 640x1280)
3.69 MB
3.69 MB MP4
>>107978410
Wrong prompt
>>
File: 1740842557672818.mp4 (3.04 MB, 1280x640)
3.04 MB
3.04 MB MP4
>>107979258
>>
>>107979378
Even ZIT is better than Klein 4B/9B in realism, so Z-Image Base will obviously be better as well. Klein can't even handle decent anatomy, people are back to 'hiding hands' when using it.

Klein has one good feature, the editing which is very capable for its size, the rest is meh.
>>
>>107979417
>Even ZIT is better than Klein 4B/9B in realism

Zoom in on the images/gen 2K. It's not.
>>
>>107979048
>installing nodes with claude code
WHY? How retarded do you have to be to not be able to git clone?
>>
>>107979435
>do my homework for me
no
>>
File: file.png (1.52 MB, 832x1216)
1.52 MB
1.52 MB PNG
Upping this lora:
https://civitai.com/models/2327401/m4crom4sti4-huge-natural-breasts-flux-2-klein-k3nk

https://files.catbox.moe/1q6ygo.safetensors
https://files.catbox.moe/am7v6k.png
>>
>>107979382
I used --reserve-vram 1 and it worked until I tried loading a lora, then it started using up until my system RAM until the server process crashed. I tried --highvram and it wasn't using system ram but it said I ran out of GPU memory and crashed. I'll try the custom nodes you mentioned and see if that works.

Am I just fucking crazy and klein really doesn't fit? I have 2x24 GB GPUs and I'm offloading the fp8 text encoder and VAE to one and the fp8 unet to another. I thought that would be enough to fully load everything.
>>
File: lol.png (64 KB, 327x280)
64 KB
64 KB PNG
>>107979439
>WHY? How retarded do you have to be to not be able to git clone?
>>
>>107979451
disgusting, those are so stretched out they barely look like aeorolas
>>
>>107979454
>Am I just fucking crazy and klein really doesn't fit?
I have a 3090 and I can fit the text encoder+klein no problem. definitely something going on in your setup.
>>
>>107979331
NAG seems to help a lot. thanks!
>>
>>107979462
Fuck you for lying like that I have a 4080 and you want to claim a 3090 can do it? Fuck you
>>
>>107979435
I don't need to zoom in to see that it has better realism, fix you eyes.
>>
File: 1759653890504589.mp4 (3.73 MB, 768x1152)
3.73 MB
3.73 MB MP4
>>107979466
np :)

>>107979239
>>
>>107979367
There's literally no way to perform a "pick". ie you select some term from a pool of terms and place it in your text and then you pick another (but the previously picked items are gone from the pool).

reee

why should I need to use python, that makes the wf unsafe to distribute!!!!! UNACCEPTABLE
>>
>>107979389
I did. She would know me as "the brother of ___"
>>
>>107979451
based ty
>>
>>107979462
I have an older version of comfy still installed and it's working fine. I'll try a fresh install and see if that fixes it, maybe my current install is fucked.
>>
File: 1764452972220544.mp4 (2.06 MB, 832x1024)
2.06 MB
2.06 MB MP4
>>107978149
>>
File: 1753538775694646.mp4 (3.79 MB, 704x1152)
3.79 MB
3.79 MB MP4
>>107979083
>>
>>107979448
Do you have eyes? Fucking up hands occasionally is not a worse offense than fucking up and slopping every detail.
>>
File: 1751115534879061.png (2.08 MB, 1072x1440)
2.08 MB
2.08 MB PNG
>>
>>107979497
you type a lot and provide little in terms of material evidence
>>
File: Capture.png (472 KB, 686x650)
472 KB
472 KB PNG
>>107979500
fat pig
>>
File: ygd4.webm (2.58 MB, 1024x1920)
2.58 MB
2.58 MB WEBM
>>107979390
>>107979476
One day I will be fit to lick your shoes, sire.
>>107979451
>Upping this lora
king
>>
File: 1738485959556796.mp4 (3.73 MB, 704x1152)
3.73 MB
3.73 MB MP4
>>107978412
>>
>>107979503
So you don't have eyes...
>>
File: 1739773765475690.png (2.13 MB, 1072x1440)
2.13 MB
2.13 MB PNG
>>
>>107979527
can you make a choco goddess with yellow hair and eyelashes
>>
File: ComfyUI_07499.png (3.32 MB, 1280x2048)
3.32 MB
3.32 MB PNG
>>107979383
Sounds like you missed out!

>>107979477
Why not just create a Mad Libs style LLM System Prompt to enhance an input prompt?
>>
File: 1766842057668746.png (2.09 MB, 1072x1440)
2.09 MB
2.09 MB PNG
>>107979510
how about this one
>>107979532
whats in it for me?
>>
File: 1765518547511048.mp4 (3.79 MB, 768x1088)
3.79 MB
3.79 MB MP4
>>107979500
>>
>>107979543
>whats in it for me?
i'll take a pic of my hand covered in coom
>>
>>107979503
>>107979526
Anyways, when the Edit model comes out (if it does come out) it'll be plain as day, immediately obvious how slopped Z truly is, and it'll be like "I told you so".
>>
File: 1761921171300319.mp4 (3.64 MB, 768x1088)
3.64 MB
3.64 MB MP4
>>107979527
>>
After reading up on ace step being delayed and now finding out cumfart doesn’t even have the weights and code im really fucking confused.
>>
1girl love
>>
>>107979564
zib culture
>>
>>107978566
>not even 1mp
poorfags GTFO
>>
>>107979569
there is reserch showing going above 768 results in worse images
>>
>>107979575
circa 2019
>>
>>107979533
>Sounds like you missed out!
women never talk to me.

>>107979533
sure. that's the "fix" for every piece of shit software, vibe prompt your way around the crap.

I am going to try a couple of nodes and see if they actually WORK unlike the other ones that are broken and only used to work (there is even a warning not to updoot on one of the pages).

I'll be trying res4lyf textshuffle (it takes a seed).

and then
>Feed that into String Selector (from Impact Pack) or Text Pick Line by Index (from YANC or ComfyUI-Extra-Samplers)
if any of those still exist / work.

btw, why doesn't comfyui have some kind of ontology system or whatever? I've been looking towards prompting an llm to understand prolog data. I never learned prolog, but the way it structures data is very user editable. It was originally meant for sort of ai / sql-like programming. But often you need to communicate a kind of structure YOU want, not what it was trained on. Takes like 1 million tokens to finally explain yourself.
>>
>>107979575
hi time traveler
>>
File: IMG_0351.jpg (118 KB, 1164x966)
118 KB
118 KB JPG
>>
File: 1769175574377328.png (28 KB, 250x250)
28 KB
28 KB PNG
Linux followup: I somehow got wangp running wan 2.2 I2V at int8 on 8GB of VRAM. Basically just threw darts at a board until the commands worked together. 81 frames at 512x resolution and no OOM! Very impressive at the int8 file size. Currently using... ONLY 4 FUCKING GB OF VRAM??? Will see how far I can push this.
>>
>add the devil to a gen
>random people start showing up in the gens
>>
>>107979546
>>107979562
kekd
>>
>>107979564
It is Chinese culture anon. Not uncommon for them to lie through their teeth about all these releases while the higher ups are the ones who can release or not release the model on a whim (and what reason would they have to give us a model like this... Only if a competitor (of which there are currently none) comes around and forces their hand. Alibaba is the only one left that could do that, but unless they were also baiting we may never get one. As for ACEStep, there's still a good chance that we get it, but expect the license to be cucked.
>>
File: screenshot.1769492876.jpg (143 KB, 893x367)
143 KB
143 KB JPG
ZIT is still above Klein, even their 9B model. It's objectively better. The anon pretending Z "won't hold a candle to Klein" is fucking retarded.
>>
File: 1756813878356213.png (2.22 MB, 1072x1440)
2.22 MB
2.22 MB PNG
>>
>>107979602
Music models tend to be overreaching with their cucked licenses as opposed to image models. When these are cucked, the outputs may not be used for commercial purposes (though they can't enforce that legally), but that's exactly how restrictive they tend to be.
>>
>>107979603
>benchmark
Who unironically posts benchmarks to win arguments? Have you seen some of the retarded shit that “tops benchmarks”?
>>
>>107979603
klein is an edit model, apples and oranges
>>
The worst shit Disney did was create the idea that the "main character" can be someone in a service industry. "I'm the main character (grin) *holds out hand* I'm a chimney sweep!"
>>
Okay
>>
>>107979546
>>107979562
sovl
>>
>>107979603
soon it'll be mostly about the finetunes again anyhow unless they actually increment on the base models every half year or so
>>
File: 01284.png (1.7 MB, 1144x912)
1.7 MB
1.7 MB PNG
>>
File: 1769194694972966.png (512 KB, 980x728)
512 KB
512 KB PNG
Linux followup 2: The ceiling is 720x720 at 81 frames for 7.3/8gb VRAM usage. I've got shit spilling into my swap file lol. Running wan 2.2 at int8 on my dogshit hardware feels criminal TBDESU!
>>
>>107979623
Nobody tunes image models anymore. It's all shitmixes.
>>
>>107979592
nobody uses any of that because the latest commit is broken
>>
>>107979642
ok dude no one gives a fuck
>>
File: 1753699990873183.png (1.23 MB, 1081x778)
1.23 MB
1.23 MB PNG
>>107979592
fixed
>>
>>107979593
>>107979642
Cool gens
>>
>>107979643
too expensive and honestly a waste of time. imagine spending $300k+ and half a year to finetune only for a newer better model to be released before you're even finished finetuning.
>>
>>107979593
prompt?
>>
>>107979656
brown
>>
>>107979659
You are confusing training with tuning.
>>
>>107979665
>proud of being retarded
lol lmao even
>>
migrate
>>107979673
>>107979673
>>107979673
>>
>>107979667
No I'm not. Fine tuning, aka, using a base model like SDXL or Flux to make Ilustrious/Chroma is expensive.
>>
File: 2026-01-27.png (45 KB, 609x440)
45 KB
45 KB PNG
>>107979166
I can't find some missing nodes there.
Are some from github that isn't in the manager?
>>107979675
No proper bake.
>>
File: Flux2-Klein_00002_.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>107979569
Bigger is possible, but then the VAE decode step kicks the text encoder out of RAM, and bringing it back takes longer. So I do smaller for testing out prompts. It's nice that ZIT's output quality is relatively consistent across different sizes compared to SDXL.

Have a janky Klein one.
>>
>migrate
Nah, bake a better bread
>>
>>107979683
hyppolita is alcide's cumrag, she's also poor man's saber. you have shit taste
>>
>>107979658
>>107979661
The prompt is to go to /jp/ and steal reaction images lol. I'm just a filthy tourist who thought any linux fags here might find my wan 2.2 endeavor interesting.
>>107979648
The local AI thread isn't for local AI????????
>>
>>107979680
https://github.com/Setmaster/comfyui-llamacpp
https://github.com/Setmaster/comfyui-utilities7
>>
>>107979695
the irony of what you're saying after literally saying you're posting reaction images from /jp/ and then crying MUH LOCAL DIFFUSION
kill yourself
>>
>>107979696
Thanks babe
>>
>>107979603
Rather than mememarks, I use my eyes to evaluate models anon. Here's my argument for you. Texture, details, skin tone, natural lighting. These are very important to a photograph, in particular photorealism, and it's somewhere Klein is significantly ahead of every model, including NBP. I refer to the texture that is captured in raw high resolution images. You'd be able to tell if you work with real cameras and stock photos. ZIT still airbrushes skin, the texture does not look as realistic, and photos look more like renders. Flux 1 dev has a similar issue, and so does Schnell. You might object that Z realism tunes look a bit more realistic, and while that is true, it still lacks texture. One way to describe is is that when you zoom in to a Flux 2 Klein image, it's not just what is being depicted is accurate, you can see every fiber of a girl's hair, and also marks on her skin that would appear on a real photograph. Whereas if you do the same on a Z image, you lose all these details and it looks airbrushed. Such simply is the power of the Flux.2 VAE, as well as a much more refined dataset from guys at BFL.
>>
Fresh

>>107979726
>>107979726
>>107979726
>>107979726
>>
>>107979696
[object Object] not found. Okay.
>>
File: bbs-zit-2026-01-27_00034_.jpg (2.41 MB, 2304x1296)
2.41 MB
2.41 MB JPG
>>
>>107979702
What do you anons even do here besides shitflinging, this general is so unbelievably bad backreading
>>
>>107979738
Can you show the node giving that?
>>
File: 2026-01-27-2.png (21 KB, 633x268)
21 KB
21 KB PNG
>>107979744
I really wish I could. Can't find any red lined node either
>>
>>107979743
we do actually have some high level shitposters the worst one is trying at all costs to promote his UI and takes every occasion to falseflag/FUD the shit out of comfy, calls it saas, says it phones home (it doesnt) so you see most of the thread him replying to himself and trying to bait anons in shitting up the thread. he's unemployed so he has all the free time in this world.
>>
>>107979743
i'll take "figments of anons imagination for $300 alex"
>>
>>107979743
>What do you anons even do here besides shitflinging,
Discuss local diffusion. How else do you think anon got so good at prompting kino?
>>
>>107979753
Do you have any extra data on the console maybe? Maybe try deleting nodes and re-adding them back.
>>
>>107979753
open dev console in browser and see what kind of errors it's throwing.
>>
>>107979776
I only get this so it is a little confused. Did a pull on Comfy as well assuming it would work with latest.
got prompt
invalid prompt: {'type': 'invalid_prompt', 'message': 'Cannot execute because a node is missing the class_type property.', 'details': "Node ID '#5189:5245'", 'extra_info': {}}
>>
File: bbs-zit-2026-01-27_00043_.jpg (2.25 MB, 2304x1296)
2.25 MB
2.25 MB JPG
>>107979787
corrupted node install probably.
here's the missing list i get. i have next to no custom_nodes at all:

VAELoaderKJ
PathchSageAttentionKJ
CM_FloatToInt
StartLlamaCppRouter
StopLlamaCppServer
LlamaCppPromptOutput
UnetLoaderGGUF
LlamaCppAdvPrompt
SomethingToString
Seed (rgthree)
ConstrainVideo
ComposeVideo
LTX2_NAG
YANC.MultilineString
LTXVSpatioTemporalTiledVAEDecode -in subgraph 'Samplers'
ImpactExecutionOrderController - in subgraph 'Samplers'
>>
>>107979761
My condolences. And not to support schizoposting or anything, but running obscure chinese workflows in comfy IS very dangerous, same goes with any other inferencing software, assuming you're not running containerized. Default install is safe tho.
>>
>>107979787
Try these, one might work:
https://files.catbox.moe/sp2a9m.json
https://files.catbox.moe/2ib3xc.json
>>
Reminder fresh

>>107979726
>>107979726
>>107979726
>>107979726
>>
>>107979787
also the problem is in a subgraph from what I could see, so you could try exploding it and replacing the nodes as well
>>
File: bbs-zit-2026-01-27_00050_.jpg (2.74 MB, 2304x1296)
2.74 MB
2.74 MB JPG
>>107979811
relating them to node packs:
ComfyUI-KJNodes
ComfyMath
ComfyUI-GGUF
ComfyUI-LlamaCpp
rgthree-comfy
ComfyUI-ConstrainResolution
ComfyUI-LTXVideo
ComfyUI-YANC
ComfyUI-Impact-Pack
>>
>>107979821
>>107979849
I'll just do a fresh custom node install and try again. Got the same error on the workflows there as well
>>
>>107979849
and >>107979696
>>
>>107979868
Me again, thanks for all the help it is appreciate I hope to get it sorted tomorrow
>>
>>107979881
good luck friend
>>
>>107979670
People are indeed very proud of still using euler only, its grim



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.