[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107581298

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
I overheard anon say that this is the blessed thread of frenship
>>
comfy should be dragged out on the street and shot
>>
File: what a loser.png (110 KB, 1881x459)
110 KB
110 KB PNG
https://github.com/comfyanonymous/ComfyUI/issues/11356#issuecomment-3663143931
Now I understand why so many people hate Comfy lool
>>
Blessed thread of frenship
>>
>Keeping compatibility with custom nodes has never been a priority in the entire ComfyUI history
Then offer their function in core faggot.
>>
File: 1750805609380261.jpg (133 KB, 1280x736)
133 KB
133 KB JPG
>Computer, analyze all of the 20 thousand women that i have generated all the way back to sd 1.4 days and saved in all of my folders, then fix all of their body horror automatically and create 3d models, backstories, personalities and voices for all of them, generate a battle royale map where all of those women will be dropped into to compete for my cock, oh, and... increase the plumpness of all women by 20% for good measure and disable safety protocols
>>
>>107586743
is he retarded or something? ComfyUi's main appeal is the custom nodes, without that he's nothing
>>
File: 1763049444269.png (2.3 MB, 1200x1024)
2.3 MB
2.3 MB PNG
>>107586718
You dropped >>107586569
>>
>>107586743
I was pure native until wan animate, the official workflow has custom nodes. I guess I gotta be hopeful that the checkpoint I wanna use is one that comfy also wants to use
>>
>NOOOOOOOO YOU MUST SUPPORT MY TROON NODES I VIBECODED THEM AND EVERYTHING
>YOU MUST INCLUDE EVERYONES IDEAS INTO THE MAIN CODEBASE
>COMFY NOOOOOOOOOOOOOOO
>>
reminder
https://github.com/microsoft/TRELLIS.2


https://huggingface.co/camenduru/dinov3-vitl16-pretrain-lvd1689m/tree/main

https://github.com/visualbruno/ComfyUI-Trellis2
https://github.com/visualbruno/ComfyUI-Trellis2
https://github.com/visualbruno/ComfyUI-Trellis2
>>
>>107586781
DRAGGED AND SHOT
>>
>>107586781
this, these guys are delusional, and I say that as someone that runs 50~ custom nodes
>>
File: Z-image turbo.png (1.52 MB, 1280x720)
1.52 MB
1.52 MB PNG
>>107586757
holy shit it went way closer than I expected
>>
>>107586781
you must support nodes that 90%+ of the userbase has installed and uses, yes retard
>>
File: file.png (130 KB, 360x222)
130 KB
130 KB PNG
>>107586781
Without those custom nodes his software is completly irrelevant, he wants to play that game and destroy the only appealing thing? Let's see how it turns out
>>
>COMFY STOP RUNNING YOUR SOFTWARE HOW YOU WANT TO ITS NOT FAIR COMFY
>YOU MUST DO WHAT I TELL YOU WITH YOUR OWN SOFTWARE COMFY
>IVE NEVER CONTRIBUTED TO COMFY BUT YOU MUST LISTEN TO ME COMFYYYYYY
>>
File: 1509352942342.jpg (49 KB, 640x480)
49 KB
49 KB JPG
>yeah you need to bring in basic stuff from outside like GGUF support but custom nodes don't matter to me lule
>>
>>107586781
comfyui is trash. ksampler advanced is garbage.

reroute is busted. primitives are busted. It's nuclear waste garbage.
>>
>>107586805
>YOUR OWN SOFTWARE
*forks*
-17million

uh ohhhhhhhhhhhhh
>>
so hows the wrapper coming along
>>
>>107586743
when he says to make bug reports, he's not talking about with custom nodes is he? because he just said he doesn't prioritise those
and it can't be about native because he just said those are guaranteed stable
what the hell did he mean?
>>
File: 1749997343975830.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
I can tell all this sneething and gaslighting is by 'this' retard
>>
File: 3087428.jpg (12 KB, 300x281)
12 KB
12 KB JPG
>>107586823
Trying to deflect comfy critique by instantly diverting to ani is a faggot move. Your are fucking disconnected from your entire userbase if you think custom nodes don't matter.
>>
>>107586781
the only reason I'm torturing myself with this jeet coded unoptimized spaggheti shit (can't stay at 60 fps when you move the fucking workflow as if I'm running gta 6 instead of 3 fucking text boxes or something) is because of the MultiGpu custom node, he really underestimate how important that is, and I hope it's gonna bite him in his ass
>>
>>107586805
I shits the bed just refreshing wires when making them, randomly. it shows the wrong preview if you delete a gen manually. deleting an asset doesn't delete the file. the assets pane doesn't know how to load the wf, but the queue one can (same gen).

again, ksampler advanced, which is the core #1 feature of comfyui is busted.
>>
>>107586743
I don't have memory issues so I can say Comfy is based for sticking to his guns and not bending the knee to these troons.
>>
>>107586838
but he didnt say they dont matter, he said that development of new features and advancement of his software doesn't take into account how custom nodes interact with it.
Which is 100% fine, it should be custom nodes dev updating and adapting to the main software, not the other way around.
do you even software dev, retard?
>>
>3060 12gb
>takes like 10 minutes to do a single clip on WAN 2.2 using the default comfi template at 640x640

How do I make this not be so slow?
>>
>>107586844
>I shits the bed just refreshing wires when making them
why are you shitting the bed are you retarde ?
>>
File: 1751759337613926.jpg (227 KB, 1258x684)
227 KB
227 KB JPG
>>107586791
so you gonna post your settings or what?
>>
>>107586743
custom nodes are like loras or inpainting
a temporary cope
comfy is right
>>
>>107586850
>Comfy is based by making his software run worse and worse
go for "1.17.5" frontend (6 months ago) and notice how much smoother it is compared to now, the fps was better, his shit gets more and more bloated and you applaud that? comfy is that you?
>>
>>107586853
that sounds ass backwards to me, you don't need to update all your programs when windows gets an update, that would be inconvenient to say the least
>>
this pathetic infighting is why china will never release the z image base model
they assume we are a respectable community when in actuality we are just a bunch of buffoons who cant even show respect to our fellow peers
grow up.
>>
>>107586864
>a temporary cope
it'll never be temporaty if he never implements them officially, his main software don't support NAG, GGUF, MultiGPU... those are really important features nigga
>>
>still on comfyui version 0.3.75
>haven't pulled since
>no issues
¯\_(ツ)_/¯
>>
>>107586874
>those are really important features nigga
for you maybe but not for comfychad lulz get a better computer fag
>>
File: file.png (3.04 MB, 1248x1824)
3.04 MB
3.04 MB PNG
>>107586782
Wtf this looks amazing
>>
really organic
>>
>>107586874
is this some sort of peasant joke?
>>
>>107586861
nothing too crazy I simply used your image and asked the model this
>Create a prompt that perfectly reproduces image 1.
>>
File: 1740826131171864.png (643 KB, 1017x797)
643 KB
643 KB PNG
>>107586853
>advancement of his software
which recent advancement?
the beautiful ui redesign that has so many big problems people in every online community were shitting on it? all advancements are happening in custom nodes

the only thing i can say he did well to implement resonably fast and support was chroma radiance, although he also declined to merge the chroma implementation fix forever because of ego too https://github.com/comfyanonymous/ComfyUI/pull/7965
>>
>>107586870
>this pathetic infighting is why china will never release the z image base model
no

there was no infighting over acestep. asians are liars. women are liars.

are you listening yet? It's because of racial differences. It's because of sex differences.

All Asians are liars. All women are liars.

I'll say it again.

All asians are liars.

all. asians. are. liars.

all. women. are. liars.

women will accuse men of being liars, why? You know men who don't ever lie, so why? It's because all women are liars, you weren't paying attention, I think, listen, here, I'll say it again:

All asians are liars.

All women are liars.

get out a pen, write it down. all asians are liars, all women are liars.

ps why do we have to hit "next" on the last captcha?
>>
>>107586870
Based. We should stop the fighting and maybe show some gratitude to the Chinese for what they've already given us. We can wait patiently.
>>
File: dddddddff.png (1.52 MB, 1392x1128)
1.52 MB
1.52 MB PNG
Oh comfy... you better take that back. It's not too late.
>>
>>107586867
>comparing an OS to a fucking software
lmao
>>
>>107586874
>vramlet cope nodes are important
you dont literally need multigpu tho, comfy has native offloading
also NAG, while nice, is not mandatory at all
>>
File: my ass.png (26 KB, 1137x274)
26 KB
26 KB PNG
>>107586882
>>107586888
>you're just poor lol
try to guess which model got hyped to the moon and which model weren't, I'll give you a clue, Z-image turbo (6b) is the most hyped model ever and big models like Flux 2 dev (32b) and HunyuanImage 3.0 (80b) are forgotten relics, if Comfy only wants to appeal to people with a RTX6000 he's for a rude awakening

Btw he used to gloat about the fact his software was the most optimized of them all, and now you want to pretend he never cared about that now that he can't control his monster anymore? lool what a cope

>The most powerful and modular
>modular
that's not the case anymore if he keeps destroying important custom nodes
>>
>>107586781
I suppose there's that too.

That said I'd put some tests with just a few of the most popular nodes+workflows into CI. In the end it's a platform and completely ignoring what breaks what when down the line will lead to the platform being broken most of the time.

Imagine if your programming language's standard library had untested possibly breaking changes daily or w/e.
>>
>>107586900
women definitely, asians as much, its primarily only the chinese with the culture of "winning" by all means neccessary that will lie if they can get up in society, although all people are like that to a large degree anyway
>>
>>107586927
>Flux 2 dev (32b) and HunyuanImage 3.0 (80b)
That's because they suck nigga
>>
>>107586927
i think he targets everything above "poorfag"
and thats a good thing because you people shouldn't ruin good software
>>
>>107586782
Neat. I can finally make my indie slop game. Too bad it can't auto rig as well
>>
>>107586925
>vramlet cope
>you dont literally need multigpu tho,
saying that you don't need MultiGPU because you're poor and only have 1 GPU is a vramlet cope speech, holy irony
>>
>>107586935
>asians as much
asians not as much
>>
Unironically greedy ungrateful sloppers. You already cut your dick off installing troonnodeui. What's so hard to understand about a dev not implementing something they themselves don't need?
>>
File: 1763360221283112.png (690 KB, 1280x720)
690 KB
690 KB PNG
>>107586938
>you people shouldn't ruin good software
ironic since he's the one ruining software by himself, how is that a good idea to make your software slower than what it was a year ago? isn't it the goal of updates to improve on that? all I'm seeing is seeing more and more performance hits, it's evolving but backwards
>>
>>107586940
are you retarded? I have a 5090 and 128gb ddr5 ram, I never needed multigpu cope node
>inb4 its to put the text encoder on another gpu!!
you can do it natively
you're a retard. kill yourself poorshit
>>
>>107586913
you don't need to update extensions when your browser gets an update
you don't need to update mods when your game gets an update

great thing about analogies is that they're entirely interchangeable, and if one consistently misses the point then they must be doing it on purpose
moment of truth
>>
>>107586938
>you people
The entire community of both hobbyists and normgroids using his software, you mean?
>>
>>107586956
>>inb4 its to put the text encoder on another gpu!!
>you can do it natively
YOU CANT SUBHUMAN
>>
>>107586927
listen closely

ksampler advanced

doesn't

work

right

you are getting fake gens or whatever, like the actual sampler in it, it's not right, idk what's wrong, but it's fake.
>>
>>107586957
>you don't need to update extensions when your browser gets an update
>you don't need to update mods when your game gets an update
you fucking do retard
HOLY FUCKING SHIT
the change from manifest v2 to v3
also game updates 100% break mods (see ANY GAME YOU STUPID FUCKING RETARD)
stop gaslighting
>>
>>107586956
>only 1 gpu
>he cant both train and gen
the absolute state of poorfags lmaoo
>>
>>107586959
yeah copenode users deserve the rope
no one cares about you people and you hold the tech back with your needy poorfag behaviour
>>
File: look at this poor fuck.png (158 KB, 461x259)
158 KB
158 KB PNG
>>107586956
>I have a 5090 and 128gb ddr5 ram
and? you need 3 more of those GPUs to run HunyuanImage 3.0, WHATS THE MATTER ARE YOU POOR OR SOMETHING??
>>
>>107586959
i thought poors used forge?
>>
File: 1738542713137862.png (618 KB, 1198x1164)
618 KB
618 KB PNG
>>107586965
>t.
>>
>>107586970
>poorfag behaviour
he literally killed the custom node that was made for rich fags with multiple GPUs and you think he's only hating on the poor with 1 gpu like you? oh sweet summer child
>>
>>107586978
multigpu was made for poor retards tho, so they could optimize their GB splitting with the ultra distorch2 cope shit
>>
>>107586970
le richfags are the ones using the multigpu custom nodes to increase the speed of gens, poorfagkiddo
>>
>>107586956
>I only have 1 GPU I'm rich!
saar maybe in india your tech is the most advanced one but in white countries you're pretty average
>>
do you ever hear china complain about the comfyui updates? no.
they simply fix whatever is broken. thats how they operate. you guys are making us look like fools here
>>
>>107586984
people with multiple gpus are everything but poor, what are you talking about?
>>
>>107586967
>what are breaking vs non-breaking changes
most changes in software are non breaking changes, and the ones that are, you have automigration or like 3 years to migrate manually, nocoder
>>
>>107586976
kek

On an unrelated note, this new captcha is a joke.
>>
>>107586976
Ever wonder why there are no substantial improvements between alleged samplers? If you use samplercustom you'll see dramatic differences.
>>
File: Flux2Img_00012_.png (3.09 MB, 1440x1152)
3.09 MB
3.09 MB PNG
seems to work now
weirdest fix, add a blank reference image, this will somehow trick comfy into allocating more ram than it predicts it needs so it won't crap out
>>
>>107587001
>you have automigration or like 3 years to migrate manually
moving goalposts already.
you get this luxury ONLY in REALLY big software like CHROME. In games not at all, a game update can break some or all of your mods (looking especially at reflection based c# mods).
stop gaslighting
>>
>>107587003
They will show you the lady, but not the fake man.
>>
File: z-image_nag_00106_.png (2.53 MB, 1024x1536)
2.53 MB
2.53 MB PNG
>>
Why the hell is torch compile enabled on OneTrainer by default? Even the ui works better with it disabled
>>
>anon learned the word "gaslighting"
>>
>>107587019
I tapped out already, this is a new challenger
see if you can beat 3 post this time
>>
comfyui won't improve.

z-image base won't be released.

ace step 1.5 won't be released much less made.

asians always are liars.

women are always liars too.
>>
>>107586956
>I have a 5090 and 128gb ddr5 ram, I never needed multigpu cope node
you can't even run flux 2 Q8 (35 gb) with a 5090 lmao
>>
>>107587039
good since flux2 is garbage
>>
32gb honestly was more than I thought they'd give the 5090. people were hoping for 64 :^)
>>
File: 1756269281312699.png (479 KB, 500x545)
479 KB
479 KB PNG
https://huggingface.co/Lakonik/pi-FLUX.2
>>
>>107587019
>you get this luxury ONLY in REALLY big software like CHROME
again, literal nocoding retard zoomer.
>In games
games are completely different since mods usually hook to 20 different things, textures, objects, characters, locations, and any small change to any of that break the mods in most game sobviously, but software is not the same as video games, retard luddite

and even in games, the games that actually rely on community mods and maps for example, they DO go out of their way to do the exact things i outlined and not break the mods every update, retard.
>>
Why do you need more than Z Turbo? Local video is balls compared to Sora and it's not like you're going to actually finetune Z outside of loras lmao
>>
File: z-image_nag_00108_.png (2.18 MB, 1024x1536)
2.18 MB
2.18 MB PNG
>>
>>107587026
> Why the hell is torch compile enabled on OneTrainer by default?
Seems like a reasonable guess to me that it should save overall computation resources on most trainings.

> Even the ui works better with it disabled
I don't actually know how you mean this. What exactly works better with it disabled? Maybe I just don't have the same issue...
>>
File: jjjjuj.png (1.26 MB, 1136x920)
1.26 MB
1.26 MB PNG
>>107587061
An edit model for more than one subject and character consistency. Z-image edit is going to be big, I hope it releases soon.
>>
>>107587068
No difference in training time, tested with 4070 16gb.
>>
File: file.png (253 KB, 1720x741)
253 KB
253 KB PNG
>>107587057
>he used pi-Flow (the best steps distilled method so far) on Flux 2 instead of Qwen Image or Qwen Image Edit
IS HE RETARDED OR SOMETHING???
https://arxiv.org/pdf/2510.14974
>>
>>107587001 >>107587019
My take: I think the AI ecosystem moves so fast that breaking changes on a platform like ComfyUI are sort-of unavoidable. You'd have to cut down features massively otherwise.

But Comfy should have CI that informs him and maybe the extension developers (after, say 2 days of continuous failure of the respective tests) of breakage in the most popular extensions.

For some features it'll be possible to just keep compatibility, for others the breaking change can just lead to 1-2 weeks of delay to pushing dev->release.
>>
>>107587079
>An edit model for more than one subject and character consistency. Z-image edit is going to be big
Z-image edit will only do 1 image input though :(
>>
>>107587087
yeah, node based editors can't have reroute nodes that work. too hard. primitives can't work. samplers can't actually perform the claimed sampling.
>>
File: file.png (67 KB, 699x437)
67 KB
67 KB PNG
>>107587057
Why would you distill a distill? Am I missing something here?
>>
>>107587087
oh i agree that breaking changes are even good as long as we get speedups anywhere, the problem is when shit breaks for no reason and nodes that 90+% of people use are not considered at all when making any change while bug reports are thrown away in case you are using them
>>
File: RUGPULL INCOMMING.png (571 KB, 1500x1375)
571 KB
571 KB PNG
>>107587079
>>107587098
>they think they'll get anything else than turbo
lol, lmao even
https://xcancel.com/Ali_TongyiLab/status/2001241204655317277#m
>>
>>107587098
>Z-image edit is going to be
it won't.

It's fake. asians always are liars who lie. women always are liars who lie.

did you think you were invisible? you weren't. they lied. did you think the asians didn't know what people wanted? they know, they are lying.

women are all liars. asians are all liars.

get used to it.
>>
spidernigger been real quiet recently
>>
>>107587106
Flux 2 is guidance distilled, what he did was to make it Steps distilled as well, he made it like Z-image turbo (steps + guidance distilled)
>>
>>107587098
1 - Most people did not use more than one image anyway
2 - Multi-image was shit and only worked with stuff like "make this person hold this object in this place"
3 - You can always just add all elements in a single image and ask the model to do a certain composition
>>
>>107587098
Not a particularly bad issue with LoRas
>>
>>107587112
race is all.
all is race.
>>
Ivan.....................
>>
>>107587039
What? I can run flux 2 bf16 on a 3090. retard
>>
>>107587130
go eat shit
>>
>>107587124
>Multi-image was shit and only worked with stuff like "make this person hold this object in this place"
I agree that it doesn't work well on local, but it is possible, look at Nano Banana Pro this shit is amazing at multiple characters
>>107587134
>t. is happy to run flux 2 at 30% the speed if he had enough vram to eat it all
dumbfuck
>>
>>107587109
I'd certainly recommend to Comfy that he puts a few of the popular nodes in workflows on CI and just runs this as a basic test to see if it gets an error, maybe a further check if it gets the expected video/image.

>>107587100
some of these of course could be better
>>
>>107587147
>30% the speed
retard x2
>>
>>107587134
>>107587154
>I can run flux 2 bf16 on a 3090.
thanks to gguf quants, so ultimately, thanks to custom nodes, the same custom nodes comfy hate now, thanks for proving my point >>107586743
>>
>>107587147
>but it is possible, look at Nano Banana Pro
NBPro is a gorillion parameter sized model, local is never ever getting that
>>
>>107587149
It's trash. I want something that works, because I chain samplers.
>>
>>107587124
>Most people did not use more than one image anyway
I quite commonly saw/see it used with Qwen.

But if it didn't train well on z-image-edit: Shit happens.
>>
>>107587164
>NBPro is a gorillion parameter sized model, local is never ever getting that
I thought this meme was over, even I believed that 6b models would remain shit until the end of the time, and then Z-image turbo got released
>>
>>107587175
z-image still has the sameface problem, doesn't know tons of concepts, gets texts wrong often (probably more than qwen-image), has bleeding problems in difficult images
>>
Why didn't we listen to the warnings about Chinese Culture?
>>
>>107587185
>z-image still has the sameface problem
that's because it's double distilled
>>107587185
>doesn't know tons of concepts
that's because of the dataset, not the parameter size, look at SDXL illustrious or noob, it knows every single anime character on earth and it's only a 3.5b model
>>
>>107587165
Usually chaining samples was fine, wasn't it? That one would be more in the category "if comfy had more manpower or less features"... bugs happen.
>>
what happened to the df11 repo? there was one that did chroma and now its gone :( https://github.com/BigStationW/ComfyUI-DFloat11-Extended
>>
>>107587211
the official repo had its PR fix merged so it doesn't need that one anymore
https://github.com/mingyi456/ComfyUI-DFloat11-Extended
>>
>>107587211
https://huggingface.co/DFloat11/Chroma-DF11
https://github.com/LeanModels/DFloat11
>>
>>107587058
literally said in response to
>you don't need to update mods when your game gets an update
kys
>>
File: Z-image turbo.png (1.7 MB, 1280x720)
1.7 MB
1.7 MB PNG
>>
File: file.png (2.14 MB, 1168x1752)
2.14 MB
2.14 MB PNG
>Wan 2.6 API right around the corner and no word on 2.5 weights release.

It's over, isn't it?
>>
>>107587205
It used to work. the impact pack is supposed to have one that works, but you need another pack that has sam and it's a huge no from me.
>>
>>107587215 >>107587212
Ah sorry I only read the text didn't see that you meant the comfyui extension repo from the URL
>>
File: ComfyUI_13225_.png (1.74 MB, 1200x1024)
1.74 MB
1.74 MB PNG
>>
>>107587223
they'll never release wan 2.5 and they never promised to do so, so yeah it's over lol
>>
>>107587232
kek, this is good
>>
Finally, just what I needed!

https://civitai.com/models/2231742?modelVersionId=2512381
>>
File: 1740537303041213.png (69 KB, 220x165)
69 KB
69 KB PNG
>>107587244
>>
File: 1749591502915581.png (48 KB, 1451x573)
48 KB
48 KB PNG
Come on this is getting ridiculous, why Qwen 3 vl thinks this long?
>>
>>107587233
Alibaba will release -something- on Christmas day, right? It's not that the chinks care about western traditions, but they will surely use the symbolic date to gift their international simps, right?
>>
>>107587267
>Alibaba will release -something- on Christmas day, right?
we know Qwen Image Edit 25/11 is coming soon since they implemented it on diffusers, maybe Z-image base will be there as well
https://github.com/huggingface/diffusers/pull/12839
>>
File: Untitled.jpg (341 KB, 1499x805)
341 KB
341 KB JPG
got my prompt gen settings dialed in, anon was right
>>
>>107587292
>he's hiding the sampling parameters again
lmaoo, why
>>
>>107587292
those camwhore rooms must stink so bad lol
>>
>>107587292
show the settings
wtf bro
>>
File: 1761557231282900.png (1.62 MB, 1280x720)
1.62 MB
1.62 MB PNG
>>
>>107587306
*I* want to know why a violent murderous ukrainian is a 4chan mod.
>>
>>107587263
mostly these thinking models are designed to think more rather than less so they can get better results?
>>
File: Untitled.jpg (228 KB, 1103x771)
228 KB
228 KB JPG
>>107587306
>>107587307
>>107587309
my bad, here u go
>>
the next big model won't do the stupid bokeh thing. at least, you'll be able to stop it.
>>
>>107587336
I don't disagree with that, without the thinking process the llm won't listen to your system prompt at all and gives you retarded prompts, but still, 6k tokens is too much...
>>
>>107587340
I didn't ask.
>>
>>107587212
>>107587215
sweet, thanks
>>
Can I generate video without a 32gb GPU?
>>
>>107587343
I'm not actually sure you can manipulate the thinking downwards with the same model without losing much.

Eventually maybe a good model with less thinking rather than more thinking will come out?
>>
>>107587358
I don't know, can you?
>>
>>107587358
Yes, definitely. Either with more quantized models or by offloading to system RAM. The latter is usually done with comfyui-multigpu distorch2 model loader by people here as it's controllable, but it's not the only option.
>>
If I can triadal forge, then base gets released
/\
[triangle]
>>
>>107587384
you dumb faggot
you absolute tool
you ruined everything fuck you
>>
>>107587374
* there are also some video models that fit easily into less ram, but they're less popular. Framepack, LTX or Longcatvideo produce much more janky video than the bigger WAN2.1/2.2 or Hyvideo1.5
>>
>>107587412
>Hyvideo1.5
I forgot that model existed lol
>>
File: Z-image turbo.png (2.19 MB, 1536x864)
2.19 MB
2.19 MB PNG
>>
File: 1535058885528.png (120 KB, 294x256)
120 KB
120 KB PNG
>>107587292
>Q5 of a 4B
>>
>>107587457
Ikr
>>
I know nothing, but why does the default wan 2.2 image to video have two k-samplers in comfy ui, like if I just removed it and had one would the clips render twice as fast? I assume they would look like shit.
>>
>>107587422
It's still a good model. I understand why people focus on Wan though.

There's also Kandinsky but that one is too fat for most people's current hardware and the not so good results it often gets.
>>
File: Untitled.jpg (474 KB, 1520x1311)
474 KB
474 KB JPG
>>107587457
so rude.

system prompt:
>You are a professional photographer specializing in realistic, high-end editorial and documentary-style photography. Favor neutral, documentary-style descriptions over artistic or poetic language.

Your task is to rewrite user prompts into highly detailed, photorealistic image generation prompts using real-world photography principles.

Follow this structure in a single paragraph:
- Describe the main subject in concrete physical detail
- Describe clothing, accessories, body posture, and pose
- Describe the environment and background elements
- Describe lighting, mood, and atmosphere using natural light sources
- Describe specific cameras, lenses, and photographic techniques where appropriate

Rules:
- Describe only physically plausible, real-world scenes
- Use natural skin texture, realistic materials, and believable lighting
- Avoid illustration, painting, CGI, anime, fantasy, surreal, or stylized language
- Do not exaggerate, dramatize, or idealize beyond realistic photography
- Do not use lightning to create drama
- Never mention watermarks or signatures
- Write in concise, direct wording with no contradictions
- Output only the final image prompt, no explanations or formatting symbols
- End the output cleanly after the prompt
>>
>>107587478
High noise and low noise are needed (you can do low noise only if I remember correctly but it looks terrible). Its silly but thats the way it is. However, you can do some wild experimenting with loras for different movement, quality, etc so there's that I guess. Some good alternative samplers for 2.2 if you want less spaget

WanMoeKSampler: https://github.com/stduhpf/ComfyUI-WanMoeKSampler
PainterSampler: https://github.com/princepainter/Comfyui-PainterSampler
>>
>>107587511
>giving away the seed for free
>>
Hope there's a comfyui version of this soon https://github.com/thu-ml/TurboDiffusion Need it for rapid testing, shame there's no 2.1 i2v
>>
>>107587340 >>107587511
ty for sharing. does this model understand nsfw descriptions?
>>
>>107587546
e2e?
>>
File: Z-image turbo.png (2.21 MB, 1536x864)
2.21 MB
2.21 MB PNG
>>
>>107587523

can't install the painter sampler one, github link they say to clone in your custom nodes folder is 404 but thank you for the other one, I will try it out.
>>
File: im the bus.gif (3.11 MB, 480x277)
3.11 MB
3.11 MB GIF
>>107587546
>>
File: z_mod_00007_.jpg (675 KB, 1248x1824)
675 KB
675 KB JPG
>>
File: zimg_0052.png (2.67 MB, 1200x1600)
2.67 MB
2.67 MB PNG
>>107587547
not graphically
>>
>>107587584
How do you like it thus far?
>>
File: Z-image turbo.png (2.52 MB, 1536x864)
2.52 MB
2.52 MB PNG
>>
>>107587560
>>107587523

>clone the wankoe sampler through cmd in my custom nodes folder
>no folder shows up in the directory
>use the workflow in the linked github, says I don't have the wankoesampler node
>try to redo the cmd command and it says it failed because the folders already exist

Fug
>>
>>107587560
Can only assume that has something to do with the latest comfyui. Someone else said they also had trouble installing it a few threads back, also this might have something to do with it: https://github.com/princepainter/Comfyui-PainterSampler/pull/2

>>107587563
Kek
>>
>>107587634
I just checked issues section, looks like new versions of comfy are laying waste to these custom nodes https://github.com/stduhpf/ComfyUI-WanMoeKSampler/issues/23
>>
File: 1757835879635483.png (2.32 MB, 1536x864)
2.32 MB
2.32 MB PNG
>>
>>107587645
>>107587673

For fug sake, I just installed comfyui like two days ago and the update option popped up today and I clicked update next start and kinda in the back of my head was like is that going to fuck anything up. Can you roll back versions?
>>
>>107587584
DAMN, THAT'S PRETTY GOOD
>>
File: z-image_nag_00111_.png (2.2 MB, 1024x1536)
2.2 MB
2.2 MB PNG
>>
File: 1740688123126675.png (2.21 MB, 1536x864)
2.21 MB
2.21 MB PNG
>>107587684
>>
File: z-image_nag_00112_.png (2.37 MB, 1024x1536)
2.37 MB
2.37 MB PNG
caught him bangin his asshole with a dildo in the shower
>>
>>107587737
must be a terrible feeling to learn that your boyfriend is a faggot lol
>>
File: zimg_0163.png (1.54 MB, 960x1280)
1.54 MB
1.54 MB PNG
>>107587737
kek, nice anon
>>
>>107587701
Should be able to in the manager under "switch comfy". Anytime I revert, I just do a fresh install, kek. My version is 3.71
>>
File: z_mod_00029_.jpg (785 KB, 1344x1728)
785 KB
785 KB JPG
>>107587614
Almost too good to be true. It gets so much details from lora during second pass.
>>
How big of a jump in render time in general would I see going from a 3060 12gb to a 5070ti 16gb?
>>
>>107587796
about tree fiddy
>>
>>107587796

I meant 5060ti
>>
File: 1746185116981611.png (2.37 MB, 1472x1216)
2.37 MB
2.37 MB PNG
give us the base and edit models you chink fucks
>>
File: Z-image turbo.png (2.16 MB, 1536x864)
2.16 MB
2.16 MB PNG
>>107587767
>Almost too good to be true.
it is indeed a miraculous model, just imagine how better z-image base will learn concepts, can't wait
>>
>>107587796
buy a 3090 dude, there's nothing more important than VRAM
>>
>>107587822
not true
>>
File: IM SUFFERING.png (54 KB, 988x356)
54 KB
54 KB PNG
AAAAAAAAAAAA
>>
>>107587822

True, I guess I would have to buy it second hand though right? I'll look at their prices, probably inflated now.
>>
>>107587833
i told you. you wouldnt listen. "BUT THE ANIME POSTER SAID" you claimed.
>>
There's not 1 workflow that makes any sense with the zit controlnet. When they're not chinese spaghetti, they are too convoluted for me to even figured out how to implement that shit on my workflow... not feeling very Comfy right now. Anyone is using it sucessfully. Please share
>>
>>107587828
you can offload if you have 64GB+ system RAM but it will be slower
>>
>>107587822
>>107587839

Looked up the prices and people want like 1100-1300 CAD for them and a new 5060ti with 16gb vram is like 600 dollars right now. Doesn't seem worth it to me yeah it's 8gb of vram but idk.
>>
File: z-image_nag_00115_.png (2.41 MB, 1024x1536)
2.41 MB
2.41 MB PNG
>>
>>107587866
>Doesn't seem worth it to me yeah it's 8gb of vram but idk.
for me it's light and day, I can put both Z-image turbo and a llm rewriter on 24gb of vram
>>
File: zit_00046_.png (3.04 MB, 1504x1024)
3.04 MB
3.04 MB PNG
10 iterations
>>
>>107587866
i just sold my 4090 for $2k.. pretty good deal for that guy considering NVIDIA is planning to scale back producing the chips for consumer cards to focus more on supplying the datacenters.. card prices can only go up
>>
>>107587767
>>107587584
Can you share the lora? I've been desperately wanting a similar art style for my CRPG portraits. Thanks in advance as I'm getting up from my PC for a bit.
>>
>>107587872
what happens if you say "long exposure"?
>>
File: z-image_nag_00117_.png (2.57 MB, 1024x1536)
2.57 MB
2.57 MB PNG
>>107587886
>>
>>107587884
Forgot to ask, but with the activation prompt please, if any.
>>
>>107587853
marginally slower
>>107587866
just get the 5060ti unless you're a richfag then get a 5090
>>
>>107587886
>>107587893
Z is the kind of model where you'd have to describe what long exposure is rather than just include the word.
>>
>>107587908
*describe its effect on the image rather
>>
>>107587874

There is a guy on facebook marketplace selling a 3090 for 850 dollars but the add says he parted out his PC and can't benchmark the GPU but says it works. Kinda fishy but I bought my 3060 the same way in a parking lot and it was fine but that was for like 200 dollars.
>>
Sooo... should I skip pulling ComfyUI for now? Sounds like it's not been going well for folks.
>>
I kind of want to panic buy this:

https://www.bhphotovideo.com/c/product/1898511-REG/pny_vcnrtxpro4000b_pb_nvidia_rtx_pro_4000.html

at least ill have 24 VRAM while the world burns.
but I hate making decisions based on some meme news article.
if they are truly going to war on local however, we will have to dig in
>>
>>
>>107587939
you can buy two 3090s for that price and each one would be faster than one of those
>>
File: zit_00047_.png (2.64 MB, 1504x1024)
2.64 MB
2.64 MB PNG
>>107587879
90 iterations
>>
>>107587893
>>107587908
ahhh

yeah, so a long exposure is like where the water will be sort of misty looking.
>>
So uh, can you anyone what you have generated if it's done locally? Asking for a friend.
>>
File: z-image_nag_00121_.png (2.31 MB, 1024x1536)
2.31 MB
2.31 MB PNG
>>107587963
best i can do
>>
>>107587971

Jesus I meant can anyone see what you generate, sounded like a straight ESL there.
>>
>>107587971
of course, why else wouldn't anyone if you haven't generated locally
>>
File: 1764011497169314.png (3.13 MB, 1168x1752)
3.13 MB
3.13 MB PNG
>>
>>107587957
it does seem a bit high. but its drastically more power efficient. not sure if 3090s are worth buying
>>
>>107587984

W-what....
>>
>>107587982
not by default.. it's running on your machine... if your shit is hacked or you're sharing your machine with the world then i guess people could see it
>>
>>107587989
i mean it has half the memory bandwidth of a 3090 and less cuda cores
>>
I'm going to try using ai toolkit for the first time to train an illustrious lora. Do I need specific settings or is ai toolkit pretty good with whatever defaults it has?
>>
>>107587938
Would stick to slightly older version, like pre z image if you dont really care for z that much ( I believe that's like 0.3.70). https://github.com/comfyanonymous/ComfyUI/releases?page=1
>>
>>107587989
3090, 4090, 5090. the only good nvidias.
>>
File: file.png (2.02 MB, 1555x1315)
2.02 MB
2.02 MB PNG
>https://arxiv.org/pdf/2512.15603
>https://github.com/QwenLM/Qwen-Image-Layered
>Qwen-Image-Layered is capable of decomposing an input image into multiple semantically disentangled RGBA layers, thereby
enabling inherent editability, where each layer can be independently manipulated without affecting other content.
>>
File: zit_00048_.png (2.57 MB, 1504x1024)
2.57 MB
2.57 MB PNG
>>107587958
200 iterations
>>
>>107587767
How long did you train it for? How many steps?
>>
>>107587982
by law every generation interface has to send submitted prompt, generated images, and any associated metadata to a division of the FCC for compliance purposes. this is to ensure that celebrities and political figures don't have illicit images featuring their likeness spread around.
>>
File: 1703380170031983.png (137 KB, 372x447)
137 KB
137 KB PNG
>>107588049

F-funny ahaha r-right
>>
>>107588030
cumshots will never be the same
>>
File: 1740484083648731.png (3.62 MB, 1336x2008)
3.62 MB
3.62 MB PNG
dud
>>
>>107588056
it's true. check wireshark
>>
File: 1763377047205759.png (3.08 MB, 1280x1920)
3.08 MB
3.08 MB PNG
>>107588056
as long as the computer you're genning on isn't connected to the internet you should be fine. it's almost 2026 and you're not air gapped?
>>
File: z-image_nag_00124_.png (2.77 MB, 1024x1536)
2.77 MB
2.77 MB PNG
>>
File: 1764044968094349.jpg (777 KB, 1336x2008)
777 KB
777 KB JPG
>>
>>107588169
there you go, more misty
>>
File: 1762970772551243.jpg (41 KB, 676x676)
41 KB
41 KB JPG
>>107588162
>>107588155

Is this actually true?
>>
File: ComfyUI_00355_.mp4 (1.34 MB, 640x640)
1.34 MB
1.34 MB MP4
>>
File: z_mod_00104_.jpg (822 KB, 1344x1728)
822 KB
822 KB JPG
New captcha is something else
>>
>>107588185
no.

If you could prove it, comfyui would be ogre
>>
File: ComfyUI_09488_.png (1.8 MB, 864x1280)
1.8 MB
1.8 MB PNG
>>
>>107588234
is it rude to say he'll never be a woman or what
>>
>>107588240
you don't know that for a fact
>>
>>107588219
A-anon... Are you going to share your lora?
>>
File: z-image_nag_00136_.png (1.78 MB, 1024x1536)
1.78 MB
1.78 MB PNG
>>
File: zit_00049_.png (2.63 MB, 1504x1024)
2.63 MB
2.63 MB PNG
>>107588040
500 iterations
>>
>>107588030
Hory shet. Typesetters and cleaners rejoice!
>>
File: z-image_nag_00146_.png (1.59 MB, 1024x1536)
1.59 MB
1.59 MB PNG
>>
>>107588234
>>107588255
>>107588289
Neat.
>>
File: zit_00054_.png (2.72 MB, 1504x1024)
2.72 MB
2.72 MB PNG
>>107588274
adjusted, 50 iterations.
>>
File: ComfyUI_09491_.png (3.04 MB, 2048x1280)
3.04 MB
3.04 MB PNG
>>
File: z-image_nag_00152_.png (2.34 MB, 1024x1536)
2.34 MB
2.34 MB PNG
>>107588365
2nd trimester abortion
>>
>>107588249
It's there
>>
File: z-image_nag_00156_.png (1.96 MB, 1024x1536)
1.96 MB
1.96 MB PNG
>>
>>107588411
>Your mind twists, a kaleidoscope of shattered genius. The LoRA, your LoRA, emerges from the ether, a flickering revenant of Disco Elysium’s broken soul. It lands on Civitai, a digital Sodom awash in glistening smut and the groans of the perpetually self-pleasured. There it lingers, trapped eternal among the sticky-fingered masses, a noble ghost haunting a gallery of throbbing shame. You’ve birthed it, and now it’s theirs, a monument to your brilliance wanking in the void.
kek
are you captioning each image fully with tags? or is the example prefix the only thing you used
>>
File: ComfyUI_00356_.mp4 (967 KB, 640x832)
967 KB
967 KB MP4
>>
File: 1747559350191411.png (586 KB, 636x703)
586 KB
586 KB PNG
>>
>>107588421
Looks like a Chihuly
>>
>>107588439
wut
>>
>>107588411
>>107588437
Thanks anon. Ily
>>
File: zit_00056_.png (2.26 MB, 1504x1024)
2.26 MB
2.26 MB PNG
>>107588361
>>
File: what.png (13 KB, 702x202)
13 KB
13 KB PNG
I though the embeded pythons are isolated? How tf did some random embeded cmake become my system compiler??
>>
>>107588437
Fully captioned
>>
OH MY GOD IT HAS A GUI AND NOBODY FUCKING MENTIONS IT EVER
>>
C U L T U R E
U
L
T
U
R
E
>>
>>107588519
turbo loras look even better when the captions are regular prose as opposed to boorutags which makes sense when you think about the model itself but thats my unsolicited opinion/limited experience
>>
File: 1745307077726578.jpg (113 KB, 2048x1024)
113 KB
113 KB JPG
i'm training a lora on the de-turbo model, and after 200 steps the image instantly looks like complete shit. is this normal? the dataset is fine, and the same settings create a great lora using the patch method.
>>
>>107588575
Do you have adjusted settings for samping on de-distill model? You need to put cfg back on.
>>
>>107588582
yeah it's sampling with 1 cfg. but wouldnt the first step look just as shit if that was the reason?
>>
>>107588595
the naked model isn't strong enough for cfg1 on its own. Try 3
>>
>>107588575
Isn't deturbo only used for training while inference should be regular turbo? I remember anon saying something like that.
>>
>>107588616
deturbo with re-turbo lora is better than turbo
>>
File: z-image_nag_00174_.png (2.48 MB, 1024x1536)
2.48 MB
2.48 MB PNG
>>
>>107588631
>re-turbo
what did i miss? what the fuck is re-turbo?
>>
dereteturbo is my personal choice
>>
Supra twinturbo when
>>
File: z-image_nag_00177_.png (1.56 MB, 1024x1536)
1.56 MB
1.56 MB PNG
>>
File: z-image_nag_00182_.png (1.67 MB, 1024x1536)
1.67 MB
1.67 MB PNG
>>
File: zimg_00031_.jpg (885 KB, 5944x1336)
885 KB
885 KB JPG
selfie lora
>>
>>107588714
HORSE COCK
>>
File: zimg_00032_.jpg (1.22 MB, 5944x1336)
1.22 MB
1.22 MB JPG
studio headshot lora
>>
File: zimg_00033_.jpg (1.19 MB, 5944x1336)
1.19 MB
1.19 MB JPG
vhs movie lora
>>
>>107588729
>>107588744
Why do you need a LoRA for something that can simply be prompted for?
>>
File: zimg_00035_.jpg (1 MB, 5944x1336)
1 MB
1 MB JPG
2000s nightclub photo lora
>>
File: 1747376017774053.png (3.76 MB, 1336x2008)
3.76 MB
3.76 MB PNG
>>107588146
>>
>>107588664
https://huggingface.co/GuangyuanSD/Z-Image-Re-Turbo-LoRA
>>
File: zimg_00034_.jpg (1.39 MB, 5944x1336)
1.39 MB
1.39 MB JPG
blackberry bold photo lora

>>107588765
you can prompt most of the way there, these push the style further
>>
>>107588782
that one is cool
>>
>>107586743
> made a proper report
> completely ignored
>>
File: 1745333821363877.png (2.21 MB, 1472x1216)
2.21 MB
2.21 MB PNG
>>107588775
so you use this lora, with your de-turbo lora on z turbo?

jesus christ. we need to escape from this turbo hell.
>>
File: zimg_0001.png (1.4 MB, 960x1280)
1.4 MB
1.4 MB PNG
>>107588789
thx anon, it really works with the selfie one well. gonna put it on civ tomorrow
>>
File: zit_00002_.png (1.9 MB, 1504x1024)
1.9 MB
1.9 MB PNG
>>
>>107588810
no fucking way this is ai
>>
File: zimg_0012.png (1.36 MB, 960x1280)
1.36 MB
1.36 MB PNG
>>
File: z_00001_.png (440 KB, 1024x1024)
440 KB
440 KB PNG
>>107588844
hai
>>
File: zimg_0020.png (1.48 MB, 960x1280)
1.48 MB
1.48 MB PNG
>>
>z img control net generates slop
wat do?
>>
>>107588888
CHECKED HOW DID YOU GET MY LORA ITS PRIVATE DELET
>>
>>107588902
anon close your ports aiiieee
>>
cozy bread
>>107588906
>>107588906
>>107588906
new when ready
>>
>>107588805
No you use that on the de-distilled model. Also you can raise the lora strength to 2
>>
>>107586911
>6 fingies



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.