[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107366147

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: Flux2Img_00024_.png (2.4 MB, 1440x1152)
2.4 MB
2.4 MB PNG
>>
>>107368756
FAKE


also the plebbit chartfags are at it again, here's a celebs chart

https://www.reddit.com/r/StableDiffusion/comments/1p9m78k/humans_of_zimage_how_many_celebrities_can_you_fit/
>>
>>107368760
Death to celebsloppers
>>
File: 1736269149956059.jpg (1007 KB, 1248x1824)
1007 KB
1007 KB JPG
>>
File: BTS femboys.png (107 KB, 458x139)
107 KB
107 KB PNG
>>107368776
ARR ROOK DA SAME
>>
File: 1753411387209337.jpg (982 KB, 3072x1261)
982 KB
982 KB JPG
anons, update ComfyUi, he fixed the loras so that they work better (or they work as they actually should)
https://github.com/comfyanonymous/ComfyUI/pull/10978
>>
File: Based-Lab.png (18 KB, 661x108)
18 KB
18 KB PNG
>>107368760
this is cool, let's hope the base model will have more celebrities and that they'll do an anime list as well
>>
>>
>>107368785
But that looks like fucking garbage now? Is that the intent?
>>
>>107368801
Meant for >>107368788
>>
>>107368745
Sup, Rajeesh?
>>
>>107368788
>turns miku into a nigger

No thanks.
>>
File: 1752719247520412.jpg (1.64 MB, 1664x2432)
1.64 MB
1.64 MB JPG
>>
>>107368788
I.. Do not wanna update.

unless you deepfried your gen with the lora turned too high kek check your settings bud
>>
File: Wan garbage 3.jpg (162 KB, 1850x871)
162 KB
162 KB JPG
WHY IS THIS SHIT SO SLOW?
>>
Okay, so I think I was just not in the mood or something but after trying Z-image again today I'm actually kinda blown away.
>>
>>107368821
it was the same exact settings for the 3 of them, with the lora turned at 1
https://civitai.com/models/2174416/technically-color-z?modelVersionId=2448632
>>
>>107368824
>WHY IS THIS SHIT SO SLOW?
you use lightning loras so that you only go for 4 steps + cfg 1 instead of 15
https://huggingface.co/lightx2v
>>
File: file.png (1.93 MB, 1535x1154)
1.93 MB
1.93 MB PNG
>>107368760
>>
>>107368824
If you have potato hardware you pretty much have to use the lightning loras with wan
>>
>>107368824
5B is trash, if you have 8GB VRAM and >32GB RAM try this https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne

v10 or v11 is best
>>
>>107368853
>suggesting phroot's shitmix to anyone
devilish
>>
>>107368837
Which ones do I download?
>>
>>107368831
Interesting. I'll give that lora a shot in a bit, trying out the mommy milkers lora at 0.8 strength 0.7 clip so i can give this character huge breasts (non nude).
so how DO you guys reign in the cfg scale? It looks kinda burnt even at high steps.
>>
File: 1758323432791870.jpg (1.12 MB, 4096x1278)
1.12 MB
1.12 MB JPG
>>107368801
>>107368812
>>107368821
I guess you have to go for less than a strength of 1 now since it's working too well
>>
>>107368788
everything is worse though? plus multigpu doesn't work if you update
>>
>>107368865
>how DO you guys reign in the cfg scale? It looks kinda burnt even at high steps.
it's just a cope, I'll wait for the base model to come out to use CFG
>>
>>107368861
Well, it technically works. 8GB shouldn't be bothering with video anyway, leave it to the gods.
>>
>>107368874
>everything is worse though?
-> >>107368869
>plus multigpu doesn't work if you update
then only update comfyui and not all the custom nodes?
>>
>>107368791
>will have more celebrities
Can someone explain this to me? Is celebrity worship an American thing? Why is your nation so obsessed with celebrities?
>>
Messi, Taylor Swift Emma Watsooooning, Donald Trooomping we are so back brooooos
>>
>>107368788
>>107368869
i just updated and there's no difference at all. is there a specific node i should use?
>>
File: 1739185683116490.png (44 KB, 656x399)
44 KB
44 KB PNG
>>107368897
not really, what lora are you trying?
>>
>very few poses
>can't do different camera angles
>censored
it's garbage
why do people keep shilling z image?
>>
>>107368869
>>107368903
set it to 0.65, works great without niggerfying characters and deepfrying the image. i have my cfg on too kek turns out throwing more steps at it does work.
>>
File: 1740427528611192.png (39 KB, 1910x171)
39 KB
39 KB PNG
Btw if you want to use the official scheduler that is being used on Z-Image it's this one
https://github.com/erosDiffusion/ComfyUI-EulerDiscreteScheduler
https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/app.py
>>
File: ComfyUI_zit_ealqj_00003_.jpg (246 KB, 1088x1344)
246 KB
246 KB JPG
>finally free from a 3-day ban for antisemitism
This one's for you, jewnitors. Mazal tov!
>>
>>107368838
Pleeease I need my heckin Hollywood western pro Illuminati pro Satan celebrity slopperino
>>
>>107368924
basado
>>
File: file.png (69 KB, 225x225)
69 KB
69 KB PNG
>>107368924
>>finally free from a 3-day ban for antisemitism
>keeps doing "oy vey" memes anyway
you live to live dangerously don't you?
>>
File: ComfyUI_00188_.png (1.18 MB, 1200x800)
1.18 MB
1.18 MB PNG
>>
https://files.catbox.moe/gbv1g8.jpg
>>
>>107368924
>3 day for antisemitism
many such cases, depends on the mood of the tranny jannies based on which nonexistant and who-gives-a-fuck rule they'll get you on.
keep doing your thing lad.

>>107368919
thanku
>>
>>107368936
lost
>>
>>107368936
wow satan really is just a big gay nigger loving faggot innit
>>
>>107368903
>what lora are you trying?
my own. 4000 steps character lora. i did an 'update all' as well, literally no difference
>>
>>107368959
maybe your lora is broken lol, the loras on civitai work fine (or maybe your lora isn't compatible with the comfyui lora shit (you can see that if you have a bunch of "lora not loaded" warning shit errors))
>>
>>107368977
the lora obviously works. maybe i'm on a different branch or something. what comfy version is supposed to have the fix?
>>
>>107368936
As a woman I feel aroused
>>
>>107368919
that FlowMatch Euler Discrete Scheduler (Custom) node doesn't show up in my install. shidd.
>>
File: comp2.jpg (2.73 MB, 2880x4320)
2.73 MB
2.73 MB JPG
Can confirm updating changed loras.
>Before 1.0 strength
>After 1.0 strength
>After 0.7 strength
0.7 strength seems very similar to old 1.0
>>
File: ComfyUI_00406_.jpg (845 KB, 2048x2048)
845 KB
845 KB JPG
>>107369020
>>
>>107369045
good taste, can it do rapunzel out of the box?
>>
>>107369051
You will never be a celebrity
>>
>>107369067
Kind of >>107368374
>>
File: 1733593614618031.png (127 KB, 439x373)
127 KB
127 KB PNG
>>107369030
mine does. this does not however
>>
File: 1756633703151841.png (25 KB, 640x480)
25 KB
25 KB PNG
>>107368919
>https://github.com/erosDiffusion/ComfyUI-EulerDiscreteScheduler
there is indeed a difference between the simple scheduler and the EulerDiscreteScheduler
- blue is simple
- red is EulerDiscreteScheduler
both are at shift = 3
>>
>>107369045
>>107369072
pls share lora, looks fucking amazing.
>>
tetotetotetotetotetotetotetotetoteto
https://litter.catbox.moe/ussiufxpjgm1mhc8.png
>>
>>107369072
interesting. i wonder if it's due to distillation and base will just know them
>>
File: 1733110166809562.png (332 KB, 2626x1269)
332 KB
332 KB PNG
>>107369078
btw you have to use the big ass node because if you go for the regular scheduler node, the ModelSamplingAuraFlow node won't apply the shift on it
>>
File: ComfyUI_00411_.jpg (1.29 MB, 1600x2304)
1.29 MB
1.29 MB JPG
"well endowed" seems to be a pretty good prompt to get larger breasts.
>>
I don't know if its always the same guy who animates peoples gens but, I kneel
>>
>>107369117
This is very sexual, mind sharing metadata?
>>
>>107369106
based
https://www.youtube.com/watch?v=gMVuf21deKc
>>
>>107369051
despite looking stellar this model seems to have a very characteristic smear which it adds everywhere. it's especially noticeable on the bottom right corner here. what causes this?
>>
File: ComfyUI_zit_ealqj_00001_.jpg (271 KB, 1088x1344)
271 KB
271 KB JPG
>>
File: 1745189182924649.png (2.99 MB, 2560x1396)
2.99 MB
2.99 MB PNG
>>107369131
>despite looking stellar this model seems to have a very characteristic smear which it adds everywhere. it's especially noticeable on the bottom right corner here. what causes this?
the default shift is too low, you have to increase it
>>
File: ComfyUI_01677_.png (3.52 MB, 2048x1248)
3.52 MB
3.52 MB PNG
>>
File: ComfyUI_00077_.png (2.02 MB, 1440x1440)
2.02 MB
2.02 MB PNG
>>107369084
I'm retraining Rapunzel currently. Original is overfitted on her purple dress. You can see here with the laces and faint sleeve marks on her arms. I'll post it on civit if I'm happy with it
>>
>>107369146
now THIS is a game i would download to my cellular device!
>>
>updated kj nodes
>torch compile doesn't work with chroma
>>
File: 1758673870570256.png (905 KB, 2186x1028)
905 KB
905 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1p9mypu/even_more_improved_zimage_turbo_variation/
this seems to be smarter than doing this 2 k-sampler shit, you start the first 20% of the denoising with no prompts then you go for a prompt
>>
>>107369117
Metadata please
>>
>>107369146
Giving multiple characters the same name. Couldn't be me.
>>
>>107369153
cool thanks, looking forward to seeing it work out.
>>
File: ComfyUI_01680_.png (3.94 MB, 2048x1248)
3.94 MB
3.94 MB PNG
Prompt: "The scream"
>>
File: 1747083502990327.png (2.05 MB, 2658x1365)
2.05 MB
2.05 MB PNG
>>107369172
Seed VarianceMaxx
>>
>>107368756
is that a real frog?
>>
>>107369172
getting some real interesting results with this

>>107369220
smart float solution. im stealing that
>>
>>107369173
?
>>
yeah that ComfyUI-EulerDiscreteScheduler does NOT wanna install whatsoever. well guess i'm fucked kek
>>
File: ComfyUI_00515_.png (2.22 MB, 1200x1200)
2.22 MB
2.22 MB PNG
>>107368919
Using FlowMatchEulerDiscreteScheduler in KSampler seems to make the output a lot less noisy and crunchy.
>>
>>107369249
based, are you at shift 3?
>>
File: ComfyUI_00460_.png (2.08 MB, 1200x1200)
2.08 MB
2.08 MB PNG
>>107369249
this is Simple

>>107369254
ModelSamplingAuraFlow node seems to have no effect when using this scheduler
>>
>>107369246
>smart float solution. im stealing that
it wasn't my idea I found it on some jeet civitai workflow, so you're stealing a thief :v
>>
>>107369260
>ModelSamplingAuraFlow node seems to have no effect when using this scheduler
yes -> >>107369114
>>
>>107369172
Based
>>
File: 1740585269710827.png (1.98 MB, 2702x1376)
1.98 MB
1.98 MB PNG
>>107369220
going for 0.01 seems to be a good spot, ultimately it's only the very first step that needs variance, but increase your steps a bit, 8 isn't enough and you're more likely to get some random shit
(wasn't it the case for some cope CFG++ method where they made the first step completly random or some shit, I remember something like that)
>>
>>107369146
now animate it and you'll get the future of youtube ads
>>
>>107369249
>>107369260
The first one tickles my brain in a way that the second doesn't. I think it's the golden shine in the legs, but I can't really place it. There's something in the face too. Not sure if it reminds me of something or I just like it.
>>
File: ComfyUI_00420_.jpg (1.08 MB, 1600x2304)
1.08 MB
1.08 MB JPG
God I love LLMs. But found another bug with this setup. It will drop it's given prompt to enhance and just posts the description as a prompt and you have to rewire noodles and gen back and forth to default it.

>>107369173
>>107369125
Posting from South Asia.
>>
>>107369051
you're a cheeky little boomer ain'tcha?

>>107369286
can we see 2b getting fouled?
>>
>>107369309
We would very much like to be having theses metadatas sar.
>>
File: 1735406326047669.jpg (1015 KB, 2400x1312)
1015 KB
1015 KB JPG
>>107369249
>>107369260
it's more detailled on that new scheduler but since you're not using a shift (shift = 0) >>107369260
that's to be expected I guess, the comparison will be valid once both of them have the same shift
>>
>>107369333
cool, shift on the right was 6 btw
>>
>>107369172
start doing this
>>
euler/simple triggers me something fierce. i dont care if it gives good results.
>>
File: WanVideo2_2_I2V_00544.mp4 (1.64 MB, 832x480)
1.64 MB
1.64 MB MP4
>>107369295
>>
>>107369356
It's because don't feel vindicated unless you use some ultra niche and relatively unknown scheduler like bong tangent and a specific kind of sampler that is one of a list of many other extremely similar sounding ones.
>>
>the other day
>everything works as intended
>today
>changed nothing, literally used the shortcut always used
>it gets stuck here and doesn't start the webui
??????????
the what the fuck? Is this shit somehow trying to defaulting to the integrated instead of the 9070XT for no fucking reason now or what? it literally stopped working overnight.
>>
crazy how much development this is getting in just a couple days, for a fucking turbo model no less.
>>
>>107369388
There's a flag to not update automatically, isn't there?
>>
>>107369388
wait. hold on. hold on just a second here. is this real? no. you're baiting. haha good one. or could it be? there's no way. you're generating images on windows?
>>
https://civitai.com/models/2175612/kasane-teto-z-image-lora?modelVersionId=2450006
took 2 hours on a 3060, very managable
excuse me for the shit dataset
had to modify transformers_z_image because of "TENSORS ON CPU AND GPU!!1111"
>>
Question is; Will all these turbo loras work just fine on the base model?
>>
File: 1732958735863416.jpg (99 KB, 1920x561)
99 KB
99 KB JPG
>>107368919
>https://github.com/erosDiffusion/ComfyUI-EulerDiscreteScheduler
lemao, actually you don't need that custom node, the "Normal" scheduler is the exact same as the EulerDiscreteScheduler, the more you know
>>
>>107369419
oh thank fuck. that shit won't even install right on my end kek.
>>
File: file.png (137 KB, 1061x334)
137 KB
137 KB PNG
chat is this real
>>
>>107369408
>donate monero (XMR):
Bro fuck off
>>
>>107369418
if they do then that's highly suspect, the base model is apparently still baking
>>
>>107369432
That's... an interesting theory.
>>
>>107369399
i suppose an update bricked me out of having fun with this shit for the foresable future, since i am not going to find out what the fuck happened ever
>>
>>107369408
im not going to donate monero but i think its fine if you shill for donations
>>
>>107369441
reinstall the requirements in your venv or something
>>
File: yeah.png (19 KB, 536x412)
19 KB
19 KB PNG
So NeoForge (the official FOSS UI without API service) just added support for Z model Turbo alongside WAN, Qwen, Qwen Edit, Lumina, Flux and Chroma. (yeah I'm shilling it because fuck Comfy)

My question is, which option do I select to use Z model, Qwen?
>>
>>107369432
sam 3?
>>
>>107369459
segmentation model
>>
>>107369432
You can literally replicate it right now if you want. Qwen Edit + SAM + any local reasoning llm model.
>>
>>107369459
NTA but it's the new segmentation model by meta. Breddy good by all accounts. Can even extract objects in the image and make simple 3D objects out of them as well as take humans and replicate the pose with a rig.
>>
>>107369432
In theory, yeah, sure. In practice you'd need a great VL model with tool call capability to reason over the image and whether it has to be further adjusted. Like Qwen 3 VL but it still would have to be finetuned to be unaligned and differentiate between pussy or dick shapes, colors and other properties.
>>
>>107369467
thanks <3
>>
Can't get deforum to work on linux:

Automatic and Forge don't seem to support 5060 Ti
Reforge keeps saying ControlNet isn't installed
Neoforge doesn't support Linux

Anyone gotten around any of these? Or should I ditch deforum for something else?
>>
File: Grid_00001_.jpg (2.24 MB, 3000x3000)
2.24 MB
2.24 MB JPG
>>107369172
This is fun to watch, favorite was a chinese man turning into hockey player in the desert and then the prompt starts and he shifts into a maiden.

Interesting that it also changes the style slightly.
>>
>>107369483
>Neoforge doesn't support Linux
nigga i'm literally running it on linux right now
>>
>>107369172
My only issue with this is it really hurts prompt adherence for detailed and location specific prompts.
>>
>>107368919
>>107369249
>>107369260
Do I need to change any of the parameters in the flowmatch node or do I not touch it?
>>
>>107368374
>>107369045
>used incredibles 2 stills to train elastigirl instead of only 1
we were so fucking close...
>>
>>107369495
Noob question but I didn't read the report yet and your pic reminded me to ask, those fingers. How many VAE channels does it have?
>>
>>107369457
Tried that already, it's bricked, somehow
>>
>>107369513
you don't need this anymore, just go for "normal" scheduler >>107369419
>>
>>107369513
don't know, i'm just using FlowMatchEulerDiscreteScheduler in KSampler
>>
>>107369458
It uses Qwen text encoder so probably.
>>
>>107369515
The render quality in Incredibles 2 is much higher
>>
>>107369390
It took the AI gen community by storm, nothing quite like it since SD15

Imagine if base is REALLY good to train on, consequences will never be the same
>>
File: ComfyUI_01689_.png (3.52 MB, 1248x2048)
3.52 MB
3.52 MB PNG
>>
>>107369525
normal outputs are nothing like FlowMatchEulerDiscreteScheduler though?
>>
>>107369432
And what are you gonna do once you finally get your personal Nano Banana? Gen out 50 identical Mikus then pass out, what's the point of this endlessly speculation, you are a tool hoarding NPC dragon sitting on a pile of treasure you'll never spend.
>>
>>107369472
i love you
>>
>>107369579
we need alternatives to gayggle and others that is open source
>>
>>107369575
they are, you're probably not using the FlowMatchEulerDiscreteScheduler node proprely >>107369114
>>
File: gadget monitor.png (79 KB, 250x500)
79 KB
79 KB PNG
>>107369579
you can shut up now
>>
File: ComfyUI_00528_.png (2.17 MB, 1280x1088)
2.17 MB
2.17 MB PNG
Ksampler
euler+FlowMatchEulerDiscreteScheduler
modelsampling node disabled
>>
File: file.png (2.78 MB, 1336x1582)
2.78 MB
2.78 MB PNG
>>107369172
based ledditor
>>
File: ComfyUI_00527_.png (1.98 MB, 1280x1088)
1.98 MB
1.98 MB PNG
Ksampler
euler+normal
modelsampling node disabled
>>
File: 1756660861157174.png (157 KB, 853x920)
157 KB
157 KB PNG
>>107369602
>Ksampler
that's the problem, the ModelSamplingAuraFlow doesn't apply anything on the Ksampler + FlowMatchEulerDiscreteScheduler (it means the shift is always at 0), you have only one way to use that custom node and it's this one >>107369114
>>
>>107369496

It doesn't officially support Linux
https://github.com/Haoming02/sd-webui-forge-classic/discussions/271
>>
can this model do cute and funny?
>>
>>107369524
>>107369457
ok what the fuck starting it from a different bat (comfyui.bat) instead of comfyui-n.bat like i always did seems to fixed it.
ok...
>going through the long ass first generation again
man
>>
>>107369529
>>107369525
Do you still need to use the shift = 7 or how much node?
>>
>>107368924
holy fucking based my man, keep on winning and fuck jannies and fuck whatever kike reports you
>>
With my logic from forge, I tried latent upscaling.
This is not how it works, is it?

Also when hooked up, these Save Image/grid nodes won't save the files. I just want to save in jpeg when spamming my tests, png is just above file size limit for posting.
>>
>>107369553
Do you only remember watching incredibles 1 on a crt tv or something? Get a bluray
>>
Ok, it's time. What is the best free service, local, can be online as well, where I can just upload a pic, give a prompt and I get a nsfw pic? I want the face to stay the same. I tried some models, but it doesn't work.
>>
>>107368924
You know you could've just restarted your router and post from a different browser right?
>>
How are people already training loras for Zimg?
>>
File: IfComfyDoesIt....png (1.29 MB, 1584x672)
1.29 MB
1.29 MB PNG
>>
>>107369707
ai toolkit. its extremely easy to set up.
>>
File: 1743336497594892.jpg (1.71 MB, 5120x1562)
1.71 MB
1.71 MB JPG
>>107369680
>Do you still need to use the shift = 7 or how much node?
I let you be the judge
>>
>>107369727
I'm glad there's options available for the differently abled.
>>
>>107369522
Ah, it uses the Flux vae.
>>
>>107369737
it looks an awful lot like an ad
>>
>>107369707
AI Toolkit has support, Diffusion-Pipe should have it in a day
>>
>>107369728
Does it have some default preset baked in or can you share? Also are the requirements for dataset the same ~ 20/50 pics?
>>
Anyone remember SD 2? I forgot it even existed
>>
File: ComfyUI_01705_.png (3.67 MB, 1248x2048)
3.67 MB
3.67 MB PNG
>>
>>107369695
Elastigirl is objectively sexier in the second film
>>
>>107369767
...would
>>
File: Z-image turbo.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>107369770
yep, I agree with that
>>
>>107369764
it defaults to 32 rank so change that, otherwise its fine. i made one lora with 3000 steps and 16 awful low quality images, captioned to point out quality, and it turned out fine. i made another with 4000steps and 60 high quality images, which didnt really capture the likeness that well.
>>
File: 1739719578655102.png (47 KB, 535x386)
47 KB
47 KB PNG
is this new? i've never run out of grok before. maybe asking for chinese prompts takes up more tokens or something.
>>
File: 1744618994253879.jpg (1.08 MB, 4096x1278)
1.08 MB
1.08 MB JPG
>>107369680
>>107369731
here's another one
>>
File: ComfyUI_00217_.png (1.96 MB, 1600x1088)
1.96 MB
1.96 MB PNG
Rate my living room
>>
>>107369855
esoteric/10
>>
>>107369855
Wonder what they talk about
>>
>>107369855
whats for din dins
>>
>>107369870
Hermetic principles and Madame Balavatsky.
>>
>>107369603
could someone share the workflow? at least the conditioning part. the picture isn't even complete
>>
>>107369045
How good does it differentiate between style and object? Can it do, say, photorealistic rapunzel?
>>
is qwen image built on flux
>>
>>107369906
No.
>>
>>107369881
It is complete, just open up your eyes.
>>
File: file.png (11 KB, 586x90)
11 KB
11 KB PNG
>>107369917
what does that mean
>>
File: 213612376.png (365 KB, 2186x1028)
365 KB
365 KB PNG
>>107369881
What is missing ?
>>
>>107369922
I have no idea. But it's not built on flux.
>>
>>107369821
llmarena is free but more censored and your prompts are basically public
>>
File: ComfyUI_00047_.jpg (690 KB, 2880x1440)
690 KB
690 KB JPG
>>107369893
Yes
>>
>>107369851
trashcan and you
>>
>>107369603
So this setup is a quick img2img hack.
My first noise steps almost always generate an oriental woman.
>>
>>107369954
please don't feed the retards
>>
>>107369925
stop spamming your shit you useless fucking nigger, fucking nigger cock obsessed retard, KYS
>>
File: cond.png (56 KB, 908x315)
56 KB
56 KB PNG
>>107369881
>>
File: trunks power up.gif (1.37 MB, 500x281)
1.37 MB
1.37 MB GIF
>the flowmatch scheduler node wouldn't install because a specific function in comfyui was calling a deprecated huggingface thingymagig
>had my cute little japanese kawaii girl assistant in gemini help me delete that shit
>THEN had to upgrade diffusers and NOW it works
i feel it. i feel the POWER. THE UN-TANGLER OF THE SPAGHETTI. ENJOYER OF PASTA SAUCE.
>>
It's probably worth to try combined two advanced ksampler setup in which first one generates leftover noise. Bit like SDXL's refiner setup was. Not sure if it's that much different but maybe...
>>
>>107368936
catbox actual satan pls
>>
>>107369966
blacks are incapable of not chimping out in every space they occupy, real or digital
>>
>>107370004
would you really trust that guy not to embed some sort of computer virus/internet aids with the metadata?
>>
>>107370012
riiight
>>
>>107368760
>not able to open the image by itself, due to redirects
what a bunch of faggots
>>107369700
It seems the filter is working.
>>
File: 1758581255294032.jpg (1.09 MB, 2048x1072)
1.09 MB
1.09 MB JPG
>>
could someone share your flowmatch zimage workflow? i think i have my shit fucked up.
>>
>>107369727
installed forge neo and got this error when generating with sdxl with hi res fix
"The size of tensor a (64) must match the size of tensor b (32) at non-singleton dimension 1"
>>
>>107370060
I'm no expert but maybe that checkpoint has the vaw baked in so try disabling the sdxl vae
>>
>>107370060
>>>/2023/
>>
20 minutes to generate one image, thats too much

i need a new gpu fuck
>>
So ani has stooped to spamming bbc with comfyui filenames to try to get people to filter it. How low can he go
>>
File: 1745285365580507.png (478 KB, 540x642)
478 KB
478 KB PNG
>>107370060
>masterpiece, best quality, amazing quality, 4k, very aeshetic, high resolution, ultra-detailed, ultra realistic, real, 3d, 3d render, cgi, depth of field, volumetric lighting, detailed, realistic hair, 1girl, solo
>>
>>107370097
wtf do you have gtx or some shit?
>>
>>107370097
> Prompt executed in 40.44 seconds
for a 1536x1872 image on my 4070 ti super, so you don't need the absolute best hardware, but yeah if you got a potato it would be pain.
>>
File: z_mod_00280_.jpg (1.16 MB, 1944x1416)
1.16 MB
1.16 MB JPG
>>
>>107370097
post details?
>>107370121
2048x2048 2 minutes on my 3060
1024x1024 30-35 seconds on my 3060
>>
>>107370122
Very cool. But I like my Batman wearing something more sensible than a painted on suit.
>>
>>107370121
enable sage attention boy, may be a bit faster.
Prompt executed in 30.99 seconds 1920x1088 9 steps with that new flowmatch setup, 5060 ti.

>>107370122
coooooool
>>
File: 00036-3522321932.jpg (275 KB, 1408x2064)
275 KB
275 KB JPG
>>107370114
better than boomer prompting with natural language anon.
>>
>>107370116
>>107370121
>>107370131
im trying out qwen image edit with rx 6600
so its like low end amd gpu worst combo for this
>>
File: generated_video_hd (5)-2.mp4 (3.91 MB, 1000x750)
3.91 MB
3.91 MB MP4
>>107370122
>>
File: 00024-727142929.png (2.81 MB, 1248x1824)
2.81 MB
2.81 MB PNG
>>
File: ComfyUI_00453_.jpg (709 KB, 2048x2048)
709 KB
709 KB JPG
>>
>>107370162
oh dang, qwen image has nunchaku (4bit quant) but no support for amd, works ok on my 3060 i dont remember the gen times but its like 1-2mins idk maybe a bit more i didnt really play with qwen image much
maybe try SDNQ (4bit quant, similar to nunchaku/SVD but has no fused kernels) qwen image?
https://huggingface.co/Disty0/Qwen-Image-Edit-SDNQ-uint4-svd-r32
https://huggingface.co/Disty0/Qwen-Image-Edit-2509-SDNQ-uint4-svd-r32
https://huggingface.co/Disty0/Qwen-Image-Edit-Lightning-SDNQ-uint4-svd-r32 - works with lower steps (HUGE BOOST)
comfyui node to support SD.Next quantization:
https://github.com/erosDiffusion/ComfyUI-ZImageDit - worked for me with zimage
https://github.com/EnragedAntelope/comfyui-sdnq - didnt work for me with zimage, might work for you with qwen image idk. u dont have to download the model with this one, it automatically downloads it when u select it in the model loader
SDNQ could work on your gpu
>>
>>107370162
I wouldn't my waste my time with QIE if I had to wait 20 minutes for one gen desu
>>
>>107370210
thanks ill try that

is it because of the low vram? because in benchmarks my gpu does better
i have to try out the benchmark and see if my gpu isnt just dying
>>
File: 1750742716724879.jpg (1.32 MB, 1248x1824)
1.32 MB
1.32 MB JPG
Z can do delicious legs.
>>
File: 12gggg.jpg (969 KB, 2044x2044)
969 KB
969 KB JPG
>>107370235
Fellow modern renaissance coomer.
>>
>>107370230
it could be that you're running it on cpu, 20 minutes seems too slow for me.
could you share more info about your setup? i know windows struggles with ai in general, and especially with AMD. consider hopping on linux
it could be because of the low vram but qwen image nunchaku used like 3gb vram for me and worked speedy as hell (they have some super good memory management)
i know ggufs are 2-3x slower than fp8 (normal fp8) and probably fp16 too
>>
>>107370242
k
>>
ayy lmao
>>
File: file.png (259 KB, 1626x1113)
259 KB
259 KB PNG
>>107370253
im using gguf as well

my setup isnt optimal for my hardware i followed steps from a video
>>
>>107370289
yea no wonder its taking you 20 minutes
holy fucking shit dude CUDNN for amd?? can some anon confirm or deny that cudNN is only for nvidia cards
what is this shit...
wait a moment
you get 8 step images in 20 minutes? u fucking serious?
>>
File: ComfyUI_00480_.jpg (1.18 MB, 2048x2048)
1.18 MB
1.18 MB JPG
I'm really impressed by the blend of different styles.
>>
File: Wan garbage 4.jpg (161 KB, 1496x876)
161 KB
161 KB JPG
The end result is coming out bad. What am I doing wrong?
>>
>>107370304
its 4 steps
in comfyui zluda it says to turn off cfz cudnn, image generation wasnt working if i dont turn it off with that node
i think cudnn is only for nvidia there was an update on comfyui that fucked with amd cards some time ago
>>
mmhmm.. the zetta reticulon huh? you fellas took your sweet time gettin' here.
>>
>>107370313
using 5B model, probably something on top of it
>>107370316
why arent you using rocm?
>>
>>107370304
also i dont know what image lightning 8 steps mean for now i've copied a setup i saw on youtube
>>
>>107370331
comfyui zluda does use rocm but you have to specifically toggle off cudnn with the node
>>
File: ComfyUI_00176_.png (3.76 MB, 1200x1800)
3.76 MB
3.76 MB PNG
>>107368760
It would've been fun if Seinfeld was in the dataset.
>>
>>107370341
https://www.reddit.com/r/ROCm/comments/1nua71b/comfy_ui_added_amd_support_plug_and_play_all_you/
maybe you need to update your driver, 20 minutes is not right. 100% not right. illegitimate. cpu tier speed.
>>
What's the catch with z image? How does it keep so much detail in such a relatively small model?
>>
File: different celebrities.png (434 KB, 927x275)
434 KB
434 KB PNG
lmao
>>
>>107370357
>arr rook da same
>>
>>107370355
math
>>
They're looking at you, Anon.
>>
>>107370379
trump is 6'3
so miku must be really fucking tall
>>
>>107370388
that's hot
>>
>>107370379
I honestly think I'm gonna injure myself because of z-image...
>>
>>107370355
>What's the catch with z image?
The model is wowing people due to two factors: 1. a good default realism style, and 2. very good prompt comprehension caused by extensive reinforcement learning.

The model doesn't actually know all that much. I bet the base model is nothing special and this will be obvious when it releases. Doing large finetunes on base won't be as good as people are expecting, because nobody in the community will replicate their RL methods which is where all the magic is happening.
>>
>>107370413
you are the definition of fake news
>>
>>107370169
Very nice anon
>>
File: 1759691471944371.png (99 KB, 240x306)
99 KB
99 KB PNG
>>107368760
wow!
>>
Is z image worth it right now or back to illustrious for goonin?
>>
>jannies 3-day the cool jew genner
>but do nothing about a bbc spammer
many such cases!
>>
>>107370471
>jannies 3-day the cool jew genner
the admins got to be careful now that theres a lunatic in the white house
>>
god i fucking love z-image turbo so much
>>
>>107370532
let the norm norms have their fun. they'll get bored and fuck off in a week or two. this always happens with new models.
>>
File: 1734040859387480.png (1.07 MB, 887x1331)
1.07 MB
1.07 MB PNG
>>107368760
>https://www.reddit.com/r/StableDiffusion/comments/1p9m78k/humans_of_zimage_how_many_celebrities_can_you_fit/
I congratulate Andy Warhol for xir's transition!
>>
>>107370573
lmao
>>
>>107370532
>and delete it
nta but I have about 300k gens and I only delete images if I accidentally generate men
>>
>>107370582
you need to stop
>>
>>107370355
>How does it keep so much detail in such a relatively small model?
I wish I had the motivation to read the paper so I could give you the answer but I'm too focused on spamming the gens on that model :(
https://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf
>>
>>107370582
thats bad for opsec
i generate all images inside ramfs and save only the exceptionally exceptional ones or workflows to an encrypted drive
>>
I have a 3060 on linux, can I install and use sage attention?
>>
>>107370628
yes
>>
How the fuck do I latent upscale like a normal fucking person?
>>
>>107369881
>>107369172
Better to just use some wildcards for that first step instead.
>>
>>107370634
I really don't want to fuck my comfy install, is there a guide or do I just pip install sageattention?
>>
>Loading checkpoint shards: 100%|##########| 3/3 [06:55<00:00, 138.45s/it]
why does it take 7 minutes to load z-image for training? the patch?
>>
File: file.png (1.04 MB, 1528x902)
1.04 MB
1.04 MB PNG
>>107370638
Forgot pic
>>
>>107370638
>>107370646
wildcards are a pain, it's not like there's one universal wildcard that would work on anything, if I remember well there was this CFG star that lets you have completly random noise on the very first step, maybe that could help for seed variation
>>
File: moviekike.png (25 KB, 434x479)
25 KB
25 KB PNG
kek by the time this is out of early access, base will already be out.
of course he doesn't know that.

https://civitai.com/models/789313/80s-fantasy-movie?modelVersionId=2450317
>>
>>107370644
>I really don't want to fuck my comfy install
you can unfuck it very fast.
1) python3.11 -m venv venv
2) install torch
3) pip install -r requirements.txt
done.
>is there a guide or do I just pip install sageattention?
https://github.com/thu-ml/SageAttention?tab=readme-ov-file#install-package
to use sage attention just do python main.py --use-sage-attention
>>
File: stare7.jpg (35 KB, 512x512)
35 KB
35 KB JPG
Wasn't the base supposed to be released today?
>>
>>107370669
He doesn't care, he's just a grifting piece of shit.
>>
>>107370701
they said "soon"
https://xcancel.com/ModelScope2022/status/1994315184840822880#m
>>
>>107369767
fuack, may i have the prompt?
>>
>>107370727
You could save the image and feed it to chatpajeet and get a reasonable prompt that way.
Or better yet, perhaps use your brain.
>>
total comfy noob so please forgive the potentially stupid question, with that samplercustom node setup for txt2img, how do i turn that into an img2img setup since there's no denoising strength slider?
>>
>>107370701
No. That was just rumor based on ESL miscommunication.
>>
>>107370342
>It would've been fun if Seinfeld was in the dataset.
with the edit model we'll be able to go for any characters/celebrities we want
>>
>>107370715
>>107370701
Two more weeks
>>
Is it me or the new iteration of Qwen Image Edit is still not here? My guess is that they saw how good z-image is and they want to cook it a bit more to make it relevant
>>
>>107370746
You using flux 2? Try searching for a denoise node.
>>
>>107370743
i fed it to saargpt and it told me to ask for the prompt again here to make you tremble in your chastity cage.
>>
File: file.png (1.62 MB, 896x1152)
1.62 MB
1.62 MB PNG
>zimage doesn't know what a raygun is
It's over.
>>
>>107368734
I mostly use nano banana pro to edit pictures of girls I know irl and remove their footwear so I can jerk off to their feet. Is there any local model that can do the same work or better?
>>
>>107370746
SplitSigmasDenoise node. attach low sigma to samplercustom sigma
>>
File: ComfyUI_00470_.png (1.51 MB, 896x1152)
1.51 MB
1.51 MB PNG
>>107370785
It knows what a Saint Seiya armor is though. I had to jump through hoops to get flux to make a realistic image of one without it becoming anime.
>>
File: ComfyUI_00471_.png (1.28 MB, 896x1152)
1.28 MB
1.28 MB PNG
It even gets the hair right.
>>
>>107370800
yeah
>>
File: 1764200090138738.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>107370821
thank you so much. i wish a million your waifu becomes real and sucks your dick right off your body one night.

>>107370833
>>107370856
yeah someone shared picrel a few days ago, cool as fuck
>>
File: image.png (234 KB, 451x389)
234 KB
234 KB PNG
SNIFFFFFFF
>>
>>107370861
Nice to know there's someone else who digs realistic SS armor.
>>
To get more sneed variation it's easier to generate initial image with sd1.5 with couple of steps and then vae decode+encode and feed the resulting latent to ZiT ksampler.
>>
Why is the final result 2048p and looks like it's been reduced to large grains of original pixels?
>>
File: 1761401756150059.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
https://civitai.com/models/2175050/vhscommercial?modelVersionId=2449356
based (since comfy's fix on loras you always have to go for less than strength 1 though)
>>
when can i run sdnq in comfy properly?
>>
>>107370800
If it works why stop using it? Flux 2 can do it locally but it's really slow.
>>
>>107370890
https://github.com/EnragedAntelope/comfyui-sdnq
>>
>>107370879
Inefficient use of VRAM imo. I prefer the wildcards method myself. Used it in other models too, so I'm already used to it.
>>
>>107370897
it's not working doe?
https://github.com/EnragedAntelope/comfyui-sdnq/issues/14
>>
>>107370800
flux kontext can do it locally, its pretty fast (with a few loras)
qwen edit lighting nunchaku can do it, very very fast
>>
>>107370890
>sdnq
what's that? a new quant method?
>>
>>107370872
im actually gonna check out SS next year with a good friend of mine who fucking LOVES that anime. the armor is cool as shit regardless.
>>
>>107370906
https://github.com/erosDiffusion/ComfyUI-ZImageDit worked for me
>>
File: ComfyUI_00578_.png (775 KB, 1152x896)
775 KB
775 KB PNG
>>
File: ComfyUI_temp_rjjte_00003_.jpg (1.05 MB, 4096x4096)
1.05 MB
1.05 MB JPG
>>107370884
Oh I'm retarded, it actually is 4096, but yeah, the quality.
>>
>>107370904
Inefficient? 2GB is nothing besides it gets unloaded. That's like 1-2 seconds of more gen time.
>>
>>107370913
https://letmegooglethat.com/?q=SDNQ+quantization
>>
>>107370918
this one is slower than using unquanted unet and te for me, for some reason
>>
>>107370925
Exactly. It's 2s more. If that's acceptable, it's still probably better to choose a random pic from your computer, resize, vae encode it, and denoise to 80%. You'll get more variety.
>>
>>107370947
its the same speed (maybe a few % faster) for me
SDNQ does not have the speed benefits of SVDQ/nunchaku
no fused kernels
>>
>>107370893
Because it doesn't work everytime. For some reason, if the girl has a very big ass, Gemini will think I'm trying to create porn or something, and refuse to generate the picture.
>>107370908
Sweet, I'll check that out
>>
>>107370471
>but do nothing about a bbc spammer
19 posts now have been nuked.
>>
>>107370968
Finally. We should double jannie's pay!
>>
>>107370956
>SDNQ does not have the speed benefits of SVDQ/nunchaku
>no fused kernels
then why are they doing this in the first place?
>>
>>107370949
I'm sorry I always forgot 4chan is full of sub 80 iq retards.
>>
>>107370985
same reason why you'd use gguf, albeit it's faster than ggufs (same speed as FP16)
vramlets probably
>>
File: 1747154453121046.jpg (609 KB, 2048x1128)
609 KB
609 KB JPG
>>107370210
the vramlets will be eating good not gonna lie
https://huggingface.co/Disty0/Z-Image-Turbo-SDNQ-uint4-svd-r32
>>
>>107370999
but like if you can go for the same quality but with the fused kernels too, why not go for that instead?
>>
>>107371032
still, comfy implementation of it is fucked though.
>>
File: Z-image turbo.png (1.4 MB, 1280x720)
1.4 MB
1.4 MB PNG
>>107370920
too bad it doesn't know how to do the dab
>>
File: ComfyUI_00480_.png (1.76 MB, 896x1152)
1.76 MB
1.76 MB PNG
Is Flux officially dead?
>>
>>107371060
eyyup
>>
File: They have a baby!.png (1.54 MB, 1280x720)
1.54 MB
1.54 MB PNG
>>107368760
I wish he could make the same list but for anime characters, it's a quick way to see what works and not
>>
>>107370800
sorry i don't want to help some weirdo with his degeneracy
>>
>>107370312
Catbox please
>>
What's this Z thing? Are we back?
>>
i..I PULLED!
>>
>>107371086
Wait one more day and then ask again.
>>
>>107371086
When the base model releases.
>>
File: 771757361.png (1.74 MB, 832x1216)
1.74 MB
1.74 MB PNG
>>107371032
Q8_0 is already 6GB, I'm using it and in my few test the differences between it and bf16 are negligible.
>>
>>107371086
https://en.wikipedia.org/wiki/Z_(military_symbol) I think it's this.
>>
>>107371045
idk fused kernels are only speed bumps for NVIDIA RTX cards, i heard 2000 series is having issues too
and i maybe it takes longer to make a quant with fused kernels, takes more space and most likely...
IT NEEDS OFFICIAL NUNCHAKU SUPPORT. (theyre working on it btw)
>>
>>107371098
I downloaded Q8_0 both text encoder and image model, and it seems the speed is slower?
>>
File: WE WON.png (627 KB, 898x490)
627 KB
627 KB PNG
>>107371086
>What's this Z thing?
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Are we back?
We are so back dude you have no idea, this is only the turbo model we'll get the base model soon too
>>
>>107371051
incoming transmission...
>>
>>107371060
Yes, they probably won't even release Flux Klein at this point, it's just a waste of time
>>
File: file.png (299 KB, 989x742)
299 KB
299 KB PNG
>>107371101
kek didnt know it was banned
KEEEEEEEEEEEEK
>>
>>107371060
Dead, buried and cremated within hours of z-image releasing. Bet those dumb fucks are feeling real proud of their bloated censorslop now.
>>
>>107371128
Based.
>>
>>107371110
yes ggufs are slower
>>
>>107371130
Good riddance.
>>
>NAG still fucked after update

:(
>>
why the fuck did the tranny decide to update the ui
again. why is he enshittifying
is he compromised?
>>
File: 2003124847.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>107371110
Yeah that's expected, there's dequantization overhead and also bf16 is natively accelerated on newer GPUs. Reason I'm running it is cuz I don't want to deal with the haphazard memory management in forge-neo.
>>
>>107371135
>>107371158
So if I am somehow able to run original text encoder and original model on my specs (8GB VRAM, 16GB RAM), then I don't need to worry?
>>
>>107371176
yea, if u cant run it then maybe try sdnq
>>
File: 320044431.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>107371176
If it works, it works. That said even with better memory management I don't think it's gonna work with those specs.
>>
File: 1744593575901612.png (1.59 MB, 1280x720)
1.59 MB
1.59 MB PNG
>>107371060
>Flux
who?
>>
>>107371194
I can, but it's very slow, I am just looking for ways to make it faster. One image (with loaded prompt) takes about 140 seconds.
>>
>>107371133
>banning a letter is.. LE BASED!?!?!
oink oink
>>
File: ComfyUI_00593_.png (858 KB, 640x904)
858 KB
858 KB PNG
>>107370871
>>
>>107371213
post workflow and what gpu model you have
8gb vram is vague
also OS
>>
Workflow is the default one with these >>107369172 changes.

GPU is RTX 2060 super, and I use NixOS btw.
>>
>>107369172
How is this different from manually setting 20% of the steps in two samplers?
>>
>>107371242
i'm on 2070 super 8gb, i have 7 s/it with this fp16 lumina patch https://civitai.com/articles/22251
it's still slow as shit without bf16 support. sad
>>
>youtube video thumbnail

"THIS Z IMAGE MODEL IS ABSOLUTELY INSANE"
>three images of asian 1girl sitting
>>
File: 3533778432.png (2.7 MB, 1536x1024)
2.7 MB
2.7 MB PNG
>>
>>107369953
nice
>>
>>107371242
also i'm pretty sure conditioning (combine) runs each step twice, halving the speed
>>
>>107371262
the previous method had the first 20% with "prompt" + cfg < 1, that method has the first 20% with "no prompt" + cfg 1
>>107371287
nope, the speed is the same
>>
>>107371242
use the default comfyui workflow
https://comfyanonymous.github.io/ComfyUI_examples/z_image/
>>
File: 1742470089168422.png (1.3 MB, 1280x720)
1.3 MB
1.3 MB PNG
>>
File: mario_kahn.jpg (852 KB, 1536x2048)
852 KB
852 KB JPG
>>
>>107371130
https://huggingface.co/black-forest-labs/FLUX.2-dev/discussions/1
>>
Does AI Toolkit ruin with torch above 2.7?
>>
>>107371317
>In the provided model documentation under 'Risks', it is stated that Black Forest Labs partnered with the Internet Watch Foundation on safety testing. The Internet Watch Foundation is currently spearheading an assault on human rights in the European Union, by lobbying for encryption backdoors under Chat Control.
lmao, this company's reputation is dead
>>
File: psxam_z_0003.png (1.13 MB, 832x1216)
1.13 MB
1.13 MB PNG
got some more loras to poast
>>
Z Images' paper “Distribution Matching Distillation Meets Reinforcement Learning” describes the DMDR process whereby the student model surpasses the teacher model in terms of quality.
So, is the base model for full finetuning more stable to train, but the distillation process is almost necessary for full quality and performance gains?
>>
>>107371317
What exactly were they going for with Flux2?
>>
>>107371321
yes i ran it with 2.7.1 without issue
>>
File: 1753527336441096.png (541 KB, 806x674)
541 KB
541 KB PNG
>>107371329
https://huggingface.co/black-forest-labs/FLUX.2-dev/discussions/1#6925fafc806da02073a1db4f
>I for one find it quite interesting that the same people who want to eradicate public access to encryption are also trying to eradicate public access to NSFW content generation.
>Why are they doing that, I wonder?
>>
>>107371313
now let's see mario's hairy arse
>>
>>107371242
have you tried FP8?
>>
>>107371112
>>
>>107371368
acne wasn't kind to this man
>>
>>107369458
>>107369534
Isn't it Lumina? Not that it really matters though, just load the correct checkpoint, vae and text encoder.
>>
File: Z-image turbo.png (3.02 MB, 1920x1080)
3.02 MB
3.02 MB PNG
>>
File: ComfyUI_00016_.png (1.37 MB, 1280x720)
1.37 MB
1.37 MB PNG
>>107371263
The patch helped, thank you.
>>
>>107371388
nice one anon
>>
File: no more blur let's goo .png (2.5 MB, 1792x1317)
2.5 MB
2.5 MB PNG
https://xcancel.com/hakomikanx/status/1994799151566262472#m
ok but is there a comfyui node for that?
>NegPiP has been updated. Z-Image is now supported in Forge Noe. It's not as effective as the Stable Diffusion series, but it seems usable enough. Changing -1 to -0.5 or -2 has no effect.
https://github.com/hako-mikan/sd-webui-negpip
>>
baker?
>>
>>107371422
https://github.com/pamparamm/ComfyUI-ppm
i wonder if someone can vibecode a patch
>>
>>107371388
how much did it help? from 140 to what?
>>
File: 1762220731832937.jpg (680 KB, 2048x1280)
680 KB
680 KB JPG
>>
>>107371345
just read the model card it's really comical
>The final FLUX.2 [dev] checkpoint (...) demonstrated higher resilience than leading open-weight models across these risk categories. Based on these findings, we approved the release of the FLUX.2 Pro model via API and the release of the open-weight FLUX.2 [dev] --AAAAAAAACCCCCCCKKKKKKKKK!!!!!
>>
File: file.png (73 KB, 304x166)
73 KB
73 KB PNG
>>107371489
kek, feels good when the bad guys lose at the end
>>
>>107370248
catbox please or at least the prompt
>>
File: Z-image turbo.png (3.02 MB, 1920x1080)
3.02 MB
3.02 MB PNG
>>
File: 1741552169108120.jpg (500 KB, 1280x2048)
500 KB
500 KB JPG
pretty cool composition I guess
>>
>>107371536
too noisy, increase the shift anon
>>
>>107371480
>>107371536
What is your childhood trauma?
>>
>>107371549
She was seduced by a white devil explorer and now makes images with that fetish. What's yours, sis?
>>
File: ComfyUI_00020_.png (1.35 MB, 1280x720)
1.35 MB
1.35 MB PNG
>>107371463
Not sure about the total time (I am playing around with prompts), but the it/s became much better (from 10 to 7).
>>
https://xcancel.com/bdsqlsz/status/1994789770439106896#m
>Too many people DM me when Base and Edit release.
>I really don't know
Come on guys let them cook!
>>
File: ComfyUI_ZImage_00045_.png (1.01 MB, 1152x896)
1.01 MB
1.01 MB PNG
Pasting flux prompts I saved months ago and trying to figure out their source from the ZImage generated one.
>>
File: 1759760591367768.png (3.83 MB, 1280x2048)
3.83 MB
3.83 MB PNG
>>107371546
should I un-bypass the node i'm just doing default workflow

i am prompting for 70s national geographic documentary though so its supposed to be a little grainy

>>107371549
>What is your childhood trauma?
growing up white and tall in the West in the 21st century

i have other fetishes from childhood trauma but thick brown women isn't one of them
>>
>>107371569
very happy for u anon :)
>>107371572
jeet menace
>>
>>107369408
how did you train the lora?
>>
>>107371584
yes, un-bypass and go for more than 3, try 5 or 7
>i am prompting for 70s national geographic documentary though so its supposed to be a little grainy
you can use some loras to get that old photo feel (go for lower strength than 1 though those are potent!)
https://civitai.com/models/2174416/technically-color-z?modelVersionId=2448632
https://civitai.com/models/2175050/vhscommercial?modelVersionId=2449356
>>
>>107370248
>>107369117
GIVE ME THE PROMPTS OR METADATA YOU BASTARDS! THIS IS THE KIND OF MOTHER THAT I WANT FOR MY CHILDREN!
>>
>>107371575
And here is the original. Not bad.

This is a close-up photograph of an Asian woman with a medium build and shoulder-length black hair, slightly disheveled. She has a light brown skin tone and a few visible bruises on her forehead and cheek, indicating she may have been in a physical altercation. She is smiling slightly, showing her teeth, and is pointing a black handgun directly at the camera, which is blurred in the foreground. The background is out of focus but appears to be an indoor setting with greenish-yellow lighting and horizontal lines, possibly from blinds or a window. She is wearing a greyish-green top with a slightly textured fabric. The image has a cinematic, dramatic quality, likely from a movie or television show, with a shallow depth of field that emphasizes her face and the gun. The photograph has a realistic, gritty style, with natural lighting and a slight greenish tint. The image is signed "©2017 Netflix" in the bottom right corner, indicating it is from a Netflix production.
>>
File: 1757849782807211.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
https://civitai.com/models/2173050/psxam-amateur-photography-style-lora?modelVersionId=2450905
that doesn't look amateur at all he's tripping
>>
File: ComfyUI_00096_.png (3.84 MB, 2048x1280)
3.84 MB
3.84 MB PNG
might as well fill up the image slots before the next bake

>>107371606
>try 5 or 7
sure

>you can use some loras to get that old photo feel (go for lower strength than 1 though those are potent!)
if base doesn't release on Sunday I'll start trying out loras
>>
>>107371627
fine i'll reupload with higher strengths
>>
File: ComfyUI_ZImage_00049_.png (1.35 MB, 1152x896)
1.35 MB
1.35 MB PNG
>>
File: Wanimate_00001.mp4 (1.3 MB, 480x352)
1.3 MB
1.3 MB MP4
WAN Animate keeps trying to give girls tails for some reason.
>>
>>107371649
so its an uggofilter?
>>
>>107371584
>>107371536
lol I like this theme. lemme try
>>
File: ComfyUI_00108_.jpg (566 KB, 2048x1280)
566 KB
566 KB JPG
>>107371676
>lol I like this theme. lemme try

>A National Geographic photograph from the 1970s, rear three-quarter group photo view, two smiling Caucasian male researchers in khaki safari outfits and hats standing together for photograph, three extremely voluptuous very dark-skinned Brazilian tribal women with primitive features, dirt-covered bodies, long matted black hair, large gold hoop earrings, extremely plump lips, minimal straw bikini tops and thong bikini bottoms exposing huge dirty round buttocks prominently facing camera, thick thighs, bare muddy feet, each woman embracing and hugging a researcher possessively from front, woman on left has eyes closed trying to kiss her researcher's face, center woman draped over her researcher, woman on right looking back over shoulder at camera with sultry half-lidded gaze, their enormous buttocks the focal point, vintage 1970s documentary film grain

and just ask an AI to make more prompts with voluptuous Brazilian tribal women I guess


I think I will try some middle eastern women next
>>
>>107371668
i like to see women i could actually get with
>>
File: ZiMG_00218_.jpg (438 KB, 1344x1728)
438 KB
438 KB JPG
>>107371584
close enuf?
>>
>>107369219
amzing
>>
Z-turbo seems very overcooked. Very little variation from seed to seed. Is the default workflow bad?
>>
>>107371704
Absolutely! Might I suggest engraving/Woodburn style for your next training assignment Mr.girt?

picrel
>>
>>107371727
we've solved this
>>
>>107371729
More references if needed

instagram.com/tenthousandscrolls
>>
>>107371692
Nice!
>>
File: ComfyUI_ZImage_00054_.png (1.19 MB, 1152x896)
1.19 MB
1.19 MB PNG
>>
>>107371706
Suddenly remembering Cannibal Ferox

Must have been a decade since I saw that
>>
Do all loras trained with a varied dataset that doesn't describe every detail too much but instead has basic captions always break the low seed variance of models that have that problem?
>>
File: ComfyUI_ZImage_00059_.png (1.41 MB, 1152x896)
1.41 MB
1.41 MB PNG
>>
>>107371791
nice
>>
>>107371727
>Z-turbo seems very overcooked. Very little variation from seed to seed. Is the default workflow bad?
it can be fixed >>107369172 >>107369603
>>
File: 520433698.jpg (520 KB, 3200x1792)
520 KB
520 KB JPG
>>
>>107371739
sick, might have to.

i'm not going to bother to publish this one because it just does literally what the model already does.

if anyone wants the fail gen pussy lora:
files.catbox.moe/tek88h.safetensors
>>
>>107371900
not publish the pussy lora?

If u do train the engraving thing. pls pls pls do publish it!
>>
File: 3533778454.png (1.44 MB, 1600x896)
1.44 MB
1.44 MB PNG
It's actually stupid how good this model is compared to Flux, the west truly has fallen.
>>
>>107371875
Danke.
>>
>>107371920
what is this style?
>>
>>107371920
and it's only getting started, the base model will reach new heights, we're so fucking back dude!
>>
File: 2100200659.png (1.4 MB, 1600x896)
1.4 MB
1.4 MB PNG
>>107371937
Ra Lilium, it's on CivitAI
>>107371948
I hope so!
>>
>>107371329
No surprise there, they are obsessed with "safety", and there is no better safety than literally checking every conversation.
>>
god I hate using comfyui so fucking much
>>
>>107370235
i like it
>>
>>107371908
not publishing the pussy lora (it's just body horror) or the realistic lora, the model is already realistic and the lora is just more western.

gonna publish the bimbo lora though
>>
File: 1745891696479921.png (2.83 MB, 1536x1152)
2.83 MB
2.83 MB PNG
>>
>>107370342
butterface we got butterface, come and get your butterface
>>
File: 2788241661.png (1.66 MB, 1600x896)
1.66 MB
1.66 MB PNG
>>107372011
nothing like spending more time being forcefed spaghetti than generating images
>>
File: 1761105663929324.png (1.38 MB, 1280x720)
1.38 MB
1.38 MB PNG
>>
SOMEONE SAID SPAGHETTI?
>>
>>107372019
this has come out well!
>>
>>107370235
>Z can do delicious legs.
it's probably the best local model in terms of anatomy, it very rarely fucks this shit up, it's so refreshing
>>
>>107372053
share this wf if you dont mind, couldnt get that redditors suggestion working for me.
>>
>>107372033
>>107371970
>>107371920
this is so good, i can't wait for the anime model + community loras, literally raised the bar for slop with this new model, what's even going to be considered slop now?
>>
>>107369496
how? everything in there is .exe and .bat shit
>>
>>107372065
It's not the z image thing, but that thing is this, it's simple. The 4 nodes goes between the pos/neg prompt and ksampler.
The two values decides when your prompt starts and ends.
>>
>>107371727
>Is the default workflow bad?
Always is
>>
File: ComfyUI_00048_.png (3.99 MB, 2048x1536)
3.99 MB
3.99 MB PNG
First anime gen with Zmodel
>>
>>107372113
cool, thanks. Ill try it . might get it working at least this time around.
>>
>>107372127
this looks good, this was made with a lora?
>>
>>107372127
The realism with anime style you get out of z image is amazing. But it's very volatile, one gen you'll real realistic, the next full anime.
>>
>>107371422
I love this so much, watching a new amazing thing drop and people figuring out in real time how to make it even more amazing, all in the next hours and days. Missed this feeling, bros.
>>
>>107369045
as someone whose a completely noob when it comes to training loras is there an updated guide anywhere?
>>
> more screenshots of spaghetti shiftiness instead of gens
we should have a rule to post noodleslop to catbox. this thread is more tech support than tech talk and output sharing
>>
>>107372160
also ban reddit and xitter screenshots
>>
>>107370143
boxo?
>>
File: file.png (2.68 MB, 1449x1665)
2.68 MB
2.68 MB PNG
kek
>>
File: ComfyUI_00049_.jpg (762 KB, 2048x1536)
762 KB
762 KB JPG
>>107372144
>>107372154
without lora, will try other seeds, im a nodelet so im only editing the positive and negative prompts
>>
>>107372160
>>107372174
Now, repeat that without crying this time.
>>
>>107372068
You could get similar output with other models, but what I really like is how fast and consistent z-image is.
>>
>>107372127
>>107372207
>without lora
prompt?
>>
>>107372229
it's GPT bloat, do you want ti anyway?
>>
>>107372156
I think NAG is even better to get negative prompts out of a distilled model, but good luck implementing that, the dev hadn't updated their repository for more than a month at this point
https://github.com/ChenDarYen/ComfyUI-NAG
>>107372247
sure, go ahead
>>
>>107372127
>>107372207
Damn. That is very pretty. I'm really looking forward to the future.
>>
>>107372207
>negative prompts
it's cfg 1 so begs don't do anything
>>
>>107372201
When's the big pony return?
>>
The person that was posting bbc and got banned was also the baker, that's why we're on page 9.
>>
>>107371388

To make it look even more Russian, the TV set should be black-and-white
>>
>>107372348
If he was the baker we would see much more of his nog insecurity on the daily
>>
>Max limit of image replies has been reached.
nooooo, BAKER
>>
haven't seen image limits hit in like 2 years
>>
>230s wait
>>
v-v
>>
>>107372392
that's because local hadn't had a good model for the past 2 years.
>>
>>107372453
flux was good
>>
>>107372113
The pos and neg conditioning are both attached to the pos conditioning input... is that correct?
>>
>>107372392
that's because too many people are posting spaghetti screenshots
>>
>>107372465
>flux was good
it was a big jump compared to SDXL yeah, but getting the same jump from flux to Z-image is more impressive since it's getting harder and harder to improve
>>
1girl
>>
>>107372465
Yeah but this is good and accessible to vramlets so we have a bigger pool of users.
>>
>>107372465
flux was a 12b model so of course it had to be "good", but Z-image is like way better than flux while being 2x lighter, that's real improvement
>>
>>107372485
>>107372485
>>107372485
bread
>>
>>107372465
flux was good because other models were that much worse



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.