[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107372485

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Comfy
>>
>>107374562
Should
>>
File: Capture.png (2.24 MB, 1024x1328)
2.24 MB
2.24 MB PNG
https://www.reddit.com/r/StableDiffusion/comments/1p9zqzw/zimage_reimagines_early_nintendo_power_covers/
damn
>>
>>107374574
bite
>>
The anon posting below me is a Comfy shill
>>
>everyone blowing their load early with training Z-Image loras when the Ostris adapter barely works at de-distilling the model
People need to calm down. I'm almost positive all these loras are way worse than they need to be. Wait for the base model, or for someone to train a proper de-distilled version.
>>
Making some images to do a comparison between many of the current and last gen models and it's so fucking annoying that SD3.5 medium doesn't converge until like 100 steps lol
>>
>>107374563
Architecture and dataset >>>>> scale at all costs
>>
>>107374613
based take
>>
>>107374601
>People need to calm down
>Wait for the base model
this
>>107374613
>Architecture and dataset >>>>> scale at all costs
this
>>
File: Wen.png (2.5 MB, 1792x1317)
2.5 MB
2.5 MB PNG
Wen ComfyUi
https://github.com/hako-mikan/sd-webui-negpip
https://xcancel.com/hakomikanx/status/1994799151566262472
>>
>>107374576
lel
>>
>>107374620
Based
>>
>>107374620
when someone is swindled into doing it for free. comfyui is stolen valor and doesn't do anything themselves anymore
>>
>>107374621
based, that's how you talk to those fuckers
>>
Can Z do the pornographic saars?
>>
>>107374621
Based
>>
Blessed thread of frenship
>>
>>107374635
Go back to black forest you faggot
>>
>>107374620
Can it unblur pussy?
>>
>>107374621
this is a very unsafe comment, bfl should contact the cyber police
>>
>>107374628
>>107374632
>>107374636
https://youtu.be/m8D-LomudKE?t=9
(this is a joke I find the "based" spam funny as well)
>>
>>107374635
It can kind of do nipples, sometimes. And not much else x-rated
>>
File: ComfyUI_00028_.png (329 KB, 512x512)
329 KB
329 KB PNG
i made this image on sd1.5, 512x512, 20 steps, 8 cfg with a rx 6600 8gb vram in 80 seconds, about 1.22s per iteration


a pink rose flower inside a white flower pot on a table inside a house


when using qwen image edit i have 200s per iterations
>>
>>107374635
Can do the cunniest
>>
>>107374649
holy kino
>>
>>107374601
>>107374617
>People need to calm down.
you need to kill yourself for trying to tell me what I can and cannot do with my GPU
>>
>>107374649
SOVL
>>
File: funnybanan.jpg (48 KB, 512x512)
48 KB
48 KB JPG
>>107374649
ok
>>
I guess we'll see Comfy implement the Base model on his repository before it gets released right? That's how it went for Turbo
>>
>>107374664
>with my GPU
with your actions and words dumb faggot
>>
File: ComfyUI_00029_.png (304 KB, 512x512)
304 KB
304 KB PNG
>>107374659
>>107374665
>>107374674

thanks
what i wanted to know is does that performance look correct for this low end gpu? it takes me 20 minutes to make one qwen image
>>
>>107374651
That sounds problematic
>>
>>107374675
hope so
>>
>>107374649
>>107374679
Considering you have an AYYMD from 2021 probably
>>
File: 1693252393.png (3.65 MB, 1344x1728)
3.65 MB
3.65 MB PNG
>>107374601
Just train them again later, no biggie.
>>
File: fdsafdsa.jpg (208 KB, 2152x892)
208 KB
208 KB JPG
Reference | Generated.
Used just the Joy Caption Beta to describe it then plugged it into Z.
>>
>flux based porn models are appearing but they're even worse than the SDXL based ones
Is flux a lost cause then? Are we just stuck with SDXL forks until the end of days
>>
File: 1758021891529235.png (3.43 MB, 2048x1280)
3.43 MB
3.43 MB PNG
>>
>>107374679
it's as good as they come anon, don't let anyone tell you otherwise
>>
>>107374704
Z will save us.
>>
How to Train a Z-Image-Turbo LoRA with AI Toolkit
https://www.youtube.com/watch?v=Kmve1_jiDpQ
>>
>>107374704
Did you not check the thread for the last 3 days?
>>
>>107374621
based based based
>>
File: ComfyUI_00125_.jpg (417 KB, 2048x1280)
417 KB
417 KB JPG
>>107374704
>Are we just stuck with SDXL forks until the end of days
People are really just dumping all their retard opinions that will be obsoleted when ZBase comes out tomorrow huh

BDS chink or another twitter chink would have sounded the alarm on a SaaSening like he did for 2.5 by now
>>
File: he's for a surprise.png (1.15 MB, 2000x1000)
1.15 MB
1.15 MB PNG
>>107374704
>Is flux a lost cause then? Are we just stuck with SDXL forks until the end of days
>>
Lumina walked so Z could runned
>>
>>107374721
No, what's the consensus
>>
File: 1758889738418078.png (1.35 MB, 1280x720)
1.35 MB
1.35 MB PNG
>>
Loras trained on the base model should work better with turbo than loras trained on turbo, right? I'm looking forward to base but turbo is still nice for the speed
>>
>>107374749
z-image wonned
>>
>>107374756
CFG distill only speeds up 2x right? So your gen time would only double which really isn't that bad for people with sub-10s gen times on turbo already
>>
>>107374749
>what's the consensus
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>>
>>107374749
Ani wonned
>>
Why is it always the fucking chinese
>>
File: file.png (8 KB, 340x139)
8 KB
8 KB PNG
what the frick someones getting 5 it seconds on linux

maybe i should switch to linux...
>>
>>107374775
year of the linux desktop will come about because of image gen AI
>>
>>107374773
Less cucked
>>
But is the z thing good for nsfw or artistic shit?
>>
>>107374773
westerners are retarded and bow down to fucking credit card companies of all people
>>
File: 1759296190286212.png (9 KB, 327x136)
9 KB
9 KB PNG
leejet has the base code done before cumfart
>>
File: z-image_00046_.jpg (1.36 MB, 1664x2432)
1.36 MB
1.36 MB JPG
>>
>>107374757
>>107374763
Are you sure it'll get danbooru forks any time soon though. I don't care about anything else
>>
>>107374794
there's no turbo lol
https://github.com/leejet/stable-diffusion.cpp/commits/z-image/
>>
>>107374800
as soon as they release the base model™ probably
>>
File: ComfyUI_11584_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
>>107374804
are you blind? the pr is in the image I attached
>>
>>107374775
What model is that for? Nobody is getting 5it/s on Qwen Image with an RX 6600. That's probably SD1.5 or something. But you should switch to Linux anyway.
>>
>>107374794
>>107374819
yeah this is still referring to turbo. i thought for a sec we got a 99% confirmation ZBase is gonna be released locally
>>
>>107374814
>bitch come back with my vespa i don't know you
>>
>>107374821
yes sd1.5
i guess i cant get any faster than 200s/it
>>
Z-Image is too prude. NSFW finetune when?
>>
File: ComfyUI_11585_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: 1758606825560171.png (1.41 MB, 1280x720)
1.41 MB
1.41 MB PNG
>>107374755
>>
>>107374826
? it almost seems to be referring to both
if z-image from 2 hours ago is turbo why not refer to it as such like before?
im so down bad bros holy fuck
>>
>>107374850
>im so down bad bros holy fuck
same, I can't see myself being this excited for like one week, I'll die of stress lmao
>>
>>107374794
cumfart btfo by C chads
>>
File: z-image_00048_.jpg (1.28 MB, 1664x2432)
1.28 MB
1.28 MB JPG
>>107374840
same
>>
Why does every Z LoRA on civit look plastic?
>>
>ani ends up getting z base support minutes before comfy
>>
>>107374841
heh
>>
>>107374870
that's fantastic
>>
File: ComfyUI_11592_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: 1754966969149441.jpg (128 KB, 960x960)
128 KB
128 KB JPG
Chroma 2 on Z?
>>
>>107374870
Really hope base has better feet, toes often get mangled when it's not a super basic pose
>>
Wishing all SaaScucks a pleasant day
>>
The era of SDXL didn't last long enough. I wanted to play with it for half a decade. My SDXL prompting skills... they're not sneeded anymore
>>
Stone all footfags to dead
>>
>>107374908
Not even a footfag, but a messed up mutant foot can ruin a gen
>>
>>107374896
Chroma Z
>>
ChromaZomes
>>
>>107374889
still z?
>>
File: 1740346816743976.png (1.46 MB, 1280x720)
1.46 MB
1.46 MB PNG
>>107374841
>>
>>107374930
fucking kek
>>
File: adolonrez.png (1.13 MB, 1216x832)
1.13 MB
1.13 MB PNG
z on forgeUI wen?
>>
File: z_based.png (138 KB, 1443x244)
138 KB
138 KB PNG
>>107374873
>Why does every Z LoRA on civit look plastic?
Welcome to the harsh reality. It was local's fault that we ended up with slop.
>>
>>107374877
C chads rising up. 2026 death of comfyui
>>
File: shifty.jpg (2.13 MB, 2560x1149)
2.13 MB
2.13 MB JPG
i don't see a big difference with shift node bypassed versus shift 7 on the same prompt but if anons itt say there's a difference it literally doesn't matter to me to use shift 7 to satisfy their autisms


>>107374850
>? it almost seems to be referring to both
no. check lee's comment in the related Github issue
>>
File: z.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>
>>107374942
it's in neoforge. hard to tell with all the comfyui propaganda in the op
>>
File: FINALLY.png (49 KB, 301x168)
49 KB
49 KB PNG
>>107374946
>train only on real images
>gets the most kino model in existence
WAS IT SO HARD????
>>
File: psxcheeks_0015.png (1.66 MB, 896x1152)
1.66 MB
1.66 MB PNG
wherever that anon is that asked for a woodcut/lino lora, it's coming up tomorrow. i'm going to go have sex with a woman tonight.
>>
>>107374873
most trainers fill their datasets with slop like retards
>>
>>107374960
my sides, thank you chinkies
>>
>>107374976
don't forget to tip
>>
File: 1744190295856235.jpg (1.08 MB, 4096x1278)
1.08 MB
1.08 MB JPG
>>107374957
you can see it more on close up humans, the skin texture tends to get old on shift = 3
>>
>>107374976
>i'm going to go have sex with a woman tonight
phwoar, take pictures and train on them
>>
File: 1764442420630016.png (1.22 MB, 1584x672)
1.22 MB
1.22 MB PNG
>>107374942
Ehemmmm
Day 1 support, also
HiresFix
Img2img
Loras
>>
>>107374963
your shilling is very tiresome and cringe
>>
File: ComfyUI_11599_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>107374924
yep
>>
>>107374963
>comfyui propaganda
says the guy spamming this image on every new threads now >>107375001
>>
>>107374969
next, throw out all the weeb slop and we'll have models generating images realer than real life
>>
>>107374957
i used it as a way to get rid of the faux jpeg artifacts when they show up and then i just kept it on all the time
>>
>>107375015
and uis that aren't grifty dogshit either!
>>
>>107374990
are you changing the max shift variable?
>>
>>107374946
this paper will be a giant wake up call for all those slop companies, if they managed to make this insane model at 6b, you can be sure Google is reading this paper like it's the bible, prepare for a series of kino models in the future, this is only the begining
>>
>>107375027
max shift shouldn't be touched i think.
ModelSamplingAuraFlow = shift value FlowMatch Euler Discrete Scheduler (Custom)
>>
File: 1757421785815326.png (41 KB, 902x477)
41 KB
41 KB PNG
>>107375027
>the max shift variable?
the what? I'm only using the shift node from Comfy's official workflow
>>
>>107375030
google is leagues ahead of everyone else
>>
File: file.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
test slop
>>
>>107374963
>>107375001
ty both I didn't even know neo was a thing. I'm like a quarterly tourist that comes by every time something big enough to pull me away from the last thing is released. Last time was illustrious / big lust.
>>
File: psxcheeks_0018.png (1.41 MB, 896x1152)
1.41 MB
1.41 MB PNG
>tfw forgot to change output file names

>>107374988
kek

>>107374997
there's already a bbw lora
>>
>>107375043
that doesn't mean they don't want to scale down their products, smaller models = less costly
>>
>>107375043
in bloatmaxxing sure
>>
>>107375040
>Comfy's official workflow
all of comfy's workflows are cursed slop. you'd think comfy would know how to use his shitty UI at this point
>>
File: 1735506433284498.png (162 KB, 478x521)
162 KB
162 KB PNG
>>107375040
oh aura flow. i thought you've been posting about the shift in the flowmatch euler node
>>
>>107374960
This white girl is going to be a meme within a few weeks, if you describe somebody with generically white features she shows up every single time.
>>
>>107375011
At least I'm not haoming02 himself.
ComfyAnon is a member of the /ldg/ Discord and has been coming here to shill himself for years.
>>
>>107375060
>>107369419
>lemao, actually you don't need that custom node, the "Normal" scheduler is the exact same as the EulerDiscreteScheduler, the more you know
>>
File: ComfyUI_11603_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>107375059
Please fuck off with your gradioshit spam. Nobody wants to hear about it.
>>
File: ComfyUI_00149_.jpg (528 KB, 1280x2048)
528 KB
528 KB JPG
>>107374990
I genuinely cannot tell the difference in skin tone between any of the pictures in that example.

you're using 16 steps and I'm using 9. maybe its more noticeable at higher step counts? (but again I'm not trolling when I say I can't tell the difference in the will smith example you shared)

>>107375017
>i used it as a way to get rid of the faux jpeg artifacts when they show up and then i just kept it on all the time
well it doesn't seem to affect gen time so like i said i'll keep it at 7
and im converting to jpg before posting to save bandwidth so it may literally not matter in my case
>>
File: 1748207390545006.jpg (520 KB, 1250x1566)
520 KB
520 KB JPG
>>107374946
But I said this first though...
>>
>>107374794
>write an inference library
>in C++
>somehow it's 10x slower than ComfyUI which is written in python.
The level of incompetence is staggering.
>>
File: 1764004524118162.png (3.41 MB, 1280x2048)
3.41 MB
3.41 MB PNG
it took 8 attempts to get (arguably) 5 toes on each foot and not more from this position (even though I'm coping on the right foot). I hope it's just a turbo issue
>>
>>107375030
Nobody will catch up to Google for genning, it's impossible. They have access to pretty much all the video, image, and audio data they will ever need while they block everyone else from scraping it.
They don't even need to scale their models down like you think because of their TPUs. Absolute broken company when it comes to AI.
>>
>>107375001
>>107375045
Also pro FOSS
no SaaS API bullshit
He doesn't live off his UI, he does it for fun.
No investors involved
A Chinese person (same as the SOTA local models nowadays)
>>
>>107375108
it's actually the same speed looking at the benchmarks. impressive since ggernov never made diffusion functions at all
>>
>>107374794
>lejeet
this can't be real
>>
>>107375121
if you actually believe this why are you not putting your money where your mouth is and owning GOOG?
>>
>1 hour 40min
>140 replies
STOP BEING POPULAR!
>>
File: ComfyUI_00696_.png (1.43 MB, 1152x896)
1.43 MB
1.43 MB PNG
>>
>>107374957
>but if anons itt say there's a difference it literally doesn't matter to me to use shift 7 to satisfy their autisms
based
>>
File: ComfyUI_11609_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
>>107375155
kek real
>>
>>107375170
B-but she is minor!!
>>
File: 1748899084100516.png (1.46 MB, 1280x720)
1.46 MB
1.46 MB PNG
>>107375137
>LeJeet
>>
>>107375146
Sure just give me $100M to train some models and I'll make it happen, kek.
>>
File: Zurbo_00023__resized.png (3.77 MB, 1936x1437)
3.77 MB
3.77 MB PNG
Zurbo, huh?
Any neat workflow tricks I should know about?
>>
>>107375192
>Sure just give me $100M to train some models
on their paper they said that the training cost 600k dollars
>>
>>107375155
turbo trains fine. base is pointless
>>
>>107375202
Skin sexo!
>>
>>107375202
it runs on other uis so you don't have to enable the comfy grift
>>
>>107375210
shutupshutupshutupshutupshutupshutupshutupshutupshutupshutupshutupshutupshutupshutupshutup
>>
>>107375210
shut up, base will have more concepts and will be easier to finetune, I don't want to experience the Flux era over again
>>
File: ComfyUI_00165_.png (3.17 MB, 2048x1280)
3.17 MB
3.17 MB PNG
>>107375192
you didn't understand what I said at all

oh maybe you interpreted "owning" as in "pwning them", no silly I literally meant buying Google stock (ticker GOOG/GOOGL) if you think they will win the AI race. it's the only way for a commoner to get exposure to SpaceX as well which is nice
>>
File: ComfyUI_ZImage_00089_.png (1.18 MB, 1152x896)
1.18 MB
1.18 MB PNG
Soulless copy of a piece by Bruno something by prompt matching.
>>
>>107375231
Because of your preference of woman I suppose that you are white and live in a first world
>>
>>107375125
sorry im giga normie Idk what the significance of SaaS API's are, or what you mean. I get the idea of a freelancer just making a UI because he loves it, that does sound appealing. Idk what the downsides of forgeUI or its forks are though
>>
File: SHUT UP.png (1.4 MB, 1280x720)
1.4 MB
1.4 MB PNG
>>107375210
>>
>>107375005
would you kindly share the prompt/settings for this? thanks anyway!
>>
File: bruno-larsson-v1.jpg (550 KB, 1920x1189)
550 KB
550 KB JPG
>>107375238
Original
>>
File: ComfyUI_00168_.jpg (526 KB, 2048x1280)
526 KB
526 KB JPG
>deformed feet can easily ruin a gen
i feel that :(

>>107375241
>Because of your preference of woman I suppose that you are white and live in a first world
that is indeed my "childhood trauma" as discussed two threads ago >>107371549
>>
I need an ETA on when I can prompt pornographic content in the style of Murata Range with Z
2 days or 2 weeks?
>>
>one autist spams a hundred of his mudnigger gens for ten threads straight
Actually I changed my mind, ZImage sucks can we have Flux2 back?
>>
>>107375270
Oh, good intuition I have
>>
>>107375264
this is fucking slop that doesn't understand what vaporwave aesthetic actually is. the ZImage isn't any better but don't call this garbage "kino" just because a human made it
>>
File: 90f.gif (2.87 MB, 374x280)
2.87 MB
2.87 MB GIF
>>107375231
>>
File: hmmm.jpg (1.15 MB, 2048x2048)
1.15 MB
1.15 MB JPG
https://www.reddit.com/r/StableDiffusion/comments/1pa1oo6/nano_banana_synthid_exposed/#lightbox
Do you think Flux 2 also has this gay ass watermark?
>>
>be white
>live in the first world
>settle for fat third world brown women
Odd choice.
>>
>>107375292
Why are you always so hateful?
>>
>>107375292
that's faux vaporwave aesthetic.
same thing happened to the cyberpunk aesthetic, being pidgeonholed into the lameass miami synthwave aesthetic
>>
>>107375281
2 more weeks
>>
File: file.png (92 KB, 1474x560)
92 KB
92 KB PNG
>>107374621
don't do that, they'll send their men, they're in the business of SAFETY
>>
>>107375292
this
>>
File: file.png (1.31 MB, 1152x896)
1.31 MB
1.31 MB PNG
>>107375292
k
>>
>>107375243
The A111/Forge ecosystem is just dead. There's a bunch of forks in various different states that branched off at different times. You have extensions that work on one fork but not on another.

The only ones still using them are tourists who are too stupid to use ComfyUI.
>>
>>107375284
ackshually i'm doing sandniggers now
i was thinking about saying something like "I wish 4chan entered the 2010s and supported multiple images per posts" but if it's pissing you off so much maybe I'll wait a bit longer to go buy weed and keep making some more thick brown women just to piss you off

>>107375297
kekked

>>107375301
>settle
who's settling? With AI i get all the yumminess of brown women with none of the cultural problems. I can literally have my cake and eat it too

>>107375304
>Why are you always so hateful?
I'm white and I live in the first world. I'm sure you understand
>>
Any good long video workflows for either wan 2.1 or 2.2? Preferably that doesn't have the sudden jerky movement every 81 frames and doesnt fry the input image. I've tried so many bros..
>>
3d sloppers are back
Pedogooners are back
Anime sloppers are back
VRAMlets are back

Newfag here, is this what real local improvement feels like?
>>
>>107374930
kino
>>
>>107375305
>same thing happened to the cyberpunk aesthetic, being pidgeonholed into the lameass miami synthwave aesthetic
okay well if we're going back to the 80s all cyberpunk became slop the minute Billy Idol made it uncool with that stinky album
>>
File: z-image_00057_.jpg (1.12 MB, 1664x2432)
1.12 MB
1.12 MB JPG
>>107375301
someone is triggered lmfao
>>
>>107374773
plenty funding, plenty math PhDs, no obsession over safetyism to the detriment of model quality, and generally left alone by the ccp to do their thing (for now)
>>
>>107375319
>The A111/Forge ecosystem is just dead
nice try comfy shill. the wane has begin. kill all grifters
>>
>>107375060
what's this? I have no idea.
>>
>>107375208
>600k
So it's over for z model finetunes, isn't it.
Nobody but furries would have that kind of money to finetune z image, and if chroma, neta and illust finetuning are any numbers to go by, it would cost about 120k to 150k to finetune this shit
>>
>>107375321
I dunno man I'm also white and live in the first world yet I prefer my women white (or yellow).
>>
File: holy shit.jpg (2.77 MB, 4626x1971)
2.77 MB
2.77 MB JPG
Jesus christ this model is a fucking beast
>>
>>107375325
If your model doesn't accommodate for VRAMlets it's dead in the water. Nvidia is still selling 8GB cards today, we're stuck with that for at least half a decade
>>
>>107375319
The only surviving fork is NeoForge and it works from day 1 with the ZiT model and it's updated every day.
NeoForge has all the facilities of image gen with SDXL and also you escape the python dependency hell that Comfy has.
Also Haoming fixed memory management when he added Neta Lumina.
With Gradio you can say goodbye to python slop dependencies and broken updates.

You centralize all the models in one stable workflow.
>>
File: file.png (581 KB, 1136x654)
581 KB
581 KB PNG
ok i am impressed by that result
it took me 20 minutes to get that but it kind of worked

remove the black background, the new background is white, and make the object lit with studio lights


its like 90% of the way there
>>
>base will be released before the weekend
REEEEEEEEEEEEEEEEEEEEE WHERE
>>
>>107374924
what's z?
>>
>>107375170
very cute
>>
>>107375355
I like forge and all but it's still pythonic slop. just less so than cumfart
>>
>>107375325
hey blame nvidia and amd
>>
File: z-turbo_00069_.png (2.59 MB, 1024x1536)
2.59 MB
2.59 MB PNG
>>
>>107375351
The P reused as R is a nice touch but it's missing the N at the end.
>>
>>107375349
600k to pretrain the model, meaning the finetune will be cheaper
>>
File: 1733277360791202.png (294 KB, 640x320)
294 KB
294 KB PNG
folding@home but finetuning lude models?
>>
>>107375351
But can it do people performing uncommon tasks? Like inflating a balloon
>>
>>107375319
>You have extensions that work on one fork but not on another.
comfy is already showing signs of this. so many custom nodes just don't work anymore
>>
>>107375358
7am Sunday in China right now tick tock chang
>>
>>107375301
>settle for fat third world brown women
Yes, I can make all of the chubby fat ass Indian women I want. Chroma is a gift from the heavens.

>>107375321
Don't listen to him, he's a fruit.
>>
>>107375370
Can you do one that has Jesus shoving her off a cliff and a God in the sky saying "never goon"?
>>
>>107375358
who said that? they said "before the weekend" when that guy asked for both the base model and the prompt rewriter, it's probably the prompt rewriter we're gonna get today or tommorow
>>
>>107375358
>>107375380
they never said the base model is coming before the weekend, but the prompt enhancer, retards
>>
>>107375333
damn unc you got any more stories from the days of yore
>>
maybe this new z-image will save the nuclear wasteland that is /hdg/
>>
>>107375321
Everybody here has a folder with hundreds of gens of their exact preferred goon material, the difference is that most of us understand that it is OUR preferred goon material and not everybody's.
Maybe tone it down to two or three gens per thread.
>>
>>107375392
>>107375393
But we already have the prompt rewriter prompt.
>>
File: 1758893308518646.png (11 KB, 177x137)
11 KB
11 KB PNG
what if he made Chroma Z but excluded all the shitty diaper fur images?`
>>
>>107375334
https://www.youtube.com/watch?v=FAmqpd3JANA
>>
>>107375405
>But we already have the prompt rewriter prompt.
so they did their mission, they released the prompt rewriter thing before the week end, that was it
>>
>>107374621
I've been away. Does Flux 2 suck? I assume they carried over the anti-porn stuff, but can it even do vanilla stuff, like a big breasted woman in a swimsuit? Or do things like "big breasts" trigger their safety bullshit as well?
>>
>>107375411
It doesn't matter what he includes as long as the result looks good. And the huge difference between schnell and (future) zimage base is that zimage will be way less of a pain in the ass to finetune.
>>
>>107375373
Yeah, but iirc chroma took 250k to finetune flux which was a 12b parameters, neta cost 60k for a 2b model, so 120 to 150k seems about right to finetune z
>>
File: z-image_00060_.jpg (1.54 MB, 1664x2432)
1.54 MB
1.54 MB JPG
>>107375416
>>
>>107375309
Well, I know which email address to flood with useless comments, to drown out genuine safety nanny posts
>>
File: file.png (74 KB, 236x157)
74 KB
74 KB PNG
>>107375351
first time a non-pornographic gen made me cum.
>>
>>107375355
I saw this other fork being shilled on reddit:
https://github.com/maybleMyers/chromaforge
It has ramtorch so it's probably better than than neoforge fork.
>>
>>107375427
>chroma took 250k to finetune flux which was a 12b parameters
chroma is a 8.9b parameters model, but he used a lot of that training to undistill the model first, here the model is already undistilled so there's that
>>
File: 1739708920708299.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>107375425
i dunno. i suspect his dataset is poisoned, and it's why you often get those images that look half way between realism and cartoons.
>>
Does Ostris support training from existing lora weights?
>>
>>107375423
>carried over the anti-porn stuff
yes of course, they're big on that
>Pre-training mitigation. We filtered pre-training data for multiple categories of “not safe for work” (NSFW)
>>
File: file.png (57 KB, 440x396)
57 KB
57 KB PNG
>>107375392
>>107375420
>>
File: ComfyUI_00174_.jpg (540 KB, 2048x1280)
540 KB
540 KB JPG
>>107375350
>I dunno man I'm also white and live in the first world yet I prefer my women white (or yellow).
Assuming you are who you are, I'm assuming you somehow picked up the asian beauty standard and that's why you like what you like.

For asians I split the difference with Filipinas/SEAs for brown, or Slavs for white. The actual bug eyes are a turn off for me, can't control it. Slavs and SEAs basically give me everything I want from the asian phenotype anyways

Western Civilization's standard of beauty (for all ages including children of both genders btw, read the Greeks) is based on Ancient Persia. (Ancient Persian women had pink nipples and a lot still do today even after getting raped by arabs.) There is no way you can have a classical education and not come to this conclusion.

>>107375396
Dude I'm probably younger than you lol I just know that cyberpunk didn't really predict the Internet which is why it's not that accurate of a description of the dystopian future anymore. Wealth inequality is mostly due to greed and not technological progress nowadays

>>107375400
There is literally nothing you can do about it. Imagine being this much of a faggot trying to tell others what to do instead of ignoring or filtering the things that make you upset.
>>
>>107375352
I'm a VRAMlet and I can run Flux 2 just fine. Running it right now, doing batches even while surfing the web with multiple tabs open and 4K porn videos paused halfway through. VRAM is overrated, system RAM is underrated for AI thats why its flying off the shelves right now. why spend 3000 when you can spend 300

>but muh gen took 3 minutes instead of 3 seconds what will i ever do
>>
>>107375399
/hdg/ refugee here?
>>
File: 1740023565326180.png (1.83 MB, 1280x720)
1.83 MB
1.83 MB PNG
>>107375351
I asked claude to rewrite the prompt so that it's about a miku game instead and I got this absolute kino, holy moly...
prompt: https://files.catbox.moe/2radeu.txt
>>
>>107375465
im really tired of /g/ censorship. we see all kinds of ** and extreme anime hentai in the collages but god forbid my virgin eyes view a nipple
>>
>>107375472
its been a year since i took refuge here, and every time i look back i feel sad
>>
>>107375461
OMG ITS HAPPENING
https://www.youtube.com/watch?v=xb2fjZa_L74
>>
>>107375465
Be a dear and post with a trip so I can filter you then
>>
>>107374896
>>107374918
Just take the porn dataset from chroma and tune it on Z
>>
Friendly reminder that Chinese research teams are notorious for reneging on promises to release models if they feel they can now profit off them instead.
>>
let the chinks eat breakfast first goddamn chill
>>
>>107375465
Are you able to gen well with pillars in temples and similar types of areas like that? I have found models struggle with such ideas. An example:
blonde between two pillars on the left, between pillars in the middle is a brunette, and between the pillars on the right, is a redhead.

You should add other things, the redhead is sitting on a skiff. The blonde is balancing a pot on her head. The brunette is riding a dragon.
>>
>>107375461
Wtf does he mean by "i guess". Who is managing that Discord lmfao.
>>
>>107375485
What happened? The schizo nation attacked?
>>
>>107375461
>before this weekend
but we're already on the weekend?
>>
File: 1742953037440607.png (1.42 MB, 1291x1207)
1.42 MB
1.42 MB PNG
>>107375482
the jannies do be like that doe
>>
>>107375498
is we getting agi?

I literally don't know what people are waiting on, I was out of the loop for like 3 weeks, busy with work.
>>
>>107375499
Try it, nigger.
>>
>>107375502
in general its strange with these sorts of teams where you dont really interact with them but some kind of gayass community manager
id rather talk to the guy who decided to put nipples in the model himself so i can thank him
>>
>>107375469
>3 minutes
That's ridiculous when only like 1/5 images are good. Might as well draw them yourself in that time
>>
>>107375496
It's a red flag they're planning on giving us base but are keeping the edit model hostage. They never said anything about open sourcing it, and there's no guarantee they won't just give us the distilled edit model as opposed to full version.
>>
>>107375455
Figures. What's the best thing to use nowadays? I've got a 4090 and 64GB of DDR5 system RAM.
>>
>>107375001
If only neoforge had memory management as good as OG forge, instead the dude just commented out the part of the code that would figure out how much to load in GPU memory and just said fuck it if lowvram then throw everything in ram.
>>
>>107375496
Member when we was getting wanchaku before qwen? I member..
>>
>>107375506
any or all of this could be translation issues its happened before
>>107375510
chinks are in the process of saving local and may deliver but no one is for sure
>>
File: ComfyUI_00179_.jpg (573 KB, 2048x1280)
573 KB
573 KB JPG
>>107375490
Why? I don't respect your beliefs or the way you use this website, and I don't need to. Learn to ignore what you don't like or leave, or keep seething

>107375499
I fucking hate you so much scabnigger you shit up every thread on this board I see you in. I'm not reading your post.

>>107375508
This make a lot of sense once you remember that /v/ is just used to trade "hidden images" inside other images with browser extensions
>>
File: 1742372441808924.png (102 KB, 1377x768)
102 KB
102 KB PNG
>>107375525
>They never said anything about open sourcing it
anon...
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>>
>>107375525
In any case, my takesies backsies detector is going nuts right now. I've been burned too many times.
>>
>>107375537
>This make a lot of sense once you remember that /v/ is just used to trade "hidden images" inside other images with browser extensions
the sink threads?
>>
Reminder that impatientlets are mentally, and usually literally, brown.
>>
I'm losing my goddamn mind right now
>>
File: 00001-398427091.png (245 KB, 1024x1024)
245 KB
245 KB PNG
>>
>>107375532
>may deliver but no one is for sure
Can't they make danbooru forks with the current Turbo version too?
>>
File: ComfyUI_temp_aygte_00007_.png (2.27 MB, 1344x1024)
2.27 MB
2.27 MB PNG
>>107375405
The prompt rewriter is a prompt? Is it like neta where you need to include the "You are a helpful..." shit?
>>
File: metal-gear-anguish.gif (2.38 MB, 640x360)
2.38 MB
2.38 MB GIF
>>107375505
so much irony posting... so many schizos... needless console wars.... less gens.... SCAT SPAM...
>>
Reminder that patientlets are mentally, and usually literally, brown.
>>
File: Z-Image turbo.png (1.68 MB, 1280x720)
1.68 MB
1.68 MB PNG
>>107375461
>>107375477
Are you ready anons? My body is ready!
>>
File: 1735478352071117.jpg (70 KB, 990x936)
70 KB
70 KB JPG
>>107375564
White = higher iq = capacity for future thinking, delayed gratification, and comperhending what per capita means.
>>
Reminder that whites have jobs and can afford a top of the line gpu so they don't have to wait.
>>
>pre-SD1.5 dark ages
>2022 - SD1.5 dawn of local generation
>2024 - SDXL forks golden era
>2026 - Z forks platinum century
Am I getting the timeline right
>>
z-image is the deepseek moment for imgen
>>
>>107375511
My prompt is just an example. The point is that models suck when it comes to pillars, mostly. I have tried. You haven't tried. just set image size really small and see if it can do it. You'll find it totally doesn't get it.
>>
>>107375461
Do you think they expected their model to have this much hype? I'm sure they saw they made something special, but sometimes you can make a great product and it won't blow out.
>>
>>107375586
Which gpu comes with the weights of z image base embedded in its vram, jamal?
>>
>>107375588
what's z-image, and does it do this:
>>107375499
>>
File: bigsleep.png (454 KB, 512x512)
454 KB
454 KB PNG
>>107375587
youre missing the kino that was preSD
>>
>>107375588
stupid comparison
>>
>>107375351
I guess that using a text encoder (Qwen3-VL-4B) that can handle 256,000 tokens helps a lot lol
>>
>>107375587
Between each of these are obscure models that are sorely missed.
>>
>>107375607
its a good comparison, a chinese lab nukes the ai space the model is for given how good it is given its efficiency
>>
File: 1432289151.png (1.04 MB, 1344x768)
1.04 MB
1.04 MB PNG
>>
>>107375587
Come on, Flux has its place in there
>>
File: file.png (742 KB, 1279x907)
742 KB
742 KB PNG
>>107375599
>>107375499
i cheated and enhanced the prompt
>>
>>107375372
Wow, I didn't even catch that at first. Really cool stuff. With a bit of a touch up, it's perfect for many cases.
>>
>>107375628
flux, chroma, and noobai, all deserve mentions.
>>
>>107375618
indeed
>>
File: 1504081869.png (1.08 MB, 1344x768)
1.08 MB
1.08 MB PNG
>Training will take ~9hr
Welp, time to chill I guess.
>>
I don't like indians, and I don't want them to be allowed to use electronics.
>>
i made a tung tung tung tung sahur lora for zturbo
>>
>>107375651
Wasn't the guy who released SD an Indian?
>>
>>107375351
can u pls post fullsize of the zimg
>>
File: 1740309304380633.png (2.07 MB, 896x1152)
2.07 MB
2.07 MB PNG
>>107375667
>can u pls post fullsize of the zimg
>>
>>107375655
beautiful
>>
>>107375664
no, he is a brownoid from England, the most Muslim country on the planet
>>
File: 00011-883159584.png (3.02 MB, 1536x1280)
3.02 MB
3.02 MB PNG
>>107375499
>>
>>107375615
what does this mean? enlighten me. i am tech illiterate. what are the different CLIP encoders and how have they advanced and is the Qwen one ultra efficient like the Chinese are breaking everything
>>
when using aitoolkit, nothing happens when I hit start training. what am i doing wrong?
>>
>>107375673
the one in the comparison plox
>>
File: z-image_00063_.jpg (1.53 MB, 1664x2432)
1.53 MB
1.53 MB JPG
>>
>>107375655
Omg epic 6767
>>
>>107375679
clip can handle 88 tokens, T5 can handle 512 tokens, Qwen 3 VL can handle 256k tokens, it means you can write a fucking bible and it won't lose track lmao
>>
>>107375664
he is Bengali
>>
File: 1098678832.png (1.82 MB, 1600x896)
1.82 MB
1.82 MB PNG
>>
I HATE SPAGHETTI
>>
File: 0kektdpv1vm71[1].jpg (67 KB, 640x634)
67 KB
67 KB JPG
>Sir, another open source Chinese AI model has mogged and upended the entire westoid coding base of pajeets
>>
File: ComfyUI_00701_.png (1.51 MB, 1152x896)
1.51 MB
1.51 MB PNG
>>107375686
liking these
>>
File: ComfyUI_00806 - Copy.jpg (107 KB, 446x715)
107 KB
107 KB JPG
https://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-lora
>THIS IS NOT A DRILL
FRIEREN ON Z IMAGE
>>
>>107375692
In practice it does.
>>
>>107375693
>Bengal (/bɛnˈɡɔːl/ ben-GAWL)[1][2][a] is a geographical, ethnolinguistic, historically geopolitical region in South Asia, located north of the Bay of Bengal. Today, it is politically divided between the sovereign state of Bangladesh, the Indian state of West Bengal, and Karimganj district in the Indian state of Assam.
That's an Indian pretending not to be an Indian
>>
Here's z image
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

deport all the things.
>>
How's the character loras for Z-Image Turbo so far, do they fuck up the anatomy like the ones for Flux Schnell?
>>
>>107375719
I know go back to /pol/
>>
>>107375707
Looks the same as all the other 700 anime models
What's even the point
>>
>>107375707
76
>>
>>107375726
If I was from /pol/ I'd know exactly what every Indian state is called and wouldn't have to read Wikipedia
>>
>>107375729
>>107375729
>>
File: zimg_00008_.png (2.09 MB, 1280x1280)
2.09 MB
2.09 MB PNG
ok Z image is too much fun and the quality is pretty great. also in comfyui can you make 3d models? i wanna 3d print more models
>>
>namefag
>>
>>107375694
lel he will never look like that or be a woman :^)
>>
File: 1125299015.png (1.2 MB, 1680x720)
1.2 MB
1.2 MB PNG
>>107375707
But... the model already knew Frieren.
>>
my fucking lora is stuck on submitted
it will probably get refunded
>>
File: 1331637369.png (1.14 MB, 1152x896)
1.14 MB
1.14 MB PNG
Altough I guess without a lora some of the details are off.
>>
>>107375795
looks demonic enough too me. Does it need more demonic 76 bullshit?
>>
File: ComfyUI_temp_jtzda_00165_.png (2.05 MB, 1120x1440)
2.05 MB
2.05 MB PNG
Enough ching chong girls. Prompt as follows (minus a detail), I wanted to see if Z-Image understands LLM-like descriptions.

>Virginia is a young vampire girl in a candle-lit antique living room of a medieval castle. She's sitting on a comfy sofa, showing off her thighs with a sultry expression. Her feet are on the ground. She's holding tight a tiny brown bear plushie on her lap.
>
># Virginia's appearance
>- Nationality: French, caucasian
>- Age: xx-year-old, with a youthful smooth face
>- Breasts: huge for her age, large enough to form a deep cleavage
>- Skin complexion: almost deathly pale
>- Eyes: gray
>- Hair: platinum blonde, very long, curly
>- Lips: Thick, pale pink and kissable
>
># Details of Virginia's outfit
>- An erotic black-and-purple gothic lolita dress with frilly lace details. The dress is sheer and very short.
>- Black lace choker with a silved inverted cross pendant
>- Black lace gloves
>- Black lace panties
>- Chunky Mary Jane shoes
>
># Brown bear plushie appearance
>- Evil grin
>- Round ears with beige interior
>- Black papillon
>>
I wonder if they are going to release the true base model or the one with SFT.
>>
>>107375299
flux who?
>>
File: z-image_00067_.jpg (1.43 MB, 1792x2176)
1.43 MB
1.43 MB JPG
>>107375814
>Enough ching chong girls.
no
>>
File: 1756780250103050.png (2.11 MB, 1024x1536)
2.11 MB
2.11 MB PNG
>>
File: vamp.jpg (562 KB, 1536x2048)
562 KB
562 KB JPG
>>
>>107375814
https://old.reddit.com/r/StableDiffusion/comments/1p809wt/z_image_turbo_can_understand_json_prompting_very/

I've been prompting using 'fake' reasoning chains like "<think>blah</think>Final prompt:blah"
>>
new thread btw
>>107375729
>>107375729
>>107375729
>>107375729
>>107375729
>>
File: 00020-1676616692.png (1.96 MB, 1120x1440)
1.96 MB
1.96 MB PNG
>>107375814
it keeps giving her bear ears for some reason
>>
>>107375953
ok otw
>>
>>107375351
Damn. Imagine coming back to image gen and getting this. Flux 1 felt like a step forward but also a step back in some ways. This feels like a true next gen.
>>
>>107375374
yeah why not
>>
>>107375040
there is also that clip text encode for lumina node, has two preset modes(instructions) for prompt processing, does give different(slightly) results
>>
>>107374904
Agreed, SDXL (and illust/noob) still surprise me. 1.5/SDXL with ZIMG hires fix will be a good time.
>>
>>107375095
Lower shift can get a burnt toast/dirty skin pattern. Check the hands, it's easier to discern than the face in that image.
>>
>>107375355
Yeah, ONE workflow. For gacha and gooning that's fine.
>>
>>107374904
You still need SDXL to inpaint penises



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.