[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Portrait Model BTW Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107358368

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
first for zimage needs dick loras
>>
>>107360402
ur gay
>>
Blessed thread of frenship
>>
File: what.png (624 KB, 1378x1647)
624 KB
624 KB PNG
https://xcancel.com/bdsqlsz/status/1994336717587845601#m
what does he mean by this?
>>
>>107360388
No matter how much you spam this board, no matter how many models you shitters get, it will never get as good as real art and you will never fill the void with slopping computer-hallucinated garbage. Soon the novelty won't be enough and you're going to kill yourselves.
>>
youre the only one who cares about that twitter chink
>>
File: 1740057979364491.jpg (65 KB, 605x373)
65 KB
65 KB JPG
>>107360416
>>
File: 1758541997153488.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
What model was used to make the top middle blonde girl? Didn't realize AI could make feet that realistic
>>
File: ZiMG_00130_.png (3.4 MB, 1344x1728)
3.4 MB
3.4 MB PNG
what can Z not do?!
>>
File: 1753054757973301.jpg (134 KB, 1169x1490)
134 KB
134 KB JPG
loras inc
>>
>>107360415
are you retarded? are you genuinely unable to parse the meaning? the base model wont be using noob's dataset (given they just asked for it), and would only be used if they planned to make specialized versions later on
>>
File: 1764123641647470.webm (3.93 MB, 674x1000)
3.93 MB
3.93 MB WEBM
Is there any way to gen a video like this from an image on GTX 1660S, even if it takes a long time?
>>
File: 1746668151162147.png (288 KB, 875x567)
288 KB
288 KB PNG
Z-Image Turbo
>16.7k downloads, 1.08k likes
>6.47% likes/download ratio

Flux 2 dev
>162k downloads + 681 likes
>0.42% likes/download ratio

AIEEEEEEEE
>>
>>107360446
No.
>>
>>107360443
thanks anon
>>
>>107360454
wow its almost like ignoring the vast majority of people using ai (vramlets) is a bad idea
>>
REMINDER: YOU ARE USING THE TURBO DISTILLED VERSION OF Z PLEASE READ THE PAPER SIR
https://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf
>>
>>107360455
I asked Grok and it said that it'll take about 20 minutes for 512x512 res
>>
File: ZiMG_00137_.png (3.72 MB, 1344x1728)
3.72 MB
3.72 MB PNG
>>107360438
>>
>>107360477
shut up monkey
>>
>>107360477
Speak up monkey
>>
>>107360446
Grok can, put in the img
>>
>>107360466
you won't have to ignore them if you are in with Nvidia to destroy local as a whole. do not support any saas whatsoever if you want a new GPU that's worth it in your lifetime
>>
>>107360438
show me widowmaker from overwatch
>>
>>107360477
not a single soul pretends they're using the base model lol
>>
>>107360454
why would anyone download flux 2?
>>
>tfw got 64gb of ram before the price gouging cause comfy was using a lot of ram for wan 2.2 Q8
thanks comfy.
>>
>>107360495
This, buy more nVidia GPUs. We need to support their current pricing as much as we can. Based
>>
>>107360440
I don't have particularly great expectations given that we are training the distilled model, but hopefully the results will be good enough quality.
>>
File: ComfyUI_temp_ahcuj_00043_.png (2.33 MB, 1088x1856)
2.33 MB
2.33 MB PNG
>>107360454
You can't even do img2img with flux.2 their scheduler doesnt allow it lmaoooo
>>
>>107360503
you would not believe how many have no conception of what distillation means
the ones not pretending simply dont know
>>
File: plz do the needful.png (351 KB, 760x696)
351 KB
351 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1p8m2lp/is_there_a_way_to_convert_old_loras_into_loras/
Saar can my SD1.4 lora be used on Z-image??
>>
File: ComfyUI_00092_.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>
where the fuck does ai toolkit save the loras by default? i didnt set any path for it
>>
>>107360497
>>107360521
jannies!
>>
>>107360500
It doesn't know any of the OW characters from what I can tell. The IP knowledge is really odd, it knows Tifa but not aerith
>>
is this training speed ok for a 4090

zturbo-niggerfusion: 4%| | 116/3000 [03:33<53:23, 1.11s/it, lr: 5.0e-04 loss: 2.256e-01]
>>
>>107360529
output folder.
>>
File: 1750365579842983.png (462 KB, 450x598)
462 KB
462 KB PNG
>>107360523
NOOO I NEED MY NAYANTHARA SEXY LORA
>>
File: ZiMG_00145_.png (3.73 MB, 1344x1728)
3.73 MB
3.73 MB PNG
>>107360500
no can do siree

what >>107360537 said
>>
>>107360523
It sucks people don't publish the datasets for their loras so others can train them on newer models.
>>
>>107360566
>childs
>>
File: 1738461325082693.jpg (511 KB, 1407x2111)
511 KB
511 KB JPG
>>
File: ComfyUI_00001_.png (2.46 MB, 1440x1440)
2.46 MB
2.46 MB PNG
>ynr remember sd3 not being able to gen people lying in grass or flux having no concept of things as basic as feet
We've come a long way bros
>>
File: 1736748934106859.jpg (1.92 MB, 2453x3680)
1.92 MB
1.92 MB JPG
>>
>>107360599
what a disaster holy shit. and then flux came out right after
>>
File: zimg_0032.png (2.58 MB, 1664x1216)
2.58 MB
2.58 MB PNG
>1500 steps
stopped training, not bad
>>
What are the best web based tools to generate models at this point? Do they compare to Comfy UI?
>>
>>107360438
a tachikoma
a velociraptor
>>
>>107360626
did you use the default settings?
>>
>>107360629
>What are the best web based tools to generate models at this point?
comfy ui unless you're a schizo
>>
>>107360529
it sends it directly to me, thanks
>>
>>107360626
For training a distilled model, it looks like it transferred surprisingly good
>>
The coolest thing about local diffusion (at first I was going to say the most dangerous) is that every new idea that leans erotic at all can instantly be turned into a complete fetish because you have the tools at your disposal to jerk off to anything you can think of. Ever since reading that Anna Khachiyan tweet about how the real white/PoC divide is pink nipples vs brown nipples I immediately became obsessed with genning 'borderline PoC' ethnic groups and prompting for light pink nipples... she is 100% correct btw

When I started writing this post I was going to act like this is a bad thing, but then I thought about it and actually this is a very good thing. This is the reassertion of the imagination against the sloppification of human sexuality by porn. If I didn't have this I'd just jerk off to stepsisters stuck in dryers or whatever, because that's all that there is. I feel more alive when I'm jerking off to mid asian girls with white woman tits, or when I discovered that I'm more aroused by the sight of a woman texting on her iphone than by any overtly sexual pose.
>>
>>107360599
it's funny how the history repeats itself
>SAI used to be based and then they released the disastrous SD3
>BFL gets created and replaces SAI with Flux 1

>BFL used to be based and then they released the disastrous Flux 2 dev
>That alibaba team gets created and replaces BFL with Z-Image
>>
>>107360629
comfyui is a web based tool. you can't run it without launching a server. my question is, where is the professional desktop apps that don't have telemetry
>>
>>107360666
>That alibaba team
they existed before and made some of the most popular llms for local
>>
>>107360666
BFL was never based. They've always had censored neutered models. There's a reason no one used Flux1 for NSFW.
>>
File: WanVid_00005.webm (810 KB, 960x720)
810 KB
810 KB WEBM
is onetrainer one of these shits that requires keys for gated models or can you just direct it to one you procured through other means
I know it's open source but I'm lazy
>>
>>107360440
the model is mid compared to chroma though. Maybe it's great for vramlets
>>
use comfyui to support saas for local. comfyorg cannot exist without taking your local gpus away from you so you pay for API nodes
>>
File: pope.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>preview looks like what i want for 1-2 steps then steers hard into some generic image
is this cuz it's a distill? I have to i2i chroma gens to get a pope without a white cassock.
>>
>>107360686
blind furry fucktard
>>
File: ZiMG_00154_.png (3.79 MB, 1344x1728)
3.79 MB
3.79 MB PNG
>>107360643
no gits no dinosaur specific knowledge sadly
>>
A noobai tier Z finetune will literally destroy dicks
>>
File: 1756996936866606.png (190 KB, 521x493)
190 KB
190 KB PNG
I've seen claims that Z-Image is overtrained and when trying to generate things out of distribution it quickly degrades in quality

Is this true?

Can it generate Hatsune Miku drinking UV fluid while sitting on the lap of a humanoid shrimp with the head of a T-Rex?
>>
>>107360694
seething vramlet kek
>>
>>107360686
You wished people would've been hyped by chroma like we're hyped by Z-image, the jealousy is off the charts lmao
>>
>>107360644
>>107360661
default settings indeed
https://civitai.com/models/2173607?modelVersionId=2447725

would love to see some feedback with your gens using this
>>
>>107360683 not one trainer, the other one that isn't diffusion pipe or musubi
>>
>>107360666
Not SD2? And SD3.5 was partially redeeming but couldn't salvage their rep.
Similarly BFL fucked up way earlier than 2.0.
>>107360676
Not the same team. Qwen Image is what the Qwen team made, also great but not as great. Tongyi is different guys.
>>
>>107360704
because you need more resources to run chroma and train for chroma.

People here don't have the cards. Z-Image is great if you call you can muster is 16 GB, or even a sad 24GB
>>
>ask z image turbo to make an image of milena velba topless
>it actually tries making her proper nipples
so close...
>>
>>107360664
I don't know. Porn art communities had a similar freeing effect on the development of unique sexualities and look where that led. What if every man has the potential to develop his own unique idiosyncratic sexual 'thing' comparable to furryism/etc.? Should we be helping him develop that?
>>
>>107360704
He's right. It's not good for NSFW, but that's fine, because it wasn't trained for. When it does get NSFW capabilities, it's going to kill Chroma 100%.
>>
File: zimg_0036.png (2.26 MB, 1664x1216)
2.26 MB
2.26 MB PNG
>a photo of a goth woman, bangs, black hair woman in micr-bikini in a kfc

i'm pretty happy with this for a 45min train on a 3090
>>
>post real images of women
>no one even questions if its not ai
ai won
>>
>>107360664
this post made me depressed
>>
>>107360497
>>107360521
You're a sick fuck
>>
>>107360626
qrd? what did you train?
>>
>>107360664
>I discovered that I'm more aroused by the sight of a woman texting on her iphone than by any overtly sexual pose.
you have a courtship disorder.
https://en.wikipedia.org/wiki/Courtship_disorder

actually, I bet many anons in this thread do as well
>>
>>107360699
Kinda. The speed is what's impressive. Can't do NSFW beyond topless girls.
>tranny jannies aren't letting me upload images due to "abuse"
It looks pretty good ill try tomorrow.
>>107360733
who cares women aren't real women anyways.
>>
>>107360686
bro your izzat just went down.
>>
>>107360716
>Tongyi is different guys.
they aren't new faggot they made glm. you are fucking stupid
>>
File: 1759309708986201.png (2.36 MB, 1280x1280)
2.36 MB
2.36 MB PNG
>>107360699
>Is this true?
not true, I haven't seen a weird image at all, it allways looks good
>Can it generate Hatsune Miku drinking UV fluid while sitting on the lap of a humanoid shrimp with the head of a T-Rex?
the prompt adherence isn't godlike, we'll have to wait for the base model and get to use CFG to get something as complex as this
>>
File: 1745449725364553.png (1.75 MB, 1280x2048)
1.75 MB
1.75 MB PNG
>>107360699
>I've seen claims that Z-Image is overtrained
It's true
the typical chinese superficial work
>>
This is my hatsune miku gen if someone wants to replicate it
>focus of the image is hatsune miku drinking glowing antifreeze out of an automotive container that says "antifreeze" on it. The antifreeze is glowing bright teal. Miku is sitting in the lap of a giant shrimp, with it's arms around her. The shrimp is on the head of a T-Rex which is out of focus. photorealistic.
>1024x1024 euler simple
>seed:610241634774802

>>107360686
It does photorealism better, has better prompt adherence, and uses less vram. NSFW will come soon.
>>
>>107360699
>overtrained
Stop using words you don't understand
>>
File: ZiMG_00161.jpg (1.62 MB, 1344x1728)
1.62 MB
1.62 MB JPG
>>107360705
niceeeeee
>>
>>107360751
Ok thanks for telling me what a Jew whose claim to fame was measuring blood flow to gay men's penises thinks about my feelings
>>
>>107360759
GLM is Zhipu.ai, ZAI. They only make llms, nothing else. Stop being disingenuous, nigger.
>>
>>107360788

>https://civitai.com/models/2173607?modelVersionId=2447725

Catbox the uncensored?

>>107360705

Also, did you caption for tattoos in the training?
>>
>>107360416
https://realorai.dev/
>>
File: zimg_0044.png (2.69 MB, 1344x1728)
2.69 MB
2.69 MB PNG
>>107360746
trained a lora on angel youngs body (face removed) for zimage, 18 well-captioned images, 1500 steps with ai toolkit, default settings.

>>107360788
heartbreak and joy when someone uses your own lora better than you can
>>
>>107360821
Nice
>>
File: 1764341324572859.jpg (555 KB, 1080x1270)
555 KB
555 KB JPG
thank you for supporting comfyui and the acceleration of wafer shortages for local!!!! you guys are so great!!!! please subscribe to comfy cloud!!!!
>>
File: sar.png (11 KB, 503x107)
11 KB
11 KB PNG
>>107360523
hello saar, why my qwen and flux loras dont work with z-images I need them to generate boobs and vegene saar
>>
>>107360767
>>107360771
insane mogging
>>
>>107360831
gpu prices about to soar, again
>>
>>107360751
>you have a mental disorder anon! this disgusting pedo jew from the tribe that views sucking baby penises as normal wrote so decades ago!
perfect goy
>>
>>107360831
>I need 80 trillion RAMs
Sam Saltsbergmann's, 2025
>>
>>107360842
>insane mogging
-> >>107360767 for that one I asked claude to make the prompte more detailled, I translated the prompt to chinese and I went for 50 steps, that's the absolute limit you can push that model on prompt adherence
>>
>>107360831
meds
>>
File: zimg_0011.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>107360814
nope tatts are not captioned
>>
>>107360821
Might want to be careful Civitai get really autistic about real people loras now, even if the face is cropped out
>>
>>107360861
they can take it down, i just wanted to toss it to y'all
>>
>>107360446
>>107360718
i can run chroma on my 3060
and i can run it at ok speed (1min per 20step pic i think, maybe im misremembering (definitely under 60s). )
so sucky sucky my pp
chroma's better for nsfw yes of course. i dont care cuz i never jerk off to ai porn because it's like reading a book you wrote. not fun.
>>107360485
yea its possible
maybe try SDNQ quants of wan, or ggufs
you'll need a bit of ram tho..
>>
>>107360833
kek
>>
>>107360875
nipples still look weird. were there any close ups of the breasts in the training data? either detailed nipples are hard to train or it's because of the low resolution making them look like that.
>>
>>107360751
Ok I'm humoring you by actually reading this and what the fuck are you talking about? I don't have any of the weird -philias he's describing as typical of this spectrum of disorders. Did you even read my post? I said I find it erotic to watch a woman use her phone. That's because a woman's mind is more engaged and she's less self-conscious, and I am attracted to women.

He's talking about guys who want to rub their crotch on women, rape them, watch them undress, pedophiles, etc.

Try to imagine that this is 100 years ago. There are two men. One of them says he fell in love with a woman while watching her playing a game of Whist or whatever. The other one says that he likes to imagine big fat nuns with breasts the size of boulders naked and contorted like pretzels, drooling for big black penis. You just diagnosed the first guy as the weird pervert
>>
omg I'm gonna traaaaaaaain
>>
File: ZiMG_00169_.png (3.6 MB, 1344x1728)
3.6 MB
3.6 MB PNG
>>107360918
Im not the trainer.

>>107360821
he is
>>
>>107360936
>Im not the trainer.
then im not asking you
>>
File: 1755707552680206.png (638 KB, 576x512)
638 KB
638 KB PNG
>>107360565
Make a website that can host petabytes of storage that won't delete stuff when people complain then
>>
File: 1744571129562373.png (742 KB, 924x1470)
742 KB
742 KB PNG
Does AI-Toolkit have mass taggers?
>>
File: ZOOMER AND HIS HAG.jpg (166 KB, 710x474)
166 KB
166 KB JPG
>>107360930
I can't train on my vramlet card anon...
>>
File: training.png (41 KB, 596x481)
41 KB
41 KB PNG
>>107360930
I'm already training too
>image expending $6000 on a GPU with the same speeds as a 4090

lmaoooo
>>
>>107360875
why does the skin look so bad in z-image. I'm getting the same shit, like the body don't look old, but it's all gross looking like that.
>>
File: zimg_0061.png (3.02 MB, 1344x1728)
3.02 MB
3.02 MB PNG
>>107360918
no this was never about adding breasts to the model, there are nsfw images in the set but no explicit close-ups.

i am going to try running this with a titty detailer (think face detailer but masked for nips) which i doubt will work. but training a nsfw lora will mean way more steps is my guess. that's a later experiment.

also you can push the lora kinda hard? (1.5)
>>
File: ZiMG_00173_.png (3.43 MB, 1344x1728)
3.43 MB
3.43 MB PNG
>>
>>107360972
you have to increase the shift to remove that noise pattern, that tends to make people look older than they should
>>
>>107360962
How VRAMlet?
Likely you can get fp8 training working unless turbo vramlet.
>>
File: ComfyUI_01926_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>107360971
I didn't pay for this GPU anon, but I could, if I really wanted to buy one, they're not that much.
>>
>>107360977
>i am going to try running this with a titty detailer (think face detailer but masked for nips) which i doubt will work
i made a bunch of these for flux and they worked pretty well. genetalia as well. it usually went person lora + nude lora -> nipple/genital detailer. i'm hoping that z-image will be better att learning this so this whole process can be replaced.
>>
>>107360438
NSFW
>>
File: lmao.jpg (218 KB, 949x1003)
218 KB
218 KB JPG
>>107361010
>I don't need that vram anyways said anon!
>>
File: 1760282226755901.png (529 KB, 676x507)
529 KB
529 KB PNG
hey look, it's a plate if z-image nipples
>>
>>107361010
thank you for supporting rising GPU prices!
>>
>>107361014
Why?
>>
gib teto zimage lora
>>
>>107360971
>AM4
>shitty PCI Bus speeds
lmao
bro the best part about a 5 series is the extra memory bandwidth from the 16x lanes. Your much much slower with that outdated board.
>>
>>107361010
>if I really wanted to buy one, they're not that much.
Paying $10k for something that would be outdated within 3-4 years is definitely expensive for most people.
>>
>>107360962
You can train Chroma loras on 8gb vram in OneTrainer, Z-Image Turbo is smaller than Chroma
>>
>>107361031
don't care until the base model releases
>>
>>107360737
confirmed brown nipples
>>
>107361021
reverse psychology post, do not engage
>>
why not just wait for the base model? you'll only have to redo any loras you make
>>
>>107361040
this. 0 point in making loras for turbo aside from wasting 1hour of compute.
>>
>>107360666
sai and bfl were never based, they just released more than nothing
>>
>>107359297
>oof, is there some uncucked qwenvl finetunes there?
Yeah, no node to load them, every nodes pack I tried use the base model and can't load any finetunes at all
>>
>>107361024
I didn't pay for the card. nmp

I might also not pay for a fully loaded C210 with 3x GPUs for kicks simply for the seethe here.
>>
>>107361002
12gb
>>
>>107361048
Zoomers are impatient. Tiktok has fried their attention spans.
>>
File: zimg_0038.png (2.25 MB, 1664x1216)
2.25 MB
2.25 MB PNG
>>107360977
lora grid as always
https://files.catbox.moe/nmapij.png
>>
>>107360438
diaper porn
>>
File: 1748080035116901.jpg (91 KB, 472x838)
91 KB
91 KB JPG
>backlog of wan2.2 loras to make
>backlog of chroma loras(now replaced with z-image loras)
>backlog of SDXL loras
>backlog of potential wan ideas using those loras
im drowning. theres no way to keep up. there's simply not even time
>>
>>107360438
More than one face from a single prompt
>>
File: ComfyUI_temp_hepqu_00014_.png (2.54 MB, 1088x1856)
2.54 MB
2.54 MB PNG
>>107361076
damn thats amazing, even at 512

Z-model is truly a god sent
>>
File: 1763110267352870.png (63 KB, 752x696)
63 KB
63 KB PNG
>>107361054
>Yeah, no node to load them
you technically can, if you replace the safetensor files from the base model by the finetuned model, you can find the files here
>ComfyUI\models\LLM\Qwen-VL
>>
>>107361018
With this multigpu offloading magic we have now this is truer than it used to be.
>>
>>107361048
because clicking a button and forgeting about it for 45 minutes is free and testing the lora afterwards is gonna be fun enough
>>
>>107361135
the issue is it only works for ngreedia and cumfart keeps breaking it every update
>>
>>107361032
this, we should all support nvidia by buying their latest product
>>
>>107361167
that will still be true when the base model is out
>>
File: 1728148180509193.jpg (125 KB, 826x871)
125 KB
125 KB JPG
There are plenty of people trying to pass off AI content as handmade, yet no one is trying to pass off their handmade content as AI generated. Why is that?
>>
>>107361179
and? its not out and it wont be in 45 minutes.
>>
>>107361189
because normies seethe about AI content
>>
>>107361179
my dick can't wait until base model is out
unless you want to suck it in the mean time of course
>>
>>107361189
There are plenty of people trying to pass off non-mentally retarded low IQ toxogroid comments as good, yet no one is trying to pass off their high IQ comments as coming from a mentally retarded low IQ toxogroid. Why is that?
>>
>>107361129
I think some anons tried a few threads ago but it didn't work because you had to change another file too, but then the node would dl the base model again instead of using the finetune
>>
>>107361060
I think you might be onto something
>>
>>107361179
saar, the pajeets are waiting for the 1girl instagirl lora for the zmodel, what part don't you understand about that?
>>
>>107361208
>yet no one is trying to pass off their high IQ comments as coming from a mentally retarded low IQ toxogroid.
i do that every day on the internet
>>
File: ComfyUI_temp_jsxpg_00016_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>107360699
>>
File: 1746211695663046.png (1.18 MB, 1039x1026)
1.18 MB
1.18 MB PNG
>>107360699
>>107361220
>So, what did I won?
>>
>>107361179
a new SOTA 2B model might come out in the meantime
strike while the irons hot
>>
The other day I tried doing some gens on Nano Banana Pro of a baseball game seen from the crowd. There was something off about the gens, and I realized it was that the crowd were all eerily similar to one another, like it was tiling the same poses, people, outfits, etc. SD/Flux/Chroma/etc would have made the crowd an unintelligible mess, a gory blur, but it would have captured some truth about the randomness of a crowd. Nano Banana Pro produced a clone army, each one believable and realistic, but the overall effect was completely fake, and depressingly lifeless.

We are making trade-offs here. I mention this because Z-image is like a local SeeDream or NBP, and has the same shortcomings. Is this the future we want for image models?
>>
File: oh really?.png (131 KB, 498x276)
131 KB
131 KB PNG
>>107361240
>a new SOTA 2B model might come out in the meantime
>>
Everyone is glazing yet another image model and I'm sitting here wondering if we'll ever get a video model that can do more than 5 seconds at a time.
>>
>>107361278
You're here forever (5 seconds).
>>
>>107361189
>Many manufacturers try to pass off their machine-made factory-produced goods as handmade, yet no artisan tries to pass off his goods as mass-produced. This means people will always continue to buy hand-made artisanal goods over the worthless alternatives.
>>
I only wanted a better sdxl with a higher channel vae and no fuckhueg encoder but they always manage to stuff nlp garbage into everything. really annoying
>>
>>107361278
LTX2 will do 10 sec, according to the devs, dunno about the parameters and quality though
>>
>>107361334
This but also trained on porn like SD1 was
>>
>>107361059
I would have tried my lack with fp8 and batch size 1
>>
>>107361354
we can never have a 1.x dataset ever again :(
>>
>>107361360
Why?
>>
>>107361386
because I said so.
>>
>>107361386
SAFETY ANON. THINK IF THE CHILDREN
>>
>>107361278
this is the real deal. it isn't like net lumina or flux 2 which only had 2 anons shilling it. literally everyone is praising ZIT.
>>
>>107361354
Why lie? Z has more porn in its dataset than 1.5 clearly
>>
File: Z-image turbo.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>
>>107361386
Because it was trained with a dataset pre-AI era, meaning no synthetic slop, after 1.5 was released >they raided the LAION dataset
>>
>>107361400
No it doesn't. It can't do genitalia at all nor does it understand NSFW content. Nudity =/= Porn.
>>
>>107361404
>Because it was trained with a dataset pre-AI era, meaning no synthetic slop
it's not hard to reproduce ca, just gather images that were uploaded on the internet before 2022
>>
You now remember Cstaber
>>
>>107361415
>It can't do genitalia
Post 1.5 genitalia (I know you won't)
>>
Has anyone tried the uncucked version of the text encoder of Z-image?
https://huggingface.co/huihui-ai/Huihui-Qwen3-4B-abliterated-v2
>>
You get better genitalia with Z-Image if you prompt in Chinese. No I'm not kidding.
>>
>>107361446
yes.
it makes almost no difference. i don't see the point in it until we get NSFW finetune.
>>
>>107361427
i still have my first gens from using it. good times
>>
>>107361455
i only get chinese women if i prompt in chinese
>>
>>107361396
>THINK IF THE CHILDREN
( ͡° ͜ʖ ͡°)
>>
>>107361401
Flux 2 had momentum for less than a day, now it's dead...
>>
>>107360831
>my 4070 ti super is 2 years old
>it will probably die on the next 4 years

I wonder if I should stock on ram, my 32 can't handle wan2.2 anymore.
>>
Do we know how big the nondistilled Z-Image is gonna be? Also 6b or some bloatmaxxed DOA abomination?
>>
>>107361456
https://huggingface.co/PantheonUnbound/Satyr-V0.1-4B What about this one?
>>
I wish I could filter anything that involves '1girl' on Civitai. So tiresome.
>>
>>107361427
I was more of a dstaber chad myself
>>
>>107360683
HE'S LITERALLY ME FR
>>
>>107361510
No idea, and I can't test because I'm locked in a 2hour gen currently.
>>
>>107361509
READ. THE PAPER.
They set out to disprove "scale at all costs". It'll be 6B :3
>>
>>107361509
No one knows yet.
>>
>>107361415
It understands "spreading labia", and things like that, but it doesn't render genitals very realistically in all cases, male ones especially.
>>
>>107361526
>I'm locked in a 2hour gen currently.
bro using Kandinsky 5 20B
>>
File: 1760634788003523.png (3.99 MB, 1248x1824)
3.99 MB
3.99 MB PNG
>>
File: Untitled.png (1.57 MB, 712x1109)
1.57 MB
1.57 MB PNG
lora #2, bustin it out
>>
>>107361509
on the paper they say it's the same size, but that nostradamus chink says he's not allowed to say its size
https://xcancel.com/bdsqlsz/status/1994103556312584685#m
>>
>>107361514
post one of your gens to see your elevated taste
>>
>>107361240
source?
>>
>new lora on civitai

>but the fag likes tats and didn't tag them so EVERY girl you gen with that lora will have tattoos

what the fuck! it's only homos making z-turbo loras!
>>
>>107361278
Who cares
Until video models look like sora 2 they are all fucking slop
Enjoy images
>>
>>107360821
Can it do variety? Or is it still always same seed?
>>
File: zimg_0053.png (2.82 MB, 1344x1728)
2.82 MB
2.82 MB PNG
>>107361615
>begging
>choosing
>>
>>107361651
truth nuke
>>
>>107361665
you can beg and choose to be a homosexual but i won't be.
>>
File: ohh shh you.jpg (1.33 MB, 5440x3072)
1.33 MB
1.33 MB JPG
https://xcancel.com/Ali_TongyiLab/status/1994314571700429267#m
>we launched Z-Image: an open-source, 6-billion-parameter model that delivers top-tier image generation for everyone, everywhere.
>But as always, the real magic came from you.
aww
>>
File: adbme2.jpg (65 KB, 675x499)
65 KB
65 KB JPG
>>107361514
>>
File: 1737352968997740.jpg (30 KB, 640x76)
30 KB
30 KB JPG
>>107361594
Where in the paper does it mention the parameter size for Z-Image? (Not Turbo) Show me. The only sentence could I find states the "Z-Image series" which presumably include Z-Image-Turbo & Z-Image-Edit. although Z-image base will be publicly released, it's not going to be advertised as a usable product. It's purely for training to use with Turbo/Edit.

https://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf
>>
for me, it's
1girl, pronebone
>>
>>107361742
>I don't consider Z-image base to be part of the Z-image series
are you retarded?
>>
1girl, standing, peace sign, 2.5D masterpiece by Greg Rutkowski
>>
File: 1738239458125371.png (214 KB, 1069x511)
214 KB
214 KB PNG
>>107361742
sure
>>
>>107361594
The only logical conclusion is they are releasing a distilled 6B "Base", but their full model. We've been cucked.
>>
File: stare5.jpg (34 KB, 512x512)
34 KB
34 KB JPG
The base model is not going to fit into 8gb VRAM, right?
>>
>>107361756
Base isn't for gens you moron. that defeats the purpose of turbo. Are you fucking stupid?
>>
>>107361769
Neither can Turbo
>>
>>107361705
this is like reaching the credits in my favorite game and getting that personal thank you from the devs
and we haven't even reached the credits yet

>>107361720
based
>>
>>107361762
>but their full model
But not*
>>
File: 1760151810556156.png (385 KB, 566x490)
385 KB
385 KB PNG
I have a dream, that future models wont be poisoned with cartoon and anime slop, and that the training data will only consist of the most beautiful of photographs
>>
>>107361771
(You)
>>
File: 1757240106135454.jpg (58 KB, 976x850)
58 KB
58 KB JPG
>>107361769
the clip file alone is 8gb
>>
>>107361278
you can already have pretty good 20s gens with some small transition problems with motion
loop workflow from https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
>>
>>107361705
>xcancel
Criiinge.
>>
File: stare4.jpg (30 KB, 340x296)
30 KB
30 KB JPG
>>107361794
>>107361774
But I can run it with 8 GB VRAM and 16 GB RAM.
>>
How is anon this retarded. Your question is answered right in the paper. READ with your FUCKING EYES
>>
>>107361812
shut up Elon
>>
>>107361822
shut up monkey
>>
File: 1760255811847781.jpg (529 KB, 1284x1133)
529 KB
529 KB JPG
>>107361820
but the speed losses
>>
>>107361820
What about 6GB?
>>
for anyone who finds Z-Image slow, make some gens with Chroma and then run Z afterwards, it'll feel like a dream now
>>
File: ComfyUI_00079_.png (883 KB, 1024x1024)
883 KB
883 KB PNG
>>107361757
zit doesn't know Greg Rutkowski

it's over
>>
>>107361822
The paper does not state Z-Image-Base is 6B, Anon. Yes, it's highly likely it is small, but it does not explicitly state this. That is NDA protected. We will not know until it is released. How hard is this to get through your thick skull?
>>
File: tgawgtfwagtfawgtwa.png (951 KB, 934x783)
951 KB
951 KB PNG
New z-image lora just dropped and its..

..something we can already gen

https://civitai.com/models/1750662/photorealistic-ai-influencer-woman877-character-lora-sd15-or-sdxl-or-flux-or-qwen?modelVersionId=2447895
>>
File: stare6.jpg (29 KB, 512x512)
29 KB
29 KB JPG
>>107361836
As long as I can generate realistic big breasted females in a reasonable amount of time, I don't care.
>>
>>107361864
>what if I make a woman lora on this model specialized on 1woman
some people are fascinating to watch desu
>>
>>107361842
Chroma HD Flash is fast enough if you're not a vramlet.
>>
>>107361863
I don't know how one reads their intent in the introduction and doesn't deduce that the base model will also be 6B. Also how does one see "Z model suite" and think that includes everything except for base. Reading comprehension.
>>
File: 1763880080762u.jpg (37 KB, 971x845)
37 KB
37 KB JPG
>>107360537
I hope that it's due to their uniform data coverage algorithm for distillation and base model will do it fine. It also confidently generates twilight sparkle whenever something from pony universe is mentioned.
>>
>>107361884
yea, half the z-image loras released can already be done natively. im not going to say anything though. let people have their fun
>>
>>107361863
>The paper does not state Z-Image-Base is 6B, Anon.
>>107361760
>>
>>107361843
zit artist knowledge is actually abysmal
>>
>>107361895
Time to move on from ChroMeme
>>
luv me wan
luv me noob
luv me zit
'ate saas
'ate nvidya
'ate uncomfyui
simple as
>>
>>107361884
>1girl lora based on SDXL 1girl images
>>
>>107361916
based beyond reason
>>
>>107361842
ZIT is twice as fast as SDXL on my machine, and moreover a four-step ZIT iteration is faster than a thirty-step SDXL iteration. If that is slow then what would be considered fast exactly?
>>
>>107361895
flash looks like shit when i try it, you got a workflow?
>>
Stop the press who is that?!
-had to do the reference, but woah suddenly a new lora appears and it's actually a character lora, one Z can't do.

https://civitai.com/models/2173778/harley-quinn-classic-clown-princess-batman-arkham-knight-z-image-turbo-lora?modelVersionId=2447919
>>
>>107361906
Anon, my guy, can you not read?
This is under 4.5 "Few-Step Distillation". The "6B foundational model" they are referring to is Z-Image-Turbo, not base. The base isn't the "leap in efficiency". That's turbo.

Z-Image-Base is the "full" foundational model. Z-Image-Turbo is the distilled foundational model.
>>
>>107361864
>>107361884
>>107361919
The fucking ridiculous self congratulatory description with links to his profiles and guides when it's the worlds most basic lora is pretty funny
>>
File: this.png (188 KB, 629x400)
188 KB
188 KB PNG
>>107361938
>The "6B foundational model" they are referring to is Z-Image-Turbo, not base.
>>
>>107361938
>>107361900
>>
>>107361936
i hate that nobody did this Harley for Noob/Illustrious but z-image gets it.
>>
>>107361950
>section says "few step distillation" in bold letters
>durr dey must mean the base model
(You)
>>
>>107361864
>>107361939
whats funny to me, they train those 1girl insta loras on sdxl/fluxqwen/image slop
>>
Enough of this. When Base is released and if it's bigger than 6B I'm going to spend everyday calling you a retard.
>>
File: b r u h.png (2.25 MB, 3387x1123)
2.25 MB
2.25 MB PNG
you have to be fucking kidding me, the first lora I would make would be fucking cctv camera style or 1980 screenshot tv style or some shit, not something that the model can already do
>>
>>107361975
You vastly overestimate the average prompters ability to prompt
>>
>>107361975
Why don't you then?
>>
>>107361994
Why don't you then?
>>
>>107362002
no gen no onyon
>>
File: looks at 4chan.png (661 KB, 976x850)
661 KB
661 KB PNG
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/discussions/25#69296b7daf6bec882aa53bf9
>I think 4chan users are very happy
is it true? are you happy anon?
>>
>>107361975
7k downloads for photorealistic AI influencer woman..? BUT YOU CAN LITERALLY DO THAT WITHOUT THE LORA

god damn I FUCKING hate RETARDED NORMIES
>>
>>107362002
My vram is already occupied, sorry!
>>
China: "gaijin, we have finally beaten scale at all costs"
Anon: "hurrdurr base model be big"
>>
>>107361975
the first lora i would make is one to verify i can make a lora successfully, the second one is one to see if i can introduce new concepts. (attempting a vagina lora)
>>
>>107361975
>I would make
>would make
uh huh
>>
>>107362017
it only has 13 downloads. they released that lora for multiple models and civitai counts them all
>>
why do we hate flux now
>>
>>107361843
No base model since SD15 has known Greg Rutkowski's art style

He was once a legend, now just some bum on the street
>>
why are you all seething about people taking advantage of retards with their 1girl slop loras that shouldnt exist? just make them yourself and make money from said retards?
>>
>>107362035
>now
>suddenly
>all of a sudden/all of the sudden
>>
>>107362011
No I'm expecting the worst and will only believe the base model is good and useable once I see it with my own eyes
>>
>>107362037
SD1.5 remains unbeatable for breadth of knowledge, and this perplexes and distrubs me.
>>
File: 1763590204159859.png (346 KB, 647x1540)
346 KB
346 KB PNG
>>107362035
>why do we hate flux now
https://huggingface.co/black-forest-labs/FLUX.2-dev
>>
>>107362011
I'll be happy when the base model releases and someone makes a finetune of it that becomes what Chroma should've been.
>>
>>107362051
but if flux turbo was distilled from base it means by definition the base model is better no?
>>
>>107362011
I am but this isn't exclusive to 4chan.
>>
>>107362039
i think it's really funny in open source when someone provides a free tool to the community and the community is upset about it
>>
File: file.png (138 KB, 1889x930)
138 KB
138 KB PNG
I'm trying to train a lora with ostris' ai toolkit, i started the job and after 10 minutes nothing happened, no network activity, no models were downloaded, no vram was used.
Please help.
terminal log (no errors): https://paste.centos.org/view/eb15391f
>>
>>107362061
This. It just needs to recognize basic NSFW concepts(genitalia, insertions, sex positions) and it'll be golden.
>>
>>107362035
>now
everyone hated flux once it turned out that it's pretty much unsalvageable
>>
>>107362039
>>107362079
so you're NOT the guys who were just whining about 1girl gens here and civitai right?
>>
>>107362071
he doesn't understand what you mean unfortunately
>>
>>107362055
poorfag can't afford to run that fat hog
>>
File: why?.png (3.09 MB, 1790x1277)
3.09 MB
3.09 MB PNG
>>107362115
why would you run a giant plastic generator?
>>
>>107362103
I'm talking about the size, if it's some 30b blob it'll be DOA like all the other bloatmodels before it
>>
>>107362055
kek

what an absolute shit company, the only 'innovation' they do is coming up with new methods to cripple their models with censorship
>>
i WILL continue to gen 1girl, slampiggy and i WILL cum.
>>
>>107362082
post job settings. did you setup the Hugging Face Token so it can download the model? check the hub folder to make sure its not empty

if its empty its not working
>>
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
DOWLOAD AND SAVE EVERY THING
https://m.youtube.com/watch?v=xm9MGlOddVY
>>
>>107362128
it's the same size though >>107361760
>>
Dev and Schell are both 12B. Why would base Z be larger than turbo?
>>
>>107362082
did you set up your .venv correctly? it's probably crashing before it starts and windows is gay so doesn't show you the cmd output. try to activate your venv and run the job via the command like so:

python run.py output/job-name/.job_config.json

you should then see what's wrong
>>
File: UKoDmy.gif (2.08 MB, 400x224)
2.08 MB
2.08 MB GIF
>>107362011
me rn fr
>>
>>107362135
This is old and no one cares. I will continue to gen cunny.
>>
>>107362123
Because it's SAFE!

Don't you want to be SAFE !?

Surely you do, pixels can hurt
>>
>>107362123
ew, wtf is up with that chin
>>
comfy should be dragged out on the street and shot
>>
>>107362134
i need a huggingface token to download ungated models? ill make an account and try again, thanks anon
>>107362144
im on debian 13 but ill try that, thank you anon
>>
>>107362153
We enjoy our SAFE and FUN models here at /ldg/
>>
>>107362147
kek, same
>>
>>107362164
you don't need a hf token for z-image training
>>
I bet Flux 2 can make a mean "funny cat astronaut on the moon riding hotdog car" image
>>
>>107362141
Flux Dev and Schnell are both distilled from the proprietary Flux Pro model, which I would assume is larger than 12b.

That said it's not necessarily the case than Z-Image Base is larger, but it seems likely, if so I would wager ~10-12b
>>
File: 1754881217898.jpg (1.11 MB, 5380x3518)
1.11 MB
1.11 MB JPG
>>107361910
Z is a neat model to use when you want to gen images that look completely accurate, but it's not as fun for NSFW prompting as Chroma. Even if it gets an NSFW tune, unless the seed variety is solved it will remain inferior. Obviously, there are things it will always do better than Chroma for now, such as backgrounds (which could be fixed with a large scale Chroma tune) or out of the box artist style/celeb concept knowledge.
>>
>>107362195
>but it seems likely
Based on previous models? If anything they seem to claim they've beaten that paradigm
>In this work, we present Z-Image, a powerful diffusion transformer model that challenges both the "scale-at-all-costs” paradigm and the reliance on synthetic data distillation. We demonstrate that neither approach is necessary to develop a top-tier image generation model.
>>
>>107362201
no one said that it solved the seed variation, it's still a problem and it'll likely be better on the base model, we're praising the model for other stuff, great details, great skin texture, great lightning, it's probably the most realistic local model and it's only 6b, that's an achievement like it or not
>>
Scheduler and sampler for Z? did we settle with euler simple or what? Some testing I did yesterday with euler simple got more consistent img2img results, but less realistic
>>
File: WAS IT SO HARD??.gif (1021 KB, 233x131)
1021 KB
1021 KB GIF
>>107362227
>and the reliance on synthetic data distillation.
I knew this model looks good because they finally stopped training their model on synthetic shit, THANK YOU
>>
>>107362201
Love this. Although if you'd turned down guidance and used some other tricks you could have got a lot more variety out of Flux.

Always good to see a good infographic shaming for these "better" models. Used to make many of them myself
>>
>muh seed variance
change the prompt you lazy bastard
>>
File: Nano Banana Pro.png (1.91 MB, 1408x768)
1.91 MB
1.91 MB PNG
>>107362251
>Always good to see a good infographic shaming for these "better" models. Used to make many of them myself
now the infographics are AI generated, get with the times grandpa!
>>
>>107362144
weird as fuck,
python run.py output/TetoLora/.job_config.json

works completely fine. thank you so much anon
>>
>>107362227
>Based on previous models?
Primarily because Z-Image Turbo is very focused on realism and portrait shots, for the base model they probably want give good coverage for a wider range of concepts, which likely means a somewhat larger model.
>>
File: stare3.jpg (59 KB, 1024x960)
59 KB
59 KB JPG
>>107361820
fucking comfynigger updated something and it stopped working
>>
>>107362238
gloss over the paper. its surreal to see a lot of things anons claimed about that being explicitly stated and proven in an academic setting
>>
>>107362283
>Pulling between releases
You took the shitty odds, it's on you
>>
File: flux comparison.jpg (799 KB, 2526x1602)
799 KB
799 KB JPG
>>107362201
It has just been one downgrade after another. It's all so tiresome.

This is why I am bearish on AI in general. "It's so over, they're getting so good now." No.
>>
File: 54454554645.jpg (555 KB, 2394x2018)
555 KB
555 KB JPG
>>107362230
>great details, great skin texture, great lightning

Of course, great prompt adherence. Some gens, sure it's the most realistic it can be. But Chroma is consistently more realistic and less slopped (of course at cost of occasional bad outputs). It's only 8b you know (with a Flash version), not like we're comparing 30b model to a 6b.
>>
>>107362282
>which likely means a somewhat larger model.
That's my point. They seem to allude that you don't actually need more params for competent models. Nowhere in the paper did I see them even suggest the opposite. But we won't know for sure until release.
>>
>>107362318
>occasional bad outputs
lol
>hands out of frame
lmao
>>
>>107362312
>comparing a """"mostly base"""" mix to a foundational model
Give me one reason to not disregard everything in your post
>>
>>107362318
I swear you motherfuckers are literally blind if you think that chroma shit actually looks good. Bitch looks like a mutant
>>
File: stare10.jpg (23 KB, 512x512)
23 KB
23 KB JPG
>>107362283
it was async offload
--disable-async-offload helped
no, I am NOT going to report it
>>
>chromasome, flux cucks, that one cumfy schizo, pepe poster, and pedochad all joining forces to fuck /ldg/'s shit up

i don't even have words at this point, this is one fucking LOADED roster.
>>
>>107362123
you keep posting that but it really shows how z prompt adherence is notably worse
>>
Oh my fucking god i hate comfy so fucking much it's unreal
>>
File: 1748979911748511.png (774 KB, 640x787)
774 KB
774 KB PNG
z-image isnt even that good
>>
>>107362346
Do you know what perspective and focal lenght is you absolute retard ?
>>
>>107362369
how about you work on your shitty ui instead of complaining, ani.
>>
>>107362376
Do you fur brained mega retard?
>>
File: 2517960256.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>107362318
I like chroma well enough, but it's not only much slower it's also less stable. If z-image is as trainable it will simply supersede it.
>>
total spaghetti death
>>
Does AniStudio even support Z-Image-Turbo, the latest popular image model? Why would anyone waste their time with a UI that can't keep up with current trends?
>>
>>107361931
Here you go anon
https://files.catbox.moe/1qe8zt.png
>>
>>107362341
Base SDXL would have demonstrated even greater variety, it just would have taken longer to cherry pick the best gens. A Flux finetune would have performed even worse on this metric, not better.
>>
>>107362376
That perspective on chroma is fucked bro lol. Honestly should never praise perspective on any AI model, all of them are trash.
>>
>>107362356
>6b model has slightly less prompt adherence than a 32b model
>6b model looks like a realistic photo of a person, 32b model looks like a person made of plastic
Flux 2 is dead
>>
you heard it from the chinaman's mouth

base is gonna fuckin rock
>>
>>107362413
They both have strong default styles that are mutable with prompting. You just like the z image smartphone photo style more than the flux hyperprocessed advertising photo look.
>>
File: ComfyUI_09093_.png (2.43 MB, 1152x1152)
2.43 MB
2.43 MB PNG
>>107362391
It will definitely catch up in stability and provide more coherent images, there's no doubt about its potential. The one thing that's concerning is the lack of seed variety. The concern is that maybe this is the reason why it's more coherent, and when one attempts to tune it then it loses some coherence. The in that case, how much better than Chroma is it truly?
>>
>>107362436
Sure Jan
>>
File: ComfyUI_09094_.png (2.57 MB, 1152x1152)
2.57 MB
2.57 MB PNG
>>
>>107362413
brutal
>>
>>107362398
It still uses python which is the funniest part
>>
>>107362445
ChroMeme will never catch on, give it up
>>
>>107362436
>just like the z image smartphone photo style more than the flux hyperprocessed advertising photo look.
what red blooded burger actually prefers the second one? only pooworlders and grifters like the second look
>>
>>107362429
>I recommend also trying with different time shift values
anon found about that first though!
>>
File: im traaaaaaining.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
tried training a lora for the first time, it went... better than i could have imagined
>>
>>107362490
>average chromosome gen
>>
>>107362452
it's overbright and the colors are oversaturated, I don't know how you can enjoy this shit, it doesn't look natural at all
>>
>>107362445
>The in that case, how much better than Chroma is it truly?
better by a lot because it's smaller and thus doesn't need a supercomputer cluster and furry millionaire to modify it
>>
File: ComfyUI_09097_.png (2.43 MB, 1152x1152)
2.43 MB
2.43 MB PNG
>>
>>107362495
looks like pooCPlustrious
>>
>>107362452
it took 9094 seeds to get one with only slightly mangled hands

look I wanted chroma to be good too, but it's time to stop
>>
>>107362501
wtf is that thing on the left lol
>>
https://www.reddit.com/r/StableDiffusion/comments/1p94pgz/implementing_nag_for_zimageturbo/
>Implementing NAG for Z-Image-Turbo?
>I found someone on a Korean forum who's claimed to have implemented a version of it based on this repo using AI (https://arca.live/b/aiartreal/155108228), and it appears that he was able to get negative prompting to work using NAG with a CFG at 1.0, but he did not provide the code.
FFFFFFFFFF
>>
>>107362318
that's grim for chroma. thousands of dollars wasted lolol
>>
I'm going to craft a titty elf prompt for Chroma Flash and Z-Image. I will craft the prompt without any testing of the outputs, so that the prompt will not be made to specially fit one model or the other. I will select the first 15 SFW images produced by each and make an infographic. This will incontrovertibly PROVE which model is better.
>>
>>107362533
Tim Shift
>>
Bake?
>>
>>107362496
his gen settings do that, chroma doesnt look like that ootb
>>
File: many such cases.png (309 KB, 653x565)
309 KB
309 KB PNG
>>107362318
>right is the woman's photo on tinder
>left is the same woman during the date
>>
File: ComfyUI_19367_.jpg (930 KB, 2048x2048)
930 KB
930 KB JPG
>>
File: 1761793099581729.png (470 KB, 1528x1378)
470 KB
470 KB PNG
>>107362372
TRVKE
>>
>>107362391
>it's also less stable
That's the appeal. When you eliminate this the model becomes useless.
>>
>>107362372
>>107362562
wait for the base model to come out before reaching any conclusion
>>
>>107362562
>Here's my list of flaws...
>seed-variety
>seed-variety
>seed-variety
>seed-variety
>composition
>>
z-ai-edit when
>>
>>107362496
That's because I'm using the Flash version and I'm not adjusting params. There are experimental samplers that get rid of that
>https://github.com/ClownsharkBatwing/RES4LYF

Alternatively if you're not a VRAMlet you just merge the HD delta weights with regular HD and the problem is gone.
>>
>>107362562
so a huge ass text just to say "no seed variance" lol
>>
>no proper nsfw lora in 24h
its over
>>
>>107362534
GUYS! GUYS!
LISTEN!!!
you are going to think I'm crazy but get this.
there is a Korean rice picking forum where some guy
get this
some guy actually got NAG to work with zit
I know, I know. I'm taking crazy pills but it's true!
he made some images with it AND had negative prompts!
he didn't share the code!!! omg what a meanie!
anyways thought I should share this since it will be IMPOSSIBLE to do negative prompts
anyways see you tomorrow when base z is out take care bye bye
>>
>>107362562
posted by the seething flux.2 dev

the pros outweights the cons of this model, also this just a turbo version, what a fucking autist
>>
File: ComfyUI_09099_.png (1.74 MB, 1152x1152)
1.74 MB
1.74 MB PNG
>>
File: nice one.png (480 KB, 750x1000)
480 KB
480 KB PNG
>>107362594
>anyways see you tomorrow when base z is out take care bye bye
kek
>>
there doesnt seem to be the option to train zimage in fp16 in ai-toolkit?
>>
Not trolling I will miss XL and by extension CLIPs ability to give you random enough kino that you can just keep pressing gen with random seeds
>>
>>107362562
Can image variation be solved with the dynamic prompting? Did someone already implement the workflow for that?
>>
>>107362594
ok
>>
>>107362587
https://civitai.com/models/2173844/z-image-turbo-facial?modelVersionId=2447989
>>
File: 1750771617660.png (923 KB, 860x823)
923 KB
923 KB PNG
Any other UIs for Z-image? I just cant into node autism.
>>
>>107362620
neoforge
>>
>>107362608
under Quantization set it to - NONE - instead of fp8
>>
>>107362610
>dynamic prompting
You mean basic wildcards? Trivial to "implement" kek
>>
>>107362620
Use SwarmUI
>>
File: sdxl vs dalle round 2.png (3.9 MB, 2355x1898)
3.9 MB
3.9 MB PNG
>>107362562
These downsides are the obvious trade-off they made to get the upsides. I've been beating you faggots over the head with this point since the SD1.5 days when you all went for finetunes over base.

"If only we could get the good hands and anatomy of PerfectBodyTune 6.0 with the variety of base SD... " (nobody is close to accomplishing this)

There has not been any big transformative revelation to make these models better by leaps and bounds. They have been making bad trade-offs to optimize for what the most retarded genners prefer at the expense of everything that made SD cool. The direction this was headed was obvious from the beginning and nobody was willing to listen. Now here we are.

Z Image is shit.
>>
>>107362619
I'm sure he trained that lora with synthetic shit, it's so slopped
>>
>>107362620
I got it running on neoforge, not as good as OG forge, but at least it's not noodles.
>>
>>107362630
thanks
>>
>>107362637
>There has not been any big transformative revelation to make these models better by leaps and bounds. They have been making bad trade-offs to optimize for what the most retarded genners prefer at the expense of everything that made SD cool.
Chroma exists though, if you are desperate for seed variation at the expense of good anatomy + good perspective, you have that lool
>>
>>107362637
>muuhh dalle3
why are you praising this model, it's the most plastic model ever
>>
>>107362637
This is what gambling rng addiction does to a motherfucker. Don't be like him kids
>>
>>107362662
You think that image is praising Dall-E 3?
>>
>>107359883
You getting 1.5-2s per iteration on 3090, I'm getting double that training zimage on ai toolkit, I do have 120 images in my dataset, I assume that's the reason? The rest of my settings are default, fp16
>>
>>107362685
look at his post he praises the seed variation comming from dalle 3
>>
>>107362701
Did you read the text under the images?
>>
>>107362697
Also my images are 1024
>>
>>107362549
gimme a sec
>>
File: ComfyUI_09107_.png (2.01 MB, 1152x1152)
2.01 MB
2.01 MB PNG
>>107362575
Yes, seed variety is important
>>
LoRA Training started
See you guys in 30 min
>>
>>107362723
those chromakeks must be seething so hard knowing that their model will never have the popularity of Z-image lmao
>>
If the base model is really also 6b I'll cum
>>
File: spsosm.jpg (113 KB, 1024x1024)
113 KB
113 KB JPG
>>107362662
good old days
>>
>>107362738
Oh noo I hate being in a small elite of posters, I wish I shared a slop trough with all the unwashed masses.
>>
>>107362610
It can be solved by running your basic ass prompt through their "prompt enhancer".
It will always add unmentioned details to the image but it will add different things each time.
>>
File: ComfyUI_09117_.png (2.21 MB, 1152x1152)
2.21 MB
2.21 MB PNG
>>
>>107362793
>>107362793
>>107362793
>>
>>107360558
i print this for my wall
>>
>>107360699
oh god, now linux kernel stolen and hacked? what do
>>
>>107362723
prompt before new threaaaad
>>
>>107362842
nyo :3
>>
>>107361311
Many "artisans" try to pass of their goods as higher quality when if fact they're not.
>>
>>107361584
BLEWBS!
>>
File: lmao.jpg (143 KB, 912x1836)
143 KB
143 KB JPG
>>107361107
yes in one prompt it can
how? figure it out.
those tricks and tips and workflows from interents are mostly retarded

i figured out (somewaht new to comfy) how face detailer works almost as fast as in auto and forge
>>
>>107364235
kek this is pretty good, post that on the new thread though >>107362793
>>
>>107360506
far sighted "entrepreneurs" updooting their services since flux was all the rage not too long ago
>>
>>107362759
WHat model is thiss???
>>
>>107362562
it is trained using chineses mandarin speak,
that ledditor prompts are not clear instruction even in english.

last night i used llm to chinesify my prompt and told it as best as you can as chineses would write it in their language,
same paramters test and language change resulted in:
using english prompt = person does not hold the coffee cup in one hand and only leans on the counter with other. coffee cup is on the kitchen counter.

using chinese promp = person held the coffee cup + everything else was more realistic
>>107364241
i deleted it -.-
repost if you wish.
that is what loser gets for not winning nobel peace prize kek



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.