[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1768026987033384.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
Z-Image Base edition again

Discussion of Free and Open Source Diffusion Models

Prev: >>107987000

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debora
https://rentry.org/ana
>>
>>107988202
Where's the collage you lazy worthless shitface
>>
>>107988168
What? I think you have been confused with someone else bud. I love ComfyUI.
>>
File: Zimage_base__00053_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
i can still gen faster than i can post so that's pretty nice
>>
i made a lora, anyone wants it?
>>
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107988219
can you test if base can make a woman giving birth
>>
File: ComfyUI_00221_.png (1.15 MB, 1280x720)
1.15 MB
1.15 MB PNG
>>
File: o_00052_.png (1.88 MB, 896x1152)
1.88 MB
1.88 MB PNG
>>
>>107988222
Only if it's for cute feet.
>>
WHERES THE FAGOLLAGE??????????
>>
File: Zimage_base__00055_.png (1.98 MB, 1024x1024)
1.98 MB
1.98 MB PNG
>>
File: Zimage_base__00081_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
Yeah base is good, china won
>>
>>
>>107988202
You sdg fags already ruined one general, isnt that enough
>>
>>107988224
where is the comfyanon rentry? dude literally hires people to schizo for him
>>
File: ComfyUI_00220__sbs.jpg (245 KB, 2078x720)
245 KB
245 KB JPG
>>
File: 1749723443194040.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>107988224
working extra hard today
>>
>>107988202
Imagine you go to next leg thread but then
>Maintain Thread Quality:
https://rentry.org/debora
https://rentry.org/ana
>>
File: o_00053_.png (1.82 MB, 896x1152)
1.82 MB
1.82 MB PNG
>>
>>107988253
I'm not imagining and it's happening!!!
>>
File: 0012.png (403 KB, 1296x1328)
403 KB
403 KB PNG
base can do pixels too
>>
File: ComfyUI_06205_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>107988242
Little known fact, also dope on ze mic
>>
>>107988261
how long before you revert to big lipped baboon?
>>
File: Zimage_base__00056_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>107988226
>>
>>107988217
>I love the enshitification of everything
nu /g/ sucks
>>
File: ComfyUI_06251_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>107988241
Not as much like Mr. Bean but I'm done for now
>three arms
>>
qwen image 2512 wins :)
>>
>>107988259
>pixels actually all the same size
What the fuck?
>>
File: ComfyUI_41483_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
File: Zimage_base__00057_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>107988226
>>107988284
>>
>>107988257
UWU are you who I think you are?
>who do you think i am
Nyo.....
>>
>>107988299
can you please make something else instead of slopstyle garbage? this one is too fried like every single one of your gens
>>
>>107988284
bitch is growing a tit out of her neck
>>
>>107988299
Did those ace step people give you the weights and code yet?
>>
>>107988305
comfy doesn't know how to gen. only math and wrapping torch. it's all he's good for
>>
>>107988299
Honest opinion on Z-Image?
>>
>Hitler had to leave for more important business
I wonder what ZIB meant by adding the little rainbow flag
>>
>>107988299
Fuck you and your friends trani and debo, troonffy
>>
>>107988320
why are you asking the anon with worst taste in the thread and is barred from being honest due to nda?
>>
>>107988241
>three arms
Is this bait
>>
File: 128607729.png (3.6 MB, 1312x1584)
3.6 MB
3.6 MB PNG
>>
File: 0TjtTtMw8s.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
censored for the christian 4chan board
>>
>>107988301
*pukes*
>>
How do I tell if something is wrong?
>>
>>107988364
zbase loves extra hands
>>
>>107988320
Not worth using for inference over Turbo or Distilled Kleins IMO
>>
>>107988299
why don't you add security to remote sessions? hackers keep installing malware on people's computers because of you
>>
>>107988368
look at samples
>>
>>107988373
wrong
>>
File: ComfyUI_07407.png (3.42 MB, 2048x1280)
3.42 MB
3.42 MB PNG
>>107988317
I thought he was stealing prompts whenever his application phoned home? He should know how to do it by now...
>>
>>107988384
is that base?
>>
>>107988380
Disabled to save time.
>>
>>107988394
No. I haven't gotten around to training just yet (I just woke up).
>>
>>107988384
see >>107988377
it's incompetence and corpo compliance
>>
>>107988301
1633.112 doesn't seem like a healthy measure of any kind of vitals
>>
>>107988373
Of course not. No base model should be used for inference. I guess it's too early to tell how Z-Image loras compare with ones made in Turbo.
>>
>>107988373
stick to coding retard
>>
>>107988380
Body horror is already showing at the 100th step.
>>
File: Zimage_base__00062_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>107988401
i think she got the high score
>>
>>107988402
>No base model should be used for inference.
But this is the base model prompters general...
>>
File: 44295916.jpg (793 KB, 1936x1985)
793 KB
793 KB JPG
>>
>>107988202
Glad base is finally here. Now begins the wait for Booru tags.
>>
>>107988373
listen to >>107988404
>>
>>107988397
get the horse faced girl lora out asap
>>
>>107988422
>booru tags
esl zoomer loser
>>
>>
>>107988456
base slop
>>
>/lmg/ - Local Models General
>/ldg/ - Local Diffusion General
>/sdg/ - stable diffusion general
why the FUCK do you fags need THREE of the exact same generals?
>>
>>107988422
illiterate
>>
>>107988462
im here for it
>>
>>107988467
this is an ai board now, nigger
>>
>>107988467
lmg is more about chatbots imo.. sdg is just a retard running a script that produces garbage shit pics all day long thinking its so clever
>>
>>107988467
>THREE of the exact same generals?
lmg is for llms
ldg for image gen
sdg was for image gen, idk why people separated, maybe they felt too many ppl talking about saas there? idk 4chan users are mentally unstable, barely anyone uses saas, and there is also the saas specific general that you missed (/DE3/)
>>
>>107988299
that's the best you can do?
>>
>>
File: z_imageBASEd_00144_.jpg (536 KB, 1520x1728)
536 KB
536 KB JPG
>>
Is omni base out or just "base". Also why is "base" so small, they said that turbo was size distilled
>>
File: kt_A-s5vMQ6L-_sUjNUCG.jpg (1.58 MB, 4400x1356)
1.58 MB
1.58 MB JPG
>>107988506
>>
>>107988506
just base, no omni for editing
>>
>>107988498
4 years, no improvement. actually he had higher res pics when he was on a1111
>>
>>107988504
are you using a lora?
>>
>>107988230
One Ogre! One Swamp! One Shrek!
>>
>>107988512
that doesnt answer my question retard
>>
>>107988516
lmfao
>>
>>107988516
>he had higher res pics when he was on a1111
KEK!
>>
>>107988512
So I guess the size distillation was for this model, meaning this is a distill and not a true base model
>>
>>107988536
>>107988516
what is the creator of a111 doing nowadays
>>
>>107988543
living the high life with all those github stars
>>
>>107988543
dota
>>
>>107988299
ACEStep when?
>>
File: nyx-assassin.png (54 KB, 256x144)
54 KB
54 KB PNG
>>107988543
>>
File: z_imageBASEd_00165_.jpg (698 KB, 1520x1728)
698 KB
698 KB JPG
>>107988517
nah dude

 A photo of Billie Eilish. A photo showing a close-up of a young woman with fair, almost pale, fair skin and long, straight black hair. She has round large blue eyes, well-defined eyebrows, and long eyelashes. Her mouth is open wide, revealing a set of silver braces and a tongue that is partially visible. Her lips are slightly parted. Her right hand, with long, manicured nails painted in a light purple color, is gently holding her chin. The background is blurred but shows a black couch and a wooden floor. The lighting is bright and even, highlighting her facial features and the texture of her skin. The style of the photo is explicit and erotic. The overall atmosphere is intimate and provocative. 
>>
>>107988559
ace step base
>>
>>107988568
im pretty sure cumbox blocked my ip for some reason
>>
>>107988467
/adt/ not mentioned but it first started as a joke and wasnt that bad but now its a containment for the pedos
>>
>>107988574
yeah i did, stop
>>
>>107988568
>image contains errors
catbox died again?
>>
what are we using to train loras for base? one trainer?
>>
>>107988585
i haven't been doing shit faggot
>>
File: Zimage_base__00068_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>107988570
>>
>>107988589
He's gonna take his sweet time like with other models so ai toolkit for now.
>>
>>107988570
prompt for picrel?
>>
>>107988599
kys pedo
>>
So what tricks do and don't work? I saw that sageattention is not compatible for now or ever. The precision tricks seem to work fine from what I've seen. Anything else? Does torch.compile or async offloading work? What about https://github.com/pollockjj/ComfyUI-MultiGPU?
>>
>>107988600
>for now
are you saying one trainer is better?
>>
File: 1768681026637986.jpg (47 KB, 828x798)
47 KB
47 KB JPG
oh i thought you meant cartoon lolis
>>
>>107988608
It's not turkish
>>
>>107988599
Fuck off.
>>
>>107988606
>clicks image when original anon requested them knowing what it was
retard
>>
>>107988575
containment for pedos? then what is this shit?
>>107988568
>>107988599
>>
Still waiting for the real base model. Does this mean it's delayed again?
>>
>>107988616
huh?
>>
File: z_imageBASEd_00167_.jpg (715 KB, 1520x1728)
715 KB
715 KB JPG
>>107988604
 candid photo of 29-years-old woman with huge massive hyper-tits covered by black skintight sleeveless top, she has long wavy red hair, fair skin, and freckles. She has green eyes. Photo was taken inside 7-11, close the exit door. She is looking directly at the camera. The texture of her hair is soft and slightly tousled. She looks annoyed. Her shirt is made out of sturdy fabric of some sort. She wears a Bird's Nest as a hat, In the bird's nest is a Large white Seagull. Glass door. Shelf with Coca-Cola advertisement. 
>>
>>107988575
but sometimes they still spawn in here like >>107988599 so idk
>>
Anyone found what the best sampler is for Z-base and/or some magic tokens? It definitely has better prompt understanding than turbo but the things it seems to struggle on are eyelashes and occasionally lips.
I've tested inverse squared and beta for the scheduler with more steps and it will occasionally fix the issue but not as much as I would like. It might just be the way it is without a second hires pass or without waiting on finetunes.
>>
>>107988633
yeah
>>
>>107988633
proof?
>>
>>107988633
heun, res2s and other strong samplers. 15-20 steps. cfg 3.5+. Euler and the generic ones are too weak.
>>
>>107988652
>cfg 3.5
retard, 6 is min
>>
>>107988619
i don't know what you're talking about, i dont read schizo threads
>>
>>107988629
code blocks dont wrap hon just paste it
>>
>>107988599
Woah based!!! Then my imoutotv jappies will work just fine, I have a lot of datasets ready to train. How many images you used and can you share the settings?
>>
>>107988653
Lol enjoy your fried gens.
>>
File: 3321015.png (2.78 MB, 1728x960)
2.78 MB
2.78 MB PNG
>>107988566
Now you made me wonder, which characters does he play?
>>
>>107988652
You're supposed to use 50 steps with euler
>>
>>107988620
we try our best man, can't contain them all.
>>
>>107988633
i use CFG 3, dpmpp_3m_sde_gpu, and beta scheduler - seems to work pretty well
>>
>>107988299
slop ui
>>
>>107988663
>believing official settings
>ever
>>
>>107988660
and you enjoy your sketches
>>
File: Zimage_base__00069_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>
>>107988676
ai slop
>>
>>107988671
You are a moron. I hope you realize res2s is just running the model for 30-40 steps anyway, but you think it's "strong" because the UI says 20 instead of 40
>>
>>107988629
much obliged my zigga
>>
mid release but at least it was expected
>>
>>107988678
sexo
>>
>>107988683
not how res works retard
>>
I am the ONLY ONE waiting for qwen image edit 2512...
>>
File: 33.jpg (470 KB, 2068x1497)
470 KB
470 KB JPG
>>107988655
Not my problem normie

>>107988659
Around 400 images, though that's mostly just different outfits. I'm sure you can get excellent quality with as less as 50 images.

>can you share the settings?
It's the default settings for Z-Image, though I may experiment with other options later. The real test is how flexible it will be with other loras.
>>
>>
>>107988698
turn off the 512 and 768 resolutions if you have a varying dataset because its ganna shrink everything to 512
>>
>>107988698
Thanks friend having known settings to start with is always good, I'll for sure experiment as well.
>>
>>107988711
that has been fixed
>>
>>107988698
also, why are you running weighted?
>>
>>107988694
Don't look up what res2s stands for anon
>>
>Lodestone does it
>Why tune it? There's body horror, that sucks

>BFL does it
>Why tune it? Z (nonexisting) is le better

>Z base does it
>They are the best model in the world!

So the formula for body horror acceptance is just gaslighting everyone into thinking your model is good.
>>
fuck you comfy im not installing your update
>>
So One Trainer > ai toolkit > kohya ss
>>
>>107988729
An aspect of chinese culture.
>>
File: Zimage_base__00074_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>107988226
>>
File: ZibVSZit.jpg (3.31 MB, 2048x1536)
3.31 MB
3.31 MB JPG
>>
>>107988739
skill issue
>>
>>107988739
Sop & Sloppier
>>
File: 014 - proxy..jpe~01.jpg (48 KB, 655x214)
48 KB
48 KB JPG
>107988738
>>
>>107988738
imagine the smell
>>
>>
>>107988739
add negatives
>>
File: 55.jpg (280 KB, 583x778)
280 KB
280 KB JPG
>>107988711
I heard using multi resolutions helps the model with fine detail. I never noticed any quality degradation

>>107988717
[spoiler] i am also an imoutotv chad[/spoiler]
good luck!

>>107988723
that's the default timestep it was set to, so I assumed jaretburkett think's it's the best for z-image. I will try sigmoid next.
>>
>>107988762
If 20 steps euler with a positive prompt isn't enough then its trash
>>
>every model needs to accept my sdxl settings waa
>>
File: 58924375684237.jpg (104 KB, 1000x994)
104 KB
104 KB JPG
>>107988202
FUCK LOCAL DIFFUSION, ALL MY NIGGAS USE GLOBAL DIFFUSION
>>
>>107988775
that's my band's name
>>
>>107988739
How many times does it need to be stated the distilled models are better for inference.

God damn thick skulls don't listen.
>>
>>107988653
maybe if your prompt is 4 paragraphs long

>>107988652
Heun doesn't look any better to me

>>107988669
dpmpp_3m_sde_gpu and beta scheduler is the best combo i've tested so far I think but it's hard to tell. I think I still need to stabilize my prompt some more.

>>107988705
You had literally all of latent space to explore.
>>
File: z-image-fp_00075_.png (2.53 MB, 1096x1608)
2.53 MB
2.53 MB PNG
z-image won
>>
>>107988773
this but unironically
>>
File: Zimage_base__00077_.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
I wonder if google is going to push for a quick release of nano banana pro 2 or something now that base is out, results are about the same but much cheaper even via api
>>
>>107988501
mama im comin home
>>
File: basedpepe.png (224 KB, 521x937)
224 KB
224 KB PNG
>>107988780
your band doesn't have a bass guitarist, it has a based guitarist
>>
>>107988299
how do you feel about your former associate dedicating his life to FUDding you and having petty squabbles on an anonymous message board?
>>
>>107988809
The base model is not out, this is still a minified model with no editing capabilities. The only use of this model is for making slightly better loras
>>
>>107988819
prompt?
>>
>>107988819
bazinga
>>
>>107988633
This is what I told another anon in the last thread but nobody listens.
The SFT "fixes" the small things that are actually sovl. That's why you need the base model, not the SFT one.
>>
>>107988833
>"Tell the guy who said 'prompt?' that he's a faggot"
>>
File: 0859181266.png (2.13 MB, 816x1472)
2.13 MB
2.13 MB PNG
>>107988819
>>
Can Z-image base make a believable fake "femanon timestamp"?
Pic related was genned with chromaHD for a "fuck the UK ID" contest on the unstable diffusion discord server
It looks like shit but the guy claimed he fooled other discord servers and the r/smallbreastsGW mods
And while I believe neither of those groups aren't particularly smart, I'm also wondering, can Z base do better?
>>
File: Zimage_base__00080_.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
>>107988792
Yes, was there ever any doubt ?
>>
>>107988809
Kek what? Klein is what Google would be worried about. Have you even tested the models? Image quality wise, only Klein has caught up, and almost entirely in knowledge as well.
>>
>>107988857
>that hand
>>
>>107988876
if that was true the thread would be talking about it, you lost
>>
>>107988886
just as in the source image, perfection
>>
File: DAT.jpg (30 KB, 594x334)
30 KB
30 KB JPG
>>107988886
>>
File: 1745595486150370.mp4 (2.76 MB, 392x220)
2.76 MB
2.76 MB MP4
>>
>>107988876
Some people on these threads are either shills or they have no clue what SOTA API actually looks like. Like if you've tested it, you know Z is shit.
>>
>>107988896
holy shit.. great material for a horror movie there
>>
>>107988864
zit and klein can
>>
>>107988219
sup
>>
>>107988897
Z models are license friendly and not cucked to shit. People will use it over Klein no matter how hard you shill.
>>
>>107988897
shills? .. as in getting paid to hawk free models?

lmao you're fucking retarded and should probably go touch grass
>>
>>107988914
I'm married with 2 kids, a stable job and a mortgage. You?
>>
>>107988864
why, you thinking about getting approved to post on r/gonewild ?

>you have to go back
>>
>>107988888
Thread is infested with shills.
>>
>>107988922
triggered big boi

"i feel like im successful, therefore I'm correct, how about you?"
>>
>>107988896
I'm gonna show this to my kindergartners.
>>
>>107988914
>being this new
this is 4chan, where everyone you don't like is a shill
>>
>>107988930
You lost.
>>
>>107988933
you should.. seeing some shit like that as a kid would have fucked me up
>>
>>107988922
proof?
>>
>>107988896
ngl when it started vomiting out fried chicken i got hungry
>>
>>107988876
>Klein has caught up, and almost entirely in knowledge as well.
LOL
>>
File: 1740649275754984.png (267 KB, 727x525)
267 KB
267 KB PNG
comfyui rekting normies again by silently failing if sage is enabled lmao, cant believe this is still happening
>>
>>107988947
he's a kiddy diddler on 4chan.. dont ask for proof

>>107988896 If this was cleaned up and whatnot it would be pretty awesome for a movie idea
>>
>>107988832
I have one in training, by the looks of it it's going to be exactly the same quality as a version I did on Turbo already with Ostris V2 adapter.
>>
File: z_imageBASEd_00175_.jpg (323 KB, 1016x1152)
323 KB
323 KB JPG
>>
File: Zimage_base__00083_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>107988767
>good luck!
I got em all as well. And a few loras for 1.5 and SDXL, trained along the years.
>>
>>107988956
1. you shouldn't have sage globally enabled because it breaks other models like qwen.
2. you would get a black image if sage didn't work, not that.
3. sage is not enabled in the default z-image workflow.
>>
ZIB is the least slopped/most SOVLFVL official model drop we've had in a long time. In terms of style capabilities, I need to test it more, but it might actually be stronger than chroma. And the anatomy and geometry isn't fucked up either.

If your goal is anything but photorealistic 1girl standing, this is a huge upgrade over ZIT and FKB. China delivered.
>>
>>107988927
>Tripfag
>Instantly thinks of porn
Pretty on character, keep going
>>
>>107988912
I don't think so. It's like comparing SDXL to Flux. The difference is too great to be overlooked by tuners.
>>
>>107988886
base is crap, zit is crap.

Flux4Lyfe
>>
File: Zimage_base__00084_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
>>107988980
porn is for faggots
>>
>>107988986
>It's like comparing SDXL to Flux
Not it's not. First off, Klein 9B is an edit model. You're comparing it to Z-Image, which is just a base model. Z-Image-Edit isn't out yet to even make such claims. You have no idea what you're talking about. Fuck off.
>>
Haven't updated my filter in a while
>>
>>107988976
>Posts a lame image you can probably do better with Klein

I warned people this would happen a couple of threads ago. The shills are out in full force.
>>
>>107989006
>Filtering anything ever
What a fucking pussy
>>
>>107988967
I like the chroma version better desu
>>
>>107988967
catbox? I'm only getting SD1.5 quality out of base
>>
>>107989018
I guess you've never seen a fluxgirl gen...
>>
i don't like that second pass loses saturation
it's over z-image lost
>>
>>107988976
lora shitmix
>>
File: Zimage_base__00090_.png (993 KB, 768x768)
993 KB
993 KB PNG
>>
>>107989001
>First off, Klein 9B is an edit mode
Not it is not. It can do both. It comes in 2 varieties, 4B/9B. Aside from being bundled with both (making it inherently superior to seperate models) it is technically much better to a better VAE which captures more details in photographs. It also comes with base and distilled models, so it's just as easy if not easier to train than Z.
>>
>>107988973
>1. you shouldn't have sage globally enabled because it breaks other models like qwen.
it works with most models aside from qwen basically, and saves a lot of time, its easiest to have it enabled globally with a dev blacklist for unsupported models instead of it silently failing
>>
>>107989042
nostril
>>
>>107988973
>>107989058
also sage does work with qwen as of latest
>>
File: F2Kb__00019_.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>107988990
It's so transparently, again, a shitmix.

>>107989035
it's a shitmix. I'm downloading "base". I can tell you it won't be able to do "creative" inputs. People only kind of notice, they don't realize actually z is not really refusing, it's not even that it's not sufficiently trained, it's that it can mostly just do whatever loras it has.

Here's Klein 9B base. This is a forced creativity gen, so instead of an LLM to come up with something, I use my "pick" subgraph to basically do like a pack of cards for nouns, adjectives, and verbs. I plan on making a much better system at some point.

THIS is how you test diffusion, because what's going on is the alien races are corrupt, they cheat tests. But they can't cheat random sampled points of the gamut of possible gens, which is what I'm forcing here!
>>
>open comfyui z base template
>defaults to res_multistep/simple
thoughts?
>>
>>107988967
JuggernautXL
>>
>>107989075
res multistep always has the worst graininess artifacts, cant imagine how thats default
>>
>>107989071
schizo babble
>>
This namefag talks big but has no idea what he's saying
I shouldn't be surprised
>>
>>107989079
> finetune vs base
retard
>>
so will they ever update sageattention for qwen/z image?
>>
File: 118945654.jpg (240 KB, 832x1216)
240 KB
240 KB JPG
>>107988976
JuggernautXL
>>
The true test of Z-image will be whether or not NSFW loras actually work correctly in Turbo. If they're highly flexible and don't change the subject/slop the results, we'll be cooking.
>>
i masturbate quite a lot
>>
File: Flux2-Klein_00406_.png (1.15 MB, 1280x720)
1.15 MB
1.15 MB PNG
>>
File: Zimage_base__00095_.png (1.14 MB, 768x768)
1.14 MB
1.14 MB PNG
>>107989071
did you take your meds?
>>
>>107989075
>>107989081
i agree. res samplers are too grainy. euler/simple honestly just werks.
>>
>>107989096
>2023 finetune 3.5b model 30sec/gen vs 2026 base model 6b 300 sec/gen
>>
File: Flux2-Klein_00945_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>107989113
+1 This.
Edit: Thank you for the gold, Kind stranger!
>>
File: 118940918.jpg (1.04 MB, 2803x4096)
1.04 MB
1.04 MB JPG
>>107989042
JuggernautXL
>>
>>107989115
2026 > 2023
>>
I'm assuming that ZiB does not require CumUI update.
>>
File: 00052-3199127437.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
cyberrealistic pony v8
>>
>>107989016
well anon, if klein is better people will naturally start using it more eventually. if z becomes the main model for realism image gen, then that means it was better. not much else to it.
>>
>>107989138
tracer got fat
>>
File: 118940910.jpg (2.11 MB, 4096x2803)
2.11 MB
2.11 MB JPG
>>107989071
JuggernautXL
>>
>>107989138
yeah, we can tell. SDXL realism checkpoints always have the same looking face/texture. it looks very fake.
>>
>>107989139
This.
>>
File: Flux2-Klein_00421_.png (825 KB, 832x640)
825 KB
825 KB PNG
>>
File: 118940912.jpg (994 KB, 4096x2803)
994 KB
994 KB JPG
>>107988241
JuggernautXL
>>
>>107989151
that game sucked so hard.. not sure why anyone liked it
>>
Sad last Qwen is too big to train & fine-tune, because it kills all other model for realism and text.
>>
>>107989101
Base loras don't work correctly in Turbo.
>>
There's already a Z-Image Base section on CivitAI, guess they realized it's the future of local training.
>>
>>107988905
I don't know about that, the text on those always come out a little too perfect, I doubt it could sell the illusion
>>
>>107989145
>same looking face
i doubt you've already seen that face in other SDXL pics
>>
File: 119008824.jpg (1.37 MB, 2803x4096)
1.37 MB
1.37 MB JPG
Cyberealistic Ponyv14
>>
Do you have to update to get base to work?
>>
File: Flux2-Klein_00429_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>107989155
The was and still very much is kinosoul.
>>
>>107989190
how horrifying
>>
>>107989190
Now prompt the same picture from different angles.
Yeah, that's what I thought.
>>
updooting. wish me
>>
File: Comparison.jpg (3.15 MB, 3072x1536)
3.15 MB
3.15 MB JPG
>>
>>107989228
brutal mogging.
>>
>>107989228
oh yeah prompt was
```professional DSLR photograph of the iconic character Princess Peach from the Super Mario video game series standing on top of a Capybara which is in turn standing on top of a wooden stool. Princess Peach is holding a sign that reads "IN THE LAND OF OZ, WHERE THE WOMEN WEAR NO BRAS, / AND THE MEN DON'T CARE, 'CAUSE THEY WEAR NO UNDERWEAR"```

28 steps gen / 28 steps high res for both, with Euler A Normal @ CFG 5
>>
File: z-image-upscaled_00010_.png (2.33 MB, 1096x1608)
2.33 MB
2.33 MB PNG
>>
>>107989145
And ZiT face isn't the same Chinese woman?
Localkeks don't have enough humility to try new SDXL checkpoints that match the most bloated local models out there.
I bet localkeks filter CivitAI models by GPU requirements, don't they?
>>107989110
JuggernautXL
>>
File: z-image_00002_.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>107989228
You need to be creative forcing.

I am currently using just a subgraph to do this. Basic idea:

You have a literal deck of cards. 78 tarot cards, each one is an animal. NOUNS.

You have another tarot deck. It's 78 animal VERBS.

you pick 2 animal cards and 1 verb card.
animal1 verb1 animal2

and on and on.

Have a deck of cards for adjectives. Like I have 78 colors for the animal.

78 different high fantasy characters (generic). and 78 actions, and 78 style types.

so you wind up with say a metal age wood elf enchanting.

I haven't added objects yet.

We can END the grift of the shitmixers.

picrel is zbase. versus the same promptu hero:
>>107989071
>>
>>107989139
You do realize it's not subjective improvements, but an objectively better architecture with Klein? It doesn't matter what techlets use, but rather how educated and would-be finetuner would bd.
>>
File: 118864974.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
>>107989257
>>107989260
>>
File: Flux2-Klein_00439_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>107989260
this is the promptu that broke it, just now. FIRST ONE I TRIED:
a photograph. bleach washed kodachrome. the location is silver-thread storm-peak , in the left: bioluminescent the-fate-spinner

possessing , in the middle: arcane the-guardian-beast diminishing , in the right: feathered reflecting the-sorcerer warping , a woman running amidst yellow colorful bat dives dust webbed sloth , while sloth paddles silver armored alligator , while alligator burrows rust energetic turtle

You can understandably "fix" it with an llm, and allegedly zbase tries to fix it, but the fact is that it shouldn't collapse into trash just because it receives a highly unusuaru promptu.
>>
People are still in the bargaining phase with Z-image.
It's pretty clear to me that their distillation techniques were doing most of the heavy lifting. The base is simply nothing special and they probably knew this and tried to fix it.
>>
>>107989160
proof?
>>
File: 118838380.jpg (248 KB, 832x1216)
248 KB
248 KB JPG
JuggernautXL released in May 2025, but I guess localfags were too busy writing purple prose for failed Chroma versions, weren't they?
>>
File: 1751886329373570.jpg (63 KB, 960x720)
63 KB
63 KB JPG
>>107989289
>be brown
>be low iq
>dont know what a base model is
>never trained a lora on it
>"It's pretty clear to me that..."
>>
>>107989260
zbase with llm "fixing" the prompt to:
>A photograph in bleach-washed Kodachrome style, set at Silver-Thread Storm-Peak. On the left, a bioluminescent entity called The-Fate-Spinner glows with ethereal light. In the center, an arcane Guardian-Beast fades into mist, its form diminishing. On the right, a feathered Sorcerer reflects energy while warping space. A woman runs through the scene, surrounded by yellow bat-like creatures diving through dust. Nearby, a sloth rides a silver-armored alligator, which burrows into the ground beneath a rust-colored, energetic turtle.

it's genning. lulz
>>
File: Flux2-Klein_00442_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: 118835599.jpg (291 KB, 1280x1856)
291 KB
291 KB JPG
>>107989204
JuggernautXL
>>
File: 1768696485992385.jpg (128 KB, 1300x1150)
128 KB
128 KB JPG
>>107989308
time to share your lora
>>
File: z-image_00003_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>107989289
It's muuuuch worse than that.

As a white man, I am euphoric.

>>107989307
>>107989071
I will also give the enhancer prompt to Klein 9b base
>>
>>107989306
I'm afraid we're past the denial and anger phase, anon. Now we're at bargaining.

>Finetuners will fix it.
>it can still train LoRAs for the distill
>It can be improved
>>
>>107989318
>still doesnt know what a base model is
I'm afraid you're still brown, jamal.
>>
>>107989228
> nooo it's for training you retar!1
>>
File: 118835598.jpg (323 KB, 1280x1856)
323 KB
323 KB JPG
>>107989204
JugernautXL
>>
>>107989322
Hmm well, I don't know. It's just like well.... hmmm. Why is flux klein's base model so much better then. It's just really puzzling to me. Yeah? Hmm.
>>
can anons share their z-image loras running on zit?
>>
Where are all the licensefags? Z is way more permissive than Klein regardless of what you believe about the capabilities of either model. You really think BFL wants coom in their model? China included celebs they don't care
>>
can someone recommend me a lora stacker that works? rgthree used to work but some recent update bricked it so now it makes my gens take 50 gazillion years
>>
Why can't you niggers just be fucking normal and enjoy things
>>
>>107989332
Why can't they or lodestone fix hands? It's mysterious.
>>
>>107989281

Why are you prompting schizobabble?
>>
Sdxl girls are cute desu
>>
>>107989359
Pushing boundaries reveals information about models.
>>
>>107989338
you're not making any sense in the least. yea, the license is regular open source.
>>
>>107989332
>>107989313
>>107989301
Slop
>>
>SD3 IS JUST A BASE MODEL, FINETUNES WILL FIX IT!!!
>>
>>107989239
if that's the prompt, then zimage won, while klein added a bunch of extra crap
>>
File: z-image-upscaled_00012_.png (2.47 MB, 1096x1608)
2.47 MB
2.47 MB PNG
>>
Z is the future, kleinfags coping and roping
>>
>>107989366
It's true. Shame about the hands. we gotta fix this hand and feet situation lol like what's the problem?

>>107989381
slop is pols backwards.
>>
>>107989289
Retarded or trolling ?

Turbo was made to create very aesthetically images out of the box

Z-Image was made to be great to train on, for lora and finetuning, with as little aesthetic bias as possible while retaining great overall quality, it will not be as aesthetically selective when generating, the aesthetics are what you will decide when you train
>>
>>107989397
bargaining.

I'll let you sort the depression phase out and then we can talk again when you've reached acceptance.
>>
File: F2Kb__00021_.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>107989317
>>107989307
now using the llm enhanced edition on Klein 9b base


>>107989389
>>107989392
^ not white people.

z image base is the best model ever - if you aren't white!
>>
>>107989381
*Easy, quickly, affordable and deterministic slop. Not your Auraflow, Clownsampler, ResAlife, Kijai, Nunchaku snake oil
>>
>>107988512
Does "Edit" means inpainting?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.