[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: gas3f8.png (2.08 MB, 2048x2048)
2.08 MB
2.08 MB PNG
Z-Image Base edition again

Discussion of Free and Open Source Diffusion Models

Prev: >>107985210

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Based thread of Zenship
>>
lol i forgot the title
>>
can we extract the different between turbo and base to make a turbo lora for base?
>>
File: Zimage_base__00022_.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: ComfyUI_06208_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
is this the Chinese Culture you folx keep talking about?
>>
>>107987000
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
why are these in the OP? it's just schizobabble
>>
wait zimage base is released finally? wow

how long to make one image in a 6600
>>
zzzzzzzzzzzzzzzzzzzz
>>
>>107987023
it happens, i got intimidated by the options while seeing if i could bake and gave up kek
>>
File: Zimage_base__00032_.png (2.14 MB, 1024x1024)
2.14 MB
2.14 MB PNG
>>107987038
5090 takes 30 seconds
>>
Blessed thread of frenship
>>
>>107987034
kill ani
>>
File: o_00039_.jpg (1.06 MB, 2304x1792)
1.06 MB
1.06 MB JPG
>>
>>107987043
i baked once and another anon beat me to it by exactly 8 seconds, so there was a duplicated thread and i felt bad. will never bake again.
>>
File: z_imageBASEd_00106_.jpg (588 KB, 1264x1688)
588 KB
588 KB JPG
>>107987027
I wouldn't be surprised if they make a new turbo from this version
>>
where is safetensors file for zimage base? I see only slit files on the site
>>
>>107987058
i like it
>>
>>107987048
so about half an hour on my 6600
>>
>>107987038
>6600
probably half an hour, it's already slow on a 4070
>>
>>107987034
literal human garbage
>>
>>107987034
>>107985366
>>107984742
>>107983447
>>
File: <3.png (1.88 MB, 1152x960)
1.88 MB
1.88 MB PNG
>>
>>107987034
let me guess, "not ani"
>>
>>107987034
>32 stars
>>
>>107987034
squirm, you were already caught red handed
>>
File: z-image_00021_.png (2.3 MB, 1440x1024)
2.3 MB
2.3 MB PNG
to the anon who recommended 200 steps res_2s, i know you were memeing but i tried it anyway, this image took 5 minutes to generate on a 5090
>>
>>107987034
kys schizo
>>
>>107987065
https://huggingface.co/Comfy-Org/z_image/tree/main/split_files/diffusion_models
>>
>>107987034
>trani
>>
File: z_imageBASEd_00113_.jpg (584 KB, 1264x1688)
584 KB
584 KB JPG
>>
>>107987127
that's actually really good though, but it's better to wait until there is a "turbo" lora. i had similar results with XL and 200 steps, but then the PCR lora for XL with 32 steps had almost the same result.
>>
How good is Z-Image-i2L for anime artstyles?
>>
>>107987034
no wonder why comfy didn't want to work with you
>>
So can it generate copyrighted stuff?
>>
>>107987034
go work on your vibe coded wrapper
>>
>>107987099
I can't believe this story you are telling me, it's ~mahcaaab~
>>107987156
>>107987172
It apparently knows Hazbin Hotel
>>
Base always puts my 1girls into ruined decrepit buildings, maybe they are supposed to be eastern European
>>
can this idiot just kill himself already?
>>
File: 1552080261076.jpg (29 KB, 400x400)
29 KB
29 KB JPG
>ai toolkit needs diffusers and can't take single file
Why is this still a thing in 2016?? JUST TAKE THE FILE
>>
File: z_imageBASEd_00115_.jpg (652 KB, 1264x1688)
652 KB
652 KB JPG
K Perry

>>107987186
that looks like Bangladesh
>>
>>107987034
lolcow
>>
>>107987197
funnily enough, i'm kind of a metalhead but went to see her live a few months ago just for the memes, and she used AI slop backdrop videos for her whole show, it made it look cheap
also i think i was the only straight dude there no joke
>>
damn those gen times are horrid but at least the results are coherent enough to justify it somewhat
>>
You guys do realize that this isn't the true base model right? It's the SFT one.
>>
File: o_00042_.jpg (1.17 MB, 2304x1792)
1.17 MB
1.17 MB JPG
>>
>>107987034
kys
>>
>>107987221
yeah but the true base probably has even worse output, and which one of us has the money to do our own SFT?
>>
Arguing in favor of a model that is just blurred mush compared to one so detailed you can see the pigment on her skin is just silly. Plus one of them is edit model on top of what it can already do. Both of them still have to be tuned at the end of the day, and only one is worth tuning and worth tuning, regardless of censorship. Unless you want your model to be stuck looking like a blurry mess not representing reality.
>>
>>107987027
Someone is probably already doing this
>>
File: z_imageBASEd_00117_.jpg (795 KB, 1264x1688)
795 KB
795 KB JPG
>>107987207
no shame in that, some popstars can put out real cool shows
>>
File: 1746643165921518.png (32 KB, 621x310)
32 KB
32 KB PNG
I'm not sure what this node does except double the gen time
>>
File: z_imageBASEd_00118_.jpg (500 KB, 1264x1688)
500 KB
500 KB JPG
they didnt cheap out with the celebs
>>
Thanks for the good times, I really enjoyed them and can now put this chapter behind me.
Even though I was mean to you, I'm glad you're so happy.
I'll be back in two weeks for the finetunes.
>>
File: 1654119193679.png (422 KB, 600x754)
422 KB
422 KB PNG
Is there a way to change the default DL directory of hf cli?
>>
>>107987235
Did you just have a stroke anon ?
>>
File: z_imageBASEd_00119_.jpg (796 KB, 1264x1688)
796 KB
796 KB JPG
>>107987260
windows has environment variables
>>
>>107987257
if you scrape the internet the dataset is bound to have lots of celebs over-represented, you'd have to go out of your way to filter them out like the western models to make it not know them
>>
>>107987240
>>107987257
>>107987270
>same fake celebs like for zit
waiting weeks for that...
>>
>>107987232
That's literally the point, and you don't need millions of images / cash to do SFT.
>>
is ran ok?
>>
>>107987285
Who is Ran? Is that your bull?
>>
>>107987235
Stop trolling them. Z bros suffered for a long time and finally won. Let them enjoy their success.

(Who would have thought that I could say the same thing in /g and /pol at the same time and make sense?)
>>
What's the safe way to update Ostris AI-toolkit? Last time I updated I did a git pull and it completely broke my installation.
>>
>>107987300
just get the easy install
>>
uh oh. melty!
>>
File: z_imageBASEd_00120_.jpg (566 KB, 1264x1688)
566 KB
566 KB JPG
>>107987272
nice change from using klein
>>
WOAH MAMA
>>
File: z_imageBASEd_00122_.jpg (479 KB, 1264x1688)
479 KB
479 KB JPG
>>
>Julien
>>
File: o_00043_.jpg (1.13 MB, 1792x2304)
1.13 MB
1.13 MB JPG
>>
>>107987027
Yes but it has a 100% chance of not being as good as just using Turbo
>>
>>107987034
pedo
>>
File: image.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>widowmaker from overwatch according to base
>>
File: 544546546545645.png (72 KB, 1259x332)
72 KB
72 KB PNG
ACEStep 1.5 is almost here

https://ace-step.github.io/ace-step-v1.5.github.io/

No idea why some of the samples are so bad and sound slopped, especially those first ones, I guess Chinese devs aren't really good at judging output quality. I'm still hopeful thanks to their playground samples that we'll get kino when the model is out.
>>
>>107987272
How do they wind up with NLP captions that state the name of the person properly and reliably, though?
>>
what's with the meltdown today?
>>
>>107987365
am I a KOL? who do I see to get an assessment for KOL status?
>>
>>107987370
unpaid indian hollywood gooners
>>
>>107987371
just ignore the falseflag. hes trying to confuse newfags still by making it seem like someone is genuinely sperging out at the "why are these links in OP" ritual post
>>
>>107987315
>>107987330
>>107987331
1pass or you upscaled?
>>
>>107987371
the "proof" ani is the schizo spammer was a load of shit so schizo has to slide the thread with ani hate so people forget
>>
>>107987365
the quality sounds fine but that first "heavy metal" sample sounds like pop rock
>>
>>107987401
Cate Blanchett one is 1pass
>>
>>107987380
Key opinion leader. Business lingo.
>>
>>107987401
>>107987414
2pass others
>>
>>107987371
this is the same shit every day. troonjak complains about the rentries (as ani) then calls it out with no proof as usual
>>
>>107987417
I thought the dude was a developer, he's a businessman?
>>
>diverse outputs but body horror
>>
>>
File: 1759814444513.png (2.32 MB, 1152x1792)
2.32 MB
2.32 MB PNG
>>107987349
>a tattoo of Hatsune Miku portrait in "fumo" style
>panties slightly eaten by the butt
These failed. The rest of the prompt is genned correctly, if we forgive it for not knowing how to do a tattoo when the image is 2d manga.
>>
>>107987397
I like how the other two replies to that post just reinforce your post kek ani tries really hard to muddy the waters
>>
>>107987423
yeah I'm sure a random anon spends every second of the day shitting on a no-name developer. that totally makes sense
>>
>>107987448
Does it even know fumo in general?
>>
tl;dr, germs and chinkoids are based?
>>
File: 49765742.png (1.42 MB, 1072x1072)
1.42 MB
1.42 MB PNG
>>107987193
>>
>>107987469
A good question, for another time.
>>
>>107987471
yeah both models are great in their own ways and we'll be eating good in the future.
>>
>>107987365
The important thing is if it works well with finetuning, I wonder what guys like:

https://www.youtube.com/@Retrosync/videos

https://www.youtube.com/@STILLINTHENIGHT-m9s

use, it doesn't sound like Suno so I assumed they were finetuning something
>>
>>107987471
they are frens for sure
>>
>>107987450
welcome newfriend
>>
File: z_imageBASEd_00129_.jpg (472 KB, 1264x1688)
472 KB
472 KB JPG
almost knows saorse

>>107987444
thats great
>>
>>107987471
Klein will prolly end up better for art shit and general sfw, but having body horrors both on base and distill is kinda eh. Flux gonna flux.
>>
HOLY FUCK ZIB LOOKS LIKE SHIT WE LOSTED
>>
>>107987498
heun simple cfg4 15 steps
>>
File: z-image_00026_.png (2.52 MB, 1440x1024)
2.52 MB
2.52 MB PNG
for science, 500 steps res_2s beta
generation time 15 minutes per image on a 5090
not trying this again.
>>
OH NONONONON LOCAL BROSS THIS IS BAD VERY BAD!
>>
>>107987496
Klein editing is still good, and will keep it around at least until Z-Image Edit is out

But for t2i, Klein is already dead, same for large scale finetunes
>>
i dont see res_2s
>>
>>107987504
can't do oppai loli futa scat so it sucks
>>
>>107987524
install res4lyf nodes
>>
>>107987471
more than SAI mutts for sure
>>
>>107987524
search for res4lyf in nodes in manager and get that. they are external samplers
>>
CAN YOU HEAR THAT? IT’S THE SOUND OF NEW COMFY CLOUD USERS , SO ASHAMED AND DISAPPOINTED WITH LOCAL MODELS!
>>
ANI WONNED
SCHIZOS WONNED
CATJACK WONNED
BOT WONNED
COMFY CLOUD WONNED

/LDG/ LOSTED
LOCAL LOSTED
>>
all trani's fault btw
>>
>>107987154
oh shiiiii
>>
>>107987529
>>107987533
thank you kindly sars
>>
>>107987521
It's not though, we're literally getting both Klein Chroma and Z Chroma already. Z Base hardly has perfect anatomy also.
>>
are we dealing with one or multiple schizos atm?
>>
>>107987521
Wrong. All I will say is compare these two outputs.

>>107977133
and
>>107987490

Then look at a real photograph of a woman

https://www.pexels.com/photo/portrait-of-woman-19248753/

You will see what problems exist with Z and why it will never replace Klein. I will give you a hint, Z simply isn't realistic, it is lacking in detail, and you can especially see that when you zoom in. This is a technical problem that can't be fixed with finetunes due to Flux.2 having a better VAE.
>>
>>107987557
yeah
>>
>>107987557
one having a mental breakdown
>>
>>107987471
give it some time (was the same when sdxl released) but we are eating very good anon
>>
>>107987557
pretty sure its 1 or 2 people at most lol, there are also people who see the shit happening and jump in for lulz
>>
>>107987565
z-base release triggered some monkey paw shit, made the schizo go completely insane
>>
>>107987561
Have to disagree with that one
>>
schizo based his whole personality on z-image-base never being released and is now facing existential crisis
>>
>>107987557
proof?
>>
>>107987565
his "proof" was a nothingburger as usual and he just snapped
>>
>>107987553
>we're literally getting both Klein Chroma and Z Chroma already
lodestones has said both of these are just experiments which can (and most likely will) end at the drop of a hat. I'm 200% certain that Z-Image Base is what he will make the next long-term Chroma finetune on, likely paired with the Flux2 VAE.
>>
>>107987575
He's priced out and lonely. This retardation points to the OG schizo.
>>
>>107987561
This doesn't just apply to closeups. The same apply to body shots, holding objects etc... You can see so much texture on her skin, individual hair fibers etc... This would allow not just for real photographic LoRAs, but actual photoreal porn for the first time from an image model. Previously only NBP had any semblance to real photographs, and now Klein surpasses it. A hypothetical Z edit would not be as good as Klein.
>>
Gen times are crazy but I really like the base so far otherwise
>>
>>107987592
ZiB has flux1 vae tho.
>>
File: 1744994362140310.png (55 KB, 168x168)
55 KB
55 KB PNG
>>107987510
>>
>>107987603
rotate 180 degrees. enhance.
>>
>>107987603
>thread schizo watching others having fun and enjoying life
>>
File: watcher5.jpg (17 KB, 289x166)
17 KB
17 KB JPG
>>107987603
I love watchers.
>>
>>107987601
And yet it is better than Klein, both in anatomy and overall realism.

In other words, the VAE is not the issue.
>>
Too much body horror. Base seems censored.
>>
>>107987561
>I like this shitty looking style and klein can only do this shitty looking style, so klein is better
..yeah...no...
>>
Cate is based
>>
four months of hype for this?? holly sloppa
>>
Has anyone tried their z base lora yet? 2500/3000 here on a 3090.
>>
>>107987592
vae meme again, this shit is as stupid as the morons that go on about abliterated TE lmao
>>
File: kt_A-s5vMQ6L-_sUjNUCG.jpg (1.58 MB, 4400x1356)
1.58 MB
1.58 MB JPG
>>107987618
>>
Base is out?
really?
>>
catjak needs to stop and get help. it's been years of this psychosis. you just need to accept there are people more talented than you
>>
>muh boogeyman
>>
>>107987636
Are you training on ai toolkit? Is it possible to train locally? Never trained anything in my life but I want to give it a try.
>>
>Julien
>>
>>107987645
yeah it is a new age of trying out settings, gennings, epic gen times, and schizo meltdowns
>>
do wez got onetrainer support for base yet
>>
File: o_00047_.jpg (898 KB, 2560x1536)
898 KB
898 KB JPG
>>
>>107987641
if we here why Z Image Turbo already exist, I am confus
>>
>>107987664
euler 20 steps wins again
>>
IS THIS THE CHINESE CULTURE??
>>
File: 1763730015125731.png (4 KB, 787x53)
4 KB
4 KB PNG
>>107987665
i literally just did this and it runs
no idea about the results yet lul
>>
>>107987660
>Are you training on ai toolkit?
yes

>Is it possible to train locally?
of course

>Never trained anything in my life but I want to give it a try.
grab 100~200 images of something you want, describe them(you can use ai to automatically describe them too) and then use https://github.com/ostris/ai-toolkit to make it. im using the default template for z-image and so far the samples are looking very promising.

i just need to see how well it performs in z-image-turbo.
>>
Where were you when troongy labs oversold and underdelivered?
>>
>>107987616
>the VAE is not the issue.

If you're okay with your outputs looking like mush/slop. Who cares that it's better at anatomy? It can't not be more realistic. See >>107987561
What good is something that looks further from ground truth? Plus Klein is actually easier to work with because it's 4B and it has Edit capabilities, the bulk of what BFL worked on probably and why it still shits on Base even with all the perceived "anatomy improvements" to it. If both models are tuned with large 5M uncensored datasets, only one of them comes out on top, and that is clearly Klein due to its technical superiority, that is what you anons need to get thru your thick skull.
>>
>>107987665
It *seems* to be working without any changes if you use BF16, int8 seems like it needs some fixing
>>
>>107987683
AMBATUTRAAAAAAAAAAAAAAAAIN
>>
>>107987690
>f you're okay with your outputs looking like mush/slop.
But it doesn't, Z-Image Turbo is better quality than Klein, it's not slopped, AND it does correct anatomy.
>>
>>107987689
that is every AI company
>>
>>107987690
And here's something that all AI researchers can agree on. A minor hand issue is generally preferred over the entire image being low quality and looking slopped. One can be fixed, the other outputs should be discarded. Now imagine if they based their research solely on whether it outputs accurate anatomy. Total disaster.
>>
File: z-image_00034_edit.png (1.67 MB, 832x1216)
1.67 MB
1.67 MB PNG
>>107987504
cfg 4 seems high
>>
>>107987700
turbo is boring as fuck every gen looks the exact same, professional studio quality photo-shoot in same poses same faces same shit the whole model is "good" because it can generate like six images in total with zero variety
>>
man I didnt think it would happen ever, and now its really here.
feels good man.
>>
You won't see real discovery in this arch until non realistic or 3D focused model drops. The real seekers have no interest in realistic shit. I have zero interest in realistic shit so I will stay on standby.
>>
>>107987730
ace is out??
>>
https://rentry.org/ranfaggot
>>
File: z-image_00036_.png (1.83 MB, 832x1216)
1.83 MB
1.83 MB PNG
>>
>>107987732
I mean to say art and anime focused model drops. I will repeat realistic or 3D slop has zero interest to us.
>>
how come /adt/ is immune to the retarded schizo trolling?
>>
imagine coping about your 15 min 500 step "for science" genes that looks like a 3d SDXL slop.
>>
>>107987743
Because the two schizos that torment this thread hate that they lost every step of the way. This isn't the dev schizo this is the horned schizo who is of lower IQ and on disability.
>>
>>107987700
If we had the Edit model, you'd see how wrong you are. But we don't have the Edit model, because they know it sucks, hence we can't compare how both models edit a real photograph. However, it's quite obvious to see by looking ahead at their paper and the outputs base is producing.
>>
File: screenshot.1769552008.jpg (69 KB, 515x146)
69 KB
69 KB JPG
>>107987743
because nobody posts there. they have 5 day long threads.
>>
>>107987743
Because the schizos are from there genius!
>>
>>
>>107987772
>the bart, the
>>
File: z-image_00037_edit.png (1.74 MB, 832x1216)
1.74 MB
1.74 MB PNG
so much sovl...
it's over the chinks won
>>
>>107987760
and that's bad because....?
>>
But can it put teenage versions of my favorite celebs in compromising positions tho? The second part is very important.
>>
File: 1740078650422397.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>107987751
n*gbo? I thought he was kill
>>
So is base good or what?
>>
OHHH THE GREAT LOCAL DREAM IS DEAD,
>>
>>107987743
the house is empty
>>
>>107987788
we dont know
>>
The schizo was right about one thing: we lost.
>>
>>107987751
>dev schizo
if he isn't participating in schizoposting, is he a schizo at all?

>horned schizo who is of lower IQ and on disability
who?

and I'm guessing ranschizo is the second schizo you are referring to but he never hides it and it's obvious
>>
Z Base doesn't seem to know Hitler
>>
>>107987787
It's been one day of him fucking off because his thread is dead. He has historically done retarded shit like this whenever something new happens to spread FUD. Remember when he was spamming sora gens until that was shot?
He does the same low IQ schizo spam and is just having a melty because he thinks he gets a pass because of the other retard shitting his pants. The biggest tell is that his brand of griefing is dumber and is just him throwing shit at a wall hoping it sticks.
>>
See you in two weeks for the finetunes that will somehow make it worse.
>>
File: 3.jpg (172 KB, 2070x441)
172 KB
172 KB JPG
my cake is almost done baking..too bad I can't post the results because i'd get banned
>>
File: z_imageBASEd_00140_.jpg (617 KB, 1520x1728)
617 KB
617 KB JPG
>>
>>107987821

Catbox example gens.
>>
quantized z-image when
>>
my bad, it does know Hitler after all
>>
hopium tanks? empty.
>>
>>107987835
kill ani



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.