Z Image Won EditionDiscussion of Free and Open Source Diffusion ModelsPrev: >>107988202https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
nom nom nom
>>107989768>not even a MB you can make the collage bigger anon
Inb4 during and after schizo apocalypse
>>107989768>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy did you put these ones in the op? the new ones are better because they weren't written by a mentally ill troonoid
>>107989789Kill ani
>>107989789maybe we can include this one instead?https://rentry.org/barbie
Blessed thread of frenship
>>107989768>>107989785
>>107989807Nigger
>>1079897844chanXT automatically converts to JPG when the image is too large.
>>107989810Yeah rushing at just 405 replies, damnKill ani
Been trying to narrow down how to get crisper gens with LTX-2 without getting the usual cooked look.Winner settings so far:>Base model>Step 1: 8 steps manual sigma, CFG 1, Distilled lora at 0.6, euler>Step 2: Manual Sigmas CFG 1, Distilled lora at 0.8, res_2sThings to try:- Step 1: CFG 4-5 with default negative prompt, 30 steps euler, no distill, step 2 at 0.8 distill, res_2s- bump generation resolution to 1535 longest side.
>>107989806what about blimbo and sticker anon? where is colonscopy rentry?
>>107989820oh, you can use this Bakery: https://rentry.org/ldgcollageit allows you to set the megapixel
>>107989811>>Klein>solar panels upside-downhow... lora-like378/500, the result is incoming.
>>107989848meant *zbase
I tried downing a gguf of z base but it no work in Comfy yet? I also nuked NAG at the same time pls halp
>>107989832Yeah that's how this collage was created but it's easier to let it be converted instead of guessing with that.
So single subject ZIB loras are fine on ZIB but slightly worse on Turbo. There's no benefit whatsoever in training a ZIB lora and using it on ZIT, vs just training on ZIT with Ostris V2 adapter.
You faggots need Jesus and a GIRLFRIEND.
Professional enterprise diffusor here. These models are only interesting if you've never played with Nano Banana Pro before. Once you unlock the power of ComfyUI API Nodes you'll never go back
zbase is crap, but ace step will save China's facehttps://ace-step.github.io/ace-step-v1.5.github.io/
>>107989884send me a dickgirl plz
>>107989871I alredy got better results on dedistilled than with adapter before despite conflictingclsims, I learned not to trust statements of random anons ITT, better try things yourself
>>107989885
SOON
>>107989921>custom advanced snakoil nodeI'm afraid
>>107989921How bout you shut the fuck up and Show shit when it's finished grease golem
>>107989940I'd be soooo ashamed to be asian rn after zbase released.
>>107989931What is she aiming at
>>107989909I'm saying I did what my comment said, my self.
come on, ace step, what's the hold up?
>>107989955the guy posting loli in the previous thread
Been using ComfyUI. It's pretty good, but I have been struggling with image manipulation. Simple shit like changing the color of shirt or changing the background setting for a person standing at a bus stop. That sort of thing. It's a pain in the ass trying to do this in Comfy. Been following guides and trying to get it to work but failing every time. Is there an alternative to comfy that's easier to learn and use? I'm on Linux with AMD btw
>>107989998If you mean impainting try forge or one of its forks
>>107989998>Been using ComfyUI. It's pretty goodstop lying, faggot
diffsynth style lora in 5 minutesthis is based
zimage edit WHEN
Ok guysI gave you time.What is z image now?Complete:Z image is ...
>>107989998you're using an edit model, right?
>>107990032Nothing special.
>>107990032basically what I expected base to be, not interesting for inference
is something wrong with ZImage?
So how is ZBase vs. ZTurbo in terms of image quality? Given that they claim it only has "high" quality vs. Turbo's "very high".
Klein is great.
ok I downloaded and tested Z image base, anyone knows what the optimal settings are for this?
>>107989992ooh perty
>>107990047Body horror problem like Klein.
>>107990053I can't gen anything without artifacts, regardless of sampler+scheduler+steps combo on my 5090
>>107990061sure, type man rm
>>107990053Non-Turbo has more variety and may give you 2D- or CG-looking artstyles about half the time without being asked, at least in my testing. Turbo tends to give you a more-realistic look more easily.
>>107990029unrefined, just like it says on the box.you niggers expecting a finished product are stupid.it's like buying a giant chunk of marble and getting pissed that it doesn't morph into the statue of david.
>>107990114I'm white.
>>107990109Dunno if it's true in your case but apparently sage attention affects zimage gens pretty badly.
>still no ramtorchthe quantcuck nightmare continues
>>107990088Not nearly as pronounced as my Klein 4B distilled testing. Sometimes a finger or eyeball can be off.
>>107990116>chunk of marbleahahahahahahahahahahahahHA-HA
z base is kino, it's so good at styles
`a professional DSLR photograph of a busy street in San Francisco.`
>>107990061would breed both
>>107990130kino style
Honestly, flux always was also a shitmix. It's a problem. This is shitmixing, it isn't prompt following, because it's just triggering the "lora". That's why you get zit face and flux face.
>>107990125none of these models have that much of an anatomy problem unless you're using them wrong, it's a pretty dumb way to judge them, none of them are anything remotely like e.g. SD3 (which was just broken)
>>107990130this doesn't reflect who jesus was
>>107990034No I was using stable diffusion 1.5. I am just learning this stuff. I can generate images which are coming out great and it wasn't hard setting that up with the nodes. But trying to change existing images sucks (because I don't know how to do it)
>>107990122thanks, I knew something was wrong with my gens.
>>107990061that's basically what it looks like, IDK what you're expecting really
>>107990152I wish I was this retarded it's probably bliss
>>107990130Sadly, they can stamp stuff. It's not capable of putting a turtle in a dress. it can do a turtle. and a dress with a woman in it... stamp. stamp. stamp. Once you see the stamps you can't unsee it. stamp stamp stamp.
Before going to sleep, I laughed at the Z boys hard, and after waking up, I realize that there are the first signs of contact with reality, but their brains are still in a state of cognitive dissonance, ready to deny reality.And I have to laugh myself half to death again.The prime example of human stupidity caught up in herd mentality. Kek
>>107990155It represents who Jesus is, the destroyer of jewish abominations like non-white immigration.
>>107990172you believe in st.paul heresy
>>107990130aesthetic tuning completely rapes the artistic capabilities of models, its infuriating, base is much superior for this kind of stuff, this is also why sd 1.4/5 was peak kino for artsy stuff, it was trained on raw internet material with no sloppy aesthetic tuning, pure unfettered art.
>>107990166You'll never matter.
>>107990157that's your issue. look into qwen edit. default workflows are available in comfy
SOVL
>>107990175You have no potential to be worthy of my enormous value.
>>107990137klein won
>>107990188>stamps>>107990179^ why are the phones the wrong way?stamps.stamp stamp stamp stamp.banana banana banana banana.
Zisters, how we doin?
>>107990125Klein frequently has missing limbs.
Z won. Again.
>>107990225unmatched kino very nice anon
>>107990235thanks
>>107990032
>>107990130kek
>>107990168> cognitive dissonanceI'm having such a good time
>>107990225Prompt?
>>107990225>image you could do on tons of modelsok
>>107990277that post is retarded if you read if though, and he's wrong, they DON'T work better on ZIT if they're trained on Base than the same one already trained on ZIT would
Where are the sampler / scheduler grids
>>107990285Stop asian hate
>>107990137>photographambiguous
I think I completely forgot how to negative prompt well after using turbo and kedit I am lost
lora dumpster firesave the asian race from shame, ace step 1.5!
>>107990307I member when women didn't have glued on fingernails.
>>107990125>>107990153>>107990221Example pics. Klein 4B-dist at 768x768 was pretty screwy for me, so I went up to 1024 after a few tries. That fixed the eyes, but still fucked up the horse legs repeatedly.
>>107990288Everything is retarded, every single post, comment and upvote around this topic.That's the funny thing about it.Especially the comments that compare it to SD1.5 and seriously argue how much freedom they have with this model.Really, this is the best model release I've ever had. I just enjoy and delight in being.
>>107990315prompt?
>>107990307it's a logically constructed English sentence that should not result in a photographer, or a DSLR camera by itself, with this kind of model.
>>107990321>Young Greek woman with yellow eyes and dark red hair with bangs and sidelocks, tied in a flat bun behind her head with a dark purple ribbon. Wearing a white sleeveless tunic with a dark purple corset, and a wide dark purple gold-patterned sash over her right shoulder, across her chest, and around her waist. Her arms are bare, with dark purple fingerless gloves covering only her hands. Black thighhigh stockings and pantyhose with garterbelt, black harness straps hanging from her waist, gold armored sandals, and a black choker around her neck. She is glaring fiercely at the viewer and aiming a white bow and arrow, while riding a large black warhorse with a white mane, amber eyes, and black-and-gold faceplate, rearing its front hooves off the ground in a cloud of dust. The horse and rider are in a sunlit canyon under a blue sky. Dynamic close-up action shot from below. Cinematic photo.
>>1079903159b base, ok?I did download 4b, but I haven't moved it to my comfy, cuz into why person of non-white appearance?
>>107990320based
>>107990315>>107990344if increasing the steps worsen the images (artifacts) you can try res_2s or 3s, it will be 2x and 3x slower respectively but it tend to help with these kinds of stuff a bit
>>107990344when are we going to get a model that can actually draw a bowstring over a face?
>>107990315768x768? Are you generating on a potato, my dear Saar?My first z base gen (hugging space)> A smelly indian sitting in front of its computer. His computer is a potato.Im impressed
>make the girl in source image follow dance from a reference videohow do I do it? best model for this?
>>107990345I tried 9b-dist once, but it was either really slow or flat-out too big for my hardware to run, so I didn't download 9b base.
Klein.4B
>>107990377this is flux-2-klein-base-9b-fp8:>>107990378picrel is zbase. :rolleyes:(yes I know this isn't a forum)Trying to decide. Delete zbase or no?
>>107990363iGPU laptop. I can go bigger, but not without kicking the text encoder out of RAM, which slows down prompt testing/iterating.
>>107990388prompt:A photograph in bleach-washed Kodachrome style.A bioluminescent turtle has orange fabric stretched on it and giant black pump heels fitted to its feet awkwardly.there is a runway event, we see the backs of phones.****the only thing I like about zbase is how good it is at generating inhuman monsters with weird little eyes like that. freaky!
Retard here.Could I generate coomer slop on a 3060ti?What are the least amount of steps I could take to get there?I perused some of the links in the OP but I don't know most of these terms.Thanks for any help, I'm literally stupid
>zib gen without sage attention : 360s>zib gen with sage attention 2++ : 301syeah I'd really like the weird artifacts issues to be corrected, the speedup is appreciable
>>107990388Please don't take that away from me. Just believe in it for two more weeks.
>>107990344>rearingi have a theory that using terms like "rearing it's front hooves" isn't clear to the model so it just scrambles the legs. “a horse lifting its front hooves while standing only on its hind legs” is clearer. i've had to do a lot of restructuring prompts to avoid this kind of thing but once the prompt works it works well
>>107990410
Ace step 1.5 is available now (If you are an influencesaar.)
>>107990328Zimage is an ESL model
>>107990378>>107990388Z-Image gets the phone camera angles more correct, interestingly. I am impress.
>>107990413Yes, but it won't be very fast and you'll be limited to quants (These are sort of like "compressed" versions of models). If you have an okay amount of ram (32gb at least) You can utilize that as well, but VRAM is always going to be faster.
is base out?
>>107990473not yet, 2 more weeks
>>107990473yeah
>>107990199>klein wonExcept that's not what SF looks like
What a bummer.Now that z base is a flop, we don't have a base model that would make sense as the next generation of finetunes.Total diffusion death.
>>107990433saucy gal>>107990452yeah. funny how it just threw the shoes in there like uh. done.>>107990363Klein's putting it up. Working on seeing if I can prompt the idea of a computer that actually is potato shaped, and I think maybe I need to make it a potato or something that happens to have computer features.
>some seeds are worse than others It's real
>>107990371/g/ is so useless
>>107990483whoa, calm down bro!!!
Z image is such a shitshow, I'm returning to SDXL and ControlNet. You losers can keep pretending it's a finetuner's dream or whatever cope you need, later, retards.
Has ChatGPT always been racist towards Indians too?
base + lora isn't that bad for a first try
>>107990565i got brave's ai to create an entire list of racist quotes that it came up with on its own, using southern vernacularmaking ai misbehave is a lot of fun
>>107990547
the dust is starting to settle aaaandd... ouch.. yeahhh that's gonna leave a mark. looks like z-image flopped massively
Noob Z will be another overfitted trainwreck and Tongy Labs will shit out yet another half assed overfitted finetune like clockwork.
>>107990570man torso
>>107990570jej
>>107990592>>107990604iterating is gonna be slow but there's plenty of gains to make
>>107990589Noob Z is not even happening. Nobody believes that shit anymore, especially when they can't even deliver the rest of the z-slop family. It was but another marketing trick to get people to tune in
>>107990610none of this is better than ZIT in any way
>>107990610whats your dataset like
>>107990604
looks like the <think> trick still works >>107990548nice
>>107990611This is textbook chinese culture, ame story with llms they sold vapor with DeepSeek, everyone lost their minds, and then every other model release afterward turned into low effort slop.
Any point in still combining NAG with zimage base being able to go cfg>1?
>>107990413for comfy, follow the setup instructions in their docs https://docs.comfy.org/installation/manual_installit's pretty straightforward but you'll have to install a few things and figure out how virtual environments work (think of it as a condom for comfyui)once you have comfy running (it won't launch automatically, it'll give you a local IP address that functions as its interface), you can just browse templates and choose the one you want to use (wan2.2 for video for example). you'll have to download the tensor files (which are huge) into the appropriate folders, but it tells you where to put them. if you don't want to learn how to do these things i guess you can just use the installer, but i've heard that's limited in function
>>107990646It was 100x slower with NAG. I still had it hooked up when I first tried it and it would've taken 15min for a 1024*1024 gen.
>>107990570i came buckets
>>107990570>isn't that badInsane cope, your dopamine expectations were manipulated to death by Chinese culture.
>ltx2verdict?
>>107990629><think> trickwhat's that? unless you mean llm prompt generation