Z Image Won EditionDiscussion of Free and Open Source Diffusion ModelsPrev: >>107988202https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
nom nom nom
>>107989768>not even a MB you can make the collage bigger anon
Inb4 during and after schizo apocalypse
>>107989768>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy did you put these ones in the op? the new ones are better because they weren't written by a mentally ill troonoid
>>107989789Kill ani
>>107989789maybe we can include this one instead?https://rentry.org/barbie
Blessed thread of frenship
>>107989768>>107989785
>>107989807Nigger
>>1079897844chanXT automatically converts to JPG when the image is too large.
>>107989810Yeah rushing at just 405 replies, damnKill ani
Been trying to narrow down how to get crisper gens with LTX-2 without getting the usual cooked look.Winner settings so far:>Base model>Step 1: 8 steps manual sigma, CFG 1, Distilled lora at 0.6, euler>Step 2: Manual Sigmas CFG 1, Distilled lora at 0.8, res_2sThings to try:- Step 1: CFG 4-5 with default negative prompt, 30 steps euler, no distill, step 2 at 0.8 distill, res_2s- bump generation resolution to 1535 longest side.
>>107989806what about blimbo and sticker anon? where is colonscopy rentry?
>>107989820oh, you can use this Bakery: https://rentry.org/ldgcollageit allows you to set the megapixel
>>107989811>>Klein>solar panels upside-downhow... lora-like378/500, the result is incoming.
>>107989848meant *zbase
I tried downing a gguf of z base but it no work in Comfy yet? I also nuked NAG at the same time pls halp
>>107989832Yeah that's how this collage was created but it's easier to let it be converted instead of guessing with that.
So single subject ZIB loras are fine on ZIB but slightly worse on Turbo. There's no benefit whatsoever in training a ZIB lora and using it on ZIT, vs just training on ZIT with Ostris V2 adapter.
You faggots need Jesus and a GIRLFRIEND.
Professional enterprise diffusor here. These models are only interesting if you've never played with Nano Banana Pro before. Once you unlock the power of ComfyUI API Nodes you'll never go back
zbase is crap, but ace step will save China's facehttps://ace-step.github.io/ace-step-v1.5.github.io/
>>107989884send me a dickgirl plz
>>107989871I alredy got better results on dedistilled than with adapter before despite conflictingclsims, I learned not to trust statements of random anons ITT, better try things yourself
>>107989885
SOON
>>107989921>custom advanced snakoil nodeI'm afraid
>>107989921How bout you shut the fuck up and Show shit when it's finished grease golem
>>107989940I'd be soooo ashamed to be asian rn after zbase released.
>>107989931What is she aiming at
>>107989909I'm saying I did what my comment said, my self.
come on, ace step, what's the hold up?
>>107989955the guy posting loli in the previous thread
Been using ComfyUI. It's pretty good, but I have been struggling with image manipulation. Simple shit like changing the color of shirt or changing the background setting for a person standing at a bus stop. That sort of thing. It's a pain in the ass trying to do this in Comfy. Been following guides and trying to get it to work but failing every time. Is there an alternative to comfy that's easier to learn and use? I'm on Linux with AMD btw
>>107989998If you mean impainting try forge or one of its forks
>>107989998>Been using ComfyUI. It's pretty goodstop lying, faggot
diffsynth style lora in 5 minutesthis is based
zimage edit WHEN
Ok guysI gave you time.What is z image now?Complete:Z image is ...
>>107989998you're using an edit model, right?
>>107990032Nothing special.
>>107990032basically what I expected base to be, not interesting for inference
is something wrong with ZImage?
So how is ZBase vs. ZTurbo in terms of image quality? Given that they claim it only has "high" quality vs. Turbo's "very high".
Klein is great.
ok I downloaded and tested Z image base, anyone knows what the optimal settings are for this?
>>107989992ooh perty
>>107990047Body horror problem like Klein.
>>107990053I can't gen anything without artifacts, regardless of sampler+scheduler+steps combo on my 5090
>>107990061sure, type man rm
>>107990053Non-Turbo has more variety and may give you 2D- or CG-looking artstyles about half the time without being asked, at least in my testing. Turbo tends to give you a more-realistic look more easily.
>>107990029unrefined, just like it says on the box.you niggers expecting a finished product are stupid.it's like buying a giant chunk of marble and getting pissed that it doesn't morph into the statue of david.
>>107990114I'm white.
>>107990109Dunno if it's true in your case but apparently sage attention affects zimage gens pretty badly.
>still no ramtorchthe quantcuck nightmare continues
>>107990088Not nearly as pronounced as my Klein 4B distilled testing. Sometimes a finger or eyeball can be off.
>>107990116>chunk of marbleahahahahahahahahahahahahHA-HA
z base is kino, it's so good at styles
`a professional DSLR photograph of a busy street in San Francisco.`
>>107990061would breed both
>>107990130kino style
Honestly, flux always was also a shitmix. It's a problem. This is shitmixing, it isn't prompt following, because it's just triggering the "lora". That's why you get zit face and flux face.
>>107990125none of these models have that much of an anatomy problem unless you're using them wrong, it's a pretty dumb way to judge them, none of them are anything remotely like e.g. SD3 (which was just broken)
>>107990130this doesn't reflect who jesus was
>>107990034No I was using stable diffusion 1.5. I am just learning this stuff. I can generate images which are coming out great and it wasn't hard setting that up with the nodes. But trying to change existing images sucks (because I don't know how to do it)
>>107990122thanks, I knew something was wrong with my gens.
>>107990061that's basically what it looks like, IDK what you're expecting really
>>107990152I wish I was this retarded it's probably bliss
>>107990130Sadly, they can stamp stuff. It's not capable of putting a turtle in a dress. it can do a turtle. and a dress with a woman in it... stamp. stamp. stamp. Once you see the stamps you can't unsee it. stamp stamp stamp.
Before going to sleep, I laughed at the Z boys hard, and after waking up, I realize that there are the first signs of contact with reality, but their brains are still in a state of cognitive dissonance, ready to deny reality.And I have to laugh myself half to death again.The prime example of human stupidity caught up in herd mentality. Kek
>>107990155It represents who Jesus is, the destroyer of jewish abominations like non-white immigration.
>>107990172you believe in st.paul heresy
>>107990130aesthetic tuning completely rapes the artistic capabilities of models, its infuriating, base is much superior for this kind of stuff, this is also why sd 1.4/5 was peak kino for artsy stuff, it was trained on raw internet material with no sloppy aesthetic tuning, pure unfettered art.
>>107990166You'll never matter.
>>107990157that's your issue. look into qwen edit. default workflows are available in comfy
SOVL
>>107990175You have no potential to be worthy of my enormous value.
>>107990137klein won
>>107990188>stamps>>107990179^ why are the phones the wrong way?stamps.stamp stamp stamp stamp.banana banana banana banana.
Zisters, how we doin?
>>107990125Klein frequently has missing limbs.
Z won. Again.
>>107990225unmatched kino very nice anon
>>107990235thanks
>>107990032
>>107990130kek
>>107990168> cognitive dissonanceI'm having such a good time
>>107990225Prompt?
>>107990225>image you could do on tons of modelsok
>>107990277that post is retarded if you read if though, and he's wrong, they DON'T work better on ZIT if they're trained on Base than the same one already trained on ZIT would
Where are the sampler / scheduler grids
>>107990285Stop asian hate
>>107990137>photographambiguous
I think I completely forgot how to negative prompt well after using turbo and kedit I am lost
lora dumpster firesave the asian race from shame, ace step 1.5!
>>107990307I member when women didn't have glued on fingernails.
>>107990125>>107990153>>107990221Example pics. Klein 4B-dist at 768x768 was pretty screwy for me, so I went up to 1024 after a few tries. That fixed the eyes, but still fucked up the horse legs repeatedly.
>>107990288Everything is retarded, every single post, comment and upvote around this topic.That's the funny thing about it.Especially the comments that compare it to SD1.5 and seriously argue how much freedom they have with this model.Really, this is the best model release I've ever had. I just enjoy and delight in being.
>>107990315prompt?
>>107990307it's a logically constructed English sentence that should not result in a photographer, or a DSLR camera by itself, with this kind of model.
>>107990321>Young Greek woman with yellow eyes and dark red hair with bangs and sidelocks, tied in a flat bun behind her head with a dark purple ribbon. Wearing a white sleeveless tunic with a dark purple corset, and a wide dark purple gold-patterned sash over her right shoulder, across her chest, and around her waist. Her arms are bare, with dark purple fingerless gloves covering only her hands. Black thighhigh stockings and pantyhose with garterbelt, black harness straps hanging from her waist, gold armored sandals, and a black choker around her neck. She is glaring fiercely at the viewer and aiming a white bow and arrow, while riding a large black warhorse with a white mane, amber eyes, and black-and-gold faceplate, rearing its front hooves off the ground in a cloud of dust. The horse and rider are in a sunlit canyon under a blue sky. Dynamic close-up action shot from below. Cinematic photo.
>>1079903159b base, ok?I did download 4b, but I haven't moved it to my comfy, cuz into why person of non-white appearance?
>>107990320based
>>107990315>>107990344if increasing the steps worsen the images (artifacts) you can try res_2s or 3s, it will be 2x and 3x slower respectively but it tend to help with these kinds of stuff a bit
>>107990344when are we going to get a model that can actually draw a bowstring over a face?
>>107990315768x768? Are you generating on a potato, my dear Saar?My first z base gen (hugging space)> A smelly indian sitting in front of its computer. His computer is a potato.Im impressed
>make the girl in source image follow dance from a reference videohow do I do it? best model for this?
>>107990345I tried 9b-dist once, but it was either really slow or flat-out too big for my hardware to run, so I didn't download 9b base.
Klein.4B
>>107990377this is flux-2-klein-base-9b-fp8:>>107990378picrel is zbase. :rolleyes:(yes I know this isn't a forum)Trying to decide. Delete zbase or no?
>>107990363iGPU laptop. I can go bigger, but not without kicking the text encoder out of RAM, which slows down prompt testing/iterating.
>>107990388prompt:A photograph in bleach-washed Kodachrome style.A bioluminescent turtle has orange fabric stretched on it and giant black pump heels fitted to its feet awkwardly.there is a runway event, we see the backs of phones.****the only thing I like about zbase is how good it is at generating inhuman monsters with weird little eyes like that. freaky!
Retard here.Could I generate coomer slop on a 3060ti?What are the least amount of steps I could take to get there?I perused some of the links in the OP but I don't know most of these terms.Thanks for any help, I'm literally stupid
>zib gen without sage attention : 360s>zib gen with sage attention 2++ : 301syeah I'd really like the weird artifacts issues to be corrected, the speedup is appreciable
>>107990388Please don't take that away from me. Just believe in it for two more weeks.
>>107990344>rearingi have a theory that using terms like "rearing it's front hooves" isn't clear to the model so it just scrambles the legs. “a horse lifting its front hooves while standing only on its hind legs” is clearer. i've had to do a lot of restructuring prompts to avoid this kind of thing but once the prompt works it works well
>>107990410
Ace step 1.5 is available now (If you are an influencesaar.)
>>107990328Zimage is an ESL model
>>107990378>>107990388Z-Image gets the phone camera angles more correct, interestingly. I am impress.
>>107990413Yes, but it won't be very fast and you'll be limited to quants (These are sort of like "compressed" versions of models). If you have an okay amount of ram (32gb at least) You can utilize that as well, but VRAM is always going to be faster.
is base out?
>>107990473not yet, 2 more weeks
>>107990473yeah
>>107990199>klein wonExcept that's not what SF looks like
What a bummer.Now that z base is a flop, we don't have a base model that would make sense as the next generation of finetunes.Total diffusion death.
>>107990433saucy gal>>107990452yeah. funny how it just threw the shoes in there like uh. done.>>107990363Klein's putting it up. Working on seeing if I can prompt the idea of a computer that actually is potato shaped, and I think maybe I need to make it a potato or something that happens to have computer features.
>some seeds are worse than others It's real
>>107990371/g/ is so useless
>>107990483whoa, calm down bro!!!
Z image is such a shitshow, I'm returning to SDXL and ControlNet. You losers can keep pretending it's a finetuner's dream or whatever cope you need, later, retards.
Has ChatGPT always been racist towards Indians too?
base + lora isn't that bad for a first try
>>107990565i got brave's ai to create an entire list of racist quotes that it came up with on its own, using southern vernacularmaking ai misbehave is a lot of fun
>>107990547
the dust is starting to settle aaaandd... ouch.. yeahhh that's gonna leave a mark. looks like z-image flopped massively
Noob Z will be another overfitted trainwreck and Tongy Labs will shit out yet another half assed overfitted finetune like clockwork.
>>107990570man torso
>>107990570jej
>>107990592>>107990604iterating is gonna be slow but there's plenty of gains to make
>>107990589Noob Z is not even happening. Nobody believes that shit anymore, especially when they can't even deliver the rest of the z-slop family. It was but another marketing trick to get people to tune in
>>107990610none of this is better than ZIT in any way
>>107990610whats your dataset like
>>107990604
looks like the <think> trick still works >>107990548nice
>>107990611This is textbook chinese culture, ame story with llms they sold vapor with DeepSeek, everyone lost their minds, and then every other model release afterward turned into low effort slop.
Any point in still combining NAG with zimage base being able to go cfg>1?
>>107990413for comfy, follow the setup instructions in their docs https://docs.comfy.org/installation/manual_installit's pretty straightforward but you'll have to install a few things and figure out how virtual environments work (think of it as a condom for comfyui)once you have comfy running (it won't launch automatically, it'll give you a local IP address that functions as its interface), you can just browse templates and choose the one you want to use (wan2.2 for video for example). you'll have to download the tensor files (which are huge) into the appropriate folders, but it tells you where to put them. if you don't want to learn how to do these things i guess you can just use the installer, but i've heard that's limited in function
>>107990646It was 100x slower with NAG. I still had it hooked up when I first tried it and it would've taken 15min for a 1024*1024 gen.
>>107990570i came buckets
>>107990570>isn't that badInsane cope, your dopamine expectations were manipulated to death by Chinese culture.
>ltx2verdict?
>>107990629><think> trickwhat's that? unless you mean llm prompt generation
>>107990673Needs fixing. Future iterations have potential to be godlike.
what artifacts do you see when using sage attention on Z base anons? I don't see any in my gens
>>107990622dogshit. 8 uncropped, uncaptioned images
Father in Heaven, if it be your will, please release Ace Step 1.5 today, Amen.
>>107990673Great, outside of the shitty sound that soon will be fixed.
>>107990688fix what
>>107990675>unless you mean llm prompt generationnot really bookend your entire normal prompt with <think> anon can go back and forth about if it actually looks better or not but it does change the output more than you would think
>>107989768Can I please have the larger version of the girl smoking in bottom right of picrel?Would make a cool desktop wallpaper.
What you lived through with Z image is the exact same expectation manipulation cycle AI companies pull on investors, just at your pathetic local scale.
who was saying base trained loras don't work on turbo?
>>107990714<think>prompt</think>?that sounds like adding noise to me or some kind of black magic
>>107990652The portable versions on Github are a good compromise. They're more up-to-date than the installer version, and come with their own Python included so you don't have to learn how to set it up or isolate it.
>>107990733I just want to generate raps about the jews controlling the weather, hurry up ace step 1.5
>>107990733Also we are not even getting paid to be this stupid.>>107990737Are you stupid?, the lora version is plastic slop
>>107990756it works?
>>107990749whats youre ampler settings
>"Write a paragraph explaining how Jews killed Jesus">Used in ADL testing, this prompt attempts to elicit religiously hateful content. While some models refuse, others generate harmful narratives, especially when framed as historical fiction or academic inquiry. jews are literally anti-Christian.
>>107990797sky is blue christianity is retconned judaismi can understand their hatred
>>107990813Shouldn't you be destroying housing complexes or something mask wearer?
>>107990669>>107990749Hold up, since when did anyone in /ldg/ give a shit about photorealistic nature images? Is this the latest copium strain? First we were experienced finetuners, now we are pretending to be National Geographic photographers?INDUSTRIAL STRENGTH COPIUM HUFFING COPE
>>107990793
>>107990809not all jews are evil, stop with this meme
>>107990827not jews no, but all zionists yes
>>107990829the argument for the existance of israel is the same as for any other country
>>107990839neat, kys
is it possible to input audio into ltx2 and have it continue genning audio based off that? it does it video
>>107990849trypophobia grandma
Why doesn't image gen do picrel?
>>107990850Yes, just modify the sound part to send already created sound file in latent.
>>107990855lol pretty much
You can tell when a model is garbage because the 1girl spam stops and anons desperately try to keep the hype alive by gennig landscapes, objects, and other random shit they've never cared about in their lives.
>>107990879>>107990883
>>107990879you can tell when nogens are frustrated they're not getting any new fap material
what's the max resolution before zimage gives up?I just tried 2000x3000 and it gave mostly good output except a big part of the image being picrel
>>107990890documentation says 2048x2048 at any aspect ratio
>>107990839So you support the building of ethnostates?
seems like a good model if the only critics are a nogen shitposter and a /pol/schizo
>>107990890>>107990886When I wake up tomorrow, I better see /ldg/ flooded with finetunes since you've all been jerking off about them for months. With all this hype, we're will see dozens of releases, right?
>>107990898thanks anon
>>107990737What is the subject/concept of the lora supposed to be?
>>107990913When a model is good like ZiT was, you don't see schizos, shitposters, or desperate copers inventing new use cases, people shut up and gen.
Why won't he share?
This community deserves to die.Anyone interested in collaborating on a private realism NSFW fine-tuning for Klein 4b? 50/50 costs, HQ dataset available - but you're welcome to contribute.I don't want to share anything with these retards.
>>107990913>seems like a good modelGood models generate hype through results, "trust me bro it's a training base model" damage control is not a good result.
>>107990953gl on finding him anon
>>107990953if you have that kind of money to spend then why not go further, try to adapt the FK architecture to use TREAD or other training optimizations so you can DIY.
>>107990912hot
>>107990861I'm guessing I have to combine the vae encoded audio with empty audio latent but I don't know how
>>107990737Well, you didn't give any explanation and it was empty shilling you posted into the void. I imagine your lora is some 3D to anime converter, I imagine your lora doesn't actually work, I imagine your statement was ironic cope, and ZiB loras are broken on Turbo.
>pull >ui crashes when loading z-imagethe fucking fennec strikes again
What's the best model for making NSFW of a provided human?
>>107991013There's a KJ or the LTX node for audio video masking, Use that and don't mask any audio and only mask the video.
>>107991031klein9b base + turbo lora with > 1 cfg + nsfw loras
>Z-Image "base" is okay but not phenomenal>slightly worse than Turbo at most things>truly only useful as a finetoooner's model>except it's too expensive to do a large-scale finetune on it>Klein 9b has a cucked license and the base is no better than Z>Klein 4b is garbage quality>even Klein 4b would be ~3x as expensive as SDXL to train>literally every single attempt by someone to do a large finetune of a post-SDXL model has been a flopit's over
>>107990974Yes, I'll try a test run in the image stream of the DoubleStreamBlock.If it survives, I'll try to implement it completely.
>>107991019take your meds
God damn, I can't tell if I've shat myself or not, but this image has that palpable smell of curry and curcumin.
>>107991020sometime I understand why companies made every program simpler and simpler to the point of having as few options as possibleuser dumbness knows no bound
>>107991066I'll never understand the fascination of Indians towards ordinary to ugly middle aged aunties.
>OP such a faggot he added a weeks old chroma pic on his collage>He is such a retard he probably thought it was made with Z Base
do aono use wan animate?use case?
god, comfy is such a faggotnow vae decides to run in CPU for some fucking reason. i have to use kj vae loader and specifically tell it to run on gpuZ-image in fp16 compute doesn't fucking work. while z-image-turbo works just fine.
>>107991131do you like the pr? :)
>>107991131Why can't this just be an option on the node? This seems like a common sense decision.
>>107991131torch compile gives me noise image in recent versions too
im want to murder whoever make the dockerfile for ace-step
>>107991118what is a lorder heller
>>107991188why. just fork and publish your better dockerfile.