Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107360388https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Zhttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://comfyanonymous.github.io/ComfyUI_examples/z_image/>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
I HATE SPAGHETTI
>>107362800What local model can I use for this? Is there a comfy workflow incorporating it?
>just one more 3gb + 3gb + 3gb pytorch blob bro
>>107362825api nodes
Can anyone qrd training 16-32-64 channel lora differences?
Blessed thread of frenship
>>107362825I used GLM 4.6 and it's pretty good. It can even crack a joke while thinking.
Bruh?
>>107362868no
I have a 4070 and each training step takes 18 sec on 256 res. Is this normal? I need my big ass lora.
I rate Chroma, glad only a few know how to use it
>>107362886I agree..thank the gods only a few use it...
anything better than this for automatic aspect ratio?
SHUT THE FUCK ABOUT CHROMAhttps://files.catbox.moe/bwv1fc.mp3
>>107362885Ok nevermind after the first save it speeds up considerably. TIME FOR BIG ASSES YES!!!
>>107362886both z and chroma are installed anyway, kek
>>107362916Ask a chatbot to write an aspect ratio node that takes into account the final res being 1MP so you'd only have to define width and height as a ratio and not "primary_dim". If you want to scale it larger, use a math node to multiply the sides by whatever factor
>>107362978can you speak english man
>>107362979Tell me more about your sampler/scheduler/steps setup please?
>>107362886i heard that Res4lyf is causing memory leaks, are you having any issues when using it with chroma?
>>107362989
>>107362916Depends on what you mean by better, but I am using Flux Resolution Calc by controlaltai. Only learned about it last night, but it seems functional?
>monitoring>takedown noticesDoes BFL think they're the fucking police?What the hell
>>107362920kek
>>107362842>Amateur photograph of a sexy Japanese female cop, going up stairs outdoors, as viewed from behind and below
>>107363017they call the internet watch cops foundation
>>107362878which one? zimage?I expect breasts, ass, body fat, penis size, age slider loras to be a thing anyway, if civitai doesn't ban them to oblivion of course
>>107363022thx :3
>Click "Stop Job">Nothing happens>Have to edit the database entry to change it's status to "stopped"I was expecting better.
>>107362985Arguably the safest resolutions to work with are based on 1MP i.e. the total pixel size of a 1024x1024px gen. Your node should take that into account when calculating a specific resolution based on an arbitrary ratio i.e. 2:3. The prompt you'd give to a chatbot would look something like "give me a custom comfy node that will take two variables, width and height as ratio, and output the width and height of an image that respects 1MP sizes." I would share mine but I don't have access right now so I apologize that you have to parse the meaning of my shitty explanation. I hope this makes sense.
>1500 steps, almost pusshttps://files.catbox.moe/vcfa0o.pnghttps://files.catbox.moe/sxkrh3.png
>>107362978>>107363014I guess I can go with 4MP (2048x2048) for z-image and add a ratio to it if that's possible, thanks for the idea anon
>>107362920>tranimefag seething about chromakino from a guy who made ramtorch which enable ostris to get us to train qwen image on 24gb of vram easily along with many future modelsits ok if youre a promptlet who doesnt care about realism
Does ComfyUI work better on Linux than Windows 11?
any lora masters here? what should i be going for with dataset sizes? 20 images? 100? i mostly do illu/noob/chroma
>>107363055>>107363042Yeah, just tell the chatbot that you want sizes like 4MP or whatever but you only want to input just the ratio and have the node calculate exact size based on that. At least that's what I have.
>>107363002Thanks!
>>107363058>who doesnt care about realismyou call that realism chromakek?
>>107363062it might delete your entire output and model directories on update if you are on linux
>>107363034>if civitai doesn't ban them to oblivion of coursehopefully people will share them here, the age and breasts one was amazing to do "timelapse" of a character
>>107363022Z-Image attempt. Same prompt. Even with the best of my prompt engineering I can't get it to do this.>Amateur photograph, split view of a beautiful Korean idol woman with pink hair who is seated in her office, diligently working on patient charts. She is a model dressed in a crisp, light blue nurse uniform, embodying a sense of dedication and care in her role. On the left side, a close-up shows her face as she looks up with a warm, reassuring smile, making an "okay" sign with her hand. The right side shows POV, first person view of her sitting with her bare feet underneath the desk, with her panties loosely rolled down around the ankles. Her fingernails are painted light pink.
>>107363017>tfw banned from using Flux
>>107363017They're sucking up to big tech, the only chance BFL has from going bankrupt is being bought up by big tech.Big tech wants censorship in local since avoiding SAAS censorship is a huge reason to go local.All western AI companies have gone all in on 'safety' for their open models, intentionally crippling them in some cases, like BFL with Flux, and setting licensing terms which means they can just terminate you right to use the model if they feel you are doing something they don't like.Thankfully China has come along and thrown a massive curve ball, derailing the 'we will control what you can do with AI' plans of western corporations.
>>107363062It should, but the best is to have it run in a dedicated PC and access it from another.>>107363091No, that only ever was an issue with their desktop version. Just mirror their github and you'll be fine.
How do you prompt zit to NOT show text? I tried pasting in character profiles to make it generate a portrait for them but it ends up including all of that text into the image as well.
>>107363017those guys are fucking insane, and I'm glad they got the hate they deserve
>>107363122why can't shit just be a simple exe already. I am so sick of this fucking python juggling
>>107363062Yes, all AI workloads work better on Linux than Windows, all research and commercial use of AI is done on Linux.
>>107363114From what I understand, they check big sites like civitai and if there are things they don't like, they can send cease and desist, and civitai will comply.Overall, it's like having 2 moderations on a row, so fucking retarded.>>107363115Well if you look at api only image models, they usually train on nsfw content, they just censor it both in the prompt and the output image, which is the clever thing to do, instead of destroying anatomy understanding by mangling any nudity for "safety".
>>107363089Ew, how do I delete the left half of someone's image?
>>107363135waaah waaah
>>107363112translate that to chinese and it'll work>业余摄影作品,采用分屏构图:画面左侧展现一位拥有粉色秀发的韩国偶像女性,她端坐于办公室内,正专注地填写病历表。这位模特身着清爽的浅蓝色护士制服,完美诠释了职业的奉献精神与关怀态度。特写镜头捕捉她抬眼望向镜头时温暖而安心的微笑,同时用手比出“OK”手势。右侧采用第一人称视角,呈现她赤脚蜷缩桌下的画面,内裤松垮地垂至脚踝。指甲涂着淡粉色指甲油。
>>107363046I went gay because of these images.
Ok, I've reached my conclusion: Z-Image Turbo and Chroma HD Flash are both shit, but Chroma HD Flash is slightly better.
>>107363017holy fuck, what a waste of resources
>>107363089>1girl laying downkekanyway, zimage will be better but only when loras come out for zimage base and seed variance gets fixed
>>107363160>aijeets forced to learn chinesechina no
>>107362920Kek what did you use for this?
>>107363178So you agree chroma mogs Z-Image?
>>107363017>We maintain a reporting relationship with organizations such as the National Center for Missing and Exploited Children>"Good news: we found one of your missing exploited children, she's an image on the computer that someone generated"
>>107363168give me your chroma hd flash workflow
>>107363180we'll get better prompt understanding with the base model since it's the one that can use CFG at allhttps://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf
>>107363002wtf delete that shit
>>107363198>it'll be 0.01 better bro that's A BIG DEAL
>>107363160DeepL translation didn't work. Guess I have to use an LLM
>>107363211this is the difference between 0.917 >>107363112and 0.926 >>107363160
>>107363168cute chart uwu :3
>>107363168The lack of variation in turbo is depressing. They basically all look the same.
>>107363231arr rook same
>>107363231You can thank RLHF for that
>>107363222Use a Qwen.
>>107363188>So you agree chroma mogs Z-Image?Depends on what one wants. If you're a vramlet then zimage is a no brainerIf you want more unique looking women instead of the samey ones zimage produces, if you want better nsfw, if you want milfs, then chroma v48/hdIf you want to find a good prompt and then gen a few hundred images to go through to get a lot of good unique images for no effort of constantly changing the prompt then chroma is the only option.Overall chroma is better for someone who knows what he wants, as zimage is too specialized for a specific look that is good ootb, but gets boring. We need base loras and seed variance fix.But there is no reason to not use both for their own strengths anyway.
>>107363249>If you're a vramlet then zimage is a no brainerwhy are you pretenting chroma is like 10x bigger than z-image, this is a fight between a 6b model and a 8.9b model, the difference isn't that big
>>107363249>If you're a vramlet then zimageChroma takes like 78 seconds to gen a single image on a 5090. Are you using a blackwell or something?
>>107363249this reminds me of easy fluff having a bit of time before sdxl had NSFW trained into it
>>107363194It's just the Comfy example Z-Image workflow with the text encoder changed to t5xxl_fp16 with 1 min_padding and 77 min_length, and the model changed to Chroma HD Flash (Q4_0 GGUF). Same 9 steps euler simple, 1.0 cfg.Would it be better if I could load it in bf16 for a proper apples-to-apples? Yeah maybe. I can try that if this really bothers you guys. Not sure if I can offload that much memory but I can try lol
z-image feels like a more advanced model than chroma desu, if it learns well it wins
Z-image works pretty well with neo forge. I used these settings for this image: DPM++ 2M/SGM Uniform 8 steps + hi.res fix+adetailer. This took about 1 min 30 secs. Not too shabby for rtx 3060.
>>107363249>If you're a vramlet then zimage is a no brainerChroma HD Flash GGUF should fit into a vramlet setup though.
It's literally one chroma guy. The Asian feet dude. No one else.
>>107363168>when the text is so unreadable that it actually changes into a featureless smudge in the thumbnail
>>107363320Also blurjeet
Reminder: you are using the gimped distilled version of Z. Base will be much better.
>>107363255>he didnt actually gen with chroma>>107363264I get ~30-50s per image depending on settings but I gen 8 images in a batch for speedup, power limited 3090, no fp16 accumulation for quality improvement, 1280x768, 26-35 steps depending on prompt, Q8 chroma and fp16 t5
>>107363293>1 min 30 seconds for 1girl, asian, standingjust open google and have a stroke at that point, there's a trillion of these images already available
>>107363339This post is disingenuous and made with the sole purpose of causing Z-Image base to be received poorly.
>>107363346NTA but the point is not a random photo of an attractive asian girl, it is that there is an attractive asian girl inside my computer that joyfully does whatever I tell her to do. Twitter people posting themselves does not compare in any way.
I don't get it, why do some people have such an autistic fixation on specific models? I try out most models but when something better comes along I'll use that one, I don't get it.
>>107363264>takes like 78 secondsUnless you state resolution and steps, that value is meaningless>on a 5090. Are you using a blackwell or something?5090 is Blackwell architecture
>>107363320>It's literally one chroma guy. The Asian feet dude.not gonna lie he's probably one of the worst anons on this general, his shilling is ubnoxious, he's so desperate for his shit model to be recognized as good, but you don't beg for praise, you just show your images and let other judge, like the Z-image developpers did
>>107363373where do you think you are?
>>107363378>he's probably one of the worst anons on this generalYou are the worst anon in this general
>>107363264Takes me 60s per image on a 4070 with Chroma, significantly less if I do low-step DEIS.But I'm ok with resolutions like 640x1152, so that's the difference.
>>107363368I mean, I assume Z-Image base will have the same problems all base models do. It'll come out, people will post "IT'S OVER" and "KEK" and what have you, and then a few months from then we'll have serious finetunes that work the way you'd hope and all the early doomposting console war stuff will be quietly forgotten.
I'm having trouble getting extreme angles.Like the camera pointing straight up.
>>107363373it's obviously a bit
>>107363390you don't know who I am chromakek
>>107363378Yeah out of 5 actual schizophrenic nogen complainers that are here 24/7/365 the worst is another guy who likes a particular model and posts asian 1girls itt, most sane nogen
>>107363255Chroma is literally 5-6x slower though, even if you're using the flash version it's still >2x slower. And it wouldn't be a problem if it was slower for a significantly better result, but it's slower for an often worse result...
>>107363368>>107363399I will be the one to convince anon that it's a good model like I did with NoobAI. Don't worry.
>>107363198what happens at 1.000?
>>107363418the generated pixels materialize into mustard gas through your screen
>>107363373>why do some people have such an autistic fixation>autistic fixation>autisticYou answered your own question anon
>>107363403Here's an old wan gen of something closer to what I want.
>>107363399>a few months from then we'll have serious finetunesWhere are the 'serious finetunes' Chroma was expected to get?
>>107363411>another guy who likes a particular model and posts asian 1girls ittyou convetiently forgot the part he starts to make walls of text to explain how his image full of oversaturated colors and nonsensical anatomy is actually valid and that you're the one who don't know how real life looks like! this guys is insufferable
>>107363418d̶̛̛̻̫̦̩̗̤̬̩̹͍̥͔͚̮̟̙̣̬̬̟̥͈̟̠͔͎̘͈̦͔͈͖̰̮̯̗̜́̑̐̽̎͊̀̔͌̊̈̽̏̈́̎͑̿̆̑̆͐͌̿̎̈̐͐̾̎̇̽̓͑͊͐̒̇͐͑͂̂͑̀̈͊̈́̋̒̌̄̈̓̋͒͐͛͆̈́͐͆̇̈́̔̇̽͋̈́͊̋̒̓͑̀̎͘̚͘͘̚͘̚̕͜͝͠͠͝ͅe̶̢̧̢̢̡̨̢̧̛̛͙̩̞̮̮̜͇̤̠̱̦̼͇͍̻͓̜͖̙̹̗͖̟͇̥͓̱̺͇̼̤̮͎̖̜̘̥̬̦̩̝̘̪̦̜̻̩̼͇̤̘̫͖̝̮̬̳̗̘̖͙̼̝̞̰͔̜̖̳̜̦̣̮̟͇̙͚̟̼̣̤͙͈̠̟̫̪̹̻̱͙̊͂̉́͒̎̿͋͐́͂̽̃̓͌̂̐̽̉͐̑̈̓͊̀̇͌̑̉̌̀̏̒́͒͗̑̈́̂̆̄̄̓͂̆͂̃̚̚̕͝͝ͅͅͅͅͅa̵̧̨̛̛͉̙̖̩̪͚͔̫͖̖͓̣̜̠͉̝̤̺̼̠̖̺̞͚̻̼̳͚̿̒̓͛̃͛͛͌̽̆̾̔̀̈́̈́̓̍̔̈̉̀̑̏̈͂̈̀͆̉̎̀̔̎̀͂̓̀̈̓̆̃́̐̄̈́̐̍̏̈́̈͐̐̄̆̐͋͑̇̀͗̈́̎̏̃̈́̂̇̇̒̕̕̕͜͠͝͝͝͝͠͝͠͝t̸̨̨̡̧̛̯̺̪̯͕͇͓̙͎͈̱̦̥͕͍̟̞̮̫̜̝̮͎̩̘͚̳͍̻̫̝̺̥͈̟͈̪̤̯̝̗̼͔͈͍̙͔̭̯̟͈̹̼̰͓͐̊̂̊̆́̎̇̈́̋̈̾̓̄̚͜͝͝ḩ̵̡̧̧̢̡̧̧̛̛̛̳̞̖͚̺̰̯̺̝̟̝̝̣͓̙̩͉̟̠̻͕̗̪̥̦̩̝̹͕͚̫̟̞͙̭̹͉̼̱̹̳͙̭̜̝̪̹̰͔̹̜͎̺̰̹̪̱̠̫̖̗̯̹͎̮̪͖̯̮̠̦̜͇̩̘̱͕̈́̊̃̿̿̈́́̍͒͌̒͋͌̓͆̓̈́̔̊̑̆͗̓̒͑̓̂̈̾͗̆̐̾̓̊͆̍͌̍̈́͊̍̈́̃͆̂̏͆̀̍͋̽̐̈́͒̈́̔̏̊̎́̎͒͑͆̃̓͌̐͒̋͆͘̕͘̚͜͜͜͜͠͝
>>107363418>what happens at 1.000?the mememark will be deemed too easy and they'll go for something harder, like it happened several times on the LLM ecosystem
seems like 1500 steps is the sweet spot for training rn
>>107363447Sorry I haven't been following. How long does it take and on what card? Also, how much would this actually translate to the base model?
don't find Chinese girls as appealing as JAV stars preening
>>107363434i dont remember him saying that, i agree that his oversaturated settings are bad but his defense of chroma overall was right most of the time, there are many more retards that cried about chroma for no reason all the time despite the model being good for a lot of things since v3x anyway, even though there are multiple bigger problems with it
don't find Mexican bodybuilders as appealing as JAV stars preening
>>107363461>i dont remember him saying thatwhy are you talking about yourself in the third person
after 50 gens i finally managed to get z-image to do a sex prone bone position
do we know the size of nano banana (the first one) or DALLE3?
>>107363447Using steps as a measurement makes very little sense since it is tied to the amount of imagesIf you train 20 images, 1500 steps is likely enough, if you train 200 images, 1500 steps is almost certainly too fewBetter to use epochs as your measurement, as in how many times every image has been trained
>>107363399>Image base will have the same problems all base models dothis will be compounded in the eyes of anon because his euler a simple 9 step config will look like ass kek
>>107362868>Can anyone qrd training 16-32-64 channel lora differences?anyone?
>>107362920you cocksucker
>>107363505No
>>107362920>https://files.catbox.moe/bwv1fc.mp3this is beautiful lmao
What is the native landscape resolution for Z Img? 1024x1024 is boring
>>107363435i remember zalgo
>>107363525It works with whatever resolution you put in so long as each size is <2048
>>107363525go for 1920x1080
>>107363491yeah, anon mentioned something as low as 18 images. of course you don't need 3k+ steps for that.
>>107363491NTA you have that backwards. Step count is more concrete than epoch because epochs are based on number of images among other things.
>>107363430SDXL finetunes took 1-3 years to emerge, it isn't even 6 months since Chroma finished trainingThat said the smaller the model the more likely you will see finetunes earlier (since training is faster), SDXL is very small compared to Chroma, but Z-Image Base will likely be smaller than Chroma at least
>>107363539>>107363537thank you, kind anons
>>107363202this is amazing, what model?
>>107363249>We need base loras and seed variance fix.Someone claims that using the ddim_uniform scheduler fixes the seed variancehttps://xcancel.com/Machinedelusion/status/1994531413744652336#m
>>107363577Z-image turbo obviously
>>107363578>fixes seed varaince>mp4 unrelated
>>107363293would
>>107363596What do you mean?
>>107363539>>107363537which sampler and scheduler for realism?euler is boring too
>>107363578gen 8 "office girl, buttoned up shirt"
>>107363293>Not too shabby for rtx 3060.8GB or 12
>>107363609learn to experiment anon
>>107363546No, step count is overall worthless unless you are always talking about a specific amount of images.1 epoch = every image has been trained once10 epochs = every image has been trained 10 times1500 steps = means nothing unless you know the number of images, if you have 1700 images, 1500 steps won't even have trained every image oncesteps are a stupid target measurement for ai training, only useful to know where in an epoch you are
should i be upscaling with the same sampler/scheduler and steps as the original image when using chroma flash?
>>107363608There's barely any difference in the images even with the different scheduler.
>>107363608He wants 1.5-tier variance. I also want that, but I understand why we don't have it.
>100% of the threads images are Z-image, because it just released>retard anon:>w0w wut modal u usin????>
>>107363543can we call Nogens Noggers from now on?
>>107363491shut the fuck up, no need advice from some loser anon like you, people here give the worst advice/take on stuff, if it looks good to me thats enough
>>107363168Is Z-Image actually this bad at doing huge breasts? That's really concerning
>>107363622I do, I really doI remember res_3m / bong was a thing back then
>>107363630kek
>>107363642Soon we will have bottom on Z-image.
>>107363578Something that might be worth trying is values below 1 on aura flow shift, or setting cfg below 1. Really though the solution is probably similar to what you had to do with Flux: shuffling input images with a denoising value lower than 1.
>>107363652I'll alow myself the luxury of hope once the base model dropped
>>107363616>gen 8 "office girl, buttoned up shirt"lul
>>107363652SDXL was capable of NSFW from the getgo, isnt Z-image made for the exact opposite? (as stated by the developers o algo)
>>107363668i guess its getting fucked by the aspect
>>107363578>men playing basketball>get exactly what you prompt>omg why every image looks the same. muh seed variationfucking ai jeets are so dumb, if you want something different, why don't you prompt something a little more complex? hell there are literally thousands of llm bots that can help you with that
>>107363630omg does comfyui work with it?
>>107363058>realismhttps://files.catbox.moe/06i4j7.mp3
VRAM is overrated. I am doing great with 10. Why do I feel pressure to upgrade because "scarcity of RAM" or GPUs are gonna go thru the roof or something's gonna happen. It's all a lie
>>107363002Comfyui does prompt switching natively now?
>>107363642>>107363664i dont even see the fucking point of all these faggot models. they cant do shit without a billion loras.chroma and noob is all you need.
>>107363687you can't do video gens without having terrible nuked to shit quality.
>>107363633Stop hurting my feelings you insensitive clod
>>107363681I think it's cool to get different images that all follow the prompt exactly because ultimately there's a lot of ways to describe the same thing, that makes it fun
>>107363688no lmao
>>107363685don't tell it vibevoice
>>10736361912gb. To be fair, I was using fp8 checkpoint and gguf text encoder, so it doesn't take million years to make one pic.
>>107363630Well to be fair, the image I was referring to is in the style of a classical painting. Besides, I'm a tourist, only dropping by only once every while or so.How am I supposed to know what new model you guys are jerking off around in the current year?
>>107363720>How am I supposed to know what new model you guys are jerking off around in the current year?are you seriously pretending you're obvious about the hype of Z-image? I don't believe you
>>107363720>Besides, I'm a tourist, only dropping by only once every while or so.being a tourist doesnt stop you from reading the thread before posting your retarded fucking question.
>>107363017they are probably tonguing the anus of the government and the billion worthless ngos
nearest-exact or lanczos?
:^)
>>107363688No, which is fucking insane.So much basic functionality you need to install third party nodes for, sad state.
>>107363681Forcing us to fill in every single gap with prompting, using language, is not only incredibly tedious, it is actually impossible and strains the limit of the model's ability to understand the prompt. YOU are easily impressed.Anglo/Saxon/Nord/German btw. Bit of a mutt but nobody can argue I'm not white
>>107363165it's getting better> https://files.catbox.moe/43crqc.jpg
>>107363767it still looks terrible desu, I'm still a faggot until you finished the training!
>>10736374870% of workflows are custom nodes. it's fucking annoying and sad because comfy doesn't fucking care about having fun anymore
there's one way to fix the seed variation is to let the instruct model rewrite your prompt everytime, each rewrite will be different and you'll get different settings each time >>107358856
>>107363731>>107363733saw z-image for the first time in this thread. looks pretty neat. I think I'll definitely check it out.
Prompt adherence (i.e. comprehension and knowledge of concepts) is the problem, not seed variance. An ideal model would have perfect prompt adherence and zero seed variance.
Can someone try to make a pic in fallout 1 style? Or fallout in general. With Z-Image. Im curious if I even wanna bother downloading it. Something like this >>107363427
face it, auto1111 losing was the worst timeline. everything is reddit now thanks to cumfart
>>107363747Imagine if when they released stable diffusion 3 it was as good as z-image-turbo
>>107363789>looks pretty neat. I think I'll definitely check it out.have fun anon, this model is really good, and don't forget your daily kneeling to our savior xi jinping!
>>107363675Post side by side NSFW base XL and Z (you won't)
is there a sane way to install comfyui with uv now? last time I checked, torch was a pain in the assand do I need comfy ui manager?
>change CFG from 1 to 1.5>gen goes from 55 seconds to 104what the fuck
dpmpp_sde + ddim_uniform actually gives pretty realistic output but holy shit it's slow as hell, 8s/it on an a10g
>>107363685kekbut its very simple, i just like big booba bimbos that chroma does well
>>107363814normal
>>107363812there is nothing sabe about installing spyware to your machine. also neoforge has zit
Just look at all the newfags. Isn't it beautiful, anon?
>>107363814Any CFG other than 1 requires the calculation of a positive step and a negative step. CFG==1 calculates only a positive step.
>>107363803if you cant be bothered to download a few files and drag the default workflow to your comfyui, then why would i bother with your request, retard.
>>107363794>An ideal model would have perfect prompt adherence and zero seed variance.You're a moron, that would mean you would get zero image variations out of a promptDo you even know what the seed does you absolute mong ?
>>107363814cfg actually makes 2 images, a positive image and a negative image, then it does some substraction math shit, so that's why its 2x slower
>>107363814Turbo models only work turbo-ly with CFG = 1.0
>>107363339True. I guess we will see when the full model comes out how it fares.
>>107363794>zero seed variance.that's retarded, there's always different ways to make an image off a prompt
>>107363829Yes, I do, it's a starting point in a search for a local maximum.An ideal model would give you exactly what you asked for, no more and no less. If you want something different then ask for it. If you're too stupid to ask for something different then use an LLM to make up some random shit for you.
>>107363819Chroma naturally does any description of woman better due to its seed variance and NSFW tuning.
>>107363855>An ideal model would give you exactly what you asked for, no more and no less.don't feed the troll
>>107363828>what is a requestOk subhuman, have fun with your vramlet model
im using heun and BONG sampler for chromaflash, is there a better combination you guys would recommend?
>>107363828what else do you expect from mikuniggers
>>107363794although variance should be maximal for unspecified factors. For instance, if your prompt is "Miggu in a catsuit" you should see Miggu in a catsuit across a vast array of locations and poses across generations. If Miggu appears in a catsuit in the same boring pose and location the model is bad imo.
>>107363825im not a newfag, im just retarded and curious.
So that's all what the chromakeks have left huh? Seed variance?
do you think anon will ever stop comparing tunes to base models like a retard
>>107363876mikuniggu and asaniggu no less
>>107363880And big boobs >>107363819
>>107363863Chroma skin texture looks kinda unnatural, like plastic wax
>>107363690Which noob btw? I mean 1.0 or 1.1?
>>107363880prompt adherence toobase is going to suck, you know, all this turbo can do is make passable 1girls
the only issue with chroma is how fucking it long it takes to generate an image. you HAVE to be edging constantly or you'll cum before the image finishes.make it faster NOW lodestone.
>>107363884That would require him to actually understand the difference which is impossible.
>>107363913I'm not a chroma shill but that's a wallet issue.
>>107363894>Chroma skin texture looks kinda unnatural, like plastic waxChroma's skin texture used to look good, and then lodestone decided to make the model run at lower speeds to make the vramlets happy and in consquence it got more slopped with the subsequent epochs
>>107363890Are all of you Z-iggers just ex NetaYume copers?Happy that you can finally run something similar to chroma but not quite?
>>107363919i have a 5090
>>107363906>prompt adherence tooChroma can use CFG, Z-Image turbo cannot, we'll have to wait for base to see how better at prompt adherence it's gonna get
>>107363913>the only issue with chroma is how fucking it long it takesno not at allit's absolutely horrible with hands & feet.
>>107363913the trick is to have to previous image you genned opened in another tab while you wait
>>107363922>he cheaped out and didn't get the 6000ngmi
>>107363906>all this turbo can do is make passable 1girlsNo it does more but also yeah they specifically designed it that way, read the paper.>>107363921No I'm still mildly sad that Neta will probably become obscure but some of the Z team worked on it so it's whatever.
>>107363921NetaYumers got bullied so hard they defaulted to Z-image, yet it cant do NSFW making it a useless toy lol
>>107363855You're so stupid. Being forced to make a prompt adjustment in order to get an image variation is absolutely retarded if you know HOW image generation works which you clearly don't.But you can do that RIGHT NOW if you want, just fucking lock the SEED value and you get EXACTLY what you describe, seriously how fucking dumb are you ?By your logic EVERY model is already perfect.
>>107363936kek, fuck you. fair enough. i'll buy the 7000 when the 60 series comes out.
>>107363766Idiot thats fine and dandy but look at the prompt of the people who are complaining, I think you can do better than "men playing basketball"
>>107363870Very experimental sampler, dropped it but back when I used I tried linear/heun3s seemed to work fine
>>107363320Sometimes I think that guy is actually just a chroma hater baiting with bad gens, cause no one can be that blind....
downloading models in comfyui is a chore. would be better if the workflows had magnet links and it had a torrent client built in and it just downloaded them.can any of you fags make this happen?
There are two types of models: 'it just works' models, and grail models.The first type do what you ask. They do it as well as you can expect, with minimal error. They are highly predictable, and produce serviceable content with ease. They are "stable".The second type are unstable, unreliable, and will give you a heaping pile of garbage. But the garbage is interesting. It is very diverse garbage. In many of these failed and worthless gens you see hints of something eerily real. You sense that if it ever managed by sheer luck to get everything right, it could be incredible. And not only this, but you suspect that lurking somewhere out there are gens of unknown transformative power; gens that can redeem all this wasted time. Once in ten thousand gens you may find gens that are really beautiful, and a comforting consolation for the frustrations; but even those aren't the point.Which of these two you prefer is mostly a question of what sort of person you are. Women will never be interested in the second kind of genning, and most men won't either.
>>1073639871.5 kinda did both?
>>107363920Have you tried mixing v26 with the HD Flash delta weights anon-kun? Pic rel is result you get after mix with v50, starts looking natural again.
>>107363975>cause no one can be that blind....either it's a falseflag or someone who got lost in the sunk cost
>muh chroma only has seed variance>muh shit realism>skin texture looks kinda unnatural, like plastic w-ACKSkill status: issueI don't have to prompt every atom in the image like with Z Image in order to get big boobs and distorted nipples, while also being able to continue genning unique images instead of the same 3 already similar looking women that are state mandated by Xi himself to be connected to that specific prompt and rotate in your computer.https://files.catbox.moe/lvv0ab.png
>>107363995>everything is bright and white>natural
>>107363995>>107363969does Z only do disgusting ching chong whores?
>>107364006>does Z only do disgusting ching chong whores?those are chroma images anon
>>107363963>>107363987There's no reason why a model shouldn't be able to generate thousands of distinct and coherent pictures from a three word prompt.>>107363999Is this supposed to be a picture with skin texture lmao?
>>107364006thats not Z, you MONGOLOID
>>107364006that's Pooma
>>107363999>distortedoh now the chromakek knows what distorted means
fuck it, im not going to upscale my chromagens, they take too long.
>>107364012>>107364019>nogens malding already
Hands vs Dogs :v
>>107363999Z-iggers BTFO!Trips of truth
what the fuck does T5TokenizerOptions do? i read the description and it sounds like snake oil
>>107364029no gen no onion
>>107364029I don't need a gen to comment on your airbrushed image.
>>107364031>two guys making out in a forest in the rain, OP style
Can Z image do this or nah?
>>107364048give me the prompt I'll try it
>>107364035if it sounds like snake oil, it probably is
>>107363630What is z-image?
>>107364048BBC sisters just discovered Z-image>it's over
I want to train a Z-Image Turbo Lora, how many steps do I need? is 1500 enough? or do I need more? I remember for a Flux Lora I needed like 5000.
>>107364054shit I dont have PNG inspector on this PC
>>107364083Just set 8k steps and train until you like the style nigger
>>107364083holy shit, you needed 5000 images per lora? what kind of a shit model needs 5000 images for a lora? that can't be real.
Need to get back on my Chroma game to make some edits for the Zissies
>>107363403>>107363428>I'm having trouble getting extreme angles.>Like the camera pointing straight up.that makes sense given the fact that zimage turbo is a portrait-tuned version of the base modeltry worms-eye-view? otherwise wait until Sunday for base>>107363633>if it looks good to me thats enoughbased as fuck. this reminds me of that nutritionist girl who was upset people were listening to a high schooler who can bench 300lbs instead of her>>107363812>is there a sane way to install comfyui with uv now? last time I checked, torch was a pain in the assi'd recommend anaconda/miniconda for doing stuff with cuda since you can install cuda-toolkit and have all the deps you'll ever need>and do I need comfy ui manager?no but it helps, especially when you grab someone elses workflow and you need to install all the custom nodes they're using
>>107364102>you needed 5000 images per lora?I said STEPS nigga, can you read?
>>107364118you just got deeb'd and it was so obvious too
>>107364118okay my bad. My response is the same btw. just reread my post and replace the word images with steps in every instance.
>>107363812i'm running comfy with uv, i just ran claude code in the directory and said "install this repo with uv">>107364111lmao conda/miniconda in 2026, get with the times unc
>>107363987very few synthographers here m8
>>107364003Looks fine to me. I mean, I am prompting for a light setting outdoors in that case.
All the cute chinese girls be hanging with BBChroma when you turn off your Z-image model
>>107364137Very deboesque image.
>>107363981Making actual torrent clients with all the features and configuration they need also is a surprisingly huge chore.Maybe -if comfyanon can be convinced that models are at risk- someone could ask for a feature in base comfyui where workflows save enough of the hashes that comfyui can generate magnets for torrent v1/torrentv2 unless it's turned off? At least that feature doesn't need a huge amount of maintenance. Of course it also doesn't put and keep the torrents online on its own.
oh shit i got a context after waiting for an hour lets go>>107364128>lmao conda/miniconda in 2026, get with the times uncthere is literally no reason for me to switch from what works. nothing uv offers me gives me a reason to switch. claude code works just fine with conda/conda run as well. conda lets you install non-python packages as well which is useful for not dealing with any CUDA headaches everalso I guarantee I am at least 10 years younger than you, and you are the unc in this dialogue
>>107364147>Looks fine to methat's the problem, it doesn't look natural at all, and you seem to be the only one to not notice that and be like "huhhh? why Z-image managed to blow in popularity but not my heckin wholesome model??"
>>107364162>Making actual torrent clients with all the features and configuration they need also is a surprisingly huge chore.no you could just use a python wrapper for libtorrent and thats part of your requirements.txt for the github repo, but no one would actually use it. hf-transfer is better for sharing anything that's not illegal, and if it's actually illegal torrents are bad opsec so it makes no sense
how the fuck do i do different angles in chroma holy SHITi tried >worm's eye view. shot taken from ground level. shot from the side. image taken from below.nothing works. is there a template for perspectives?
>>107364152
>>107364160no gen no onion
trying to train an iphone photo z-image lora and it's turning the guy more hispanic with every epoch
Can Z-image do good text placement like this?
>>107364194the guy on the bottom looks like the dude on american pie lol
>>107364012Actually there are reasons, but it suffices to say: show me this model. (It doesn't exist)
>Z natively understands russian too...huh
>>107364218qwen 3 knows a lot of language, it's at its best on chinese and english though
Chroma can do very sexy stuff out of the box, the level of NSFW depends entirely on your prompt
>>107363994Rose-tinted glasses, mate. 1.5 was definitely a type 2 and failed miserably at being a type 1. (I loved SD1.5 don't get me wrong)
>>107364229why is there a nigger in every of your pictures?
>>107364233He is probably* Indian, like most of /ldg/. He's right about Chroma though.*Not necessarily ofc.
>>107364175needs an entire torrent client of configuration and diagnostics WebUI and CLI, for either mode of using ComfyUI. it'll be more work than you think, but good luck.i'd personally just add the hashes and magnets so it can get used if someone wants to use it, without the entire torrent client attached.
>>107364233There's always one in his mirror too lmao
china keeps on winning
>>107364245Its what chinese rice bunnies lust after
>>107364229bro looks like a 3d-rendered fortnite skin
>>107363089>ask for Japanese>AI delivers Koreansbwahahahahano wonder Japanese hate the AI
>>107364200heres some text placement
>>107364276I can't tell the difference anyway.
Hmm, at 500 steps batch size=2 it's already picking up a style pretty well. It's looking better than expected for training a distilled model.
>>107364279hilldawg knows?
>>107362886It's Mr Oinkers Wife
Can Z-image do spidermen or is it another failbake like Yume?
>>107364297lmao this fag got doxed to his family and i think at some point he still came back to post
is there a way to use the already loaded qwen model for LLM inference (I want to translate to chinese) directly in comfy?
>>107364283>I can't tell the difference anyway.is it accurate?
So what games are you guys currently playing?
>flux 2 devverdict?
>>107364276arrr rook same, arr rice bunnies
>>107364321China looks good, japan looks half-chinese, korea looks pretty good too.
How did Ostris manage to include lora training for Z-image when the training scripts aren't even released yet?
>>107364327im playing with my pp
>>107364302If it can do Spiderman it means he's real.
@107364327debo
>>107364327I was trying out Timesplitters Rewind earlier. Pretty fun but still waiting for the team to release more of the story mode.
>>107363778>ComfyUnless you zoom in and notice all their right hands are amputated.
Okay that's cool and all but 1girl standing can only keep me interested for so long
>>107364342thats BBChroma tho
>>107364318There's some qwen node. Google or check couple of threads back or was it in /lmg/ don't remember
If you REALLY want seed variety just inject noise into the conditioning. But really, you should just learn to prompt (skill issue)>prompt: a girl jogging
>>107364172It is just mimicking real smartphone photographs. Chroma is only not as popular due to heavier constraints to run and most vramlets not knowing about HD Flash.
>>107364365>just inject noise into the conditioninghow do you do that?
>>107364359Totally irrelevant and out of context post
>>107364369ConDelta
>>107364327None. I don't have time. Too much content to make
>>107364366>Chroma is only not as popular due to heavier constraints to runbullshit, Flux got popular, Wan got popular and both are bigger than Chroma, you're coping hard
>>107364365>sneed variety-anon posts again
>>107364262If women want to fuck niggers so badly why do interracial porn sites always need to pay the actresses extra?
>>107364332It's not bad, but not many people are bothering to test it because their only need is efficient 1girl production and z has met this.
>>107364387Shut the fuck up retard. Learn to read posts next time.
>>107364402>buttmad
>>107364387I CANT SEED
I love human civilization and social media
I just want to know can z replace chroma?
>>107364374ummm... okay
>>107364390Because they all have big penis, so it takes a bigger toll on their tiny Asian vaginas. Imagine 10 inch BBC stretching them to the limit
>>107364200choking doesn't seem to work too well on z-image-turbo, at least in english. you can have a shirt with text.
>>107364390its husband compensation for the stretched pussy
>>107364339aaah yes that old game that everybody keeps on going back to.>>107364352I don't see Timesplitters on Steam? is it console?
>>107364410twitter is really a cesspool, like you lose your sanity if you keep reading those retarded takes (but it was meant to, the more they ragebait the more they get money)
Everyone is obsessed by the seed variation or what? even on leddit they can't stop talking about it lolhttps://www.reddit.com/r/StableDiffusion/comments/1p99t7g/improving_zimage_turbo_variation/
go pornspam elsewhere
>>107364455are people retarded or something? this is a product of distillation
>>107364361>GoogleIt's really hard to google this considering there are already two qwen imagegen models for comfyUI.The only things I found involved running inference in a separate program and just using comfy as a frontend.
>>107364431It's a free fanmade remake of the original games but it's not completed yet. Google Timesplitters Rewind and you should find the download
>>107364366>HD Flashthis shit keeps randomly doing anime like 50% of the time so i dropped it
Why is nothing happening?
The ape let's out deep aggressive grunts as he splurges gallons of thick slimy semen in the 5ft tall tiny asian. Her toes curl up as she relieves his raw love potion not meant for humans
>>107364466No it's not fucktard. It's a product of deliberate supervised fine tuning on a subset of high quality imagesDon't respond to me if you haven't read the paper
>>107364475we don't see shit nigger
>>107364481I wiped my butt with the paper so ai'm qualified
>>107364365Got a workflow? Seems worth a try
>>107364495Just try this >>107364455
I just opened reddit and saw an actually good idea for better image variance.Run a few inference steps with no prompt and then run the rest with the prompt.
>>107364510>>107364455Yeah these two are similar ideas
>>107364437
>Vito CorleoneLook how they massacred my boy...
>>107364369KSampler with Variations
>>107362825get any low parameter qwen llm modelask it to translate it to chinese symbols
>>107364548>>107364548>>107364548>>107364548
How did you anons squeeze vintage photographs out of Zprompt for picrel>Retro 80s expedition photo with faded colors and film grain, wide shot of jungle river scene, three voluptuous Brazilian women with caramel skin and long black hair wearing grass bikinis that barely cover their massive breasts and huge buttocks, gold hoop earrings catching sunlight, bathing in shallow river water, water dripping down their curvaceous bodies, arched backs and posed provocatively, washing each other's hair and bodies, tropical waterfall in background, vintage color palette with warm tones, caption in yellow retro font 'Bathing rituals of the indigenous people', 1980s adventure magazine aesthetic>>107364250>needs an entire torrent client of configuration and diagnostics WebUI and CLI, for either mode of using ComfyUI.no it really doesn't, but there's no point arguing with you because it doesn't matter either way because no one will use it for the reasons I mentioned
>>107364394x-img
>>107364550I noticed it struggles with a lot of male celebrities. It seems to just generate a vague combination of a bunch of other celebrities mashed together.
>>107364660i tried star trek and star wars, gets 90% of them wrong.shame.
>>107362979
>>107364365These results are underwhelming
>>107364468cool, very nice. Maybe it's time for some Timesplitters Rewind.