[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107360388

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
I HATE SPAGHETTI
>>
>>107362800
What local model can I use for this? Is there a comfy workflow incorporating it?
>>
File: file.png (821 B, 704x18)
821 B
821 B PNG
>just one more 3gb + 3gb + 3gb pytorch blob bro
>>
>>107362825
api nodes
>>
Can anyone qrd training 16-32-64 channel lora differences?
>>
Blessed thread of frenship
>>
File: file.png (101 KB, 874x574)
101 KB
101 KB PNG
>>107362825
I used GLM 4.6 and it's pretty good. It can even crack a joke while thinking.
>>
Bruh?
>>
>>107362868
no
>>
File: hog-buff.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>
I have a 4070 and each training step takes 18 sec on 256 res. Is this normal? I need my big ass lora.
>>
I rate Chroma, glad only a few know how to use it
>>
>>107362886
I agree..thank the gods only a few use it...
>>
File: 1755218130534024.png (55 KB, 907x538)
55 KB
55 KB PNG
anything better than this for automatic aspect ratio?
>>
File: 1737259997556317.png (1.93 MB, 1024x1496)
1.93 MB
1.93 MB PNG
SHUT THE FUCK ABOUT CHROMA

https://files.catbox.moe/bwv1fc.mp3
>>
>>107362885
Ok nevermind after the first save it speeds up considerably. TIME FOR BIG ASSES YES!!!
>>
File: flux2_bf16_c-0051.jpg (254 KB, 1600x1600)
254 KB
254 KB JPG
>>
File: milkdong.png (1.27 MB, 1328x452)
1.27 MB
1.27 MB PNG
>>
>>107362886
both z and chroma are installed anyway, kek
>>
>>107362916
Ask a chatbot to write an aspect ratio node that takes into account the final res being 1MP so you'd only have to define width and height as a ratio and not "primary_dim". If you want to scale it larger, use a math node to multiply the sides by whatever factor
>>
>>
>>107362978
can you speak english man
>>
>>107362979
Tell me more about your sampler/scheduler/steps setup please?
>>
>>107362886
i heard that Res4lyf is causing memory leaks, are you having any issues when using it with chroma?
>>
File: 4ch.png (1.75 MB, 2105x1126)
1.75 MB
1.75 MB PNG
>>107362989
>>
>>107362916
Depends on what you mean by better, but I am using Flux Resolution Calc by controlaltai. Only learned about it last night, but it seems functional?
>>
File: 1734880429555338.png (167 KB, 1563x1028)
167 KB
167 KB PNG
>monitoring
>takedown notices
Does BFL think they're the fucking police?
What the hell
>>
File: 1761614714682446.png (130 KB, 637x358)
130 KB
130 KB PNG
>>107362920
kek
>>
File: ComfyUI_09127_.png (1.77 MB, 1152x1152)
1.77 MB
1.77 MB PNG
>>107362842
>Amateur photograph of a sexy Japanese female cop, going up stairs outdoors, as viewed from behind and below
>>
>>107363017
they call the internet watch cops foundation
>>
File: ComfyUI_09123_.png (1.8 MB, 1152x1152)
1.8 MB
1.8 MB PNG
>>
>>107362878
which one? zimage?
I expect breasts, ass, body fat, penis size, age slider loras to be a thing anyway, if civitai doesn't ban them to oblivion of course
>>
>>107363022
thx :3
>>
>Click "Stop Job"
>Nothing happens
>Have to edit the database entry to change it's status to "stopped"
I was expecting better.
>>
>>107362985
Arguably the safest resolutions to work with are based on 1MP i.e. the total pixel size of a 1024x1024px gen. Your node should take that into account when calculating a specific resolution based on an arbitrary ratio i.e. 2:3. The prompt you'd give to a chatbot would look something like "give me a custom comfy node that will take two variables, width and height as ratio, and output the width and height of an image that respects 1MP sizes."
I would share mine but I don't have access right now so I apologize that you have to parse the meaning of my shitty explanation. I hope this makes sense.
>>
File: zimg_0067.png (1.8 MB, 1344x1728)
1.8 MB
1.8 MB PNG
>1500 steps, almost puss
https://files.catbox.moe/vcfa0o.png
https://files.catbox.moe/sxkrh3.png
>>
>>107362978
>>107363014
I guess I can go with 4MP (2048x2048) for z-image and add a ratio to it if that's possible, thanks for the idea anon
>>
>>107362920
>tranimefag seething about chromakino from a guy who made ramtorch which enable ostris to get us to train qwen image on 24gb of vram easily along with many future models
its ok if youre a promptlet who doesnt care about realism
>>
File: girl.jpg (196 KB, 768x512)
196 KB
196 KB JPG
Does ComfyUI work better on Linux than Windows 11?
>>
any lora masters here? what should i be going for with dataset sizes? 20 images? 100? i mostly do illu/noob/chroma
>>
>>107363055
>>107363042
Yeah, just tell the chatbot that you want sizes like 4MP or whatever but you only want to input just the ratio and have the node calculate exact size based on that. At least that's what I have.
>>
>>107363002
Thanks!
>>
File: are you for real?.png (3.08 MB, 2394x2018)
3.08 MB
3.08 MB PNG
>>107363058
>who doesnt care about realism
you call that realism chromakek?
>>
>>107363062
it might delete your entire output and model directories on update if you are on linux
>>
>>107363034
>if civitai doesn't ban them to oblivion of course
hopefully people will share them here, the age and breasts one was amazing to do "timelapse" of a character
>>
File: ComfyUI_09131_.png (1.42 MB, 1152x1152)
1.42 MB
1.42 MB PNG
>>107363022
Z-Image attempt. Same prompt. Even with the best of my prompt engineering I can't get it to do this.

>Amateur photograph, split view of a beautiful Korean idol woman with pink hair who is seated in her office, diligently working on patient charts. She is a model dressed in a crisp, light blue nurse uniform, embodying a sense of dedication and care in her role. On the left side, a close-up shows her face as she looks up with a warm, reassuring smile, making an "okay" sign with her hand. The right side shows POV, first person view of her sitting with her bare feet underneath the desk, with her panties loosely rolled down around the ankles. Her fingernails are painted light pink.
>>
>>107363017
>tfw banned from using Flux
>>
>>107363017
They're sucking up to big tech, the only chance BFL has from going bankrupt is being bought up by big tech.

Big tech wants censorship in local since avoiding SAAS censorship is a huge reason to go local.

All western AI companies have gone all in on 'safety' for their open models, intentionally crippling them in some cases, like BFL with Flux, and setting licensing terms which means they can just terminate you right to use the model if they feel you are doing something they don't like.

Thankfully China has come along and thrown a massive curve ball, derailing the 'we will control what you can do with AI' plans of western corporations.
>>
>>107363062
It should, but the best is to have it run in a dedicated PC and access it from another.

>>107363091
No, that only ever was an issue with their desktop version. Just mirror their github and you'll be fine.
>>
How do you prompt zit to NOT show text? I tried pasting in character profiles to make it generate a portrait for them but it ends up including all of that text into the image as well.
>>
>>107363017
those guys are fucking insane, and I'm glad they got the hate they deserve
>>
>>107363122
why can't shit just be a simple exe already. I am so sick of this fucking python juggling
>>
>>107363062
Yes, all AI workloads work better on Linux than Windows, all research and commercial use of AI is done on Linux.
>>
>>107363114
From what I understand, they check big sites like civitai and if there are things they don't like, they can send cease and desist, and civitai will comply.
Overall, it's like having 2 moderations on a row, so fucking retarded.

>>107363115
Well if you look at api only image models, they usually train on nsfw content, they just censor it both in the prompt and the output image, which is the clever thing to do, instead of destroying anatomy understanding by mangling any nudity for "safety".
>>
>>107363089
Ew, how do I delete the left half of someone's image?
>>
>>107363135
waaah waaah
>>
File: Z-image turbo.png (2.13 MB, 1280x1280)
2.13 MB
2.13 MB PNG
>>107363112
translate that to chinese and it'll work
>业余摄影作品,采用分屏构图:画面左侧展现一位拥有粉色秀发的韩国偶像女性,她端坐于办公室内,正专注地填写病历表。这位模特身着清爽的浅蓝色护士制服,完美诠释了职业的奉献精神与关怀态度。特写镜头捕捉她抬眼望向镜头时温暖而安心的微笑,同时用手比出“OK”手势。右侧采用第一人称视角,呈现她赤脚蜷缩桌下的画面,内裤松垮地垂至脚踝。指甲涂着淡粉色指甲油。
>>
>>107363046
I went gay because of these images.
>>
Ok, I've reached my conclusion: Z-Image Turbo and Chroma HD Flash are both shit, but Chroma HD Flash is slightly better.
>>
>>107363017
holy fuck, what a waste of resources
>>
>>107363089
>1girl laying down
kek
anyway, zimage will be better but only when loras come out for zimage base and seed variance gets fixed
>>
>>107363160
>aijeets forced to learn chinese
china no
>>
>>107362920
Kek what did you use for this?
>>
>>107363178
So you agree chroma mogs Z-Image?
>>
>>107363017
>We maintain a reporting relationship with organizations such as the National Center for Missing and Exploited Children
>"Good news: we found one of your missing exploited children, she's an image on the computer that someone generated"
>>
>>107363168
give me your chroma hd flash workflow
>>
File: this.png (174 KB, 2526x577)
174 KB
174 KB PNG
>>107363180
we'll get better prompt understanding with the base model since it's the one that can use CFG at all
https://github.com/Tongyi-MAI/Z-Image/blob/main/Z_Image_Report.pdf
>>
File: ComfyUI_00444_.png (1.89 MB, 1152x1152)
1.89 MB
1.89 MB PNG
>>
File: 836696060.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>
>>107363002
wtf delete that shit
>>
>>107363198
>it'll be 0.01 better bro that's A BIG DEAL
>>
>>107363160
DeepL translation didn't work. Guess I have to use an LLM
>>
>>107363211
this is the difference between 0.917 >>107363112
and 0.926 >>107363160
>>
>>107363168
cute chart uwu :3
>>
>>107363168
The lack of variation in turbo is depressing. They basically all look the same.
>>
>>107363231
arr rook same
>>
>>107363231
You can thank RLHF for that
>>
>>107363222
Use a Qwen.
>>
>>107363188
>So you agree chroma mogs Z-Image?
Depends on what one wants. If you're a vramlet then zimage is a no brainer
If you want more unique looking women instead of the samey ones zimage produces, if you want better nsfw, if you want milfs, then chroma v48/hd
If you want to find a good prompt and then gen a few hundred images to go through to get a lot of good unique images for no effort of constantly changing the prompt then chroma is the only option.
Overall chroma is better for someone who knows what he wants, as zimage is too specialized for a specific look that is good ootb, but gets boring. We need base loras and seed variance fix.

But there is no reason to not use both for their own strengths anyway.
>>
>>107363249
>If you're a vramlet then zimage is a no brainer
why are you pretenting chroma is like 10x bigger than z-image, this is a fight between a 6b model and a 8.9b model, the difference isn't that big
>>
>>107363249
>If you're a vramlet then zimage
Chroma takes like 78 seconds to gen a single image on a 5090. Are you using a blackwell or something?
>>
>>107363249
this reminds me of easy fluff having a bit of time before sdxl had NSFW trained into it
>>
>>107363194
It's just the Comfy example Z-Image workflow with the text encoder changed to t5xxl_fp16 with 1 min_padding and 77 min_length, and the model changed to Chroma HD Flash (Q4_0 GGUF). Same 9 steps euler simple, 1.0 cfg.

Would it be better if I could load it in bf16 for a proper apples-to-apples? Yeah maybe. I can try that if this really bothers you guys. Not sure if I can offload that much memory but I can try lol
>>
z-image feels like a more advanced model than chroma desu, if it learns well it wins
>>
File: 00005-3064468002.png (1.66 MB, 1160x1496)
1.66 MB
1.66 MB PNG
Z-image works pretty well with neo forge. I used these settings for this image: DPM++ 2M/SGM Uniform 8 steps + hi.res fix+adetailer. This took about 1 min 30 secs. Not too shabby for rtx 3060.
>>
>>107363249
>If you're a vramlet then zimage is a no brainer

Chroma HD Flash GGUF should fit into a vramlet setup though.
>>
It's literally one chroma guy. The Asian feet dude. No one else.
>>
>>107363168
>when the text is so unreadable that it actually changes into a featureless smudge in the thumbnail
>>
>>107363320
Also blurjeet
>>
Reminder: you are using the gimped distilled version of Z. Base will be much better.
>>
>>107363255
>he didnt actually gen with chroma
>>107363264
I get ~30-50s per image depending on settings but I gen 8 images in a batch for speedup, power limited 3090, no fp16 accumulation for quality improvement, 1280x768, 26-35 steps depending on prompt, Q8 chroma and fp16 t5
>>
>>107363293
>1 min 30 seconds for 1girl, asian, standing
just open google and have a stroke at that point, there's a trillion of these images already available
>>
>>107363339
This post is disingenuous and made with the sole purpose of causing Z-Image base to be received poorly.
>>
>>107363346
NTA but the point is not a random photo of an attractive asian girl, it is that there is an attractive asian girl inside my computer that joyfully does whatever I tell her to do. Twitter people posting themselves does not compare in any way.
>>
I don't get it, why do some people have such an autistic fixation on specific models? I try out most models but when something better comes along I'll use that one, I don't get it.
>>
>>107363264
>takes like 78 seconds
Unless you state resolution and steps, that value is meaningless

>on a 5090. Are you using a blackwell or something?
5090 is Blackwell architecture
>>
>>107363320
>It's literally one chroma guy. The Asian feet dude.
not gonna lie he's probably one of the worst anons on this general, his shilling is ubnoxious, he's so desperate for his shit model to be recognized as good, but you don't beg for praise, you just show your images and let other judge, like the Z-image developpers did
>>
>>107363373
where do you think you are?
>>
>>107363378
>he's probably one of the worst anons on this general
You are the worst anon in this general
>>
>>107363264
Takes me 60s per image on a 4070 with Chroma, significantly less if I do low-step DEIS.

But I'm ok with resolutions like 640x1152, so that's the difference.
>>
>>107363368
I mean, I assume Z-Image base will have the same problems all base models do. It'll come out, people will post "IT'S OVER" and "KEK" and what have you, and then a few months from then we'll have serious finetunes that work the way you'd hope and all the early doomposting console war stuff will be quietly forgotten.
>>
File: file.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
I'm having trouble getting extreme angles.
Like the camera pointing straight up.
>>
>>107363373
it's obviously a bit
>>
File: this is you.png (1.51 MB, 2000x1000)
1.51 MB
1.51 MB PNG
>>107363390
you don't know who I am chromakek
>>
>>107363378
Yeah out of 5 actual schizophrenic nogen complainers that are here 24/7/365 the worst is another guy who likes a particular model and posts asian 1girls itt, most sane nogen
>>
>>107363255
Chroma is literally 5-6x slower though, even if you're using the flash version it's still >2x slower. And it wouldn't be a problem if it was slower for a significantly better result, but it's slower for an often worse result...
>>
>>107363368
>>107363399
I will be the one to convince anon that it's a good model like I did with NoobAI. Don't worry.
>>
>>107363198
what happens at 1.000?
>>
>>107363418
the generated pixels materialize into mustard gas through your screen
>>
File: zimg_198_.png (2.2 MB, 1536x1536)
2.2 MB
2.2 MB PNG
>>107363373
>why do some people have such an autistic fixation
>autistic fixation
>autistic
You answered your own question anon
>>
File: file.mp4 (2.09 MB, 720x1280)
2.09 MB
2.09 MB MP4
>>107363403
Here's an old wan gen of something closer to what I want.
>>
>>107363399
>a few months from then we'll have serious finetunes
Where are the 'serious finetunes' Chroma was expected to get?
>>
>>107363411
>another guy who likes a particular model and posts asian 1girls itt
you convetiently forgot the part he starts to make walls of text to explain how his image full of oversaturated colors and nonsensical anatomy is actually valid and that you're the one who don't know how real life looks like! this guys is insufferable
>>
>>107363418
d̶̛̛̻̫̦̩̗̤̬̩̹͍̥͔͚̮̟̙̣̬̬̟̥͈̟̠͔͎̘͈̦͔͈͖̰̮̯̗̜́̑̐̽̎͊̀̔͌̊̈̽̏̈́̎͑̿̆̑̆͐͌̿̎̈̐͐̾̎̇̽̓͑͊͐̒̇͐͑͂̂͑̀̈͊̈́̋̒̌̄̈̓̋͒͐͛͆̈́͐͆̇̈́̔̇̽͋̈́͊̋̒̓͑̀̎͘̚͘͘̚͘̚̕͜͝͠͠͝ͅe̶̢̧̢̢̡̨̢̧̛̛͙̩̞̮̮̜͇̤̠̱̦̼͇͍̻͓̜͖̙̹̗͖̟͇̥͓̱̺͇̼̤̮͎̖̜̘̥̬̦̩̝̘̪̦̜̻̩̼͇̤̘̫͖̝̮̬̳̗̘̖͙̼̝̞̰͔̜̖̳̜̦̣̮̟͇̙͚̟̼̣̤͙͈̠̟̫̪̹̻̱͙̊͂̉́͒̎̿͋͐́͂̽̃̓͌̂̐̽̉͐̑̈̓͊̀̇͌̑̉̌̀̏̒́͒͗̑̈́̂̆̄̄̓͂̆͂̃̚̚̕͝͝ͅͅͅͅͅa̵̧̨̛̛͉̙̖̩̪͚͔̫͖̖͓̣̜̠͉̝̤̺̼̠̖̺̞͚̻̼̳͚̿̒̓͛̃͛͛͌̽̆̾̔̀̈́̈́̓̍̔̈̉̀̑̏̈͂̈̀͆̉̎̀̔̎̀͂̓̀̈̓̆̃́̐̄̈́̐̍̏̈́̈͐̐̄̆̐͋͑̇̀͗̈́̎̏̃̈́̂̇̇̒̕̕̕͜͠͝͝͝͝͠͝͠͝t̸̨̨̡̧̛̯̺̪̯͕͇͓̙͎͈̱̦̥͕͍̟̞̮̫̜̝̮͎̩̘͚̳͍̻̫̝̺̥͈̟͈̪̤̯̝̗̼͔͈͍̙͔̭̯̟͈̹̼̰͓͐̊̂̊̆́̎̇̈́̋̈̾̓̄̚͜͝͝ḩ̵̡̧̧̢̡̧̧̛̛̛̳̞̖͚̺̰̯̺̝̟̝̝̣͓̙̩͉̟̠̻͕̗̪̥̦̩̝̹͕͚̫̟̞͙̭̹͉̼̱̹̳͙̭̜̝̪̹̰͔̹̜͎̺̰̹̪̱̠̫̖̗̯̹͎̮̪͖̯̮̠̦̜͇̩̘̱͕̈́̊̃̿̿̈́́̍͒͌̒͋͌̓͆̓̈́̔̊̑̆͗̓̒͑̓̂̈̾͗̆̐̾̓̊͆̍͌̍̈́͊̍̈́̃͆̂̏͆̀̍͋̽̐̈́͒̈́̔̏̊̎́̎͒͑͆̃̓͌̐͒̋͆͘̕͘̚͜͜͜͜͠͝
>>
>>107363418
>what happens at 1.000?
the mememark will be deemed too easy and they'll go for something harder, like it happened several times on the LLM ecosystem
>>
seems like 1500 steps is the sweet spot for training rn
>>
>>107363447
Sorry I haven't been following. How long does it take and on what card? Also, how much would this actually translate to the base model?
>>
don't find Chinese girls as appealing as JAV stars preening
>>
>>107363434
i dont remember him saying that, i agree that his oversaturated settings are bad but his defense of chroma overall was right most of the time, there are many more retards that cried about chroma for no reason all the time despite the model being good for a lot of things since v3x anyway, even though there are multiple bigger problems with it
>>
don't find Mexican bodybuilders as appealing as JAV stars preening
>>
>>107363461
>i dont remember him saying that
why are you talking about yourself in the third person
>>
after 50 gens i finally managed to get z-image to do a sex prone bone position
>>
do we know the size of nano banana (the first one) or DALLE3?
>>
>>107363447
Using steps as a measurement makes very little sense since it is tied to the amount of images

If you train 20 images, 1500 steps is likely enough, if you train 200 images, 1500 steps is almost certainly too few

Better to use epochs as your measurement, as in how many times every image has been trained
>>
>>107363399
>Image base will have the same problems all base models do
this will be compounded in the eyes of anon because his euler a simple 9 step config will look like ass kek
>>
>>107362868
>Can anyone qrd training 16-32-64 channel lora differences?
anyone?
>>
>>107362920
you cocksucker
>>
>>107363505
No
>>
>>107362920
>https://files.catbox.moe/bwv1fc.mp3
this is beautiful lmao
>>
File: serious Pepe.png (359 KB, 728x793)
359 KB
359 KB PNG
What is the native landscape resolution for Z Img? 1024x1024 is boring
>>
>>107363435
i remember zalgo
>>
>>
>>107363525
It works with whatever resolution you put in so long as each size is <2048
>>
>>107363525
go for 1920x1080
>>
File: ComfyUI_04272_.png (1.31 MB, 864x1152)
1.31 MB
1.31 MB PNG
>>
>>107363491
yeah, anon mentioned something as low as 18 images. of course you don't need 3k+ steps for that.
>>
>>107363491
NTA you have that backwards. Step count is more concrete than epoch because epochs are based on number of images among other things.
>>
>>107363430
SDXL finetunes took 1-3 years to emerge, it isn't even 6 months since Chroma finished training

That said the smaller the model the more likely you will see finetunes earlier (since training is faster), SDXL is very small compared to Chroma, but Z-Image Base will likely be smaller than Chroma at least
>>
>>107363539
>>107363537

thank you, kind anons
>>
>>107363202
this is amazing, what model?
>>
File: 1744009349650592.mp4 (2 MB, 1578x1038)
2 MB
2 MB MP4
>>107363249
>We need base loras and seed variance fix.
Someone claims that using the ddim_uniform scheduler fixes the seed variance
https://xcancel.com/Machinedelusion/status/1994531413744652336#m
>>
>>107363577
Z-image turbo obviously
>>
>>107363578
>fixes seed varaince
>mp4 unrelated
>>
>>107363293
would
>>
>>107363596
What do you mean?
>>
>>107363539
>>107363537

which sampler and scheduler for realism?
euler is boring too
>>
>>107363578
gen 8 "office girl, buttoned up shirt"
>>
>>107363293
>Not too shabby for rtx 3060.
8GB or 12
>>
>>107363609
learn to experiment anon
>>
>>107363546
No, step count is overall worthless unless you are always talking about a specific amount of images.

1 epoch = every image has been trained once
10 epochs = every image has been trained 10 times

1500 steps = means nothing unless you know the number of images, if you have 1700 images, 1500 steps won't even have trained every image once

steps are a stupid target measurement for ai training, only useful to know where in an epoch you are
>>
should i be upscaling with the same sampler/scheduler and steps as the original image when using chroma flash?
>>
>>107363608
There's barely any difference in the images even with the different scheduler.
>>
>>107363608
He wants 1.5-tier variance. I also want that, but I understand why we don't have it.
>>
File: 1734894660741551.png (104 KB, 594x594)
104 KB
104 KB PNG
>100% of the threads images are Z-image, because it just released
>retard anon:
>w0w wut modal u usin????
>
>>
>>107363543
can we call Nogens Noggers from now on?
>>
>>107363491
shut the fuck up, no need advice from some loser anon like you, people here give the worst advice/take on stuff, if it looks good to me thats enough
>>
>>107363168
Is Z-Image actually this bad at doing huge breasts? That's really concerning
>>
>>107363622
I do, I really do

I remember res_3m / bong was a thing back then
>>
>>107363630
kek
>>
>>
>>107363642
Soon we will have bottom on Z-image.
>>
>>107363578
Something that might be worth trying is values below 1 on aura flow shift, or setting cfg below 1. Really though the solution is probably similar to what you had to do with Flux: shuffling input images with a denoising value lower than 1.
>>
>>107363652
I'll alow myself the luxury of hope once the base model dropped
>>
File: 1742973148802649.jpg (731 KB, 2383x1606)
731 KB
731 KB JPG
>>107363616
>gen 8 "office girl, buttoned up shirt"
lul
>>
>>107363652
SDXL was capable of NSFW from the getgo, isnt Z-image made for the exact opposite? (as stated by the developers o algo)
>>
>>107363668
i guess its getting fucked by the aspect
>>
File: 1764337931717772.jpg (284 KB, 900x817)
284 KB
284 KB JPG
>>107363578
>men playing basketball
>get exactly what you prompt
>omg why every image looks the same. muh seed variation

fucking ai jeets are so dumb, if you want something different, why don't you prompt something a little more complex? hell there are literally thousands of llm bots that can help you with that
>>
>>107363630
omg does comfyui work with it?
>>
>>107363058
>realism
https://files.catbox.moe/06i4j7.mp3
>>
VRAM is overrated. I am doing great with 10. Why do I feel pressure to upgrade because "scarcity of RAM" or GPUs are gonna go thru the roof or something's gonna happen. It's all a lie
>>
>>107363002
Comfyui does prompt switching natively now?
>>
>>107363642
>>107363664
i dont even see the fucking point of all these faggot models. they cant do shit without a billion loras.

chroma and noob is all you need.
>>
>>107363687
you can't do video gens without having terrible nuked to shit quality.
>>
>>107363633
Stop hurting my feelings you insensitive clod
>>
File: what do you think?.png (56 KB, 301x331)
56 KB
56 KB PNG
>>107363681
I think it's cool to get different images that all follow the prompt exactly because ultimately there's a lot of ways to describe the same thing, that makes it fun
>>
>>107363688
no lmao
>>
>>107363685
don't tell it vibevoice
>>
>>107363619
12gb. To be fair, I was using fp8 checkpoint and gguf text encoder, so it doesn't take million years to make one pic.
>>
>>107363630
Well to be fair, the image I was referring to is in the style of a classical painting. Besides, I'm a tourist, only dropping by only once every while or so.
How am I supposed to know what new model you guys are jerking off around in the current year?
>>
>>107363720
>How am I supposed to know what new model you guys are jerking off around in the current year?
are you seriously pretending you're obvious about the hype of Z-image? I don't believe you
>>
>>107363720
>Besides, I'm a tourist, only dropping by only once every while or so.
being a tourist doesnt stop you from reading the thread before posting your retarded fucking question.
>>
>>107363017
they are probably tonguing the anus of the government and the billion worthless ngos
>>
nearest-exact or lanczos?
>>
File: 1738877003998157.png (2.59 MB, 2671x750)
2.59 MB
2.59 MB PNG
:^)
>>
>>107363688
No, which is fucking insane.

So much basic functionality you need to install third party nodes for, sad state.
>>
>>107363681
Forcing us to fill in every single gap with prompting, using language, is not only incredibly tedious, it is actually impossible and strains the limit of the model's ability to understand the prompt. YOU are easily impressed.

Anglo/Saxon/Nord/German btw. Bit of a mutt but nobody can argue I'm not white
>>
>>107363165
it's getting better

> https://files.catbox.moe/43crqc.jpg
>>
>>107363767
it still looks terrible desu, I'm still a faggot until you finished the training!
>>
File: ComfyUI_19401_.png (2.71 MB, 1920x1088)
2.71 MB
2.71 MB PNG
>>
>>107363748
70% of workflows are custom nodes. it's fucking annoying and sad because comfy doesn't fucking care about having fun anymore
>>
there's one way to fix the seed variation is to let the instruct model rewrite your prompt everytime, each rewrite will be different and you'll get different settings each time >>107358856
>>
>>107363731
>>107363733
saw z-image for the first time in this thread. looks pretty neat. I think I'll definitely check it out.
>>
Prompt adherence (i.e. comprehension and knowledge of concepts) is the problem, not seed variance. An ideal model would have perfect prompt adherence and zero seed variance.
>>
File: 00100-3998502889.png (1.98 MB, 1216x1496)
1.98 MB
1.98 MB PNG
Can someone try to make a pic in fallout 1 style? Or fallout in general. With Z-Image. Im curious if I even wanna bother downloading it. Something like this >>107363427
>>
face it, auto1111 losing was the worst timeline. everything is reddit now thanks to cumfart
>>
>>107363747
Imagine if when they released stable diffusion 3 it was as good as z-image-turbo
>>
File: Z-image turbo.png (2.95 MB, 1920x1080)
2.95 MB
2.95 MB PNG
>>107363789
>looks pretty neat. I think I'll definitely check it out.
have fun anon, this model is really good, and don't forget your daily kneeling to our savior xi jinping!
>>
>>107363675
Post side by side NSFW base XL and Z (you won't)
>>
is there a sane way to install comfyui with uv now? last time I checked, torch was a pain in the ass

and do I need comfy ui manager?
>>
>change CFG from 1 to 1.5
>gen goes from 55 seconds to 104
what the fuck
>>
dpmpp_sde + ddim_uniform actually gives pretty realistic output but holy shit it's slow as hell, 8s/it on an a10g
>>
File: 1739171950999518.png (1.47 MB, 768x1280)
1.47 MB
1.47 MB PNG
>>107363685
kek
but its very simple, i just like big booba bimbos that chroma does well
>>
>>107363814
normal
>>
>>107363812
there is nothing sabe about installing spyware to your machine. also neoforge has zit
>>
Just look at all the newfags. Isn't it beautiful, anon?
>>
>>107363814
Any CFG other than 1 requires the calculation of a positive step and a negative step. CFG==1 calculates only a positive step.
>>
>>107363803
if you cant be bothered to download a few files and drag the default workflow to your comfyui, then why would i bother with your request, retard.
>>
>>107363794
>An ideal model would have perfect prompt adherence and zero seed variance.
You're a moron, that would mean you would get zero image variations out of a prompt

Do you even know what the seed does you absolute mong ?
>>
>>107363814
cfg actually makes 2 images, a positive image and a negative image, then it does some substraction math shit, so that's why its 2x slower
>>
>>107363814
Turbo models only work turbo-ly with CFG = 1.0
>>
File: ComfyUI_09137_.png (1.94 MB, 1152x1152)
1.94 MB
1.94 MB PNG
>>107363339
True. I guess we will see when the full model comes out how it fares.
>>
>>107363794
>zero seed variance.
that's retarded, there's always different ways to make an image off a prompt
>>
>>107363829
Yes, I do, it's a starting point in a search for a local maximum.
An ideal model would give you exactly what you asked for, no more and no less. If you want something different then ask for it. If you're too stupid to ask for something different then use an LLM to make up some random shit for you.
>>
File: ComfyUI_09139_.png (1.81 MB, 1152x1152)
1.81 MB
1.81 MB PNG
>>107363819
Chroma naturally does any description of woman better due to its seed variance and NSFW tuning.
>>
>>107363855
>An ideal model would give you exactly what you asked for, no more and no less.
don't feed the troll
>>
>>107363828
>what is a request
Ok subhuman, have fun with your vramlet model
>>
im using heun and BONG sampler for chromaflash, is there a better combination you guys would recommend?
>>
>>107363828
what else do you expect from mikuniggers
>>
>>107363794
although variance should be maximal for unspecified factors. For instance, if your prompt is "Miggu in a catsuit" you should see Miggu in a catsuit across a vast array of locations and poses across generations. If Miggu appears in a catsuit in the same boring pose and location the model is bad imo.
>>
>>107363825
im not a newfag, im just retarded and curious.
>>
File: I mean...png (415 KB, 500x564)
415 KB
415 KB PNG
So that's all what the chromakeks have left huh? Seed variance?
>>
do you think anon will ever stop comparing tunes to base models like a retard
>>
>>107363876
mikuniggu and asaniggu no less
>>
>>107363880
And big boobs >>107363819
>>
>>107363863
Chroma skin texture looks kinda unnatural, like plastic wax
>>
File: firstcrusade.png (1.9 MB, 1536x864)
1.9 MB
1.9 MB PNG
>>
>>107363690
Which noob btw? I mean 1.0 or 1.1?
>>
>>107363880
prompt adherence too
base is going to suck, you know, all this turbo can do is make passable 1girls
>>
the only issue with chroma is how fucking it long it takes to generate an image. you HAVE to be edging constantly or you'll cum before the image finishes.

make it faster NOW lodestone.
>>
>>107363884
That would require him to actually understand the difference which is impossible.
>>
>>107363913
I'm not a chroma shill but that's a wallet issue.
>>
File: this.png (1.22 MB, 1080x906)
1.22 MB
1.22 MB PNG
>>107363894
>Chroma skin texture looks kinda unnatural, like plastic wax
Chroma's skin texture used to look good, and then lodestone decided to make the model run at lower speeds to make the vramlets happy and in consquence it got more slopped with the subsequent epochs
>>
File: images.jpg (8 KB, 230x220)
8 KB
8 KB JPG
>>107363890
Are all of you Z-iggers just ex NetaYume copers?
Happy that you can finally run something similar to chroma but not quite?
>>
>>107363919
i have a 5090
>>
>>107363906
>prompt adherence too
Chroma can use CFG, Z-Image turbo cannot, we'll have to wait for base to see how better at prompt adherence it's gonna get
>>
>>107363913
>the only issue with chroma is how fucking it long it takes
no
not at all
it's absolutely horrible with hands & feet.
>>
>>107363913
the trick is to have to previous image you genned opened in another tab while you wait
>>
>>107363922
>he cheaped out and didn't get the 6000
ngmi
>>
>>107363906
>all this turbo can do is make passable 1girls
No it does more but also yeah they specifically designed it that way, read the paper.
>>107363921
No I'm still mildly sad that Neta will probably become obscure but some of the Z team worked on it so it's whatever.
>>
>>107363921
NetaYumers got bullied so hard they defaulted to Z-image, yet it cant do NSFW making it a useless toy lol
>>
>>107363855
You're so stupid. Being forced to make a prompt adjustment in order to get an image variation is absolutely retarded if you know HOW image generation works which you clearly don't.

But you can do that RIGHT NOW if you want, just fucking lock the SEED value and you get EXACTLY what you describe, seriously how fucking dumb are you ?

By your logic EVERY model is already perfect.
>>
>>107363936
kek, fuck you. fair enough. i'll buy the 7000 when the 60 series comes out.
>>
>>107363766
Idiot thats fine and dandy but look at the prompt of the people who are complaining, I think you can do better than "men playing basketball"
>>
File: ComfyUI_09144_.png (2.39 MB, 1152x1152)
2.39 MB
2.39 MB PNG
>>107363870
Very experimental sampler, dropped it but back when I used I tried linear/heun3s seemed to work fine
>>
>>107363320
Sometimes I think that guy is actually just a chroma hater baiting with bad gens, cause no one can be that blind....
>>
downloading models in comfyui is a chore. would be better if the workflows had magnet links and it had a torrent client built in and it just downloaded them.

can any of you fags make this happen?
>>
There are two types of models: 'it just works' models, and grail models.

The first type do what you ask. They do it as well as you can expect, with minimal error. They are highly predictable, and produce serviceable content with ease. They are "stable".

The second type are unstable, unreliable, and will give you a heaping pile of garbage. But the garbage is interesting. It is very diverse garbage. In many of these failed and worthless gens you see hints of something eerily real. You sense that if it ever managed by sheer luck to get everything right, it could be incredible. And not only this, but you suspect that lurking somewhere out there are gens of unknown transformative power; gens that can redeem all this wasted time. Once in ten thousand gens you may find gens that are really beautiful, and a comforting consolation for the frustrations; but even those aren't the point.

Which of these two you prefer is mostly a question of what sort of person you are. Women will never be interested in the second kind of genning, and most men won't either.
>>
>>107363987
1.5 kinda did both?
>>
File: ComfyUI_08750_.jpg (993 KB, 2048x2048)
993 KB
993 KB JPG
>>107363920
Have you tried mixing v26 with the HD Flash delta weights anon-kun? Pic rel is result you get after mix with v50, starts looking natural again.
>>
File: many such cases.png (569 KB, 1500x1000)
569 KB
569 KB PNG
>>107363975
>cause no one can be that blind....
either it's a falseflag or someone who got lost in the sunk cost
>>
>muh chroma only has seed variance
>muh shit realism
>skin texture looks kinda unnatural, like plastic w-ACK
Skill status: issue
I don't have to prompt every atom in the image like with Z Image in order to get big boobs and distorted nipples, while also being able to continue genning unique images instead of the same 3 already similar looking women that are state mandated by Xi himself to be connected to that specific prompt and rotate in your computer.

https://files.catbox.moe/lvv0ab.png
>>
>>107363995
>everything is bright and white
>natural
>>
>>107363995
>>107363969
does Z only do disgusting ching chong whores?
>>
>>107364006
>does Z only do disgusting ching chong whores?
those are chroma images anon
>>
>>107363963
>>107363987
There's no reason why a model shouldn't be able to generate thousands of distinct and coherent pictures from a three word prompt.

>>107363999
Is this supposed to be a picture with skin texture lmao?
>>
>>107364006
thats not Z, you MONGOLOID
>>
>>107364006
that's Pooma
>>
File: the nerve.png (466 KB, 720x720)
466 KB
466 KB PNG
>>107363999
>distorted
oh now the chromakek knows what distorted means
>>
fuck it, im not going to upscale my chromagens, they take too long.
>>
>>107364012
>>107364019
>nogens malding already
>>
File: ComfyUI_temp_pxrcs_00021_.jpg (766 KB, 1768x1408)
766 KB
766 KB JPG
Hands vs Dogs :v
>>
File: lTU3eup.jpg (29 KB, 640x480)
29 KB
29 KB JPG
>>107363999
Z-iggers BTFO!
Trips of truth
>>
what the fuck does T5TokenizerOptions do? i read the description and it sounds like snake oil
>>
>>107364029
no gen no onion
>>
>>107364029
I don't need a gen to comment on your airbrushed image.
>>
File: Z-image turbo.png (1.19 MB, 1024x768)
1.19 MB
1.19 MB PNG
>>
>>107364031
>two guys making out in a forest in the rain, OP style
>>
File: 1762204981281.png (1.49 MB, 1280x1120)
1.49 MB
1.49 MB PNG
Can Z image do this or nah?
>>
>>107364048
give me the prompt I'll try it
>>
>>107364035
if it sounds like snake oil, it probably is
>>
File: ComfyUI_07959_.png (1.43 MB, 944x1280)
1.43 MB
1.43 MB PNG
>>
>>107363630
What is z-image?
>>
>>107364048
BBC sisters just discovered Z-image
>it's over
>>
I want to train a Z-Image Turbo Lora, how many steps do I need? is 1500 enough? or do I need more? I remember for a Flux Lora I needed like 5000.
>>
File: 1762205327714.png (1.01 MB, 1280x1120)
1.01 MB
1.01 MB PNG
>>107364054
shit I dont have PNG inspector on this PC
>>
>>107364083
Just set 8k steps and train until you like the style nigger
>>
File: ComfyUI_07935_.png (1.29 MB, 944x1280)
1.29 MB
1.29 MB PNG
>>
>>107364083
holy shit, you needed 5000 images per lora?
what kind of a shit model needs 5000 images for a lora? that can't be real.
>>
File: 1762383313438.png (916 KB, 1280x1120)
916 KB
916 KB PNG
Need to get back on my Chroma game to make some edits for the Zissies
>>
>>107363403
>>107363428
>I'm having trouble getting extreme angles.
>Like the camera pointing straight up.
that makes sense given the fact that zimage turbo is a portrait-tuned version of the base model

try worms-eye-view? otherwise wait until Sunday for base

>>107363633
>if it looks good to me thats enough
based as fuck. this reminds me of that nutritionist girl who was upset people were listening to a high schooler who can bench 300lbs instead of her

>>107363812
>is there a sane way to install comfyui with uv now? last time I checked, torch was a pain in the ass
i'd recommend anaconda/miniconda for doing stuff with cuda since you can install cuda-toolkit and have all the deps you'll ever need
>and do I need comfy ui manager?
no but it helps, especially when you grab someone elses workflow and you need to install all the custom nodes they're using
>>
>>107364102
>you needed 5000 images per lora?
I said STEPS nigga, can you read?
>>
>>107364118
you just got deeb'd and it was so obvious too
>>
>>107364118
okay my bad.
My response is the same btw. just reread my post and replace the word images with steps in every instance.
>>
>>107363812
i'm running comfy with uv, i just ran claude code in the directory and said "install this repo with uv"

>>107364111
lmao conda/miniconda in 2026, get with the times unc
>>
File: nbp-ws.jpg (2.05 MB, 2400x1309)
2.05 MB
2.05 MB JPG
>>107363987
very few synthographers here m8
>>
File: ComfyUI_09147_.png (1.82 MB, 1152x1152)
1.82 MB
1.82 MB PNG
>>107364003
Looks fine to me. I mean, I am prompting for a light setting outdoors in that case.
>>
File: 1762203729035.png (1.18 MB, 1280x1120)
1.18 MB
1.18 MB PNG
All the cute chinese girls be hanging with BBChroma when you turn off your Z-image model
>>
>>107364137
Very deboesque image.
>>
File: z-i-t.jpg (451 KB, 2048x2048)
451 KB
451 KB JPG
>>107363981
Making actual torrent clients with all the features and configuration they need also is a surprisingly huge chore.

Maybe -if comfyanon can be convinced that models are at risk- someone could ask for a feature in base comfyui where workflows save enough of the hashes that comfyui can generate magnets for torrent v1/torrentv2 unless it's turned off? At least that feature doesn't need a huge amount of maintenance. Of course it also doesn't put and keep the torrents online on its own.
>>
oh shit i got a context after waiting for an hour lets go

>>107364128
>lmao conda/miniconda in 2026, get with the times unc
there is literally no reason for me to switch from what works. nothing uv offers me gives me a reason to switch. claude code works just fine with conda/conda run as well. conda lets you install non-python packages as well which is useful for not dealing with any CUDA headaches ever

also I guarantee I am at least 10 years younger than you, and you are the unc in this dialogue
>>
>>107364147
>Looks fine to me
that's the problem, it doesn't look natural at all, and you seem to be the only one to not notice that and be like "huhhh? why Z-image managed to blow in popularity but not my heckin wholesome model??"
>>
>>107364162
>Making actual torrent clients with all the features and configuration they need also is a surprisingly huge chore.
no you could just use a python wrapper for libtorrent and thats part of your requirements.txt for the github repo, but no one would actually use it. hf-transfer is better for sharing anything that's not illegal, and if it's actually illegal torrents are bad opsec so it makes no sense
>>
how the fuck do i do different angles in chroma holy SHIT

i tried
>worm's eye view. shot taken from ground level. shot from the side. image taken from below.

nothing works. is there a template for perspectives?
>>
File: 1748170945533007.jpg (12 KB, 228x221)
12 KB
12 KB JPG
>>107364152
>>
>>107364160
no gen no onion
>>
File: grid_output.jpg (636 KB, 1024x4096)
636 KB
636 KB JPG
trying to train an iphone photo z-image lora and it's turning the guy more hispanic with every epoch
>>
File: 1762289727471.png (1.11 MB, 1280x1120)
1.11 MB
1.11 MB PNG
Can Z-image do good text placement like this?
>>
File: famous last words.png (684 KB, 749x499)
684 KB
684 KB PNG
>>107364194
the guy on the bottom looks like the dude on american pie lol
>>
>>107364012
Actually there are reasons, but it suffices to say: show me this model. (It doesn't exist)
>>
File: 1745333493079913.jpg (1.81 MB, 1536x2048)
1.81 MB
1.81 MB JPG
>Z natively understands russian too
...huh
>>
>>107364218
qwen 3 knows a lot of language, it's at its best on chinese and english though
>>
File: 1762285673906.png (1.25 MB, 1280x898)
1.25 MB
1.25 MB PNG
Chroma can do very sexy stuff out of the box, the level of NSFW depends entirely on your prompt
>>
>>107363994
Rose-tinted glasses, mate. 1.5 was definitely a type 2 and failed miserably at being a type 1. (I loved SD1.5 don't get me wrong)
>>
>>107364229
why is there a nigger in every of your pictures?
>>
>>107364233
He is probably* Indian, like most of /ldg/. He's right about Chroma though.

*Not necessarily ofc.
>>
>>107364175
needs an entire torrent client of configuration and diagnostics WebUI and CLI, for either mode of using ComfyUI. it'll be more work than you think, but good luck.

i'd personally just add the hashes and magnets so it can get used if someone wants to use it, without the entire torrent client attached.
>>
>>107364233
There's always one in his mirror too lmao
>>
china keeps on winning
>>
File: 1762208064858.png (1.09 MB, 1280x1120)
1.09 MB
1.09 MB PNG
>>107364245
Its what chinese rice bunnies lust after
>>
>>107364229
bro looks like a 3d-rendered fortnite skin
>>
>>107363089
>ask for Japanese
>AI delivers Koreans
bwahahahaha
no wonder Japanese hate the AI
>>
File: Z-Image turbo.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
File: 1757436568205547.jpg (70 KB, 990x936)
70 KB
70 KB JPG
>>107364200
heres some text placement
>>
>>107364276
I can't tell the difference anyway.
>>
File: 1764368698870.png (1.5 MB, 2048x1024)
1.5 MB
1.5 MB PNG
Hmm, at 500 steps batch size=2 it's already picking up a style pretty well. It's looking better than expected for training a distilled model.
>>
File: bog1515425247448.jpg (81 KB, 840x506)
81 KB
81 KB JPG
>>107364279
hilldawg knows?
>>
File: 1746071891649.jpg (15 KB, 506x395)
15 KB
15 KB JPG
>>107362886
It's Mr Oinkers Wife
>>
File: 1762201078751.png (1.37 MB, 1216x1408)
1.37 MB
1.37 MB PNG
Can Z-image do spidermen or is it another failbake like Yume?
>>
>>107364297
lmao this fag got doxed to his family and i think at some point he still came back to post
>>
is there a way to use the already loaded qwen model for LLM inference (I want to translate to chinese) directly in comfy?
>>
File: Z-image turbo.png (2.7 MB, 1920x1080)
2.7 MB
2.7 MB PNG
>>107364283
>I can't tell the difference anyway.
is it accurate?
>>
So what games are you guys currently playing?
>>
>flux 2 dev
verdict?
>>
File: 1762292913018.png (1.07 MB, 1280x1120)
1.07 MB
1.07 MB PNG
>>107364276
arrr rook same, arr rice bunnies
>>
>>107364321
China looks good, japan looks half-chinese, korea looks pretty good too.
>>
How did Ostris manage to include lora training for Z-image when the training scripts aren't even released yet?
>>
>>107364327
im playing with my pp
>>
>>107364302
If it can do Spiderman it means he's real.
>>
@107364327
debo
>>
>>107364327
I was trying out Timesplitters Rewind earlier. Pretty fun but still waiting for the team to release more of the story mode.
>>
>>107363778
>Comfy
Unless you zoom in and notice all their right hands are amputated.
>>
Okay that's cool and all but 1girl standing can only keep me interested for so long
>>
File: 1762203259725.png (1.21 MB, 1280x1120)
1.21 MB
1.21 MB PNG
>>107364342
thats BBChroma tho
>>
>>107364318
There's some qwen node. Google or check couple of threads back or was it in /lmg/ don't remember
>>
File: 1763590063276877.png (2.99 MB, 1215x1651)
2.99 MB
2.99 MB PNG
If you REALLY want seed variety just inject noise into the conditioning. But really, you should just learn to prompt (skill issue)
>prompt: a girl jogging
>>
File: ComfyUI_09151_.png (1.79 MB, 1152x1152)
1.79 MB
1.79 MB PNG
>>107364172
It is just mimicking real smartphone photographs. Chroma is only not as popular due to heavier constraints to run and most vramlets not knowing about HD Flash.
>>
>>107364365
>just inject noise into the conditioning
how do you do that?
>>
>>107364359
Totally irrelevant and out of context post
>>
>>107364369
ConDelta
>>
>>107364327
None. I don't have time. Too much content to make
>>
>>107364366
>Chroma is only not as popular due to heavier constraints to run
bullshit, Flux got popular, Wan got popular and both are bigger than Chroma, you're coping hard
>>
>>107364365
>sneed variety-anon posts again
>>
>>107364262
If women want to fuck niggers so badly why do interracial porn sites always need to pay the actresses extra?
>>
File: flux2_bf16_c-0102.jpg (409 KB, 1600x1600)
409 KB
409 KB JPG
>>107364332
It's not bad, but not many people are bothering to test it because their only need is efficient 1girl production and z has met this.
>>
>>107364387
Shut the fuck up retard. Learn to read posts next time.
>>
>>107364402
>buttmad
>>
>>107364387
I CANT SEED
>>
File: 1741418536102617.jpg (676 KB, 1054x9185)
676 KB
676 KB JPG
I love human civilization and social media
>>
I just want to know can z replace chroma?
>>
File: 1762392770311.png (1.99 MB, 1280x1120)
1.99 MB
1.99 MB PNG
>>107364374
ummm... okay
>>
>>107364390
Because they all have big penis, so it takes a bigger toll on their tiny Asian vaginas. Imagine 10 inch BBC stretching them to the limit
>>
File: rchroma.png (942 KB, 832x1488)
942 KB
942 KB PNG
>>107364200
choking doesn't seem to work too well on z-image-turbo, at least in english. you can have a shirt with text.
>>
>>107364390
its husband compensation for the stretched pussy
>>
>>107364339
aaah yes that old game that everybody keeps on going back to.
>>107364352
I don't see Timesplitters on Steam? is it console?
>>
>>107364410
twitter is really a cesspool, like you lose your sanity if you keep reading those retarded takes (but it was meant to, the more they ragebait the more they get money)
>>
File: ComfyUI_09154_.png (1.9 MB, 1152x1152)
1.9 MB
1.9 MB PNG
>>
Everyone is obsessed by the seed variation or what? even on leddit they can't stop talking about it lol
https://www.reddit.com/r/StableDiffusion/comments/1p99t7g/improving_zimage_turbo_variation/
>>
go pornspam elsewhere
>>
>>107364455
are people retarded or something? this is a product of distillation
>>
>>107364361
>Google
It's really hard to google this considering there are already two qwen imagegen models for comfyUI.

The only things I found involved running inference in a separate program and just using comfy as a frontend.
>>
>>107364431
It's a free fanmade remake of the original games but it's not completed yet. Google Timesplitters Rewind and you should find the download
>>
>>107364366
>HD Flash
this shit keeps randomly doing anime like 50% of the time so i dropped it
>>
File: Wan garbage.jpg (39 KB, 1308x686)
39 KB
39 KB JPG
Why is nothing happening?
>>
The ape let's out deep aggressive grunts as he splurges gallons of thick slimy semen in the 5ft tall tiny asian. Her toes curl up as she relieves his raw love potion not meant for humans
>>
>>107364466
No it's not fucktard. It's a product of deliberate supervised fine tuning on a subset of high quality images
Don't respond to me if you haven't read the paper
>>
>>107364475
we don't see shit nigger
>>
>>107364481
I wiped my butt with the paper so ai'm qualified
>>
>>107364365
Got a workflow? Seems worth a try
>>
>>107364495
Just try this >>107364455
>>
I just opened reddit and saw an actually good idea for better image variance.
Run a few inference steps with no prompt and then run the rest with the prompt.
>>
File: 1762212273648.png (842 KB, 1280x1120)
842 KB
842 KB PNG
>>
>>107364510
>>107364455
Yeah these two are similar ideas
>>
File: 1762393361879.png (1.15 MB, 1280x1120)
1.15 MB
1.15 MB PNG
>>107364437
>>
File: 1742142469422105.png (2.86 MB, 1920x1080)
2.86 MB
2.86 MB PNG
>Vito Corleone
Look how they massacred my boy...
>>
>>107364369
KSampler with Variations
>>
>>107362825
get any low parameter qwen llm model
ask it to translate it to chinese symbols
>>
>>107364548
>>107364548
>>107364548
>>107364548
>>
File: ComfyUI_00009_.png (3.97 MB, 2048x1280)
3.97 MB
3.97 MB PNG
How did you anons squeeze vintage photographs out of Z

prompt for picrel
>Retro 80s expedition photo with faded colors and film grain, wide shot of jungle river scene, three voluptuous Brazilian women with caramel skin and long black hair wearing grass bikinis that barely cover their massive breasts and huge buttocks, gold hoop earrings catching sunlight, bathing in shallow river water, water dripping down their curvaceous bodies, arched backs and posed provocatively, washing each other's hair and bodies, tropical waterfall in background, vintage color palette with warm tones, caption in yellow retro font 'Bathing rituals of the indigenous people', 1980s adventure magazine aesthetic

>>107364250
>needs an entire torrent client of configuration and diagnostics WebUI and CLI, for either mode of using ComfyUI.
no it really doesn't, but there's no point arguing with you because it doesn't matter either way because no one will use it for the reasons I mentioned
>>
File: ComfyUI_00074_.jpg (1.5 MB, 3072x2446)
1.5 MB
1.5 MB JPG
>>107364394
x-img
>>
File: Photo of Henry Cavill.png (1.93 MB, 1536x1152)
1.93 MB
1.93 MB PNG
>>107364550
I noticed it struggles with a lot of male celebrities. It seems to just generate a vague combination of a bunch of other celebrities mashed together.
>>
>>107364660
i tried star trek and star wars, gets 90% of them wrong.
shame.
>>
File: 1747592723989620.mp4 (2.22 MB, 720x720)
2.22 MB
2.22 MB MP4
>>107362979
>>
>>107364365
These results are underwhelming
>>
>>107364468
cool, very nice. Maybe it's time for some Timesplitters Rewind.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.