[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor application acceptance emails are being sent out. Please remember to check your spam box!


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107342183

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/Comfy-Org/z_image_turbo

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
seedream will not be forgotten... we will have our revenge
>>
File: 1031640599408064.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
File: 1764227536329679.png (92 KB, 821x375)
92 KB
92 KB PNG
>>107343661
Just as a reminder for the big news.
>>
gifted threed of frenship
>>
Why does Z have such horrible case of sameface by default?
Sure you can give it celebrity names but I want random people that don't exist.
>>
>>107344176
>she was a doll all along
rei bros... not like this!!!!!
>>
>>107344182
you gotta prompt face features/race. it's the price of consistency
>>
don't care, my finetune will be better

no, actually, i will steal-i mean, use the design of the architecture as base inspiration to create my second own new architecture and surpass this new Z-image thingy everyone is talking about
just be patient, don't worry
>>
>>107344182
tons of recent models have overfitting/seed variance issues. it's not just z image, but i don't know what's causing it
>>
File: 1737229920179534.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
A large group of indian men who are barefoot running down the street in mumbai. The indian men are holding a sign saying "DO NOT REDEEM SAAR!".
>>
>>107344194
You could generate any 1girl or anything else you want and yet you choose to generate ugly jeets. Why are you gay?
>>
File: 1745092054645674.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>107344194
>>
You want more variety? Sure, return TO CLIP then. Mongrels.
>>
File: 1760967812858588.png (997 KB, 1024x1024)
997 KB
997 KB PNG
>>107344199
it's just a test

also I think they had insider flux 2 info and released it the same day on purpose.
>>
>>107344182
If it's Asian, I suspect their dataset is primarily portrait photography from Chinese TikTok/Douyin cosplayers and internet celebrities with the typical heavy beauty face filters.
>>
>>107344191
Overfitting due to overtraining in order to make them perform as intended by the creators, like in the LLM world.
>>
>>107344194
it wants to gen 6 fingers, but doesn't do it
>>
File: 229605598304919.png (1.1 MB, 768x1344)
1.1 MB
1.1 MB PNG
>>107344183
wcyd
>>
I thought you fucks said Qwen image was fast why it taking over 10 minutes just to shit out a megapixel?
>>
How will the non-turbo version be?
>>
File: file.png (2.08 MB, 1152x1152)
2.08 MB
2.08 MB PNG
>>107344202
>>
File: ComfyUI_07688_.png (1.46 MB, 944x1280)
1.46 MB
1.46 MB PNG
>>
Why is qwenvl taking barely any resources while it is reading an image? My system is very capable.
Model is already downloaded.
>>
File: 1758757359956127.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
a sexy Japanese woman is at the beach sitting in a chair, reading a book. The title of the book is "how to beat flux 2".
>>
>>107344220
just use the nunchaku version with lightning lora, blazing fast
>>
is z-image actually good or is it just the latest toy to shill?
>>
>>107344245
it makes non plastic people and is much faster, so yes it's good.
>>
>>107344187
fake kurumuz. give us Director Mode first
>>
File: 1740846313656378.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>107344247
like if you made this in flux she'd have a plastic doll sheen.
>>
>>107344252
Make it in flux
>>
almost forgor we can modify a prompt without ending into an horror movie... Z is killing it
>>
File: ComfyUI_07693_.png (1.52 MB, 944x1280)
1.52 MB
1.52 MB PNG
>>
Ummm okay yeah it's looking pretty god but

How the fuck are we supposed to tag images for loras (if trainers ever finds out how to train one)? Like, I can't imagine that right now, honestly
Good ole danbooru-based autotaggers won't save us now, seeing that the model is perfectly capable of genning readable text
And what about sex and stuff? Is it already over?
>>
>>107344245
Flux 2 is good but everyone is spamming z-slop because its so much faster
>>
>>107344262
flux2 at q4 (already an ultra cope quant for imagen) is 20gb, so go figure
>>
>make qwenvl describe my nsfw gen and then gen it with zimage

That bundle of sticks, lold.

https://files.catbox.moe/fnj6cg.jpg NSFW
>>
>>107344272
safe realistic model can't into danbooru degeneracy whaaat?
>>
File: 1747400783519336.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
see, it's more natural vs flux which has the plastic/doll look.
>>
>>107344262
Got some flux slop for us to see? Sometimes the juice isn't worth the squeeze.
>>
>>107344281
is this flux? fucked up feet and the hand seems weird
>>
>>107344272
man stop with the normie frieren shit, PLEASE
>>
>>107344260
>And what about sex and stuff? Is it already over?
Just one more finetune and it will fix everything, don't worry, just like lumina 2 and auraflow
>>
File: 1739352531682674.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
So does it run, per step, faster than SDXL?
For me it's running a little bit more than twice as slow as SDXL.
But I am a 12gb VRAMlet, so I suspect CPU offloading plays a role.
Can any 24+gb VRAMkings here share how the speed compares to SDXL?
>>
File: z_00357_.png (457 KB, 998x998)
457 KB
457 KB PNG
>>
>>107344153
Is it more common to use local models for image generation? Because in LLM world almost everyone just consumes through API services, most of the things you can run are garbage with no context. I mean running local is the only way to get nsfw anyway right? I'm going for anime style
>>
File: Flux2_00008_.png (1.46 MB, 1344x752)
1.46 MB
1.46 MB PNG
>>107344285
posted this earlier but I like how it does horse armor
>>
>>107344325
tell the horse to wear pants
>>
>>107344280
It's fun to try, anon.

>>107344298
That's an old image. I'm doing quad amputee fleshlights now.
>>
File: 1760669877689846.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>107344258
What model and what hoops do I need to jump through to gen like this?
>>
>>107344322
if you want to do image only perchance.org will give you unlimited nsfw. it's based on chroma and good enough.
>>
How do you guys sleep at night knowing you use GGUF models?
>>
File: file.png (2.12 MB, 1152x1152)
2.12 MB
2.12 MB PNG
>>107344332
>quad amputee
nugget bros...
>>
File: 1743594304889588.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
I don't get why you guys are hyping up these bloated trash models that manage to be a step below Chroma HD somehow
>>
File: ComfyUI_temp_xqsnc_00002_.png (3.11 MB, 1088x1600)
3.11 MB
3.11 MB PNG
Some fucking genshintron
>>
File: ComfyUI_07698_.png (1.41 MB, 944x1280)
1.41 MB
1.41 MB PNG
>>107344336
just run Z image turbo locally.
>>
>>107344360
Okay, booting it into Fooocus UI now, fingers crossed
>>
>>107344360
does it need nvdia cards?
>>
>>107344338
I'm already reading guides to understand this shit, I'm gonna try running everything local, I downloaded comfyUI and I've downloaded checkpoints from SDXL like WAI-illustrious. Do you have any tips for prompting.
I'm gonna use anime models now (it seems they are also lighter or something, I only have 12GB vram) but in the future I would like some realistic model to make 2D girls into 3D for fun
>>
>>107344364
If you're genning and not using a Nvidia card than stick to online sites bud
>>
File: ComfyUI_temp_zyroa_00102_.png (2.94 MB, 1440x1152)
2.94 MB
2.94 MB PNG
People already complaining even when it's this good, small and fast before the inevitable finetunes. As long as they release a solid base model it's going to be the new standard (for coomers).
>>
File: zi-1764234342.png (1.18 MB, 1152x896)
1.18 MB
1.18 MB PNG
>>
>>107344371
Small is definitely not what it is retard.
>>
>comfy must be dragged into the streets and shot
>>
File: Flux2_Edit_00003_.png (1.24 MB, 1192x1024)
1.24 MB
1.24 MB PNG
I am testing Flux2 edit since there are no LoRa training for ZIT nor ZITedit. Better than QwenEdit at anatomy, but getting that stupid Flux blur effect.
>>
>>107344369
is it really that bad? isn't this a monopoly?
>>
>>107344367
perchance isn't local, it's a website. with chroma try using natural language sentences to describe one aspect of an object or person, then end the sentence with comma separated tokens. like anime girl has blue hair in braids, long, flowing, neatly plaited. don't try and describe a lot of different aspects in a single sentence, it seems to be how you get body horrors.
>>
so is 9 steps the standard/ideal?
>>
File: 859367836459148.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: zi-1764234604.png (1.2 MB, 1152x896)
1.2 MB
1.2 MB PNG
>>
File: 1742693502193870.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
>>107344393
it would be if any of the other companies made something remotely competitive
>>
>>107344395
Does anyone have an actual example of it benefiting from more than 8?
Most of the time, it changes little.
>>
>>107344375
kys vramlet
>>
>>107344375
Comfortably fits on a 5 years old consumer gpu. I'd say that counts as small.
>>
>>107344395
yes, actual inference will start after 1st step
>>
>>107344393
I've never used it but you can try zluda
>>
>>107344178
no FUCKING WAY
>>
File: zi-1764234920.png (1.26 MB, 1152x896)
1.26 MB
1.26 MB PNG
>>
>>107344388
What the fuck is that derp face kek.
>>
File: Flux2_Edit_00005_.png (1.13 MB, 1365x1024)
1.13 MB
1.13 MB PNG
This one came out better than expected.
>>
File: 1037510596691205.png (1.05 MB, 832x1216)
1.05 MB
1.05 MB PNG
>he doesn't have a fridge full of beer in his bathroom
why even live
>>
>>107344430
Dude not cool. She's brave putting herself out there after the stroke.
>>
Does the Zimmage work on 8gb or should I an hero
>>
>>107344454
It's literally built with 8GB as the target
>>
>>107344178
big if true, local is saved
>>
>>107344178
Every day I thank God for President Xi and the Chinese people. May the Jews never do to that country what they did to us
>>
lightning Z lora waiting room
>>
>>107344454
turbo prompts in ~20s on 4060
>>
>>107344469
2 bit quant z image is what we're all really waiting for
>>
>>107344458
>>107344474
Okay thanks
description said 'fits comfortably" on 16gb so I assumed that was the baseline
>>
>>107344468
the jew fears the 100 acre wood
>>
>>107344454
I mean the worst you would have to do is go down 8bit?
But it should work with some offloading. (Which it seems to automatically do when needed.)
>>
>>107344469
plus teacache
>>
>>107344469
Is your computer made by Tiger Electronics?
>>
doesn't know any characters therefore irrelevant until a big finetune
>>
File: zi-1764235491.png (1001 KB, 1152x896)
1001 KB
1001 KB PNG
>>
File: file.png (21 KB, 798x300)
21 KB
21 KB PNG
how the hell do you increment a value or randomize it with the new comfyui style?
>>
>>107344191
It's the only way to remove anatomy issues, you basically overtrain on poses, gestures etc, so that you while you have very little seed variation, you will have correct anatomy.

There is sadly no magic to solve this, Chroma for example has a lot more variation between seeds, but it also has more anatomy issues. And it's not a model size issue either, GPT-4o etc have practically zero variation between seeds.
>>
>>107344482
>ANON'S COMPUTER?
>FOUR GIGA BYYYYTTEESS
>>
Z can't even do NSFW right if at all how is it being shilled this hard as the next big thing?
>>
File: ComfyUI_07738_.png (3.54 MB, 2048x1280)
3.54 MB
3.54 MB PNG
>>
>>107344500
KEK what model is this?
>>
>>107344507
Flux
>>
File: negro hand.png (67 KB, 165x145)
67 KB
67 KB PNG
>>107344509
ah you know what i should've guessed that. thanks thoughbeit.
>>
>>107344497
Memes are moar important than NSFW
You would know that if your brain wasn't full of coom
>>
File: ComfyUI_07718_.png (1.46 MB, 944x1280)
1.46 MB
1.46 MB PNG
>>107344507
Z-Image Turbo
>>
File: file.png (10 KB, 281x295)
10 KB
10 KB PNG
>>107344492
Go back to the good nodes.
>>
>>107344497
just think of it as a bigger and better sdxl waiting for its noob finetune
>>
>>107344176
People are claiming Z is uncensored but this is barbie doll, or can you just not expect for it to get explicit unless you explicit prompt for genitals in a base model?
>>
File: 1740874904605791.jpg (1.44 MB, 1536x2048)
1.44 MB
1.44 MB JPG
>>107344497
training nsfw loras or finetunes for a model of this size is pathetically easy
it's literally qwen but ten times faster and with a less pronounced neutral bias, this shit is insane
>>
>>107344520
I guess it's broken then, ok
>>
>Update ST to check the new model
>The UI has been shuffled around again for no reason.

Why do they keep doing this.
>>
File: 845645412454.png (45 KB, 698x496)
45 KB
45 KB PNG
Can't load Flux.2 by offloading anymore. Anyone know why?
>>
>>107344538
>Can't load Flux.2
good
>>
File: ComfyUI_04684_.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
How often do you guys get anatomy errors with z-image?
>>
File: ComfyUI_temp_fikel_00015_.png (3.48 MB, 1080x1920)
3.48 MB
3.48 MB PNG
>>
Honestly all i can think now is i wanna pause all my genning until base releases, knowing it'll be trained with noob's dataset, it'll 100% replace illustrious/noob/pony for me permanently.. woah..

>>107344552
not super often. only times i can think of it happening were because of my prompt.
>>
File: Flux2_Edit_00007_.png (1.65 MB, 1258x1024)
1.65 MB
1.65 MB PNG
>>
File: 1739361123663404.jpg (138 KB, 1024x1024)
138 KB
138 KB JPG
BUT CAN IT DO PEEPEE?
>>
File: 1757276279740432.png (3.84 MB, 1920x1440)
3.84 MB
3.84 MB PNG
do we have z-image loras yet?
>>
nb4 base is paywalled
nb4 "but they said they wouldnt"
>>
>>107344538
Use Z Image instead.
>>
File: 1738996288240974.jpg (1.81 MB, 1536x2048)
1.81 MB
1.81 MB JPG
Absurd. This shit is so good for its size. Not as good at different art styles as chroma obviously, but still insane nonetheless.
>>
>>107344563
Already have at least 300 different ones last i checked.
>>
>>107344418
The more complex your prompt the more a higher step count will do
>>
>>107344393
im almost certain you can use an amd card on linux with comfyui pretty smoothly
>>
>>107344578
so there's a benefit to higher step counts? thought anything over 9 was negligible.
>>
>Negative prompts have no effect
>Incorrect model sampling node
Why is the workflow comfy shared so bad?
>>
>qwenvl is literally bugged and takes 500+s to read an image
>swap to joycaption

Now we're talking.
>>
>>107344588
>so there's a benefit to higher step counts?
Yes, for the reason described in the post you replied to
>>
>>107344563
It can read all SDXL and Flux branches fine somehow so you got countless too choose from
>>
>>107344603
no way, are you epically trolling?
>>
>>107344603
you're saying my flux loras work? no way
>>
>>107344603
lol
>>
File: 1752746617005296.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
I can't even get it to work in SwarmUI (says missing backend text encoders) and Comfy is too confusing and enraging for my smooth brain so I will just go back to crying in the cry corner like I have been I guess.
>>
Reminder Pony V7 is literally right here if you want a competent realism model that does NSFW really well.
>>
File: 1504068089660.jpg (54 KB, 599x450)
54 KB
54 KB JPG
Bruh..

So I use joycaption to describe my nsfw images, been spamming a few just copy pasting the prompt.
It just straight up genned cp with zimage..
>>
File: Flux2_Edit_00011_.png (1.59 MB, 1213x1024)
1.59 MB
1.59 MB PNG
ZIT edit has to beat Flux2 when it comes out.
>>
>>107344662
amazing humor, /hdg/ tumor
>>
>>107344659
use comfy's workflow and switch back to swarm's ui with it
>>
>>107344669
Anon, we need to investigate your hard drive
>>
>>107344669
THAT'S HIM OFFICAH
>>
>>107344669
...and?
>>
File: 1748863700442814.jpg (41 KB, 562x575)
41 KB
41 KB JPG
>>107344669
Why don't you take a seat?
>>
File: file.png (3.5 MB, 1408x2064)
3.5 MB
3.5 MB PNG
>>107344669
stop right there
>>
>>107344360
Does it do lewds? What about the edit variant?
>>
File: 5454515454484.jpg (365 KB, 2138x1664)
365 KB
365 KB JPG
>>107344493
>Chroma for example has a lot more variation between seeds, but it also has more anatomy issues.

less*
It's a skill issue at this point desu. I mean in this case does the Z side look like good anatomy to you? Let's just say, there are situation where Z is unable to do what is being asked, and there's no way to fix the bad anatomy because it's hard baked. I won't lie that Chroma v40 in this case didn't take several tries to get it right, or gets the number of toes wrong now and then. But these are minor issues that are fixed if I keep regenning due to the gift of seed variety (even within the same seed). Or now with Chroma HD Flash, I get 10x less anatomy issues that I had on non-Flash versions, so it would probably get it first or second try.
>>
>>107344705
KINO
>>
>>107344709
fewer**
>>
>>107344709
>>107344723
pingas***
>>
the coherent backgrounds Z is able to pull off makes my dick rock hard
GOD imagine how that noob dataset is gonna look with good backgrounds
>>
>>107344737
>GOD imagine how that noob dataset is gonna look with good backgrounds
That's the issue. If they overtrain on it it will nuke the background quality because you're training on images with godawful backgrounds for the most part. Can't really have your cake and eat it too.
>>
>>107344745
Just merge the datasets??????????
>>
File: 1747611164961941.png (118 KB, 2039x464)
118 KB
118 KB PNG
>>107344178
This is gonna be for the base model right? you add this shit with the reasoning capabiliti (that shit is what's making nano banana so great) and we'll end up with the best model ever, holy shit god bless china I love communism now!
https://xcancel.com/srameojin/status/1993793896397320193#m
>>
File: ComfyUI_00081_.png (2.18 MB, 1920x1088)
2.18 MB
2.18 MB PNG
This is looking real comfy.
>>
File: ComfyUI_temp_0067_.jpg (629 KB, 896x1600)
629 KB
629 KB JPG
>>
>>107344754
>I love communism now!

>>107344745
i trust the plan either way. the fact they didn't just release it as is, means they actually care. they're not benchmaxxing.
>>
>>107344754
>Prompt enhancer with z-image-turbo might be better . System prompt is on its way!
what does that even mean?
>>
>>107344500
>>107344507
>>107344516
More like Z-image Negro am I right??
>>
It's pronounced "zimmage"
>>
holy shit Z yes COOK

>>107344774
haha lol
>>
File: 1737682606564548.png (2.24 MB, 2391x969)
2.24 MB
2.24 MB PNG
>>107344765
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>>
>>107344789
do young goku instead
>>
>>107344569
that looks great
>>
File: ZImg_00244_.jpg (1.08 MB, 1440x2240)
1.08 MB
1.08 MB JPG
take it down
NOW
>>
File: 1757670600426430.png (1.02 MB, 1028x878)
1.02 MB
1.02 MB PNG
>>107344789
>mfw I don't have to download a 50gb model to get something slopped and subpar
>>
>>107344791
very very interesting. i wonder how censored it's gonna be.
>>
File: 1757061533357064.png (190 KB, 640x584)
190 KB
190 KB PNG
>>107344569
those chinks showed that you can get great quality at normal size, this is what I always said, there's still a lot of room for improvement, we're just at the begining at this shit, that's why I find it sad that bfl and tencent went for the layerMaxxing thing, they implicitly admit they don't know how to improve their training process and go the easy way out
>>
>>107344794
it really cannot do kid goku sadly
>>
File: z-star.jpg (338 KB, 1024x1024)
338 KB
338 KB JPG
hmm
>>
>>107344765
>>107344791
https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py
>>
File: 1750138447522500.png (245 KB, 457x487)
245 KB
245 KB PNG
it knows how to do pasties
>>
Holy shit how can it load and dump 32gb of vram so fast repeatedly? That can't be healthy.
>>
File: file.png (3.23 MB, 1408x2064)
3.23 MB
3.23 MB PNG
>>107344709
you could at least have tried
>>
>>107344827
That's what VRAM is for dude
>>
>>107344706
>Does it do lewds?
It does boobs
>What about the edit variant?
Not released
>>
File: 1741205822167678.png (123 KB, 480x461)
123 KB
123 KB PNG
>>107344821
>it made r2d2 chinese
LMAO
>>
File: gigachad pureblood.gif (2.66 MB, 361x498)
2.66 MB
2.66 MB GIF
>>107344826
>it knows how to do pasties
another lora for flux i can safely delete
>>
File: Nano Banana Pro.png (2.08 MB, 1408x768)
2.08 MB
2.08 MB PNG
>>107344754
>prompt enhancer is on its way!
I'm more waiting for the reasoning personally, that's the most important part, with that your prompt adherence will be off the charts and it'll be able to make comics/manga pages with very vague prompts like on Nano Banana Pro
>>
>>107344830
It's chromajeet, his behavior is so fucking predictable and pathetic >>107333063
>>
>>107344851
it's really sad desu, I gave him the benefit of the doubt when he was defending chroma so hard (his arguments can hold up, it's a model with the best skin texture and can do NSFW out of the box), but when Z-image got released and showed how much superior it is and how good it it at rendering asian women I thought he would be thrilled by that, I think he's on sunk cost fallacy mode, he shilled chroma too hard to abandon it, many such cases
>>
File: file.png (825 KB, 832x1216)
825 KB
825 KB PNG
>>107344851
kek
>>
Can Z-Image pass the "girl lying on grass, upside down" test?
>>
File: 1763772104790821.png (346 KB, 2072x1900)
346 KB
346 KB PNG
>>107344822
>https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py
pretty based system prompt if you ask me
>>
File: ComfyUI_00089_.png (2.31 MB, 1920x1088)
2.31 MB
2.31 MB PNG
The uncanny valley with zimage doing images from 2d is scary.
>>
File: 1755087633251553.jpg (1.67 MB, 1536x2048)
1.67 MB
1.67 MB JPG
>>
>>107344890
try to go for "a woman disguised as [insert anime character]"
>>
>>107344886
that is one hell of a system prompt. wish i could do that with llm's without inflating gen time.

>>107344890
how are you guys img2img'ing zimage? never done it in cumfart before.
>>
File: 10405994080005.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>107344524
I actually prompted for it to be censored, because I wanted to post it on this christian board. Otherwise it will do nudity no problem https://files.catbox.moe/hpla5v.jpg
>>
is 7s/it on ancient vramlet gpu (2070 super 8gb vram) about what i should expect for ZIT, or is something wrong with my setup?
>>
>>107344830
Sure, you can engineer it to give you what Chroma gave me from the same prompt, but that's not the point. It has inferior prompt understanding.
>>
File: 1744511825182206.jpg (105 KB, 1600x900)
105 KB
105 KB JPG
>>107344821
Ohhhh, dis vewy goowood
>>
File: file.jpg (603 KB, 1408x2064)
603 KB
603 KB JPG
>>107344870
of course it does
but does it pass the "girl lying on grass, upside down and pointing a gun straight at the viewer?"
>>
>>107344918
this is actually really impressive not gonna lie, not a lot of models can nail that
>>
so can this new shit run on anistudio?
>>
File: 1750051416398245.png (2.32 MB, 1280x1280)
2.32 MB
2.32 MB PNG
>>107344821
https://www.youtube.com/watch?v=lPm890OX_Dc
>>
File: ZImg_00263_.png (1.8 MB, 1440x1152)
1.8 MB
1.8 MB PNG
>>
>>107344927
You have alerted the poop dick schizo
>>
File: ComfyUI_00091_.png (2.02 MB, 1920x1088)
2.02 MB
2.02 MB PNG
Can joycaption recognize people/characters and enter their names?

>>107344898
Interesting.

>>107344903
There was a link to a page with a few workflows earlier.
>>
>>107344927
ani said support is going to be added soon. stopped using comfy a long time ago so i'll just wait for ani to implement it, no way im launching comfyui ever again
>>
>>107344918
bad anatomy on the left hand but very impressive
>>
>>107344946
>no way im launching comfyui ever again
we both know that you will
>>
>>107344563
>do we have z-image loras yet?
for the moment no but civitai has added z-image to their list, damn that was fast
https://civitai.com/models/2169035/z-image-turbo-workflow?modelVersionId=2442551
>>
>>107344954
nope. not giving schizo the satisfaction. ani did a great job on his interface, i have no reason to ever use anything made by spiteful schizo comfy
>>
>>107344946
I was tolerating comfy these past few months but had to update it to get Z-Image working, somehow they made the UI even worse.
Quite an achievement.
>>
>i have no reason to ever use anything made by spiteful schizo comfy
reminder that ani forgot to take his trip off while schizoposting
>>
File: (((ani))).jpg (205 KB, 800x800)
205 KB
205 KB JPG
>>107344960
>>107344946
>>107344927
can tell the exact moment Jewlien logs in
>>
>>107344954
There's also forge forks, they'll also get Z-Image support eventually
No one should have to subject themselves to spaghetti
>>
>>107344851
>>107344864
>>107344868
Samefag anti-Chroma troll. The argument was that Chroma has worse anatomy, which would mean that Z somehow is perfect, which I showed it is not, nor is it able to handle the changing complexity of those prompts in particular.
>>
>>107344974
why do you keep posting this photo schizo
>>
gottem
>>
File: angry kitten.jpg (336 KB, 750x764)
336 KB
336 KB JPG
ignore the fucking schizoposting fellas, someone just woke up because /sdg/ is getting it at the exact same time too.
>>
>>107344984
its debo
>>
chroma lost, hard.
though it has a chance to win considering lodestone is open to working with alibaba on a noob+chroma finetune
>>
>>107344992
ITS ACTUALLY JULIEN DICKTARD
>>
>>107344994
>chroma lost, hard.
it lost very hard, Z-image will soon have all the anime character in human history, we're getting close to the perfect model >>107344178
>>
>>107344974
Literally fucking obsessed with a guy who did nothing wrong. He's literally making one of the best UIs for the community.
>>
File: tests_resized.jpg (3.91 MB, 3017x4893)
3.91 MB
3.91 MB JPG
Hey hey Anon, Anon here.
Bringing the usual. Euler only as of yet but I'll fuck around some more. Comprehensive style plot later today.
Thank you, China. This is pretty exciting.

Full sized box (jpg because of filesize):
https://files.catbox.moe/2xg6gg.jpg
>>
>>107345001
you fucking legendary cum god, thank you for your service.
>>
>>107345000
put the trip back on please
>>
>>107345001
what ui
>>
File: Z-image turbo.png (2.51 MB, 1280x1280)
2.51 MB
2.51 MB PNG
>>
File: ComfyUI_00027_.png (3.05 MB, 1360x2048)
3.05 MB
3.05 MB PNG
fp8 is faster but I prefer bf16
>>
>>107345024
seems like he has time to trollbake though? https://desuarchive.org/g/search/username/Ani/tripcode/0gRLTHrqN2/type/posts/
>>
OneTrainer update waiting room

>>107345001
Thanks for testing. Which one was your favorite?
>>
>>107345023
>ComfyUI
is it really worth using this shit
wait a week so proper uis implement z-model anon
>>
Comfy should be dragged out on the street and shot
>>
>>107345028
that's not ani
>>
File: ComfyUI_00097_.png (2.43 MB, 1920x1088)
2.43 MB
2.43 MB PNG
Oh im raffin so hard.
>>
>>107345034
>wait a week
that's too long, I don't want to waste any time having fun with this SOTA model
>>
>>107345001
how do you do these? some script that reads metadata, or are you doing it with nodes?
>>
>>107344669
Hello
I would like you to guide me step by step how to install that model and reproduce the same output, thanks
>>
>>107345044
embarrassing addict
>>
>>107345044
based addict
>>
>>107345044
help ani with anistudio if you want features added faster
https://github.com/FizzleDorf/AniStudio
>>
>>107345034
Eh, who cares? Just use whichever uis you want, anon. Though, I prefer forge neo, it's easier integration for my game later
>>
File: 1733394348184407.png (257 KB, 500x890)
257 KB
257 KB PNG
>>107345051
>>
>>107345058
supporting comfy is simply unethical, he's just not a decent person and his interface sucks. anons should know about the alternatives
>>
>>107344994
>though it has a chance to win considering lodestone is open to working with alibaba on a noob+chroma finetune

I would welcome a Z Chroma/bigASP tune, but as of now Chroma ain't going anywhere as it's still the best photoreal model. Since Z is overfit to give good results, we don't know what a tune would do, is it going to break the coherence of the model for stuff outside the training data like Qwen? If so, what is the point of that over Chroma?
>>
>>107345066
i'll take comfyanonymous over the guy who trollbakes and schizoposts in this thread all day
>>
>>107345066
>muhh decent person
who cares, I separate the art and the artist
https://www.youtube.com/watch?v=3OV4VaNW4FU
>>
>>107345064
go back to ifunny faggot
>>
>>107345001
i can't use my eyes, what's the best?
>>
oh, turning the normal txt2img workflow into an img2img is the easiest thing in the world and its working just fine at 0.3 denoise

jesus people really do blow comfy's complexity out of proportion huh.
>>
>>107345068
it's comfy false flagging as ani. ani did nothing wrong, he deserves our support desu, he's based and hard-working but he doesn't have infinite time due to job and dating obligations. and ani hasn't made any ldg threads for months
>>
File: 1740463182600519.png (226 KB, 1130x386)
226 KB
226 KB PNG
>>107344822
wait, that prompt enhancer thing is gonna be API only? OHNONONONO
>>
>>107345066
Sure. I think sd.next already supports z-image, but its UI is bloated as hell.
>>
>>107345067
>Chroma ain't going anywhere as it's still the best photoreal model
Chroma's main gimmick is not photorealism, it's a vast library of mediums and art styles.
>>
>>107345011
Comfy.
Forgot to add: It's all 9 steps, bf16.
>>107345032
Simple seems to work just fine, but I'm really surprised by ddim_uniform, I kinda dig it. Usually I like bong_tangent for a lot of models but it's real noisy here.
I'll wait with the final judgement, but simple and ddim_uniform seem worth to play around with.
>>107345087
If it's just another (V)LLM in the middle like all those other enhancers, all we need is the system prompts they're using. Pretty confident we can get that.
>>
>>107345085
>ani hasn't made any ldg threads for months
And how do you know that, ``anon''? Sounds to me like you got caught with your pants down and are now trying to damage control.
>>
File: file.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>107344912
prompt engineering goes both ways
chroma is extremely finicky with some tags to the point of changing the whole image style just by typing the wrong one

also the original argument was about anatomy
>>
>>107345108
daily routine at this point
>>
>>107345096
Yes, but Chroma has the handicap of T5 so it can't properly lock on to a style. Plus Z Edit if anything could potentially bridge any gap in styles.
>>
Comfyui won, majorly. Spoke to thousands of webui users, pretty much all of them are switching to comfy for zimage.
>>
>i can take the semi-real nova animal images i've been generating for months and make them much more realistic with a quick 8 step img2img now
holy. fuck. god if it didn't have that jpeg noise filter this would be insanely overpowered. that base checkpoint will officially light the scene on fire.
>>
File: file.png (3.75 MB, 1408x2064)
3.75 MB
3.75 MB PNG
>>107345129
based
>>
>>107345124
T5 is like 6 years old at this point, it's such a terrible and obsolete choice for a text encoder
>>
>>107345124
>the handicap of T5
I wonder what it exactly is, isn't t5 pretty damn robust on paper atleast
>>
>>107345110
Be honest. You get the same image over and over again. That is boring. I can get that girl in 5 variations of that pose with Chroma. That is its power.
>>
we're eating well, maybe qwen edit v3 will be out today too.
>>
File: 1736010914473808.jpg (904 KB, 2780x1617)
904 KB
904 KB JPG
You can really appreciate how realizic Z-image is when you compare it against Flux desu
>>
>>107345144
Q: what's with the no seed variance?
>>
File: 1737133092380602.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
>>107345155
oh that's right, i should send my gens from early yesterday i did in flux to z. thanks for reminding me

kek how fucked is that? feeding gens from a model way fucking bigger because that model is somehow less realistic.
>>
>less parameters
>generates faster
>generates better results
quite an achievement desu, it's so refreshing to get SDXL speed gens.
>>
so how many months until the base is released?
>>
I just woke up and I had to catch up to 3 threads, this is going so fast, even during the release of Wan it wasn't like that, damn
>>
>>107345178
do we have info on how long it took start to finish for them to get the turbo model?
shit, i don't even know how big the noob dataset is.
>>
>>107345155
Yep, or compare to Qwen or Flux 2 Pro (which is about same realism). Good stuff in that case.
>>
>>107345129
God, I love Carol!
>>
Trying comfy for the first time and I'm able to generate 1 image, but getting decode something out of memory error when trying to generate a second one, without even changing prompts or anything, on batch 1. How does this shit work?
8 vram 16 ramlet btw
>>
>it's time to shill the current thing
>>
File: file.png (354 KB, 500x500)
354 KB
354 KB PNG
>>107345155
God bless the chinks
>>
>>107345202
try using clean vram nodes.
>>
Can I use joycaption directly on my workflow? I'm tired of copy pasting to a web page.
>>
>>107345199
same brother. my brain can't fully process the shit i'm making right now.
i can integrate turbo z into my illustrious/noob realism workflow as the final step to make it properly realistic, barring the jpg noise issue. nuts.

>>107345202
the solution's in your statement, you're a fucking ramlet 'arry
>>
>>107345205
>the current thing
but Flux 2 is a current thing and no one is shilling it, they're all making fun of it
>>
>>107345202
>but getting decode something out of memory error
use tilted vae node instead
>>
File: 1750855603573640.jpg (1.53 MB, 1536x2048)
1.53 MB
1.53 MB JPG
Fucking hell, no way this model can be 3 times lighter than qwen, it generates considerably better results.
>>
>>107345231
yep, not only it looks realistic but the anatomy and details are on point, something that Chroma failed to do
>>
>>107345213
>i can integrate turbo z into my illustrious/noob realism workflow as the final step to make it properly realistic, barring the jpg noise issue. nuts.
post the workflow, carol-anon
>>
>>107345124
>so it can't properly lock on to a style
There's no problem training styles on Chroma, Civitai is full of Chroma style loras

The Chroma base model simply didn't caption many styles when training so everything became 'generalized' which is how ai training works unless you separate concepts with specific captions

For example, if you train a ton of images of women captioned only as 'woman', the model will generalise all these women into a single 'look', which is why you need eye color, hair color, skin color, freckles, full lips, thin lips, ethnicity etc to instruct the model not to generalise all these concepts
>>
>>107345183
>even during the release of Wan it wasn't like that, damn
it's a good model and it can run fast on any modern card with 12GB+ vram
>>
>>107345245
workflow was probably a bad word, i meant like "my usual genning in sd forge, THEN img2img in comfyui".
>>
>>107345250
how do you even train chroma loras, i keep getting errors when testing the lora on comfy.
>>
so how much vram to run all this goon sheet
>>
>>107345263
i use sd-scripts. works very well for training characters
>>
>>107345250
Speaking of Civitai, is there any good alternative or an archive? Their website is laggy and slow
>>
Imagine if they used nemo instead of qwen 3b.
>>
File: Z-Image Turbo.png (703 KB, 1080x623)
703 KB
703 KB PNG
>>
>>107345263
Diffusion-Pipe and OneTrainer work fine at least as I've used both to train Chroma loras and both work in Comfy, OneTrainer loras doesn't work in Forge though while Diffusion-Pipe loras do.

OneTrainer has Chroma presets and Diffusion-Pipe has a config example.
>>
>>107345271
ill look into it, i tried a bunch of times with the onetrainer preset for chroma and keep getting header json issues when loading it to comfy
>>
>>107345281
>Imagine if they used nemo instead of qwen 3b.
I have no doubt this model could be even better, what if they went for a 15b model instead, this shit would be Nano Banana Pro tier, and I'm not exagerating
>>
Is there an abliterated/uncensored joycaption somewhere?
>>
>>107345294
which preset did you use in onetrainer for chroma? im using the 16gb preset.
>>
>>107345278
I wish, but there really isn't. It's easily the worst site of its kind I've ever come across in terms of UI navigation and bloat, but it is THE place for lora / finetunes, and seemingly the last AI site that allows NSFW sharing, they had to drop celebrities though else the (((payment processors))) would ban them.
>>
>in less than 12 hours i went from "i cant wait to delete all my flux loras" to "i can't wait to improve all my flux gens"
god damn what a model
>>
File: oh my gadd.png (1.95 MB, 1715x848)
1.95 MB
1.95 MB PNG
>See people shill this model hard
>"It can't be that good right? I'll try it by myself and see if the realism is..."
oh my... I apologize to the chinks, I wasn't familiar to their game
>>
>>107345299
theres a bunch of resources on the chroma discord
>>
File: file.png (3.47 MB, 1408x2064)
3.47 MB
3.47 MB PNG
>>107345144
I can talk positively about SD1.5 if I keep moving the goal posts too. But I'm not interested in doing so or arguing with a person who does that.

>>107345310
Joycaption is pretty much uncensored.
>>
>>107345205
no shilling, it's super fast, and makes better gens than the 35 gig model.
>>
File: 157567326333935.png (949 KB, 832x1216)
949 KB
949 KB PNG
>>
>>107345325
So... there's no private/public tracker for AI models and lora?
>>
>>107345314
16gb preset as well, if you get an error you should report it, Chroma loras have been working fine since support was officially merged.

Are you sure you have an updated OneTrainer ?
>>
Complete retard here, how are the requirements compared to SDXL?
>>
File: 265540357882318.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
>>
>>107345337
Not that I'm aware of. Would be cool if there was.
>>
>>107345129
Can you share a img2img wf anon?
>>
File: 355201584048663.png (1.37 MB, 768x1344)
1.37 MB
1.37 MB PNG
>>107345339
bout 2x
>>
okay. hear me out. Flux 3. and its 160gb
>>
>>107345337
torrent is lost technology for anyone born after 2000
>>
File: 1759861335752854.png (72 KB, 986x596)
72 KB
72 KB PNG
>>107345339
if you go for bf16 you need a bit more than 12gb of vram, you can offload to the ram though
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>107345347
>add load image load
>add vae encde
>connect load image node to vae encode
>connect output of vae encode to latent image
ezpz. adjust as needed if you have less than 16gb of vram though. can't guarantee you won't OOM.
>>
>>107345339
Uses about 13.5GB vram for me, runs great, 40 seconds for a 2048x1536 image. Faster if you go smaller obviously.
>>
>>107345303
NBP is probably powered by SOTA thinking models like Kimi K2 Thinking. Gemini 2.5 Flash was like trillions of params.
>>
>>107345349
>>107345353
>>107345355
>tfw 3060 vramlet
It's over for the little guy. maybe i'll try it still
>>
>>107345352
still great for apps and games, I use 1337x or cs.rin.ru torrents.
>>
>>107345331
I was half joking. I just don't want to update CumUI but I guess I need to do it.
Would be interesting to try some bit more artistic gens and different mediums and see if it bends or not.
>>
>>107345358
you really don't need that much parameters just to reason and rewrite prompts though, like c'mon, we're asking the model to render 1girl, not resolve the Navier Stokes equation
>>
>>107345362
like I said, offload 2 or 3 gb to the ram and you're good to go >>107345353
>>
>>107345325
Jesus, what about just index all the models on Civitai and link directly to their model pages? All I need is just a search and filtering without the autoplay gifs and bloated ui
>>
>>107345371
offloading usually meant awful speed. also never used comfy, i'm a reforge boomer. but yeah i'll try it
>>
File: 787888149894151.png (1019 KB, 832x1216)
1019 KB
1019 KB PNG
>>107345362
8GB I assume? It'll still work, just gotta offload more to RAM.
>>
>>107345338
I did a clean reinstall of OneTrainer, then hit the bat file to update and then redownloaded the lodestone repo. I keep getting this error:

>Error while deserializing header: invalid JSON in header: EOF while parsing a value at line 1 column 0

I made sure that the dataset is in place as well.
>>
>>107344994
>>107344999
>more anime slop

ew. anyway, chroma is still king for being able to do more weird and interesting shit. I can deform bodies (intentionally and unintentionally) in chroma where as in z it tries to correct it. but maybe that's what chroma needs, correction on unintended limb horror so a z and chroma merge would be legendary.

>z becomes more flexible with weird shit
>chroma becomes less body horror

dis gon b good
>>
How better quality wise is the BF16 compared to the FP8? worth the extra niggabytes?
>>
>>107345377
>offloading usually meant awful speed.
not if you offload less than 20% of the total size, and the offloading method has improved a lot recently, the model is already so fast you won't notice a lot desu lol
>>
>>107345329
>>107345136
can someone explain to me why does this child looks appealing to me WHAT THE FUCK, HELP
>>
>>107345385
If you have an RTX 3000, maybe. BF16 is usually the better option.
>>
>>107345385
nah stay on bf16, fp8 is a pretty bad quant in my opinion, if you're really desperate for something smaller go for Q8
>>
>>107345400
>>107345403
thanks, i guess most of us are on the fp8 then huh kek was wondering why some of the gens here are higher quality. the fp8 must be the one with the really bad artifacts. grabbin the bf16 then.
>>
where's the actual download I hate this huggingnigger website so much
>>
>>107345362
It runs pretty well even on 8GB
>>
File: 1755330151009480.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>107345410
>the fp8 must be the one with the really bad artifacts.
it's not that bad but the difference is noticable compared to the real deal
>>
>2070 super
>8 sec per step when loading BF16 in fp8 and offloading a small amount of the model to RAM
>Try Q4_K_M
>11 sec per step even though model is fully in VRAM
Jesus didn't realize it would be this bad on an older GPU. Well, at least it works.
>>
>>107345397
She's white
>>
>>107345417
what s/it?
>>
File: 1744513174031939.png (2.18 MB, 1280x1280)
2.18 MB
2.18 MB PNG
https://www.youtube.com/watch?v=ZEcqHA7dbwM
>>
>>107345397
I got news for you, that means you're a pedo
>>
>>107344493
Might consistency of the transformation also be a benefit to their edit model

It seems to me less an error and more where image generation models are moving. Whether or not that's a good or bad thing is subjective
>>
Should I be downloading the fp16 or the gguf Q8 quaint?
>>
>>107345426
my man... q8 is 7s/it
>>
>>107345397
How compelling, now say that in public
>>
>>107345444
>>107345424
>>107345403
>>
File: troll.png (63 KB, 168x300)
63 KB
63 KB PNG
>>107345445
>my man... q8 is 7s/it
>>
>>107345397
She's of age
You're a perfect biological man
>>
>>107345354
thanks anon
>>
>>107345364
it's worth it. you can actually make flux/qwen type gens in seconds, no speed lora needed. also, people look real not plastic.
>>
File: 1762225238788165.png (2.14 MB, 1280x1280)
2.14 MB
2.14 MB PNG
>>107345430
>>
>>107345329
What are you even trying to argue?
>>
File: file.png (3.92 MB, 1408x2064)
3.92 MB
3.92 MB PNG
>>107345397
stop right there
>>
>>107345463
Yeah, in any case it's better experience than Chroma already.
>>
Why does Comfy say I have 5068 MB available when it loads models. I have 8 GB VRAM and I made sure only a small amount was in use before launching Comfy. And CLIP is on CPU.
>>
File: ComfyUI_00110_.jpg (753 KB, 2048x2048)
753 KB
753 KB JPG
In a near future.
>>
>>107344178
What's the point of this fake screen? Give hope to some people, and then make them despair when nothing come out of this? That's it?
If they really were ready to do h, their model would be able to do dicks, like hunyuan
>>
>>107344791
So this is why it comes bundled with the full LLM and not just the encoder. Makes sense. Really we should be able to do this in Comfy anyway, no reason we couldn't just load the LLM and ask it to enhance the prompt, it isn't a super technical thing or something that hasn't been done before
>>
File: 1737570109998251.jpg (739 KB, 2560x1396)
739 KB
739 KB JPG
I think I understand why it's so noisy and pixelated, the shift isn't high enough, it's at 3 (default), you can increase that value with the ModelSamplingAuraFlow node
>>
>>107345397
cute ok obviously, appealing, nope
>>
File: 329846967345949.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>
>>107345491
Wait, that was a shitpost?
>>
>>107345494
Wait, you have to use model sampling? I thought it was optional and could be bypassed.
>>
>>107345470
MUH FUCKING DICK FUCK
>>
>>107345397
Welcome to the club
>>
>>107345503
it is technically optional (when you don't use it it sets the value at 3) but you can use it to increase its value if it's beneficial to you
>>
File: 577538126128902.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>
>>107344791
If it comes with a reasoning llm in it, does that mean something like, say, sillytavern could take advantage of it AND use it for image gen at the same time?
>>
HOLY im gonna pre
also BF16 is as fast as the fp8 lmao
>>
>>107345491
fuck
>>
why not working saars
>>
>>107345534
>BF16 is as fast as the fp8 lmao
depends on the gpu generation
>>
>>107344709
Why you are an absolute retard? this is a base model, Chroma is a finetune, is better because has more porn data set with many poses, but if the furryfag did that with flux schnell, an old and ugly model, distilled, with this when we have the base model without any retard distillation, you can have the same but 10 times better in a future finetune of chroma2.
>>
>>107345539
What error have you encountered?
>>
File: ZgEfU9sFlD.jpg (16 KB, 457x418)
16 KB
16 KB JPG
>>
File: 813549904026901.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>
>>107345555
woah holy tamoli what'd you prompt for this?
>>
File: 1753791452046883.jpg (1.01 MB, 3840x1396)
1.01 MB
1.01 MB JPG
>>107345494
here's another example, look at the wall there's no noise patterns anymore
>>
>>107345546
only considering that lodestone actually will do it, as he ran out of money long ago
I hope he does, z image is also smaller than schnell
>>
File: 1039393987666304.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
>>107345541
niggerwell kek i'm so glad i bought this card. made the literal moon jump from the gtx 1080 too.

>>107345494
>>107345560
holy shit really? it was that easy?
>>
Does ComfyUI sideload to ram automatically? I imported that workflow, have 12GB vram and it still generated
>>
>>107345560
Nice, try adding some film grain using shift=7. Will that work?
>>
>>107345467
I'm saying that you moved goal posts many times already, and I can do the same. That profits nobody.

Just enjoy chroma, its alright.
>>
File: aeghbaegbheazghbea.png (41 KB, 903x312)
41 KB
41 KB PNG
>IT WAS THE FUCKING SHIFT THE WHOLE TIME

COOOOMFFFYYY GET YOUR ASS IN HERE AND EXPLAIN THIS SHIT IN YOUR DEFAULT WORKFLOW
WHY WAS IT NOT TESTED CONNECTED?
>>
>>107345511
Catbox the uncensored version.
>>
>>107345588
Nigga you can gen your own easily
>>
>>107345582
you mean it was bypassed by default?
>>
>>107345592
bypassed+not connected to begin with
>>
>>107345582
>>107345592
>WHY WAS IT NOT TESTED CONNECTED?
if the node is being bypassed it means the shift is at 3, there will always be a shift, but the default one might be too low
>>
>>107345599
Not connected you shouldn't even be able to gen anything since it's inline with the loading of the model.

>>107345600
I see, I'll try 7.
>>
File: 33736650852082.png (1.01 MB, 832x1216)
1.01 MB
1.01 MB PNG
>>107345559
>Polaroid SX-70 manipulation photograph
>>
File: notfake.png (65 KB, 576x450)
65 KB
65 KB PNG
>>107345491
Not fake nigger, the messages wew just rearranged for convenience.
>>
>>107345600
shift 7 seems like the best spot, this is my img2img workflow at 6 steps.
>>
>>107345591
>Nigga you can gen your own easily
I want that image uncensored.
>>
File: ComfyUI_00123_.jpg (301 KB, 1269x2048)
301 KB
301 KB JPG
Zimage seems very good at glowing/aura effects.
>>
anyone testing zimage with cfg > 1?
I'm trying 2.5 with some negatives
>>
File: 205057699999437.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>107345588
>>107345615
https://files.catbox.moe/a2s7pe.jpg
>>
How was Z-Image-Turbo trained from Z-Image-Base
>>
>>107345610
>6 steps.
why 6? it was trained at 8 steps lol
>>
>>107345642
too many steps turns the image into that taylor swift as an 80 year old gen from the last OP
but ill bring it back to 8 and also try 16 given i'm still testing.
>>
>>107345634
Doesn't seem to response to negatives at all, and if you change cfg from 1.0 it literally doubles the generation time
>>
Is there a forbidden lora trained on underage pussy or how do we go bout this dog
>>
>>107345668
for me it drastically enhances prompt adherance, but it makes the image worse looking
I'll take a look at tricks like skimmed cfg, maybe that can help
I don't care about speed I'm using a 5090 with batch 4, I'm so used to long gens this is nothing
>>
>>107345668
>if you change cfg from 1.0 it literally doubles the generation time
That applies to literally any model. After realizing that I never went back to being a cfg>1 cuck. You can still use negative prompts with NAG, but it probably doesn't work with this new model yet. https://github.com/ChenDarYen/ComfyUI-NAG
>>
File: ComfyUI_00127_.jpg (951 KB, 2048x2048)
951 KB
951 KB JPG
Sick.
>>
File: 1749433715924139.png (343 KB, 1080x357)
343 KB
343 KB PNG
>>107345668
>>107345683
if we had NAG for Z-Image we would definitely get some prompt adherence improvement
>>
>>107345637
Thanks.
>>
File: Arkveld by HatiDraw.jpg (284 KB, 960x1200)
284 KB
284 KB JPG
>>107345687
Same energy.
>>
>>107345693
kek i like that puu turned into a design on her shirt
>>
>>107345693
what model is that? qwen-edit?
>>
File: 1746394464210710.jpg (765 KB, 2560x1396)
765 KB
765 KB JPG
>>107345560
yep, I think this definitely fixes the noise
>>
>>107345715
it was kontext dev
>>
>>107345682
this model will never be fully uncensored because otherwise it would be able to gen cheese pizza insanely easy
the only way this model can be fully uncensored is to lobotomize the absolute shit out of it so it forgets even the smallest glimpse of realistic data so it only prints 2D
and i have absolutely no fucking idea on how are chinks going to achieve that
>>
File: 37054808830872.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
Just needs a liiiiitle finetuning on some explicit material
>>
>>107345494
>>107345560
>>107345722
Kek, and I thought that issue was linked to the model itself, once again it's Comfy's fault :(
>>
>>107345741
>literally cumfy's fault because he didn't connect the node in his workflow
>>
>>107345745
>>107345745
>>
>>107345733
>it would be able to gen cheese pizza insanely easy
I mean, it's kinda doing that already, just some proportions are bad
>>
File: file.png (20 KB, 443x159)
20 KB
20 KB PNG
>>107345741
Well, 3.0 shift is in their own scheduler config, that's kinda what you'd use first.
>>
>>107345759
>FlowMatchEulerDiscreteScheduler
is that the simple scheduler though?
>>
wonder if zimage base will be even better?
>>
File: 1747014050209321.png (3 MB, 1536x1536)
3 MB
3 MB PNG
>>
>>107345798
Probably need to be finetuned before it can be considered production quality
>>
>>107345608
Well, we will see then, but they won't use the noob dataset as it is, I can see them cut a lot of the h from it. And I don't know how they will tackle artists mix with natural language, or the threshold for a character/artist to appear.
That and the noob dataset is at least one year old, so retraining one year of loras will be painful, more if it still can't do nsfw poses
Hell it's not even a deal they will do an anime finetune, but if they do it's not going to be anytime soon and I can see ppl waiting for this before doing any big finetune of the base model
>>
File: zit-cus.jpg (617 KB, 1280x1280)
617 KB
617 KB JPG
>>
File: zimg_0035.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
yesterday i did some testing with the fucking model shift but i only tested from 0-5, here's a bigger range.

https://files.catbox.moe/7n0qwl.png
>>
>>107346254
You fucked something up because those are all identical.
>>
File: zimg_0036.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
another few, i guess it's very much prompt dependent when shift will make a difference:

euler a
> https://files.catbox.moe/scw8lo.png
> https://files.catbox.moe/w1xxdu.png

euler
> https://files.catbox.moe/bz7bm4.png

prompts:
>an analog film photo of a man holding a beer while sitting in the driver's seat of an old truck, a woman in a white bikini sits on the hood laughing
>an amateur photo, an irish woman wearing lacy black top, purple hair, black bangs, posing in the middle of moshpit, high-angle selfie
>>
File: zimg_0037.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>107346305
yeah the noodles had shift set up to the sampler and not scheduler, i fixed it here: >>107346323

but also i redid that one:
> https://files.catbox.moe/qo3gy7.png
>>
>>107345382
of course the chromashitter is antianime
it's hilarious how they out themselves
>>
File: zimg_0038.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
since my coffee is hitting, here's a clip-skip grid:
https://files.catbox.moe/gow552.png



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.