/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 11/26/25(Wed)20:19:11 No.107341067

File: highlights_g_107339853_17(...).jpg (2.42 MB, 3126x3521)

2.42 MB JPG

/ldg/ - Local Diffusion General Anonymous 11/26/25(Wed)20:19:11 No.107341067 Archived

I'd Rather You Start Schizoposting Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107339853

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>Z
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/Comfy-Org/z_image_turbo

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
11/26/25(Wed)20:20:19 No.107341078

Anonymous 11/26/25(Wed)20:20:19 No.107341078

>>107341058

Anonymous
11/26/25(Wed)20:20:27 No.107341080

Anonymous 11/26/25(Wed)20:20:27 No.107341080

First for z-image sucks.

Anonymous
11/26/25(Wed)20:20:28 No.107341081

Anonymous 11/26/25(Wed)20:20:28 No.107341081

why the fuck is the Nvidia offloading on by default?

Anonymous
11/26/25(Wed)20:20:55 No.107341088

Anonymous 11/26/25(Wed)20:20:55 No.107341088

total bloatmodel death

Anonymous
11/26/25(Wed)20:20:59 No.107341090

Anonymous 11/26/25(Wed)20:20:59 No.107341090

>>107341081
how do you turn it off

Anonymous
11/26/25(Wed)20:21:29 No.107341095

Anonymous 11/26/25(Wed)20:21:29 No.107341095

>>107341081
nobody needs anything else other than flux2

Anonymous
11/26/25(Wed)20:21:41 No.107341096

Anonymous 11/26/25(Wed)20:21:41 No.107341096

>>107341081
i guess for tranny vidya games?

Anonymous
11/26/25(Wed)20:22:07 No.107341101

Anonymous 11/26/25(Wed)20:22:07 No.107341101

File: 1764206337705971.png (742 KB, 1503x1664)

742 KB PNG

Why is the official ComfyUI page shilling APIShit and not mentioning z-image at all?
>>107341064
I thought Comfy was local-first?

Anonymous
11/26/25(Wed)20:22:20 No.107341103

Anonymous 11/26/25(Wed)20:22:20 No.107341103

Is this new z image model open to finetunes, including nsfw ones, or is it doa like bfl models outside of flux schnell?

Anonymous
11/26/25(Wed)20:22:38 No.107341107

Anonymous 11/26/25(Wed)20:22:38 No.107341107

File: ComfyUI_08887_.png (1.42 MB, 1152x1152)

1.42 MB PNG

Anonymous
11/26/25(Wed)20:22:46 No.107341109

Anonymous 11/26/25(Wed)20:22:46 No.107341109

reminder that paid cuckmodel shills are itt right now

Anonymous
11/26/25(Wed)20:23:20 No.107341113

Anonymous 11/26/25(Wed)20:23:20 No.107341113

>>107341103
>flux schnell
That was also DOA

Anonymous
11/26/25(Wed)20:23:26 No.107341114

Anonymous 11/26/25(Wed)20:23:26 No.107341114

>>107341081
>teehee your entire computer just slowed to a crawl and you don't know why
>teehee

Anonymous
11/26/25(Wed)20:23:27 No.107341115

Anonymous 11/26/25(Wed)20:23:27 No.107341115

>>107341101
Maybe because the last piece of news was 2 days ago, before the release of z image?

Anonymous
11/26/25(Wed)20:23:48 No.107341119

Anonymous 11/26/25(Wed)20:23:48 No.107341119

>>107341101
for comfyorg
flux2 > zimage turbo

Anonymous
11/26/25(Wed)20:23:58 No.107341122

Anonymous 11/26/25(Wed)20:23:58 No.107341122

>>107341113
I meant license wise.

Anonymous
11/26/25(Wed)20:24:17 No.107341125

Anonymous 11/26/25(Wed)20:24:17 No.107341125

File: ComfyUI-euler-1.0-9-2025-(...).png (2.6 MB, 1536x1536)

2.6 MB PNG

..And after the premiere of Von Braun, we also have a new retelling of Schindler's List!

Anonymous
11/26/25(Wed)20:24:41 No.107341127

Anonymous 11/26/25(Wed)20:24:41 No.107341127

File: ComfyUI_276397_.png (1.08 MB, 1280x720)

1.08 MB PNG

>>107341101
People are on vacation for thanksgiving so the blog post might be up Monday.

Anonymous
11/26/25(Wed)20:24:42 No.107341129

Anonymous 11/26/25(Wed)20:24:42 No.107341129

File: zimage_c-20-0036.jpg (449 KB, 2048x2048)

449 KB JPG

Z image artist knowlege is poor. It only knows the most popular, and for those it doesn't know it has the same fallback as Qwen where it uses the name as a reference for the ethnicity of the subject, or for a style bucket (ie Italian name -> generic renaissance style approximation)

Anonymous
11/26/25(Wed)20:25:09 No.107341135

Anonymous 11/26/25(Wed)20:25:09 No.107341135

>>107341107
as a clean feet lover, this is some good shit, especially with a cute girl

Anonymous
11/26/25(Wed)20:25:09 No.107341136

Anonymous 11/26/25(Wed)20:25:09 No.107341136

>>107341125
shaniqua's list

Anonymous
11/26/25(Wed)20:25:46 No.107341140

Anonymous 11/26/25(Wed)20:25:46 No.107341140

>>107341125
Can you make an obese guy in a concentration camp with a Judenstern saying "FLUX 2" on his clothing?

Anonymous
11/26/25(Wed)20:25:54 No.107341142

Anonymous 11/26/25(Wed)20:25:54 No.107341142

>>107341127
why aren't you commenting on all the complaints racking up? why do you still make the absolute shittiest templates?

Anonymous
11/26/25(Wed)20:26:05 No.107341146

Anonymous 11/26/25(Wed)20:26:05 No.107341146

>>107341129
>cake the image in a layer of ugly noise instead of actual brushwork
new meta just dropped

Anonymous
11/26/25(Wed)20:26:42 No.107341150

Anonymous 11/26/25(Wed)20:26:42 No.107341150

File: ComfyUI_08893_.png (1.45 MB, 1152x1152)

1.45 MB PNG

Anonymous
11/26/25(Wed)20:26:44 No.107341152

Anonymous 11/26/25(Wed)20:26:44 No.107341152

>>107341146
100% wf issue

Anonymous
11/26/25(Wed)20:28:00 No.107341162

Anonymous 11/26/25(Wed)20:28:00 No.107341162

>>107341150
is it also good at non asian girls?

Anonymous
11/26/25(Wed)20:28:51 No.107341168

Anonymous 11/26/25(Wed)20:28:51 No.107341168

>>107341152
Is this the part where you blame comfy instead of the model?

Anonymous
11/26/25(Wed)20:28:58 No.107341170

Anonymous 11/26/25(Wed)20:28:58 No.107341170

File: ComfyUI_07550_.png (1.53 MB, 944x1280)

1.53 MB PNG

Anonymous
11/26/25(Wed)20:29:02 No.107341171

Anonymous 11/26/25(Wed)20:29:02 No.107341171

>>107341162
who cares

Anonymous
11/26/25(Wed)20:29:06 No.107341172

Anonymous 11/26/25(Wed)20:29:06 No.107341172

File: ComfyUI-euler-1.0-9-2025-(...).png (2.67 MB, 1536x1536)

2.67 MB PNG

>>107341136
sides caved in reading this. thank you.

Anonymous
11/26/25(Wed)20:29:47 No.107341175

Anonymous 11/26/25(Wed)20:29:47 No.107341175

>>107341172
ded

Anonymous
11/26/25(Wed)20:30:30 No.107341180

Anonymous 11/26/25(Wed)20:30:30 No.107341180

I did not lose respect for comfy when he started associating himself with avatarfags.
I did not lose respect for comfy when he started enshittifying the UI.
I did not lose respect for comfy when he started adding api nodes.
I lost respect for comfy when someone posted that promo video and we learned that he is a fat fuck irl.

Anonymous
11/26/25(Wed)20:30:31 No.107341181

Anonymous 11/26/25(Wed)20:30:31 No.107341181

z-image danbooru finetune might be the best thing for /hdg/ troglodytes

Anonymous
11/26/25(Wed)20:30:38 No.107341182

Anonymous 11/26/25(Wed)20:30:38 No.107341182

>>107341172
Can you get rid of the blur using prompts?

Anonymous
11/26/25(Wed)20:31:14 No.107341186

Anonymous 11/26/25(Wed)20:31:14 No.107341186

>>107341127

Reminder ComfyUI still:
>doesn't remember queue if you crash/quit
>can't stop the queue if one gen OOMs, it will go through all the queued gens and OOM on them all instantly, forgetting all sent workflows in case you don't have each saved
>can't be scheduled so it only gens within a particular time or at least begin at a set time
>doesn't have "precompute text encodings of all queued gens and throw the encoder out of vram forever" toggle, speeding up bigger models by double digit % if genning multiple images

These wouldn't be as bad if this wasn't trivial to add for them who know the codebase well at least in a dirty way before implementing it properly and it wouldn't be that bad if there werent literal dozens of memory leaks and bad memory allocation code that no matter what OOM you for video gen every once in a while despite unloading all models for every single gen, having 24gb vram 128gb ram and dynamically managed pagefile that gets filled to 170+gb sometimes.

Even the basic feature of just being able to stop the gen mid way through a step instead of having to wait multiple minutes for it to finish for high quality video gens wasnt implemented until a few days ago and ONLY after some guy created a node to do it first, proving that it's obviously possible. Only after that did comfy write what were essentially two unique lines of code that added this basic feature.

Anonymous
11/26/25(Wed)20:31:23 No.107341187

Anonymous 11/26/25(Wed)20:31:23 No.107341187

>>107341181
/hdg/ would be too busy irony posting to even care

Anonymous
11/26/25(Wed)20:31:23 No.107341188

Anonymous 11/26/25(Wed)20:31:23 No.107341188

This is a major upscale for Seedream 5, I just know Bytedance won't let themselves be beat by open sores

Anonymous
11/26/25(Wed)20:31:41 No.107341189

Anonymous 11/26/25(Wed)20:31:41 No.107341189

>>107341180
I never respected comfy and was a spaghetti hater since day 1
Total spaghetti death

Anonymous
11/26/25(Wed)20:32:08 No.107341193

Anonymous 11/26/25(Wed)20:32:08 No.107341193

>>107341182
the blur *is* part of the prompt famalam.

A dramatic movie poster for 'Shaniqua's List'. A serious-looking, very fat overweight African American woman with curly hair, wearing a black nazi uniform with a red armband featuring a white circle and black swastika on her left arm, stands prominently in the foreground. In the blurry background, numerous people in striped prisoner uniforms are visible in a somber, industrial setting with dark, muddy ground. The top left corner shows the NETFLIX logo in red, followed by 'Presents' in white. The title 'Shaniqua's List' is at the bottom, in a classic, slightly distressed white serif font. The lighting is dim and dramatic, casting a serious tone.

Anonymous
11/26/25(Wed)20:32:30 No.107341195

Anonymous 11/26/25(Wed)20:32:30 No.107341195

>>107341168
wheres the jpg artifacting in images like >>107341150

Anonymous
11/26/25(Wed)20:32:36 No.107341196

Anonymous 11/26/25(Wed)20:32:36 No.107341196

File: ComfyUI_08895_.jpg (443 KB, 2048x2048)

443 KB JPG

>>107341089
Kek, forgot to set it to 2k

Anonymous
11/26/25(Wed)20:32:41 No.107341197

Anonymous 11/26/25(Wed)20:32:41 No.107341197

>>107341193
oh ok I thought it was forced

Anonymous
11/26/25(Wed)20:34:04 No.107341210

Anonymous 11/26/25(Wed)20:34:04 No.107341210

>>107341186
>stop the gen mid way through a step
That issue stems from ai researches usually being dogshit programmers and pasting reference code as is into the node is much easier that fixing it.

Anonymous
11/26/25(Wed)20:34:14 No.107341214

Anonymous 11/26/25(Wed)20:34:14 No.107341214

Reminder than comfyui sends all your data to their cloud for analysis and can format your hard drive anytime if they want.

Anonymous
11/26/25(Wed)20:34:44 No.107341218

Anonymous 11/26/25(Wed)20:34:44 No.107341218

>>107341186 (me)
>Even the basic feature of just being able to stop the gen mid way through a step instead of having to wait multiple minutes for it to finish for high quality video gens wasnt implemented until a few days ago and ONLY after some guy created a node to do it first
I copied this from my previous posting of this complaint, and this part is no longer true, this happened weeks ago by now instead of few days.

Anonymous
11/26/25(Wed)20:34:46 No.107341219

Anonymous 11/26/25(Wed)20:34:46 No.107341219

File: ComfyUI-euler-1.0-9-2025-(...).png (1.9 MB, 1536x1536)

1.9 MB PNG

>>107341214
cool i hope the glowniggers see this one that's cracking me up

Anonymous
11/26/25(Wed)20:34:52 No.107341222

Anonymous 11/26/25(Wed)20:34:52 No.107341222

>>107341195
most obvious at the ends of the hair, but whole picture looks like it has noise reduction at 200%

Anonymous
11/26/25(Wed)20:35:53 No.107341227

Anonymous 11/26/25(Wed)20:35:53 No.107341227

File: zimage_c-20-0040.jpg (160 KB, 1328x2048)

160 KB JPG

>1girl, sculpted by michelangelo

Anonymous
11/26/25(Wed)20:36:03 No.107341230

Anonymous 11/26/25(Wed)20:36:03 No.107341230

WHERE IS THE LORA SUPPORT???

Anonymous
11/26/25(Wed)20:36:31 No.107341235

Anonymous 11/26/25(Wed)20:36:31 No.107341235

File: ComfyUI_23498_.png (3.1 MB, 1280x2048)

3.1 MB PNG

Anonymous
11/26/25(Wed)20:36:36 No.107341237

Anonymous 11/26/25(Wed)20:36:36 No.107341237

You have to update comfy to use z, but o remember just a few days ago someone showed that the latest update deliberately made the up worse and hide a bunch of shit. For those of you who not the bullet and updated, how bad is it.

Anonymous
11/26/25(Wed)20:37:06 No.107341242

Anonymous 11/26/25(Wed)20:37:06 No.107341242

File: Z-Image-Turbo_00022_.png (3.56 MB, 1280x2048)

3.56 MB PNG

Anonymous
11/26/25(Wed)20:38:04 No.107341251

Anonymous 11/26/25(Wed)20:38:04 No.107341251

>>107341242
Hagbros eating good

Anonymous
11/26/25(Wed)20:38:19 No.107341254

Anonymous 11/26/25(Wed)20:38:19 No.107341254

>>107341237
Made the UI worse*

Anonymous
11/26/25(Wed)20:38:22 No.107341255

Anonymous 11/26/25(Wed)20:38:22 No.107341255

>>107341237
ComfyUI is terrible, rendering nodes tanks it to 20FPS. And it's not a model loading issue because if I just move the viewport away from the nodes it spikes up to 240 FPS.

Anonymous
11/26/25(Wed)20:38:44 No.107341260

Anonymous 11/26/25(Wed)20:38:44 No.107341260

File: ComfyUI_08896_.png (1.67 MB, 1152x1152)

1.67 MB PNG

>>107341129
Shame, it does do stylized photos quite well. Flux.2 however does seem to know every artist style I threw at it, but I have some hopium that this is due to distillation and when they give us base model it will be better since Z knows a few of the ones I tested from Flux.2.

Anonymous
11/26/25(Wed)20:38:53 No.107341261

Anonymous 11/26/25(Wed)20:38:53 No.107341261

File: ComfyUI_01590_.png (1.18 MB, 1024x1024)

1.18 MB PNG

Anonymous
11/26/25(Wed)20:39:34 No.107341267

Anonymous 11/26/25(Wed)20:39:34 No.107341267

Comfy should be dragged out on the street and shot

Anonymous
11/26/25(Wed)20:40:04 No.107341272

Anonymous 11/26/25(Wed)20:40:04 No.107341272

>>107341255
when previewing images, resize them down to like 32px
it's some retarded approach to rendering texture arrays.

Anonymous
11/26/25(Wed)20:40:32 No.107341275

Anonymous 11/26/25(Wed)20:40:32 No.107341275

File: ComfyUI_08897_.png (1.82 MB, 1152x1152)

1.82 MB PNG

Anonymous
11/26/25(Wed)20:41:39 No.107341283

Anonymous 11/26/25(Wed)20:41:39 No.107341283

Can Z-Image do upside down faces?

Anonymous
11/26/25(Wed)20:41:43 No.107341285

Anonymous 11/26/25(Wed)20:41:43 No.107341285

Has anyone else noticed that every big SAAS model is starting to look the same? I feel like there was more variety in a batch of Dall-E 3 gens than in Flux 2, Seedream, Nano Banana Pro, any of this shit.

Anonymous
11/26/25(Wed)20:42:02 No.107341290

Anonymous 11/26/25(Wed)20:42:02 No.107341290

File: ComfyUI_08898_.png (1.69 MB, 1152x1152)

1.69 MB PNG

Anonymous
11/26/25(Wed)20:42:41 No.107341298

Anonymous 11/26/25(Wed)20:42:41 No.107341298

>>107341283
the famous "upside down on grass" test?

Anonymous
11/26/25(Wed)20:42:50 No.107341301

Anonymous 11/26/25(Wed)20:42:50 No.107341301

>>107341285
They're probably being trained on each other's slop

Anonymous
11/26/25(Wed)20:43:18 No.107341303

Anonymous 11/26/25(Wed)20:43:18 No.107341303

>>107341301
I think they're just plagiarizing each others' methods

Anonymous
11/26/25(Wed)20:43:31 No.107341306

Anonymous 11/26/25(Wed)20:43:31 No.107341306

>>107341285
The money being put into AI wants it to be replicating a certain thing instead of doing it's own unique thing like what a creative type would want

Anonymous
11/26/25(Wed)20:43:52 No.107341311

Anonymous 11/26/25(Wed)20:43:52 No.107341311

Hello everyone, how have threads been?
Is the poop dick schizo still at large?

Anonymous
11/26/25(Wed)20:43:53 No.107341312

Anonymous 11/26/25(Wed)20:43:53 No.107341312

File: Z-Image-Turbo_00024_.png (3.52 MB, 1440x2048)

3.52 MB PNG

Anonymous
11/26/25(Wed)20:44:01 No.107341313

Anonymous 11/26/25(Wed)20:44:01 No.107341313

File: ComfyUI_08902_.png (1.22 MB, 1152x1152)

1.22 MB PNG

Anonymous
11/26/25(Wed)20:44:23 No.107341317

Anonymous 11/26/25(Wed)20:44:23 No.107341317

>>107341303
Why not both

Anonymous
11/26/25(Wed)20:45:28 No.107341322

Anonymous 11/26/25(Wed)20:45:28 No.107341322

File: zimg_0131.png (1.33 MB, 832x1216)

1.33 MB PNG

Anonymous
11/26/25(Wed)20:45:42 No.107341326

Anonymous 11/26/25(Wed)20:45:42 No.107341326

>>107341313
Feet look like rubber, very bonerkilling

Anonymous
11/26/25(Wed)20:45:43 No.107341327

Anonymous 11/26/25(Wed)20:45:43 No.107341327

File: ComfyUI_08903_.png (1.83 MB, 1152x1152)

1.83 MB PNG

Anonymous
11/26/25(Wed)20:45:44 No.107341328

Anonymous 11/26/25(Wed)20:45:44 No.107341328

Lumina2 + few step distil + realism lora
chinese revolution

Anonymous
11/26/25(Wed)20:46:43 No.107341332

Anonymous 11/26/25(Wed)20:46:43 No.107341332

>>107341312
Very unsafe gen

Anonymous
11/26/25(Wed)20:46:57 No.107341334

Anonymous 11/26/25(Wed)20:46:57 No.107341334

File: ComfyUI-euler-1.0-9-2025-(...).png (2.79 MB, 1536x1536)

2.79 MB PNG

..And we're back with LDG news. Breaking story, PoopDickSchizo is back, and this time he's like a hydra with many unkillable heads.

Anonymous
11/26/25(Wed)20:47:00 No.107341335

Anonymous 11/26/25(Wed)20:47:00 No.107341335

File: wan22__00003.mp4 (238 KB, 480x480)

238 KB MP4

Anonymous
11/26/25(Wed)20:47:03 No.107341336

Anonymous 11/26/25(Wed)20:47:03 No.107341336

File: ComfyUI_276420_.png (2.31 MB, 1536x1536)

2.31 MB PNG

>>107341186
frontend issues like queue stuff will probably get fixed at some point.

If you have a workflow that is OOMing go make an issue with it on the repo.

Anonymous
11/26/25(Wed)20:47:07 No.107341337

Anonymous 11/26/25(Wed)20:47:07 No.107341337

File: ComfyUI_08904_.png (1.52 MB, 1152x1152)

1.52 MB PNG

Anonymous
11/26/25(Wed)20:48:08 No.107341350

Anonymous 11/26/25(Wed)20:48:08 No.107341350

File: ComfyUI_08905_.png (1.4 MB, 1152x1152)

1.4 MB PNG

Anonymous
11/26/25(Wed)20:48:15 No.107341354

Anonymous 11/26/25(Wed)20:48:15 No.107341354

>>107341285
The more data samples, the closer the results will look between models, even if the datasets are different.

Similar to how a live poll's results will swing back and forth with the initial votes, but then 3,000 votes in and the percentages barely move anymore. They stabilize around a particular result.

Anonymous
11/26/25(Wed)20:49:15 No.107341363

Anonymous 11/26/25(Wed)20:49:15 No.107341363

>>107341354
I don't buy it.

Anonymous
11/26/25(Wed)20:49:24 No.107341366

Anonymous 11/26/25(Wed)20:49:24 No.107341366

So will it be possible to train loras for turbo? It was still possible with Flux dev and schnell despite being distilled. Though the loras all wrecked the anatomy

Anonymous
11/26/25(Wed)20:49:29 No.107341368

Anonymous 11/26/25(Wed)20:49:29 No.107341368

File: wan22__00004.mp4 (506 KB, 480x480)

506 KB MP4

Anonymous
11/26/25(Wed)20:49:29 No.107341369

Anonymous 11/26/25(Wed)20:49:29 No.107341369

File: 1761139341590710.jpg (267 KB, 1013x1449)

267 KB JPG

added sdxl, so basically z image is almost twice as big as sdxl

Anonymous
11/26/25(Wed)20:50:24 No.107341378

Anonymous 11/26/25(Wed)20:50:24 No.107341378

>>107341336
is that z image?

Anonymous
11/26/25(Wed)20:50:30 No.107341380

Anonymous 11/26/25(Wed)20:50:30 No.107341380

>>107341354
unless they're all training on the same shit that doesn't make any sense

Anonymous
11/26/25(Wed)20:51:15 No.107341385

Anonymous 11/26/25(Wed)20:51:15 No.107341385

fucking great model. based alibaba

Anonymous
11/26/25(Wed)20:51:59 No.107341393

Anonymous 11/26/25(Wed)20:51:59 No.107341393

anyone tried the z image edit model? is it even out?

Anonymous
11/26/25(Wed)20:52:04 No.107341395

Anonymous 11/26/25(Wed)20:52:04 No.107341395

>>107341369
this chart is something you made?

Anonymous
11/26/25(Wed)20:52:15 No.107341397

Anonymous 11/26/25(Wed)20:52:15 No.107341397

Man I must be tripping because this model feels like every other chinese model.
Like I've already seen these images before.

Anonymous
11/26/25(Wed)20:52:25 No.107341398

Anonymous 11/26/25(Wed)20:52:25 No.107341398

File: ComfyUI_01629_.png (2.84 MB, 1280x1920)

2.84 MB PNG

step right up, step right up

Anonymous
11/26/25(Wed)20:53:00 No.107341402

Anonymous 11/26/25(Wed)20:53:00 No.107341402

>>107341395
used perplexity because I was annoyed at never remembering what model was what size

Anonymous
11/26/25(Wed)20:53:15 No.107341405

Anonymous 11/26/25(Wed)20:53:15 No.107341405

File: ComfyUI-euler-1.0-9-2025-(...).png (2.22 MB, 1536x1536)

2.22 MB PNG

>>107341369
thank you based chartautismo

>>107341385
it's fantastic, way better than i could have expected.

>>107341397
might be a flux trauma symptom, many such cases.

>>107341398
kino

Anonymous
11/26/25(Wed)20:53:36 No.107341410

Anonymous 11/26/25(Wed)20:53:36 No.107341410

>>107341369
Z-image runs fullsize on 12 gigs perfectly fiine. 4070S here.

Anonymous
11/26/25(Wed)20:53:46 No.107341412

Anonymous 11/26/25(Wed)20:53:46 No.107341412

>>107341380
I think he is treating "different data" as different large samples from the same set of data, that set of data being "all the data that exists"

Anonymous
11/26/25(Wed)20:54:00 No.107341416

Anonymous 11/26/25(Wed)20:54:00 No.107341416

>>107341336

Other ComfyUI QoL things:
>doesn't have a native "fuzzy match model names in loader nodes" feature that automatically resolve paths to model files that were moved since last usage, or, god forbid, just find those models anywhere they might be by their unique hash
>doesn't have "widget control mode: before" as default seed changing behaviour, which is much more intuitive
>goes to first workflow tab when closing the currently active one instead of going to the one that was last used or at least the one next to it, helping people with many active tabs
>doesn't allow you to drag and drop a workflow tab anywhere on the bar to the right but instead you have to drop a workflow on top of another
>doesn't have a quick swap button for hight/width of image resolutions on nodes
>can't foward image dimensions from load image node to nodes that use dimensions
>when loading a workflow and running it, even if you have seed set to randomize, the first time will silently use the seed in the workflow instead of randomizing it
>workflows doesn't have fuzzy search or anything similar, if you search "wan lora 1" it wont find your "wan 2.2 lora 1" workflow

Anonymous
11/26/25(Wed)20:54:54 No.107341421

Anonymous 11/26/25(Wed)20:54:54 No.107341421

>>107341369
>2x as big as SDXL
>2x as slow as SDXL
>2x the resolution of SDXL
>4x res VAE compared to SDXL
>at least 10x better than base SDXL
this is the scaling that we need. not shit like hidream with 4 text encoders that's 4x bigger and 2x slower for images not even 10% better than flux 1.

Anonymous
11/26/25(Wed)20:55:08 No.107341425

Anonymous 11/26/25(Wed)20:55:08 No.107341425

>>107341402
would be neat to see with a bunch of models going back to SD1 and before. i wonder how accurate perplexity is with shit like sana or other obscure models
>never remembering what model was what size
same

Anonymous
11/26/25(Wed)20:55:11 No.107341426

Anonymous 11/26/25(Wed)20:55:11 No.107341426

Drag and shot the comfy

Anonymous
11/26/25(Wed)20:55:18 No.107341430

Anonymous 11/26/25(Wed)20:55:18 No.107341430

>>107341369
Turns out license wise, the only time bfl ever cared was with Schnell.

Anonymous
11/26/25(Wed)20:55:33 No.107341432

Anonymous 11/26/25(Wed)20:55:33 No.107341432

>>107341412
bahaha.. uh ok.. well that also doesn't make sense but sure

Anonymous
11/26/25(Wed)20:56:11 No.107341439

Anonymous 11/26/25(Wed)20:56:11 No.107341439

>>107341363
>>107341380
Imagine if you took one trillion completely random photographs in the world, for one model. Then another trillion random photographs in the world, for another model. They would start to start to look the same, the randomization ends up being less random the more data samples there are.

You can scale up this thought experiment:
Imagine if you took infinite photographs of the world (essentially building a simulation of earth) for one model. Then took infinite photographs of the world for another model. The two models end up being the same.

You can scale down this thought experiment:
Imagine if you took 1000 photographs of the world for one model and 1000 photographs of the world for another model, the two models would be much more randomized than the trillion version is.

Anonymous
11/26/25(Wed)20:56:21 No.107341440

Anonymous 11/26/25(Wed)20:56:21 No.107341440

File: ComfyUI_08906_.png (1.57 MB, 1152x1152)

1.57 MB PNG

>>107341326
Yh unfortunately only a few come out non slopped enough for my tastes.

Anonymous
11/26/25(Wed)20:56:55 No.107341444

Anonymous 11/26/25(Wed)20:56:55 No.107341444

File: 1569698720378.jpg (99 KB, 448x537)

99 KB JPG

a WAI-style nsfw tune of this gonna go hard.

Anonymous
11/26/25(Wed)20:57:13 No.107341446

Anonymous 11/26/25(Wed)20:57:13 No.107341446

>>107341425
give me a few and I can try
sd1.5?

Anonymous
11/26/25(Wed)20:57:25 No.107341449

Anonymous 11/26/25(Wed)20:57:25 No.107341449

File: ComfyUI_08907_.png (1.74 MB, 1152x1152)

1.74 MB PNG

Anonymous
11/26/25(Wed)20:57:43 No.107341451

Anonymous 11/26/25(Wed)20:57:43 No.107341451

File: Z-Image-Turbo_00025_.jpg (1.75 MB, 2048x2048)

1.75 MB JPG

Anonymous
11/26/25(Wed)20:57:44 No.107341452

Anonymous 11/26/25(Wed)20:57:44 No.107341452

>>107341440
top of feet, so nice

Anonymous
11/26/25(Wed)20:57:49 No.107341453

Anonymous 11/26/25(Wed)20:57:49 No.107341453

File: ComfyUI-euler-1.0-9-2025-(...).png (2.41 MB, 1536x1536)

2.41 MB PNG

BLACK FOREST LABS POOL CLOSED

Anonymous
11/26/25(Wed)20:58:36 No.107341462

Anonymous 11/26/25(Wed)20:58:36 No.107341462

>>107341430
they never cared. schnell was a pseudo-local release designed to bait 'developers' into thinking the model could be salvaged. meanwhile it was giga-slopped, hyper-distilled, and sabotaged to the point where it wasn't finetuneable. chodestone spent $100k of his budget de-distilling it and rebaking from scratch only to be left with a mess of melted limbs and nonsense artifacts.
harsh lesson to never attempt serious development on anything BFL releases, it's bloated dated garbage designed to shill API. flux 1 was only relevant because it was the first natural language local model, though scholars at the time correctly pointed out that it was miles behind dall-e 3.

Anonymous
11/26/25(Wed)20:58:48 No.107341465

Anonymous 11/26/25(Wed)20:58:48 No.107341465

File: uni_pc_z-image__00003_.png (1.81 MB, 1024x1024)

1.81 MB PNG

Anonymous
11/26/25(Wed)20:58:55 No.107341466

Anonymous 11/26/25(Wed)20:58:55 No.107341466

This model really changed my mind that bigger models was the only way forward. I am a "size don't matter" believer now

Anonymous
11/26/25(Wed)20:59:27 No.107341471

Anonymous 11/26/25(Wed)20:59:27 No.107341471

File: ComfyUI_08908_.png (1.65 MB, 1152x1152)

1.65 MB PNG

Anonymous
11/26/25(Wed)20:59:56 No.107341476

Anonymous 11/26/25(Wed)20:59:56 No.107341476

>>107341466
any improvement that happens with a small model will be better on a bigger one, hardware will be the bottleneck for years to come until we get a small model that will be able to do everything

Anonymous
11/26/25(Wed)21:00:18 No.107341478

Anonymous 11/26/25(Wed)21:00:18 No.107341478

File: 1752087299450009.png (27 KB, 150x150)

27 KB PNG

>>107341449

Anonymous
11/26/25(Wed)21:00:28 No.107341479

Anonymous 11/26/25(Wed)21:00:28 No.107341479

File: ComfyUI_08911_.png (1.62 MB, 1152x1152)

1.62 MB PNG

Anonymous
11/26/25(Wed)21:00:29 No.107341480

Anonymous 11/26/25(Wed)21:00:29 No.107341480

>>107341453
A bunch of
Blacks
Farting
Logs
is the best evropa can do now... really makes you think.

Anonymous
11/26/25(Wed)21:00:52 No.107341484

Anonymous 11/26/25(Wed)21:00:52 No.107341484

>>107341446
1.5, 1.4, VGAN+CLIP, Pixart Sigma, Lumina 2, both Schnell and Dev Flux, uh.... fuck i swear there are more

Anonymous
11/26/25(Wed)21:01:07 No.107341486

Anonymous 11/26/25(Wed)21:01:07 No.107341486

>>107341444
>wai
stop it, slopped
wai is just a shitmerge
now noob finetune on other hand

Anonymous
11/26/25(Wed)21:01:11 No.107341488

Anonymous 11/26/25(Wed)21:01:11 No.107341488

>>107341462
I guess you're right, especially concerning how hard it was to finetune schnell.
Kind of sad, but it seems they don't really care about the competition and just do their own thing.

Anonymous
11/26/25(Wed)21:01:13 No.107341489

Anonymous 11/26/25(Wed)21:01:13 No.107341489

Is Zimage comaptible with any of the speedtraining snakeoil methods?

Anonymous
11/26/25(Wed)21:01:30 No.107341492

Anonymous 11/26/25(Wed)21:01:30 No.107341492

File: ComfyUI_08912_.png (1.66 MB, 1152x1152)

1.66 MB PNG

Anonymous
11/26/25(Wed)21:02:10 No.107341498

Anonymous 11/26/25(Wed)21:02:10 No.107341498

>>107341444
>WAI-style nsfw tune
Once again, not a tune.

Anonymous
11/26/25(Wed)21:02:27 No.107341500

Anonymous 11/26/25(Wed)21:02:27 No.107341500

>>107341466
>I am a "size don't matter" believer now
it's more that there was plenty optimizations to have at any size, so this shows the 20B+ models could look even better if they did the same

Anonymous
11/26/25(Wed)21:02:32 No.107341501

Anonymous 11/26/25(Wed)21:02:32 No.107341501

File: ComfyUI_08913_.png (1.61 MB, 1152x1152)

1.61 MB PNG

Anonymous
11/26/25(Wed)21:02:40 No.107341502

Anonymous 11/26/25(Wed)21:02:40 No.107341502

>>107341127
Hi comfyanon, can you stop bloating up the frontend? It's dropping frames baka
It's supposed to litegraph, but now it's bloatgraph

Anonymous
11/26/25(Wed)21:03:02 No.107341506

Anonymous 11/26/25(Wed)21:03:02 No.107341506

File: dpmpp_3m_sde_gpu_z-image_(...).png (1.63 MB, 1024x1024)

1.63 MB PNG

Anonymous
11/26/25(Wed)21:03:08 No.107341507

Anonymous 11/26/25(Wed)21:03:08 No.107341507

File: ComfyUI-euler-1.0-9-2025-(...).png (3.12 MB, 1536x1536)

3.12 MB PNG

>It doesn't know natalie portman
welp, at least it got every detail right anyway kek

Anonymous
11/26/25(Wed)21:03:10 No.107341508

Anonymous 11/26/25(Wed)21:03:10 No.107341508

>Comfy 3.75
Why does middle mouse wheel now clone the entire fucking work flow and how do I un-keybind this shit.

Anonymous
11/26/25(Wed)21:03:24 No.107341511

Anonymous 11/26/25(Wed)21:03:24 No.107341511

>>107341451
I've got canal paths near me that look just like this. Majestic.

Anonymous
11/26/25(Wed)21:03:28 No.107341512

Anonymous 11/26/25(Wed)21:03:28 No.107341512

>>107341484
ok

Anonymous
11/26/25(Wed)21:03:33 No.107341514

Anonymous 11/26/25(Wed)21:03:33 No.107341514

File: ComfyUI_08914_.png (2.04 MB, 1152x1152)

2.04 MB PNG

Anonymous
11/26/25(Wed)21:04:03 No.107341516

Anonymous 11/26/25(Wed)21:04:03 No.107341516

>>107341508
how many mouse wheels you got?

Anonymous
11/26/25(Wed)21:04:36 No.107341520

Anonymous 11/26/25(Wed)21:04:36 No.107341520

File: ComfyUI_08915_.png (1.7 MB, 1152x1152)

1.7 MB PNG

Anonymous
11/26/25(Wed)21:04:41 No.107341522

Anonymous 11/26/25(Wed)21:04:41 No.107341522

>>107341466
Big models were always memes because big=static, you can't/barely can modify them with local hardware. That's why XL is so popular, every retard can play around with the models, meaning lots of interesting things get done. Bloatmodels might be able to do more out of the box but once you run into a limitation it's basically over because the model is static for all intents and purposes.

Anonymous
11/26/25(Wed)21:05:03 No.107341526

Anonymous 11/26/25(Wed)21:05:03 No.107341526

File: dpmpp_3m_sde_gpu_z-image_(...).png (1.48 MB, 1024x1024)

1.48 MB PNG

Anonymous
11/26/25(Wed)21:05:31 No.107341528

Anonymous 11/26/25(Wed)21:05:31 No.107341528

>>107341476
>hardware will be the bottleneck for years to come until we get a small model that will be able to do everything
Not if we're still using attention/transformers. In 4 years the perspective will change and a "small model" will be one that fits onto an entry level 48gb GPU

>>107341488
Kek what competition? They are the weights-available leaders in the West

Anonymous
11/26/25(Wed)21:05:38 No.107341531

Anonymous 11/26/25(Wed)21:05:38 No.107341531

File: ComfyUI_08916_.png (1.46 MB, 1152x1152)

1.46 MB PNG

Anonymous
11/26/25(Wed)21:05:44 No.107341532

Anonymous 11/26/25(Wed)21:05:44 No.107341532

File: ComfyUI-euler-1.0-9-2025-(...).png (2.82 MB, 1536x1536)

2.82 MB PNG

To our new golden age. Cheers fellas.

Anonymous
11/26/25(Wed)21:06:01 No.107341534

Anonymous 11/26/25(Wed)21:06:01 No.107341534

How good is zimage in following a complex prompt?

Example, the one used for qwen :

A vibrant, warm neon-lit street scene in Hong Kong at the afternoon, with a mix of colorful Chinese and English signs glowing brightly. The atmosphere is lively, cinematic, and rain-washed with reflections on the pavement. The colors are vivid, full of pink, blue, red, and green hues. Crowded buildings with overlapping neon signs. 1980s Hong Kong style. Signs include:
"龍鳳冰室" "金華燒臘" "HAPPY HAIR" "鴻運茶餐廳" "EASY BAR" "永發魚蛋粉" "添記粥麵" "SUNSHINE MOTEL" "美都餐室" "富記糖水" "太平館" "雅芳髮型屋" "STAR KTV" "銀河娛樂城" "百樂門舞廳" "BUBBLE CAFE" "萬豪麻雀館" "CITY LIGHTS BAR" "瑞祥香燭莊" "文記文具" "GOLDEN JADE HOTEL" "LOVELY BEAUTY" "合興百貨" "興旺電器" And the background is warm yellow street and with all stores' lights on.

Anonymous
11/26/25(Wed)21:06:35 No.107341539

Anonymous 11/26/25(Wed)21:06:35 No.107341539

>>107341520
>>107341531
What's the surefire way to get rid of the psudocompression? These look good.

Anonymous
11/26/25(Wed)21:06:37 No.107341540

Anonymous 11/26/25(Wed)21:06:37 No.107341540

>>107341520
>uuuuohhh husbant...

Anonymous
11/26/25(Wed)21:06:46 No.107341542

Anonymous 11/26/25(Wed)21:06:46 No.107341542

>>107341528
>Kek what competition? They are the weights-available leaders in the West
Competition in general, it's not like it matters if the weights are Chinese, German or whatever.

Anonymous
11/26/25(Wed)21:06:58 No.107341543

Anonymous 11/26/25(Wed)21:06:58 No.107341543

File: Z-Image-Turbo_00027_.jpg (1.68 MB, 2048x2048)

1.68 MB JPG

>>107341511
>>107341451(You)
>I've got canal paths near me that look just like this. Majestic.
Lucky. My canal paths have gators

Anonymous
11/26/25(Wed)21:07:05 No.107341545

Anonymous 11/26/25(Wed)21:07:05 No.107341545

File: euler_z-image__00004_.png (1.5 MB, 1024x1024)

1.5 MB PNG

Anonymous
11/26/25(Wed)21:07:39 No.107341550

Anonymous 11/26/25(Wed)21:07:39 No.107341550

>>107341528
>4 years
>48GB entry level GPU
What are you smoking

Anonymous
11/26/25(Wed)21:07:57 No.107341552

Anonymous 11/26/25(Wed)21:07:57 No.107341552

https://www.reddit.com/r/LocalLLaMA/comments/1p4urm7/we_are_considering_removing_the_epstein_files/

Anonymous
11/26/25(Wed)21:08:06 No.107341553

Anonymous 11/26/25(Wed)21:08:06 No.107341553

>>107341545
i have a subway near me that looks just like this!

Anonymous
11/26/25(Wed)21:09:08 No.107341559

Anonymous 11/26/25(Wed)21:09:08 No.107341559

>>107341421
>>2x as slow as SDXL
this is with extreme distillation. Even FLUX1 could run in 3s on a 3090 with distillation.

Anonymous
11/26/25(Wed)21:09:13 No.107341560

Anonymous 11/26/25(Wed)21:09:13 No.107341560

If this is what z image turbo can do I can't wait to see what the full model does
https://files.catbox.moe/7nvqib.png

Anonymous
11/26/25(Wed)21:10:11 No.107341565

Anonymous 11/26/25(Wed)21:10:11 No.107341565

File: ComfyUI-euler-1.0-9-2025-(...).png (3.61 MB, 1536x1536)

3.61 MB PNG

>>107341534
that certainly pushes its limit

Anonymous
11/26/25(Wed)21:10:37 No.107341569

Anonymous 11/26/25(Wed)21:10:37 No.107341569

were there genuinely retards itt who thought the bloated slow crap like flux/chroma/qwen/hidream/neta would actually take off? Nobody wants to wait 30+ seconds for subpar 1024x gens on a 5090. there is a reason SDXL remained winning for so long. z-image is the first model since SDXL with the actual potential to both dethrone it while being a full upgrade, not a sidegrade.

Anonymous
11/26/25(Wed)21:10:57 No.107341573

Anonymous 11/26/25(Wed)21:10:57 No.107341573

>>107341560
whoooaaaa...it can do 1girl...standing there?!?!

Anonymous
11/26/25(Wed)21:11:11 No.107341576

Anonymous 11/26/25(Wed)21:11:11 No.107341576

File: ComfyUI_08917_.png (1.46 MB, 1152x1152)

1.46 MB PNG

Anonymous
11/26/25(Wed)21:11:41 No.107341579

Anonymous 11/26/25(Wed)21:11:41 No.107341579

>>107341542
If the weights were german, the model would be worse for cooking /ldg/ asian 1girls, baka

Anonymous
11/26/25(Wed)21:11:41 No.107341580

Anonymous 11/26/25(Wed)21:11:41 No.107341580

>>107341573
I see you are new here but this is a base model. Even sd 1.4 wasn't this uncensored to start

Anonymous
11/26/25(Wed)21:12:20 No.107341589

Anonymous 11/26/25(Wed)21:12:20 No.107341589

>>107341560
apparently the full model is worse currently

Anonymous
11/26/25(Wed)21:12:31 No.107341591

Anonymous 11/26/25(Wed)21:12:31 No.107341591

>>107341573
sd3 couldn't kek

Anonymous
11/26/25(Wed)21:12:37 No.107341592

Anonymous 11/26/25(Wed)21:12:37 No.107341592

>>107341542
>Competition in general
BFL is German, 100% they're trying to get contracts where the fact they're not a Chinese company is relevant here

And even without adjusting the goalposts I'd say the local competition is all equally in a state of "don't use this for actual work ever unless you have a really good reason like privacy or compliance" right now in terms of generative AI

>>107341550
>What are you smoking
Mostly distillate with terpenes but I'm thinking of splurging on one of those gimmicky disposable vapes with two strains in one, it's half live diamonds and the other half live resin

But this is just a napkin math guess based on current scaling + china catching up + the fact that 40gb A100s should be cheapish by then

I should have said "entry level AI GPU" because I also think the future of inference is dedicated discrete cards

Anonymous
11/26/25(Wed)21:12:42 No.107341594

Anonymous 11/26/25(Wed)21:12:42 No.107341594

>>107341573
and its prompt following is insane if that is what you are after, that has already been shown off, and its skin detail is the best from any base model as well

Anonymous
11/26/25(Wed)21:12:47 No.107341595

Anonymous 11/26/25(Wed)21:12:47 No.107341595

>>107341067
Wtf Bruce Lee? How could you hurt migu?

Anonymous
11/26/25(Wed)21:13:22 No.107341599

Anonymous 11/26/25(Wed)21:13:22 No.107341599

File: file.jpg (813 KB, 2048x2048)

813 KB JPG

>>107341565
Actually better than what I expected, see what Qwen does.

Anonymous
11/26/25(Wed)21:13:43 No.107341604

Anonymous 11/26/25(Wed)21:13:43 No.107341604

>>107341589
its not out yet so what insider knowledge are you claiming?

Anonymous
11/26/25(Wed)21:13:49 No.107341607

Anonymous 11/26/25(Wed)21:13:49 No.107341607

>>107341285
I think it's because of how they all clean up your prompt with LLMs now.

Anonymous
11/26/25(Wed)21:14:02 No.107341609

Anonymous 11/26/25(Wed)21:14:02 No.107341609

>>107341569
It was mostly anons flexing that they can run them I guess, it was obvious they won't take off given their sizes

Anonymous
11/26/25(Wed)21:14:03 No.107341611

Anonymous 11/26/25(Wed)21:14:03 No.107341611

>>107341589
probably not worse but not good enough for what they want as its still training
the guy who leaks shit on twtter who said that speaks through a language barrier

Anonymous
11/26/25(Wed)21:14:26 No.107341615

Anonymous 11/26/25(Wed)21:14:26 No.107341615

File: ComfyUI_08923_.png (1.33 MB, 1152x1152)

1.33 MB PNG

Seed variety is really bad on some prompts, like it locks in like Qwen.

Anonymous
11/26/25(Wed)21:14:39 No.107341618

Anonymous 11/26/25(Wed)21:14:39 No.107341618

>>107341592
>the future of inference is dedicated discrete cards
I've been hearing that since 2022.

Anonymous
11/26/25(Wed)21:14:51 No.107341620

Anonymous 11/26/25(Wed)21:14:51 No.107341620

File: z_image_turbo_bf16.safete(...).png (1.75 MB, 1440x1120)

1.75 MB PNG

Anonymous
11/26/25(Wed)21:15:00 No.107341624

Anonymous 11/26/25(Wed)21:15:00 No.107341624

File: dpmpp_3m_sde_gpu_z-image_(...).png (1.55 MB, 1024x1024)

1.55 MB PNG

Anonymous
11/26/25(Wed)21:15:29 No.107341633

Anonymous 11/26/25(Wed)21:15:29 No.107341633

>>107341615
yea, that is the trade off with strong prompt adherence, but fine tuning for more creativity instead will be easy

Anonymous
11/26/25(Wed)21:16:05 No.107341636

Anonymous 11/26/25(Wed)21:16:05 No.107341636

File: 1738215698424927.png (165 KB, 2040x893)

165 KB PNG

>>107341604

Anonymous
11/26/25(Wed)21:16:32 No.107341639

Anonymous 11/26/25(Wed)21:16:32 No.107341639

So Z Image is done by Alibaba but not by the Qwen team? Why do they have multiple teams on this

Anonymous
11/26/25(Wed)21:16:33 No.107341640

Anonymous 11/26/25(Wed)21:16:33 No.107341640

File: z_image_turbo_bf16.safete(...).png (1.66 MB, 1120x1440)

1.66 MB PNG

>Prompt enhancer with z-image-turbo might be better . System prompt is on its way!
https://xcancel.com/srameojin/status/1993793896397320193#m

THIS ISN'T EVEN THEIR FINAL FORM

Anonymous
11/26/25(Wed)21:16:46 No.107341643

Anonymous 11/26/25(Wed)21:16:46 No.107341643

>>107341636
sounds like a bad translation if anything to me

Anonymous
11/26/25(Wed)21:16:56 No.107341646

Anonymous 11/26/25(Wed)21:16:56 No.107341646

>>107341636
it's a bit weird that the distilled is better

Anonymous
11/26/25(Wed)21:17:02 No.107341647

Anonymous 11/26/25(Wed)21:17:02 No.107341647

File: ComfyUI-euler-1.0-9-2025-(...).png (2.56 MB, 1536x1536)

2.56 MB PNG

>>107341599
phew. wow. that's Qwen huh? there really isn't a model left out there that Z isn't assraping.

Anonymous
11/26/25(Wed)21:17:48 No.107341654

Anonymous 11/26/25(Wed)21:17:48 No.107341654

>>107341569
Z and neta run at a similar speed on my system so

Anonymous
11/26/25(Wed)21:17:55 No.107341656

Anonymous 11/26/25(Wed)21:17:55 No.107341656

File: lcm_z-image__00007_.png (1.49 MB, 1024x1024)

1.49 MB PNG

Anonymous
11/26/25(Wed)21:18:01 No.107341657

Anonymous 11/26/25(Wed)21:18:01 No.107341657

>>107341560
>actually proper genitalia
that's already so much better than flux nonsense on this

Anonymous
11/26/25(Wed)21:18:06 No.107341658

Anonymous 11/26/25(Wed)21:18:06 No.107341658

>>107341543
>Lucky. My canal paths have gators
Exotic.

Anonymous
11/26/25(Wed)21:18:08 No.107341659

Anonymous 11/26/25(Wed)21:18:08 No.107341659

>>107341633
>fine tuning for more creativity
is that a thing? any finetuning just removes the creativity iirc.

Anonymous
11/26/25(Wed)21:18:29 No.107341660

Anonymous 11/26/25(Wed)21:18:29 No.107341660

>>107341639
So they can decide which one gets promoted to API-only

Anonymous
11/26/25(Wed)21:18:39 No.107341661

Anonymous 11/26/25(Wed)21:18:39 No.107341661

Maybe it's because it's Turbo but this model seems very deterministic. Using the same prompt generating 4 images with different seeds results in almost identical images a lot of the time. Even if the prompt is very vague like "A monster under the bed" which could look like fucking anything.

Anonymous
11/26/25(Wed)21:19:07 No.107341667

Anonymous 11/26/25(Wed)21:19:07 No.107341667

>>107341657
i asked it for a penis and got a disgusting mass of flesh.

Anonymous
11/26/25(Wed)21:19:29 No.107341669

Anonymous 11/26/25(Wed)21:19:29 No.107341669

>>107341640
>Prompt enhancer with z-image-turbo might be better . System prompt is on its way!
care to explain what are these?
system prompt is for LLMs, so what does it have to do with zimage?
and prompt enhancer?

Anonymous
11/26/25(Wed)21:19:59 No.107341674

Anonymous 11/26/25(Wed)21:19:59 No.107341674

>>107341647
Qwen FP16 yeah.

Anonymous
11/26/25(Wed)21:20:13 No.107341676

Anonymous 11/26/25(Wed)21:20:13 No.107341676

>>107341659
No, finetuning is just changing parts of the weights, its already been shown that this works for qwen in making it less deterministic.

Also there are samplers that inject more noise between steps that helps too

Anonymous
11/26/25(Wed)21:20:13 No.107341677

Anonymous 11/26/25(Wed)21:20:13 No.107341677

>>107341661
it's an issue for me as well. i believe qwen was like this too

Anonymous
11/26/25(Wed)21:20:57 No.107341683

Anonymous 11/26/25(Wed)21:20:57 No.107341683

File: snake codec 1.gif (146 KB, 256x438)

146 KB GIF

>>107341669
>care to explain what a system prompt would do with an image gen model?
>prompt enhancer? now you're really not making any sense.

Anonymous
11/26/25(Wed)21:21:40 No.107341688

Anonymous 11/26/25(Wed)21:21:40 No.107341688

>>107341661
will be fixed by loras when they release the base model. Even just a lora that is mostly noise would do wonders. For now just use a prompt enhance or add random stuff to prompt

Anonymous
11/26/25(Wed)21:22:12 No.107341692

Anonymous 11/26/25(Wed)21:22:12 No.107341692

>>107341661
For quite a while now I've been of the opinion that no one should ever use a base-model, loras are ALWAYS essential (that's the human-input that makes your outputs look different than someone else's outputs). It's way too easy for any two people's prompts to be the same.

Anonymous
11/26/25(Wed)21:22:33 No.107341695

Anonymous 11/26/25(Wed)21:22:33 No.107341695

File: ComfyUI_00049_[1].png (1.59 MB, 1024x1024)

1.59 MB PNG

OMG it Migu!
The watercolor effect on the background is decent but on Miku herself it's a little iffy. Looks more like normal anime style with some artifacts but could definitely be worse.

Anonymous
11/26/25(Wed)21:22:52 No.107341699

Anonymous 11/26/25(Wed)21:22:52 No.107341699

>>107341661
You have to manually prompt the variation and be specific
If you don't want straight-on photos just tell it you want photo from the side or from behind etc.

Anonymous
11/26/25(Wed)21:23:59 No.107341702

Anonymous 11/26/25(Wed)21:23:59 No.107341702

File: 1736000191955432.jpg (993 KB, 2048x2048)

993 KB JPG

>>107341439
makes sense to me

Anonymous
11/26/25(Wed)21:24:20 No.107341707

Anonymous 11/26/25(Wed)21:24:20 No.107341707

File: seeds_2_z-image__00003_.png (1.42 MB, 1024x1024)

1.42 MB PNG

>>107341702
im hard now

Anonymous
11/26/25(Wed)21:24:22 No.107341708

Anonymous 11/26/25(Wed)21:24:22 No.107341708

>>107341514
that monogatari live action looking good

Anonymous
11/26/25(Wed)21:24:30 No.107341709

Anonymous 11/26/25(Wed)21:24:30 No.107341709

Can Z-Image do XI Jinping or Winnie the Pooh?

Anonymous
11/26/25(Wed)21:25:05 No.107341712

Anonymous 11/26/25(Wed)21:25:05 No.107341712

>>107341618
>I've been hearing that since 2022.
You don't need to hear it, just look at the inference speeds you can get on custom hardware with stuff like Groq. Also both Nvidia and AMD are dedicating more resources to discrete NPUs etc

Here's another perspective: The gaming GPU is almost dead in favour of the APU at this point. Consumers have developed the learned helplessness about not affording high end GPUs and have always enjoyed consoles. Couple this with the fact that Nvidia and AMD make buckets on data center compared to consumer, the fact that consumer GPUs have already reached that point e.g. the 5090 is just a VRAM gimped RTX 6000 Pro Blackwell, and the fact that eventually you have to cave and do unified memory/soldered memory to get the higher bandwidth speeds you need like Apple Silicon or DGX clusters do, I really don't see discrete GPGPU being a thing for very much longer

Anonymous
11/26/25(Wed)21:25:52 No.107341715

Anonymous 11/26/25(Wed)21:25:52 No.107341715

>>107341709
Yes >>107341620

Anonymous
11/26/25(Wed)21:26:34 No.107341722

Anonymous 11/26/25(Wed)21:26:34 No.107341722

File: z_image_turbo_bf16.safete(...).png (2.47 MB, 1440x1120)

2.47 MB PNG

>>107341709
yeah

Anonymous
11/26/25(Wed)21:27:59 No.107341735

Anonymous 11/26/25(Wed)21:27:59 No.107341735

File: ComfyUI-euler-1.0-9-2025-(...).png (2.72 MB, 1536x1536)

2.72 MB PNG

>>107341709
you got it boss
https://youtu.be/PGa3xmdVvMM?si=xM9T_UzurJx1nh2y

Anonymous
11/26/25(Wed)21:28:05 No.107341736

Anonymous 11/26/25(Wed)21:28:05 No.107341736

>>107341722
fuck yeah

Anonymous
11/26/25(Wed)21:28:22 No.107341738

Anonymous 11/26/25(Wed)21:28:22 No.107341738

>>107341661
I wonder if this will help making an animation workflow later on.

Anonymous
11/26/25(Wed)21:28:34 No.107341740

Anonymous 11/26/25(Wed)21:28:34 No.107341740

File: ComfyUI_temp_trrch_00001_.png (1.89 MB, 1280x1024)

1.89 MB PNG

Anonymous
11/26/25(Wed)21:28:52 No.107341741

Anonymous 11/26/25(Wed)21:28:52 No.107341741

now we just need LTXV-2 to release so we can be freed from the shackles of WAN

Anonymous
11/26/25(Wed)21:29:04 No.107341742

Anonymous 11/26/25(Wed)21:29:04 No.107341742

>>107341692
>(that's the human-input that makes your outputs look different than someone else's outputs)
>t. promptlet

Anonymous
11/26/25(Wed)21:29:13 No.107341744

Anonymous 11/26/25(Wed)21:29:13 No.107341744

File: ComfyUI_08928_.png (1.51 MB, 1152x1152)

1.51 MB PNG

Anonymous
11/26/25(Wed)21:29:56 No.107341748

Anonymous 11/26/25(Wed)21:29:56 No.107341748

>>107341741
kandinsky 20B already did that but its fucking fat and comfy still has not merged kaji's implementation cause comfyui hates torch compie which it needs apparently

Anonymous
11/26/25(Wed)21:30:14 No.107341751

Anonymous 11/26/25(Wed)21:30:14 No.107341751

File: ComfyUI_08927_.png (1.49 MB, 1152x1152)

1.49 MB PNG

Anonymous
11/26/25(Wed)21:31:14 No.107341759

Anonymous 11/26/25(Wed)21:31:14 No.107341759

>>107341741
No, we need ltx2 so alibaba will release wan 2.5 out of spite

Anonymous
11/26/25(Wed)21:31:38 No.107341764

Anonymous 11/26/25(Wed)21:31:38 No.107341764

File: ComfyUI_08931_.png (1.73 MB, 1152x1152)

1.73 MB PNG

Anonymous
11/26/25(Wed)21:32:29 No.107341767

Anonymous 11/26/25(Wed)21:32:29 No.107341767

>>107341748
>kandinsky 20B
link?

Anonymous
11/26/25(Wed)21:32:40 No.107341769

Anonymous 11/26/25(Wed)21:32:40 No.107341769

File: ComfyUI_08934_.png (1.8 MB, 1152x1152)

1.8 MB PNG

Anonymous
11/26/25(Wed)21:33:04 No.107341772

Anonymous 11/26/25(Wed)21:33:04 No.107341772

https://files.catbox.moe/m3zlcd.png
Naked Frieren standing in a fantasy forest setting, her breasts and vulva exposed, vulva, pussy, 2d, anime screenshot, masterpiece, high resolution, very aesthetic

Anonymous
11/26/25(Wed)21:33:18 No.107341773

Anonymous 11/26/25(Wed)21:33:18 No.107341773

>>107341742
It's not really about me, it's about the million other users who only type ONE sentence for their prompt. There's only so many ways a person can type one sentence, and they're all colliding, all producing the same result and feeling like a retard when they see their album cover as a video game asset in someone else's product. This is going to be sadly common in the future unless everyone adopts standards like using loras for the sake of uniqueness.

Anonymous
11/26/25(Wed)21:33:39 No.107341774

Anonymous 11/26/25(Wed)21:33:39 No.107341774

2 Years ago I would not have questioned if these amateur photos are real. It's so over.

Anonymous
11/26/25(Wed)21:33:49 No.107341776

Anonymous 11/26/25(Wed)21:33:49 No.107341776

File: ComfyUI_08937_.png (1.46 MB, 1152x1152)

1.46 MB PNG

Anonymous
11/26/25(Wed)21:34:02 No.107341778

Anonymous 11/26/25(Wed)21:34:02 No.107341778

File: vbrobsdehb301.jpg (28 KB, 499x373)

28 KB JPG

>>107341764

Anonymous
11/26/25(Wed)21:34:09 No.107341781

Anonymous 11/26/25(Wed)21:34:09 No.107341781

>>107341769
fuckin hot

Anonymous
11/26/25(Wed)21:34:23 No.107341784

Anonymous 11/26/25(Wed)21:34:23 No.107341784

>It's not X, it's Y
stop letting your chatbots in here

Anonymous
11/26/25(Wed)21:34:39 No.107341786

Anonymous 11/26/25(Wed)21:34:39 No.107341786

File: 1746800384476897.jpg (443 KB, 2069x1141)

443 KB JPG

>>107341484
man sd1.5 was so small

Anonymous
11/26/25(Wed)21:34:41 No.107341787

Anonymous 11/26/25(Wed)21:34:41 No.107341787

>>107341767
https://huggingface.co/collections/kandinskylab/kandinsky-50-video-pro

https://github.com/kijai/ComfyUI/tree/kandinsky5

20B does porn out of the box btw. But its big and slow as fuck

Anonymous
11/26/25(Wed)21:34:51 No.107341789

Anonymous 11/26/25(Wed)21:34:51 No.107341789

File: ComfyUI_08938_.png (1.42 MB, 1152x1152)

1.42 MB PNG

Anonymous
11/26/25(Wed)21:36:18 No.107341803

Anonymous 11/26/25(Wed)21:36:18 No.107341803

File: uni_pc_z-image__00009_.png (1.36 MB, 1024x1024)

1.36 MB PNG

Anonymous
11/26/25(Wed)21:36:30 No.107341805

Anonymous 11/26/25(Wed)21:36:30 No.107341805

File: ComfyUI_08939_.png (1.02 MB, 1152x1152)

1.02 MB PNG

Anonymous
11/26/25(Wed)21:36:31 No.107341806

Anonymous 11/26/25(Wed)21:36:31 No.107341806

>>107341784
nice feminine energy, next time quote me bitch.

Anonymous
11/26/25(Wed)21:36:58 No.107341811

Anonymous 11/26/25(Wed)21:36:58 No.107341811

the step-by-step inference on Z Image doesn't seem very fast at all to me DESU, so far. 8 steps Z doesn't seem any faster than 25 steps of NetaYume at the same res

Anonymous
11/26/25(Wed)21:37:00 No.107341812

Anonymous 11/26/25(Wed)21:37:00 No.107341812

>>107341805
nice OL

Anonymous
11/26/25(Wed)21:37:30 No.107341815

Anonymous 11/26/25(Wed)21:37:30 No.107341815

>>107341784
You are absolutely right!

Anonymous
11/26/25(Wed)21:37:32 No.107341817

Anonymous 11/26/25(Wed)21:37:32 No.107341817

File: ComfyUI_08936_.png (1.32 MB, 1152x1152)

1.32 MB PNG

Anonymous
11/26/25(Wed)21:37:57 No.107341819

Anonymous 11/26/25(Wed)21:37:57 No.107341819

>>107341787
>20B does porn out of the box btw. But its big and slow as fuck
Yeah I won't be running that shit unfortunately. Sad to hear that something like that out of the box already exists but is that big

Anonymous
11/26/25(Wed)21:38:19 No.107341822

Anonymous 11/26/25(Wed)21:38:19 No.107341822

>>107341774
>It's so over.
Alternatively, intellectual property is dead and the children (think of them!) have been saved

>>107341773
Why do people forget that we already were in a slop lack-of-creativity attention economy culture war doomspiral before AI

Also, your point about millions of NPC prompting the same thing is another point for SaaS, because those same NPCs don't delete or turn off conversation history sharing so the Service can adjust the prompt slightly based on your preferences inferred from your past conversations

Anonymous
11/26/25(Wed)21:38:36 No.107341823

Anonymous 11/26/25(Wed)21:38:36 No.107341823

File: ComfyUI_08940_.png (1.76 MB, 1152x1152)

1.76 MB PNG

>>107341748
>Hunyuan 1.5*
We just need its NSFW tune.

Anonymous
11/26/25(Wed)21:39:14 No.107341828

Anonymous 11/26/25(Wed)21:39:14 No.107341828

z image does 2k res perfect btw, you are not limited to 512 x 512 - 1024 x 1024, 1440x1920

>>107341823
nah, sorry to say hunyuan 1.5 is way more censored

Anonymous
11/26/25(Wed)21:40:58 No.107341838

Anonymous 11/26/25(Wed)21:40:58 No.107341838

this is kandinsky 20B but at super low res / steps so it does not take 2 hours
https://files.catbox.moe/6pdai4.webp

Anonymous
11/26/25(Wed)21:41:30 No.107341840

Anonymous 11/26/25(Wed)21:41:30 No.107341840

>>107341786
It gets some of the details incorrect. VG+C ran on the CPU, couldn't do videos (unless you count animations of it generating), 1.5 could be run on 4GB cards same with XL, and there's probably some other stuff that I don't realize because I'm retarded and don't wanna read papers right now. A chart like this with even more models, like Kandinsky and any others anon can think of, would be a really cool resource to have.

Anonymous
11/26/25(Wed)21:42:05 No.107341844

Anonymous 11/26/25(Wed)21:42:05 No.107341844

>>107341838
howd you run it?

Anonymous
11/26/25(Wed)21:42:38 No.107341847

Anonymous 11/26/25(Wed)21:42:38 No.107341847

>>107341844
>>107341787

Anonymous
11/26/25(Wed)21:42:59 No.107341849

Anonymous 11/26/25(Wed)21:42:59 No.107341849

>>107341844
OFFLOADING
F
F
L
O
A
D
I
N
G

Anonymous
11/26/25(Wed)21:43:13 No.107341851

Anonymous 11/26/25(Wed)21:43:13 No.107341851

>>107341838
The clit isn't supposed to be at this place

Anonymous
11/26/25(Wed)21:43:48 No.107341854

Anonymous 11/26/25(Wed)21:43:48 No.107341854

>>107341838
Any workflows for this model? May fuck around and try to run it

Anonymous
11/26/25(Wed)21:44:05 No.107341857

Anonymous 11/26/25(Wed)21:44:05 No.107341857

>>107341851
yea no shit
here is this as well btw
https://huggingface.co/Ada321/Kandinsky-5.0-T2V-Pro-sft-5s-Q8.gguf

https://github.com/Ada123-a/ComfyUI-Kandinsky

Anonymous
11/26/25(Wed)21:44:48 No.107341863

Anonymous 11/26/25(Wed)21:44:48 No.107341863

File: heunpp2_z-image__00006_.png (2.92 MB, 1536x1536)

2.92 MB PNG

Anonymous
11/26/25(Wed)21:45:06 No.107341864

Anonymous 11/26/25(Wed)21:45:06 No.107341864

>>107341854
>>107341857
16GB vram is minimum btw, even with offloading the latent is about 8GB ish

Anonymous
11/26/25(Wed)21:45:16 No.107341866

Anonymous 11/26/25(Wed)21:45:16 No.107341866

File: ComfyUI_08943_.png (1.75 MB, 1152x1152)

1.75 MB PNG

Anonymous
11/26/25(Wed)21:46:20 No.107341873

Anonymous 11/26/25(Wed)21:46:20 No.107341873

File: ComfyUI_08944_.png (1.81 MB, 1152x1152)

1.81 MB PNG

Anonymous
11/26/25(Wed)21:46:42 No.107341878

Anonymous 11/26/25(Wed)21:46:42 No.107341878

>>107341787
How slow we talking? Depending on how good it is, id be willing to wait up to 10 minutes with a 3090.

Anonymous
11/26/25(Wed)21:47:15 No.107341882

Anonymous 11/26/25(Wed)21:47:15 No.107341882

What's actually "so over" is that this board's traffic when /ldg/ is popular is just "/ldg/ and friends".

>>107341786
>Z image is 6B
Huh guess I'll get off the couch and try it out. In my opinion Z.ai have been the most consistent Chinese lab of 2025, they feel like a Chinese version of Anthropic

>>107341851
>The clit isn't supposed to be at this place
That is a very, very reasonable mistake that models make (like double assholes or turning pussy lips into testicles or vice versa) and that example actually makes me more hyped because you need to have an understanding of the anatomy of the vagina and have seen enough clits to make a mistake like that

>>107341864
I am assuming that 64gb of ram is the minimum with 16gb of vram because I couldn't even run the q4_ks with just 32gb. The best I could do was a one frame 128x64 image that took 5 minutes to generate.

Anonymous
11/26/25(Wed)21:47:32 No.107341885

Anonymous 11/26/25(Wed)21:47:32 No.107341885

>>107341878
lol, lmao even.
a hour on a 5090 using cache

Anonymous
11/26/25(Wed)21:47:45 No.107341888

Anonymous 11/26/25(Wed)21:47:45 No.107341888

File: ComfyUI_08946_.png (1.47 MB, 1152x1152)

1.47 MB PNG

Body horror when prompting for certain yoga poses that Chroma HD Flash nails first try.

Anonymous
11/26/25(Wed)21:48:23 No.107341892

Anonymous 11/26/25(Wed)21:48:23 No.107341892

File: ComfyUI_07488_.png (2.84 MB, 2048x1280)

2.84 MB PNG

Anonymous
11/26/25(Wed)21:49:00 No.107341900

Anonymous 11/26/25(Wed)21:49:00 No.107341900

File: 1750450262792550.png (2.34 MB, 1120x1440)

2.34 MB PNG

z image struggles to mix 2d and IRL, qwen beats it at this (for now)

>>107341811
it's not really faster, the distill is just way higher quality than we're used to. With CFG, it's similar speed to chroma for me too.

The answer is we need to ditch CFG and use something like this.
https://github.com/AMAP-ML/S2-Guidance

Anonymous
11/26/25(Wed)21:49:19 No.107341902

Anonymous 11/26/25(Wed)21:49:19 No.107341902

File: ComfyUI_08949_.png (1.64 MB, 1152x1152)

1.64 MB PNG

Anonymous
11/26/25(Wed)21:49:37 No.107341903

Anonymous 11/26/25(Wed)21:49:37 No.107341903

File: ComfyUI_07569_.png (3.16 MB, 2048x1280)

3.16 MB PNG

Anonymous
11/26/25(Wed)21:50:21 No.107341909

Anonymous 11/26/25(Wed)21:50:21 No.107341909

File: ComfyUI_08954_.png (1.62 MB, 1152x1152)

1.62 MB PNG

Anonymous
11/26/25(Wed)21:51:11 No.107341912

Anonymous 11/26/25(Wed)21:51:11 No.107341912

>>107341885
thing requires a fucking h100 cluster then lmao im good. surely we'll get something as uncensored as it that ISNT fuck-you big

Anonymous
11/26/25(Wed)21:51:29 No.107341916

Anonymous 11/26/25(Wed)21:51:29 No.107341916

>>107341885
Why the fuck can’t we have anything nice. Imagine z-image prompt adherence but for i2v. I’m tired of tard wrangling the hell out of wan and still not getting anywhere close to what I want.

Anonymous
11/26/25(Wed)21:52:16 No.107341925

Anonymous 11/26/25(Wed)21:52:16 No.107341925

File: ComfyUI_08957_.png (1.73 MB, 1152x1152)

1.73 MB PNG

Anonymous
11/26/25(Wed)21:52:28 No.107341928

Anonymous 11/26/25(Wed)21:52:28 No.107341928

>>107341912
>>107341916
with a good 4 step distill it would prob get down to like 5 mins

Anonymous
11/26/25(Wed)21:53:30 No.107341934

Anonymous 11/26/25(Wed)21:53:30 No.107341934

>>107341882
>In my opinion Z.ai have been the most consistent Chinese lab of 2025, they feel like a Chinese version of Anthropic
What else should anon know them from?

Anonymous
11/26/25(Wed)21:54:05 No.107341937

Anonymous 11/26/25(Wed)21:54:05 No.107341937

>>107341882
It's not Z.ai btw, this is still alibaba

Anonymous
11/26/25(Wed)21:54:05 No.107341938

Anonymous 11/26/25(Wed)21:54:05 No.107341938

File: heunpp2_z-image__00008_.png (2.71 MB, 1536x1536)

2.71 MB PNG

Anonymous
11/26/25(Wed)21:55:06 No.107341945

Anonymous 11/26/25(Wed)21:55:06 No.107341945

>>107341882
dumbass kid talking out of his ass lol

Anonymous
11/26/25(Wed)21:55:13 No.107341947

Anonymous 11/26/25(Wed)21:55:13 No.107341947

please care about flux 2

Anonymous
11/26/25(Wed)21:55:23 No.107341950

Anonymous 11/26/25(Wed)21:55:23 No.107341950

File: ComfyUI_07608_.png (1.31 MB, 944x1280)

1.31 MB PNG

Anonymous
11/26/25(Wed)21:55:40 No.107341952

Anonymous 11/26/25(Wed)21:55:40 No.107341952

>>107341950
would

Anonymous
11/26/25(Wed)21:55:49 No.107341954

Anonymous 11/26/25(Wed)21:55:49 No.107341954

File: ComfyUI_08960_.png (1.74 MB, 1152x1152)

1.74 MB PNG

Anonymous
11/26/25(Wed)21:56:14 No.107341957

Anonymous 11/26/25(Wed)21:56:14 No.107341957

>>107341950
can u fill the box with cum plox

Anonymous
11/26/25(Wed)21:56:17 No.107341958

Anonymous 11/26/25(Wed)21:56:17 No.107341958

File: tay.png (3.11 MB, 1824x1248)

3.11 MB PNG

Flux 2 had better artistic potential while the China model is cheap yet functional slop machine spewing out cheap gimmicky memeslop clogging up all the threads, how fitting

Anonymous
11/26/25(Wed)21:56:39 No.107341959

Anonymous 11/26/25(Wed)21:56:39 No.107341959

>>107341885
>an hour for 5 seconds
bro..

Anonymous
11/26/25(Wed)21:58:28 No.107341967

Anonymous 11/26/25(Wed)21:58:28 No.107341967

File: Screenshot 2025-11-27 025809.jpg (88 KB, 648x506)

88 KB JPG

is this option depracated or just hidden now

Anonymous
11/26/25(Wed)21:58:38 No.107341969

Anonymous 11/26/25(Wed)21:58:38 No.107341969

File: res_multistep_z-image__00001_.png (2.17 MB, 1536x1536)

2.17 MB PNG

Anonymous
11/26/25(Wed)22:00:12 No.107341983

Anonymous 11/26/25(Wed)22:00:12 No.107341983

>>107341967
it's "streamlined"

Anonymous
11/26/25(Wed)22:01:02 No.107341987

Anonymous 11/26/25(Wed)22:01:02 No.107341987

please care about flux 2

Anonymous
11/26/25(Wed)22:01:33 No.107341991

Anonymous 11/26/25(Wed)22:01:33 No.107341991

File: ComfyUI_08962_.png (1.34 MB, 1152x1152)

1.34 MB PNG

Kek, doesn't understand fellatio that well unfortunately, or maybe it's skill issue on my part.

https://files.catbox.moe/kair61.png

Anonymous
11/26/25(Wed)22:01:39 No.107341993

Anonymous 11/26/25(Wed)22:01:39 No.107341993

I don't get it. It's just slop. The whole model is a slop machine. It's basically the same shit as flux 1.

Anonymous
11/26/25(Wed)22:01:51 No.107341995

Anonymous 11/26/25(Wed)22:01:51 No.107341995

>it can't do the part were her body is made out of the pancake syrup but nails literally everything including perfect anatomy
flux would just fuck up the anatomy constantly, same with chromasome. wow.
https://files.catbox.moe/9nan6k.png
https://files.catbox.moe/c4mktw.png

Anonymous
11/26/25(Wed)22:01:53 No.107341997

Anonymous 11/26/25(Wed)22:01:53 No.107341997

>>107341959
I’d probably be willing to wait even that long so long as it had god tier prompt adherence which it probably doesn’t. Imagine waiting for an hour excited for your 5 second goon gen, and then something is glitchy or your prompt was almost ignored cause you were just slightly too undescriptive. Long gen times make experimentation hard.

Anonymous
11/26/25(Wed)22:03:23 No.107342011

Anonymous 11/26/25(Wed)22:03:23 No.107342011

File: ComfyUI_08963_.png (1.46 MB, 1152x1152)

1.46 MB PNG

Anonymous
11/26/25(Wed)22:03:58 No.107342017

Anonymous 11/26/25(Wed)22:03:58 No.107342017

>>107341997
Rent a 8xH100 cluster and get those 2 min gen times buddy

Anonymous
11/26/25(Wed)22:04:05 No.107342018

Anonymous 11/26/25(Wed)22:04:05 No.107342018

z-image sucks in the same way SDXL sucks

Anonymous
11/26/25(Wed)22:05:13 No.107342026

Anonymous 11/26/25(Wed)22:05:13 No.107342026

File: ComfyUI_01690_.png (2.82 MB, 1280x1920)

2.82 MB PNG

Anonymous
11/26/25(Wed)22:07:18 No.107342044

Anonymous 11/26/25(Wed)22:07:18 No.107342044

>>107342018
it feels like SDXL-2 in a lot of ways

Anonymous
11/26/25(Wed)22:07:35 No.107342046

Anonymous 11/26/25(Wed)22:07:35 No.107342046

>>107342018
I remember being blown the fuck away when sdxl came out but could barely run it on my 1080 ti.

Anonymous
11/26/25(Wed)22:08:23 No.107342052

Anonymous 11/26/25(Wed)22:08:23 No.107342052

File: ComfyUI_08964_.png (1.62 MB, 1152x1152)

1.62 MB PNG

>>107341995
Yeah, Chroma has substantially better prompt following, NSFW concept and anatomical knowledge, in addition to higher res photos, way less slop and more variety. It's just that this is obviously a 6B Turbo model, and for what it can do out of the box (about 60% of what Chroma can) it's impressive. We'll see how its base model with full prompt following fairs, plus tunes (especially a large scale tune) will bring the best out of it.

Anonymous
11/26/25(Wed)22:09:06 No.107342058

Anonymous 11/26/25(Wed)22:09:06 No.107342058

>>107342018
no, sdxl was safetyslopped when it first came out

Anonymous
11/26/25(Wed)22:09:27 No.107342061

Anonymous 11/26/25(Wed)22:09:27 No.107342061

File: z-turbo_00041_.png (985 KB, 720x1280)

985 KB PNG

Anonymous
11/26/25(Wed)22:09:28 No.107342062

Anonymous 11/26/25(Wed)22:09:28 No.107342062

File: 1740597350809949.png (2.03 MB, 1120x1440)

2.03 MB PNG

ok this works, I just have to start the prompt with the description of the 2d character.

Anonymous
11/26/25(Wed)22:10:55 No.107342073

Anonymous 11/26/25(Wed)22:10:55 No.107342073

>>107342018
>>107341945

Anonymous
11/26/25(Wed)22:11:43 No.107342082

Anonymous 11/26/25(Wed)22:11:43 No.107342082

File: ComfyUI_08966_.png (970 KB, 1152x1152)

970 KB PNG

Anonymous
11/26/25(Wed)22:12:21 No.107342088

Anonymous 11/26/25(Wed)22:12:21 No.107342088

Need a local audio model that can do NSFW shit and dialogue too

Anonymous
11/26/25(Wed)22:12:52 No.107342094

Anonymous 11/26/25(Wed)22:12:52 No.107342094

File: z-turbo_00043_.png (1.05 MB, 896x1152)

1.05 MB PNG

>>107342082
thats enough asian beauties for today, anon

Anonymous
11/26/25(Wed)22:13:08 No.107342097

Anonymous 11/26/25(Wed)22:13:08 No.107342097

File: ComfyUI_07622_.png (3.02 MB, 2048x1280)

3.02 MB PNG

Anonymous
11/26/25(Wed)22:13:36 No.107342100

Anonymous 11/26/25(Wed)22:13:36 No.107342100

>>107342088
vibe voice not good enough for you?

Anonymous
11/26/25(Wed)22:14:15 No.107342108

Anonymous 11/26/25(Wed)22:14:15 No.107342108

>>107342100
no anything outside of dialogue is very gacha

Anonymous
11/26/25(Wed)22:15:48 No.107342114

Anonymous 11/26/25(Wed)22:15:48 No.107342114

File: ComfyUI_07597_.png (1.89 MB, 944x1280)

1.89 MB PNG

Anonymous
11/26/25(Wed)22:15:51 No.107342115

Anonymous 11/26/25(Wed)22:15:51 No.107342115

>>107342088
we need something where you can clone specific plap sounds

Anonymous
11/26/25(Wed)22:16:14 No.107342119

Anonymous 11/26/25(Wed)22:16:14 No.107342119

File: 1737523168720720.png (1.06 MB, 832x1248)

1.06 MB PNG

Anonymous
11/26/25(Wed)22:16:49 No.107342124

Anonymous 11/26/25(Wed)22:16:49 No.107342124

File: 1759972772319413.jpg (413 KB, 904x904)

413 KB JPG

>>107341934
>What else should anon know them from?
Their only other stuff is /lmg/ , the GLM series is considered a cheaper, but genuine alternative to Claude. We have Claude 3.25 at home because of them

>>107341937
>It's not Z.ai btw, this is still alibaba
... Okay in that case Alibaba is the best across all modalities for local, text image video a complete and total moggening

>>107341945
>dumbass kid talking out of his ass lol
I am probably younger than you, that is true, but I have an idea of what's going on in the industry. The actual "talking out of the ass" comes from the fact that who knows what decisions these companies would make if DRAM and compute weren't so bottlenecked. Pretty much every chip that gets released always has some SKU defined in the spec with double the VRAM

Anonymous
11/26/25(Wed)22:18:15 No.107342130

Anonymous 11/26/25(Wed)22:18:15 No.107342130

File: ComfyUI_08968_.png (1.66 MB, 1152x1152)

1.66 MB PNG

Anonymous
11/26/25(Wed)22:19:16 No.107342139

Anonymous 11/26/25(Wed)22:19:16 No.107342139

File: ComfyUI_08969_.png (2.19 MB, 1152x1152)

2.19 MB PNG

Anonymous
11/26/25(Wed)22:20:14 No.107342143

Anonymous 11/26/25(Wed)22:20:14 No.107342143

>>107342052
I haven't tested this model yet (I am on a trip) , but from the outputs i've seen so far, z-image has less mangled anatomy problems than Chroma, and this is coming from a guy that defended Chroma for a long time. I think the base model with loras will look just as good as Chroma for photorealism, considering the base for z-image is already devoid of the plastic skin and buttchinfest Flux had

Anonymous
11/26/25(Wed)22:20:20 No.107342144

Anonymous 11/26/25(Wed)22:20:20 No.107342144

File: ComfyUI_08970_.png (1.59 MB, 1152x1152)

1.59 MB PNG

Anonymous
11/26/25(Wed)22:20:26 No.107342145

Anonymous 11/26/25(Wed)22:20:26 No.107342145

Z Image seems to work with Qwen3-4B 2507 Instruct too. Haven't tried Thinking yet. Possible gains to be had there anyways since real finetunes of Qwen actually exist, unlike for T5-XXL.

Anonymous
11/26/25(Wed)22:21:54 No.107342156

Anonymous 11/26/25(Wed)22:21:54 No.107342156

File: ComfyUI_01600_.png (1010 KB, 1024x1024)

1010 KB PNG

Anonymous
11/26/25(Wed)22:22:04 No.107342158

Anonymous 11/26/25(Wed)22:22:04 No.107342158

>>107342143
i can't fully illustrate how happy i am knowing i don't need to put (cleft chin) in my negatives anymore thanks to the chinese.

Anonymous
11/26/25(Wed)22:22:16 No.107342161

Anonymous 11/26/25(Wed)22:22:16 No.107342161

File: 1748558057824913.png (1.77 MB, 1120x1440)

1.77 MB PNG

>>107341995
chroma, especially spark chroma definitely beats it at this prompt. pic is as far as I can get it with proompting. maybe the prompt enhancer can fix this when they release it.

z image is better than chroma overall, but won't replace it yet. ironically, chroma has better survivability against Z than qwen or flux because it has NSFW and more flexibility.
If the SPARK chroma guy finishes his work and then we get a Chroma distill with the same quality as Z, chroma will still be competitive.

Anonymous
11/26/25(Wed)22:23:05 No.107342173

Anonymous 11/26/25(Wed)22:23:05 No.107342173

File: ComfyUI_00040_.png (1.3 MB, 1024x1024)

1.3 MB PNG

>>107342161
not even remotely what the fuck is this kek, the body is supposed to be like picrel.

Anonymous
11/26/25(Wed)22:23:31 No.107342177

Anonymous 11/26/25(Wed)22:23:31 No.107342177

File: ComfyUI_01602_.png (1.15 MB, 1024x1024)

1.15 MB PNG

Anonymous
11/26/25(Wed)22:23:37 No.107342179

Anonymous 11/26/25(Wed)22:23:37 No.107342179

File: ComfyUI_08971_.png (1.71 MB, 1152x1152)

1.71 MB PNG

>>107342143
>less mangled anatomy problems than Chroma
Chroma HD Flash fixes all problems from original Chroma. This model however has very little seed variety (similar to Qwen) and can do less than Chroma out of the box. I would know because I've prompted all of these things in Chroma (though some of them are not first try on Chroma, there's still more seed variety). If a model cheats by beating its seed variety then it's not really better anatomy than Chroma imo.

Anonymous
11/26/25(Wed)22:23:41 No.107342181

Anonymous 11/26/25(Wed)22:23:41 No.107342181

>>107342145
I need to look into how these instruct models work with the new gen of Chinese models. If you give it too illegal of a prompt, won't the instruct model just write "I'm sorry, I can't assist with that" instead of enhancing/encoding your prompt properly?

Anonymous
11/26/25(Wed)22:24:35 No.107342189

Anonymous 11/26/25(Wed)22:24:35 No.107342189

Fresh

>>107342183
>>107342183
>>107342183

Anonymous
11/26/25(Wed)22:25:26 No.107342194

Anonymous 11/26/25(Wed)22:25:26 No.107342194

File: ComfyUI_08972_.png (1.95 MB, 1152x1152)

1.95 MB PNG

>>107342179
Also I'm paying very close attention to the prompt following, and it's also not up to par. But this is just the Turbo model with reasoning turned off after all.

Anonymous
11/26/25(Wed)22:27:17 No.107342207

Anonymous 11/26/25(Wed)22:27:17 No.107342207

>>107341840
Yeah I didn't run a second pass but I probably should to check sources.

Anonymous
11/26/25(Wed)22:27:50 No.107342214

Anonymous 11/26/25(Wed)22:27:50 No.107342214

>>107342181
no, that's not how it works, none of these models are being used in a way where it's possible for them to "refuse" a response, they only activate the "model understanding" state layers. There's no typical chat context going on at all

Anonymous
11/27/25(Thu)02:49:29 No.107343864

Anonymous 11/27/25(Thu)02:49:29 No.107343864

Chroma still produces monstrosities more than often... anyone saying otherwise is just a fag

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.