/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 11/26/25(Wed)10:59:54 No.107334502

File: highlights_g_107332452_17(...).jpg (1.96 MB, 3139x2680)

1.96 MB JPG

/ldg/ - Local Diffusion General Anonymous 11/26/25(Wed)10:59:54 No.107334502 Archived

6b Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107332452

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
11/26/25(Wed)11:00:42 No.107334524

Anonymous 11/26/25(Wed)11:00:42 No.107334524

6B for life

Anonymous
11/26/25(Wed)11:01:30 No.107334540

Anonymous 11/26/25(Wed)11:01:30 No.107334540

yumebros.... our days our numberd?

Anonymous
11/26/25(Wed)11:01:50 No.107334542

Anonymous 11/26/25(Wed)11:01:50 No.107334542

>SexB

Anonymous
11/26/25(Wed)11:01:51 No.107334544

Anonymous 11/26/25(Wed)11:01:51 No.107334544

>>107334502
does anistudio support z image yet?

Anonymous
11/26/25(Wed)11:01:53 No.107334545

Anonymous 11/26/25(Wed)11:01:53 No.107334545

File: dead.png (347 KB, 655x455)

347 KB PNG

>RuntimeError: CUDA error: HIPBLAS_STATUS_ALLOC_FAILED when calling `hipblasCreate(handle)`
NEVER UPDATE COMFYUI
FUCK

Anonymous
11/26/25(Wed)11:02:28 No.107334550

Anonymous 11/26/25(Wed)11:02:28 No.107334550

File: 1734869171115090.png (2.67 MB, 1728x1221)

2.67 MB PNG

Prompt:
人物特征:东亚年轻女性,齐肩中长发,发尾微内扣,深棕色头发,带有空气刘海;佩戴黑色大圆框眼镜,镜腿有蓝色细节;化淡妆,唇色为淡粉色,左耳戴一颗小巧耳钉,颈间有细链项链;穿着米白色针织上衣。
风格氛围:清新日常的自拍风格,光线柔和,人物面带自然微笑,神态亲切。
背景细节:后方有带有 “L NREODNAZ SOVEK ALLERHAO” 字样的圆形标识,背景墙是大理石纹理与木质材质的组合,带有暖光照明。

Translated to english:
An East Asian young woman with medium-length, shoulder-length hair, the ends slightly curled inward. Her hair is a deep brown, styled with wispy, airy bangs. She wears black oversized round-frame glasses with blue detailing on the temples. Her makeup is light, with lips in a soft pink shade. A small stud earring adorns her left ear, and a delicate chain necklace rests at her neck. She is dressed in a cream-colored knit top.
Style & Atmosphere:
A fresh, everyday selfie style. The lighting is soft, and she smiles naturally, her expression warm and approachful.
Background Details:
Behind her, there is a circular sign with the text “L NREODONAZ SOVEK ALLERHAO.” The wall features a combination of marble texture and wood grain, illuminated by warm lighting.

Anonymous
11/26/25(Wed)11:02:33 No.107334553

Anonymous 11/26/25(Wed)11:02:33 No.107334553

File: ComfyUI_temp_iubdp_00030_.png (2.54 MB, 2304x960)

2.54 MB PNG

>>107334491
WAN seems to avoid the "AI Slop" look, but it's hard to get it to follow lighting prompt (i.e blue light, etc) and the text in the image is weird.
Qwen + Wan seems to be the best combo: Qwen for the prompt adherence and good text and Wan to remove the "AI Slop" look
---
https://files.catbox.moe/24i01w.png

Anonymous
11/26/25(Wed)11:02:49 No.107334556

Anonymous 11/26/25(Wed)11:02:49 No.107334556

File: Z-image turbo.png (846 KB, 720x1280)

846 KB PNG

Remember that china has won (but seriously though, when will they release it? I'm getting tired of that edging)
https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo/picture

Anonymous
11/26/25(Wed)11:03:04 No.107334562

Anonymous 11/26/25(Wed)11:03:04 No.107334562

nb4 z is a total flop and the anon who pretended to hype it the most spends days trolling like "AHAHAHA LOCALKEKS WERE SO EXCITED BUT I KNEW IT WOULD BE SHIT"

Anonymous
11/26/25(Wed)11:03:25 No.107334566

Anonymous 11/26/25(Wed)11:03:25 No.107334566

File: Screenshot 2025-11-26 at (...).png (11 KB, 425x130)

11 KB PNG

IT'S COMING

Anonymous
11/26/25(Wed)11:03:43 No.107334574

Anonymous 11/26/25(Wed)11:03:43 No.107334574

>>107334550
flux bro. its over.

Anonymous
11/26/25(Wed)11:04:00 No.107334575

Anonymous 11/26/25(Wed)11:04:00 No.107334575

>>107334550
sovless vs sovl

Anonymous
11/26/25(Wed)11:04:26 No.107334579

Anonymous 11/26/25(Wed)11:04:26 No.107334579

File: YUSUKE psycho scream.png (613 KB, 736x736)

613 KB PNG

>>107334550
Black Forest Labs? ... Nah.

More like Big Fucking Losers.

Anonymous
11/26/25(Wed)11:04:36 No.107334581

Anonymous 11/26/25(Wed)11:04:36 No.107334581

HAIL ALIBABA
HAIL CHINA
HAIL XI JINPING THOUGHT

Anonymous
11/26/25(Wed)11:05:09 No.107334586

Anonymous 11/26/25(Wed)11:05:09 No.107334586

>>107334562
This will most certainly happen.

Anonymous
11/26/25(Wed)11:05:28 No.107334589

Anonymous 11/26/25(Wed)11:05:28 No.107334589

File: ComfyUI_temp_qmfoy_00051_.png (3.43 MB, 1824x1248)

3.43 MB PNG

>>107334574
It never began. Eurocucks can't stop losing
>>107334575
truthnvke
>>107334581
all my niggas chicoms
https://files.catbox.moe/lhf9on.png

Anonymous
11/26/25(Wed)11:05:29 No.107334590

Anonymous 11/26/25(Wed)11:05:29 No.107334590

>>107334566
Niggel prease let this be true!

Anonymous
11/26/25(Wed)11:06:06 No.107334595

Anonymous 11/26/25(Wed)11:06:06 No.107334595

>Decoupled-DMD: The Acceleration Magic Behind Z-Image
>DMDR: Fusing DMD with Reinforcement Learning

this is big, DMD was fucking great for SDXL

Anonymous
11/26/25(Wed)11:06:59 No.107334605

Anonymous 11/26/25(Wed)11:06:59 No.107334605

File: 1733893251833551.png (62 KB, 220x211)

62 KB PNG

>>107334566
AIEEEEEEE LETS GOOOOOO

Anonymous
11/26/25(Wed)11:07:37 No.107334608

Anonymous 11/26/25(Wed)11:07:37 No.107334608

Retard here.
What exactly does turbo mean? Can the turbo version be finetuned or do we need to wait for the base one for that?

Anonymous
11/26/25(Wed)11:07:56 No.107334614

Anonymous 11/26/25(Wed)11:07:56 No.107334614

You guys get your hopes up way too high, every, single, time. Just wait till it releases.

>>107334545
>he pulled?

Anonymous
11/26/25(Wed)11:07:59 No.107334616

Anonymous 11/26/25(Wed)11:07:59 No.107334616

>>107334562
>>107334586
why would it happen, we can see non cherry picked images on the site (made by regular users) and those images all look good
https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo/picture

Anonymous
11/26/25(Wed)11:08:26 No.107334618

Anonymous 11/26/25(Wed)11:08:26 No.107334618

File: enter-the-dragon-5-1080x675.png (788 KB, 1080x675)

788 KB PNG

我爱北京天安门,
天安门上太阳升;
伟大领袖毛主席,
指引我们向前进。

Anonymous
11/26/25(Wed)11:09:01 No.107334620

Anonymous 11/26/25(Wed)11:09:01 No.107334620

File: 1744501439305219.png (287 KB, 2234x1252)

287 KB PNG

>>107334608
>What exactly does turbo mean?
they explained everything here
https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo/summary?version=master

Anonymous
11/26/25(Wed)11:10:04 No.107334629

Anonymous 11/26/25(Wed)11:10:04 No.107334629

>>107334608
>Can the turbo version be finetuned or do we need to wait for the base one for that?
it's really hard to undistill a model, look how much we struggled with flux dev and flux schnell, we'll wait the base model for that

Anonymous
11/26/25(Wed)11:10:06 No.107334630

Anonymous 11/26/25(Wed)11:10:06 No.107334630

File: Flux2_00047_ (1).jpg (1.41 MB, 2400x3200)

1.41 MB JPG

>>107331027
Flux2 scheduler glues sigmas to the ceiling at extreme resolutions and that seems to be part of the problem. Genning with regular beta 0.9/0.8 helps, though dithering isn't gone for good.
(I'm not repeating that experiment, though)

Anonymous
11/26/25(Wed)11:10:36 No.107334635

Anonymous 11/26/25(Wed)11:10:36 No.107334635

File: 1744190522407245.jpg (1.24 MB, 2016x1844)

1.24 MB JPG

alternative collage for last thread

Anonymous
11/26/25(Wed)11:10:56 No.107334638

Anonymous 11/26/25(Wed)11:10:56 No.107334638

>>107334620
>>107334626
>>107334629
Thanks! Guess we're waiting a bit longer until shit gets real.

Anonymous
11/26/25(Wed)11:11:04 No.107334640

Anonymous 11/26/25(Wed)11:11:04 No.107334640

>>107334595
those 2 methods are those being used to make 8 steps good quality right? it's not linked to the base model?

Anonymous
11/26/25(Wed)11:11:04 No.107334641

Anonymous 11/26/25(Wed)11:11:04 No.107334641

have you started learning chinese to prompt yet?

Anonymous
11/26/25(Wed)11:11:07 No.107334642

Anonymous 11/26/25(Wed)11:11:07 No.107334642

>>107334579
a particularly malicious individual might opt for an alternative, semi-rhyming substitute for "lab", but i'm glad we have class here in /ldg/

Anonymous
11/26/25(Wed)11:12:05 No.107334654

Anonymous 11/26/25(Wed)11:12:05 No.107334654

File: file.png (64 KB, 250x250)

64 KB PNG

>>107334566
https://huggingface.co/collections/Tongyi-MAI/z-image

Anonymous
11/26/25(Wed)11:12:07 No.107334655

Anonymous 11/26/25(Wed)11:12:07 No.107334655

>>107334614
>You guys
These are Chinese bots, mostly. They always swarm on release. Some of their models turn out legitimately good, though.

Anonymous
11/26/25(Wed)11:12:12 No.107334657

Anonymous 11/26/25(Wed)11:12:12 No.107334657

>>107334630
incredible research anon

Anonymous
11/26/25(Wed)11:12:13 No.107334658

Anonymous 11/26/25(Wed)11:12:13 No.107334658

>>107334642
bfc

Anonymous
11/26/25(Wed)11:12:15 No.107334660

Anonymous 11/26/25(Wed)11:12:15 No.107334660

BFL on suicide watch

Anonymous
11/26/25(Wed)11:13:07 No.107334666

Anonymous 11/26/25(Wed)11:13:07 No.107334666

>>107334660
>BFL on suicide watch
good, they fucking deserved it

Anonymous
11/26/25(Wed)11:13:21 No.107334670

Anonymous 11/26/25(Wed)11:13:21 No.107334670

File: just-a-few-z-image-turbo-(...).png (1.09 MB, 1080x1080)

1.09 MB PNG

--ALERT ALERT--
A REDDITOR HAS SUCCESSFULLY PROMPTED AMERICANA THOT
https://www.reddit.com/r/StableDiffusion/comments/1p7b016/just_a_few_zimageturbo_shots/

JESUS that's detailed for a turbo model and low resolution.

Anonymous
11/26/25(Wed)11:13:59 No.107334677

Anonymous 11/26/25(Wed)11:13:59 No.107334677

>one of the z-image devs liked the Neta repository

LUMINABROS WE ARE AVENGED

Anonymous
11/26/25(Wed)11:14:45 No.107334687

Anonymous 11/26/25(Wed)11:14:45 No.107334687

File: 1748033940674231.png (562 KB, 607x1203)

562 KB PNG

>Flux 2 is released
>all people are talking about on r/Stablediffusion is the upcoming Z-Image model
AIIIIEEE SLOPMAN SAVE MEEEE

Anonymous
11/26/25(Wed)11:15:34 No.107334696

Anonymous 11/26/25(Wed)11:15:34 No.107334696

>>107334670
look at the hair, it's like a realplayer screengrab

Anonymous
11/26/25(Wed)11:15:46 No.107334699

Anonymous 11/26/25(Wed)11:15:46 No.107334699

>>107334677
really? give me his twitter account plz

Anonymous
11/26/25(Wed)11:16:25 No.107334705

Anonymous 11/26/25(Wed)11:16:25 No.107334705

>>107334660
When was you when Flux 2 was kill ?

Anonymous
11/26/25(Wed)11:19:32 No.107334731

Anonymous 11/26/25(Wed)11:19:32 No.107334731

>>107334556
cute boy but what's with the bra?

Anonymous
11/26/25(Wed)11:19:52 No.107334735

Anonymous 11/26/25(Wed)11:19:52 No.107334735

File: 1755700785413481.jpg (48 KB, 686x386)

48 KB JPG

>>107334620
>fits comfortably on a 16GB GPU
That's they key to success, it's in the roughly same spot as XL when it came out when 8GB only were starting to become the norm. 12GB should be able to run and train loras with it without too much problems too, meaning that alot of people will be able to use the model right away without unholy jewish tricks, unlike the fuckhueg novelty models that kept dropping and failing.

Anonymous
11/26/25(Wed)11:20:46 No.107334744

Anonymous 11/26/25(Wed)11:20:46 No.107334744

File: chinkmodelhype.png (10 KB, 555x193)

10 KB PNG

god i hope they can get it out by lunchtime

Anonymous
11/26/25(Wed)11:20:49 No.107334745

Anonymous 11/26/25(Wed)11:20:49 No.107334745

File: flux2__00047_.png (1.63 MB, 832x1216)

1.63 MB PNG

>>107334550
turn the flux guidance down

Anonymous
11/26/25(Wed)11:20:52 No.107334748

Anonymous 11/26/25(Wed)11:20:52 No.107334748

>>107334660
BFL announced their upcoming video model, Wan is released, BFL quietly cancels their video model

BFL announces Flux 2, Z-image is released...

China is so far ahead at this point, only thing the west is better at is crippling censoring

Anonymous
11/26/25(Wed)11:20:58 No.107334749

Anonymous 11/26/25(Wed)11:20:58 No.107334749

File: flux2_bf16_c_00037_.png (3.7 MB, 1200x1600)

3.7 MB PNG

>>107334550
Eh

Anonymous
11/26/25(Wed)11:21:10 No.107334752

Anonymous 11/26/25(Wed)11:21:10 No.107334752

File: Screenshot 2025-11-26 at (...).png (56 KB, 639x455)

56 KB PNG

>>107334699
actually three of the huggingspace team members also did the Lumina paper

Anonymous
11/26/25(Wed)11:22:25 No.107334766

Anonymous 11/26/25(Wed)11:22:25 No.107334766

>>107334745
>>107334749
Not even a pajeet would believe these are real

Anonymous
11/26/25(Wed)11:22:31 No.107334768

Anonymous 11/26/25(Wed)11:22:31 No.107334768

>>107334745
to what?

Anonymous
11/26/25(Wed)11:22:36 No.107334770

Anonymous 11/26/25(Wed)11:22:36 No.107334770

>>107334589
the eurocuck model was cencored (like all of yuropooria) so who cares lol

Anonymous
11/26/25(Wed)11:22:42 No.107334771

Anonymous 11/26/25(Wed)11:22:42 No.107334771

>>107334748
Don't forget that one
>BFL releases Flux kontext dev, gets BTFO by Qwen Image Edit

Alibaba 3 - BFL 0

Anonymous
11/26/25(Wed)11:23:09 No.107334773

Anonymous 11/26/25(Wed)11:23:09 No.107334773

>>107334766
If I resave it as jpg 40 times, will you be convinced?

Anonymous
11/26/25(Wed)11:23:24 No.107334777

Anonymous 11/26/25(Wed)11:23:24 No.107334777

File: 1763920020886770.mp4 (1.3 MB, 832x480)

1.3 MB MP4

Anonymous
11/26/25(Wed)11:23:38 No.107334779

Anonymous 11/26/25(Wed)11:23:38 No.107334779

>>107334771

isnt bfl funded by musk?

Anonymous
11/26/25(Wed)11:23:43 No.107334781

Anonymous 11/26/25(Wed)11:23:43 No.107334781

>>107334769
>But they are already available.
where?

Anonymous
11/26/25(Wed)11:24:22 No.107334787

Anonymous 11/26/25(Wed)11:24:22 No.107334787

>>107334502
>>107334635
Good collages with many images means healthy bread :-)

Anonymous
11/26/25(Wed)11:24:47 No.107334792

Anonymous 11/26/25(Wed)11:24:47 No.107334792

File: 1748489905716629.png (576 KB, 1920x1080)

576 KB PNG

>>107334752
>Why do you never give up
>Because the lumina devs never did
Bigma is finally real lmao

Anonymous
11/26/25(Wed)11:24:51 No.107334794

Anonymous 11/26/25(Wed)11:24:51 No.107334794

>>107334768
1.9 gets interesting but you start to get artifacts, start at 2.5 and go where your heart desires.

Anonymous
11/26/25(Wed)11:25:20 No.107334799

Anonymous 11/26/25(Wed)11:25:20 No.107334799

>>107334781
I am retard, sorry.

Anonymous
11/26/25(Wed)11:29:52 No.107334845

Anonymous 11/26/25(Wed)11:29:52 No.107334845

File: Z-Image turbo.png (1.4 MB, 1024x1024)

1.4 MB PNG

>Husbando, you buy too much ram to run Flux 2 dev, now we are homeress

Anonymous
11/26/25(Wed)11:31:48 No.107334864

Anonymous 11/26/25(Wed)11:31:48 No.107334864

File: Z-Image turbo.png (1.28 MB, 1024x1280)

1.28 MB PNG

I tried Rachel Green from Friends, desu it's way closer than I expected

Anonymous
11/26/25(Wed)11:32:10 No.107334867

Anonymous 11/26/25(Wed)11:32:10 No.107334867

>>107334845
>don't worry honey, i was one of the original rammaxers back in llama 1 times

Anonymous
11/26/25(Wed)11:32:23 No.107334869

Anonymous 11/26/25(Wed)11:32:23 No.107334869

>>107334773
It's not that it's too clean, heck you can see lots of macro blocks on the bangs as is, it's how the lightning affects the skin, you can lower the image quality as much as you want, it will always look uncanny and is a clear result of them training primarily on synthetic data

Anonymous
11/26/25(Wed)11:32:31 No.107334871

Anonymous 11/26/25(Wed)11:32:31 No.107334871

Temper your expectations.

Anonymous
11/26/25(Wed)11:33:12 No.107334876

Anonymous 11/26/25(Wed)11:33:12 No.107334876

>>107334871
>Tempers your nuts on a forge

Anonymous
11/26/25(Wed)11:34:57 No.107334888

Anonymous 11/26/25(Wed)11:34:57 No.107334888

>>107334869
>it's how the lightning affects the skin
this, those slopped models can't help but to make the skin too smoth and shiny, only Chroma (and now Z-Image) managed to surpass that uncanny valey

Anonymous
11/26/25(Wed)11:35:10 No.107334891

Anonymous 11/26/25(Wed)11:35:10 No.107334891

>>107334779
IIRC X used their image service initially, but that was like a year ago, now everything runs on the Grok model

Anonymous
11/26/25(Wed)11:35:30 No.107334893

Anonymous 11/26/25(Wed)11:35:30 No.107334893

>>107334864
>sort by color
>tide
surprinsingly coherent text. especially the "sort by color" sign, since it's out of focus

Anonymous
11/26/25(Wed)11:35:43 No.107334896

Anonymous 11/26/25(Wed)11:35:43 No.107334896

>>107334752
Based

Anonymous
11/26/25(Wed)11:36:05 No.107334901

Anonymous 11/26/25(Wed)11:36:05 No.107334901

>>107334869
>training primarily on synthetic data

zero facts, all schizo posting

Anonymous
11/26/25(Wed)11:36:13 No.107334903

Anonymous 11/26/25(Wed)11:36:13 No.107334903

File: chroma_00015_.png (2.5 MB, 1024x1536)

2.5 MB PNG

>>107334550

Anonymous
11/26/25(Wed)11:36:48 No.107334913

Anonymous 11/26/25(Wed)11:36:48 No.107334913

>>107334845
Hmm. So the secret trick to making it look realistic is simply to slap heavy jpg compression on it?
It looks fine in the mini preview, but you shouldn't open it lol kek

Anonymous
11/26/25(Wed)11:36:50 No.107334915

Anonymous 11/26/25(Wed)11:36:50 No.107334915

>>107334901
then explain why that 32b model is so slopped and that 6b isn't

Anonymous
11/26/25(Wed)11:37:06 No.107334919

Anonymous 11/26/25(Wed)11:37:06 No.107334919

>>107334888
>model not even released yet
>absolutely deepthroating the devs

Anonymous
11/26/25(Wed)11:37:51 No.107334925

Anonymous 11/26/25(Wed)11:37:51 No.107334925

>>107334773
>If I resave it as jpg 40 times, will you be convinced?
>>107334913
>So the secret trick to making it look realistic is simply to slap heavy jpg compression on it?
no need to say the same argument twice we saw it debo

Anonymous
11/26/25(Wed)11:38:07 No.107334930

Anonymous 11/26/25(Wed)11:38:07 No.107334930

>>107334915
define slopped

Anonymous
11/26/25(Wed)11:38:51 No.107334942

Anonymous 11/26/25(Wed)11:38:51 No.107334942

>>107334919
>t. seething bfl employee
get back to the lab and make a better model than this bloated shit nigger

Anonymous
11/26/25(Wed)11:39:32 No.107334946

Anonymous 11/26/25(Wed)11:39:32 No.107334946

>>107334871
Impossible, did you see ldg when wan2.5 released? Oh it was full blown kicking, screaming, pants shitting meltdown when they announced api only, kek.

Anonymous
11/26/25(Wed)11:40:27 No.107334955

Anonymous 11/26/25(Wed)11:40:27 No.107334955

File: file.png (1 KB, 458x63)

1 KB PNG

Is that bad

it still lets me slop

Anonymous
11/26/25(Wed)11:40:41 No.107334957

Anonymous 11/26/25(Wed)11:40:41 No.107334957

>state of the art western model is released
>chink shills and poorfags flood the thread to do damage control for their master Xi
kek

Anonymous
11/26/25(Wed)11:40:43 No.107334959

Anonymous 11/26/25(Wed)11:40:43 No.107334959

File: 1748295002480293.png (306 KB, 500x500)

306 KB PNG

>>107334946
>Oh it was full blown kicking, screaming, pants shitting meltdown when they announced api

Anonymous
11/26/25(Wed)11:40:52 No.107334967

Anonymous 11/26/25(Wed)11:40:52 No.107334967

>>107334925
It's nice that others notice it too. You can't recognize anything in the pictures, not even the teeth are visible because of all the artifacts.
You chink vram poor bot
But I'm happy for all third-worlder that they also have something to play with.

Anonymous
11/26/25(Wed)11:41:48 No.107334972

Anonymous 11/26/25(Wed)11:41:48 No.107334972

File: is this nigga serious?.png (29 KB, 200x200)

29 KB PNG

>>107334957
>state of the art western model is released

Anonymous
11/26/25(Wed)11:42:30 No.107334978

Anonymous 11/26/25(Wed)11:42:30 No.107334978

>>107334919
I hope it's a good model too but this much hype before we even get our hands on it is insane. What a fucking madhouse.

Anonymous
11/26/25(Wed)11:42:54 No.107334981

Anonymous 11/26/25(Wed)11:42:54 No.107334981

>>107334957
link?

Anonymous
11/26/25(Wed)11:43:39 No.107334986

Anonymous 11/26/25(Wed)11:43:39 No.107334986

>reddit reposting general

Anonymous
11/26/25(Wed)11:43:42 No.107334987

Anonymous 11/26/25(Wed)11:43:42 No.107334987

>>107334946
>t. the guy who was shitting his pants when 2.5 announced API only >>107334562

Anonymous
11/26/25(Wed)11:43:48 No.107334988

Anonymous 11/26/25(Wed)11:43:48 No.107334988

File: 93a3c822-5a80-4183-a214-5(...).jpg (127 KB, 864x1152)

127 KB JPG

A natural-light museum exhibit photographed with a neutral documentary tone. The image shows a detailed diorama behind glass, lit primarily by soft overhead museum lighting and ambient daylight leaking from nearby windows. In the center of the display is a lifelike model of a fictional prehistoric animal labeled “VRAMLET.” The creature resembles a medium-sized mammal with a long, droopy nose similar to a tapir’s trunk but slimmer, sagging downward in a comical way. It wears oversized, thick-rimmed nerd-style glasses that sit awkwardly on its snout.

In front of the VRAMLET is a retro beige CRT computer on an old desk surface. The monitor is bulky, with a curved screen and visible ventilation slits. Gray smoke billows from the computer’s vents and keyboard, curling upward toward the diorama’s lighting. The VRAMLET model is posed as if actively using the malfunctioning machine—forelimbs resting on the keyboard, head tilted forward as if concentrating.

Museum placards and descriptive text sit off to one side, slightly out of focus. The background includes typical exhibit elements: faux foliage, painted prehistoric landscape mural, and textured ground materials like resin dirt and artificial rocks. The glass barrier in front reflects faint silhouettes of visitors, preserving the natural candid feel. The overall style is realistic, mildly humorous due to the glasses and smoking CRT, and consistent with standard museum photography.

Anonymous
11/26/25(Wed)11:43:53 No.107334992

Anonymous 11/26/25(Wed)11:43:53 No.107334992

>>107334978
I mean, the images do look good, so if this what we'll get from ComfyUi then we are definitely saved

Anonymous
11/26/25(Wed)11:45:06 No.107335004

Anonymous 11/26/25(Wed)11:45:06 No.107335004

File: Wan_00078.mp4 (2.19 MB, 480x480)

2.19 MB MP4

Didn't realize it would work to slot in color match, figure'd it'd break the merges. Very very nice.

Anonymous
11/26/25(Wed)11:45:07 No.107335005

Anonymous 11/26/25(Wed)11:45:07 No.107335005

File: May I see it.jpg (28 KB, 500x378)

28 KB JPG

>>107334957
>state of the art western model is released

Anonymous
11/26/25(Wed)11:47:08 No.107335024

Anonymous 11/26/25(Wed)11:47:08 No.107335024

File: No refunds.png (93 KB, 225x225)

93 KB PNG

>>107334967
>Nooooo, small models can't be good! You have to stack moar layers to get better results! Why did I pay for an RTX 5090 then???
No refund.

Anonymous
11/26/25(Wed)11:47:08 No.107335025

Anonymous 11/26/25(Wed)11:47:08 No.107335025

File: 1764107953073766.png (104 KB, 770x2051)

104 KB PNG

>>107335005
Agree to the terms and conditions goy, and we MAY let you use it to generate pictures of kittens

Anonymous
11/26/25(Wed)11:47:32 No.107335031

Anonymous 11/26/25(Wed)11:47:32 No.107335031

File: 1755384622148825.jpg (1004 KB, 2880x1349)

1004 KB JPG

Prompt:
在画面中央,一位年轻女子,有着长长的、飘逸的棕色头发和柔软的刘海,温柔地凝视着观众。她的表情平静而沉思,精致的五官和眼睛似乎蕴含着安静的惊奇。她穿着一件轻便的无袖服装,与飘逸的环境巧妙地融为一体,让她的皮肤和头发捕捉到周围的光芒。她的姿势很放松,几乎就好像她是场景的一部分,而不是观察它。

这里的环境似乎是一个梦幻般的冬季花园,花朵被闪闪发光的冰晶包裹着,营造出超凡脱俗的景观。背景柔和模糊,充满冷蓝色调和散景亮点,让人联想到冰冻的花朵或发光的霜。前景还以这些冰冷的花朵形式为特色,有些部分处于失焦状态,增加了深度并强化了神奇的氛围。整个环境感觉时间暂停了,仿佛大自然在冬季的魔力下暂停了。

灯光在塑造气氛方面起着至关重要的作用,在女人的脸上和肩膀上投射出温暖的金色光芒,而周围的世界仍然沐浴在凉爽的蓝色和白色中。

Anonymous
11/26/25(Wed)11:48:05 No.107335041

Anonymous 11/26/25(Wed)11:48:05 No.107335041

>>107334992
If they still look so badly compressed locally, it's DOA

Anonymous
11/26/25(Wed)11:48:10 No.107335044

Anonymous 11/26/25(Wed)11:48:10 No.107335044

File: Flux2_Output_26626.jpg (1.48 MB, 2048x2048)

1.48 MB JPG

Z-Image looks interesting. I somewhat doubt it's more lightweight architecturally than Lumina 2.0 in practice though meaning it's even less likely to be an "immediate SDXL" killer I'd say. Flux 2 is decent also, a bit huge though. Looks way better than Flux 1 for photographic gens out of the box is any case.

Anonymous
11/26/25(Wed)11:48:56 No.107335058

Anonymous 11/26/25(Wed)11:48:56 No.107335058

>gen image assets and day dream about that perfect weg you gonna make with them
I don't even feel the need to play video games at this point that much, this is just too good.

Anonymous
11/26/25(Wed)11:49:23 No.107335064

Anonymous 11/26/25(Wed)11:49:23 No.107335064

white synthographers crave big chang cock

Anonymous
11/26/25(Wed)11:50:36 No.107335079

Anonymous 11/26/25(Wed)11:50:36 No.107335079

>>107334986
It's just him, he does it all the time.

Anonymous
11/26/25(Wed)11:50:48 No.107335082

Anonymous 11/26/25(Wed)11:50:48 No.107335082

>>107335031
Flux 2
>Oversaturated colors
>plastic skin
>low details
Z-image
>actual kino skin
>subtle colors with a smooth color gradient
>really nice details especially the hair

32b vs 6b btw

Anonymous
11/26/25(Wed)11:50:54 No.107335083

Anonymous 11/26/25(Wed)11:50:54 No.107335083

>>107335064
>big chang cock
but alas, this doesn't exist.

Anonymous
11/26/25(Wed)11:51:52 No.107335096

Anonymous 11/26/25(Wed)11:51:52 No.107335096

>>107335079
>him
poopdickschizo?

Anonymous
11/26/25(Wed)11:52:10 No.107335100

Anonymous 11/26/25(Wed)11:52:10 No.107335100

>>107335031
how the FUCK do labs see something like the left most image and say "wow it looks so realistic... SOTA! :D" idgi

Anonymous
11/26/25(Wed)11:52:38 No.107335109

Anonymous 11/26/25(Wed)11:52:38 No.107335109

File: flux2__00057_.png (1.52 MB, 832x1216)

1.52 MB PNG

res_3m/linear_quadratic 28 steps, i guess you don't have to use the flux scheduler at all

Anonymous
11/26/25(Wed)11:53:06 No.107335110

Anonymous 11/26/25(Wed)11:53:06 No.107335110

File: 289472d6-c3f6-4391-8092-d(...).jpg (118 KB, 864x1152)

118 KB JPG

A documentary-style photograph taken inside an older subway station with dirty off-white tiled walls and overhead fluorescent lighting. The main subject is a large rectangular advertisement poster mounted along the platform wall. The original ad has been replaced with a clean, minimalist promo for Flux 2, the new AI image generator from Black Forest Labs. The design is corporate and sleek: large bold sans-serif title “Flux 2” centered on a bright white background, with a smaller tagline beneath such as “next-generation image synthesis” and a Black Forest Labs logo along the bottom edge.

The ad has been heavily vandalized with multiple layers of graffiti in different handwriting styles, all in the sloppy marker-pen aesthetic commonly found in real subway ads. Examples of the graffiti include:

“fluxbros… it’s over” scrawled unevenly across the top in fading blue marker.

“sovless” written diagonally near the center in aggressive, jagged lettering.

A block of political graffiti written in red marker, formatted like a chant:
“HAIL ALIBABA
HAIL CHINA
HAIL XI JINPING THOUGHT”
(clearly presented as vandalism, not part of the ad).

More random doodles, arrows, crossed-out text, and crude speech bubbles occupy the rest of the space, giving the scene a chaotic, defaced look. The lower edge of the poster shows wear, peeling corners, and grime. The composition is realistic, slightly wide-angle, capturing the gritty authenticity of subway-ad vandalism.

Anonymous
11/26/25(Wed)11:53:13 No.107335113

Anonymous 11/26/25(Wed)11:53:13 No.107335113

File: file.png (2 KB, 176x53)

2 KB PNG

cant imagine not having at least 96gb ram btw

Anonymous
11/26/25(Wed)11:53:40 No.107335119

Anonymous 11/26/25(Wed)11:53:40 No.107335119

File: img_00086_.jpg (532 KB, 1632x1264)

532 KB JPG

Anonymous
11/26/25(Wed)11:55:52 No.107335138

Anonymous 11/26/25(Wed)11:55:52 No.107335138

>>107335082
Z-Image isn't kino at all there IMO, it looks like it has the same JPEGmaxxed problem as Cosmos 2 and HiDream kinda

Anonymous
11/26/25(Wed)11:57:48 No.107335157

Anonymous 11/26/25(Wed)11:57:48 No.107335157

>>107335138
Because its been converted to jpeg on generation as the original image on the promo site is only jpeg, then it was encoded again into jpeg for that comparison. When generating you will be saving directly into png.

Anonymous
11/26/25(Wed)11:58:37 No.107335166

Anonymous 11/26/25(Wed)11:58:37 No.107335166

File: Detailed-woow.jpg (494 KB, 1079x1394)

494 KB JPG

> JESUS that's detailed
lol chinks

Anonymous
11/26/25(Wed)11:58:40 No.107335167

Anonymous 11/26/25(Wed)11:58:40 No.107335167

>>107335138
cope

Anonymous
11/26/25(Wed)11:59:02 No.107335172

Anonymous 11/26/25(Wed)11:59:02 No.107335172

File: flux2__00058_.png (1.58 MB, 832x1216)

1.58 MB PNG

>>107335157
anon you are making apologist explanations for an unreleased model, have some self respect

Anonymous
11/26/25(Wed)11:59:13 No.107335174

Anonymous 11/26/25(Wed)11:59:13 No.107335174

File: flux2_bf16_c_00047_.png (1.44 MB, 720x1328)

1.44 MB PNG

>>107335031

Anonymous
11/26/25(Wed)11:59:40 No.107335179

Anonymous 11/26/25(Wed)11:59:40 No.107335179

I would take faux jpeg compression over hyperslop 2.5d soulless bullshit any day. You are retarded.

Anonymous
11/26/25(Wed)12:00:35 No.107335191

Anonymous 11/26/25(Wed)12:00:35 No.107335191

>>107335157
i have seen this cope before

Anonymous
11/26/25(Wed)12:02:38 No.107335211

Anonymous 11/26/25(Wed)12:02:38 No.107335211

File: 592892e4-f123-4236-9392-d(...).jpg (125 KB, 864x1152)

125 KB JPG

Prompt: A gritty, documentary-style photograph inside an aging subway station with stained off-white tiles, metal grime, and cold fluorescent lighting. The main subject is a large rectangular advertisement poster mounted on the wall. The ad itself promotes Flux 2, the new AI image generator from Black Forest Labs. The design is sterile and corporate: stark white background, bold black sans-serif “Flux 2” headline, a small tagline like “next-generation image synthesis,” and a Black Forest Labs logo near the bottom. The poster has been aggressively vandalized with layers of graffiti made using different markers and handwriting styles, giving it a chaotic, real-world subway defacement look. The graffiti includes: “fluxbros… it’s over” scribbled in uneven blue marker across the top. “sovless” written diagonally in jagged red lettering. “cringe model desu senpai” scrawled across the lower center in rushed bubble-letter style. “>mfw they release a better model than mine’s the day I announced it” written in green marker, using an imageboard-style quote arrow. A crude graffiti drawing of a frog, low-effort and lopsided, in black marker beside the text. Additional random scratches, lines, scribbles, and half-erased tags clutter the edges. The poster’s corners peel slightly, and grime accumulates along its edges, reinforcing the worn, urban, photo-journalistic realism.
Sampling Steps: 9Sampler Method: eulerCFG Scale: 7.5Random Seed: 1451342554Size: 864x1152
Elapsed Time: 2.5 s

Anonymous
11/26/25(Wed)12:04:48 No.107335225

Anonymous 11/26/25(Wed)12:04:48 No.107335225

>>107335119
is this chroma?

Anonymous
11/26/25(Wed)12:04:52 No.107335226

Anonymous 11/26/25(Wed)12:04:52 No.107335226

>>107335179
This

Anonymous
11/26/25(Wed)12:07:12 No.107335242

Anonymous 11/26/25(Wed)12:07:12 No.107335242

>>107335172
anon, you are making libel for an unreleased model, have some self respect

Anonymous
11/26/25(Wed)12:08:12 No.107335254

Anonymous 11/26/25(Wed)12:08:12 No.107335254

>>107335179
>I would take faux jpeg compression over hyperslop 2.5d soulless bullshit any day. You are retarded.
100% this, artifacts >>>> plastic

Anonymous
11/26/25(Wed)12:08:47 No.107335258

Anonymous 11/26/25(Wed)12:08:47 No.107335258

>>107335166
> 6b bro
> turbo bro

Anonymous
11/26/25(Wed)12:10:27 No.107335270

Anonymous 11/26/25(Wed)12:10:27 No.107335270

File: flux2__00038_.png (1.71 MB, 1216x832)

1.71 MB PNG

>>107335242
i'm actually pretty pumped for the new model too, but it's not a panacea before it's even been released

Anonymous
11/26/25(Wed)12:10:34 No.107335272

Anonymous 11/26/25(Wed)12:10:34 No.107335272

File: Flux2_00041_.png (1.44 MB, 1024x1024)

1.44 MB PNG

Anonymous
11/26/25(Wed)12:11:27 No.107335280

Anonymous 11/26/25(Wed)12:11:27 No.107335280

File: 1752555027416494.png (165 KB, 400x205)

165 KB PNG

>>107335191

>>107335172
Heres the zoomed in corner of your own png image on the left compared to the encoded with the default ffmpeg jpeg preset on the right, retard

Anonymous
11/26/25(Wed)12:11:51 No.107335284

Anonymous 11/26/25(Wed)12:11:51 No.107335284

File: 1432498179182.png (296 KB, 722x768)

296 KB PNG

Does the Z-Image use any existing vae/TE or is everything new again?

Anonymous
11/26/25(Wed)12:12:20 No.107335288

Anonymous 11/26/25(Wed)12:12:20 No.107335288

>>107335270
fair enough

Anonymous
11/26/25(Wed)12:13:23 No.107335295

Anonymous 11/26/25(Wed)12:13:23 No.107335295

>>107335284
I guess you have to look at Comfy's code to see the answer
https://github.com/comfyanonymous/ComfyUI/commit/e9aae31fa241a6a63a368800146ea91629d4e8c2
(can't help you further I'm a codelet)

Anonymous
11/26/25(Wed)12:13:32 No.107335297

Anonymous 11/26/25(Wed)12:13:32 No.107335297

>>107335179
those who prefer the second one should not be taken seriously and they likely reside in their states custody in some kind of prison or mental facility

Anonymous
11/26/25(Wed)12:14:24 No.107335302

Anonymous 11/26/25(Wed)12:14:24 No.107335302

File: flux2dev q8.png (1.21 MB, 720x1280)

1.21 MB PNG

>prompt: woman
hagbros...

Anonymous
11/26/25(Wed)12:14:54 No.107335305

Anonymous 11/26/25(Wed)12:14:54 No.107335305

>>107335297
based

Anonymous
11/26/25(Wed)12:16:26 No.107335317

Anonymous 11/26/25(Wed)12:16:26 No.107335317

>>107335031
the blur is so beautiful to look at on Z-image, it gets stronger and stronger in a really smooth transition,

Anonymous
11/26/25(Wed)12:16:56 No.107335319

Anonymous 11/26/25(Wed)12:16:56 No.107335319

>>107335272
Doesn't look Indian though.

Anonymous
11/26/25(Wed)12:17:23 No.107335322

Anonymous 11/26/25(Wed)12:17:23 No.107335322

File: flux2__00059_.png (1.44 MB, 832x1216)

1.44 MB PNG

>>107335302
please do not mock my adult woman fetish

Anonymous
11/26/25(Wed)12:18:03 No.107335326

Anonymous 11/26/25(Wed)12:18:03 No.107335326

>>107335284
Text encoder is new. Just a few months old and afaik nothing else used it.
No clue about which vae it uses though.

Anonymous
11/26/25(Wed)12:18:08 No.107335327

Anonymous 11/26/25(Wed)12:18:08 No.107335327

File: 1759468755080390.png (285 KB, 1581x809)

285 KB PNG

>>107335280
Here's zoomed in more.

There is a reason why jpeg is so much smaller in size, retards.

Anonymous
11/26/25(Wed)12:18:32 No.107335329

Anonymous 11/26/25(Wed)12:18:32 No.107335329

>>107335272
>>107335319
yeah he looks like a spic

Anonymous
11/26/25(Wed)12:20:48 No.107335351

Anonymous 11/26/25(Wed)12:20:48 No.107335351

File: flux2__00022_.png (1.61 MB, 832x1216)

1.61 MB PNG

>having a melty posting pixels from jpegs to show how compression works
ishygddt

Anonymous
11/26/25(Wed)12:21:38 No.107335356

Anonymous 11/26/25(Wed)12:21:38 No.107335356

File: Another Comparison.png (978 KB, 1080x778)

978 KB PNG

Anonymous
11/26/25(Wed)12:23:08 No.107335372

Anonymous 11/26/25(Wed)12:23:08 No.107335372

>>107335356
>snakebite
At least give XL a fighting chance kek

Anonymous
11/26/25(Wed)12:23:20 No.107335375

Anonymous 11/26/25(Wed)12:23:20 No.107335375

>>107335356
>"The man is giving a thumbs up"
>Only Flux 2 pro doesn't respect that, it has a 24b text encoder btw
AIEEEEEEEEEE

Anonymous
11/26/25(Wed)12:23:34 No.107335379

Anonymous 11/26/25(Wed)12:23:34 No.107335379

File: 1744739307418768.png (792 KB, 1136x912)

792 KB PNG

qwen edit v3 where

2509 is fine but I wonder what is changed.

Anonymous
11/26/25(Wed)12:23:43 No.107335382

Anonymous 11/26/25(Wed)12:23:43 No.107335382

>>107335284
>>107335295
The code seems to indicate it's using the Qwen3 4B text encoder which makes sense. It's smaller than T5 thankfully

This is also fun because there are various sizes of TE in this architecture so maybe we could try projecting the smaller 0.8B output into it, for instance

Anonymous
11/26/25(Wed)12:23:55 No.107335383

Anonymous 11/26/25(Wed)12:23:55 No.107335383

File: 1756084949439544.jpg (58 KB, 600x1104)

58 KB JPG

>>107335351

Anonymous
11/26/25(Wed)12:24:25 No.107335387

Anonymous 11/26/25(Wed)12:24:25 No.107335387

>>107335356
sdxl: looks slightly like Ai-slop
Z-image: amateur photograph picture
Flux.2 pro: professional photograph picture with slightly too much post editing or filtering

Anonymous
11/26/25(Wed)12:24:51 No.107335393

Anonymous 11/26/25(Wed)12:24:51 No.107335393

>>107335382
*i mean it's smaller than t5-xxl, also i meant 0.6B

Anonymous
11/26/25(Wed)12:26:57 No.107335412

Anonymous 11/26/25(Wed)12:26:57 No.107335412

File: flux2__00060_.png (1.36 MB, 832x1216)

1.36 MB PNG

we're still scratching the surface i guess.i wired up the Q8 GGUF to the clownsharksampler and ran this with res_3m/kl_optimal and the detail boost node. 28 steps. it's still dog slow but it's fun to fuck around with. see y'all when z drops.

>a capture from an old tv show. a scene depicting a gothic woman as she stands on the set of a 90s sitcom. she wears a black high-waisted thong with skulls on it and a long sleeve very short crop top. she has heavy goth makeup her shirt says "Death" on it in gothic letters. she has wide full hips and a slender stomach. her hair is black with dark purple highlights. she looks as if she is about to say something, looking left off screen, mouth open. behind her are all the makings of a 90s living room, a beige sofa, a brass lamp on a lamp table, a bookcase full of spooky toys etc. on the wall is a pentagram themed hanging. beside stands a stuffed black goat, wearing a gold pentagram chain. the goat is almost as tall as she is, vhs artifacts are visible on the image

Anonymous
11/26/25(Wed)12:27:33 No.107335416

Anonymous 11/26/25(Wed)12:27:33 No.107335416

>>107335351
Did BFL take notes from Lodestone and train on troons?

Anonymous
11/26/25(Wed)12:28:31 No.107335423

Anonymous 11/26/25(Wed)12:28:31 No.107335423

>>107334540
a moment of silence for the finetuners' wasted compute
and for all the unreleased illustrious models kek

Anonymous
11/26/25(Wed)12:29:09 No.107335426

Anonymous 11/26/25(Wed)12:29:09 No.107335426

>>107335225
Yes

Anonymous
11/26/25(Wed)12:29:41 No.107335433

Anonymous 11/26/25(Wed)12:29:41 No.107335433

File: MewingMaxx.png (518 KB, 736x1015)

518 KB PNG

>>107335351
>that jawline

Anonymous
11/26/25(Wed)12:30:24 No.107335440

Anonymous 11/26/25(Wed)12:30:24 No.107335440

>>107334540
The examples from even Neta (not Yume) look better than what we've seen from Z but obviously we need to wait until release to do direct 1:1 comparisons.

Anonymous
11/26/25(Wed)12:30:44 No.107335445

Anonymous 11/26/25(Wed)12:30:44 No.107335445

>>107335356
>1080x778
I see how you're downsampling all the comparison images to hide how badly artifacted z is!

Anonymous
11/26/25(Wed)12:30:59 No.107335447

Anonymous 11/26/25(Wed)12:30:59 No.107335447

all models that get deprecated by another model indirectly contributed a large amount of pressue on the company to actually publish a model that is better than that previous model

Anonymous
11/26/25(Wed)12:32:06 No.107335451

Anonymous 11/26/25(Wed)12:32:06 No.107335451

>>107335387
>staircase in zimage
>not slopped out
lol

Anonymous
11/26/25(Wed)12:32:07 No.107335453

Anonymous 11/26/25(Wed)12:32:07 No.107335453

>>107335447
bro just learned what is competition

Anonymous
11/26/25(Wed)12:33:18 No.107335464

Anonymous 11/26/25(Wed)12:33:18 No.107335464

>>107335445
you understand that if that were the case then z-image would be even more impressive since its easier to create details in large pixel space and then compress them down than rawdog a locked resolution, right?
this is literally how supersampling in games works as an antialiasing method that is the best but the most costly.

Anonymous
11/26/25(Wed)12:33:45 No.107335466

Anonymous 11/26/25(Wed)12:33:45 No.107335466

>>107335356
I mean, for sdxl, that looks pretty decent. Might have to break out the old sdxl workflows.

Anonymous
11/26/25(Wed)12:35:11 No.107335482

Anonymous 11/26/25(Wed)12:35:11 No.107335482

>>107335453
im reminding the retards here that dont seem to understand it as they shit on specific models as completely worthless all the time while wondering why companies ARE realising specific models and then also wondering why they are not at other times

Anonymous
11/26/25(Wed)12:36:45 No.107335496

Anonymous 11/26/25(Wed)12:36:45 No.107335496

File: princess peach confused.png (556 KB, 1024x1024)

556 KB PNG

So while waiting for the actually relevant model, I tested Flux 2 vae a bit.
Decent improvement but nothing revolutionary or mind blowing.
It's fucking 128 channel but also compresses images 4 times higher than previous one. 16x16=256 times compressed instead of 8x8=64.
After Wan 2.2. 5B I was skeptical of whether stacking copious amount of latent channels can compensate for decreased resolution properly, I guess I am proven wrong.
It's the current SOTA for VAE quality.

Anonymous
11/26/25(Wed)12:37:15 No.107335500

Anonymous 11/26/25(Wed)12:37:15 No.107335500

>>107335447
It doesn't always work like that, though.

Anonymous
11/26/25(Wed)12:37:26 No.107335503

Anonymous 11/26/25(Wed)12:37:26 No.107335503

Z-Image can almost perfectly reproduce a lot of cars. Pretty damn impressive. It knows the difference between old & new model years, even

Anonymous
11/26/25(Wed)12:37:36 No.107335510

Anonymous 11/26/25(Wed)12:37:36 No.107335510

File: Z-image turbo.png (1.17 MB, 1152x864)

1.17 MB PNG

lul

Anonymous
11/26/25(Wed)12:38:14 No.107335517

Anonymous 11/26/25(Wed)12:38:14 No.107335517

>>107335464
that works for antialiasing in games, but it's not how sampling of photographs or illustrations works.

Anonymous
11/26/25(Wed)12:38:39 No.107335520

Anonymous 11/26/25(Wed)12:38:39 No.107335520

File: 1742609898176491.png (227 KB, 500x378)

227 KB PNG

>>107335503

Anonymous
11/26/25(Wed)12:40:12 No.107335534

Anonymous 11/26/25(Wed)12:40:12 No.107335534

File: ctywbjZyrAMn2-by1aZE4.png (1.05 MB, 1024x768)

1.05 MB PNG

>>107335520

Anonymous
11/26/25(Wed)12:40:38 No.107335538

Anonymous 11/26/25(Wed)12:40:38 No.107335538

>>107335517
if you gen an image at a higher res than those z-image generations, which are 1MP, and then you downscale everything to 1MP to match z-image, you are doing the exact thing that i described and those initially bigger images will ultimately have more detail

Anonymous
11/26/25(Wed)12:40:42 No.107335539

Anonymous 11/26/25(Wed)12:40:42 No.107335539

>>107335496
>16x16=256 times compressed instead of 8x8=64
They just moved the 2x2 patching into the VAE instead of at the input to the model. So the diffusion model input is 16x16 total compression, same as most other models (which do 8x8 in VAE then 2x2 patches separately at model input).

Anonymous
11/26/25(Wed)12:41:13 No.107335547

Anonymous 11/26/25(Wed)12:41:13 No.107335547

File: Z-image turbo.png (1.26 MB, 1152x864)

1.26 MB PNG

>>107335510
The realism is on point, even in the background the details are good

Anonymous
11/26/25(Wed)12:41:16 No.107335549

Anonymous 11/26/25(Wed)12:41:16 No.107335549

File: OEwp5xYUmelcqFF9Y9Na9.png (1015 KB, 1024x768)

1015 KB PNG

>>107335534
1998 -> 2025

Anonymous
11/26/25(Wed)12:41:30 No.107335550

Anonymous 11/26/25(Wed)12:41:30 No.107335550

modelscope won't let me login (not receiving the verification code), does anyone want to run this prompt?

>A drawing of hatsune miku with dreadlocks and light black skin skateboarding in New York at night. She is holding a smartphone on her left hand and a multicolored ball on her right hand, she has a red t-shirt with text on it that says: "MIGU". A pikachu can be seen on the top of her head. Her speech bubble says "Hard to keep me in style huh?", neons, 50's comic book style

Anonymous
11/26/25(Wed)12:42:12 No.107335556

Anonymous 11/26/25(Wed)12:42:12 No.107335556

>>107335538
>will ultimately have more detail
*compared to what those non-Z models would have if they were to gen something directly at 1MP, i mean

Anonymous
11/26/25(Wed)12:42:33 No.107335560

Anonymous 11/26/25(Wed)12:42:33 No.107335560

i'm retarded and don't know how to install flux

Anonymous
11/26/25(Wed)12:43:04 No.107335565

Anonymous 11/26/25(Wed)12:43:04 No.107335565

>>107335560
https://justgetflux.com/

Anonymous
11/26/25(Wed)12:43:14 No.107335566

Anonymous 11/26/25(Wed)12:43:14 No.107335566

>>107335560
just look at a youtube tutorial

Anonymous
11/26/25(Wed)12:43:37 No.107335568

Anonymous 11/26/25(Wed)12:43:37 No.107335568

>>107335534
>>107335549
Neato

Anonymous
11/26/25(Wed)12:44:11 No.107335574

Anonymous 11/26/25(Wed)12:44:11 No.107335574

https://catbox.to/3KkRlTrKEsIKJGf/preview

Anonymous
11/26/25(Wed)12:45:01 No.107335584

Anonymous 11/26/25(Wed)12:45:01 No.107335584

>>107335319
>>107335329
My bad. It missed the nationality, but got the skin color
>Right Panel: A candid, slightly blurry outdoor shot of a man with dark hair and a beard. He is wearing a blue long-sleeved shirt and is squatting on the ground in a wooded area with dry leaves and dirt. His pants are pulled down around his ankles, and he is looking back over his shoulder at the camera with a startled or distressed expression.

Anonymous
11/26/25(Wed)12:45:08 No.107335585

Anonymous 11/26/25(Wed)12:45:08 No.107335585

>>107335031
Flux 2 can never get realistic lighting and depth right. It always looks flat.

Anonymous
11/26/25(Wed)12:45:52 No.107335594

Anonymous 11/26/25(Wed)12:45:52 No.107335594

File: 1a4dc4ad59d6.png (1.66 MB, 2048x1536)

1.66 MB PNG

Hmm, neither is perfect, but considering the size difference Z-Image is pretty impressive.

Anonymous
11/26/25(Wed)12:46:27 No.107335596

Anonymous 11/26/25(Wed)12:46:27 No.107335596

>>107335539
Oh?
I just lazily read debug tensor shapes and drew conclusions about model structure from that, thanks for the correction.
Then it sounds like 128 channels is indeed overkill with limited returns.

Anonymous
11/26/25(Wed)12:46:41 No.107335600

Anonymous 11/26/25(Wed)12:46:41 No.107335600

uh oh Z-Image is pretty ghibli slopped

Anonymous
11/26/25(Wed)12:46:46 No.107335602

Anonymous 11/26/25(Wed)12:46:46 No.107335602

>>107335538
If you have a natively 12MP photo and a 2MP photo, and you downscale the 12MP photo to 2MP, you won't notice a difference in quality as long as you didn't do something stupid like nearest neighbor rounding. You won't get a supersampling benefit like with raster CG graphics. It won't be distinguishable that the formerly 12MP image was originally much more detailed.

Anonymous
11/26/25(Wed)12:47:26 No.107335606

Anonymous 11/26/25(Wed)12:47:26 No.107335606

>>107335594
even flux 2 pro looks less realistic than that 6b model, damn that's brutal, those chinks caught lightning in a bottle

Anonymous
11/26/25(Wed)12:48:17 No.107335613

Anonymous 11/26/25(Wed)12:48:17 No.107335613

File: Ellen Page.png (235 KB, 541x543)

235 KB PNG

>>107335566
>indian accent

Anonymous
11/26/25(Wed)12:48:28 No.107335616

Anonymous 11/26/25(Wed)12:48:28 No.107335616

>>107335600
may I see it?

Anonymous
11/26/25(Wed)12:50:09 No.107335630

Anonymous 11/26/25(Wed)12:50:09 No.107335630

It seems Z-Image is by people who were poached from Lumina 2.0

Anonymous
11/26/25(Wed)12:50:13 No.107335631

Anonymous 11/26/25(Wed)12:50:13 No.107335631

I got banned for posting hitler on a pony, are you fucking kidding me?

Anonymous
11/26/25(Wed)12:50:54 No.107335635

Anonymous 11/26/25(Wed)12:50:54 No.107335635

>>107335631
that's why you go local my nigga

Anonymous
11/26/25(Wed)12:52:20 No.107335645

Anonymous 11/26/25(Wed)12:52:20 No.107335645

https://xcancel.com/bdsqlsz/status/1993733868181246163#m
>wait it release tomorrow.
aww... :(

Anonymous
11/26/25(Wed)12:52:28 No.107335649

Anonymous 11/26/25(Wed)12:52:28 No.107335649

>>107335631
Previous threads would get insta nuked if one out of the two dozen gens in the collage featured a pony. They look for any reason to report because they hate the fact that this thread exists.

Anonymous
11/26/25(Wed)12:53:52 No.107335663

Anonymous 11/26/25(Wed)12:53:52 No.107335663

>>107335645
For those who can't test it out on the chink site you have fal now
https://fal.ai/models/fal-ai/z-image/turbo

Anonymous
11/26/25(Wed)12:54:19 No.107335666

Anonymous 11/26/25(Wed)12:54:19 No.107335666

do normalfags still believe that all ai has a piss filter and looks like ghibli or have they learned yet

Anonymous
11/26/25(Wed)12:55:24 No.107335680

Anonymous 11/26/25(Wed)12:55:24 No.107335680

File: nbp-dank.jpg (760 KB, 2000x1493)

760 KB JPG

Anonymous
11/26/25(Wed)12:55:35 No.107335685

Anonymous 11/26/25(Wed)12:55:35 No.107335685

>>107335631
Maybe post that edgy fetish stuff on the appropriate boards. That you are already ban evading to whine also tells it own story.

Anonymous
11/26/25(Wed)12:55:39 No.107335686

Anonymous 11/26/25(Wed)12:55:39 No.107335686

>>107335631
some anon must've reported you thats the only way to get banned here

Anonymous
11/26/25(Wed)12:56:56 No.107335697

Anonymous 11/26/25(Wed)12:56:56 No.107335697

>>107335686
>some anon must've reported you
it's this anon -> >>107335685

Anonymous
11/26/25(Wed)12:57:01 No.107335698

Anonymous 11/26/25(Wed)12:57:01 No.107335698

>>107335602
the difference is all ai models currently have a problem when they are generating at a set resolution, they are generating everything to look mostly right unless you get too close, for example skin details and small fingers on a large full body image, they can look good when you're looking at everything, but when you look closer with your eyes to the screen you will notice small things that dont make sense physically, things that would have been fixed if the model was generating in large resolution right away and then downscaled its output

in other words, its different because models when generating are basically searching for a solution in a limited pixel space, the bigger the pixel space, the more refined the solution they can find for everything (as long as they are trained to support those resolutions)

this isnt the case for real photos because real photos are not diffusing a solution from nothing to something they vaguely "remember", and thus creating a lot of small mistakes, instead real cameras have direct access to and are just compressing what is already "perfect", the real world, where all the physics have to be right from the get go

Anonymous
11/26/25(Wed)12:58:39 No.107335712

Anonymous 11/26/25(Wed)12:58:39 No.107335712

>>107335685
imagine being this much of a faggot

Anonymous
11/26/25(Wed)12:58:42 No.107335714

Anonymous 11/26/25(Wed)12:58:42 No.107335714

File: 1756118946093983.png (1018 KB, 1024x768)

1018 KB PNG

>>107335663
Thanks.
>A drawing of hatsune miku with dreadlocks and light black skin skateboarding in New York at night. She is holding a smartphone on her left hand and a multicolored ball on her right hand, she has a red t-shirt with text on it that says: "MIGU". A pikachu can be seen on the top of her head. Her speech bubble says "Hard to keep me in style huh?", neons, 50's comic book style

Anonymous
11/26/25(Wed)12:58:54 No.107335717

Anonymous 11/26/25(Wed)12:58:54 No.107335717

>>107335697
Why would that be the case?

Anonymous
11/26/25(Wed)12:59:30 No.107335721

Anonymous 11/26/25(Wed)12:59:30 No.107335721

>>107335714
now translate the prompt to chinese

Anonymous
11/26/25(Wed)12:59:37 No.107335722

Anonymous 11/26/25(Wed)12:59:37 No.107335722

>>107335680
This isn't a gen.

Anonymous
11/26/25(Wed)12:59:38 No.107335723

Anonymous 11/26/25(Wed)12:59:38 No.107335723

>>107335714
bruh where are the dreadlocks? :(

Anonymous
11/26/25(Wed)13:00:38 No.107335738

Anonymous 11/26/25(Wed)13:00:38 No.107335738

>>107335722
it is

Anonymous
11/26/25(Wed)13:00:54 No.107335740

Anonymous 11/26/25(Wed)13:00:54 No.107335740

>>107335712
>hurr durr i got banned and directly ban evaded
Bans have become the new slap on the wrist, actually it happened years ago, but here you whine around like you could finally post again after a 30 day ban. Fucking crybabies.

Anonymous
11/26/25(Wed)13:01:53 No.107335748

Anonymous 11/26/25(Wed)13:01:53 No.107335748

>>107335740
>Fucking crybabies.
says the faggot crying when he sees an edgy image btw

Anonymous
11/26/25(Wed)13:02:48 No.107335755

Anonymous 11/26/25(Wed)13:02:48 No.107335755

>>107335748
There is a time and a place, newfag.

Anonymous
11/26/25(Wed)13:02:57 No.107335756

Anonymous 11/26/25(Wed)13:02:57 No.107335756

File: 1755533280886903.png (1.22 MB, 1024x768)

1.22 MB PNG

>>107335663
>https://fal.ai/models/fal-ai/z-image/turbo
>A young white woman skateboarding in Tokyo, she is holding a Kasane Teto plush
noo it doesn't know teto!! (this shit is so fast though I got that image in less than a second)

Anonymous
11/26/25(Wed)13:03:06 No.107335757

Anonymous 11/26/25(Wed)13:03:06 No.107335757

so tomorrow is when 1asiangirlGODS win...

Anonymous
11/26/25(Wed)13:03:19 No.107335759

Anonymous 11/26/25(Wed)13:03:19 No.107335759

File: flux2_bf16_c_00049_.jpg (452 KB, 1600x1600)

452 KB JPG

Pleasantly surprised with basic flux 2 1girl results considering all the hate

Anonymous
11/26/25(Wed)13:05:06 No.107335779

Anonymous 11/26/25(Wed)13:05:06 No.107335779

>>107335759
its basically better qwen overall with no low seed variance issue, its just that it wont see much community support nor loras given the size

Anonymous
11/26/25(Wed)13:05:09 No.107335780

Anonymous 11/26/25(Wed)13:05:09 No.107335780

>>107335738
Prove it.

Anonymous
11/26/25(Wed)13:05:29 No.107335785

Anonymous 11/26/25(Wed)13:05:29 No.107335785

File: nbp-this.jpg (1.35 MB, 2000x1493)

1.35 MB JPG

>>107335722
checked

Anonymous
11/26/25(Wed)13:07:08 No.107335794

Anonymous 11/26/25(Wed)13:07:08 No.107335794

File: Z-image turbo (fal).png (1.57 MB, 1024x1024)

1.57 MB PNG

>>107335756

Anonymous
11/26/25(Wed)13:07:21 No.107335796

Anonymous 11/26/25(Wed)13:07:21 No.107335796

File: Flux 2 Q8 20 steps.jpg (290 KB, 1920x1088)

290 KB JPG

>>107335714

Anonymous
11/26/25(Wed)13:07:48 No.107335802

Anonymous 11/26/25(Wed)13:07:48 No.107335802

>>107335785
This also isn't a gen.

Anonymous
11/26/25(Wed)13:09:19 No.107335817

Anonymous 11/26/25(Wed)13:09:19 No.107335817

>>107335802
don't make me pull out the reddit
https://www.reddit.com/r/ChatGPT/comments/1p6lcj6/comment/nqx1l03/?context=1

Anonymous
11/26/25(Wed)13:11:23 No.107335829

Anonymous 11/26/25(Wed)13:11:23 No.107335829

>>107335817
Why are you posting cloud in the local thread? Are you dumb?

Anonymous
11/26/25(Wed)13:12:25 No.107335843

Anonymous 11/26/25(Wed)13:12:25 No.107335843

>>107335631
kek i also got banned once and muted the next time for posting an hatsune miku pony in lmg thats just the state of things when eunuchs are in court

Anonymous
11/26/25(Wed)13:12:45 No.107335847

Anonymous 11/26/25(Wed)13:12:45 No.107335847

>>107335829
No, just lonely :(

Anonymous
11/26/25(Wed)13:12:49 No.107335848

Anonymous 11/26/25(Wed)13:12:49 No.107335848

File: 1742181563983017.png (1.78 MB, 1024x1024)

1.78 MB PNG

>>107335794

Anonymous
11/26/25(Wed)13:13:19 No.107335851

Anonymous 11/26/25(Wed)13:13:19 No.107335851

File: nbp-ftank.jpg (1.22 MB, 1600x1195)

1.22 MB JPG

>>107335829
I'm just here to T you off (again) ;^)

Anonymous
11/26/25(Wed)13:16:36 No.107335881

Anonymous 11/26/25(Wed)13:16:36 No.107335881

File: 1747284844557338.png (981 KB, 1024x768)

981 KB PNG

>>107335756
an manga of Hatsune Miku with a speech bubble saying "no troons allowed!"

Anonymous
11/26/25(Wed)13:17:27 No.107335893

Anonymous 11/26/25(Wed)13:17:27 No.107335893

File: Z-image turbo.png (2.03 MB, 832x1280)

2.03 MB PNG

Anonymous
11/26/25(Wed)13:18:34 No.107335901

Anonymous 11/26/25(Wed)13:18:34 No.107335901

I've been flapping to a lot of AI porn lately, I may as well make my own. Is the guide in the OP still good? And am I doomed with amd gpu (6700xt)?

Anonymous
11/26/25(Wed)13:19:46 No.107335909

Anonymous 11/26/25(Wed)13:19:46 No.107335909

>>107335847
wanna be friends?

Anonymous
11/26/25(Wed)13:20:39 No.107335926

Anonymous 11/26/25(Wed)13:20:39 No.107335926

>>107335901
>6700xt
I think that's sufficient for generating 1girl with SDXL.

Anonymous
11/26/25(Wed)13:24:46 No.107335945

Anonymous 11/26/25(Wed)13:24:46 No.107335945

https://xcancel.com/LumaLabsAI/status/1993735476264481126#m
>Introducing Terminal Velocity Matching: a scalable, single-stage generative training method that delivers diffusion-level quality with a 25× fewer inference steps, now trained at 10B+ scale.
damn

Anonymous
11/26/25(Wed)13:26:45 No.107335962

Anonymous 11/26/25(Wed)13:26:45 No.107335962

ALIBABY WON
BLACKED FOREST LOST

Anonymous
11/26/25(Wed)13:28:01 No.107335969

Anonymous 11/26/25(Wed)13:28:01 No.107335969

>>107335901
>12gb vram
not great not terrible

Anonymous
11/26/25(Wed)13:28:51 No.107335973

Anonymous 11/26/25(Wed)13:28:51 No.107335973

>>107335945
I will care when I see a decent model using it.
An entire graveyard full of breakthrough methods that promise 10x quality at 100x speed.

Anonymous
11/26/25(Wed)13:30:16 No.107335993

Anonymous 11/26/25(Wed)13:30:16 No.107335993

>>107335901
SDXL is as far as you can push with comfy speeds with that card without heavy quantization.
Could have run nunchaku flux if nvidia.
Like you can run larger and newer models of course but the speed will suck.

Anonymous
11/26/25(Wed)13:33:28 No.107336032

Anonymous 11/26/25(Wed)13:33:28 No.107336032

>>107335901
You could get away with some small wan gens. As for the guide, well.... its a mess and slightly dated but its still usable. Definitely go for gguf/native instead of kijai so, thats Bullerwins quants for your models https://huggingface.co/bullerwins/Wan2.2-I2V-A14B-GGUF/tree/main, https://github.com/pollockjj/ComfyUI-MultiGPU for your wan/model nodes. Would definitely save up for stronger gpu for the long term

Anonymous
11/26/25(Wed)13:34:09 No.107336038

Anonymous 11/26/25(Wed)13:34:09 No.107336038

Kek so Z image is basically Flux 2 Pro but on 6B? We really are getting BTFO'd by Chinaman.

Anonymous
11/26/25(Wed)13:36:04 No.107336058

Anonymous 11/26/25(Wed)13:36:04 No.107336058

>>107335901
static: get illustrious (wainsfw v15 is the latest I think)

moving: wan 2.2 + lightx2v + nsfw loras

also, qwen edit can make lewds of people or change their clothes.

Anonymous
11/26/25(Wed)13:37:05 No.107336074

Anonymous 11/26/25(Wed)13:37:05 No.107336074

File: Flux 2 Q8 20 steps, 2.png (2.2 MB, 1024x1024)

2.2 MB PNG

>>107335714
>>107335796

Anonymous
11/26/25(Wed)13:37:28 No.107336076

Anonymous 11/26/25(Wed)13:37:28 No.107336076

>>107336038
And they will release Z-Image Base which will be larger and better quality than Z-Image Turbo, but obviously not as large as Flux 2

BFL is so done

Anonymous
11/26/25(Wed)13:37:55 No.107336078

Anonymous 11/26/25(Wed)13:37:55 No.107336078

>>107336058
>wainsfw v15
why would you set anon up for slop like that

Anonymous
11/26/25(Wed)13:38:45 No.107336085

Anonymous 11/26/25(Wed)13:38:45 No.107336085

>>107334957
Chroma is already z image tier at photorealism, nice to know China has caught up in one of their base models though. But they love to pretend Western models don't exist

Anonymous
11/26/25(Wed)13:39:39 No.107336094

Anonymous 11/26/25(Wed)13:39:39 No.107336094

>>107336078
for 2d anime it's waiNSFW or hassaku, knows almost every character

Anonymous
11/26/25(Wed)13:40:41 No.107336102

Anonymous 11/26/25(Wed)13:40:41 No.107336102

>>107336076
>larger
it's still 6b, the only difference between base and turbo is that turbo has a step distillation method, but they're the same size

Anonymous
11/26/25(Wed)13:42:22 No.107336118

Anonymous 11/26/25(Wed)13:42:22 No.107336118

>>107336094
And yet they make the most soulless plastic sloppa imaginable.
Use Noob V-pred 1.0.

Anonymous
11/26/25(Wed)13:42:51 No.107336121

Anonymous 11/26/25(Wed)13:42:51 No.107336121

>>107336038
>>107336076
ok chang

Anonymous
11/26/25(Wed)13:45:01 No.107336133

Anonymous 11/26/25(Wed)13:45:01 No.107336133

File: 00054-1010848574.png (1.51 MB, 896x1152)

1.51 MB PNG

>>107334550
Illustrious still on top!

Anonymous
11/26/25(Wed)13:46:39 No.107336150

Anonymous 11/26/25(Wed)13:46:39 No.107336150

File: 1738114697147240.jpg (84 KB, 784x844)

84 KB JPG

But anon... how can you prefer FAKE ai bug women instead of strong white women you can meet in real life?

Anonymous
11/26/25(Wed)13:46:51 No.107336153

Anonymous 11/26/25(Wed)13:46:51 No.107336153

File: Z-image turbo.png (1.03 MB, 576x1280)

1.03 MB PNG

>1boy,JoJo's Bizarre Adventur
it's this simple lmao

Anonymous
11/26/25(Wed)13:48:20 No.107336168

Anonymous 11/26/25(Wed)13:48:20 No.107336168

File: Z-image turbo.png (915 KB, 576x1280)

915 KB PNG

bbw bros we are so back

Anonymous
11/26/25(Wed)13:48:36 No.107336171

Anonymous 11/26/25(Wed)13:48:36 No.107336171

>>107336150
wut

Anonymous
11/26/25(Wed)13:49:01 No.107336176

Anonymous 11/26/25(Wed)13:49:01 No.107336176

>>107336094
>2d
You mean gross 2.5d and not even the semi respectable "realism" kind

Anonymous
11/26/25(Wed)13:50:39 No.107336185

Anonymous 11/26/25(Wed)13:50:39 No.107336185

>>107336168
>big bug woman

Anonymous
11/26/25(Wed)13:50:58 No.107336187

Anonymous 11/26/25(Wed)13:50:58 No.107336187

>>107335594
Can it do soles?

Anonymous
11/26/25(Wed)13:51:57 No.107336195

Anonymous 11/26/25(Wed)13:51:57 No.107336195

>>107336187
try it you can make 5 free images before it starts asking you for money
https://fal.ai/models/fal-ai/z-image/turbo

Anonymous
11/26/25(Wed)13:55:47 No.107336228

Anonymous 11/26/25(Wed)13:55:47 No.107336228

>>107335714
>>107335723
it's never been this over

Anonymous
11/26/25(Wed)13:57:34 No.107336243

Anonymous 11/26/25(Wed)13:57:34 No.107336243

>>107336228
we'll see if the prompt adherence is better on the base model though

Anonymous
11/26/25(Wed)13:58:36 No.107336253

Anonymous 11/26/25(Wed)13:58:36 No.107336253

>>107335723
They have a seperate reasoning model, don't think that anon is using it.

Anonymous
11/26/25(Wed)14:00:01 No.107336266

Anonymous 11/26/25(Wed)14:00:01 No.107336266

>>107335031
>32b looks worse than 6b
so this is the power of safety...

Anonymous
11/26/25(Wed)14:01:13 No.107336276

Anonymous 11/26/25(Wed)14:01:13 No.107336276

>>107336266
Just distillation. Always has been an issue.

Anonymous
11/26/25(Wed)14:02:31 No.107336285

Anonymous 11/26/25(Wed)14:02:31 No.107336285

>>107336276
>Just distillation.
Z-image turbo is distilled too lol

Anonymous
11/26/25(Wed)14:06:58 No.107336320

Anonymous 11/26/25(Wed)14:06:58 No.107336320

>>107336085
True, but it is slow, if you can get the same quality but MUCH faster then it's pure win

That said Chroma is THE best model for realistic NSFW, time will tell if Z-Image can be enhanced with lora/finetune to be as good, it certainly won't be out-of-the-box

Anonymous
11/26/25(Wed)14:07:53 No.107336326

Anonymous 11/26/25(Wed)14:07:53 No.107336326

>>107336085
>Chroma is already z image tier at photorealism
no it's not, chroma has bad details compared to z-image

Anonymous
11/26/25(Wed)14:11:00 No.107336354

Anonymous 11/26/25(Wed)14:11:00 No.107336354

>>107336102
Really ? I thought the Z-Image base would be larger, not just distilled.

Chroma trains relatively fast (much faster than Flux and Qwen), but if Z-Image Base (which is the one you will train on) is just 6b, that should theoretically bring training time down by half compared to Chroma.

Chinks be based!

Anonymous
11/26/25(Wed)14:11:34 No.107336362

Anonymous 11/26/25(Wed)14:11:34 No.107336362

>>107336320
where are these mythical chroma nsfw gens?

Anonymous
11/26/25(Wed)14:12:40 No.107336373

Anonymous 11/26/25(Wed)14:12:40 No.107336373

>>107336150
nose ring theory is pretty confirmed at this point...

Anonymous
11/26/25(Wed)14:13:36 No.107336380

Anonymous 11/26/25(Wed)14:13:36 No.107336380

>>107336085
Chroma is a deformation fest. I haven't tried Z image but if it can hold better coherence than Chroma while being faster, it has more potential.

Main factor is if training NSFW into Z image is more feasible than fixing Chroma.

Anonymous
11/26/25(Wed)14:14:25 No.107336390

Anonymous 11/26/25(Wed)14:14:25 No.107336390

>>107336326
It may also have worse prompt comprehension. We'll see.

Of course Chroma will also have better NSFW training. There's no way they put an equal amount of lewd into a Chinese model.

Anonymous
11/26/25(Wed)14:15:38 No.107336398

Anonymous 11/26/25(Wed)14:15:38 No.107336398

>>107335167
With what? In comparison to what? Wtf did this mean lmao?

Anonymous
11/26/25(Wed)14:17:34 No.107336414

Anonymous 11/26/25(Wed)14:17:34 No.107336414

>>107336373
it's funny because it makes them look like cattle.

Anonymous
11/26/25(Wed)14:19:28 No.107336426

Anonymous 11/26/25(Wed)14:19:28 No.107336426

>>107336362
Have you been living under a rock

Anonymous
11/26/25(Wed)14:19:45 No.107336432

Anonymous 11/26/25(Wed)14:19:45 No.107336432

>>107336228
>>107336243
>>107336253
We're so back

Anonymous
11/26/25(Wed)14:22:22 No.107336454

Anonymous 11/26/25(Wed)14:22:22 No.107336454

I fucked around with it a bit over FAL. First impressions:
Not super great but knows nipples.
Doesn't know genitals too well.
Can't conclude this with a high degree of confidence, but doesn't seem too poisoned? Will draw people in sex like positions when asked, just deformed genital-like blob between them. I am going to guess that this can probably be finetuned like how we beat genitals into SDXL. Unless the model is somehow completely unresponsive to training or requires flux levels of surgery.
Doesn't know too many celebs.
Of the celebs it knows, it doesn't mind adding boobs to them.
That's all for know, gonna try more artistic stuff, text, copyrighted characters
Oh and sadly it doesn't seem to respond too well to short prompts, it seems some word salad is needed. (Though I need more tests to conclude this confidently)

Anonymous
11/26/25(Wed)14:23:35 No.107336468

Anonymous 11/26/25(Wed)14:23:35 No.107336468

>>107336454
show some images nigga, I don't care about your wall of text I want to see the images with my own eyes

Anonymous
11/26/25(Wed)14:24:56 No.107336481

Anonymous 11/26/25(Wed)14:24:56 No.107336481

>>107336468
All the images included ponies so I can't post them.

Anonymous
11/26/25(Wed)14:25:44 No.107336490

Anonymous 11/26/25(Wed)14:25:44 No.107336490

>>107336426
He's just trying to be a contrarian-fag

Anonymous
11/26/25(Wed)14:26:10 No.107336500

Anonymous 11/26/25(Wed)14:26:10 No.107336500

such quick threads and not due to trolling or bot posting
nice

Anonymous
11/26/25(Wed)14:26:17 No.107336501

Anonymous 11/26/25(Wed)14:26:17 No.107336501

File: Z-image turbo.png (1.2 MB, 864x1152)

1.2 MB PNG

Anonymous
11/26/25(Wed)14:26:51 No.107336509

Anonymous 11/26/25(Wed)14:26:51 No.107336509

>>107336500
half of these posts have to be trolls, i refuse to accept anon is this retarded

Anonymous
11/26/25(Wed)14:27:35 No.107336518

Anonymous 11/26/25(Wed)14:27:35 No.107336518

File: radiance.png (2.23 MB, 832x1488)

2.23 MB PNG

the local z-image model still isn't relased yet, right? all the posts so far are from SaaS?

Anonymous
11/26/25(Wed)14:27:55 No.107336520

Anonymous 11/26/25(Wed)14:27:55 No.107336520

>>107336518
yea it releases locally tomorrow

Anonymous
11/26/25(Wed)14:28:15 No.107336525

Anonymous 11/26/25(Wed)14:28:15 No.107336525

>>107336468
Suck my balls dipshit but you can get one.
ibb DOT co SLASH V0wPTmc5

Anonymous
11/26/25(Wed)14:28:46 No.107336529

Anonymous 11/26/25(Wed)14:28:46 No.107336529

>>107336525
>dipshit
debo's favorite word

Anonymous
11/26/25(Wed)14:29:22 No.107336537

Anonymous 11/26/25(Wed)14:29:22 No.107336537

>>107336525
Hopefully the model isnt garbage to finetune, certainly has potential

Anonymous
11/26/25(Wed)14:29:28 No.107336538

Anonymous 11/26/25(Wed)14:29:28 No.107336538

File: SPARK.Chroma_preview.safe(...).png (2.38 MB, 1120x1440)

2.38 MB PNG

>>107336153
bruh even most dedicated anime models can't do jojo's style out of the box wtf. a booru finetune of z image would be insane

>>107336276
>>107336285
z-image makes small optimizations to model architecture and is already BTFOing the bloatmaxxed, benchmaxxed models. this shows that if model bakers would actually use some more of the fucking optimization papers that get published, we'll be able to improve the size/quality ratio even more.

it's almost over for bloatmaxxers, apicucks, and jews

Anonymous
11/26/25(Wed)14:29:46 No.107336542

Anonymous 11/26/25(Wed)14:29:46 No.107336542

>>107336518
oof, not gonna lie but it hurts my eyes to see the mess of chroma radiance after seeing so many Z-image kino today

Anonymous
11/26/25(Wed)14:30:55 No.107336560

Anonymous 11/26/25(Wed)14:30:55 No.107336560

>>107336153
damn impressive
did i waste time learning prompt-fu? wtf

Anonymous
11/26/25(Wed)14:31:53 No.107336568

Anonymous 11/26/25(Wed)14:31:53 No.107336568

>>107336538
>it's almost over for bloatmaxxers
based

Anonymous
11/26/25(Wed)14:32:24 No.107336573

Anonymous 11/26/25(Wed)14:32:24 No.107336573

>>107336518
>all the posts so far are from SaaS?
Characterizing them as SaaS is incorrect. It's normal for anon to showcase gens from new local models via demos hosted on the cloud. It's actually retarded for a lab to NOT host a demo to showcase their new shit.

Anonymous
11/26/25(Wed)14:32:29 No.107336575

Anonymous 11/26/25(Wed)14:32:29 No.107336575

>it'll come out within the hour!
>well, sometime today. get ready!
>actually, it'll be released tomorrow but for real this time!

Anonymous
11/26/25(Wed)14:33:25 No.107336583

Anonymous 11/26/25(Wed)14:33:25 No.107336583

>>107336575
Do you think Comfy would waste his energy implementing the inference code if he knew the model wouldn't be released? lol

Anonymous
11/26/25(Wed)14:34:26 No.107336592

Anonymous 11/26/25(Wed)14:34:26 No.107336592

>>107336575
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
AHAHAHAH, ITS HERE LMAOOOOOOOOO

Anonymous
11/26/25(Wed)14:34:50 No.107336595

Anonymous 11/26/25(Wed)14:34:50 No.107336595

File: its up.png (146 KB, 1366x768)

146 KB PNG

>>107336575
>>107336518
IT'S UP

https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

Anonymous
11/26/25(Wed)14:35:26 No.107336601

Anonymous 11/26/25(Wed)14:35:26 No.107336601

File: 2270556597.png (1.09 MB, 1216x832)

1.09 MB PNG

Anonymous
11/26/25(Wed)14:36:15 No.107336610

Anonymous 11/26/25(Wed)14:36:15 No.107336610

>>107336595
>U. G. Krishnamurti
hello sar

Anonymous
11/26/25(Wed)14:36:20 No.107336612

Anonymous 11/26/25(Wed)14:36:20 No.107336612

>>107336592
>>107336595
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo/tree/main/transformer
how to you fusion multiple safetensors models though?

Anonymous
11/26/25(Wed)14:36:45 No.107336617

Anonymous 11/26/25(Wed)14:36:45 No.107336617

>>107336610
please do the needful saar

Anonymous
11/26/25(Wed)14:37:09 No.107336619

Anonymous 11/26/25(Wed)14:37:09 No.107336619

>>107336612
WHY do the model bakers do this?? is it so fucking hard for them to just publish a single fucking file???

Anonymous
11/26/25(Wed)14:37:55 No.107336623

Anonymous 11/26/25(Wed)14:37:55 No.107336623

>>107336612
>how to you fusion multiple safetensors models though?
and this shit is on f32... bruh :(

Anonymous
11/26/25(Wed)14:38:24 No.107336629

Anonymous 11/26/25(Wed)14:38:24 No.107336629

Fresh when ready

>>107336625
>>107336625
>>107336625

Fresh when ready

Anonymous
11/26/25(Wed)14:40:54 No.107336655

Anonymous 11/26/25(Wed)14:40:54 No.107336655

File: file.png (2.92 MB, 832x1488)

2.92 MB PNG

>>107336520
ty. it does look interesting enough.

>>107336542
have there even been any comparable z-image images? to me it looked like the images posted were all realistic, not anime-comic 2d-3d 1girl

>>107336595 >>107336592
Great timing.

Anonymous
11/26/25(Wed)14:42:38 No.107336672

Anonymous 11/26/25(Wed)14:42:38 No.107336672

>>107336655
>to me it looked like the images posted were all realistic, not anime-comic 2d-3d 1girl
look the previous thread there's some anime images of z-image

Anonymous
11/26/25(Wed)14:43:47 No.107336681

Anonymous 11/26/25(Wed)14:43:47 No.107336681

>>107336655
wtf is that tiling

Anonymous
11/26/25(Wed)15:08:19 No.107336980

Anonymous 11/26/25(Wed)15:08:19 No.107336980

Question, if I may:

Why are y'all posting all this lame shit here, while the technology is way better these days? Look at what this dude posted (mildly NSFW): >>107336734

Anonymous
11/26/25(Wed)15:08:47 No.107336990

Anonymous 11/26/25(Wed)15:08:47 No.107336990

>>107336681
retard is using some shit lora or bad lightning lora x qwen combo

Anonymous
11/26/25(Wed)15:18:41 No.107337117

Anonymous 11/26/25(Wed)15:18:41 No.107337117

>>107336980
This thread is for GPU-poor weabus.

Anonymous
11/26/25(Wed)15:19:23 No.107337128

Anonymous 11/26/25(Wed)15:19:23 No.107337128

time

Anonymous
11/26/25(Wed)15:20:24 No.107337148

Anonymous 11/26/25(Wed)15:20:24 No.107337148

to

Anonymous
11/26/25(Wed)15:21:24 No.107337161

Anonymous 11/26/25(Wed)15:21:24 No.107337161

go

Anonymous
11/26/25(Wed)15:23:53 No.107337201

Anonymous 11/26/25(Wed)15:23:53 No.107337201

What's the new anime meta?

Anonymous
11/26/25(Wed)16:43:29 No.107338249

Anonymous 11/26/25(Wed)16:43:29 No.107338249

File: cock.jpg (55 KB, 1019x662)

55 KB JPG

>>107336525
y

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.