/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 12/13/25(Sat)08:11:43 No.107536415

File: highlights_g_107529397_17(...).jpg (3.22 MB, 3876x3340)

3.22 MB JPG

/ldg/ - Local Diffusion General Anonymous 12/13/25(Sat)08:11:43 No.107536415 Archived

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107529397

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
12/13/25(Sat)08:13:01 No.107536422

Anonymous 12/13/25(Sat)08:13:01 No.107536422

>all these 1girls
WE WON!

Anonymous
12/13/25(Sat)08:13:10 No.107536424

Anonymous 12/13/25(Sat)08:13:10 No.107536424

Wretched thread of mental illness

Anonymous
12/13/25(Sat)08:13:33 No.107536428

Anonymous 12/13/25(Sat)08:13:33 No.107536428

Blessed thread of 1girl posting

Anonymous
12/13/25(Sat)08:14:42 No.107536436

Anonymous 12/13/25(Sat)08:14:42 No.107536436

>>107536415
wow, this thread desperately wants to be /adt/ but suck at making kino

Anonymous
12/13/25(Sat)08:15:35 No.107536443

Anonymous 12/13/25(Sat)08:15:35 No.107536443

https://github.com/Tongyi-MAI/Z-Image?tab=readme-ov-file#-community-works
>SGLang-Diffusion brings SGLang's state-of-the-art performance to accelerate image and video generation for diffusion models, now supporting Z-Image.
another snakeoil or?

Anonymous
12/13/25(Sat)08:15:38 No.107536444

Anonymous 12/13/25(Sat)08:15:38 No.107536444

>>107536436
>page 7
why don't go there?

Anonymous
12/13/25(Sat)08:16:31 No.107536448

Anonymous 12/13/25(Sat)08:16:31 No.107536448

>>107536443
sglang is just an engine, it's pretty popular with LLMs (for enterprises, like VLLM)

Anonymous
12/13/25(Sat)08:16:39 No.107536449

Anonymous 12/13/25(Sat)08:16:39 No.107536449

>>107536443
Chinese culture. Anything but the base model.

Anonymous
12/13/25(Sat)08:17:51 No.107536455

Anonymous 12/13/25(Sat)08:17:51 No.107536455

>>107536436
>kino
Is that a codeword for pedophilic images?

Anonymous
12/13/25(Sat)08:18:56 No.107536461

Anonymous 12/13/25(Sat)08:18:56 No.107536461

Arousing thread of 1girl gooning

Anonymous
12/13/25(Sat)08:19:12 No.107536464

Anonymous 12/13/25(Sat)08:19:12 No.107536464

File: 2025-12-13_21-44_Garage-M(...).jpg (407 KB, 1248x1824)

407 KB JPG

Anonymous
12/13/25(Sat)08:19:53 No.107536469

Anonymous 12/13/25(Sat)08:19:53 No.107536469

>>107536443
a z image gen takes like 10 seconds on modern cards. how fast do you need it to be?

Anonymous
12/13/25(Sat)08:20:35 No.107536471

Anonymous 12/13/25(Sat)08:20:35 No.107536471

>>107536436
>oh no! the niche and hyper specific thread of 3 troons feels threatened when someone posts anime in the god's chosen general
Are you that insecure?

Anonymous
12/13/25(Sat)08:21:13 No.107536476

Anonymous 12/13/25(Sat)08:21:13 No.107536476

>>107536469
we won't be using turbo forever, lot of steps + cfg (2x slower) will come back on base

Anonymous
12/13/25(Sat)08:22:27 No.107536486

Anonymous 12/13/25(Sat)08:22:27 No.107536486

>>107536471
you say this but this thread was baked by a dramatic troon

Anonymous
12/13/25(Sat)08:23:05 No.107536492

Anonymous 12/13/25(Sat)08:23:05 No.107536492

File: 2025-12-13_21-44_Garage-M(...).png (3.88 MB, 1248x1824)

3.88 MB PNG

Anonymous
12/13/25(Sat)08:24:07 No.107536498

Anonymous 12/13/25(Sat)08:24:07 No.107536498

File: z-image omni base.png (53 KB, 369x387)

53 KB PNG

Possibly posted here before, I don't check every thread but they seem to have updated their website
https://tongyi-mai.github.io/Z-Image-blog/
Unified ti2 and i2i? Wth does this mean?
I am inclined to believe that they aren't doing a complete rug pull and will release something, but I have no idea what that is shaping up to be.

Anonymous
12/13/25(Sat)08:24:33 No.107536503

Anonymous 12/13/25(Sat)08:24:33 No.107536503

>>107536436
go back there, why are you here seething?

Anonymous
12/13/25(Sat)08:25:23 No.107536510

Anonymous 12/13/25(Sat)08:25:23 No.107536510

>>107536498
I like how the team didn't do free advertisement for cumfart

Anonymous
12/13/25(Sat)08:26:43 No.107536519

Anonymous 12/13/25(Sat)08:26:43 No.107536519

>>107536498
if I understand well, base will be able to do edit while z-image edit will be something finetuned to be really good at that?

Anonymous
12/13/25(Sat)08:27:19 No.107536523

Anonymous 12/13/25(Sat)08:27:19 No.107536523

File: 2025-12-13_21-44_Hoseki-L(...).jpg (546 KB, 1248x1824)

546 KB JPG

Anonymous
12/13/25(Sat)08:28:19 No.107536530

Anonymous 12/13/25(Sat)08:28:19 No.107536530

File: WHEN???.png (94 KB, 224x224)

94 KB PNG

>>107536498
>the base model will be able to do edit as well
you guys have no idea how powerful this shit will be, an unslopped model that can make realistic shit and edit, apache 2.0, small, this is literally the dream model, those fucker brought the fire on me, WHEN RELEASE???

Anonymous
12/13/25(Sat)08:29:19 No.107536537

Anonymous 12/13/25(Sat)08:29:19 No.107536537

>>107536498
So, if you finetune such model you have to finetune it on both image and edit pair examples?

Anonymous
12/13/25(Sat)08:29:38 No.107536538

Anonymous 12/13/25(Sat)08:29:38 No.107536538

>>107536519
I am thinking it might mean something like that but it doesn't make sense to beat that into the base model, edit models have a different training loop and why burn money doing that, and degrade finetuning and non-edit use case capabilities of the model, if you are already going to release a dedicated edit model?

Anonymous
12/13/25(Sat)08:29:47 No.107536540

Anonymous 12/13/25(Sat)08:29:47 No.107536540

>>107536511

>107536471 (You)
>107536436
>Samefag btw

Too tired to take a screencap and then you mention that I used console inspect element

Anonymous
12/13/25(Sat)08:29:51 No.107536541

Anonymous 12/13/25(Sat)08:29:51 No.107536541

>>107536530
Hehe. You're about to be a victim of Chinese culture.

Anonymous
12/13/25(Sat)08:30:36 No.107536543

Anonymous 12/13/25(Sat)08:30:36 No.107536543

>>107536498
that's the first time they changed the readme to provide some news about base, it's a big sign it'll be released soon

Anonymous
12/13/25(Sat)08:31:21 No.107536545

Anonymous 12/13/25(Sat)08:31:21 No.107536545

>>107536537
If that's what they are implying then yes.

Anonymous
12/13/25(Sat)08:31:49 No.107536549

Anonymous 12/13/25(Sat)08:31:49 No.107536549

>>107536538
maybe they found a way to not kill the edit's capabilities if you only finetune on imagegen, I know that's naive wishful thinking but I like that approach

Anonymous
12/13/25(Sat)08:32:54 No.107536554

Anonymous 12/13/25(Sat)08:32:54 No.107536554

File: 2025-12-13_21-44_Hoseki-L(...).png (3.16 MB, 1248x1824)

3.16 MB PNG

Anonymous
12/13/25(Sat)08:33:11 No.107536555

Anonymous 12/13/25(Sat)08:33:11 No.107536555

File: z-image_00156_.png (2.34 MB, 1408x1408)

2.34 MB PNG

Is it really worth it /g/?

Anonymous
12/13/25(Sat)08:34:21 No.107536562

Anonymous 12/13/25(Sat)08:34:21 No.107536562

>>107536555
Is it powered by her piss or shit?

Anonymous
12/13/25(Sat)08:34:58 No.107536566

Anonymous 12/13/25(Sat)08:34:58 No.107536566

>>107536562
yes

Anonymous
12/13/25(Sat)08:35:42 No.107536571

Anonymous 12/13/25(Sat)08:35:42 No.107536571

File: Wanimate_00145.mp4 (903 KB, 544x960)

903 KB MP4

Anonymous
12/13/25(Sat)08:36:04 No.107536574

Anonymous 12/13/25(Sat)08:36:04 No.107536574

>>107536391
>>107536426
>>107536477
>>107536511
actual samefag btw

Anonymous
12/13/25(Sat)08:36:24 No.107536578

Anonymous 12/13/25(Sat)08:36:24 No.107536578

>>107536537
>>107536545
I guess it would be possible to finetuning on just images, if you don't mind frying edit capabilities away.
We have no idea what it actually is so it's just a guessing game at this point.

Anonymous
12/13/25(Sat)08:36:38 No.107536580

Anonymous 12/13/25(Sat)08:36:38 No.107536580

File: 2025-12-13_21-44_Hoseki-L(...).jpg (566 KB, 1248x1824)

566 KB JPG

Anonymous
12/13/25(Sat)08:36:51 No.107536582

Anonymous 12/13/25(Sat)08:36:51 No.107536582

>>107536449
>Anything but the base model.
not the right time to say that kek >>107536498

Anonymous
12/13/25(Sat)08:37:52 No.107536588

Anonymous 12/13/25(Sat)08:37:52 No.107536588

>>107536498
that's the first time I've heard that base is actually able to do edit, I thought you put the edit capabilitie on a model through finetuning, not during pretraining

Anonymous
12/13/25(Sat)08:38:43 No.107536594

Anonymous 12/13/25(Sat)08:38:43 No.107536594

>>107536582
I am quietly confident the model will not becoming. You can kekkaroo increasingly nervously as the weeks go by but I just understand the culture better than you.

Anonymous
12/13/25(Sat)08:39:05 No.107536598

Anonymous 12/13/25(Sat)08:39:05 No.107536598

>he spams reports on different ips
reminds me of that randall (the Jewish snitch) from recess

Anonymous
12/13/25(Sat)08:41:10 No.107536612

Anonymous 12/13/25(Sat)08:41:10 No.107536612

File: comfydogfart.png (56 KB, 943x433)

56 KB PNG

He could've just posted "the inpaint part is currently missing and will be implemented later"

but nooo he had to add his petty opinions too

Anonymous
12/13/25(Sat)08:42:28 No.107536619

Anonymous 12/13/25(Sat)08:42:28 No.107536619

File: 2025-12-13_21-44_Hoseki-L(...).jpg (617 KB, 1248x1824)

617 KB JPG

Anonymous
12/13/25(Sat)08:42:59 No.107536622

Anonymous 12/13/25(Sat)08:42:59 No.107536622

File: 31313.png (489 KB, 1587x888)

489 KB PNG

first time trying 3d. and it made an object good enough for 3d-printing on first try. only one reference image. very impressive

Anonymous
12/13/25(Sat)08:43:13 No.107536623

Anonymous 12/13/25(Sat)08:43:13 No.107536623

File: 1761923227031455.png (74 KB, 279x181)

74 KB PNG

>>107536594
>You can kekkaroo increasingly nervously as the weeks go by
you're the one being nervous, a lot of signs point to an actual release, we're so back

Anonymous
12/13/25(Sat)08:44:36 No.107536633

Anonymous 12/13/25(Sat)08:44:36 No.107536633

File: image.jpg (60 KB, 630x630)

60 KB JPG

>>107536622
Mind sharing workflow?

Anonymous
12/13/25(Sat)08:45:08 No.107536640

Anonymous 12/13/25(Sat)08:45:08 No.107536640

is cumfartorg and this thread some kind of mental asylum that uses shock therapy to make you gay?

Anonymous
12/13/25(Sat)08:46:09 No.107536646

Anonymous 12/13/25(Sat)08:46:09 No.107536646

>>107536622
>>107536633
desu, I just use the script from the GitHub. cumfart is too annoying when comfy breaks everything. workflows suck

Anonymous
12/13/25(Sat)08:46:16 No.107536647

Anonymous 12/13/25(Sat)08:46:16 No.107536647

File: Doom guy.png (319 KB, 445x404)

319 KB PNG

>>107536498
>A foundation model designed for easy fine-tuning
That's Chinese Culture speech to say that you'll only be able to finetune through API, trust the doom.

Anonymous
12/13/25(Sat)08:47:42 No.107536656

Anonymous 12/13/25(Sat)08:47:42 No.107536656

>>107536612
He's probably pissed at having to implement any new models. Just doing it to keep up appearances. Normal ZIT is fucked up too, genning times are all over the place and require a restart after a while.
Probably wishes they would all go API, much easier to handle.

Anonymous
12/13/25(Sat)08:48:36 No.107536663

Anonymous 12/13/25(Sat)08:48:36 No.107536663

>>107536656
>He's probably pissed at having to implement any new models.
instead of hiring Ui jeets who are obviously doing nothing but pretending to be working by removing stop buttons, he should train them to learn how to implement new models

Anonymous
12/13/25(Sat)08:48:43 No.107536666

Anonymous 12/13/25(Sat)08:48:43 No.107536666

>>107536498
Actually it seems like they have put this up 10 hours ago but no one seems to have noticed kek
https://github.com/Tongyi-MAI/Z-Image-blog/commit/e67bafb673fa19d301f903ac62de26c48b4cc1c4
If you scroll down there added hints about the difference between the base model and the dedicated edit one? (It has better prompt adherence and it is more creative?)

Anonymous
12/13/25(Sat)08:48:55 No.107536667

Anonymous 12/13/25(Sat)08:48:55 No.107536667

File: 2025-12-13_21-44_ShigureL(...).png (1.3 MB, 832x1216)

1.3 MB PNG

Anonymous
12/13/25(Sat)08:52:01 No.107536686

Anonymous 12/13/25(Sat)08:52:01 No.107536686

File: 1755630714880557.png (67 KB, 997x865)

67 KB PNG

>>107536498
it's comming, they hadn't touched the blog for 2 weeks, the model is probably finished

Anonymous
12/13/25(Sat)08:52:21 No.107536688

Anonymous 12/13/25(Sat)08:52:21 No.107536688

>>107536656
he just can't understand new models at all anymore. everything recently has been an improper implementation and actively ooms on dumb shit

Anonymous
12/13/25(Sat)08:52:29 No.107536689

Anonymous 12/13/25(Sat)08:52:29 No.107536689

>>107536612
Slower = increased cloud cost = less profit
They need to do the needful and retrain it immediately

Anonymous
12/13/25(Sat)08:53:27 No.107536698

Anonymous 12/13/25(Sat)08:53:27 No.107536698

I feel so conflicted. I want to believe base will be released. It's like being interested in a girl and getting mixed signals.

Anonymous
12/13/25(Sat)08:53:42 No.107536699

Anonymous 12/13/25(Sat)08:53:42 No.107536699

>>107536436
>post some 1girl, anime in highlights
>entire general is now meaningless

Anonymous
12/13/25(Sat)08:54:14 No.107536703

Anonymous 12/13/25(Sat)08:54:14 No.107536703

>>107536698
Embrace the understanding of Chinese culture.

Anonymous
12/13/25(Sat)08:54:39 No.107536707

Anonymous 12/13/25(Sat)08:54:39 No.107536707

File: 1758643090553139.png (2.14 MB, 3202x1422)

2.14 MB PNG

>>107536666
>If you scroll down there added hints about the difference between the base model and the dedicated edit one? (It has better prompt adherence and it is more creative?)
you can see it on the blog yeah

Anonymous
12/13/25(Sat)08:54:54 No.107536710

Anonymous 12/13/25(Sat)08:54:54 No.107536710

>>107536703
Can you define this for me? I see people saying it all the time but I don't know what they mean.

Anonymous
12/13/25(Sat)08:55:23 No.107536712

Anonymous 12/13/25(Sat)08:55:23 No.107536712

File: ComfyUI_00001_.png (1.15 MB, 1024x1024)

1.15 MB PNG

>>107536498
>>107536666
Anyway here is the 1girl of celebration.
Doomers on their last supply of copium.

Anonymous
12/13/25(Sat)08:55:42 No.107536718

Anonymous 12/13/25(Sat)08:55:42 No.107536718

>>107536699
some lost /adt/ posts are the most prevalent images made itt

Anonymous
12/13/25(Sat)08:55:49 No.107536720

Anonymous 12/13/25(Sat)08:55:49 No.107536720

>>107536710
it's not that deep, he's saying that chinese people are snakes and that "chinese culture" is actually the default way of acting for them (lying, cheating and so on)

Anonymous
12/13/25(Sat)08:55:49 No.107536721

Anonymous 12/13/25(Sat)08:55:49 No.107536721

File: 4444444444.png (97 KB, 293x416)

97 KB PNG

>>107536633
I just used this template and dragged in the png. just wanted to try it and it worked much better than expected

Anonymous
12/13/25(Sat)08:56:47 No.107536729

Anonymous 12/13/25(Sat)08:56:47 No.107536729

>>107536720
I see. Thank you for the explanation.

Anonymous
12/13/25(Sat)08:56:54 No.107536731

Anonymous 12/13/25(Sat)08:56:54 No.107536731

>>107536721
does it do texture extraction? would be ultrakino

Anonymous
12/13/25(Sat)08:58:27 No.107536743

Anonymous 12/13/25(Sat)08:58:27 No.107536743

>>107536720
Basically, but they will string you on as along as necessary to achieve their own means at the lowest personal cost to themselves. This is why you need to leave no wiggle room to cheat when doing business. If they can go back on a condition of a deal and get away with it, they will.
The concept of good faith business is a joke to them.

Anonymous
12/13/25(Sat)08:58:38 No.107536744

Anonymous 12/13/25(Sat)08:58:38 No.107536744

File: Can I hope?.png (228 KB, 640x377)

228 KB PNG

>>107536498
>>107536666
>>107536707
I want to believe boys...

Anonymous
12/13/25(Sat)08:59:10 No.107536750

Anonymous 12/13/25(Sat)08:59:10 No.107536750

You can test with the control_refiner_layers for noise_refiner hints and it's only marginally faster, it's just generally slower because of increased control embedder dimensions and adding hints to more layers, also requires more steps. Probably they did an initial experiment with control_refiner_layers for noise_refiner and found it doesn't work as good as just using control_layers twice, then forgot to remove the code so the untrained weights are in the released checkpoint
Also you have to concatenate zeros to fill the expected dimensions to make t2i work anyway so not implementing inpaint doesn't even make sense considering it's just concatenating the init image and mask instead of zeros

Anonymous
12/13/25(Sat)08:59:20 No.107536751

Anonymous 12/13/25(Sat)08:59:20 No.107536751

File: ComfyUI_00003_.png (1005 KB, 1024x1024)

1005 KB PNG

>>107536710

Anonymous
12/13/25(Sat)09:00:48 No.107536754

Anonymous 12/13/25(Sat)09:00:48 No.107536754

>>107536498
Do you think they delayed the release because they wanted to make the base model unified? it's the first time they ever said base would be something like that

Anonymous
12/13/25(Sat)09:00:59 No.107536756

Anonymous 12/13/25(Sat)09:00:59 No.107536756

>>107536718
Hope your next post is proof of what you are saying, also, why are you shilling your general here if we don't step in yours?

Anonymous
12/13/25(Sat)09:01:12 No.107536759

Anonymous 12/13/25(Sat)09:01:12 No.107536759

File: 1753105486026549.mp4 (1.45 MB, 720x1080)

1.45 MB MP4

>>107536744

Anonymous
12/13/25(Sat)09:02:03 No.107536763

Anonymous 12/13/25(Sat)09:02:03 No.107536763

>>107536756
no I am just ashamed of the quality around here. allowing slopstyle is a farce

Anonymous
12/13/25(Sat)09:02:35 No.107536767

Anonymous 12/13/25(Sat)09:02:35 No.107536767

>>107536588
did you forget that flux2 can do that? I don't blame you to be desu

Anonymous
12/13/25(Sat)09:03:05 No.107536770

Anonymous 12/13/25(Sat)09:03:05 No.107536770

File: Wanimate_00146.mp4 (1.3 MB, 528x832)

1.3 MB MP4

Anonymous
12/13/25(Sat)09:03:23 No.107536775

Anonymous 12/13/25(Sat)09:03:23 No.107536775

>>107536759
>instabitch makes a prediction
kek, now there's a 100% chance it'll be released
>>107536767
flux 2 isn't a base model though, it's a finetuned model, like Qwen Image Edit

Anonymous
12/13/25(Sat)09:03:39 No.107536780

Anonymous 12/13/25(Sat)09:03:39 No.107536780

File: cute-anime-holding-okay-s(...).gif (38 KB, 220x177)

38 KB GIF

>>107536721
Thanks, will try some anime girl with white background

Anonymous
12/13/25(Sat)09:06:33 No.107536798

Anonymous 12/13/25(Sat)09:06:33 No.107536798

File: 1750052244981548.png (86 KB, 1781x366)

86 KB PNG

>>107536666
>https://github.com/Tongyi-MAI/Z-Image-blog/commit/e67bafb673fa19d301f903ac62de26c48b4cc1c4
>Single-to-Single
as expected, Z-image edit will only be able to handle one image input, but they could do like what they did on Qwen Image Edit, finetune it further to make it able to edit multiple images

Anonymous
12/13/25(Sat)09:06:40 No.107536799

Anonymous 12/13/25(Sat)09:06:40 No.107536799

>>107536763
And the photorealistic ones aren't also 'slopstyle', why your fixation with anime gens here? Also if anons here post kino anime gens, wouldn't that be taking away the little sense your general has left?

Anonymous
12/13/25(Sat)09:07:51 No.107536812

Anonymous 12/13/25(Sat)09:07:51 No.107536812

>>107536799
>plastic skin isn't slopstyle

Anonymous
12/13/25(Sat)09:09:57 No.107536823

Anonymous 12/13/25(Sat)09:09:57 No.107536823

>>107536707
I have meant it in the sense that they have also changed edit model's description, that's why I highlighted the commit
>>107536744
Enjoy the hopium!
>>107536754
Likely yeah.
There aren't too many good explanation why you distill an unfinished model otherwise.
>>107536798
Actually single-to-single got deleted
That means they might already be finetuning the edit model to do that.
But that might be too much hopium.

Anonymous
12/13/25(Sat)09:11:18 No.107536833

Anonymous 12/13/25(Sat)09:11:18 No.107536833

>>107536823
>There aren't too many good explanation why you distill an unfinished model otherwise.
since turbo can't do edit, it means it was distilled from an early version of base that couldn't do edit as well

Anonymous
12/13/25(Sat)09:12:39 No.107536838

Anonymous 12/13/25(Sat)09:12:39 No.107536838

>>107536812
There's an entire thread waiting for you, you can go there.

Anonymous
12/13/25(Sat)09:12:57 No.107536842

Anonymous 12/13/25(Sat)09:12:57 No.107536842

>>107536833
Anything to cope with the fact the base is done and they either cant/won't release it.

Anonymous
12/13/25(Sat)09:14:43 No.107536856

Anonymous 12/13/25(Sat)09:14:43 No.107536856

File: 2025-12-13_21-44_ShigureL(...).jpg (247 KB, 832x1216)

247 KB JPG

Anonymous
12/13/25(Sat)09:15:15 No.107536861

Anonymous 12/13/25(Sat)09:15:15 No.107536861

File: 1414.png (537 KB, 1582x1105)

537 KB PNG

>>107536731
dont see any texture. took four screenshots of a very complex model on sketchfab and it remade it pretty good. at least good enough as a base mesh or to get proportions right in your modelling

Anonymous
12/13/25(Sat)09:16:40 No.107536877

Anonymous 12/13/25(Sat)09:16:40 No.107536877

>>107536861
Now try an API img 2 3D model and die of hopelessness.

Anonymous
12/13/25(Sat)09:17:21 No.107536889

Anonymous 12/13/25(Sat)09:17:21 No.107536889

>>107536861
Cool, which other anime girls can do this model too? Can I export it to 3D rendering software? I need 4 different angles, right?

Anonymous
12/13/25(Sat)09:17:42 No.107536891

Anonymous 12/13/25(Sat)09:17:42 No.107536891

File: Wanimate_00147.mp4 (1.14 MB, 960x544)

1.14 MB MP4

Anonymous
12/13/25(Sat)09:18:00 No.107536897

Anonymous 12/13/25(Sat)09:18:00 No.107536897

File: 1736073357907263.png (437 KB, 3802x1518)

437 KB PNG

>>107536842
They literally changed the github to say that it's a base model made for finetune and you still don't think it'll be local?

Anonymous
12/13/25(Sat)09:18:26 No.107536902

Anonymous 12/13/25(Sat)09:18:26 No.107536902

>>107536861
Can do some random SDXL gen >>107536856 to 3d?

Anonymous
12/13/25(Sat)09:18:55 No.107536908

Anonymous 12/13/25(Sat)09:18:55 No.107536908

>>107536707
No photorealistic tag on base? So will it be shit at realism or better at weebshit? Why would they drop this tag

Anonymous
12/13/25(Sat)09:19:46 No.107536912

Anonymous 12/13/25(Sat)09:19:46 No.107536912

>>107536897
>and you still don't think it'll be local?
Yes. I do not think it will be release.

Anonymous
12/13/25(Sat)09:20:04 No.107536915

Anonymous 12/13/25(Sat)09:20:04 No.107536915

File: 2025-12-13_21-44_ShigureL(...).jpg (197 KB, 832x1216)

197 KB JPG

Anonymous
12/13/25(Sat)09:20:08 No.107536917

Anonymous 12/13/25(Sat)09:20:08 No.107536917

>>107536908
you don't expect a base model to look as good as a finetuned model, it's just not happening, if base models ended up like that we wouldn't need extra finetune in the first place

Anonymous
12/13/25(Sat)09:21:31 No.107536925

Anonymous 12/13/25(Sat)09:21:31 No.107536925

>>107536889
>>107536902
you need four angles for best result. find models on sketchfab for instance and just take screenshots. or you can make one image and use the template for more views based on image

Anonymous
12/13/25(Sat)09:24:34 No.107536951

Anonymous 12/13/25(Sat)09:24:34 No.107536951

File: Z-image turbo.png (1.45 MB, 1280x720)

1.45 MB PNG

>>107536744
>I want to believe
billions must believe

Anonymous
12/13/25(Sat)09:25:28 No.107536957

Anonymous 12/13/25(Sat)09:25:28 No.107536957

>107536912
i am very english too good morning

Anonymous
12/13/25(Sat)09:25:32 No.107536958

Anonymous 12/13/25(Sat)09:25:32 No.107536958

who is paying ran overtime for being a dramafaggot 24/7?

Anonymous
12/13/25(Sat)09:25:38 No.107536960

Anonymous 12/13/25(Sat)09:25:38 No.107536960

>>107536666
The training pipeline and model variants were already described like that in the technical report (https://arxiv.org/abs/2511.22699, section 4.3) from its first version in November. Omni pre-training covered both image generation and editing. Both Z-Image-Edit and Z-Image-Turbo (which is actually called "Z-Image" in some parts of the report) branch off from the base model after that stage. The editing variant had more pre-training specifically for editing (section 4.7).

This means there's a chance LORAs trained on base will work on the editing model, but it's not guaranteed.

Anonymous
12/13/25(Sat)09:25:57 No.107536963

Anonymous 12/13/25(Sat)09:25:57 No.107536963

>>107536957
esl

Anonymous
12/13/25(Sat)09:27:28 No.107536979

Anonymous 12/13/25(Sat)09:27:28 No.107536979

I really wish base gets released and the porn-niggas that made biglust and lustify fine-tune it. The slop will be so good. I hope and wish.

Anonymous
12/13/25(Sat)09:29:18 No.107536997

Anonymous 12/13/25(Sat)09:29:18 No.107536997

>>107536979
this. and chinese noob niggas

Anonymous
12/13/25(Sat)09:30:05 No.107537006

Anonymous 12/13/25(Sat)09:30:05 No.107537006

>>107536951
How did you prompt this style?
>>107536979
For me it is the BigASP guy moving on to Z-Image Base

Anonymous
12/13/25(Sat)09:31:35 No.107537026

Anonymous 12/13/25(Sat)09:31:35 No.107537026

>>107536979
>I really wish base gets released
I'm more hyped now knowing it can do edit as well, I thought I would have to wait additional months (and the release of z-image edit) before doing that lol

Anonymous
12/13/25(Sat)09:32:46 No.107537041

Anonymous 12/13/25(Sat)09:32:46 No.107537041

>>107537006
>How did you prompt this style?
I got that style prompt from the LLM rewriter >>107531631

Anonymous
12/13/25(Sat)09:33:12 No.107537044

Anonymous 12/13/25(Sat)09:33:12 No.107537044

>>107536958
Animanon = Debo
Ran = Debo
It's all the same faggot
No other pathetic fuck wakes up every day, sees his dead general with the same Avatartroons himself despises but pretends to like them, then seethes and starts shitposting in other generals

Anonymous
12/13/25(Sat)09:34:38 No.107537061

Anonymous 12/13/25(Sat)09:34:38 No.107537061

File: kewl.png (1.37 MB, 832x1216)

1.37 MB PNG

I'm mostly happy with how my latest lora turned out

Anonymous
12/13/25(Sat)09:36:19 No.107537074

Anonymous 12/13/25(Sat)09:36:19 No.107537074

>>107537044
Poor guy, you have to understand him, he hit the jackpot with the worst avatarfags from /g/

Anonymous
12/13/25(Sat)09:38:03 No.107537096

Anonymous 12/13/25(Sat)09:38:03 No.107537096

File: Wanimate_00149.mp4 (2.41 MB, 960x544)

2.41 MB MP4

Anonymous
12/13/25(Sat)09:38:54 No.107537102

Anonymous 12/13/25(Sat)09:38:54 No.107537102

>>107537044
ani is a real person. comfyanon and catpissanon met him irl. I don't think he shits up the thread but mentally unhinged schizos try to make it appear otherwise

Anonymous
12/13/25(Sat)09:38:55 No.107537103

Anonymous 12/13/25(Sat)09:38:55 No.107537103

>>107537061
What is the theme of the lora, retro? SDXL or ZiT? I want to use it!

Anonymous
12/13/25(Sat)09:39:06 No.107537105

Anonymous 12/13/25(Sat)09:39:06 No.107537105

Is it possible to generate 16bit pngs in comfy for massive dynamic range, or is the gen itself limited in a way?

Anonymous
12/13/25(Sat)09:39:55 No.107537115

Anonymous 12/13/25(Sat)09:39:55 No.107537115

When will the onetrainer nigga update his shit

Anonymous
12/13/25(Sat)09:40:32 No.107537121

Anonymous 12/13/25(Sat)09:40:32 No.107537121

File: 1737924715806510.jpg (525 KB, 2473x1452)

525 KB JPG

>>107536498
https://arxiv.org/pdf/2511.22699
I have a question though, what version of base will we get? the one on the far left?

Anonymous
12/13/25(Sat)09:43:01 No.107537145

Anonymous 12/13/25(Sat)09:43:01 No.107537145

File: yallJEALOUS.png (65 KB, 671x287)

65 KB PNG

>>107537102
surrrre

Anonymous
12/13/25(Sat)09:44:52 No.107537164

Anonymous 12/13/25(Sat)09:44:52 No.107537164

File: qwen_edit.png (645 KB, 857x655)

645 KB PNG

>>107537102
Sorry Debo, it must be rough watching time pass and seeing your justice league of avatarfags stay the same, and even worse, they don't improve, like they have some kind of mental illness, right?

Anonymous
12/13/25(Sat)09:44:57 No.107537166

Anonymous 12/13/25(Sat)09:44:57 No.107537166

>>107537145
https://github.com/FizzleDorf
here is ani's GitHub. show us yours

Anonymous
12/13/25(Sat)09:45:33 No.107537170

Anonymous 12/13/25(Sat)09:45:33 No.107537170

File: woman__.png (2.18 MB, 1024x1024)

2.18 MB PNG

Anonymous
12/13/25(Sat)09:47:09 No.107537182

Anonymous 12/13/25(Sat)09:47:09 No.107537182

File: erm.png (2.11 MB, 832x1216)

2.11 MB PNG

>>107537103
It's based off of 100 illustrations by an artist named systemst91
The more I use it, the more I realize it's pretty flawed, and might need to be remade
I might not have the skill (or, let's be honest, the patience) to make a lora that is actually worth uploading anywhere

Anonymous
12/13/25(Sat)09:48:00 No.107537188

Anonymous 12/13/25(Sat)09:48:00 No.107537188

>>107537061
link?

Anonymous
12/13/25(Sat)09:48:29 No.107537191

Anonymous 12/13/25(Sat)09:48:29 No.107537191

>>107537188
bruh >>107537182

Anonymous
12/13/25(Sat)09:50:05 No.107537205

Anonymous 12/13/25(Sat)09:50:05 No.107537205

>>107537182
What model is this? If it is Noob show me training settings and I might suggest some changes.
For SDXL especially, bad anatomy can also stem from "confusing" images in the dataset as well though.

Anonymous
12/13/25(Sat)09:50:26 No.107537209

Anonymous 12/13/25(Sat)09:50:26 No.107537209

>>107537182
You can always share your failed lora gens in the official /Stable DiffsuionTM general/ , I'm sure they'll be above average there.
Right? >>107536958

Anonymous
12/13/25(Sat)09:52:16 No.107537225

Anonymous 12/13/25(Sat)09:52:16 No.107537225

File: 00037-2881979531.png (1.94 MB, 832x1216)

1.94 MB PNG

>>107537205
I'm probably going to get laughed out of the thread for using a model that is universally regarded as shitty, but...
I trained the lora on Illustrious Hassaku

Anonymous
12/13/25(Sat)09:52:50 No.107537231

Anonymous 12/13/25(Sat)09:52:50 No.107537231

>>107537182
if you are using noob vpred it just looks like this. try Wai or plantmilk

Anonymous
12/13/25(Sat)09:53:32 No.107537234

Anonymous 12/13/25(Sat)09:53:32 No.107537234

File: file.png (2 KB, 159x40)

2 KB PNG

Imagine being so far up your own ass that you add an entire thing called "broken" to your code instead of just not using the part you know isn't used

Anonymous
12/13/25(Sat)09:53:52 No.107537237

Anonymous 12/13/25(Sat)09:53:52 No.107537237

>>107537225
>Hassaku
based, Ikena is an honest dev

Anonymous
12/13/25(Sat)09:53:57 No.107537238

Anonymous 12/13/25(Sat)09:53:57 No.107537238

>>107537225
>Illustrious Hassaku
why? you are supposed to train on base illustrious then it's compatible with every other ill model

Anonymous
12/13/25(Sat)09:56:03 No.107537257

Anonymous 12/13/25(Sat)09:56:03 No.107537257

File: ComfyUI_01773_.png (1.26 MB, 1440x816)

1.26 MB PNG

Anonymous
12/13/25(Sat)09:57:25 No.107537267

Anonymous 12/13/25(Sat)09:57:25 No.107537267

>>107537225
Training on specific checkpoints can be good if you want to squeeze maximum quality from your lora at the expense of compatibility with other checkpoints. Though training on shitmixes come with the same caveats of using shitmixes.
You want to train on a base model like Illustrious XL v2 or better Noob v-pred v1 for SDXL anime.
Though you likely have fucked up some parameters or have too weird images in the dataset to mangle anatomy that much.

Anonymous
12/13/25(Sat)09:57:35 No.107537272

Anonymous 12/13/25(Sat)09:57:35 No.107537272

File: file.png (7 KB, 354x112)

7 KB PNG

Even if they retrained it, unless they changed something else, you'd have no way of toggling broken

Anonymous
12/13/25(Sat)09:58:26 No.107537280

Anonymous 12/13/25(Sat)09:58:26 No.107537280

>>107537231
Ignore this troll.

Anonymous
12/13/25(Sat)09:59:06 No.107537284

Anonymous 12/13/25(Sat)09:59:06 No.107537284

File: ComfyUI-res_2m-1.0-8-2025(...).png (1.91 MB, 1080x1920)

1.91 MB PNG

Ah, I understand now. I guess my stocking lora sucked cock because i stopped it at under 3,000 steps. This is my personal look for a (realistic) peach at 3,000 steps i let train while i slept.
last time i tried this, it failed to learn certain aspects of her attire. This time i went in with a more diverse dataset (still 20 images) and on top of that anon's suggestion of keeping all the training settings at default, it trained 100% of the character and i can change all of her attire.
oh and i can strip her nude because of the dataset, and the nudity accuracy is like 99% there.
yeah z-img turbo is really as good and trainable as everyone says. Damn. Base and edit are gonna light this scene on fire. 10/10 do recommend giving it a shot.

>>107537115
onetrainer is fucking DEAD nigga you're gonna have to get ai toolkit.

Anonymous
12/13/25(Sat)09:59:45 No.107537291

Anonymous 12/13/25(Sat)09:59:45 No.107537291

File: 2025-12-13_21-44_ShigureL(...).jpg (205 KB, 832x1216)

205 KB JPG

Anonymous
12/13/25(Sat)10:00:03 No.107537295

Anonymous 12/13/25(Sat)10:00:03 No.107537295

File: zimg_0007.png (2.09 MB, 1080x1440)

2.09 MB PNG

necroresponse, someone asked about training a lora on 512px, the likeness is not bad (zoey luna). it's kinda weird that i can get results of 750 steps

Anonymous
12/13/25(Sat)10:00:51 No.107537304

Anonymous 12/13/25(Sat)10:00:51 No.107537304

>>107537166
ran doesn't do anything constructive to the thread or society so of course she doesn't have one

Anonymous
12/13/25(Sat)10:01:04 No.107537305

Anonymous 12/13/25(Sat)10:01:04 No.107537305

File: ComfyUI_00005_.png (2.02 MB, 1024x1024)

2.02 MB PNG

>>107537284
I was gearing up to train on turbo but now I think I will just wait for base

Anonymous
12/13/25(Sat)10:01:15 No.107537307

Anonymous 12/13/25(Sat)10:01:15 No.107537307

>>107537145
What is this screenshot supposed to be about?

Anonymous
12/13/25(Sat)10:01:32 No.107537313

Anonymous 12/13/25(Sat)10:01:32 No.107537313

File: derp.png (1.61 MB, 832x1216)

1.61 MB PNG

>>107537238
cuz I'm a retard and didn't know that

Anonymous
12/13/25(Sat)10:02:11 No.107537320

Anonymous 12/13/25(Sat)10:02:11 No.107537320

>>107537305
I would still wait for base, it's not perfect. Just because it's trainable doesn't mean it's as good as it can be. Would be great if i can get it to not force the style of the dataset too but that may be a flaw of training a distilled model.

Anonymous
12/13/25(Sat)10:02:26 No.107537323

Anonymous 12/13/25(Sat)10:02:26 No.107537323

>>107537284
What training settings?

Anonymous
12/13/25(Sat)10:02:28 No.107537324

Anonymous 12/13/25(Sat)10:02:28 No.107537324

>>107537234
Why are people hating comfy for his behavior here?

There are way too many fucking grifters that are trying to capitalize on the Z-Image hype at any cost. A different team inside Alibaba itself trained a controlnet for Z-Image, on the distilled model (?), not once, but twice and released just a few days apart. The second version has a literal blatant typo that runs part of the model wrong, but of course being an ML model it will adapt to whatever you trained it with even if suboptimal. It is clearly broken and comfy just made the code handle that case explicitly and called it out.

Z-Image has an epidemic of shitty loras, shitty controlnets, shitty half-assed everything that people are rushing out because they want to jump on the hype train.

Anonymous
12/13/25(Sat)10:03:03 No.107537327

Anonymous 12/13/25(Sat)10:03:03 No.107537327

File: 2025-12-13_21-44_WAI-illu(...).jpg (1.21 MB, 1568x2320)

1.21 MB JPG

Anonymous
12/13/25(Sat)10:03:06 No.107537328

Anonymous 12/13/25(Sat)10:03:06 No.107537328

File: zimg_0010.png (2.04 MB, 1080x1440)

2.04 MB PNG

>>107537295
512px isn't terrible just kinda flat on the detail i guess. 40 mins on a 3090 (1500 steps), i might push this to see if i can actually train a likeness lora on 750 steps, 512px in 20mins

Anonymous
12/13/25(Sat)10:03:29 No.107537330

Anonymous 12/13/25(Sat)10:03:29 No.107537330

>>107537295
>the likeness is not bad
Flux vae preserves details a lot better even at low res, so the model can actually learn the likeness.
It still looks desperate or Indian to train at 512p though. (Not that I should judge too much as a vramlet, but that doesn't make it untrue)

Anonymous
12/13/25(Sat)10:03:31 No.107537331

Anonymous 12/13/25(Sat)10:03:31 No.107537331

>>107537307
ani is more respected and talented than tRan which is why she has frequent melties and spitebakes

Anonymous
12/13/25(Sat)10:03:38 No.107537334

Anonymous 12/13/25(Sat)10:03:38 No.107537334

>>107537324
Forgot your avatar image

Anonymous
12/13/25(Sat)10:03:59 No.107537339

Anonymous 12/13/25(Sat)10:03:59 No.107537339

File: gggsgweweg.jpg (2.54 MB, 5000x3042)

2.54 MB JPG

There's no alternative to seedvr2 is there? tiled upscaling with zit itself?

Anonymous
12/13/25(Sat)10:04:16 No.107537342

Anonymous 12/13/25(Sat)10:04:16 No.107537342

File: pees.png (1.64 MB, 832x1216)

1.64 MB PNG

>>107537267
This is helpful to know!
So, I'm guessing that a small dataset with very clear images is probably going to produce much better results than a large dataset with a lot of clutter in each image?

Anonymous
12/13/25(Sat)10:04:25 No.107537344

Anonymous 12/13/25(Sat)10:04:25 No.107537344

>>107537324
>Z-Image has an epidemic of shitty loras, shitty controlnets
calm down that model is less than 2 weeks old, let people master this architecture

Anonymous
12/13/25(Sat)10:04:32 No.107537346

Anonymous 12/13/25(Sat)10:04:32 No.107537346

>>107537324
Newbie was a lumina tune and comfy should be labelling his own code broken considering people are still memory leaking zit

Anonymous
12/13/25(Sat)10:05:18 No.107537353

Anonymous 12/13/25(Sat)10:05:18 No.107537353

>>107537323
defaults as i said.
-open ai toolkit
-change paths and lora settings as needed
-change steps as needed
-start.

>>107537295
>>107537328
yep that was me. Still blown away you could even train a lora at all with that resolution. Picrel was trained at 1024.

Anonymous
12/13/25(Sat)10:05:33 No.107537355

Anonymous 12/13/25(Sat)10:05:33 No.107537355

File: 1738340391810037.png (1.84 MB, 1271x1347)

1.84 MB PNG

>>107537121
even the Z-image devs are shilling rewriting your prompts into a boomer prompt with LLMs

Anonymous
12/13/25(Sat)10:07:53 No.107537375

Anonymous 12/13/25(Sat)10:07:53 No.107537375

File: 2025-12-13_21-44_Kyaa-Fla(...).jpg (709 KB, 1664x2432)

709 KB JPG

Anonymous
12/13/25(Sat)10:07:54 No.107537376

Anonymous 12/13/25(Sat)10:07:54 No.107537376

>>107537331
trvke

Anonymous
12/13/25(Sat)10:08:35 No.107537382

Anonymous 12/13/25(Sat)10:08:35 No.107537382

>>107537342
For SDXL, for style loras, you want a lot of images typically.
Quality over quantity route works for character but SDXL struggles to learn style without too much noise from a small dataset. (Can still be done with knowhow and luck but you don't have the former.) Your initial 100 mark is good enough. I guess you can remove some low quality images but don't remove more than a few.

Anonymous
12/13/25(Sat)10:11:59 No.107537414

Anonymous 12/13/25(Sat)10:11:59 No.107537414

what happened to the vae replacements? how come the new models don't use them?

Anonymous
12/13/25(Sat)10:12:34 No.107537423

Anonymous 12/13/25(Sat)10:12:34 No.107537423

File: ComfyUI_01774_.png (1.09 MB, 1360x768)

1.09 MB PNG

Anonymous
12/13/25(Sat)10:12:51 No.107537427

Anonymous 12/13/25(Sat)10:12:51 No.107537427

File: 2025-12-13_21-44_Kyaa-Fla(...).jpg (604 KB, 1664x2432)

604 KB JPG

Anonymous
12/13/25(Sat)10:13:50 No.107537432

Anonymous 12/13/25(Sat)10:13:50 No.107537432

File: ComfyUI-res_2m-1.0-8-2025(...).png (1.78 MB, 1080x1920)

1.78 MB PNG

>>107537423
when are you releasing the Yakub ZiT lora?

Anonymous
12/13/25(Sat)10:13:59 No.107537436

Anonymous 12/13/25(Sat)10:13:59 No.107537436

>>107537382
NTA I agree, 100 images is great for IL style loras. However I've had to use datasets that only had like 30 images and managed to get pretty decent results so don't be discouraged if your artist doesn't have a lot of art online or something.

Anonymous
12/13/25(Sat)10:15:30 No.107537447

Anonymous 12/13/25(Sat)10:15:30 No.107537447

>>107537414
Do you refer to lodestone's claims of about pixel space diffusion?
His model is yet to (and not going to) converge into anything worth a damn to convince anyone outside of his discord.
And while not perfect flux vae is good enough in terms of quality.

Anonymous
12/13/25(Sat)10:15:34 No.107537448

Anonymous 12/13/25(Sat)10:15:34 No.107537448

File: ComfyUI_01775_.png (1.04 MB, 1360x768)

1.04 MB PNG

Anonymous
12/13/25(Sat)10:16:00 No.107537453

Anonymous 12/13/25(Sat)10:16:00 No.107537453

cumfartorg is simultaneously toxic positivity culture and toxic corpo culture that reached the boiling point with a garbage ui library they doubled down on. 2026 really is going to be the year it all falls apart

Anonymous
12/13/25(Sat)10:16:52 No.107537462

Anonymous 12/13/25(Sat)10:16:52 No.107537462

>>107537453
Yeah, whatever sad schizo

Anonymous
12/13/25(Sat)10:17:02 No.107537463

Anonymous 12/13/25(Sat)10:17:02 No.107537463

DRAG AND SHOT

Anonymous
12/13/25(Sat)10:17:05 No.107537465

Anonymous 12/13/25(Sat)10:17:05 No.107537465

File: Cefurktuber.png (924 KB, 1011x889)

924 KB PNG

Anonymous
12/13/25(Sat)10:17:33 No.107537472

Anonymous 12/13/25(Sat)10:17:33 No.107537472

>>107537447
no just the papers that came out a while ago. supposedly the replacements are much lighter on vram, faster and reduces noise in the output (higher quality)

Anonymous
12/13/25(Sat)10:17:34 No.107537473

Anonymous 12/13/25(Sat)10:17:34 No.107537473

>>107537463
Inquiry if i may,
What if comfynigger drug and shot you?

Anonymous
12/13/25(Sat)10:18:21 No.107537484

Anonymous 12/13/25(Sat)10:18:21 No.107537484

>>107537465
4h33m51s

proper documentation my furkan

Anonymous
12/13/25(Sat)10:18:34 No.107537485

Anonymous 12/13/25(Sat)10:18:34 No.107537485

>>107537465
I hope this gets memed into reality. furk is a great storyteller I'd love to have my kids listen to for what these times were like

Anonymous
12/13/25(Sat)10:18:40 No.107537486

Anonymous 12/13/25(Sat)10:18:40 No.107537486

File: cute couple.png (1.08 MB, 1024x1024)

1.08 MB PNG

Anonymous
12/13/25(Sat)10:19:30 No.107537495

Anonymous 12/13/25(Sat)10:19:30 No.107537495

>>107537486
get tf outta here with that slop

sdg is that way

Anonymous
12/13/25(Sat)10:19:41 No.107537496

Anonymous 12/13/25(Sat)10:19:41 No.107537496

>>107537472
>papers that came out a while ago
Well there a lot of these.
Link to which ones you are talking about?

Anonymous
12/13/25(Sat)10:19:53 No.107537498

Anonymous 12/13/25(Sat)10:19:53 No.107537498

>>107537465
>"he made a deal with the Jewish devils"
>"big mistake"

Anonymous
12/13/25(Sat)10:20:40 No.107537506

Anonymous 12/13/25(Sat)10:20:40 No.107537506

File: gggsgweweg2.jpg (3.1 MB, 8000x4866)

3.1 MB JPG

>>107537339
I don't know why I didn't think that tiled upscaling would work. Then just top it off with seedvr2.

Anonymous
12/13/25(Sat)10:20:48 No.107537508

Anonymous 12/13/25(Sat)10:20:48 No.107537508

>>107537486
>>>/g/adt/

Anonymous
12/13/25(Sat)10:21:36 No.107537517

Anonymous 12/13/25(Sat)10:21:36 No.107537517

>>107537495
>>107537508
What's wrong with it

Anonymous
12/13/25(Sat)10:21:40 No.107537518

Anonymous 12/13/25(Sat)10:21:40 No.107537518

>>107537496
there were quite a few but this one came out early
https://arxiv.org/html/2510.15301v1
there are a lot looking into it but I can't link them all

Anonymous
12/13/25(Sat)10:21:52 No.107537520

Anonymous 12/13/25(Sat)10:21:52 No.107537520

File: 2025-12-13_21-44_Red_Lily(...).jpg (525 KB, 1344x1728)

525 KB JPG

Anonymous
12/13/25(Sat)10:22:41 No.107537525

Anonymous 12/13/25(Sat)10:22:41 No.107537525

File: ComfyUI_01776_.png (1.12 MB, 1360x768)

1.12 MB PNG

Anonymous
12/13/25(Sat)10:23:18 No.107537531

Anonymous 12/13/25(Sat)10:23:18 No.107537531

>>107537517
Maybe you are a newfag, but there is a specific anime general thread for that. It will be better received there than here.

Anonymous
12/13/25(Sat)10:23:52 No.107537536

Anonymous 12/13/25(Sat)10:23:52 No.107537536

>me waiting for OneTrainer to implement ZiT

Anonymous
12/13/25(Sat)10:24:31 No.107537542

Anonymous 12/13/25(Sat)10:24:31 No.107537542

why is he samefagging again

Anonymous
12/13/25(Sat)10:24:33 No.107537543

Anonymous 12/13/25(Sat)10:24:33 No.107537543

File: 2025-12-13_21-44_Red_Lily(...).png (2.67 MB, 1344x1728)

2.67 MB PNG

Anonymous
12/13/25(Sat)10:24:52 No.107537545

Anonymous 12/13/25(Sat)10:24:52 No.107537545

>>107537508
It's not anime though
>>107537518
Let me skim through.
But judging by 17 Oct 2025 release date I would say it is too new even if correct and worthwhile. Probably would take a few more months until it gets used in any finished model.

Anonymous
12/13/25(Sat)10:25:40 No.107537557

Anonymous 12/13/25(Sat)10:25:40 No.107537557

>>107537166
How does that prove that trAni is not an unhinged faggot and should fuck off forever?

Anonymous
12/13/25(Sat)10:26:00 No.107537559

Anonymous 12/13/25(Sat)10:26:00 No.107537559

File: Z-image turbo.png (1.94 MB, 1280x720)

1.94 MB PNG

Anonymous
12/13/25(Sat)10:26:59 No.107537569

Anonymous 12/13/25(Sat)10:26:59 No.107537569

>>107537545
it's interesting they tested on sdxl. we might see it get life support in some new ill-like model which I would be fine with. unet is still a great arch that still needs to be explored

Anonymous
12/13/25(Sat)10:27:13 No.107537574

Anonymous 12/13/25(Sat)10:27:13 No.107537574

File: 2025-12-13_21-44_Red_Lily(...).png (3.52 MB, 1344x1728)

3.52 MB PNG

Anonymous
12/13/25(Sat)10:27:16 No.107537575

Anonymous 12/13/25(Sat)10:27:16 No.107537575

File: ComfyUI_01777_.png (1.19 MB, 1360x768)

1.19 MB PNG

Anonymous
12/13/25(Sat)10:28:11 No.107537583

Anonymous 12/13/25(Sat)10:28:11 No.107537583

>>107537557
ani works at contributing and sharing his work with others. you shit your diaper and screech in the thread every day. I wonder which anon people want around?

Anonymous
12/13/25(Sat)10:29:34 No.107537593

Anonymous 12/13/25(Sat)10:29:34 No.107537593

>>107537583
you forgot to mention ran, sonic and ben10 "anon"

Anonymous
12/13/25(Sat)10:29:49 No.107537596

Anonymous 12/13/25(Sat)10:29:49 No.107537596

>>107537583
trvth nvke

Anonymous
12/13/25(Sat)10:30:04 No.107537599

Anonymous 12/13/25(Sat)10:30:04 No.107537599

File: random chinese woman.png (1.27 MB, 896x1152)

1.27 MB PNG

Anonymous
12/13/25(Sat)10:30:26 No.107537602

Anonymous 12/13/25(Sat)10:30:26 No.107537602

File: 2025-12-13_21-44_Red_Lily(...).png (3.21 MB, 1384x1784)

3.21 MB PNG

Anonymous
12/13/25(Sat)10:30:37 No.107537604

Anonymous 12/13/25(Sat)10:30:37 No.107537604

File: zimg_0012.png (2.07 MB, 1080x1440)

2.07 MB PNG

>>107537353
it's an interesting experiment, for higher concepts i think you need a ton of steps, i think refining something the model already knows is very fast. i couldn't get a new concept in there with the default settings, i had to really change the LR, etc

Anonymous
12/13/25(Sat)10:34:34 No.107537626

Anonymous 12/13/25(Sat)10:34:34 No.107537626

>>107537583
ani jacks off to shota and waits for people to implement basic features in his wrapper more like

Anonymous
12/13/25(Sat)10:35:33 No.107537632

Anonymous 12/13/25(Sat)10:35:33 No.107537632

File: ComfyUI_01778_.png (1.2 MB, 1360x768)

1.2 MB PNG

Anonymous
12/13/25(Sat)10:36:47 No.107537640

Anonymous 12/13/25(Sat)10:36:47 No.107537640

NO you don't understand, everyone at alibaba's internal team and the reviewers on their public repo missed the TYPO, no way it was done on purpose, can't be, I'm right and EVERYONE ELSE IS WRONG

Anonymous
12/13/25(Sat)10:37:38 No.107537643

Anonymous 12/13/25(Sat)10:37:38 No.107537643

File: Z-image turbo.png (1.53 MB, 1280x720)

1.53 MB PNG

Anonymous
12/13/25(Sat)10:40:03 No.107537659

Anonymous 12/13/25(Sat)10:40:03 No.107537659

>>107537626
link to the shota collection?

Anonymous
12/13/25(Sat)10:40:38 No.107537667

Anonymous 12/13/25(Sat)10:40:38 No.107537667

>>107537626 >>107537583 >>107537557
Did you know Ani from AniStudio (Ani from Anime) is a /adt/ regular? You should post and discuss his stuff there instead, it's more relevant to that thread!

Anonymous
12/13/25(Sat)10:40:45 No.107537669

Anonymous 12/13/25(Sat)10:40:45 No.107537669

>>107537640
Kek

Anonymous
12/13/25(Sat)10:41:16 No.107537675

Anonymous 12/13/25(Sat)10:41:16 No.107537675

>>107537626
ani sounds pretty based ngl

Anonymous
12/13/25(Sat)10:41:36 No.107537678

Anonymous 12/13/25(Sat)10:41:36 No.107537678

File: demo2_00006_ copy.jpg (3.01 MB, 4480x6000)

3.01 MB JPG

How could flux blunder so hard? China nr 1.

Anonymous
12/13/25(Sat)10:42:16 No.107537685

Anonymous 12/13/25(Sat)10:42:16 No.107537685

>>107537667
sorry but animanon is in the OP so it's relevant to the thread

Anonymous
12/13/25(Sat)10:42:29 No.107537689

Anonymous 12/13/25(Sat)10:42:29 No.107537689

>>107537583
based
i wish schizo just stopped harassing anons who actually try to contribute. ani is the good guy here

Anonymous
12/13/25(Sat)10:43:08 No.107537694

Anonymous 12/13/25(Sat)10:43:08 No.107537694

File: Screenshot_20251213-16094(...).jpg (599 KB, 1676x801)

599 KB JPG

When will I be able to create 3D characters and 3D clothing/objects from generated images in a simple pipeline, then manage them in a DAZ3D-like editor and use these characters, clothing, backgrounds, and poses to auto create control nets + prompts for Z Image finetune?

I mean, can't we replace the writing somehow and make the generation process more playful?

Anonymous
12/13/25(Sat)10:43:21 No.107537698

Anonymous 12/13/25(Sat)10:43:21 No.107537698

File: 2025-12-13_21-44_Red_Lily(...).png (2.86 MB, 1344x1728)

2.86 MB PNG

Anonymous
12/13/25(Sat)10:43:28 No.107537700

Anonymous 12/13/25(Sat)10:43:28 No.107537700

>>107537632
are you 12? you seem to have a lot of free time

Anonymous
12/13/25(Sat)10:44:22 No.107537708

Anonymous 12/13/25(Sat)10:44:22 No.107537708

>>107537694
bad idea since the topology is bad for anything that isn't a static object

Anonymous
12/13/25(Sat)10:45:32 No.107537725

Anonymous 12/13/25(Sat)10:45:32 No.107537725

>>107537694
2 more years. 3D models are the final frontier of this tech to be honest, there's far more complexities going on there than with 2d.
which is why you never hear artists crying and pissing and shidding themselves about the 3d modeler jobs, no one cares about them.

(remember the industry hated them first)

Anonymous
12/13/25(Sat)10:46:06 No.107537730

Anonymous 12/13/25(Sat)10:46:06 No.107537730

>>107537685
Sorry but his UI is focused on Anime, and there is a specific Anime general for that, you have to talk about him there.

Anonymous
12/13/25(Sat)10:46:58 No.107537733

Anonymous 12/13/25(Sat)10:46:58 No.107537733

>>107537640
Comfy's ego is so high he genuinely believes that yeah

Anonymous
12/13/25(Sat)10:47:08 No.107537735

Anonymous 12/13/25(Sat)10:47:08 No.107537735

File: 2025-12-13_21-44_Red_Lily(...).png (3.86 MB, 1344x1728)

3.86 MB PNG

Anonymous
12/13/25(Sat)10:47:16 No.107537739

Anonymous 12/13/25(Sat)10:47:16 No.107537739

File: church.png (1.5 MB, 1216x832)

1.5 MB PNG

>>107537678
Baby making sex

Anonymous
12/13/25(Sat)10:49:31 No.107537771

Anonymous 12/13/25(Sat)10:49:31 No.107537771

File: ComfyUI_00179_.png (2.21 MB, 1520x1040)

2.21 MB PNG

bet-
a57 is alight

Anonymous
12/13/25(Sat)10:50:23 No.107537777

Anonymous 12/13/25(Sat)10:50:23 No.107537777

>>107537708
I think you interpreted my thoughts differently than I intended.
Auto Remesh that it's a few hundred/thousand instead of 1 million polygons and you're done. I don't see how topology would play a role here in any case.
The 3D model is more for visual feedback, as what you currently have in the prompt, and gives you a control net. You are also welcome to separate the character and background.

Anonymous
12/13/25(Sat)10:50:29 No.107537780

Anonymous 12/13/25(Sat)10:50:29 No.107537780

>>107537771
thank you for convincing me that the dpm samplers are overhyped and not alight

Anonymous
12/13/25(Sat)10:51:32 No.107537787

Anonymous 12/13/25(Sat)10:51:32 No.107537787

File: ComfyUI_01779_.png (1.28 MB, 1360x768)

1.28 MB PNG

Anonymous
12/13/25(Sat)10:52:23 No.107537794

Anonymous 12/13/25(Sat)10:52:23 No.107537794

File: three women.png (1.41 MB, 1216x832)

1.41 MB PNG

>>107537771
I am a dpmpp_2m beta guy.
Though that causes issues for zit sometimes.
So I am using euler ancestral ddim unifrom for now.
No idea what I will use when base releases because I hate ancestral samplers.

Anonymous
12/13/25(Sat)10:53:23 No.107537799

Anonymous 12/13/25(Sat)10:53:23 No.107537799

File: lol lmao.jpg (439 KB, 1600x896)

439 KB JPG

>>107537794
>No idea what I will use when base releases

Anonymous
12/13/25(Sat)10:53:36 No.107537803

Anonymous 12/13/25(Sat)10:53:36 No.107537803

File: Z-image turbo.png (1.63 MB, 3096x1527)

1.63 MB PNG

Way closer than I expected lol

Anonymous
12/13/25(Sat)10:54:24 No.107537807

Anonymous 12/13/25(Sat)10:54:24 No.107537807

>>107537777
the poly count isn't the problem, it's how the topology works with skinning+rigging. generated 3d models are already retopo'd but it's just evenly spaced quads which will just look like clipping garbage when it's used in a rig. some edge cases might be static objects like a hair ornament or a belt buckle but for the character model itself it's terrible other than for 3d printing or maybe a 3d statue in a scene

Anonymous
12/13/25(Sat)10:54:36 No.107537808

Anonymous 12/13/25(Sat)10:54:36 No.107537808

File: ComfyUI-res_2m-1.0-8-2025(...).png (1.91 MB, 1080x1920)

1.91 MB PNG

>>107537794
res2m seems to be the go to sampler, with beta/beta57 scheduler.

Anonymous
12/13/25(Sat)10:54:37 No.107537809

Anonymous 12/13/25(Sat)10:54:37 No.107537809

File: 2025-12-13_21-44_Red_Lily(...).png (3.22 MB, 1344x1728)

3.22 MB PNG

Anonymous
12/13/25(Sat)10:55:14 No.107537815

Anonymous 12/13/25(Sat)10:55:14 No.107537815

>>107537803
Can you share the whole prompt for the options thingy?

Anonymous
12/13/25(Sat)10:55:46 No.107537820

Anonymous 12/13/25(Sat)10:55:46 No.107537820

>>107537725
>3D models are the final frontier
world models are because it removes the need for 3d altogether

Anonymous
12/13/25(Sat)10:56:14 No.107537823

Anonymous 12/13/25(Sat)10:56:14 No.107537823

File: img_00004_.jpg (558 KB, 1332x1776)

558 KB JPG

Anonymous
12/13/25(Sat)10:56:58 No.107537829

Anonymous 12/13/25(Sat)10:56:58 No.107537829

File: ComfyUI_00030_.png (1.4 MB, 832x1216)

1.4 MB PNG

Convenient censorship when you didn't ask for it.

Anonymous
12/13/25(Sat)10:57:18 No.107537833

Anonymous 12/13/25(Sat)10:57:18 No.107537833

File: Z-image turbo.png (1.38 MB, 1280x720)

1.38 MB PNG

>>107537815
Sure
https://files.catbox.moe/nvl4h1.txt

Anonymous
12/13/25(Sat)10:57:19 No.107537834

Anonymous 12/13/25(Sat)10:57:19 No.107537834

File: ComfyUI_00185_.png (2.19 MB, 1520x1040)

2.19 MB PNG

>>107537794
>switched to dpmpp 2m without changing the prompt
>it removed "sde" from the text
bros?...

Anonymous
12/13/25(Sat)10:58:04 No.107537843

Anonymous 12/13/25(Sat)10:58:04 No.107537843

>all this 1girls
Mistress /ldg/, can this worthless anon coom?

Anonymous
12/13/25(Sat)10:59:12 No.107537853

Anonymous 12/13/25(Sat)10:59:12 No.107537853

>>107537803
Does it run local models is it hardwired to HF repos? Fuck the cloud nodes.
>>107537834
Can you try i2i upscale if the beta scheduler is still blurring it?

Anonymous
12/13/25(Sat)10:59:47 No.107537856

Anonymous 12/13/25(Sat)10:59:47 No.107537856

File: ComfyUI-sampler_name-1.0-(...).png (2.21 MB, 1536x1536)

2.21 MB PNG

>>107537843
DON'T CUM!

Anonymous
12/13/25(Sat)11:00:39 No.107537868

Anonymous 12/13/25(Sat)11:00:39 No.107537868

>>107537853
>Does it run local models is it hardwired to HF repos?
it only runs on local models, you put your gguf on a folder and you're good to go
https://github.com/BigStationW/ComfyUI-Prompt-Manager

Anonymous
12/13/25(Sat)11:00:59 No.107537870

Anonymous 12/13/25(Sat)11:00:59 No.107537870

>>107537834
Lol
Also the excessive grain you see in the image is the problem I was referring to.
No idea why that happens with zit, it works fine on many other models.

Anonymous
12/13/25(Sat)11:02:59 No.107537896

Anonymous 12/13/25(Sat)11:02:59 No.107537896

File: lel.png (529 KB, 3040x1255)

529 KB PNG

https://emma-umm.github.io/emma/
>We did it SAAR we beat Bagel!
Really? In front of my end of 2025?

Anonymous
12/13/25(Sat)11:04:04 No.107537908

Anonymous 12/13/25(Sat)11:04:04 No.107537908

>>107537870
>No idea why that happens with zit, it works fine on many other models.
since they only trained on real data, there was probably a lot of compressed jpg images in there

Anonymous
12/13/25(Sat)11:04:26 No.107537911

Anonymous 12/13/25(Sat)11:04:26 No.107537911

i have a text encoder in this format

qwen_2.5_vl_7b_fp8_scaled.safetensors
its a 9gb file

when i try to find abliterated versions of qwen 3 vl 2b or whatever, i cannot find a single file, instead there is the whole folder with configuration files and what not

what is the difference, and how do i make use of the verion that has multiple files in a folder in comfy ui? right now i load the qwen_2.5_vl_7b_fp8_scaled.safetensors using load clip node

i want to use it with qwen image edit

Anonymous
12/13/25(Sat)11:07:52 No.107537941

Anonymous 12/13/25(Sat)11:07:52 No.107537941

File: 2025-12-13_21-44_Red_Lily(...).jpg (719 KB, 1384x1784)

719 KB JPG

Anonymous
12/13/25(Sat)11:12:20 No.107537976

Anonymous 12/13/25(Sat)11:12:20 No.107537976

File: img_00008_.jpg (935 KB, 1332x1776)

935 KB JPG

Anonymous
12/13/25(Sat)11:13:27 No.107537992

Anonymous 12/13/25(Sat)11:13:27 No.107537992

You are using the correct text encoder?
Why do you want abliterated or 2b. The latter won't work at all, the former will give worse results, since the model wasn't trained on it.

Anonymous
12/13/25(Sat)11:14:27 No.107538001

Anonymous 12/13/25(Sat)11:14:27 No.107538001

>>107537976
Cute!

Anonymous
12/13/25(Sat)11:15:06 No.107538007

Anonymous 12/13/25(Sat)11:15:06 No.107538007

>>107537992
99% chance a coomer said it'll make better booba

Anonymous
12/13/25(Sat)11:15:16 No.107538008

Anonymous 12/13/25(Sat)11:15:16 No.107538008

>>107537992
Meant to tag >>107537911

Anonymous
12/13/25(Sat)11:17:55 No.107538034

Anonymous 12/13/25(Sat)11:17:55 No.107538034

File: samurai child.png (1.37 MB, 832x1216)

1.37 MB PNG

If ZIT can figure out that she has to close her eye because the scar is going through her eye, not some random part of her face, it would be great.

Anonymous
12/13/25(Sat)11:18:03 No.107538037

Anonymous 12/13/25(Sat)11:18:03 No.107538037

>>107537992
>>107538007
yes better bob and vagene

so its not like ill get better results with say qwen 3 4b vl ect instead of qwen_2.5_vl_7b_fp8_scaled.safetensors? im new to local ai

Anonymous
12/13/25(Sat)11:18:32 No.107538042

Anonymous 12/13/25(Sat)11:18:32 No.107538042

base more like a never-ending maze

Anonymous
12/13/25(Sat)11:18:38 No.107538044

Anonymous 12/13/25(Sat)11:18:38 No.107538044

>>107538034
try prompting it better. like, boomer prompt. >>107537355

Anonymous
12/13/25(Sat)11:19:26 No.107538052

Anonymous 12/13/25(Sat)11:19:26 No.107538052

>>107537257
>>107537423
>>107537448
>>107537525
>>107537632
>>107537787
agartha needzs your powa frien

Anonymous
12/13/25(Sat)11:21:44 No.107538078

Anonymous 12/13/25(Sat)11:21:44 No.107538078

>>107537803
Is this simply zit? It understands regional prompting like that?

Anonymous
12/13/25(Sat)11:23:26 No.107538092

Anonymous 12/13/25(Sat)11:23:26 No.107538092

>>107538037
No it doesn't give better bob and vagene sir.
The text encoder isn't censored, the diffusion model can't draw bob and vagene, because it simply wasn't trained on bob and vagene.
Qwen image (edit) isn't really a coom model. There is no coom edit model yet. There are some loras for flux kontext and qwen image edit that you might use (most got jannied so you need to use civarchive) but they don't work well. Might get something usable through seed lottery though.
There are some API models that can do okay bob, but that's outside the scope of this general.

Anonymous
12/13/25(Sat)11:24:35 No.107538098

Anonymous 12/13/25(Sat)11:24:35 No.107538098

so it seems, from my testing, if you can train your character lora with some nudes, absolutely do it. it 100% fixes the lack of titty training in turbo provided you gave it enough steps to work with.
down to the color of the nipples trained with that character, even. very nice.
oh and pubes.

Anonymous
12/13/25(Sat)11:25:41 No.107538107

Anonymous 12/13/25(Sat)11:25:41 No.107538107

File: img_00012_.jpg (787 KB, 1332x1776)

787 KB JPG

>>107538001
tyytyy

Anonymous
12/13/25(Sat)11:25:54 No.107538108

Anonymous 12/13/25(Sat)11:25:54 No.107538108

>>107538078
it's using a visual LLM to rewrite your prompt and describe your characters from the image input, then with that prompt you put that on ZiT >>107537868

Anonymous
12/13/25(Sat)11:26:07 No.107538111

Anonymous 12/13/25(Sat)11:26:07 No.107538111

File: zimg_0018.png (2.07 MB, 1080x1440)

2.07 MB PNG

20 minute loras aren't great, but they sure as hell aren't bad, no wonder civit is full of low effort crap

Anonymous
12/13/25(Sat)11:27:25 No.107538122

Anonymous 12/13/25(Sat)11:27:25 No.107538122

>>107538078
Pretty much any semi-decent text encoder (t5, qwen) understands regional instructions. (Though even they will occasional blend stuff.)
CLIP of SDXL and before days couldn't, because it is simply too retarded not to blend concepts from different regions together.
Flux, chroma, qwen image, etc. can all do this, nothing special to zit.

Anonymous
12/13/25(Sat)11:29:28 No.107538133

Anonymous 12/13/25(Sat)11:29:28 No.107538133

>>107538111
20 minutes on which GPU? 5090?

Anonymous
12/13/25(Sat)11:30:31 No.107538139

Anonymous 12/13/25(Sat)11:30:31 No.107538139

File: zimg_0023.png (2.18 MB, 1080x1440)

2.18 MB PNG

>>107538133
a 3090?

Anonymous
12/13/25(Sat)11:32:40 No.107538160

Anonymous 12/13/25(Sat)11:32:40 No.107538160

>>107538108
>>107538122
I'm doing llm for zit already, but was unaware of the regional thing, cool.
I'm gonna have fun with that node, thanks.

Anonymous
12/13/25(Sat)11:33:42 No.107538167

Anonymous 12/13/25(Sat)11:33:42 No.107538167

>>107538139
no fucking way you can train a lora in 20min even on a 3090... capppppp

Anonymous
12/13/25(Sat)11:35:45 No.107538174

Anonymous 12/13/25(Sat)11:35:45 No.107538174

>>107538139
I am curious what shortcuts you used to get it converge into something halfway usable that fast on a 3090.
Mind sharing your training settings?

Anonymous
12/13/25(Sat)11:38:06 No.107538182

Anonymous 12/13/25(Sat)11:38:06 No.107538182

>>107538167
20min is pretty low but still XL is realyl fucking slow, more modern models learn way faster

Anonymous
12/13/25(Sat)11:38:44 No.107538188

Anonymous 12/13/25(Sat)11:38:44 No.107538188

>>107538182
wait thats an SDXL lora?

Anonymous
12/13/25(Sat)11:39:28 No.107538192

Anonymous 12/13/25(Sat)11:39:28 No.107538192

>>107538139

bruh WTF my output of the IRL I trained on was so dogwater compared to your lora and gens.

Anonymous
12/13/25(Sat)11:40:15 No.107538197

Anonymous 12/13/25(Sat)11:40:15 No.107538197

File: z-image_00824_.png (1.58 MB, 1152x2048)

1.58 MB PNG

>>107533983
use a black image and lower denoise a bit

Anonymous
12/13/25(Sat)11:40:56 No.107538205

Anonymous 12/13/25(Sat)11:40:56 No.107538205

File: Untitled.png (98 KB, 1224x697)

98 KB PNG

>>107538167
this is literally the whole point of the experiment to see how shit my training can be

>>107538174
- ai-toolkit
- z-img default settings with training adapter v2
- rank 64
- 18 images cropped square (various resolutions)
- no captions
- trained at 512
- 750 steps

Anonymous
12/13/25(Sat)11:41:53 No.107538214

Anonymous 12/13/25(Sat)11:41:53 No.107538214

>>107538188
no I meant in comparison. XL is really slow when compared to newer models when it comes to training

Anonymous
12/13/25(Sat)11:42:04 No.107538215

Anonymous 12/13/25(Sat)11:42:04 No.107538215

>>107537803
question, can this be done without hooking up a thinking LLM to my regional prompting workflow? what node do i use instead of that prompt generator?

Anonymous
12/13/25(Sat)11:47:23 No.107538255

Anonymous 12/13/25(Sat)11:47:23 No.107538255

>>107538092
i was able to denude quite a few celebrities with 2509 though
although nothing amazing for coomerbrains, works for me (for now)

but i was just wondering what were the differences, theres a billion models out there

Anonymous
12/13/25(Sat)11:48:13 No.107538259

Anonymous 12/13/25(Sat)11:48:13 No.107538259

>>107538205
Interesting. I thought it needed 3k steps or so.

Anonymous
12/13/25(Sat)11:48:45 No.107538264

Anonymous 12/13/25(Sat)11:48:45 No.107538264

>>107538205
>356W
Just an idea if you want to experiment but I remember reading some studies that said you could undervolt your card by like 60% and get very minimal impact on inference speed.

Anonymous
12/13/25(Sat)11:49:29 No.107538269

Anonymous 12/13/25(Sat)11:49:29 No.107538269

>>107538255
>theres a billion models out there
Irrelevant for you.
Diffusion models only work with the text encoders they were trained on.

Anonymous
12/13/25(Sat)11:52:18 No.107538293

Anonymous 12/13/25(Sat)11:52:18 No.107538293

File: img_00018_.jpg (676 KB, 1332x1776)

676 KB JPG

Anonymous
12/13/25(Sat)11:52:55 No.107538299

Anonymous 12/13/25(Sat)11:52:55 No.107538299

>>107538255
qwen will give generic parts but you can always feed the image back into a diffusion model for inpainting after.

SAM3 can detect tits easily and for a vagina you can say "mouth".

Anonymous
12/13/25(Sat)11:53:45 No.107538305

Anonymous 12/13/25(Sat)11:53:45 No.107538305

>>107538299
>and for a vagina you can say "mouth".
sounds about right.

Anonymous
12/13/25(Sat)11:53:53 No.107538307

Anonymous 12/13/25(Sat)11:53:53 No.107538307

>>107538205
interesting. Im trying to make a body type lora, of this ferraira woman, but even at 3k steps, nada. fuckall results.

Anonymous
12/13/25(Sat)11:54:47 No.107538312

Anonymous 12/13/25(Sat)11:54:47 No.107538312

>>107538299
lmao patchworking tits and vagene in

Anonymous
12/13/25(Sat)11:55:55 No.107538322

Anonymous 12/13/25(Sat)11:55:55 No.107538322

Is there a node that uses Qwen 3 4b to refine the prompt before passing it to z-image? Seeing as you have to load Qwen 3 as the text encoder it seems like the slowdown wouldn't be too bad. I think it would work well for creating cohesive scenes with wildcards.

Anonymous
12/13/25(Sat)11:56:45 No.107538326

Anonymous 12/13/25(Sat)11:56:45 No.107538326

File: Untitled-5ffffffffff.jpg (387 KB, 4800x1792)

387 KB JPG

>>107538197
It's either on or off for me. Left is .9 denoise, right is .91.

Anonymous
12/13/25(Sat)11:58:27 No.107538343

Anonymous 12/13/25(Sat)11:58:27 No.107538343

File: ComfyUI_temp_lufha_00004_.png (2.93 MB, 1280x1600)

2.93 MB PNG

Anonymous
12/13/25(Sat)11:58:54 No.107538347

Anonymous 12/13/25(Sat)11:58:54 No.107538347

>>107538343
lmao

Anonymous
12/13/25(Sat)11:59:06 No.107538350

Anonymous 12/13/25(Sat)11:59:06 No.107538350

File: 1752428534985169.png (2.21 MB, 1120x1440)

2.21 MB PNG

Anonymous
12/13/25(Sat)11:59:50 No.107538359

Anonymous 12/13/25(Sat)11:59:50 No.107538359

File: ComfyUI_temp_lufha_00005_.png (2.87 MB, 1280x1600)

2.87 MB PNG

>>107538343
LMAO, just released that the model generated indian men pissing on the background, I didn't prompt that, based Xi, I kneel

Anonymous
12/13/25(Sat)12:01:13 No.107538378

Anonymous 12/13/25(Sat)12:01:13 No.107538378

File: ComfyUI_temp_lufha_00006_.png (3.19 MB, 1280x1600)

3.19 MB PNG

Anonymous
12/13/25(Sat)12:01:21 No.107538381

Anonymous 12/13/25(Sat)12:01:21 No.107538381

>>107538359
>>107538343
Needs more indians around her. Totally takes me out of the immersion with so few trying to take pictures and demand the bobs and vagene for the pay cards, saar.

Anonymous
12/13/25(Sat)12:05:26 No.107538411

Anonymous 12/13/25(Sat)12:05:26 No.107538411

>>107537808
yeah res2m seems to work pretty damn well in zit .. i usually use uniform_pc for the scheduler with it

Anonymous
12/13/25(Sat)12:05:32 No.107538415

Anonymous 12/13/25(Sat)12:05:32 No.107538415

>>107537803
this is very very cool

>>107538259
>>107538307
really depends how many images you are using for your dataset and the quality of them, along with your captions. if you're using default settings it shouldn't be absolutely nothing.

Anonymous
12/13/25(Sat)12:05:33 No.107538416

Anonymous 12/13/25(Sat)12:05:33 No.107538416

File: zit_kmshiftest.jpg (285 KB, 3064x1024)

285 KB JPG

>>107537870
>excessive grain
if i understand thing right shift is essentially a control for when zit shifts from low noise to high noise. the "grain" is high noise, try lowering shift. I haven't extensively tested shift values against samplers and schedules but it definitely has an impact on shitty skin texture graininess.

Anonymous
12/13/25(Sat)12:05:56 No.107538421

Anonymous 12/13/25(Sat)12:05:56 No.107538421

>be trani
>see MIT licenced work with over 40 contributors
>"wait, comfy became a winner when he made an ui!"
>lightbulb.png
>vibecode a wrapper that barely works and even misses trivial features
>"now is my time to shine thehe~"
>closes shota folder
>injects the last dose of hrt juice
>slap a commercial licence on top of the MIT licenced stuff
>"it's basically as if i build everything myself, i'm such a genius"
>"they will never make fun of me again thehehe~"
>release it and spam all threads for months
...
>no one cares

Anonymous
12/13/25(Sat)12:06:57 No.107538432

Anonymous 12/13/25(Sat)12:06:57 No.107538432

File: img_00022_.jpg (893 KB, 1352x1776)

893 KB JPG

Anonymous
12/13/25(Sat)12:07:08 No.107538435

Anonymous 12/13/25(Sat)12:07:08 No.107538435

>>107538421
maybe this trani person will stop doing whatever it is you don't like if you don't bring them up out of nowhere

Anonymous
12/13/25(Sat)12:07:58 No.107538441

Anonymous 12/13/25(Sat)12:07:58 No.107538441

>>107538415
I'll tell u exactly how many and what kind and captions and shit.

40 images, head cropped out. insta size mostly. ie 1350.
captioned with body description.

Anonymous
12/13/25(Sat)12:08:24 No.107538449

Anonymous 12/13/25(Sat)12:08:24 No.107538449

File: 4928.png (2.4 MB, 1310x1310)

2.4 MB PNG

Anonymous
12/13/25(Sat)12:10:15 No.107538465

Anonymous 12/13/25(Sat)12:10:15 No.107538465

>>107538205
>- rank 64
Ism't that too strong for Zit? You already run loras at sub 0.50 because of the distortions when having 2+

Anonymous
12/13/25(Sat)12:11:12 No.107538475

Anonymous 12/13/25(Sat)12:11:12 No.107538475

File: ComfyUI_temp_lufha_00018_.png (2.71 MB, 1088x1856)

2.71 MB PNG

Anonymous
12/13/25(Sat)12:11:24 No.107538477

Anonymous 12/13/25(Sat)12:11:24 No.107538477

Please halp. Nvme drive loads clip SLOWER than my ssd drive. It takes like 5+ minutes to load wan where on sdd its almost instant..

I have the same setup on my regular ssd crucial drive and nvme m.2 drive (which is supposed to be faster), same speed boosts and models. Only difference is, there's more nodes on my nvme drive.

Anonymous
12/13/25(Sat)12:13:15 No.107538490

Anonymous 12/13/25(Sat)12:13:15 No.107538490

File: ZiMG_01375_.jpg (611 KB, 1344x1728)

611 KB JPG

gday fellas

Anonymous
12/13/25(Sat)12:14:42 No.107538497

Anonymous 12/13/25(Sat)12:14:42 No.107538497

>>107538441
don't caption the body, caption everything except the body. you're basically telling the trainer that it has to learn everything else.

ie. if i want to train a body i want to caption the setting, the clothes, the jewelry so it learns only the body and not that stuff.

try again with no captions.

Anonymous
12/13/25(Sat)12:16:24 No.107538508

Anonymous 12/13/25(Sat)12:16:24 No.107538508

File: b9660c065be078f22507046f6(...).png (14 KB, 457x303)

14 KB PNG

>>107537803
Welp.

Anonymous
12/13/25(Sat)12:17:29 No.107538516

Anonymous 12/13/25(Sat)12:17:29 No.107538516

>>107538435
>maybe this trani person will stop doing whatever it is you don't like
if you mean existing sign me up champ!

Anonymous
12/13/25(Sat)12:17:38 No.107538517

Anonymous 12/13/25(Sat)12:17:38 No.107538517

>>107538508
you have to put the mmproj next to the gguf you tardburglar

Anonymous
12/13/25(Sat)12:18:19 No.107538522

Anonymous 12/13/25(Sat)12:18:19 No.107538522

>>107538497
Hm ok, ill try once more with no captions at all.

Anonymous
12/13/25(Sat)12:19:07 No.107538526

Anonymous 12/13/25(Sat)12:19:07 No.107538526

Cute wannabe butthurt schizo

Anonymous
12/13/25(Sat)12:19:51 No.107538529

Anonymous 12/13/25(Sat)12:19:51 No.107538529

File: 255ebc4879bdc0305d5d7f3be(...).png (6 KB, 819x96)

6 KB PNG

>>107538517
Pretend I'm retarded.

Anonymous
12/13/25(Sat)12:20:58 No.107538534

Anonymous 12/13/25(Sat)12:20:58 No.107538534

>>107536612
Well, he is corrrect. Not surprising since this Z-Image control net is not from the guys who makes Z-Image.

Anonymous
12/13/25(Sat)12:21:25 No.107538538

Anonymous 12/13/25(Sat)12:21:25 No.107538538

>>107538529
Now check the console. It must start the same as the model itself and end with "mmproj"

Anonymous
12/13/25(Sat)12:21:33 No.107538541

Anonymous 12/13/25(Sat)12:21:33 No.107538541

File: ZiMG_01382_.jpg (514 KB, 1344x1728)

514 KB JPG

>>107538490

Anonymous
12/13/25(Sat)12:22:39 No.107538549

Anonymous 12/13/25(Sat)12:22:39 No.107538549

>>107538477
maybe your nvme is kill

Anonymous
12/13/25(Sat)12:23:33 No.107538556

Anonymous 12/13/25(Sat)12:23:33 No.107538556

fresh bread
>>107538552
>>107538552
>>107538552

Anonymous
12/13/25(Sat)12:23:48 No.107538558

Anonymous 12/13/25(Sat)12:23:48 No.107538558

>>107538541
damn that's great, gj anon

Anonymous
12/13/25(Sat)12:26:15 No.107538577

Anonymous 12/13/25(Sat)12:26:15 No.107538577

>>107536897
Also they say it's a community model over and over, the Flux 2 shills are desperately trying to pretend this won't be released.

What's even more hilarious is that the chinks will release this great undistilled model before BFL can get their shitty small distilled Flux Klein model out.

BFL is dead, they peaked at Flux dev which was ok for art styles but 100% slopped humans and super censored.

Thanks for playing.

Anonymous
12/13/25(Sat)12:30:03 No.107538616

Anonymous 12/13/25(Sat)12:30:03 No.107538616

>>107538541
kino gen
prompt?

Anonymous
12/13/25(Sat)12:32:46 No.107538637

Anonymous 12/13/25(Sat)12:32:46 No.107538637

>>107536908
A Base model that is primarily made to be further trained on should have as little aesthetic and caption bias as possible.

It should have strong foundational knowledge of practically every concept, and then people can finetune the model to be extremely good at particular concepts, anime, NSFW, art styles, etc.

Anonymous
12/13/25(Sat)12:33:20 No.107538644

Anonymous 12/13/25(Sat)12:33:20 No.107538644

>>107538508
you have to read this
https://github.com/BigStationW/ComfyUI-Prompt-Manager?tab=readme-ov-file#image-inputs

Anonymous
12/13/25(Sat)13:51:38 No.107539415

Anonymous 12/13/25(Sat)13:51:38 No.107539415

>>107538107
catbox/prompt for this style please?

Anonymous
12/13/25(Sat)14:35:27 No.107539772

Anonymous 12/13/25(Sat)14:35:27 No.107539772

>>107538312
>patchworking tits and vagene in
It's actually a good technique when you want a specific boob shape. it's really hard to prompt for a slim body with massive tits or a fat body with tiny tits.

And for some reason boob tags massively affect the way a face will look.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.