/g/ - /ldg/ - Local Diffusion General (subject included edition) - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 12/28/25(Sun)11:10:09 No.107693072

File: 1749766932296056.jpg (1.89 MB, 2954x2552)

/ldg/ - Local Diffusion General (subject included edition) Anonymous 12/28/25(Sun)11:10:09 No.107693072

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107687569 (Cross-thread)

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

Anonymous
12/28/25(Sun)11:11:11 No.107693079

Anonymous 12/28/25(Sun)11:11:11 No.107693079

>>107693072
third time's the charm kej

Anonymous
12/28/25(Sun)11:14:51 No.107693108

Anonymous 12/28/25(Sun)11:14:51 No.107693108

blessed thread of frenship

Anonymous
12/28/25(Sun)11:15:24 No.107693115

Anonymous 12/28/25(Sun)11:15:24 No.107693115

>>107693072
Why is AniStudio not in OP?

Anonymous
12/28/25(Sun)11:18:59 No.107693142

Anonymous 12/28/25(Sun)11:18:59 No.107693142

File: collage.jpg (3 MB, 4607x3095)

3 MB JPG

Anonymous
12/28/25(Sun)11:29:02 No.107693221

Anonymous 12/28/25(Sun)11:29:02 No.107693221

Are there any good detailer loras for z-image?

Anonymous
12/28/25(Sun)11:37:04 No.107693279

Anonymous 12/28/25(Sun)11:37:04 No.107693279

>>107693142
>xbox one

Anonymous
12/28/25(Sun)11:37:15 No.107693282

Anonymous 12/28/25(Sun)11:37:15 No.107693282

cozy bread

Anonymous
12/28/25(Sun)11:48:30 No.107693378

Anonymous 12/28/25(Sun)11:48:30 No.107693378

>>107693354
No one knows, yes still king, and very well

Anonymous
12/28/25(Sun)11:52:38 No.107693415

Anonymous 12/28/25(Sun)11:52:38 No.107693415

>>107693354
>What happens to Z image base
it's inference code PR got merged 4 days ago, for the new version of Qwen Image Edit, the PR got merged 2 weeks before they released the model, make that what you will
https://github.com/huggingface/diffusers/pull/12857

Anonymous
12/28/25(Sun)11:57:09 No.107693452

Anonymous 12/28/25(Sun)11:57:09 No.107693452

File: 1741413748506572.jpg (804 KB, 1920x1080)

804 KB JPG

Anonymous
12/28/25(Sun)11:59:20 No.107693469

Anonymous 12/28/25(Sun)11:59:20 No.107693469

File: 1741752823950154.jpg (349 KB, 1344x1240)

349 KB JPG

Anonymous
12/28/25(Sun)12:02:02 No.107693495

Anonymous 12/28/25(Sun)12:02:02 No.107693495

File: 1759027773787847.jpg (366 KB, 848x1280)

366 KB JPG

>>107693479
not interested in your schizophrenic drama. just posting zit gens.

Anonymous
12/28/25(Sun)12:02:07 No.107693497

Anonymous 12/28/25(Sun)12:02:07 No.107693497

File: 1738071918997894.png (92 KB, 666x307)

92 KB PNG

lol
lmao

Anonymous
12/28/25(Sun)12:04:41 No.107693523

Anonymous 12/28/25(Sun)12:04:41 No.107693523

>>107693469
cute migu!

Anonymous
12/28/25(Sun)12:07:41 No.107693545

Anonymous 12/28/25(Sun)12:07:41 No.107693545

File: file.jpg (348 KB, 1344x1056)

348 KB JPG

Anonymous
12/28/25(Sun)12:08:17 No.107693551

Anonymous 12/28/25(Sun)12:08:17 No.107693551

i tried all the z-image lora training toolboxes in order to get it running with my shitty 8gb card and only this one
https://github.com/shootthesound/comfyUI-Realtime-Lora
makes it possible for me to train @4gb vram with good results. what did he do differently?
I tried to replicate the config with onetrainer but i always end up with higher vram and massively worse results
ai-toolkit was the worst

Anonymous
12/28/25(Sun)12:10:10 No.107693568

Anonymous 12/28/25(Sun)12:10:10 No.107693568

what's the difference between kijai's wan2.2 unets and the regular ones?

Anonymous
12/28/25(Sun)12:25:20 No.107693691

Anonymous 12/28/25(Sun)12:25:20 No.107693691

I vaguely remember BFL announcing a video model, or at least hinting to it.
Do I remember badly or are they doing something?

Anonymous
12/28/25(Sun)12:27:41 No.107693711

Anonymous 12/28/25(Sun)12:27:41 No.107693711

>>107693691
they did at some point but i think they gave up after the chinese models like Wan got released

Anonymous
12/28/25(Sun)12:32:17 No.107693757

Anonymous 12/28/25(Sun)12:32:17 No.107693757

anyone have benchmarks for sdcpp and how it compares to comfy?

Anonymous
12/28/25(Sun)12:35:23 No.107693780

Anonymous 12/28/25(Sun)12:35:23 No.107693780

>>107693072
Based.

Anonymous
12/28/25(Sun)12:37:15 No.107693793

Anonymous 12/28/25(Sun)12:37:15 No.107693793

>>107693469
great gen

Anonymous
12/28/25(Sun)12:54:24 No.107693918

Anonymous 12/28/25(Sun)12:54:24 No.107693918

Do I retrain a previous lora (because last time it wasnt nearly enough steps) or do I train on a fresh new exciting dataset hmm...

Anonymous
12/28/25(Sun)12:55:03 No.107693921

Anonymous 12/28/25(Sun)12:55:03 No.107693921

File: 1756415429576962.png (888 KB, 2432x1664)

888 KB PNG

Anonymous
12/28/25(Sun)12:57:47 No.107693948

Anonymous 12/28/25(Sun)12:57:47 No.107693948

>>107693918
>it wasnt nearly enough steps)
just continue from the last epoch

Anonymous
12/28/25(Sun)12:59:08 No.107693962

Anonymous 12/28/25(Sun)12:59:08 No.107693962

File: rin.jpg (491 KB, 958x1400)

491 KB JPG

Anonymous
12/28/25(Sun)13:01:25 No.107693987

Anonymous 12/28/25(Sun)13:01:25 No.107693987

>>107693962
catbox?

Anonymous
12/28/25(Sun)13:06:42 No.107694039

Anonymous 12/28/25(Sun)13:06:42 No.107694039

File: ZiMG_01652_.png (2.75 MB, 1344x1728)

2.75 MB PNG

Anonymous
12/28/25(Sun)13:07:50 No.107694053

Anonymous 12/28/25(Sun)13:07:50 No.107694053

File: Screenshot 2025-12-28 120650.png (1.44 MB, 2209x1025)

1.44 MB PNG

Am I just retarded or how are you supposed to prompt these things
I've tried using qwen 2511 and flux 2 dev, and neither seem to be capable of this edit. Also tried with inpainting on qwen and couldn't get it to even do anything to the hair clip

Anonymous
12/28/25(Sun)13:13:08 No.107694090

Anonymous 12/28/25(Sun)13:13:08 No.107694090

File: ZiMG_01663_.png (3.2 MB, 1344x1728)

3.2 MB PNG

>>107694039

Anonymous
12/28/25(Sun)13:20:03 No.107694139

Anonymous 12/28/25(Sun)13:20:03 No.107694139

>>107693115
agreed, I only use AniStudio because the dev understands what he's doing and it doesn't have use shitty python

Anonymous
12/28/25(Sun)13:22:03 No.107694155

Anonymous 12/28/25(Sun)13:22:03 No.107694155

>>107694139
>conan isn't using python

Anonymous
12/28/25(Sun)13:31:23 No.107694204

Anonymous 12/28/25(Sun)13:31:23 No.107694204

File: zimg_0179.png (1.51 MB, 848x1280)

1.51 MB PNG

>>107693551
not for nothing anon you can read the documentation yourself and see that this uses musubi tuner under the hood; you can probably just run musubi tuner with these settings:
https://github.com/shootthesound/comfyUI-Realtime-Lora/blob/main/musubi_zimage_config_template.py

Anonymous
12/28/25(Sun)13:43:35 No.107694273

Anonymous 12/28/25(Sun)13:43:35 No.107694273

File: ZiMG_01676_.png (2.83 MB, 1344x1728)

2.83 MB PNG

>>107694090

Anonymous
12/28/25(Sun)13:44:18 No.107694279

Anonymous 12/28/25(Sun)13:44:18 No.107694279

File: 1737139411746294.jpg (1017 KB, 1536x1536)

1017 KB JPG

Anonymous
12/28/25(Sun)13:44:24 No.107694281

Anonymous 12/28/25(Sun)13:44:24 No.107694281

>>107694204
yeah I also think musubi might be the key difference here. honestly i only tried to replicate the settings with the musubi gui which is lacking quite a lot of options.
i think it might be the "blocks_to_swap" option, haven't seen that anywhere else so far.

Anonymous
12/28/25(Sun)13:50:30 No.107694322

Anonymous 12/28/25(Sun)13:50:30 No.107694322

File: 1761242797327290.jpg (1.39 MB, 1248x1824)

1.39 MB JPG

Anonymous
12/28/25(Sun)13:57:20 No.107694368

Anonymous 12/28/25(Sun)13:57:20 No.107694368

File: dmmg_00029.png (1.66 MB, 960x1280)

1.66 MB PNG

>>107694281
that could very well be it, as that would be directly tied to how many blocks to swap out of vram into your machine ram. generally block swapping will prevent you from going OOM, but it slows down training quite a bit.

Anonymous
12/28/25(Sun)14:01:18 No.107694411

Anonymous 12/28/25(Sun)14:01:18 No.107694411

>>107694368
yeah it's slow alright. it takes hours to finish.
but at least it runs and the results are solid...

Anonymous
12/28/25(Sun)14:08:17 No.107694478

Anonymous 12/28/25(Sun)14:08:17 No.107694478

File: ComfyUI_00682_.png (1.7 MB, 1480x1128)

1.7 MB PNG

Anonymous
12/28/25(Sun)14:08:42 No.107694483

Anonymous 12/28/25(Sun)14:08:42 No.107694483

why can't comfy save normal fucking FP8 now. Everything is scaled without asking. Torch 2.9 can't compile fp8 on 3090 if there are scaled tensors. Even if you patch triton. Torch 2.7 worked.
It's like there is a silent cabal of faggots who conspire to break older GPU work arounds and then ignore you when you ask about it.

Anonymous
12/28/25(Sun)14:17:42 No.107694587

Anonymous 12/28/25(Sun)14:17:42 No.107694587

File: ComfyUI_00526_.mp4 (732 KB, 1200x720)

732 KB MP4

the WAN keyframe template is alright I guess. Still waiting for something like qwen-edit for video

Anonymous
12/28/25(Sun)14:20:28 No.107694618

Anonymous 12/28/25(Sun)14:20:28 No.107694618

>>107694587
Google had this since 2024. can't believe open source still has no answer

Anonymous
12/28/25(Sun)14:21:23 No.107694626

Anonymous 12/28/25(Sun)14:21:23 No.107694626

>>107694039
yes, please, moar. I need to get into the right mindset to abandon the (oldshit) I am experienced with and start fresh with Z-image. harder than I thought.

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.