/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 04/01/26(Wed)14:21:38 No.108502685

File: 1769681690665286.jpg (1.39 MB, 1967x1967)

/ldg/ - Local Diffusion General Anonymous 04/01/26(Wed)14:21:38 No.108502685

Discussion of Open Source Diffusion Models

Previous: >>108494530

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

Anonymous
04/01/26(Wed)14:27:32 No.108502726

Anonymous 04/01/26(Wed)14:27:32 No.108502726

File: 1766194561883981.jpg (2.5 MB, 3024x4032)

2.5 MB JPG

Anonymous
04/01/26(Wed)14:30:28 No.108502753

Anonymous 04/01/26(Wed)14:30:28 No.108502753

File: 441314770780331.png (1.06 MB, 1664x2432)

1.06 MB PNG

Anonymous
04/01/26(Wed)14:36:09 No.108502790

Anonymous 04/01/26(Wed)14:36:09 No.108502790

>mfw Resource news

04/01/2026

>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
https://carlofkl.github.io/dreamlite

>MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
https://vcbsl.github.io/MMFace-DiT

>Hallucination-aware intermediate representation edit in LVLMs
https://github.com/ASGO-MM/HIRE

>CutClaw: Agentic Hours-Long Video Editing via Music Synchronization
https://github.com/GVCLab/CutClaw

>Extend3D: Town-Scale 3D Generation
http://seungwoo-yoon.github.io/extend3d-page

>PixlStash 1.0.0 release candidate
https://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0rc3

>adetailer-hires-sync: Automatically enables ADetailer in Forge
https://github.com/KazeKaze93/adetailer-hires-sync

03/31/2026

>See-through: Single-image Layer Decomposition for Anime Characters
https://github.com/shitagaki-lab/see-through

>VRAM Pager: Compressed GPU Memory Paging for Diffusion & Video Models
https://github.com/willjriley/vram-pager

>TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark
https://github.com/IDLabMedia/tgif-dataset

>Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting
https://differential-query-painter.github.io/DQ-painter

>Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Drifting
https://github.com/aSleepyTree/Drift-AR

>INSID3: Training-Free In-Context Segmentation with DINOv3
https://visinf.github.io/INSID3

>OmniColor: Unified Framework for Multi-modal Lineart Colorization
https://github.com/zhangxulu1996/OmniColor

>Gen-Searcher: Reinforcing Agentic Search for Image Generation
https://gen-searcher.vercel.app

>V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video LLMs
https://github.com/xinyouu/V-CAST

>GEMS: Agent-Native Multimodal Generation with Memory and Skills
https://gems-gen.github.io

>RAWIC: Bit-Depth Adaptive Lossless Raw Image Compression
https://github.com/chunbaobao/RAWIC

Anonymous
04/01/26(Wed)14:37:09 No.108502796

Anonymous 04/01/26(Wed)14:37:09 No.108502796

>mfw Research news
>>108502500
>>108502506

Anonymous
04/01/26(Wed)14:40:17 No.108502818

Anonymous 04/01/26(Wed)14:40:17 No.108502818

File: 57622191487741.png (626 KB, 832x1216)

626 KB PNG

Anonymous
04/01/26(Wed)14:42:36 No.108502834

Anonymous 04/01/26(Wed)14:42:36 No.108502834

Mugen vs animal anon? For nsfw ofc

Also Jenner, share your LoRA please, I start to have a kink for her

Anonymous
04/01/26(Wed)14:44:29 No.108502851

Anonymous 04/01/26(Wed)14:44:29 No.108502851

>>108502834
Anima unless you are a fudding lolcow thread schizo or member of their shillcord. Not even close.

Anonymous
04/01/26(Wed)14:48:39 No.108502883

Anonymous 04/01/26(Wed)14:48:39 No.108502883

>>108502834
sdxl

Anonymous
04/01/26(Wed)14:48:59 No.108502888

Anonymous 04/01/26(Wed)14:48:59 No.108502888

File: 132110849987576.png (995 KB, 1344x768)

995 KB PNG

Anonymous
04/01/26(Wed)14:50:25 No.108502902

Anonymous 04/01/26(Wed)14:50:25 No.108502902

>julienbake

Anonymous
04/01/26(Wed)14:50:35 No.108502903

Anonymous 04/01/26(Wed)14:50:35 No.108502903

Qwen and Klein are great in a lot of ways but I miss Chroma's soul. I haven't been keeping up with the Chroma meta since 1.0, though. What are people using? I heard the flash huen lora was good but it looked really slopped (skill issue?) and Chroma really needs negative prompts. What are people using these days?

Anonymous
04/01/26(Wed)14:55:02 No.108502928

Anonymous 04/01/26(Wed)14:55:02 No.108502928

File: Klein9BDistilledRemakes.jpg (3.54 MB, 2312x2604)

3.54 MB JPG

>>108502637
Remade this a couple slightly different ways (different prompt approaches) with Klein 9B Distilled

Anonymous
04/01/26(Wed)14:56:46 No.108502941

Anonymous 04/01/26(Wed)14:56:46 No.108502941

File: 123436057959205.png (952 KB, 1152x896)

952 KB PNG

Anonymous
04/01/26(Wed)14:58:39 No.108502954

Anonymous 04/01/26(Wed)14:58:39 No.108502954

>>108502903
I use either base or the DC-2K memeversion. As for flash loras, the rank 64 are the best.

Anonymous
04/01/26(Wed)14:59:34 No.108502960

Anonymous 04/01/26(Wed)14:59:34 No.108502960

>>108502685
cake lady looks delish, catbox?

Anonymous
04/01/26(Wed)15:00:48 No.108502965

Anonymous 04/01/26(Wed)15:00:48 No.108502965

>>108502928
The framing/composition is too simplistic, if you want to make it look more realistic you have to make them stand more off-center, make the framing not perfectly horizontally aligned, etc. Probably add a bit of photo blur and turn down the color saturation.

Anonymous
04/01/26(Wed)15:04:05 No.108502985

Anonymous 04/01/26(Wed)15:04:05 No.108502985

File: 491221717145059.png (773 KB, 832x1216)

773 KB PNG

Anonymous
04/01/26(Wed)15:04:49 No.108502994

Anonymous 04/01/26(Wed)15:04:49 No.108502994

>>108502954
>DC-2K
What is this? I found the files but there isn't any description or guidance on sampler settings. Is it just a drop-in replacement for Chroma-HD? Is it meant to gen at 2k?

As far as the flash lora I was using rank 64 and it was generating images that looked like base flux when I was trying to do something realistic. I gave up pretty quickly but it really didn't look promising.

Anonymous
04/01/26(Wed)15:08:29 No.108503012

Anonymous 04/01/26(Wed)15:08:29 No.108503012

>>108502994
>Is it meant to gen at 2k?
No it's just more stable at higher than 1mpx gens. Other than that it's just chroma.
Idk I don't really gen 3dpd so I can't help you here.

Anonymous
04/01/26(Wed)15:09:48 No.108503018

Anonymous 04/01/26(Wed)15:09:48 No.108503018

>>108502994
No one besides the five furtroons who basement dwell their cord 7/24 knows what the fuck exactly the trillion different schizo chroma spinoff experiments precisely are supposed to be.
Chroma is an astroturfed failbake and you will only waste your time with it. None of the variants or autistic workflows work without the most aggressive cherry picking and seed lottery game.

Anonymous
04/01/26(Wed)15:10:38 No.108503023

Anonymous 04/01/26(Wed)15:10:38 No.108503023

File: Klein9BVsZIT.png (2.87 MB, 1566x1168)

2.87 MB PNG

What ZIT thinks white people look like is the biggest issue with it IMO, it's really not that realistic at all when it comes to faces in quite a lot of cases, you can immediately tell it's a Chinese model just by looking at the eyes most of the time

Anonymous
04/01/26(Wed)15:15:59 No.108503059

Anonymous 04/01/26(Wed)15:15:59 No.108503059

>>108503012
Got it. I have to wonder if the flash lora was trained for anime or something because it absolutely tanked the realism which Chroma usually excels at.

>>108503018
Fuck off, we've got people in this thread that are still using fucking SDXL. Chroma has been surpassed in a lot of ways by recent models but I've made plenty of Chroma images that I just can't replicate with new models.

Anonymous
04/01/26(Wed)15:18:12 No.108503076

Anonymous 04/01/26(Wed)15:18:12 No.108503076

>>108503059
desu there are multiple flash loras. I have like 5 and each is baked differrently and gives differrent results.

Anonymous
04/01/26(Wed)15:24:25 No.108503123

Anonymous 04/01/26(Wed)15:24:25 No.108503123

File: 385555633032208.png (959 KB, 896x1152)

959 KB PNG

Anonymous
04/01/26(Wed)15:26:02 No.108503134

Anonymous 04/01/26(Wed)15:26:02 No.108503134

teto is so shit as a waifu it's not even funny anymore

Anonymous
04/01/26(Wed)15:49:15 No.108503312

Anonymous 04/01/26(Wed)15:49:15 No.108503312

Chroma is only good/better if you're a furfags... If not just pick any Qwen/ZiT/Klein nsfw.

Anonymous
04/01/26(Wed)15:58:13 No.108503370

Anonymous 04/01/26(Wed)15:58:13 No.108503370

>>108503312
Hermano, we're all furfags

Anonymous
04/01/26(Wed)16:06:00 No.108503418

Anonymous 04/01/26(Wed)16:06:00 No.108503418

>>108503023
and don't forget to mention that zit, has far more trouble with the loras than klein

Anonymous
04/01/26(Wed)16:10:26 No.108503444

Anonymous 04/01/26(Wed)16:10:26 No.108503444

>chinese anime man's hyped up 'new model' was API wan 2.7
local really is dead. absolutely zero developments this entire year.

Anonymous
04/01/26(Wed)16:12:06 No.108503454

Anonymous 04/01/26(Wed)16:12:06 No.108503454

>108503312
>same one argument

Anonymous
04/01/26(Wed)16:16:50 No.108503483

Anonymous 04/01/26(Wed)16:16:50 No.108503483

>>108503444
No he was referring to Nucleus MoE whose diffusers PR seems to be stuck in development hell.

Anonymous
04/01/26(Wed)16:18:17 No.108503491

Anonymous 04/01/26(Wed)16:18:17 No.108503491

File: Screenshot from 2026-04-0(...).png (144 KB, 342x429)

144 KB PNG

>>108503444
I'm actually glad local is stagnating. it means there's no reason for me to upgrade my PC. which saves me money. i'm going to ride out my 3090 128 GB of ram for the next 5 years.

Anonymous
04/01/26(Wed)16:23:49 No.108503525

Anonymous 04/01/26(Wed)16:23:49 No.108503525

File: wan.png (287 KB, 708x608)

287 KB PNG

>>108503483
No he wasn't, he quoted his own post to refer to wan 2.7. Whatever nucleus is, it will be worse than z-image, qwen 2512, and flux klein. Local has nothing left, even Noob has to resort to shitty GLM-image because Qwen refuses to release image 2.0. The era of API is here

Anonymous
04/01/26(Wed)16:30:09 No.108503571

Anonymous 04/01/26(Wed)16:30:09 No.108503571

>>108503525
Didn't check his recent posts, damn that sucks.

Anonymous
04/01/26(Wed)16:31:53 No.108503579

Anonymous 04/01/26(Wed)16:31:53 No.108503579

>>108503525
so is Wan 2.7 image not something to be excited for?

Anonymous
04/01/26(Wed)16:32:14 No.108503583

Anonymous 04/01/26(Wed)16:32:14 No.108503583

I thought anon would have a good april fools joke but instead hes just repeating the same troll he always uses :(

Anonymous
04/01/26(Wed)16:32:30 No.108503585

Anonymous 04/01/26(Wed)16:32:30 No.108503585

File: 789507225759739.png (826 KB, 832x1216)

826 KB PNG

Anonymous
04/01/26(Wed)16:32:54 No.108503591

Anonymous 04/01/26(Wed)16:32:54 No.108503591

>>108503579
Never mind, i didn't read that it was API. i thought it was being open sourced.

Anonymous
04/01/26(Wed)16:34:20 No.108503603

Anonymous 04/01/26(Wed)16:34:20 No.108503603

The least they could do is releasing earlier wan versions like 2.5 but they won't even do that.

Anonymous
04/01/26(Wed)16:36:51 No.108503621

Anonymous 04/01/26(Wed)16:36:51 No.108503621

>>108503591
it's only local until it's good

Anonymous
04/01/26(Wed)16:40:04 No.108503636

Anonymous 04/01/26(Wed)16:40:04 No.108503636

>>108503603
they REALLY hate the idea of people using their own hardware to make shit. they probably think it's going to be a huge legal risk.

Anonymous
04/01/26(Wed)16:46:27 No.108503685

Anonymous 04/01/26(Wed)16:46:27 No.108503685

>>108503636
I thought china was supposed to be heckin based and redpilled, saving AI from western jews and censorship, releasing local models to end capitalism and defeat OpenAI's monopoly?? Don't tell me that was just a bunch of a shill astroturfing after all and they sold out to SaaS just as quickly as everyone else

Anonymous
04/01/26(Wed)16:48:26 No.108503699

Anonymous 04/01/26(Wed)16:48:26 No.108503699

File: 16436738523.jpg (137 KB, 640x705)

137 KB JPG

Looking through some of my old folders, people had it figured it out in 2023 apperantly.

Anonymous
04/01/26(Wed)16:50:57 No.108503712

Anonymous 04/01/26(Wed)16:50:57 No.108503712

>>108503685
even cumfart sold out to saas

Anonymous
04/01/26(Wed)16:51:37 No.108503718

Anonymous 04/01/26(Wed)16:51:37 No.108503718

>>108503685
Pretty much. you saw how China bent the knee and are going to strip Seedance of any ability to do anything before releasing it locally. China is ultimately just as scared of america as the rest of the world is.

Anonymous
04/01/26(Wed)16:58:42 No.108503759

Anonymous 04/01/26(Wed)16:58:42 No.108503759

File: 1749783908067763.jpg (467 KB, 1744x1432)

467 KB JPG

*zitslops all over u*

Anonymous
04/01/26(Wed)17:02:33 No.108503774

Anonymous 04/01/26(Wed)17:02:33 No.108503774

>>108502726
it sent shivers down my spine

Anonymous
04/01/26(Wed)17:12:17 No.108503824

Anonymous 04/01/26(Wed)17:12:17 No.108503824

>>108503718
good, no one wants to train an overfit model.

Anonymous
04/01/26(Wed)17:18:48 No.108503866

Anonymous 04/01/26(Wed)17:18:48 No.108503866

Klein (at least 4b) and qwen are censored. Qwen phroot is all or nothing. Moving to pony or illustrous

Anonymous
04/01/26(Wed)17:22:29 No.108503885

Anonymous 04/01/26(Wed)17:22:29 No.108503885

>>108503418
Klein generates body horror with loras.

Anonymous
04/01/26(Wed)17:24:19 No.108503902

Anonymous 04/01/26(Wed)17:24:19 No.108503902

>>108503712
you sound jealous

Anonymous
04/01/26(Wed)17:31:17 No.108503953

Anonymous 04/01/26(Wed)17:31:17 No.108503953

>>108503902
you sound like a little bitch boy saas cuck

Anonymous
04/01/26(Wed)17:34:37 No.108503972

Anonymous 04/01/26(Wed)17:34:37 No.108503972

>>108503953
you post about saas here all the time though

Anonymous
04/01/26(Wed)17:35:42 No.108503977

Anonymous 04/01/26(Wed)17:35:42 No.108503977

>>108503972
I don't faggot

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.