/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 04/19/26(Sun)14:30:31 No.108639162

File: highlights_g_108629083_17(...).jpg (1.43 MB, 3380x1456)

/ldg/ - Local Diffusion General Anonymous 04/19/26(Sun)14:30:31 No.108639162

Discussion and Development of Local Image and Video Models

Previous: >>108629083

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
04/19/26(Sun)14:33:11 No.108639180

Anonymous 04/19/26(Sun)14:33:11 No.108639180

it's not over

Anonymous
04/19/26(Sun)14:38:27 No.108639219

Anonymous 04/19/26(Sun)14:38:27 No.108639219

File: ComfyUI_22149.png (2.58 MB, 1920x1080)

2.58 MB PNG

>>108638173
>>108638184
So... faster, less resources or both?

Anonymous
04/19/26(Sun)14:39:59 No.108639228

Anonymous 04/19/26(Sun)14:39:59 No.108639228

>>108639162
we don't have anything to talk, why did you bake?

Anonymous
04/19/26(Sun)14:42:55 No.108639255

Anonymous 04/19/26(Sun)14:42:55 No.108639255

>>108639228
he only cares about made up schizo drama

Anonymous
04/19/26(Sun)14:44:26 No.108639266

Anonymous 04/19/26(Sun)14:44:26 No.108639266

>mfw Resource news

04/19/2026

>ZPix: Local AI image generator and editor powered by open image models.
https://github.com/SamuelTallet/ZPix

>Comfy Canvas: Local inline layer based image editor
https://github.com/Zlata-Salyukova/Comfy-Canvas

04/18/2026

>Rose: Range-Of-Slice Equilibration PyTorch optimizer
https://github.com/MatthewK78/Rose

04/17/2026

>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
https://yjx-research.github.io/ControlFoley

>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
https://research.nvidia.com/labs/toronto-ai/tokengs

>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
https://aka.ms/mm-webagent

>Qwen2D-VAE
https://huggingface.co/Anzhc/Qwen2D-VAE

>ComfyUI HY-World 2.0 — WorldMirror 3D
https://github.com/AHEKOT/ComfyUI_HYWorld2

>Anima Style Explorer: A free web tool for ComfyUI styles
https://anima.mooshieblob.com

>Stanford AI Index Report 2026
https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf

04/16/2026

>Motif-Video 2B: A micro-budget text-to-video diffusion transformer from Motif Technologies
https://motiftech.io/videoshowcase

>HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
https://huggingface.co/tencent/HY-World-2.0

>ErnieTurbo_extracted_lora
https://huggingface.co/GuangyuanSD/ErnieTurbo_extracted_lora/tree/main

04/15/2026

>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
https://huggingface.co/tencent/DisCa

>Lyra 2.0: Explorable Generative 3D Worlds
https://research.nvidia.com/labs/sil/projects/lyra2

>AniGen: Unified S3 Fields for Animatable 3D Asset Generation
https://github.com/VAST-AI-Research/AniGen

>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Models
https://gyanendrachaubey.github.io/T2I-BiasBench

Anonymous
04/19/26(Sun)14:44:46 No.108639268

Anonymous 04/19/26(Sun)14:44:46 No.108639268

File: downfall-asset-1054100487.jpg (171 KB, 1920x1088)

171 KB JPG

where am I supposed to get my nsfw resources from now that civitai went full retard, are we back to shady businessmen in back alleys?

Anonymous
04/19/26(Sun)14:45:27 No.108639272

Anonymous 04/19/26(Sun)14:45:27 No.108639272

>mfw Research news

04/19/2026

>Boosting Robust AIGI Detection with LoRA-based Pairwise Training
https://arxiv.org/abs/2604.12307

>A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargeting
https://arxiv.org/abs/2604.13427

>Decoupled Similarity for Task-Aware Token Pruning in Large Vision-Language Models
https://arxiv.org/abs/2604.11240

>Relaxing Anchor-Frame Dominance for Mitigating Hallucinations in Video Large Language Models
https://arxiv.org/abs/2604.12582

>One-shot Compositional 3D Head Avatars with Deformable Hair
https://yuansun-xjtu.github.io/CompHairHead.io

>Crowdsourcing of Real-world Image Annotation via Visual Properties
https://arxiv.org/abs/2604.14449

>Chaotic CNN for Limited Data Image Classification
https://arxiv.org/abs/2604.14645

>HTDC: Hesitation-Triggered Differential Calibration for Mitigating Hallucination in Large Vision-Language Models
https://arxiv.org/abs/2604.12115

>Degradation-Consistent Paired Training for Robust AI-Generated Image Detection
https://arxiv.org/abs/2604.10102

>On The Application of Linear Attention in Multimodal Transformers
https://arxiv.org/abs/2604.10064

>Reasoning Resides in Layers: Restoring Temporal Reasoning in Video-Language Models with Layer-Selective Merging
https://arxiv.org/abs/2604.11399

>Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models
https://arxiv.org/abs/2604.14888

>Benchmarking Deflection and Hallucination in Large Vision-Language Models
https://arxiv.org/abs/2604.12033

>Why MLLMs Struggle to Determine Object Orientations
https://arxiv.org/abs/2604.13321

>Quality-Aware Calibration for AI-Generated Image Detection in the Wild
https://grip-unina.github.io/QuAD

>Reward Design for Physical Reasoning in Vision-Language Models
https://arxiv.org/abs/2604.13993

>Seeing Through Circuits: Faithful Mechanistic Interpretability for Vision Transformers
https://arxiv.org/abs/2604.14477

Anonymous
04/19/26(Sun)14:46:15 No.108639281

Anonymous 04/19/26(Sun)14:46:15 No.108639281

File: ComfyUI_temp_xnbin_00011_.png (2.83 MB, 1602x1024)

2.83 MB PNG

Anonymous
04/19/26(Sun)14:46:56 No.108639287

Anonymous 04/19/26(Sun)14:46:56 No.108639287

>>108639162
sarah peterson status?

Anonymous
04/19/26(Sun)14:50:08 No.108639316

Anonymous 04/19/26(Sun)14:50:08 No.108639316

>>108639287
Yeah, let me tell you
>Sarah Petersons BBC Holding Dildo FT15
https://civitai.red/models/466318/sarah-petersons-bbc-holding-dildo-ft15
>Sarah Petersons Black Bred Magazine cover
https://civitai.red/models/717113/sarah-petersons-black-bred-magazine-cover
>Sarah Petersons BBC Spoon FT15
https://civitai.red/models/185076/sarah-petersons-bbc-spoon-ft15
>Sarah Petersons BBC Gangbang Kneeling surrounded
https://civitai.red/models/537775/sarah-petersons-bbc-gangbang-kneeling-surrounded

Happy BBCunday ^^!

Anonymous
04/19/26(Sun)14:50:12 No.108639317

Anonymous 04/19/26(Sun)14:50:12 No.108639317

>>108639287
in shambles, Indian GDP dropped by 2%

Anonymous
04/19/26(Sun)14:51:55 No.108639332

Anonymous 04/19/26(Sun)14:51:55 No.108639332

File: ComfyUI_temp_xnbin_00022_.png (3.69 MB, 2756x1024)

3.69 MB PNG

Anonymous
04/19/26(Sun)14:53:04 No.108639342

Anonymous 04/19/26(Sun)14:53:04 No.108639342

File: 5aab3c67-cb64-411e-90ec-b(...).png (1.94 MB, 4096x2713)

1.94 MB PNG

>>108639316
so based..

Anonymous
04/19/26(Sun)14:54:23 No.108639351

Anonymous 04/19/26(Sun)14:54:23 No.108639351

I haven't been ITT since Z Image and Kleins dropped, what's the current meta? Are the threads still under assault by anus? Is lodestones still a retard?

Anonymous
04/19/26(Sun)14:56:56 No.108639372

Anonymous 04/19/26(Sun)14:56:56 No.108639372

File: ComfyUI_temp_xnbin_00030_.png (2.47 MB, 2476x1050)

2.47 MB PNG

Anonymous
04/19/26(Sun)14:59:48 No.108639393

Anonymous 04/19/26(Sun)14:59:48 No.108639393

>>108639162
good boy tran

Anonymous
04/19/26(Sun)15:06:39 No.108639444

Anonymous 04/19/26(Sun)15:06:39 No.108639444

>>108639351
Anima shows promise for anime stuff, and became Ani's latest target. It's a little smaller than SDXL and much slower, but can do both tags and natural-language prompting. There's even a WaiAnima v1 now that noticeably improves high-res results.

Anonymous
04/19/26(Sun)15:09:49 No.108639464

Anonymous 04/19/26(Sun)15:09:49 No.108639464

>>108639372
Aaahhh

Anonymous
04/19/26(Sun)15:12:30 No.108639478

Anonymous 04/19/26(Sun)15:12:30 No.108639478

>>108639372
Wat prompt anon

Anonymous
04/19/26(Sun)15:14:01 No.108639493

Anonymous 04/19/26(Sun)15:14:01 No.108639493

File: AAuE7mAo5osHnoXHx_W7kzrEx(...).jpg (95 KB, 900x900)

95 KB JPG

what the fuck is ERNIE

Anonymous
04/19/26(Sun)15:14:37 No.108639496

Anonymous 04/19/26(Sun)15:14:37 No.108639496

>>108639351
Kekstone is training his last model on pics of his own poop with disposable camera. Sounds promising...

Anonymous
04/19/26(Sun)15:17:41 No.108639510

Anonymous 04/19/26(Sun)15:17:41 No.108639510

>>108639493
an another nothingburger

Anonymous
04/19/26(Sun)15:17:53 No.108639511

Anonymous 04/19/26(Sun)15:17:53 No.108639511

>>108639493
the fastest milkman in the west

Anonymous
04/19/26(Sun)15:19:20 No.108639518

Anonymous 04/19/26(Sun)15:19:20 No.108639518

>>108639351
Klein-9B-KV was released, which used kv-caching to speed up edit gens by a lot.

Anonymous
04/19/26(Sun)15:23:05 No.108639536

Anonymous 04/19/26(Sun)15:23:05 No.108639536

>>108639496
>Kekstone is training his last model on pics of his own poop with disposable camera
sounds retarded enough to be true

Anonymous
04/19/26(Sun)15:28:41 No.108639572

Anonymous 04/19/26(Sun)15:28:41 No.108639572

File: FluxKlein9BDistilled_Outp(...).jpg (2.77 MB, 2048x2048)

2.77 MB JPG

Anonymous
04/19/26(Sun)15:29:26 No.108639575

Anonymous 04/19/26(Sun)15:29:26 No.108639575

>>108639572
wtf I want to die for Israel now??

Anonymous
04/19/26(Sun)15:37:53 No.108639627

Anonymous 04/19/26(Sun)15:37:53 No.108639627

File: 1886981.png (15 KB, 709x86)

15 KB PNG

so its over? owarida?

Anonymous
04/19/26(Sun)15:39:06 No.108639634

Anonymous 04/19/26(Sun)15:39:06 No.108639634

I keep seeing some fucking crazy NSFW videos on DeviantArt with multi-shot character consistency and audio. How are people doing it? No way it's LTX-2.3

Anonymous
04/19/26(Sun)15:40:06 No.108639640

Anonymous 04/19/26(Sun)15:40:06 No.108639640

>>108639493
The husband of HERNIA

Anonymous
04/19/26(Sun)15:40:11 No.108639641

Anonymous 04/19/26(Sun)15:40:11 No.108639641

>>108639634
link

Anonymous
04/19/26(Sun)15:41:43 No.108639653

Anonymous 04/19/26(Sun)15:41:43 No.108639653

trying image editing for the first time with klein 9b on my 8gb vram, absolute magic

Anonymous
04/19/26(Sun)15:48:06 No.108639698

Anonymous 04/19/26(Sun)15:48:06 No.108639698

File: ComfyUI_temp_xnbin_00043_.png (2.54 MB, 2476x1024)

2.54 MB PNG

>>108639478
A character sheet multi-view photo 3x3 grid of the woman for dataset creation, white seamless background,

Anonymous
04/19/26(Sun)16:15:16 No.108639851

Anonymous 04/19/26(Sun)16:15:16 No.108639851

>>108639219
very nice

Anonymous
04/19/26(Sun)16:16:08 No.108639856

Anonymous 04/19/26(Sun)16:16:08 No.108639856

>>108639653
ye once you get the hang of how to prompt klein for edit it's quite good for the size / speed

Anonymous
04/19/26(Sun)16:33:37 No.108639938

Anonymous 04/19/26(Sun)16:33:37 No.108639938

>>108639518
>Klein-9B-KV
is it better in other regards too or just faster

Anonymous
04/19/26(Sun)16:36:00 No.108639953

Anonymous 04/19/26(Sun)16:36:00 No.108639953

>>108639856
how do i prompt Klein to make me a canny filter accurate and not change the style?

Anonymous
04/19/26(Sun)16:37:26 No.108639962

Anonymous 04/19/26(Sun)16:37:26 No.108639962

>>108639938
Worse but faster imo.

Anonymous
04/19/26(Sun)16:40:01 No.108639980

Anonymous 04/19/26(Sun)16:40:01 No.108639980

File: [044861].jpg (227 KB, 1300x1300)

227 KB JPG

>>108639698
>>108639372
what model

Anonymous
04/19/26(Sun)16:45:13 No.108640016

Anonymous 04/19/26(Sun)16:45:13 No.108640016

tdrusell are you here?

Anonymous
04/19/26(Sun)16:46:02 No.108640023

Anonymous 04/19/26(Sun)16:46:02 No.108640023

File: 74567237272.jpg (377 KB, 1344x768)

377 KB JPG

Anonymous
04/19/26(Sun)17:09:35 No.108640173

Anonymous 04/19/26(Sun)17:09:35 No.108640173

>>108639962
Is the quality even supposed to be different? The description sounds like it just avoids redundant recomputes by reusing the part that doesn't change.

https://github.com/black-forest-labs/flux2/blob/main/docs/flux2_klein_kv_cache.md

Anonymous
04/19/26(Sun)17:11:09 No.108640183

Anonymous 04/19/26(Sun)17:11:09 No.108640183

>>108640016
im in my ferrari sports car training v4 but whats up

Anonymous
04/19/26(Sun)17:23:07 No.108640234

Anonymous 04/19/26(Sun)17:23:07 No.108640234

its up
https://www.youtube.com/watch?v=B6dq0Q5UAaE

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.