/g/ - /sdg/ - Stable Diffusion general - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/sdg/ - Stable Diffusion gener(...) 12/26/25(Fri)03:03:53 No.107670801

File: 3-corp-art.png (1.42 MB, 896x1152)

/sdg/ - Stable Diffusion general Anonymous 12/26/25(Fri)03:03:53 No.107670801

Previous /sdg/ thread : >>107653059

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody

Anonymous
12/26/25(Fri)03:05:39 No.107670810

Anonymous 12/26/25(Fri)03:05:39 No.107670810

>mfw Resource news

12/25/2025

>Lumi Tools v1.1.0 adds LLM processors, new utility nodes, and more
https://github.com/illuminatianon/comfyui-lumi-tools/releases/tag/v1.1.0

>Input-Adaptive Visual Preprocessing for Efficient Fast Vision-Language Model Inference
https://github.com/kmdavidds/mlfastlm

>Rethinking Direct Preference Optimization in Diffusion Models
https://github.com/kaist-cvml/RethinkingDPO_Diffusion_Models

>ComfyUI-LG_SamplingUtils
https://github.com/LAOGOU-666/ComfyUI-LG_SamplingUtils

12/24/2025

>PhotoMapAI: fast, modern image browser and search tool for large photo collections
https://github.com/lstein/PhotoMapAI

12/23/2025

>StoryMem: Multi-shot Long Video Storytelling with Memory
https://kevin-thu.github.io/StoryMem

>Qwen-Image-Edit-2511
https://huggingface.co/Qwen/Qwen-Image-Edit-2511
https://huggingface.co/lightx2v/Qwen-Image-Edit-2511-Lightning
https://huggingface.co/unsloth/Qwen-Image-Edit-2511-GGUF

>CASA: Cross-Attention via Self-Attention for Efficient VL Fusion
https://kyutai.org/casa

>The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
https://github.com/WeichenFan/UAE

>MaskFocus: Focusing Policy Optimization on Critical Steps for Masked ImGen
https://github.com/zghhui/MaskFocus

>Efficient Zero-Shot Inpainting with Decoupled Diffusion Guidance
https://github.com/YazidJanati/ding

>ComfyUI-SpectralVAEDetailer
https://github.com/SparknightLLC/ComfyUI-SpectralVAEDetailer

>Wan2.1 NVFP4 quantization-aware 4-step distilled models
https://huggingface.co/lightx2v/Wan-NVFP4

>Majoor Assets Manager for ComfyUI
https://github.com/MajoorWaldi/ComfyUI-Majoor-AssetsManager

12/22/2025

>Region-Constraint In-Context Generation for Instructional Video Editing
https://zhw-zhang.github.io/ReCo-page

>Infinite-Homography as Robust Conditioning for Camera-Controlled VidGen
https://emjay73.github.io/InfCam

>SAM 3 Segmentation Agent Now in ComfyUI
https://github.com/adambarbato/ComfyUI-Segmentation-Agent

Anonymous
12/26/25(Fri)03:06:40 No.107670814

Anonymous 12/26/25(Fri)03:06:40 No.107670814

>mfw Research news

12/25/2025

>VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs
https://arxiv.org/abs/2512.21194

>HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
https://arxiv.org/abs/2512.21338

>Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models
https://sytwu.github.io/BeyondMemo

>GriDiT: Factorized Grid-Based Diffusion for Efficient Long Image Sequence Generation
https://arxiv.org/abs/2512.21276

>ACD: Direct Conditional Control for Video Diffusion Models via Attention Supervision
https://arxiv.org/abs/2512.21268

>DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation
https://dreamontage.github.io/DreaMontage

>FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
https://arxiv.org/abs/2512.21104

>T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
https://arxiv.org/abs/2512.21094

>Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
https://arxiv.org/abs/2512.21004

>FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
https://arxiv.org/abs/2512.21015

>Generalization of Diffusion Models Arises with a Balanced Representation Space
https://arxiv.org/abs/2512.20963

>UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
https://arxiv.org/abs/2512.21185

>Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection
https://arxiv.org/abs/2512.20937

Anonymous
12/26/25(Fri)03:07:41 No.107670821

Anonymous 12/26/25(Fri)03:07:41 No.107670821

>mfw Yesterday's Research news

12/24/2025

>AI Image Generators Default to the Same 12 Photo Styles, Study Finds
https://gizmodo.com/ai-image-generators-default-to-the-same-12-photo-styles-study-finds-2000702012

>SemanticGen: Video Generation in Semantic Space
https://jianhongbai.github.io/SemanticGen

>FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
https://arxiv.org/abs/2512.20561

>CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation
https://arxiv.org/abs/2512.20362

>TAVID: Text-Driven Audio-Visual Interactive Dialogue Generation
https://arxiv.org/abs/2512.20296

>How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
https://arxiv.org/abs/2512.20233

>AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model
https://arxiv.org/abs/2512.20157

>HEART-VIT: Hessian-Guided Efficient Dynamic Attention and Token Pruning in Vision Transformer
https://arxiv.org/abs/2512.20120

>Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts
https://arxiv.org/abs/2512.20088

>UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
https://arxiv.org/abs/2512.20479

>How Much 3D Do Video Foundation Models Encode?
https://vidfm-3d-probe.github.io

>Few-Shot-Based Modular Image-to-Video Adapter for Diffusion Models
https://arxiv.org/abs/2512.20000

>Learning to Refocus with Video Diffusion Models
https://arxiv.org/abs/2512.19823

>Beyond Vision: Contextually Enriched Image Captioning with Multi-Modal Retrieva
https://arxiv.org/abs/2512.20042

Lumi (¬ᴗ ´¬ )
12/26/25(Fri)03:10:18 No.107670840

Lumi (¬ᴗ ´¬ ) 12/26/25(Fri)03:10:18 No.107670840

File: l-nbp-2025-12-26_00011_.png (1.29 MB, 1376x768)

1.29 MB PNG

>>107670781
no version in readme no problems :)

Anonymous
12/26/25(Fri)03:10:51 No.107670843

Anonymous 12/26/25(Fri)03:10:51 No.107670843

File: deJS_zi_00019_.png (2.63 MB, 1408x1536)

2.63 MB PNG

news, posted
thread, filled
xmas, over
food, eaten
sleep, awaits
gn

Lumi (¬ᴗ ´¬ )
12/26/25(Fri)03:11:50 No.107670847

Lumi (¬ᴗ ´¬ ) 12/26/25(Fri)03:11:50 No.107670847

File: l-nbp-2025-12-26_00015_.png (1.49 MB, 1376x768)

1.49 MB PNG

>>107670843
gn

Anonymous
12/26/25(Fri)04:01:48 No.107671094

Anonymous 12/26/25(Fri)04:01:48 No.107671094

File: 00299-433697709-_lora_quo(...).jpg (138 KB, 1024x1024)

138 KB JPG

trying out the old SDXL quokka LoCon on illustrious, he turns into a bird like pokemon
>>107670843
gn anon :)

Anonymous
12/26/25(Fri)04:38:06 No.107671300

Anonymous 12/26/25(Fri)04:38:06 No.107671300

File: 00300-3999552889-_lora_qu(...).jpg (153 KB, 1024x1024)

153 KB JPG

yep, illustrous identifies quokka as bird, lmao

Lumi (¬ᴗ ´¬ )
12/26/25(Fri)05:35:57 No.107671621

Lumi (¬ᴗ ´¬ ) 12/26/25(Fri)05:35:57 No.107671621

File: l-nbp-2025-12-26_00027_.png (1.31 MB, 1376x768)

1.31 MB PNG

Lumi (¬ᴗ ´¬ )
12/26/25(Fri)05:42:54 No.107671671

Lumi (¬ᴗ ´¬ ) 12/26/25(Fri)05:42:54 No.107671671

File: l-nbp-2025-12-26_00045_.png (1.74 MB, 1344x768)

1.74 MB PNG

Lumi (¬ᴗ ´¬ )
12/26/25(Fri)05:57:26 No.107671770

Lumi (¬ᴗ ´¬ ) 12/26/25(Fri)05:57:26 No.107671770

File: l-nbp-2025-12-26_00052_.png (1.62 MB, 1344x768)

1.62 MB PNG

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.