/g/ - /sdg/ -Stable Diffusion general - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/sdg/ -Stable Diffusion genera(...) 01/19/26(Mon)17:11:19 No.107914506

File: k_00777_.png (3.09 MB, 1920x1080)

/sdg/ -Stable Diffusion general Anonymous 01/19/26(Mon)17:11:19 No.107914506

Previous /sdg/ thread : >>107899825

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody

Anonymous
01/19/26(Mon)17:14:49 No.107914530

Anonymous 01/19/26(Mon)17:14:49 No.107914530

>mfw Resource news

01/19/2026

>kohya-ss/sd-scripts v0.10.0 released
https://github.com/kohya-ss/sd-scripts/releases/tag/v0.10.0

>Radiance: Professional HDR Image Processing Suite for ComfyUI
https://github.com/fxtdstudios/radiance

>M3DDM+: An improved video outpainting by a modified masking strategy
https://github.com/tamaki-lab/M3DDM-Plus

>ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
http://facebookresearch.github.io/ShapeR

>VidLeaks: Membership Inference Attacks Against Text-to-Video Models
https://zenodo.org/records/17972831

>Moonworks Lunara Aesthetic Dataset
https://huggingface.co/datasets/moonworks/lunara-aesthetic

01/18/2026

>VIBE: Visual Instruction Based Editor
https://huggingface.co/iitolstykh/VIBE-Image-Edit

>Arthemy Live Tuner SDXL ComfyUI
https://github.com/aledelpho/Arthemy_Live-Tuner-SDXL-ComfyUI

>Pixel-Perfect Aligner (AI Fix) for GIMP 3
https://github.com/CombinEC-R/Pixel-Perfect-Aligner

>Stable AI Flow: Phase-Locked Live AI Filter
https://github.com/anttiluode/StableAIflow

>ComfyUI-Flux2Klein-Enhancer: Conditioning enhancement node for FLUX.2 Klein 9B
https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer

>DiffusionDesk: Self-hosted Creative AI server integrating stable-diffusion.cpp and llama.cpp
https://github.com/Danmoreng/diffusion-desk

>WAN 2.6 Reference-to-Video is available in ComfyUI
https://blog.comfy.org/p/wan26-reference-to-video

01/17/2026

>FLUX.2 Prompting: Prompting Guide - FLUX.2 [klein]
https://docs.bfl.ai/guides/prompting_guide_flux2_klein

01/16/2026

>ComfyUI-CapitanFlowMatch: Optimal samplers and schedulers for rectified flow models
https://github.com/capitan01R/ComfyUI-CapitanFlowMatch

01/15/2026

>FLUX.2 [klein]: Generate and edit in less than a second with state-of-the-art quality
https://bfl.ai/models/flux-2-klein

>ComfyUI-TBG-ETUR: 100MP Enhanced Tiled Upscaler & Refiner Pro. Enhance Your Images with TBG's Upscaler
https://github.com/Ltamann/ComfyUI-TBG-ETUR

Anonymous
01/19/26(Mon)17:15:50 No.107914538

Anonymous 01/19/26(Mon)17:15:50 No.107914538

>mfw Research news

01/19/2026

>Your One-Stop Solution for AI-Generated Video Detection
https://arxiv.org/abs/2601.11035

>PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models
https://arxiv.org/abs/2601.11087

>CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
https://lucaria-academy.github.io/CoDance

>SoLA-Vision: Fine-grained Layer-wise Linear Softmax Hybrid Attention
https://arxiv.org/abs/2601.11164

>ATATA: One Algorithm to Align Them All
https://arxiv.org/abs/2601.11194

>Enhancing Vision Language Models with Logic Reasoning for Situational Awareness
https://arxiv.org/abs/2601.11322

>When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models
https://arxiv.org/abs/2601.11444

>MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models
https://arxiv.org/abs/2601.11464

Anonymous
01/19/26(Mon)17:25:42 No.107914608

Anonymous 01/19/26(Mon)17:25:42 No.107914608

File: k_00831_.png (581 KB, 896x1152)

581 KB PNG

Anonymous
01/19/26(Mon)17:35:08 No.107914677

Anonymous 01/19/26(Mon)17:35:08 No.107914677

File: deGG_zi_00015_.png (3.2 MB, 2176x1152)

3.2 MB PNG

we havent seen baker anon since around xmas. would he really vanish without even saying goodbye?

Anonymous
01/19/26(Mon)17:47:33 No.107914783

Anonymous 01/19/26(Mon)17:47:33 No.107914783

File: k_00832_.png (1.45 MB, 896x1152)

1.45 MB PNG

>>107914677
idk. it is depressing. we seem in the latter days of sdg. i am pretty burnt out.

Anonymous
01/19/26(Mon)18:11:06 No.107914989

Anonymous 01/19/26(Mon)18:11:06 No.107914989

File: deGG_zi_00016_.png (3.2 MB, 2176x1152)

3.2 MB PNG

>>107914783
post cadence hasn't been too bad lately, keeping close to a thread a day. I do miss a lot of posters though. maybe we'll see them again some day

Anonymous
01/19/26(Mon)18:23:54 No.107915098

Anonymous 01/19/26(Mon)18:23:54 No.107915098

File: 00000003-238390615843326-(...).jpg (1.02 MB, 3072x2048)

1.02 MB JPG

klein lora attemp 1 (.3) worked well enough

Anonymous
01/19/26(Mon)18:32:09 No.107915184

Anonymous 01/19/26(Mon)18:32:09 No.107915184

File: 00000006-283183995075673-(...).jpg (1.64 MB, 3072x2048)

1.64 MB JPG

Anonymous
01/19/26(Mon)18:36:18 No.107915215

Anonymous 01/19/26(Mon)18:36:18 No.107915215

File: 00021-comfyui-cfg=1.0-ste(...).jpg (171 KB, 1216x832)

171 KB JPG

I created a kpop band. kek

Full video
https://files.catbox.moe/cljkr0.mp4

Anonymous
01/19/26(Mon)18:53:17 No.107915340

Anonymous 01/19/26(Mon)18:53:17 No.107915340

File: deGG_zi_00018_.png (3.26 MB, 2176x1152)

3.26 MB PNG

>>107915098
nice. how do you rate f2k trainability compared to chroma or zimg?

>>107915215
this was all local? pretty nice pacing and scene structure. the faces get pretty demonic in parts tho
how long did the whole project take?

Anonymous
01/19/26(Mon)19:03:03 No.107915432

Anonymous 01/19/26(Mon)19:03:03 No.107915432

File: 00000013-1029224174454383(...).jpg (1.94 MB, 3072x2048)

1.94 MB JPG

>>107915340
>how do you rate f2k trainability compared to chroma or zimg?
taking into account that the more i've trained loras the more i've learned what works and waht doesnt, and that i dont tend to go back to retrain old loras with new knowledge...
i know chroma well and i know what doesnt work on it well too, what works is a challenge because of the nature of training (datasets, samplers, optimizations)
given that..
chroma is hardest and longest to train, but if done right it's like you add your character/style to the model
z-image is fast to train but tends to overfit (because it's not a base model+i didnt spend too much time on it)
klein is not even fully implemented (in onetrainer anyway) and it's as fast if not faster than z, and so far seems to preserve the base model stuff without bullying it (chroma) or overfitting (z). kinda of best of both worlds, but i'm still ironing out some quirks.
it took 6 hrs or so to do 50 epochs (around 2500 steps). using 250+ source images with meh captions, batch 10 at res 1024
same thing on chroma would've been maybe 15-20 hrs (if not more), and i never tried that many source images on z, but probably around 5-6 hrs too

Anonymous
01/19/26(Mon)19:14:28 No.107915514

Anonymous 01/19/26(Mon)19:14:28 No.107915514

File: 00000014-403394901353152-(...).jpg (1.92 MB, 3072x2048)

1.92 MB JPG

gonna leave the chromagirl training overnight, see how it does at 100 epochs lel

Anonymous
01/19/26(Mon)19:26:12 No.107915632

Anonymous 01/19/26(Mon)19:26:12 No.107915632

File: 00000016-44919451335000-f(...).jpg (1.43 MB, 3072x2048)

1.43 MB JPG

Anonymous
01/19/26(Mon)19:46:42 No.107915822

Anonymous 01/19/26(Mon)19:46:42 No.107915822

>>107915340
>how long did the whole project take?
About 3-4 days, all local. Yeah, quality is bad because of just 12gb. Still amazing what you can do with just 12gb.

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.