/g/ - /sdg/ - Stable Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/sdg/ - Stable Diffusion Gener(...) 10/01/25(Wed)00:50:35 No.106754581

File: PW.webm (1.95 MB, 480x576)

/sdg/ - Stable Diffusion General Anonymous 10/01/25(Wed)00:50:35 No.106754581

Previous /sdg/ thread : >>106737217

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Flux.1 Krea
https://docs.comfy.org/tutorials/flux/flux1-krea-dev
https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
https://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt

カガミノコエ
10/01/25(Wed)00:55:45 No.106754614

カガミノコエ 10/01/25(Wed)00:55:45 No.106754614

File: 00019-1540331261.png (1.87 MB, 1536x864)

1.87 MB PNG

forgive him baker, for he knows not what he does.

Anonymous
10/01/25(Wed)00:59:00 No.106754636

Anonymous 10/01/25(Wed)00:59:00 No.106754636

>mfw Resource news

09/30/2025

>Kandinsky 5.0: A family of diffusion models for Video & Image generation
https://github.com/ai-forever/Kandinsky-5

>Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>CharGen: Fast and Fluent Portrait Modification
https://chargen.jdihlmann.com

>Visual Jigsaw Post-Training Improves MLLMs
https://penghao-wu.github.io/visual_jigsaw

>DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space
https://github.com/dc-ai-projects/DC-Gen

>Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
https://github.com/maifoundations/LayerCD

>LayerD: Decomposing Raster Graphic Designs into Layers
https://cyberagentailab.github.io/LayerD

>UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation
https://unilat3d.github.io

>STAGE: Stable and Generalizable GRPO for Autoregressive Image Generation
https://github.com/krennic999/STAGE

>AutoPrune: Each Complexity Deserves a Pruning Policy
https://github.com/AutoLab-SAI-SJTU/AutoPrune

>EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
https://github.com/VectorSpaceLab/EditScore

>GenView++: Unifying Adaptive View Generation and Quality-Driven Supervision for Contrastive Representation Learning
https://github.com/xiaojieli0903/GenViewPlusPlus

>Image MetaHub: Desktop application for browsing, searching, and organizing AI-generated images
https://github.com/LuqP2/Image-MetaHub

>Wan-Alpha; High-Quality Text-to-Video Generation with Alpha Channel
https://donghaotian123.github.io/Wan-Alpha

>California Governor Newsom signs landmark AI safety bill SB 53
https://techcrunch.com/2025/09/29/california-governor-newsom-signs-landmark-ai-safety-bill-sb-53

Anonymous
10/01/25(Wed)00:59:50 No.106754642

Anonymous 10/01/25(Wed)00:59:50 No.106754642

File: GmdeV_37.jpg (335 KB, 2458x2458)

335 KB JPG

Anonymous
10/01/25(Wed)01:00:54 No.106754650

Anonymous 10/01/25(Wed)01:00:54 No.106754650

>mfw Research news

09/30/2025

>Score-based Membership Inference on Diffusion Models
https://arxiv.org/abs/2509.25003

>PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion
https://yuyangyin.github.io/PanoWorld-X

>Scalable GANs with Transformers
https://arxiv.org/abs/2509.24935

>OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
https://arxiv.org/abs/2509.24900

>VAGUEGAN: Stealthy Poisoning and Backdoor Attacks on Image Generative Pipelines
https://arxiv.org/abs/2509.24891

>Training-Free Token Pruning via Zeroth-Order Gradient Estimation in VLMs
https://arxiv.org/abs/2509.24837

>Causal-Adapter: Taming T2I Diffusion for Faithful Counterfactual Generation
https://arxiv.org/abs/2509.24798

>Inducing Dyslexia in VLMs
https://arxiv.org/abs/2509.24597

>SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer
https://arxiv.org/abs/2509.24695

>TokenSwap: Backdoor Attack on the Compositional Understanding of Large VLMs
https://arxiv.org/abs/2509.24566

>Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
https://arxiv.org/abs/2509.24514

>UI2V-Bench: An Understanding-based I2V Generation Benchmark
https://arxiv.org/abs/2509.24427

>CLQ: Cross-Layer Guided Orthogonal-based Quantization for Diffusion Transformers
https://arxiv.org/abs/2509.24416

>TraitSpaces: Towards Interpretable Visual Creativity for Human-AI Co-Creation
https://arxiv.org/abs/2509.24326

>Hyperspherical Latents Improve Continuous-Token Autoregressive Generation
https://arxiv.org/abs/2509.24335

>SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
https://arxiv.org/abs/2509.24299

>Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes
https://johann.wang/Light-SQ

>FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in I2V Generation
https://pku-yuangroup.github.io/FlashI2V

Anonymous
10/01/25(Wed)01:01:55 No.106754656

Anonymous 10/01/25(Wed)01:01:55 No.106754656

>mfw MORE Research news

>DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
https://arxiv.org/abs/2509.25182

>GHOST: Hallucination-Inducing Image Generation for Multimodal LLMs
https://arxiv.org/abs/2509.25178

>Personalized Vision via Visual In-Context Learning
https://yuxinn-j.github.io/projects/PICO

>Aligning Visual Foundation Encoders to Tokenizers for Diffusion Models
https://aligntok.github.io

>Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
https://kunhao-liu.github.io/Rolling_Forcing_Webpage

>Score Distillation of Flow Matching Models
https://arxiv.org/abs/2509.25127

>Not All Tokens are Guided Equal: Improving Guidance in Visual Autoregressive Models
https://arxiv.org/abs/2509.23876

>Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
https://arxiv.org/abs/2509.23980

>HunyuanImage 3.0 Technical Report
https://arxiv.org/abs/2509.23951

>GANji: A Framework for Introductory AI Image Generation
https://arxiv.org/abs/2509.24128

>Autoregressive Video Generation beyond Next Frames Prediction
https://arxiv.org/abs/2509.24081

>SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
https://arxiv.org/abs/2509.24006

>Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
https://arxiv.org/abs/2509.23919

>Towards Fine-Grained Text-to-3D Quality Assessment: A Benchmark and A Two-Stage Rank-Learning Metric
https://cbysjtu.github.io/Rank2Score

>Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution
https://arxiv.org/abs/2509.23774

>Semantic Editing with Coupled Stochastic Differential Equations
https://arxiv.org/abs/2509.24223

>Asymmetric VAE for One-Step Video Super-Resolution Acceleration
https://arxiv.org/abs/2509.24142

>Origins of Creativity in Attention-Based Diffusion Models
https://arxiv.org/abs/2506.17324

Anonymous
10/01/25(Wed)01:14:54 No.106754737

Anonymous 10/01/25(Wed)01:14:54 No.106754737

File: deUN_cHD_00063_.png (2.12 MB, 1310x1310)

2.12 MB PNG

Anonymous
10/01/25(Wed)01:29:52 No.106754839

Anonymous 10/01/25(Wed)01:29:52 No.106754839

File: PW_143312_.png (2.02 MB, 1280x1800)

2.02 MB PNG

I had to run off for a bit haha
Gonna go to the store too! Brb again

カガミノコエ
10/01/25(Wed)01:30:57 No.106754850

カガミノコエ 10/01/25(Wed)01:30:57 No.106754850

File: 00027-4148845319.png (1.56 MB, 1536x864)

1.56 MB PNG

>>106754839
pick me up a labatt tallboy and a pack of lucky red 100s

Anonymous
10/01/25(Wed)01:35:41 No.106754879

Anonymous 10/01/25(Wed)01:35:41 No.106754879

File: deUN_cHD_00066_.png (2.29 MB, 1310x1310)

2.29 MB PNG

>>106754850
gonna go "grab a pack of smokes" and never come back :(

カガミノコエ
10/01/25(Wed)01:39:31 No.106754905

カガミノコエ 10/01/25(Wed)01:39:31 No.106754905

File: till_death.jpg (372 KB, 1920x1080)

372 KB JPG

till death
https://youtu.be/LQiSEHDRzg8

カガミノコエ
10/01/25(Wed)01:44:27 No.106754941

カガミノコエ 10/01/25(Wed)01:44:27 No.106754941

File: file.png (29 KB, 917x417)

29 KB PNG

i need to be careful, this is the response for that Panic! at the disco rip off i posted the other day. fucking normals.

Anonymous
10/01/25(Wed)01:46:05 No.106754949

Anonymous 10/01/25(Wed)01:46:05 No.106754949

>>106754941
I don't understand.

Anonymous
10/01/25(Wed)01:47:22 No.106754956

Anonymous 10/01/25(Wed)01:47:22 No.106754956

File: deUN_cHD_00067_.png (3.13 MB, 1843x1147)

3.13 MB PNG

>>106754941
line go up in a nice feeling

カガミノコエ
10/01/25(Wed)02:00:03 No.106755031

カガミノコエ 10/01/25(Wed)02:00:03 No.106755031

File: 00024-4148845316.png (1.66 MB, 1536x864)

1.66 MB PNG

>>106754949
it's the view graph for this https://youtu.be/trRoY0ngQjE
you be the judge. it was completely ignored here lmao, which wasn't unexpected.

カガミノコエ
10/01/25(Wed)02:04:41 No.106755062

カガミノコエ 10/01/25(Wed)02:04:41 No.106755062

File: 00025-4148845317.png (1.76 MB, 1536x864)

1.76 MB PNG

Anonymous
10/01/25(Wed)02:06:40 No.106755077

Anonymous 10/01/25(Wed)02:06:40 No.106755077

File: deTF_cHD_00004_.png (3.11 MB, 2081x1190)

3.11 MB PNG

Anonymous
10/01/25(Wed)02:08:01 No.106755087

Anonymous 10/01/25(Wed)02:08:01 No.106755087

>>106755031
I think you might have something here. It's a pretty good song, it has structure and the chorus has the right hooks. I mean it's AI generated but you made a hit song of sorts.

Anonymous
10/01/25(Wed)02:12:40 No.106755101

Anonymous 10/01/25(Wed)02:12:40 No.106755101

File: PW_143404_.png (2.94 MB, 1280x1800)

2.94 MB PNG

>>106754850
You got it!
>>106754905
I love this song! I've listened to it quite a few times hahaha
>>106755031
I like it!

カガミノコエ
10/01/25(Wed)02:14:06 No.106755108

カガミノコエ 10/01/25(Wed)02:14:06 No.106755108

File: 00035-1246292130.png (2.62 MB, 1536x864)

2.62 MB PNG

>>106755087
suno excels at making "the median song," like it will never do anything really great but with some work it makes some pretty listenable tracks.

Anonymous
10/01/25(Wed)02:16:29 No.106755118

Anonymous 10/01/25(Wed)02:16:29 No.106755118

>>106755108
Yeah this is what people like - it's catchy, somewhat progressive but still not too complicated and has a clear structure.
You could use Ableton or Cubase (or whatever) to edit these and switch around parts etc..

カガミノコエ
10/01/25(Wed)02:22:24 No.106755144

カガミノコエ 10/01/25(Wed)02:22:24 No.106755144

File: 00037-3392621873.png (2.01 MB, 1536x864)

2.01 MB PNG

>>106755118
supposedly the new studio thing lets you do that, if they're they're still doing the sale when i run out of credits i might just do upgrade. pw has access to it, idk if he's done anything yet (he hasn't). i do know how to use FL (more less), supposedly you can spit a song out as midi but idk what it is, can't imagine it's got a full suite of effects and vsts and automation, but who knows.

Anonymous
10/01/25(Wed)02:22:45 No.106755146

Anonymous 10/01/25(Wed)02:22:45 No.106755146

File: deTF_cHD_00007_.png (3 MB, 2081x1190)

3 MB PNG

Anonymous
10/01/25(Wed)02:29:06 No.106755166

Anonymous 10/01/25(Wed)02:29:06 No.106755166

File: PW_143414_.png (2.88 MB, 1280x1800)

2.88 MB PNG

>>106755144
The new studio thing is fun!
I've done a couple songs LOL but nothing I was gonna share yet haha mostly testing it out
I wanna make something really good and new in v5 first then really get into it!

カガミノコエ
10/01/25(Wed)02:30:08 No.106755171

カガミノコエ 10/01/25(Wed)02:30:08 No.106755171

File: curse_night.jpg (352 KB, 1920x1080)

352 KB JPG

WHAT A TERRIBLE NIGHT TO HAVE A CURSE!

curse night
https://suno.com/s/075oGhgpHlRgqoBI

Anonymous
10/01/25(Wed)02:30:21 No.106755172

Anonymous 10/01/25(Wed)02:30:21 No.106755172

File: BL_SEK_BURG02.jpg (1.1 MB, 4608x3584)

1.1 MB JPG

>>106755166
>>106755146
Hey a twofer!. What's good?

Anonymous
10/01/25(Wed)02:31:01 No.106755177

Anonymous 10/01/25(Wed)02:31:01 No.106755177

>>106755144
I would find out how to generate individual audio tracks, then produce something on my own in a proper software. Then make a demo.
It's a long path of course especially if you are totally new.

Anonymous
10/01/25(Wed)02:31:38 No.106755184

Anonymous 10/01/25(Wed)02:31:38 No.106755184

File: deTF_cHD_00012_.png (3.13 MB, 2081x1190)

3.13 MB PNG

>>106755172
hey, sorry I missed you earlier today. you caught me during lunch. unusual time for you to pop in!

Anonymous
10/01/25(Wed)02:35:54 No.106755202

Anonymous 10/01/25(Wed)02:35:54 No.106755202

File: JI_SE_LAYS_8.jpg (222 KB, 2048x2048)

222 KB JPG

>>106755184
Hey man its no biggie. What's up?

カガミノコエ
10/01/25(Wed)02:37:11 No.106755209

カガミノコエ 10/01/25(Wed)02:37:11 No.106755209

File: 00041-3940773168.png (2.32 MB, 1536x864)

2.32 MB PNG

>>106755177
i can operate FL with some proficiency, my problem was always coming up with melody and variety. i'll give it a whirl at some point. i could see it being great for bolting vocals on a track tho. they need more voices, not sure how it works exactly because they do have some variety but not a lot

Anonymous
10/01/25(Wed)02:46:29 No.106755253

Anonymous 10/01/25(Wed)02:46:29 No.106755253

File: PW_143443_.png (2.73 MB, 1280x1800)

2.73 MB PNG

>>106755171
Nice!
>>106755172
Heyyy! It's great to see you again :]
Just relaxing at home! It was a great relaxing day haha
How are you?

Anonymous
10/01/25(Wed)02:47:26 No.106755258

Anonymous 10/01/25(Wed)02:47:26 No.106755258

File: RL_SE_CT_COOK_2.jpg (1.01 MB, 4096x3940)

1.01 MB JPG

>>106755253
Sounds nice. You had off work I suppose?

カガミノコエ
10/01/25(Wed)02:52:55 No.106755289

カガミノコエ 10/01/25(Wed)02:52:55 No.106755289

File: 57906.png (3.52 MB, 1440x3120)

3.52 MB PNG

>>106755253
thinking about doing a series of halloween stuff, not sure when to start. last year was fun

カガミノコエ
10/01/25(Wed)03:06:59 No.106755368

カガミノコエ 10/01/25(Wed)03:06:59 No.106755368

File: 57914.png (3.81 MB, 1440x3120)

3.81 MB PNG

while we're old posting, let me just say

カガミノコエ
10/01/25(Wed)03:10:02 No.106755380

カガミノコエ 10/01/25(Wed)03:10:02 No.106755380

File: 57840.png (3.05 MB, 1440x3120)

3.05 MB PNG

カガミノコエ
10/01/25(Wed)03:14:07 No.106755399

カガミノコエ 10/01/25(Wed)03:14:07 No.106755399

File: 57852.jpg (328 KB, 1440x3120)

328 KB JPG

we used to be a real country

カガミノコエ
10/01/25(Wed)03:20:31 No.106755445

カガミノコエ 10/01/25(Wed)03:20:31 No.106755445

File: 15297-1760922233.png (961 KB, 576x1248)

961 KB PNG

Anonymous
10/01/25(Wed)03:22:44 No.106755465

Anonymous 10/01/25(Wed)03:22:44 No.106755465

File: PQ_SE_N_SODA_5.jpg (169 KB, 1792x2304)

169 KB JPG

カガミノコエ
10/01/25(Wed)03:23:03 No.106755466

カガミノコエ 10/01/25(Wed)03:23:03 No.106755466

File: 00064-3909172629.png (1.84 MB, 1536x864)

1.84 MB PNG

Anonymous
10/01/25(Wed)03:24:25 No.106755474

Anonymous 10/01/25(Wed)03:24:25 No.106755474

Maintain thread quality
https://rentry.org/debo

カガミノコエ
10/01/25(Wed)03:25:16 No.106755480

カガミノコエ 10/01/25(Wed)03:25:16 No.106755480

File: 00069-4227813072.png (2 MB, 1536x864)

2 MB PNG

カガミノコエ
10/01/25(Wed)03:28:15 No.106755496

カガミノコエ 10/01/25(Wed)03:28:15 No.106755496

File: 00073-1993612533.png (1.79 MB, 1536x864)

1.79 MB PNG

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

🎉 Happy Birthday 4chan! 🎉