Previous /sdg/ thread : >>108493483>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtaiOP https://rentry.co/twkuk8tz
Shithole general
>>108501784And you are still here, curious.
Morning anons
>>108502001morning. never forget
>gm
>mfw Resource news04/01/2026>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editinghttps://carlofkl.github.io/dreamlite>MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generationhttps://vcbsl.github.io/MMFace-DiT>Hallucination-aware intermediate representation edit in LVLMshttps://github.com/ASGO-MM/HIRE>CutClaw: Agentic Hours-Long Video Editing via Music Synchronizationhttps://github.com/GVCLab/CutClaw>Extend3D: Town-Scale 3D Generationhttp://seungwoo-yoon.github.io/extend3d-page>PixlStash 1.0.0 release candidatehttps://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0rc3>adetailer-hires-sync: Automatically enables ADetailer in Forgehttps://github.com/KazeKaze93/adetailer-hires-sync03/31/2026>See-through: Single-image Layer Decomposition for Anime Charactershttps://github.com/shitagaki-lab/see-through>VRAM Pager: Compressed GPU Memory Paging for Diffusion & Video Modelshttps://github.com/willjriley/vram-pager>TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmarkhttps://github.com/IDLabMedia/tgif-dataset>Look, Compare and Draw: Differential Query Transformer for Automatic Oil Paintinghttps://differential-query-painter.github.io/DQ-painter>Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Driftinghttps://github.com/aSleepyTree/Drift-AR>INSID3: Training-Free In-Context Segmentation with DINOv3https://visinf.github.io/INSID3>OmniColor: Unified Framework for Multi-modal Lineart Colorizationhttps://github.com/zhangxulu1996/OmniColor>Gen-Searcher: Reinforcing Agentic Search for Image Generationhttps://gen-searcher.vercel.app>V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video LLMshttps://github.com/xinyouu/V-CAST>GEMS: Agent-Native Multimodal Generation with Memory and Skillshttps://gems-gen.github.io>RAWIC: Bit-Depth Adaptive Lossless Raw Image Compressionhttps://github.com/chunbaobao/RAWIC
>mfw Research news04/01/2026>Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edgehttps://arxiv.org/abs/2603.29535>SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generationhttps://arxiv.org/abs/2603.29186>Abstraction in Stylehttps://arxiv.org/abs/2603.29924>Stepper: Stepwise Immersive Scene Generation with Multiview Panoramashttps://fwmb.github.io/stepper>Gloria: Consistent Character Video Generation via Content Anchorshttps://yyvhang.github.io/Gloria_Page>PromptForge-350k: Large-Scale Dataset and Contrastive Framework for AI Image Forgery Localizationhttps://arxiv.org/abs/2603.29386>MEDiC: Multi-objective Exploration of Distillation from CLIPhttps://arxiv.org/abs/2603.29009>Multi-Feature Fusion Approach for Generative AI Images Detectionhttps://arxiv.org/abs/2603.29788>CIPHER: Counterfeit Image Pattern High-level Examination via Representationhttps://arxiv.org/abs/2603.29356>MacTok: Robust Continuous Tokenization for ImgGenhttps://arxiv.org/abs/2603.29634>Diffusion Mental Averageshttps://diffusion-mental-averages.github.io>Unify-Agent: Unified Multimodal Agent for World-Grounded Image Synthesishttps://arxiv.org/abs/2603.29620>SHIFT: Stochastic Hidden-Trajectory Deflection for Removing Diffusion-based Watermarkhttps://arxiv.org/abs/2603.29742>Unbiased Model Prediction Without Using Protected Attribute Informationhttps://arxiv.org/abs/2603.29270>Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understandinghttps://arxiv.org/abs/2603.29258>MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engineshttps://ryanpo.com/multigen>Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Modelshttps://arxiv.org/abs/2602.15772>When Test-Time Guidance Is Enough: Fast Image/Video Editing with Diffusion Guidancehttps://arxiv.org/abs/2602.14157
>mfw YESTERDAY's Research news03/31/2026>On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformershttps://contextual-repulsion.github.io>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editinghttps://carlofkl.github.io/dreamlite>ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Taskshttps://arxiv.org/abs/2603.27862>EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generationhttps://arxiv.org/abs/2603.28405>TokenDial: Continuous Attribute Control in Text-to-Video via Spatiotemporal Token Offsetshttps://tokendial.github.io>Attention Frequency Modulation: Training-Free Spectral Modulation of Diffusion Cross-Attentionhttps://arxiv.org/abs/2603.28114>Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detectionhttps://arxiv.org/abs/2603.27800>MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generationhttps://arxiv.org/abs/2603.27959>Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Modelshttps://arxiv.org/abs/2603.28367>OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generationhttps://arxiv.org/abs/2603.27637>Customized Visual Storytelling with Unified Multimodal LLMshttps://arxiv.org/abs/2603.27690>GEditBench v2: A Human-Aligned Benchmark for General Image Editinghttps://arxiv.org/abs/2603.28547>Inference-time Trajectory Optimization for Manga Image Editinghttps://arxiv.org/abs/2603.27790>Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignmenthttps://arxiv.org/abs/2603.27987>SonoWorld: From One Image to a 3D Audio-Visual Scenehttps://humathe.github.io/sonoworld>CoPE-VideoLM: Leveraging Codec Primitives For Efficient Video Language Modelinghttps://microsoft.github.io/CoPE
>>108502481>adetailer-hires-sync: Automatically enables ADetailer in Forge>Manually toggling the checkbox each time is friction.>This extension hooks into the hires fix button and manages the ADetailer checkbox automatically:checking a box is too much for some people
>>108502532lol I wasnt sure if there was more to it or not. its rare for me to find anything to give to the forge folks though so I included it
gm
>>108502778gm