Previous /sdg/ thread : >>108377950>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtaiOP https://rentry.co/twkuk8tz
>mfw Resource news03/18/2026>Faster Inference of Flow-Based Generative Models via Improved Data-Noise Couplinghttps://github.com/araachie/loom-cfm>Flash-Unified: Training-Free and Task-Aware Acceleration Framework for Native Unified Modelshttps://github.com/Rirayh/FlashU>ViFeEdit: Video-Free Tuner of Your Video DiThttps://github.com/Lexie-YU/ViFeEdit>SegviGen: Repurposing 3D Generative Model for Part Segmentationhttps://fenghora.github.io/SegviGen-Page>W2T: LoRA Weights Already Know What They Can Dohttps://github.com/xiaolonghan2000/Weight2Token>V-Co: Closer Look at Visual Representation Alignment via Co-Denoisinghttps://github.com/HL-hanlin/V-Co>HeBA: Heterogeneous Bottleneck Adapters for Robust VLMshttps://github.com/Jahid12012021/VLM-HeBA>Parallel In-context Learning for LVLMshttps://github.com/yshinya6/parallel-icl>GDPO-SR: Group Direct Preference Optimization for One-Step SRhttps://github.com/Joyies/GDPO>REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Modelshttps://github.com/Imfatnoily/REFORGE>Mixture of Style Experts for Diverse Image Stylizationhttps://hh-lg.github.io/StyleExpert-Page>GlyphPrinter: Region-Grouped DPO for Glyph-Accurate Visual Text Renderinghttps://henghuiding.com/GlyphPrinter>PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Spacehttps://github.com/ryutaroLF/PureCLIP-Depth>Kimodo: Scaling Controllable Human Motion Generationhttps://research.nvidia.com/labs/sil/projects/kimodo>Learning through Creation: Hash-Free Framework for On-the-Fly Category Discoveryhttps://github.com/brandinzhang/LTC>Vlo: Local open-source video editor with ComfyUI-backendhttps://github.com/PxTicks/vlo>ComfyUI-LCS: Training-free color control via Latent Color Subspacehttps://github.com/facok/ComfyUI-LCS>FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidancehttps://quanhaol.github.io/flashmotion-site
>mfw Research news03/18/2026>Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motionhttps://zhouzhenghong-gt.github.io/Tri-Prompting-Page>Diffusion Models for Joint Audio-Video Generationhttps://arxiv.org/abs/2603.16093>When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systemshttps://arxiv.org/abs/2603.16134>VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignmenthttps://arxiv.org/abs/2603.16271>SNCE: Geometry-Aware Supervision for Scalable Discrete Image Generationhttps://arxiv.org/abs/2603.15150>LibraGen: Playing a Balance Game in Subject-Driven Video Generationhttps://arxiv.org/abs/2603.13506>Generative Video Compression with One-Dimensional Latent Representationhttps://gvc1d.github.io>Semantic One-Dimensional Tokenizer for Image Reconstruction and Generationhttps://arxiv.org/abs/2603.16373>Unlearning for One-Step Generative Models via Unbalanced Optimal Transporthttps://arxiv.org/abs/2603.16489>Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generationhttps://arxiv.org/abs/2603.16211>Feed-forward Gaussian Registration for Head Avatar Creation and Editinghttps://malteprinzler.github.io/projects/match>Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Samplinghttps://arxiv.org/abs/2603.16797>WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representationhttps://cvlab-kaist.github.io/WorldCam>Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-traininghttps://arxiv.org/abs/2603.16139>Visual Prompt Discovery via Semantic Explorationhttps://arxiv.org/abs/2603.16250>Grounding World Simulation Models in a Real-World Metropolishttps://seoul-world-model.github.io>Interact3D: Compositional 3D Generation of Interactive Objectshttps://arxiv.org/abs/2603.16085
>mfw MORE Research news>SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagationhttps://arxiv.org/abs/2603.16864>LICA: Layered Image Composition Annotations for Graphic Design Researchhttps://arxiv.org/abs/2603.16098>Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Modelshttps://arxiv.org/abs/2603.16001>AnyCrowd: Instance-Isolated Identity-Pose Binding for Arbitrary Multi-Character Animationhttps://arxiv.org/abs/2603.15415>Persistent Story World Simulation with Continuous Character Customizationhttps://arxiv.org/abs/2603.16285>Next-Frame Decoding for Ultra-Low-Bitrate Image Compression with Video Diffusion Priorshttps://arxiv.org/abs/2603.15129>Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigationhttps://arxiv.org/abs/2603.16284>WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigationhttps://arxiv.org/abs/2603.15132>Reevaluating the Intra-Modal Misalignment Hypothesis in CLIPhttps://arxiv.org/abs/2603.16100>AI Application Benchmarking: Power-Aware Performance Analysis for Vision and Language Modelshttps://arxiv.org/abs/2603.16164>Real-Time Human Frontal View Synthesis from a Single Imagehttps://arxiv.org/abs/2603.15433>HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenizationhttps://arxiv.org/abs/2603.15228>Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Modelshttps://arxiv.org/abs/2603.15557>AI-Generated Figures in Academic Publishing: Policies, Tools, and Practical Guidelineshttps://arxiv.org/abs/2603.16159>Directional Embedding Smoothing for Robust Vision Language Modelshttps://arxiv.org/abs/2603.15259>What DINO saw: ALiBi positional encoding reduces positional bias in Vision Transformershttps://arxiv.org/abs/2603.16840
left or right?i prefer left, but right has some subtleties that are nicer (like hand pose)
>>108401969the details on the left are more crisp and have move character, overall more life and vibrance. right is kinda washed out and 'airbrushed', though I do like the tighter color range more (and the hands are nicer too but maybe thats just random)
>>108402148nice
jeet thread
a surprisingly accurate globe from zit, except i think antarctica blewed up but oh well, we didn't need it anyway
>>108402053yah i couldnt fix the airbrushed look on that method, ah well
>>108402776>accurate>erased the roman empire
>>108402776>antarctica blewed upomg the secret world government activated the alien pyramid
>>108401870>>Kimodo: Scaling Controllable Human Motion Generation>https://research.nvidia.com/labs/sil/projects/kimodoHas anyone tried using this?
>>108402776Cool.
>>108402994be the pioneer in that type of model, anon
>>108403038>Amish Barn Rumble 2026Damn I missed the event.
in the grim darkness of the far future, there are only 1girls
>>108403156could be worse
i feel like zimage forces certain composition layouts with certain keywords no matter what (and i'm not referring to the seed variance thign which is something different). it's hard to explain it's like a visual balance within the gen. klein does it a bit too, but not as much as zimage seems to. flux and chroma did not. it's possibly somethign to do with distillation i suppose.
>>108403235i get the same kind of thing. it also has a small stock of representations for a bunch of stuff, so "mask" always comes out like in pic rel. i see a lot of repetitive stuff that never gets posted. poses and stuff too. like it starts out as wildcards, sure, but there's no way kimi-k2 is that stable with its rewording of stuff so it boils down to whatever weird constrained latent space from distilling imo
>>108403262yah but with models like chroma or flux dedistilled it was possible to just push the model by messing with the sigmas and whatnot into doing more or less what you want. with these you have to either use excessive +1.5 strength or multipliers to push it or it doesnt work at all, and even then it breaks more often than not. on the one hand these models do well enough out of the box but it just feels constrained .
LOL
great, now i have to train a qwen-2512 lora. i just heard about this model>>108403773nice
>>108403910i mean you don't *have* to unless...
>>108403948What's the deal here?
>now i have to train a qwen-2512 lora. i just heard about this modeljesus are /sdg/ anons behind
>>108403987good question..."Esoteric librarian’s dream: Manly P. Hall’s sacred geometry, gilded zodiac wheels, emerald tablets, candlelit marble halls, occult frescoes, celestial Atlantean sigils, ivory robes, starlit domes, chiaroscuro mysticism, ultra-crisp gothic detailing, 8K visionary realism."[mode:Voynich|whispered herbal cipher, looping marginalia, star-wheels, asemic glyphs]Illuminated parchment folio, a full-foliage dame of generous circumference seated in lotus above the page, tresses like ink poured through water curtaining a single revealed eye whose iris swirls with the center of a spiraling sigil hidden within a meadow of whispering reeds, her mien rapt in cogitation; she is mantled in a short cote-hardie of midnight silk, lacings slack to the waist, slashed sleeves baring soft under-glow of skin yet no nakedness, hems edged with tiny bell-fruit seeds that chime without wind; dramatized by chiaroscuro, moon-cinders stripe her curves while lanterned fireflies orbit; behind, distant edges softened by haze around weathered signs—rotted stelae whose runes have migrated onto her hem; ringing vibration marks around hanging bells with reflective liquid surfaces ripple outward like concentric star diagrams, each droplet a microcosm of unknown constellations, safe for work, no nudity.(sfw: 适合工作场合观看,无裸露画面,无文字。)
>>108404010sorry i dont spend all day filtering through neverending drama just to get the latest flavor of the month
>>108404034was more a joke about the "news" here but you do you sis
>op pic deletedbased thank you jannies
>>108404062>I was only pretending
>>108404076wait till you hear about other models people use /sdg/ anonie
>chud leechun lee but with chud face. why hasnt anyone done this?
>>108404070what was wrong with it?
>>108404017Interesting, thank you.
>>108404117it was the op pic of a literal shithole
lolwut
>>108404162he just wants to show you his mishappen children bro, relax