Previous /sdg/ thread : >>107218018>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicreForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeStability Matrix: https://github.com/LykosAI/StabilityMatrix>Early Preview UIAniStudio: https://github.com/FizzleDorf/AniStudio>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Imagehttps://huggingface.co/QuantStack/Qwen-Image-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Distill-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF>Flux.1 Kreahttps://docs.comfy.org/tutorials/flux/flux1-krea-devhttps://huggingface.co/black-forest-labs/FLUX.1-Krea-devhttps://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://github.com/maybleMyers/chromaforgehttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Models, LoRAs & upscalinghttps://civitai.comhttps://tensor.arthttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/vp/napt
First
>mfw Resource news11/17/2025>PixAI Tagger ONNX GUIhttps://github.com/wai55555/PixaiTaggerOnnxGui>ComfyUI Depth Anything V3https://github.com/PozzettiAndrea/ComfyUI-DepthAnythingV3>ComfyUI Flow Matching Upscalerhttps://github.com/ttulttul/ComfyUI-FlowMatching-Upscaler11/15/2025>Depth Anything 3: Recovering the Visual Space from Any Viewshttps://depth-anything-3.github.io>Kandinsky 5.0 19B T2V and I2V models releasedhttps://huggingface.co/kandinskylab>ComfyUI-Kandinskyhttps://github.com/Ada123-a/ComfyUI-Kandinsky>Torch-Uncertainty: A Deep Learning Framework for Uncertainty Quantificationhttps://github.com/ENSTA-U2IS-AI/Torch-Uncertainty>PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learninghttps://github.com/YanbeiJiang/PROPA>SPOT: Sparsification with Attention Dynamics via Token Relevance in Vision Transformershttps://github.com/odedsc/SPOT>MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generationhttps://tyfeld.github.io/mmadaparellel.github.io>Equivariant Sampling for Improving Diffusion Model-based Image Restorationhttps://github.com/FouierL/EquS11/13/2025>Kandinsky-5.0-I2V-Pro-sft-5s-Diffusershttps://huggingface.co/kandinskylab/Kandinsky-5.0-I2V-Pro-sft-5s-Diffusers/tree/main>Causally-Grounded Dual-Path Attention Intervention for Object Hallucination Mitigation in LVLMshttps://github.com/CikZ2023/OWL>Diversifying Counterattacks: Orthogonal Exploration for Robust CLIP Inferencehttps://github.com/bookman233/DOC11/12/2025>Multi-modal Deepfake Detection and Localization with FPN-Transformerhttps://github.com/Zig-HS/MM-DDL>3D4D: An Interactive, Editable, 4D World Model via 3D Video Generationhttps://yunhonghe1021.github.io/NOVA>xdit-comfyui-private: Parallel Multi GPU workerhttps://github.com/xdit-project/xdit-comfyui-private>Moondream 3 HF https://huggingface.co/NyxKrage/moondream3-hf
>mfw Research news11/17/2025>Enhancing Meme Emotion Understanding with Multi-Level Modality Enhancement and Dual-Stage Modal Fusionhttps://arxiv.org/abs/2511.11126>Toward Generalized Detection of Synthetic Media: Limitations, Challenges, and the Path to Multimodal Solutionshttps://arxiv.org/abs/2511.11116>LiteAttention: A Temporal Sparse Attention for Diffusion Transformershttps://arxiv.org/abs/2511.11062>CareCom: Generative Image Composition with Calibrated Reference Featureshttps://arxiv.org/abs/2511.11060>NP-LoRA: Null Space Projection Unifies Subject and Style in LoRA Fusionhttps://arxiv.org/abs/2511.11051>Accelerating Controllable Generation via Hybrid-grained Cachehttps://arxiv.org/abs/2511.11031>SP-Guard: Selective Prompt-adaptive Guidance for Safe Text-to-Image Generationhttps://arxiv.org/abs/2511.11014>VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Modelshttps://arxiv.org/abs/2511.11007>EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and Generationhttps://zane-zyqiu.github.io/EmoVid>CLUE: Controllable Latent space of Unprompted Embeddings for Diversity Management in Text-to-Image Synthesishttps://arxiv.org/abs/2511.10993>Bridging Hidden States in Vision-Language Modelshttps://arxiv.org/abs/2511.11526>ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generationhttps://arxiv.org/abs/2511.11483>Low-Bit, High-Fidelity: Optimal Transport Quantization for Flow Matchinghttps://arxiv.org/abs/2511.11418>Parameter-Efficient MoE LoRA for Few-Shot Multi-Style Editinghttps://arxiv.org/abs/2511.11236>Fast Data Attribution for Text-to-Image Modelshttps://arxiv.org/abs/2511.10721>Leveraging NTPs for Efficient Hallucination Detection in VLMshttps://arxiv.org/abs/2509.20379
Its Monday my dudes
Last one from megood night anons
i miss schizo anon
>>107218018What model?
>>107233235BeefsocketV3
>>107233077Goodnight
>>107234379Thank you for the nigbobump.
>>107235358nice birds
>>107235377Thank you
>>107235969what a dapper fellow
>>107236365a sort of sequel to the old pill rat
>>107236430from high status to pill poppinga commentary on society
Stable diffuses Lavra & Co. into oblivion very fast
gm
>>107237107>waiting inside: a bunch of homeless people frantically drawing on the walls>>107237231gm
>>107238293can you try adding something like "hand resting on {face|cheek|chin}"
>>107238329sadly it seems beyond the model's ability. i'll post a griddle of what i get
>>107238329
>>107238563a few are pretty decent but theres some hilariously bad ones too, haha. interesting experiment tho
Afternoon anons
>ga
Accidentally made 2 videos out of that one, lmao
>>107240046>mfw the state of other threads
Total Debo win.I kneel.
>>107240186>do nothing>win
>>107240252"when you kill your enemies, they win"
How was everybody's workday
>>107240281my gpt agent is working very hard (and failing at everything)
>>107240281
had a dream i was some sort of medical auditor in one of the world trade towers, either a remake of the towers or pre knockdown. went up to the roof, a bit scary. so in my dream i had a job.
>>107240455sounds like a bland job. I think dream you can do better. but maybe the dream pay was nice
>>107240834
>>107240875kek
>>107241576you're cookin todaylots of very unique gens
>>107242406thxi'm having a case of the mondays but day's nearly done
>>107242522the good thing about mondays is they always end
>Stable Diffusion general>none of the generations are Stable Diffusion
>>107243243problem?
now there's sd gens in sdg. happy?
On civitai is there anything I can do to make sure when I tell a prompt "floating head" it doesn't generate a body? I keep generating shit with negative tags like "body" "neck" torso" and it keeps generating whole bodies. I'm using an Illustrious model.
>>107243673i'm bored, so what's the full prompt?
>>107243700it's fine, I just took the best attempt that generated a neck and photoshopped the neck out
>>107243615lumiOS update?
>>107243722suit yourself.>>107243732it's currently trying to figure out why it did... this. digging into gemini-2.5-flash-image system prompts. bunchan edits too cheap to meter... (they practically are, it's $30/1m tokens and every image costs 1290 tokens, so it's like $0.04 each)(it hasn't managed to figure out why it can't use the whole div while i've been writing this. claude is about to get a kick in the nuts)
>>107243792I'm using my position as board member to veto the use of material ui
>>107243830i know i know, but vuetify is what i know and i tend to prefer using things i actually know even vibe coding (because when the stupid fucking model can't figure out how to make a div fill i can do it myself). it's also a mature enough project that coding agents know it (primevue looked a lot better, implementation was pure torture). except for typescript. i don't really know it but it's not different enough from js (except for the type system cock and ball torture, but a cute stiletto heal pressed on the ballsack is what type systems are for so...)
>>107243906typescript helps the AI work with the code a lot. strict typing is basically baking in free context into every file. I'm actually at the tail-end of refactoring my entire backend into typescript for that reason
>>107243972actually, typescript gives me less problems than the psychotic nightmare python calls type annotations... by god, it's almost as ugly as rust.
im making some really terrible stuff rn
>>107244043metaphorically pouring one out for your RNG
>>107244082I made a zoomer gang. surrender all your vapes or theres gonna be trouble
I can practically hear the marvel quips
this prompt is cursed
>>107244108as a certified unc, i cast "get of my lawn"> rolls d20: 5>As you speak the sacred words of the Boomer Cantrip, your voice cracks like a Bluetooth speaker at 2% battery.>A weak gust of “get off my lawn” sputters out and dies somewhere around knee height.>The zoomer gang barely reacts. One lowers his vape long enough to squint at you. Another says “ok grandpa” and blows a mango-ice cloud directly into your initiative order.>You take 1 psychic dmg from secondhand embarrassment.>Your spell fizzles, the vape stash is now marked as loot, and the zoomers grow noticeably bolder.---sorry chatgpt randomly decided i wasn't allowed to do something allowable again and i got distracted arguing with it. what the fuck do you mean you can't make an image of Clint Eastwood? fuck off
>>107244285>can't make an image of Clint Eastwoodcan you ask for a "copyright-safe guy that looks a lot like clint eastwood but definitely isnt"?>>107244318floor to ceiling windows with a view of the skyline is peak luxury
>>107244348its way more retarded than that. i was attempting to make clint eastwood in gran torino being racist to your gutter punks and instead of calling me a chud, it is insisting it is simply impossible for it to create an image of a real person. i've given it several examples of chatgpt doing exactly that but it refuses to believe me. i know i should just cut ties and move on but for some reason i'm fascinated by this behavior. (the edit of ur image was testing a theory that it was mad about guns. it is not mad about guns)
Goooooooood morning
"do not do anything the user asks, make up any and all kind of bullshit as an excuse"yo new customgpt idea
>>107244390i like how happy zoomer-chan is to have a mentormaybe he has a bright future afterall>>107244394gm and gnI must retire
>>107244404Sad. Gn
>>107244404Дo cвидaния, тoвapищ!
is the 9060 xt 16gb good enough for this hobby? I'm getting a new gpu this year and I would like to try this
>gm
bad morning
tfw cloudflare outage takes out half the internet
>>107247213Spotify on a rollercoaster ride.For American telcos seems like it's just another day.
>>107247213It seems to be happening more frequently. I'm glad we're still here.
>>107247745i'm surprised we are since captcha and images use cloudflare, or at least they used toguess the glowies want us pacified
>>107247213You too huh. Sucks.
Morning anons
nom
Next Thread>>107250748>>107250748>>107250748
Time to count up your gens