Previous /sdg/ thread : >>107201522>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicreForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeStability Matrix: https://github.com/LykosAI/StabilityMatrix>Early Preview UIAniStudio: https://github.com/FizzleDorf/AniStudio>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Imagehttps://huggingface.co/QuantStack/Qwen-Image-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Distill-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF>Flux.1 Kreahttps://docs.comfy.org/tutorials/flux/flux1-krea-devhttps://huggingface.co/black-forest-labs/FLUX.1-Krea-devhttps://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://github.com/maybleMyers/chromaforgehttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Models, LoRAs & upscalinghttps://civitai.comhttps://tensor.arthttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/vp/napt
>mfw Resource news11/15/2025>Depth Anything 3: Recovering the Visual Space from Any Viewshttps://depth-anything-3.github.io>Kandinsky 5.0 19B T2V and I2V models releasedhttps://huggingface.co/kandinskylab>ComfyUI-Kandinskyhttps://github.com/Ada123-a/ComfyUI-Kandinsky>Torch-Uncertainty: A Deep Learning Framework for Uncertainty Quantificationhttps://github.com/ENSTA-U2IS-AI/Torch-Uncertainty>PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learninghttps://github.com/YanbeiJiang/PROPA>SPOT: Sparsification with Attention Dynamics via Token Relevance in Vision Transformershttps://github.com/odedsc/SPOT>MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generationhttps://tyfeld.github.io/mmadaparellel.github.io>Equivariant Sampling for Improving Diffusion Model-based Image Restorationhttps://github.com/FouierL/EquS11/13/2025>Kandinsky-5.0-I2V-Pro-sft-5s-Diffusershttps://huggingface.co/kandinskylab/Kandinsky-5.0-I2V-Pro-sft-5s-Diffusers/tree/main>Causally-Grounded Dual-Path Attention Intervention for Object Hallucination Mitigation in LVLMshttps://github.com/CikZ2023/OWL>Diversifying Counterattacks: Orthogonal Exploration for Robust CLIP Inferencehttps://github.com/bookman233/DOC11/12/2025>Multi-modal Deepfake Detection and Localization with FPN-Transformerhttps://github.com/Zig-HS/MM-DDL>3D4D: An Interactive, Editable, 4D World Model via 3D Video Generationhttps://yunhonghe1021.github.io/NOVA>xdit-comfyui-private: Parallel Multi GPU workerhttps://github.com/xdit-project/xdit-comfyui-private>Moondream 3 HF https://huggingface.co/NyxKrage/moondream3-hf11/11/2025>StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generationhttp://streamdiffusionv2.github.io>Turbo-DDCM: Fast and Flexible Zero-Shot Diffusion-Based Image Compressionhttps://amitvaisman.github.io/turbo_ddcm
>mfw Research news11/15/2025>Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluationhttps://arxiv.org/abs/2511.10547>Intrinsic Dimensionality as a Model-Free Measure of Class Imbalancehttps://arxiv.org/abs/2511.10475>One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Modelshttps://arxiv.org/abs/2511.10629>Adaptive Residual-Update Steering for Low-Overhead Hallucination Mitigation in Large Vision Language Modelshttps://arxiv.org/abs/2511.10292>GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrievalhttps://arxiv.org/abs/2511.10154>Right Looks, Wrong Reasons: Compositional Fidelity in Text-to-Image Generationhttps://arxiv.org/abs/2511.10136>How does My Model Fail? Automatic Identification and Interpretation of Physical Plausibility Failure Modes with Matryoshka Transcodershttps://arxiv.org/abs/2511.10094>Image Aesthetic Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performancehttps://arxiv.org/abs/2511.10055>LampQ: Towards Accurate Layer-wise Mixed Precision Quantization for Vision Transformershttps://arxiv.org/abs/2511.10004>Difference Vector Equalization for Robust Fine-tuning of Vision-Language Modelshttps://arxiv.org/abs/2511.09973>A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Spacehttps://arxiv.org/abs/2511.10555>Fragile by Design: On the Limits of Adversarial Defenses in Personalized Generationhttps://arxiv.org/abs/2511.10382>PRISM: Diversifying Dataset Distillation by Decoupling Architectural Priorshttps://arxiv.org/abs/2511.09905>Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Modelshttps://arxiv.org/abs/2511.09809>SliderEdit: Continuous Image Editing with Fine-Grained Instruction Controlhttps://arxiv.org/abs/2511.09715
>>107218285based faceless contributor
idk why my fitting looks so wacky
erm
>>107218859
mfw
>>107219355
>>107219410dope album cover>>107219430eerie
>>107219454Thanks this is my first time getting into Stable Diffusion I made a python application to make this, I was curious if any of you know a better way to do what I'm doing than to do all of the work locally, any free cloud based options so I can communicate with a cloud which does the heavy lifting so I can generate HD images quicker? My pc cannot handle 1080p or higher, I am upscaling my images at the end to be 4k but they are originally 512x512, I'm trying out 720p in pic related will post result when it's through rendering
>>107219475Forgot to post picture of the software
>>107219475you can do pretty much anything using ComfyUI, including plugging into API services
>>107219534How does this stack up? I changed the color some in paint.net I had to compress it to a JPG
>>107219637Lol just realized the guy has 3 legs
>>107219637you'll want to explore more modern models at some point. sd15 is super old at this point and it def shows its age. although, newer models will be bigger/more resource demandingwhat hardware are you on? (gpu)
>>107219708Thanks for the advice, I'll look into a better model. Is Ollama3 an alright LLM to keep in use as of the moment? Pic related is my GPU. I don't have enough VRAM to outright generate 4k images it gave me an error and when I tried 1080p my PC started dying and freezing lmao I killed the program, 720p takes a while but I can upscale to 4k at the end and it came out alright.
>>107219742>Is Ollama3 an alright LLMI couldn't really say. I don't dick around with LLMs. theres an LLM general you can ask; they might be a good resource>Pic related is my GPUAMD has historically been more difficult to work with than nvidia, but idk what the current status is. AMD has been catching up. 8gb vram is decent, you should be able to use a lot of the newer stuff>I don't have enough VRAM to outright generate 4k imagesthere are a bunch of strategies for upscale. if you use ComfyUI, it has a built in vae tiling. that pretty much means there's no upper limit on gen size, it just chunks the work (so long as you're willing to wait)
>>107218018What is happening to grok's imagine?I could generate porn video a month ago but now it rejects everything.
>>107220535probably the same thing that openai always does. 1) release a new model with no guardrails so people can do anything they want.2) people get really excited, use the model a lot, share the model and their creations. 3) after you have a wide audience using the platform, start adding guardrails so you can avoid being sued and shit4) ???5) ask investors for more money
do not trust "doctors" i'm not joking
>>107220684in retrospect, I don't think dr mario was a even real doctor
>>107221292He dispensed pills, hes a doctor. That's how that works. Pssshht
>>107221866he did dispense pills, thats true. but he ONLY dispensed pills. he never even asked what was wrong. it was just pills, pills, pills. kinda sus
>>107222003An intuitive doctor doesn't have to ask what's wrong. He just knows.
How much regret do you think Lumi has adding me to discord 1-10, place your bets now
>>107219708I love your photorealism, do you have public workflows to study by any chance?My outputs are nowhere near your quality and I've no idea if it's the Prompts, Loras or embeddings.What settings are you using?
>>107222060bmp is back, in mp4 form>>107222062thanks, glad you like them. I'll be honest, this series hasn't been my cleanest one. has some weird ovefitting sometimes and the textures tend to be rubbery. but happy to share the workflow, in case there's anything in there that could be interesting to youhttps://files.catbox.moe/wb3bcp.png>I've no idea if it's the Prompts, Loras or embeddings.could just be the model. would be curious to see what you're working on>What settings are you using?these are using chroma HD with res_multistep+beta @ 2.5 cfgadditional note:while I was just looking through the workflow for the settings, I realized the fresca node was on. thats probably what was causing the issues I mentioned
>>107222003ain't that much different to real doctors desu
>>107222114>hats probably what was causing the issues I mentionedoh yeah, this is def looking better without fresca
>>107222060â–³/10
>>107222148you've been infected by chromagirl diseaseI'm sorry, its terminal (unless dr mario throws enough pills at you)chill was better than feverhttps://www.youtube.com/watch?v=TXOZRsUXVuw
>>107222195>chill was better than fevertruke
>>107222221missed a huge get by 1how did that sky pattern come to be? looks really cool
BE NOT AFRAID (Salt & Shadow)https://youtu.be/TtmiAj7CzAQhttps://suno.com/s/i7MS6ZIz5W9wavUB
so what kind of mental illness do these colors signify? please tell me there's a "obese chick wearing clothes she ought not be wearing" pride flag
>>107222532No but that does look like a pedofile flag
similar colors, dissimilar order. i win again.
Rabbituokka
>>107222536>pedofile flagthey have flags now?I guess that makes them easier to cull
>>107222611
>>107222752fuck I lost so much money on that match
>>107222241flat style, clip art, desert oasis, net grid, space
flat style, clip art, desert oasis, net grid, space
>>107222752was supposed to be the peoples elbow. nb didn't get it
>Decide to train a CNN >Start preparing dataset>Oh I haven't seen that before better caption it so I can use it elsewhere >Get distracting loading captioner >End up doing nothing but listening to soundcloud trying to find a bootleg \ vip I heard to years ago Anudda productive day
>>107222787surprisingly simple>>107222808>>End up doing nothing but listening to soundcloud trying to find a bootleg \ vip I heard to years agowell, did you find it?
>>107222808babbys first yak shaving>>107222817he didn't post anything to fact check. mark him down as scum, and or villian.
>>107222817>surprisingly simpleGood things typically are
>>107222817Afraid not just a bunch of unrelated volocoid tracks which caused me spiral searching for a re-upload
>>107222890i tried this and i died. buyer beware
>>107222890hey, nice to see youi'm falling asleep, otherwise I'd chathow you're well
>>107222997Hwy nice to see you too, sleep good man. Happy weekend.
>>107222752Kek, nice.
>>107223016'twas supposed to be an elbow slam, but the stupid fucking thing couldn't understand. so it is what it is. as is this image. "put a girl in the boot" is not something **I'D** ever write so... idk where sdxl gets off being this degenrate.
>>107222365
Last one from meGood night anons
>>107223311Cozy diorama. It's nice to see you getting into video gens. Goodnight.
i miss schizo anon
>>107218018What is the name of this shade of hair color? It's not gray, and it's not white. I know it's blonde, but what kind of blonde is it.She looks cute, and fox ears don't spoil her image at all. thank you, yours creators!
>>107224517>I know it's blonde, but what kind of blonde is it.try platinum blonde
>>107218018Does anyone have some sort of image comparison chart or comparisons that show the differences in quality between 4-bit, 8-bit, and full precision image outputs for Chroma?
>>107225354this was not on the final model
>>107226184i like how the mouth squiggle overflowed
>>107226263kind of thought it looked like a stache.same with the loras lowered from .8 to .6
>>107226616thats cool. I'm surprised you were able to get this kind of motion without it going totally off the rails
>>107226738was an old fashioned batch/img2img, of frames from a grok video of a sora gen
>>107226616The illustration style works great with this type of animation.
>>107218878purdy
Afternoon anons
>>107227657great minds think alike. I was randomly doing some space stuff too
I've been out of the game for a while, what model do people use nowdays for 1girl or 2girl? tried flux and qwen and didn't find it improved anything, the ilust models with SDXL still produce much better results imo or am I missing something?
>>107227905space is the place>>107227753afternoo
g/a>>107227753 supabear
>>107228808cool (heh) dragonvery cool gen
>>107228859tyjust noticed an extra leg. kek wow, that crow looks badass.
>>107228808>>107228880>>107228907nice
>>107228808Suppuokka to the rescue
a few more dragons.>>107228931ty
thank you for blessing this general with all the nigbobumps
progress has been made
>>107231750What is this?
>>107231750I'd call it lumuior maybe luimior lumi's big box of user interfacingidk I'll get a focus group together of middle-americans and workshop it
>>107231750rn i'm working on an abstract prompting interface, that's the first working gemini-2.5-flash-image from the new system. reference images are next. all part of my bespoke "Lumi UI" (which no one else will want to use lol) to automate my nonsense. at the very least managing wildcards, doing reforge nonsense (via api), nano banana, etc.
>>107232008the waffle cone is the pinnacle of human invention
well it inserted bunchan, so that's a win
it's a brand new day
>>107232794extreme glowie
>>107232808nothing to see here
>>107232831you should make the landscape fat or something idk
>>107232846>>107232846>>107232846
>>107232844bro just reverse engineered chromagirls workflow LMAO
>>107232856the simplified version
>>107232894lol
Filling
Cyclone / Joker