Previous /sdg/ thread : >>108705694 >Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtai
>containment general
>>108719627obviously it's not working to contain anyone
>mfw Resource news04/29/2026>Z-Anime | Full Anime Fine-Tune on Z-Image Base https://huggingface.co/SeeSee21/Z-Anime>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantizationhttps://github.com/svg-project/Quant-VideoGen>World-R1: Reinforcing 3D Constraints for Text-to-Video Generationhttps://github.com/microsoft/World-R1>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settingshttps://github.com/lparolari/cobench>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generationshttps://github.com/SonyResearch/VibeToken>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Groundinghttps://github.com/oceanflowlab/OmniVTG>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Modelshttps://github.com/LeapLabTHU/RvR>SketchVLM: Vision language models can annotate images to explain thoughts and guide usershttps://sketchvlm.github.io>Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generationhttps://tuna-ai.org/tuna-2>Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Modelshttps://github.com/huaiyi66/PTI04/28/2026>Illustrious XL & NoobAI-XL Style Explorer https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer>LTX Desktop 1.0.5https://github.com/Lightricks/LTX-Desktop/releases/tag/v1.0.5>Meta-CoT: Enhancing Granularity and Generalization in Image Editinghttps://shiyi-zh0408.github.io/projectpages/Meta-CoT04/27/2026>PixlStash 1.1.0 Updatehttps://pixlstash.dev/whatsnew.html>AURA AI Studio Vault: One-stop management app for models, images and morehttps://github.com/TheGho7t/AURA-AI-Studio-Vault>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Models https://mo230761.github.io/UniGeo.github.io
>mfw Research news04/29/2026>Golden RPG: Confidence-Adaptive Region-Aware Noise for Compositional Text-to-Image Generationhttps://arxiv.org/abs/2604.25314>A Systematic Post-Train Framework for Video Generationhttps://arxiv.org/abs/2604.25427>ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latenthttps://arxiv.org/abs/2604.25128>ViPO: Visual Preference Optimization at Scalehttps://liming-ai.github.io/ViPO>GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolutionhttps://github.com/aimagelab/GramSR>Exploring Time Conditioning in Diffusion Generative Models from Disjoint Noisy Data Manifoldshttps://arxiv.org/abs/2604.25289>The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latentshttps://arxiv.org/abs/2604.25299>DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editinghttps://arxiv.org/abs/2604.25477>Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generationhttps://mutualforcing.github.io>Learning Illumination Control in Diffusion Modelshttps://nishitanand.github.io/relighting-diffusion-website>Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimizationhttps://arxiv.org/abs/2604.24952>Improving Diversity in Black-box Few-shot Knowledge Distillationhttps://arxiv.org/abs/2604.25795>QFlash: Bridging Quantization and Memory Efficiency in Vision Transformer Attentionhttps://arxiv.org/abs/2604.25306>When the Forger Is the Judge: GPT-Image-2 Cannot Recognize Its Own Faked Documentshttps://arxiv.org/abs/2604.25213>The Forensic Cost of Watermark Removalhttps://arxiv.org/abs/2604.25491>GPT-Image-2 in the Wild: A Twitter Dataset of Self-Reported AI-Generated Images from the First Week of Deploymenthttps://arxiv.org/abs/2604.25370>Can We Change the Stroke Size for Easier Diffusion?https://arxiv.org/abs/2603.26783
evening anon
>>108720100howdy
>>108719619It may be time to add some agents to the mix. https://x.com/NousResearch/status/2049584595465572752https://github.com/NousResearch/hermes-agent/tree/main/skills/creative/comfyui
>>108720995that zoomout gave me extreme anxiety I wonder how much value you can really get out of agentic workflow creation tho. maybe helpful for intricate video timecoding or something
When did last thread highlights stop?
>>108721118was that ever a thing? we had an anon do it on rare occasion but it wasn't a constant thing
>>108721041>that zoomout gave me extreme anxietySame.>I wonder how much value...Not sure, It'll be interesting to see how it setups and novel workflows based on prompt description. I'm expecting bloat and redundancy of course.Prompting a Gen directly in Hermes is a token whore of course.
quick request, can someone prompt classic winnie the pooh fighting red shirt disney winnie the pooh? red shirt pooh bear is winning
erm, my zimg anime isnt working
well, it looks like something at least
still very melty hm
kinda works. will have to tinker moregn
i miss schizo anon
gm>>108722161So do we
>>108723090gm
>gm
>>108723300nice and shiny.
posting some anima stuff while I try to figure out zia more
>>108724971me in the back
>>108726430did you bog him on purpose lol
Afternoon anons
>>108726534naw, just the normal sd 1.5 result
>>108726541howdy
Deep in the hear of Germany
Good morning, afternoon, or night.I got tired of asking for edits in /trash/ (the Edit, Lineart, and Coloring Threads are practically unusable due to spam)Could you please edit this image?What I need is for you to change the design and color of the body fur of this female furry cheetah to transform her into a Jaguar and also change the design of her ears to resemble those of a real jaguar.https://en.wikipedia.org/wiki/Jaguarhttps://upload.wikimedia.org/wikipedia/commons/0/0a/Standing_jaguar.jpghttps://en.wikifur.com/w/images/1/1f/Miles-df_xian-jaguar.jpg
>>108727030
>>108727180Not OR, But nice edit
>>108727030Send tokens
>>108727180>>108727202
>>108726839(n)ice is this a real character or an ai construct?
>>108727349beepboop
>>108727256>>108727180>>108727231OR of edit request hereThank you so much.Blessings to you.It's nice to see that at least in this thread, you can find some good things.
>>108727628science has gone too far>>108727641did you flush z image anime? I was able to get it kinda tuned in but I can't solve 100% of the meltiness
>>108727658>Think we get a fire this time messing with genes,,
>>108727628LOLThis image is appreciated nonetheless.I imagine the female Jaguar as a character from the 2000s (2003 or 2004) and as a Linkin Park fan.
>>108727658i doubt i'll try it again unless something really draws my attention to it, but i havent deleted itare you using cfg >3.5 and steps >30?
>>108727782cfg 6 seemed like the sweet spot and I felt like I had to go above 40 steps for full convergenceits weird cuz the sample workflows use cfg 1.1 for some reason
>>108727805there's a turbo version isnt there? or distilled at leasti've used cfg 1.1 with zturbo before, currently at 3.5as someone said, "just because it's turbo doesnt mean you *have* to use it at cfg 1"
>>108727814ah, it mustve been for the distilled version
>>108727820cfg 1.1 is a cheat to give turbo models neg conditioning, but it does slow it down. higher cfg on turbo = slower and slower
>>108728036>they bleated ne toog ditiedaint that the truth
>>108728061some of these are pretty funny. wish zit were just a smidge better at text lol. nb2 would have a field day with this
cat is yelling at me to go to bed. i obey. gn!
cereal summon technique>>108728452gn
How do you organize your pics when genning with ChatGpt?Especially annoying with all my alt accounts
gm
>>108730046gm
huehuehue
>>108720659Hands are reversed.
>>108730935early start?
>>108730935gmhappy friday>>108731090weekend eve gets everyone excited
>>108731090the same cat what made me go to bed woke me up too. little fucker>>108731196gm
Morning anonshttps://youtu.be/k4hjX6ZsplU?si=Df3Aw0-i3_GFPG7g
>>108732073morning
>>108730178>>108730214I like all these, but esp the elephants
>>108732301been trying to gen using creatures i've seldom used
guess suno has a 'create your own model' thing now, currently it is 'cooking' one for me
>>108732657well it was lame, or i didn't do it right
>>108732657>suno has a 'create your own model' thing nowohh really... thats very interestingim curious how you'd say it failed. it just didn't follow the inputs well?
>>108733054well, i gave it sounds of various sort of experimental punk styles, and the results i get are the very generic results i would get from before, like the custom model isn't having an effect.
>>108733078>high resolution, clean image>fucked up handscome on anon...
>mfw Security newshttps://github.com/huggingface/diffusers/security/advisories/GHSA-98h9-4798-4q5v
>>108733078>>108733054as i understand it you're supposed to be uploading your own non-ai songs so it can mimic your style/voice. i guess you could use anything that wouldn't trigger ContentID (or whatever they use to keep the RIAA reptilians off their backs)https://help.suno.com/en/articles/11362497
>>108733240ah, i was being bad.. just an experiment. was hoping to have a good model based off of the obscure music i like. oh well.
me on the right
>>108733367i mean if they let you upload it then whatever, it's on their head lol! i haven't tried it myself. the voice feature is pretty rad tho even if it does drag some style prompt in (via the voice itself, not the optional style prompt section). i'd guess it's a half-baked feature rn anyway (the custom models)
>>108733456the rare moment when the AI catches the genner in action>>108733496>>108733514these come together like a pokemon battle
>>108734058raichibis
hmmmm
G'evenin Anons,
>>108734293heyo
>>108734293evening
>>108734325Nice.>>108734354>>108734366TGIF!>the crystal ball shows the future within,
>>108734410thx
>>108734410>TGIF!thank /g/ its frog
>>108734568>>108734759nice ones
testing settings
baking
>>108735043>>108735043>>108735043
>>108723451holy shit is this a god prompt or are using more than just turbo lora?