Previous /sdg/ thread : >>108841225 >Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtaiOP https://rentry.co/twkuk8tz
>>108853501my back hurts just looking at thattry making the pose yourself
>mfw Resource news05/18/2026>Lance: Unified Multimodal Modeling by Multi-Task Synergyhttps://lance-project.github.io>GridLoraTester: Workbench for character LoRA training on FLUX.2: dataset curationhttps://github.com/Mandrakia/GridLoraTester>FLUX MCP serverhttps://docs.bfl.ai/api_integration/mcp_integration>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimizationhttps://shredded-pork.github.io/Flash-GRPO.github.io>LongLive2.0 5B BF16: AR-trained Wan2.2-TI2V-5B generatorhttps://huggingface.co/Efficient-Large-Model/LongLive-2.0-5B>DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformerhttps://github.com/haha-lisa/DealMaTe>Deep Pre-Alignment for VLMshttps://github.com/THUMAI-Lab/Deep-Pre-Alignment>Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP modelshttps://github.com/Fabian-Mor/sae-ft>VAGS: Velocity Adaptive Guidance Scale for Image Editing and Generationhttps://github.com/Harvard-AI-and-Robotics-Lab/Velocity_Adaptive_Guidance_Scale>Neural Companion: Local desktop AI companion shellhttps://github.com/Rakile/NeuralCompanion>PixlStash 1.2: easy sharing, cleaner UI and faster background processing for your image managementhttps://pixlstash.dev/whatsnew.html05/17/2026>Comfy-mesh LTX 2.3 support — separate node + separate server GUIhttps://github.com/shootthesound/comfyui-mesh#ltx-23--separate-node--separate-server-gui>Rebels_HiDream-01_Image_Dev_NODES: Run HiDream-01 Image Dev bf16 and GGUFhttps://github.com/RealRebelAI/Rebels_HiDream-01_Image_Dev_NODES05/16/2026>ComfyUI-Mesh Icarus & Daedalus: Split a diffusion model across two GPUshttps://github.com/shootthesound/comfyui-mesh>Pixal3D-ComfyUIhttps://github.com/Saganaki22/Pixal3D-ComfyUI>ArXiv to Ban Researchers for a Year if They Submit AI Slophttps://www.404media.co/new-arxiv-rules-ai-generated-papers-ban
>mfw Research news05/18/2026>DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformerhttps://arxiv.org/abs/2605.15682>ElasticDiT: Efficient Diffusion Transformers via Elastic Architecture and Sparse Attention for High-Resolution Image Generation on Mobile Deviceshttps://arxiv.org/abs/2605.15684>Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learninghttps://hongxiii.github.io/mstedit>Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generationhttps://arxiv.org/abs/2605.16003>One Pass Is Not Enough: Recursive Latent Refinement for Generative Modelshttps://arxiv.org/abs/2605.15309>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimizationhttps://arxiv.org/abs/2605.15980>Evaluating Design Video Generation: Metrics for Compositional Fidelityhttps://arxiv.org/abs/2605.16223>Sound Sparks Motion: Audio and Text Tuning for Video Editinghttps://amirhossein-razlighi.github.io/Sound_Sparks_Motion>Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidancehttps://arxiv.org/abs/2605.15533>Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?https://arxiv.org/abs/2605.15855>GenShield: Unified Detection and Artifact Correction for AI-Generated Imageshttps://arxiv.org/abs/2605.16122>Efficient Image Synthesis with Sphere Latent Encoderhttps://arxiv.org/abs/2605.15592>Neutral-Reference Prompting for Vision-Language Modelshttps://arxiv.org/abs/2605.15615>HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusionhttps://arxiv.org/abs/2605.15741>Registers Matter for Pixel-Space Diffusion Transformershttps://arxiv.org/abs/2605.16147>RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representationshttps://arxiv.org/abs/2605.15908
>>108853585I dont want to have opinions on the posing. I wanna leave it to the AI to see what it does, good or not
yeahslop
>>108853621also that leg (on previous image)what's up wit dat
>>108853639anatomy isn't anima's greatest strength
>>108853666lel
>>108854051
>>108854089you're not selling me anima here looks like sd15 gen lel
>>108854113its a cool layout and composition but, yeah, the face/hand details are rough. I might experiment with some different upscaling and genning at a higher native resolution later. I can't really find the sweet spot
>>108854120i've seen people using 50 steps lel
>>108854161I am also using 50 steps
>>108854168ooof and yikes
me on the bottom left
>>108854120what are the prompts like? u may have shared before... i've been encouraging the prompt enhancer to go full sdxl
death to clankers
do your worst, i have already won
>>108854336>ooof and yikeser_sde is the recommended sampler for anima and it needs higher steps>>108854679>what are the prompts like?heres the full workflow: https://files.catbox.moe/mhgxt1.png>encouraging the prompt enhancerI haven't been using the enhancer cuz its too slow locally
>>108855031did you try any other samplers/schedulers?maybe start with euler + beta at like 35 stepsor res_2m whoever recommended that appears to be wrong
>>108855041yeah when I first started playing with anima, I tried a bunch of different combos. researched what other people seemed to be using too. there honestly wasn't a huge difference between most of them, but er_sde still slightly outperformed the match-upsthat was on beta3, so idk if its different for base1. I assumed it wasn't
>>108855053i just dont think sde samplers are good for any model post sdxl or its derivatives. from my experience with flux/chroma/z its' been either single step euler-types or multistep, not so much exponential onesalso the scheduler makes a big diff. try something like res-2m/deis-2m/abnorset with beta/beta32 if you have extra samplers/schedulers (res4lyf), or even kl-optimal/power/shift,with less steps for the multistep samplers (since they do 2+ steps per regular step)
>>108855053idk it could be the weighting, 1.4 is pretty high esp on two of them. picrel what i pulled out of the workflow but w/o any weighting. main issue is the character is super small so the detail is scuffed.https://files.catbox.moe/oydhs9.png is run through my sdxl prompt enhancer (no weighting)
it's just like that time i took acid in school
>>108855183neat
>>108855095I can try turning the style weights down but they tend to lose stickiness cuz the prompt is busy otherwise>>108855101>>108855119nice character sheets>>108855183really cool composition
>>108855196chromagirl is overpowered
>>108855193>>108855196it does a lot of neat stylish stuff, i think it's the "bold outlines, flat colors, minimal hard-edged shading". i also just noticed i have a fuckup in my rewrite block lol time to see fixing it breaks the spell
heres one with- higher base gen- different sampler/scheduler- different upscaler settings- lower weightingdoesn't really perform any better on tiny faces>>108855241>also just noticed i have a fuckup in my rewrite block loldoesn't beat when I accidentally had my positive prompt plugged into my negs for a whole month before I noticed
>>108855275most models (especially small ones) will suck at small facessee >>108855183no details granted that was probably on purpose but it'll have a hard timehave you done any portrait/"medium shot" types?
>>108855282sometimes it seems like these things have a maximum detail budget. these chibi ones have the advantage of being flat shading so it can deal with small faces being just eyes and a blob
>>108855282>have you done any portrait/"medium shot" types?yeah, ofc it can do face details when it has a lot of space to work withyou know me tho, i like the more "lived in" expansive scenes with characters tucked into the environment
>>108855301yah>>108855302well you're gonna have to really push the model, use a lora, or admit defeat
gn all
>>108855310or I can enjoy the stuff I'm making even tho its not perfect :)>>108855319gn
>>108855319gn
>>108855420pls stop mogging the thread with every gen
>>108855439lol i've been pretty choosy
last one. bedtime. gn
wtf>>108855785gn
>>108855798thicc lol
https://www.youtube.com/watch?v=a856jos1bSo
i miss schizo anon