Discussion and Development of Local Image and Video ModelsPrevious: >>108851016https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news05/18/2026>Lance: Unified Multimodal Modeling by Multi-Task Synergyhttps://lance-project.github.io>GridLoraTester: Workbench for character LoRA training on FLUX.2: dataset curationhttps://github.com/Mandrakia/GridLoraTester>FLUX MCP serverhttps://docs.bfl.ai/api_integration/mcp_integration>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimizationhttps://shredded-pork.github.io/Flash-GRPO.github.io>LongLive2.0 5B BF16: AR-trained Wan2.2-TI2V-5B generatorhttps://huggingface.co/Efficient-Large-Model/LongLive-2.0-5B>DealMaTe: Multi-Dimensional Material Transfer via Diffusion Transformerhttps://github.com/haha-lisa/DealMaTe>Deep Pre-Alignment for VLMshttps://github.com/THUMAI-Lab/Deep-Pre-Alignment>Sparse Autoencoders enable Robust and Interpretable Fine-tuning of CLIP modelshttps://github.com/Fabian-Mor/sae-ft>VAGS: Velocity Adaptive Guidance Scale for Image Editing and Generationhttps://github.com/Harvard-AI-and-Robotics-Lab/Velocity_Adaptive_Guidance_Scale>Neural Companion: Local desktop AI companion shellhttps://github.com/Rakile/NeuralCompanion>PixlStash 1.2: easy sharing, cleaner UI and faster background processing for your image managementhttps://pixlstash.dev/whatsnew.html05/17/2026>Comfy-mesh LTX 2.3 support — separate node + separate server GUIhttps://github.com/shootthesound/comfyui-mesh#ltx-23--separate-node--separate-server-gui>Rebels_HiDream-01_Image_Dev_NODES: Run HiDream-01 Image Dev bf16 and GGUFhttps://github.com/RealRebelAI/Rebels_HiDream-01_Image_Dev_NODES05/16/2026>ComfyUI-Mesh Icarus & Daedalus: Split a diffusion model across two GPUshttps://github.com/shootthesound/comfyui-mesh>Pixal3D-ComfyUIhttps://github.com/Saganaki22/Pixal3D-ComfyUI>ArXiv to Ban Researchers for a Year if They Submit AI Slophttps://www.404media.co/new-arxiv-rules-ai-generated-papers-ban
>mfw Research news05/18/2026>DreamSR: Towards Ultra-High-Resolution Image Super-Resolution via a Receptive-Field Enhanced Diffusion Transformerhttps://arxiv.org/abs/2605.15682>ElasticDiT: Efficient Diffusion Transformers via Elastic Architecture and Sparse Attention for High-Resolution Image Generation on Mobile Deviceshttps://arxiv.org/abs/2605.15684>Self-Prompting Diffusion Transformer for Open-Vocabulary Scene Text Editing via In-Context Learninghttps://hongxiii.github.io/mstedit>Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generationhttps://arxiv.org/abs/2605.16003>One Pass Is Not Enough: Recursive Latent Refinement for Generative Modelshttps://arxiv.org/abs/2605.15309>Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimizationhttps://arxiv.org/abs/2605.15980>Evaluating Design Video Generation: Metrics for Compositional Fidelityhttps://arxiv.org/abs/2605.16223>Sound Sparks Motion: Audio and Text Tuning for Video Editinghttps://amirhossein-razlighi.github.io/Sound_Sparks_Motion>Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidancehttps://arxiv.org/abs/2605.15533>Do Less, Achieve More: Do We Need Every-Step Optimization for RL Fine-tuning of Diffusion Models?https://arxiv.org/abs/2605.15855>GenShield: Unified Detection and Artifact Correction for AI-Generated Imageshttps://arxiv.org/abs/2605.16122>Efficient Image Synthesis with Sphere Latent Encoderhttps://arxiv.org/abs/2605.15592>Neutral-Reference Prompting for Vision-Language Modelshttps://arxiv.org/abs/2605.15615>HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusionhttps://arxiv.org/abs/2605.15741>Registers Matter for Pixel-Space Diffusion Transformershttps://arxiv.org/abs/2605.16147>RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representationshttps://arxiv.org/abs/2605.15908
>>108855284>>108855287thanks!
BUSHYARMPITFARMWOMEN
whats the best client and model to use on an amd 6800xt?
>>108855361I have a 6950xt. I'm on Linux, and I use ComfyUI and for models I use Flux Klein 9B, but you may want to use the smaller Flux Klein one.I suspect at this point people are using gguf to save memory.the thing is, you have to learn to use linux, you have to use venv and install python stuff, so idk, it's like a gaping time hole.btw, for anime, it's all about anima v1.
>>108855507
>>108855361oh yeah, I should obviously mention Zit, I use it the most. Z Image Base.lol
Is there a way to use reference background images for t2i? Depth, canny isn't enough.
>>108855690no, its impossible
>>108855690Inpainting? Just oldschool photobashing?
>3 hour old thread>barely any posts
>>108855894It's midnight in america chud
>https://xcancel.com/LodestoneRock/status/2056533258746396705Damn. I could also use new gpu
Claude is having me install llama shit for a node, compiling shit for 5090. Wish me luck.
>Fucking piece of shit Civitai always break at night. >Tfw Night NEETREEEEE
Anima diffusers FUCKING when
>>108855953nice! time to burn them on some shitty useless experiments which will be abandoned midway, instead of finetuning anima on his dataset!!!
Nodes 2.0... whatever happened there?https://github.com/Comfy-Org/ComfyUI_frontend/discussions/12330A lot of my nodes are broken in this vue (cancer) rendering system.
>>108856053Nice, it's actually working, allowing me to use all 81frames from any 5second video I've done preciously.Hate how ltx prefers an essay of useless prompts.
>>108855953>LodestoneRock
>>108855953@Comfy can you send me one too?Fax: 9844 1529
>>108855953WHY CAN'T THIS STUPID NIGGERMONKEYFAGGOTRETARD JUST DO A SIMPLE FINETUNE
bruh i found my old 1.5 pngs and loading them into newer models with all the schizo weights produces some shit ill tell u wat
>>108856609Mass copying my prompts from midjourney to sdxl based furry porn models does some pretty weird shit too.
>>108856600More fun to try something new and complicated I'd guess
>hourglass figurehmmmm
What is the tech or node im looking for to run 2 gens, same everything but just switch the model between 2. right now I'm doing 1 and switching manually then 1 but im sure there is a better way
>>108856775did you use it like a tag or "The woman’s body has an hourglass figure."
>>108856600>REEE
>>108856782a tag, copypaste from old prompt
bnuy
>>108855690of course there is
oink oink
>>108855690reference? you mean targeting? make a mask with REMBG.
>>108856929and brotip: you can invert the masks.
>>108856929That makes it an i2i, right?
>>108856991how do you reference an image it not being i2i?
Anybody use Claude TUI for image gen?
>>108855690
>>108857039is unsloth version of klein 9b any less censored than normal? isn't it pure snakeoil for image models
>>108857059i run the unsloth gguf because it's smaller file size and works on my 16gb vram 32 ram setup. i doubt it's less censored in any way.
^_^
I never see people posting their gens to Anima lora galleries on civit. It's weird.
>>108857227civit only upvotes the most indian images so I won't put anything I like into that toilet
>>108857227I post them once or twice per day. It gets drowned immediately. It's unironically better to stick to older models like chroma if you want visibility.
>>108857227all the best slop gets posted on twatter nowadays
someone make adetailer for anima please I am so tired of hard slopped eyes we have the tech to fix
>>108857197did u copy
>>108857278isnt she coldalso anima yume for 1.0 based when?
>>108857291Use gimp and i2i on low denoise.
>>108857327i'd fix with inpaint illustrious, I just want other lazy retards to have a tool I can tell them to use to stop posting shit like the image above
>>108857241you can hide any boring members
>>108857321i can't wait for ANY competent anima finetune (base is just not usable by itself, image burning into undetailed flat color blobs even at cfg 4, etc.)
Gyaruren
>>108857439how do I remove filthy males from being genned?
stop using piece of shit overfit sdxl garbage
>>108857468>1girl, solo,
>>108857468that is clearly another woman with breasts, but just write solo, retard.