Discussion and Development of Local Image and Video ModelsPrevious: >>108629083https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
it's not over
>>108638173>>108638184So... faster, less resources or both?
>>108639162we don't have anything to talk, why did you bake?
>>108639228he only cares about made up schizo drama
>mfw Resource news04/19/2026>ZPix: Local AI image generator and editor powered by open image models. https://github.com/SamuelTallet/ZPix>Comfy Canvas: Local inline layer based image editorhttps://github.com/Zlata-Salyukova/Comfy-Canvas04/18/2026>Rose: Range-Of-Slice Equilibration PyTorch optimizerhttps://github.com/MatthewK78/Rose04/17/2026>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handlinghttps://yjx-research.github.io/ControlFoley>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokenshttps://research.nvidia.com/labs/toronto-ai/tokengs>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generationhttps://aka.ms/mm-webagent>Qwen2D-VAEhttps://huggingface.co/Anzhc/Qwen2D-VAE>ComfyUI HY-World 2.0 — WorldMirror 3Dhttps://github.com/AHEKOT/ComfyUI_HYWorld2>Anima Style Explorer: A free web tool for ComfyUI styleshttps://anima.mooshieblob.com>Stanford AI Index Report 2026https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf04/16/2026>Motif-Video 2B: A micro-budget text-to-video diffusion transformer from Motif Technologieshttps://motiftech.io/videoshowcase>HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worldshttps://huggingface.co/tencent/HY-World-2.0>ErnieTurbo_extracted_lorahttps://huggingface.co/GuangyuanSD/ErnieTurbo_extracted_lora/tree/main04/15/2026>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching https://huggingface.co/tencent/DisCa>Lyra 2.0: Explorable Generative 3D Worldshttps://research.nvidia.com/labs/sil/projects/lyra2>AniGen: Unified S3 Fields for Animatable 3D Asset Generationhttps://github.com/VAST-AI-Research/AniGen>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Modelshttps://gyanendrachaubey.github.io/T2I-BiasBench
where am I supposed to get my nsfw resources from now that civitai went full retard, are we back to shady businessmen in back alleys?
>mfw Research news04/19/2026>Boosting Robust AIGI Detection with LoRA-based Pairwise Traininghttps://arxiv.org/abs/2604.12307>A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargetinghttps://arxiv.org/abs/2604.13427>Decoupled Similarity for Task-Aware Token Pruning in Large Vision-Language Modelshttps://arxiv.org/abs/2604.11240>Relaxing Anchor-Frame Dominance for Mitigating Hallucinations in Video Large Language Modelshttps://arxiv.org/abs/2604.12582>One-shot Compositional 3D Head Avatars with Deformable Hairhttps://yuansun-xjtu.github.io/CompHairHead.io>Crowdsourcing of Real-world Image Annotation via Visual Propertieshttps://arxiv.org/abs/2604.14449>Chaotic CNN for Limited Data Image Classificationhttps://arxiv.org/abs/2604.14645>HTDC: Hesitation-Triggered Differential Calibration for Mitigating Hallucination in Large Vision-Language Modelshttps://arxiv.org/abs/2604.12115>Degradation-Consistent Paired Training for Robust AI-Generated Image Detectionhttps://arxiv.org/abs/2604.10102>On The Application of Linear Attention in Multimodal Transformershttps://arxiv.org/abs/2604.10064>Reasoning Resides in Layers: Restoring Temporal Reasoning in Video-Language Models with Layer-Selective Merginghttps://arxiv.org/abs/2604.11399>Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Modelshttps://arxiv.org/abs/2604.14888>Benchmarking Deflection and Hallucination in Large Vision-Language Modelshttps://arxiv.org/abs/2604.12033>Why MLLMs Struggle to Determine Object Orientationshttps://arxiv.org/abs/2604.13321>Quality-Aware Calibration for AI-Generated Image Detection in the Wildhttps://grip-unina.github.io/QuAD>Reward Design for Physical Reasoning in Vision-Language Modelshttps://arxiv.org/abs/2604.13993>Seeing Through Circuits: Faithful Mechanistic Interpretability for Vision Transformershttps://arxiv.org/abs/2604.14477
>>108639162sarah peterson status?
>>108639287Yeah, let me tell you>Sarah Petersons BBC Holding Dildo FT15https://civitai.red/models/466318/sarah-petersons-bbc-holding-dildo-ft15>Sarah Petersons Black Bred Magazine coverhttps://civitai.red/models/717113/sarah-petersons-black-bred-magazine-cover>Sarah Petersons BBC Spoon FT15https://civitai.red/models/185076/sarah-petersons-bbc-spoon-ft15>Sarah Petersons BBC Gangbang Kneeling surroundedhttps://civitai.red/models/537775/sarah-petersons-bbc-gangbang-kneeling-surroundedHappy BBCunday ^^!
>>108639287in shambles, Indian GDP dropped by 2%
>>108639316so based..
I haven't been ITT since Z Image and Kleins dropped, what's the current meta? Are the threads still under assault by anus? Is lodestones still a retard?
>>108639162good boy tran
>>108639351Anima shows promise for anime stuff, and became Ani's latest target. It's a little smaller than SDXL and much slower, but can do both tags and natural-language prompting. There's even a WaiAnima v1 now that noticeably improves high-res results.
>>108639372Aaahhh
>>108639372Wat prompt anon
what the fuck is ERNIE
>>108639351Kekstone is training his last model on pics of his own poop with disposable camera. Sounds promising...
>>108639493an another nothingburger
>>108639493the fastest milkman in the west
>>108639351Klein-9B-KV was released, which used kv-caching to speed up edit gens by a lot.
>>108639496>Kekstone is training his last model on pics of his own poop with disposable camerasounds retarded enough to be true
>>108639572wtf I want to die for Israel now??
so its over? owarida?
I keep seeing some fucking crazy NSFW videos on DeviantArt with multi-shot character consistency and audio. How are people doing it? No way it's LTX-2.3
>>108639493The husband of HERNIA
>>108639634link
trying image editing for the first time with klein 9b on my 8gb vram, absolute magic
>>108639478A character sheet multi-view photo 3x3 grid of the woman for dataset creation, white seamless background,
>>108639219very nice
>>108639653ye once you get the hang of how to prompt klein for edit it's quite good for the size / speed
>>108639518>Klein-9B-KVis it better in other regards too or just faster
>>108639856how do i prompt Klein to make me a canny filter accurate and not change the style?
>>108639938Worse but faster imo.
>>108639698>>108639372what model
tdrusell are you here?
>>108639962Is the quality even supposed to be different? The description sounds like it just avoids redundant recomputes by reusing the part that doesn't change.https://github.com/black-forest-labs/flux2/blob/main/docs/flux2_klein_kv_cache.md
>>108640016im in my ferrari sports car training v4 but whats up
its uphttps://www.youtube.com/watch?v=B6dq0Q5UAaE