You Guys Goon To That? EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108727613https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>108733994>>108734009You posted a brunette lol
>inb4 nigbo
>>108733994no anime in the collage, no posting
>>1087341235th image bint
>>108733928
for anime collage please don't miss out on our sister thread /adg. very nice thread. high quality.
>>108734145Ok, but I am fat, I wear glasses, and have a beard and long dark brown hair. That guy looks very unattractive - no long hair, no glasses, and possibly a prisoner of war (women like winners).
>>108734145She looks old af. Probably all used up and dry.
>>108734149it's /adt/ faggot, the name stands for Anime Diffusion Thread. Why a thread and not a general? Because it was intended to be a subordinate thread to /ldg/, that is, it was made to discuss anime THERE and to help organize and reduce the amount of information in /ldg/, NOT HERE.
the new Comfy search is the first actually good UI update in a long time lol
>>108734169The lady in sunglasses is absolute
>>108734176>>108734149check the archives, it's original purpose was for that
>>108734145now make the guy look like the average /g/ poster
>>108734187He does, look at him, tight shirt, combover, very gay (he's noticing how he could cur her hair better)
>>108734176can't believe I missed that detail
apache2 anima status?
>>108734187Just edited my original gen directly lol, straight through 2160x1600 -> 2160x1600, 4 steps DPM++ 2S Ancestral / Simple scheduler."Completely replace the man on the left side of photographic image 1 with a very fat nerdy fedora-wearing bearded slob while keeping the original character design and facial likeness of the woman on the right and overall layout and lighting and text exactly the same in every way."
>>108734227too well groomed
>>108734176>>108734185>>108734205It was always a troll. Cope, seethe, and dialate.
>>108734230Yeah, no glasses, plus no long hair. I would also point out no hats in church.
>>108734230I mean I'm sure I could fully prompt a new image with a slobbier ldgian but I feel like this is good enough lol
>>108734242He looks like he's a secret millionaire tho
>>108734227a neckbeard would look so much nicer
>mfw Resource news05/01/2026>Representation Fréchet Loss for Visual Generationhttps://github.com/Jiawei-Yang/FD-loss>Caption Generator Pro: Tkinter app for generating image captions with LLaVA-style modelshttps://github.com/CoolGenius-123/Caption-Generator-Pro>Metascan v0.3.0 Updatehttps://github.com/pakfur/metascan/releases/tag/v0.3.0>Phosphene: Local video and audio generation for Apple Silicon ( LTX2.3 )https://github.com/mrbizarro/phosphene>MoCapAnything V2: End-to-End Learning of Generalizable Motionhttps://animotionlab.github.io/MoCapAnythingV2>Diffusers <0.37.1 Security Vulnerability - Code Injectionhttps://github.com/huggingface/diffusers/security/advisories/GHSA-98h9-4798-4q5v04/30/2026>ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Pythonhttps://github.com/princeton-vl/procfunc>Efficient, VRAM-Constrained xLM Inference on Clientshttps://github.com/deepshnv/pipeshard-mlsys26-ae04/29/2026>Z-Anime | Full Anime Fine-Tune on Z-Image Base https://huggingface.co/SeeSee21/Z-Anime>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantizationhttps://github.com/svg-project/Quant-VideoGen>World-R1: Reinforcing 3D Constraints for Text-to-Video Generationhttps://github.com/microsoft/World-R1>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settingshttps://github.com/lparolari/cobench>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generationshttps://github.com/SonyResearch/VibeToken>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Groundinghttps://github.com/oceanflowlab/OmniVTG>Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Modelshttps://github.com/LeapLabTHU/RvR>SketchVLM: Vision language models can annotate images to explain thoughts and guide usershttps://sketchvlm.github.io
>mfw Research news05/01/2026>AesRM: Improving Video Aesthetics with Expert-Level Feedbackhttps://arxiv.org/abs/2604.28078>TripVVT: A Large-Scale Triplet Dataset and a Coarse-Mask Baseline for In-the-Wild Video Virtual Try-Onhttps://arxiv.org/abs/2604.27958>HiMix: Hierarchical Artifact-aware Mixup for Generalized Synthetic Image Detectionhttps://arxiv.org/abs/2604.27903>Frequency-Aware Semantic Fusion with Gated Injection for AI-generated Image Detectionhttps://arxiv.org/abs/2604.27875>Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraininghttps://arxiv.org/abs/2604.27715>Leveraging Verifier-Based Reinforcement Learning in Image Editinghttps://arxiv.org/abs/2604.27505>Post-Optimization Adaptive Rank Allocation for LoRAhttps://arxiv.org/abs/2604.27796>Generate Your Talking Avatar from Video Referencehttps://gseancdat.github.io/projects/TAVR>AdvDMD: Adversarial Reward Meets DMD For High-Quality Few-Step Generationhttps://arxiv.org/abs/2604.28126>The Effects of Visual Priming on Cooperative Behavior in Vision-Language Modelshttps://arxiv.org/abs/2604.27953>Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modelinghttps://arxiv.org/abs/2604.28185>Are DeepFakes Realistic Enough? Exploring Semantic Mismatch as a Novel Challengehttps://arxiv.org/abs/2604.28022
> >108734249> >108734257fuck off
>>108734273Why are you so angry all the time?
>>108734273what is the point of this ritual post you do?
>>108734292Mainly attacking :^) it's an indian thing. He's hired to post, and is very lazy.
>>108733568>https://rentry.co/s8fg8berUpdated the captioning script to properly populate lyrics instead of raw_lyrics, note formatted_lyrics may also be redundant.
Funny troonbo comes in here like he's welcome. Kekd.
>>108734184lol
>disabo is lonely againGo back to your containment general thread schizo
>>108734383kek
>>108734249>>108734257thanks!
who's hyped for another month of local stagnation and saas innovation?
>>108734447heh, nice try SATAN, she's no supermodel church woman.
>>108734447thought this said saars innovation
that's right, localkeks. api comes first
kino alert
Local shroom.
>>108734751i just tested it and the audio is perfectly seamless across a 1 minute video. i seems like i can generate full music with this now
>>108732769just in case people didn't believe me that this works for basically anything BTWoriginal image:https://files.catbox.moe/3he15x.pngcaption from Gemini 3.1 Pro:https://pastes.io/mJzbYB4p
>>108734825Those of us with a brain knew this months ago, it's only freetards still stuck in 2023 seething at API who think otherwise. Gemini is the best NSFW NL captioner available right now
>>108734870yeah. It does take a good prompt to get the most out of it though, my one I linked before I've been refining forever.
calm down fag
>>108734447huh, Ernie version of this actually goodrare Ernie Wcan't upscale as much though
>>108734825>tokenflood>sending your porn prompts to googlelol
>>108734856lol you can tell it went through zit because it's so fucking shit at doing panties. It always does that retarded crease in the middle and it never looks natural. The only way to fix it is hitting the gen lotto on klein.
>>108735123>The only way to fix it is hitting the gen lotto on klein.skill issue.
>>108735036wat? the caption is CREATED by Gemini, FROM the image
>>108735226and yet, those panties
>>108735233>it's okay, I didn't gen the furry porn, only send them the pic directlyok