I Love LDG EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108652848https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news04/21/2026>MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mappinghttps://jeoyal.github.io/MegaStyle>UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Modelshttps://github.com/Yovecent/UDM-GRPO>Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuninghttps://github.com/NA-HMC/NA-HMC>Evolutionary Negative Module Pruning for Better LoRA Merginghttps://github.com/CaoAnda/ENMP-LoRAMerging>DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantizationhttps://github.com/Hsu1023/DuQuant>Generalizable Face Forgery Detection via Separable Prompt Learninghttps://github.com/OUC-YER/SePL-DeepfakeDetection>Adaptive receptive field-based spatial-frequency feature reconstruction network for few-shot fine-grained image classificationhttps://github.com/ICL-SUST/ARF-SFR-Net.git>ComfyUI-DiffAid-Patches: Inference-time Diff-Aid-inspired text-conditioning patches for ComfyUIhttps://github.com/xmarre/ComfyUI-DiffAid-Patches>modl: Train LoRAs and generate images on your own GPU. Web UI + CLIhttps://github.com/modl-org/modl>ComfyUI-KleinRefGrid: Turns reference images into reference_latentshttps://github.com/xb1n0ry/ComfyUI-KleinRefGrid>node-banana: Free and open node based generative workflowshttps://github.com/shrimbly/node-banana>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representationhttps://github.com/AMAP-ML/EMF04/20/2026>Elucidating the SNR-t Bias of Diffusion Probabilistic Modelshttps://github.com/AMAP-ML/DCW>(1D) Ordered Tokens Enable Efficient Test-Time Searchhttps://soto.epfl.ch>Frequency-Aware Flow Matching for High-Quality Image Generationhttps://github.com/OliverRensu/FreqFlow>From Zero to Detail: A Progressive Spectral Decoupling Paradigm for UHD Image Restoration with New Benchmarkhttps://github.com/NJU-PCALab/ERR
>mfw Research news04/21/2026>DreamShot: Personalized Storyboard Synthesis with Video Diffusion Priorhttps://arxiv.org/abs/2604.17195>Speculative Decoding for Autoregressive Video Generationhttps://arxiv.org/abs/2604.17397>LIVE: Leveraging Image Manipulation Priors for Instruction-based Video Editinghttps://arxiv.org/abs/2604.17021>AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generationhttps://arxiv.org/abs/2604.18348>Coevolving Representations in Joint Image-Feature Diffusionhttps://arxiv.org/abs/2604.17492>Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion Modelshttps://arxiv.org/abs/2604.17415>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Modelshttps://arxiv.org/abs/2604.17565>FlowC2S: Flowing from Current to Succeeding Frames for Fast and Memory-Efficient Video Continuationhttps://arxiv.org/abs/2604.17625>ReCap: Lightweight Referential Grounding for Coherent Story Visualizationhttps://arxiv.org/abs/2604.18575>UniCSG: Unified High-Fidelity Content-Constrained Style-Driven Generation via Staged Semantic and Frequency Disentanglementhttps://arxiv.org/abs/2604.17850>Towards Robust Text-to-Image Person Retrieval: Multi-View Reformulation for Semantic Compensationhttps://arxiv.org/abs/2604.18376>mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrievalhttps://scene-the-ella.github.io/meol>Depth Adaptive Efficient Visual Autoregressive Modelinghttps://arxiv.org/abs/2604.17286>When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Modelshttps://arxiv.org/abs/2604.17375>Cross-Modal Attention Analysis and Optimization in Vision-Language Models: A Study on Visual Reliabilityhttps://arxiv.org/abs/2604.17217>Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Modelshttps://arxiv.org/abs/2604.17873
>>108655785>>108655793stop deblessing /ldg/ thread schizo
mogged
>>108655803There's a picture of my penis in there. How the fuck?
>>108655803here's the context lol >>108654985>>108655069
Why people are so naive or ignorant when it comes to NSFW AI generated content, I always get asked "what AI you use bro?" when I share something on reddit or X, they think I just write a prompt on a site and then AI does all the magic?
>>108656011Almost every human is an absolute retard, that should have been clear by now.
>>108656011For context, this OF whore wrote me a DM asking me how I generate my videos (I use wan, VACE, LTX, post processing, etc, etc), I tell her that I have a local setup and I use several workflows, that is not that simple but I'm happy to collab (for money ofc) and then she writes me that crap >>108656011
>>108656011>they think I just write a prompt on a site and then AI does all the magic?yes? the future is AI models using tools to correct themselves and build all pieces together (like here, GPT Image-2 makes an image -> looks at it -> notices the issues -> fixes those issue with an image edit process) >>108655670
>>108656084generally when dealing with retards you charge a retard tax, I always give a stupid big quote to retards and sometimes they take it and it's worth the headache
>>108656085Have you not learned anything from the past few years? That's not how it works, especially with SaaS (tools get nerfed, rugpulled by the big corpos) and even more with NSFW content and we're talking about today, not the future you idiot
>>108656105>not the futuredefinitely the future, the /lmg/ fags are incorporating tools on gemma 4, as usual /ldg/ is completly clueless about the news and how to move forwards, it's filled with retards like you
Why do anons lurk and post here if they think every other anon here is a retard? Surely they'd find some other place to post...
>>108656122AI doesn't think so any workflow based on AI "thinking" to itself will just end in piles of aesthetic trash. Even the best models still can't handle 2000 lines of code without going schizo on the task and that's code that is relatively simple, recursion destroys most models same with complex A() -> F() -> C() -> E() -> D() relationships
>>108656122gemma4 is local thoever? anons here use it to caption and write prompts all the time desu desu.
>>108656147>thoeversaar?
>>108656122>/lmg/ fags are incorporating tools on gemma 4no?
>>108656139localchads live rent free in their minds.
I saw GPT-Image 2 released. Can anon share some gens?
It's over for local
>>108656122I can tell you that any serious genner/trainer (myself included) is using gemma4, you're just ignoring my original post that was that people overlook the process of generating NSFW AI content, especially images and videos, LLM and text/code based crap is easy as shit thats why /lmg/ threads are filled with happy people and /ldg/ is filled with frustrated anons that can't generate anything that other few anons can
>>108656165yes, cute catprompt:>generate cat image
>>108656174How many rugpulls until you learn that they will take away the good model after the good press dies down? We've done this like 6 times now.
>>108656011>>108656084Total OF whore death
>>108656165>Can anon share some gens?no, there's a thread for that and the images are already shared here >>108653190
>>108656189its always the same cycle of these "groundbreaking" models, they get hyped, users start generating viral stuff, copyright holders get mad, tools get nerfed, userbase gets mad and tools dropped
>>108656227>its always the same cyclethere's even a name for thathttps://en.wikipedia.org/wiki/
>>108656227No it's the costs, they are expensive to run so the first they do is cut compute, that's outside of the safety. Nano Banana is so bad because they do bullshit like switch you to fast mode even though it's completely shit. Also the core model just got worse slowly but surely.
>>108656174The killed sora for this.
Tested GPT-Image in the API.Can do celebs by attaching picture (edit mode).No NSFW as expected though, maybe allows limited artistic stuff but didn't bother pushing too much.Was able to give a woman cleavage by cloth swapping, but that's the extent of what I got.Dogshit for style transfers, hallucinates what's style, what's content, omits or makes up details. 4 years into this and still not a single good model on this front.Can make detailed infographics with text without slopping the text, for whatever that's worth.
>>108656284>>108656199
>>108656243nice wiki page.
>>108656320lmaohttps://en.wikipedia.org/wiki/Enshittification
I want to make an anime style LoRA. I checked the official repository, but it seems like the program is only compatible with Linux. Is there another workaround?>>108656284How about copyrighted anime characters?
>>108656153>no?yes >>108656365
>>108656376*AnimaI want to make an Anima LoRa but it seems it is only Linux compatible
>>108656227nope, its different nowgoogle finally has a worthy image gen competitor, so they wont be able to fuck around with their users anymore.here's whats going to happen. very soon, google will release nbp 2, which will be better than gpt image 2it will look similar to the vibecoding war between claude and codex. if either of them starts to enshittify their model, then users will just jump ship.apichads won, and as a consequence, localcucks will benefit since the chinese models will train off the api outputsyou're welcome
>>108656383https://github.com/gazingstars123/Anima-Standalone-Trainer
>>108656383https://github.com/gazingstars123/Anima-Standalone-TrainerI'm using this on Linux but it shows Binbows support. No idea if it's any good on Winblows but try it I guess.
>>108656389Indians should be banned from the internet
>>108656389>localcuks benefit since chinese models will train off the api outputsfuck that shit dude, China must stop eating the shit of API western models and do the Z-image turbo way (the kino way)
>>108656389NBP is still better that 2 at some things. But it's good that OpenAI has something that isn't complete dogshit.
>>108656389>words words wordsAPIcucks cant meme
>>108656398>>108656391Thanks and you can replicate the same settings tdrusell shows in his official Anima Lora?
>>108656084think she'd let a humble localchad suck on her toes or something? how big are her tits
>>108656423stupid sexy gooks
>>108656389>so they wont be able to fuck around with their users anymore.LOLOL
>>108656376>I want to make an anime style LoRA. I checked the official repository, but it seems like the program is only compatible with Linux. Is there another workaround?sd-scripts supports it. I think it works on Windows, not sure.>How about copyrighted anime characters?I am done testing for today but I wouldn't expect it to be too pissy about it. If anything you are more likely to run into issues with Disney, Sony, Nintendo etc. characters.
>>108656426I don't know who that is or what you're talking about. Sorry.
What is the status of copyrighted anime characters with GPT Image 2? We won?
>>108656165A plain prompt "Warhammer 40k crossover with one piece"
>>108656426Well I can't replicate the exact lora as he (understandably) does not provide the images used but the settings he provides seem like sane defaults. No "catastrophic" forgetting or anything I've heard claimed about training loras on it.
>>108656466What I mean is that TDRussell shared some LoRA training settings to use in his only Linux compatible workflow. My question is whether I can select the same settings here >>108656391
>>108656466He shared the training dataset for his rutkowski lora.
>>108656458why does it look like someone injected extra noise into the last steps of the diffusion process
>>108656466Sorry, i inderstood thanks
>>108656486There's nothing wildly out of the scope of the average trainer that I can tell. So you should be able to use the same settings just fine.
>>108656486Training settings do not depend on OS so the answer is yes if that tool has implemented every relevant feature.
>>108656165
>>108656513that's crazy good. too bad no porn so it's worthless.
>>108656458howd they manage to keep the ugly sepia poison
>>108656513wtf this is next level, holy fuck...
reminder that if you want to talk about that model you have to go here, this is a fucking local thread in case you forgot>>108653190>>108653190>>108653190
>>108656513the hands are still fucked though
>>108656513hands are on another level
>>108656389Midjourney still mogs both NB2 and GPT Image-2 in terms of aesthetics.
>>108656552emoboy4ever had an accident. leave his hands out of this
>>108656513No fucking way...
>>108656513I didn't realize how much better an AI image gets when the text is correct, this makes the difference