I Love LDG EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108652848https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news04/21/2026>MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mappinghttps://jeoyal.github.io/MegaStyle>UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Modelshttps://github.com/Yovecent/UDM-GRPO>Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuninghttps://github.com/NA-HMC/NA-HMC>Evolutionary Negative Module Pruning for Better LoRA Merginghttps://github.com/CaoAnda/ENMP-LoRAMerging>DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantizationhttps://github.com/Hsu1023/DuQuant>Generalizable Face Forgery Detection via Separable Prompt Learninghttps://github.com/OUC-YER/SePL-DeepfakeDetection>Adaptive receptive field-based spatial-frequency feature reconstruction network for few-shot fine-grained image classificationhttps://github.com/ICL-SUST/ARF-SFR-Net.git>ComfyUI-DiffAid-Patches: Inference-time Diff-Aid-inspired text-conditioning patches for ComfyUIhttps://github.com/xmarre/ComfyUI-DiffAid-Patches>modl: Train LoRAs and generate images on your own GPU. Web UI + CLIhttps://github.com/modl-org/modl>ComfyUI-KleinRefGrid: Turns reference images into reference_latentshttps://github.com/xb1n0ry/ComfyUI-KleinRefGrid>node-banana: Free and open node based generative workflowshttps://github.com/shrimbly/node-banana>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representationhttps://github.com/AMAP-ML/EMF04/20/2026>Elucidating the SNR-t Bias of Diffusion Probabilistic Modelshttps://github.com/AMAP-ML/DCW>(1D) Ordered Tokens Enable Efficient Test-Time Searchhttps://soto.epfl.ch>Frequency-Aware Flow Matching for High-Quality Image Generationhttps://github.com/OliverRensu/FreqFlow>From Zero to Detail: A Progressive Spectral Decoupling Paradigm for UHD Image Restoration with New Benchmarkhttps://github.com/NJU-PCALab/ERR
>mfw Research news04/21/2026>DreamShot: Personalized Storyboard Synthesis with Video Diffusion Priorhttps://arxiv.org/abs/2604.17195>Speculative Decoding for Autoregressive Video Generationhttps://arxiv.org/abs/2604.17397>LIVE: Leveraging Image Manipulation Priors for Instruction-based Video Editinghttps://arxiv.org/abs/2604.17021>AdaCluster: Adaptive Query-Key Clustering for Sparse Attention in Video Generationhttps://arxiv.org/abs/2604.18348>Coevolving Representations in Joint Image-Feature Diffusionhttps://arxiv.org/abs/2604.17492>Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion Modelshttps://arxiv.org/abs/2604.17415>UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Modelshttps://arxiv.org/abs/2604.17565>FlowC2S: Flowing from Current to Succeeding Frames for Fast and Memory-Efficient Video Continuationhttps://arxiv.org/abs/2604.17625>ReCap: Lightweight Referential Grounding for Coherent Story Visualizationhttps://arxiv.org/abs/2604.18575>UniCSG: Unified High-Fidelity Content-Constrained Style-Driven Generation via Staged Semantic and Frequency Disentanglementhttps://arxiv.org/abs/2604.17850>Towards Robust Text-to-Image Person Retrieval: Multi-View Reformulation for Semantic Compensationhttps://arxiv.org/abs/2604.18376>mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrievalhttps://scene-the-ella.github.io/meol>Depth Adaptive Efficient Visual Autoregressive Modelinghttps://arxiv.org/abs/2604.17286>When Text Hijacks Vision: Benchmarking and Mitigating Text Overlay-Induced Hallucination in Vision Language Modelshttps://arxiv.org/abs/2604.17375>Cross-Modal Attention Analysis and Optimization in Vision-Language Models: A Study on Visual Reliabilityhttps://arxiv.org/abs/2604.17217>Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Modelshttps://arxiv.org/abs/2604.17873
>>108655785>>108655793stop deblessing /ldg/ thread schizo
mogged
>>108655803There's a picture of my penis in there. How the fuck?
>>108655803here's the context lol >>108654985>>108655069
Why people are so naive or ignorant when it comes to NSFW AI generated content, I always get asked "what AI you use bro?" when I share something on reddit or X, they think I just write a prompt on a site and then AI does all the magic?
>>108656011Almost every human is an absolute retard, that should have been clear by now.
>>108656011For context, this OF whore wrote me a DM asking me how I generate my videos (I use wan, VACE, LTX, post processing, etc, etc), I tell her that I have a local setup and I use several workflows, that is not that simple but I'm happy to collab (for money ofc) and then she writes me that crap >>108656011
>>108656011>they think I just write a prompt on a site and then AI does all the magic?yes? the future is AI models using tools to correct themselves and build all pieces together (like here, GPT Image-2 makes an image -> looks at it -> notices the issues -> fixes those issue with an image edit process) >>108655670
>>108656084generally when dealing with retards you charge a retard tax, I always give a stupid big quote to retards and sometimes they take it and it's worth the headache
>>108656085Have you not learned anything from the past few years? That's not how it works, especially with SaaS (tools get nerfed, rugpulled by the big corpos) and even more with NSFW content and we're talking about today, not the future you idiot
>>108656105>not the futuredefinitely the future, the /lmg/ fags are incorporating tools on gemma 4, as usual /ldg/ is completly clueless about the news and how to move forwards, it's filled with retards like you
Why do anons lurk and post here if they think every other anon here is a retard? Surely they'd find some other place to post...
>>108656122AI doesn't think so any workflow based on AI "thinking" to itself will just end in piles of aesthetic trash. Even the best models still can't handle 2000 lines of code without going schizo on the task and that's code that is relatively simple, recursion destroys most models same with complex A() -> F() -> C() -> E() -> D() relationships
>>108656122gemma4 is local thoever? anons here use it to caption and write prompts all the time desu desu.
>>108656147>thoeversaar?
>>108656122>/lmg/ fags are incorporating tools on gemma 4no?
>>108656139localchads live rent free in their minds.
I saw GPT-Image 2 released. Can anon share some gens?
It's over for local
>>108656122I can tell you that any serious genner/trainer (myself included) is using gemma4, you're just ignoring my original post that was that people overlook the process of generating NSFW AI content, especially images and videos, LLM and text/code based crap is easy as shit thats why /lmg/ threads are filled with happy people and /ldg/ is filled with frustrated anons that can't generate anything that other few anons can
>>108656165yes, cute catprompt:>generate cat image
>>108656174How many rugpulls until you learn that they will take away the good model after the good press dies down? We've done this like 6 times now.
>>108656011>>108656084Total OF whore death
>>108656165>Can anon share some gens?no, there's a thread for that and the images are already shared here >>108653190
>>108656189its always the same cycle of these "groundbreaking" models, they get hyped, users start generating viral stuff, copyright holders get mad, tools get nerfed, userbase gets mad and tools dropped
>>108656227>its always the same cyclethere's even a name for thathttps://en.wikipedia.org/wiki/
>>108656227No it's the costs, they are expensive to run so the first they do is cut compute, that's outside of the safety. Nano Banana is so bad because they do bullshit like switch you to fast mode even though it's completely shit. Also the core model just got worse slowly but surely.
>>108656174The killed sora for this.
Tested GPT-Image in the API.Can do celebs by attaching picture (edit mode).No NSFW as expected though, maybe allows limited artistic stuff but didn't bother pushing too much.Was able to give a woman cleavage by cloth swapping, but that's the extent of what I got.Dogshit for style transfers, hallucinates what's style, what's content, omits or makes up details. 4 years into this and still not a single good model on this front.Can make detailed infographics with text without slopping the text, for whatever that's worth.
>>108656284>>108656199
>>108656243nice wiki page.
>>108656320lmaohttps://en.wikipedia.org/wiki/Enshittification
I want to make an anime style LoRA. I checked the official repository, but it seems like the program is only compatible with Linux. Is there another workaround?>>108656284How about copyrighted anime characters?
>>108656153>no?yes >>108656365
>>108656376*AnimaI want to make an Anima LoRa but it seems it is only Linux compatible
>>108656227nope, its different nowgoogle finally has a worthy image gen competitor, so they wont be able to fuck around with their users anymore.here's whats going to happen. very soon, google will release nbp 2, which will be better than gpt image 2it will look similar to the vibecoding war between claude and codex. if either of them starts to enshittify their model, then users will just jump ship.apichads won, and as a consequence, localcucks will benefit since the chinese models will train off the api outputsyou're welcome
>>108656383https://github.com/gazingstars123/Anima-Standalone-Trainer
>>108656383https://github.com/gazingstars123/Anima-Standalone-TrainerI'm using this on Linux but it shows Binbows support. No idea if it's any good on Winblows but try it I guess.
>>108656389Indians should be banned from the internet
>>108656389>localcuks benefit since chinese models will train off the api outputsfuck that shit dude, China must stop eating the shit of API western models and do the Z-image turbo way (the kino way)
>>108656389NBP is still better that 2 at some things. But it's good that OpenAI has something that isn't complete dogshit.
>>108656389>words words wordsAPIcucks cant meme
>>108656398>>108656391Thanks and you can replicate the same settings tdrusell shows in his official Anima Lora?
>>108656084think she'd let a humble localchad suck on her toes or something? how big are her tits
>>108656423stupid sexy gooks
>>108656389>so they wont be able to fuck around with their users anymore.LOLOL
>>108656376>I want to make an anime style LoRA. I checked the official repository, but it seems like the program is only compatible with Linux. Is there another workaround?sd-scripts supports it. I think it works on Windows, not sure.>How about copyrighted anime characters?I am done testing for today but I wouldn't expect it to be too pissy about it. If anything you are more likely to run into issues with Disney, Sony, Nintendo etc. characters.
>>108656426I don't know who that is or what you're talking about. Sorry.
What is the status of copyrighted anime characters with GPT Image 2? We won?
>>108656165A plain prompt "Warhammer 40k crossover with one piece"
>>108656426Well I can't replicate the exact lora as he (understandably) does not provide the images used but the settings he provides seem like sane defaults. No "catastrophic" forgetting or anything I've heard claimed about training loras on it.
>>108656466What I mean is that TDRussell shared some LoRA training settings to use in his only Linux compatible workflow. My question is whether I can select the same settings here >>108656391
>>108656466He shared the training dataset for his rutkowski lora.
>>108656458why does it look like someone injected extra noise into the last steps of the diffusion process
>>108656466Sorry, i inderstood thanks
>>108656486There's nothing wildly out of the scope of the average trainer that I can tell. So you should be able to use the same settings just fine.
>>108656486Training settings do not depend on OS so the answer is yes if that tool has implemented every relevant feature.
>>108656165
>>108656513that's crazy good. too bad no porn so it's worthless.
>>108656458howd they manage to keep the ugly sepia poison
>>108656513wtf this is next level, holy fuck...
reminder that if you want to talk about that model you have to go here, this is a fucking local thread in case you forgot>>108653190>>108653190>>108653190
>>108656513the hands are still fucked though
>>108656513hands are on another level
>>108656389Midjourney still mogs both NB2 and GPT Image-2 in terms of aesthetics.
>>108656552emoboy4ever had an accident. leave his hands out of this
>>108656513No fucking way...
>>108656513I didn't realize how much better an AI image gets when the text is correct, this makes the difference
nice samefagging desu
Cloudcucks, I dedicate this one to you (generated by yours truly, ACEStep XL 0.7 Merge)https://vocaroo.com/15lInkgzMLR4What good is a censored and gated model? VISA/Mastercard ain't changing any time soon, cloud is forever cucked and just a toy that waits 2-3 years for local to catch up.
>>108656513>solved text>hands still fucked:(
>>108656513Ok the level of detail mogs local to oblivion, sure.But there are lots of errors too, such as everyone but the Drama Queen shirt girl having fucked hands.Honestly they seem to have overtuned the detail during the iterative generation process.Anyway this still proves that the local needs:1) Autoregressive generation that iterates on the prompt2) A smart VLM that inspects and guides the generation throughout it3) Non-slop training datasetIf it wants to compete with API.
after years of trying stuff out and practicei'm a master of baking goon lorasand you know whati will not share even a single one with you plebs kek
>108656603Didn't ask but cool story bro.
>>108656591>VISA/Mastercard ain't changing any time soondo you understand that civitai got more and more cucked over time it's because of VISA/Mastercard? it's a poison for both API and localfags
ive stopped genning as much. i guess i am depressed
>>108656598>Ok the level of detail mogs local to oblivion, sure.it mogs everything, the lmarena ranking is absurd, never seen such a gap before >>108656174
>>108656603training a lora is easy bro, get over it
>>108656603>im not sharing my precious loras>5 billion loras on civitai alonewow bro, how will I ever cope?
>>108656591>ACEStep XL just needs a merge to match the best cloud has to offer in terms of overall audio quality, not even a LoRALocal audio is saved. Also, I've tested a few LoRAs with it, and even though they're all underbaked, it's absolutely insane at replicating style/voice, far trumps the earlier version.
>>108656591can you make latina dance music? or quirky goblin music?
>>10865665290% of civitai loras are crap tho
>>108656676good thing I make my own because it's literally EZPZ. you should consider joining MAID.inb4 more cope inb4 yeah but YOUR LORA IS BAD AND STINKYget real; grow up.
>>108656513>so many bad handschroma bros, api has same limit kek
This is embarrassing to read even by non-existent Julien standards.He sounds completely mentally buckbroken at this point.
>>108656719>talking about e-celeb lolcowsB O R I N G
>>108656591>ACEStep XL 0.7 Mergewhere do i get it?
>>108656591it sounds so bad lool
>>108656652i don't think even 2% of them come close to my datasets and finetuning but enjoy your low tier gooning slopper lol
>>108656851cool story, bro
>>108656851fuck you
I want to play with anima but I don't want to play with comfy.What are my options for frontends?
>>108656671>can you make latina dance musicHoly shit kek, prompt/lyrics included moans and it just killed ithttps://vocaroo.com/1iwOck91jlFh
>>108656591kek based
>>108656749https://huggingface.co/scragnog/ace-step-1.5-gguf-merge-models/tree/mainI'm using ACEStep cpp.
>>108656591>Cloudcucks, I dedicate this one to youI won't be listening to this shit the quality is awful
>Wan 2.2>LTX-2>Chroma>Z>Flux KleinDid we plateau?
>>108656591>ACEStep XL 0.7hey is there now comfyui support?
>>108656911Oh I'm sorry you thought local was about GOOD?use case for GOOD?Personally, I use it to make songs that praise the mustache man.
>>108656885damn nice
>>108656911Show us your definition of "quality"Oh that's right, you can't even properly meme with audio
>>108657019how does that even make sense when 80% of contemporary music is about fucking and sucking?
AIs still think chubby = morbidly obese
>>108657056>he used an API model to make an image saying that API is badthe irony is on point
>>108657062can't even put racial slurs on udio/suno as well, that means that all those poor black rappers can't use it proprely, this is racism!
>>108657056why dont you gen the same image on local?oh wait, local cant do textLOLOLOLOLOLOLOL
>>108657100Ikr kek
35 melties
>>108657082>>108657100got 'em bubblin' kek
suno has anti- ace step shills. It's wild.
>>108657110if you love local so much, then why is it an API image?
>>108656882Vibecode your own front end that uses comfy as a back end. Unironically. I haven't touched ComfyUI web interface in months. It's bliss.
>>108657125Because its a meme, the only good thing cloudkeks are good for
>CLOUD API SUCKS>.... well except for this one case in which it's really good>and this other case but that's it i swear!!>okay fine cloud is way better than local but ummm its not free and im poor!!!
>still seething
>>108657202Can you show another example for good use of cloud generation that is not a meme or an infographic?
>>108657202>im poor!!!Cloudcucks say you need a million dollar GPU to gen though???
It's actually crazy how far ahead of local API is. Local isn't even at the level of last year's ghibli GPT. It's no surprise most of us switched to API Nodes.