Discussion and Development of Local Image and Video ModelsPrevious: >>108681463https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
no more trolling
this thread is for FRENS ONLY
>>108683023why don't you ever upscale these?
>>108683042overheating. got an old rig.
>>108683059>overheatingput a temp limit with MSIAfterburner nigga
>>108683066I click on that and nothing happens. It be like read only mode or sumpin.
>mfw Resource news04/24/2026>MAI-Image-2https://playground.microsoft.ai/chat>ComfyUI-NAG-Extended: NAG support for Flux 2 Klein and Animahttps://github.com/BigStationW/ComfyUI-NAG-Extended>UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detectionhttps://github.com/Zhangyr2022/UniGenDet>VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolutionhttps://github.com/EternalEvan/VARestorer>Sapiens2https://github.com/facebookresearch/sapiens2>Vista4D: Video Reshooting with 4D Point Cloudshttps://eyeline-labs.github.io/Vista4D>Pre-process for segmentation task with nonlinear diffusion filtershttps://github.com/cplatero/NonlinearDiffusion04/23/2026>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Controlhttps://shelley-golan.github.io/ParetoSlider-webpage>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusionhttps://github.com/Adamlong3/DynamicRad>Normalizing Flows with Iterative Denoisinghttps://github.com/apple/ml-itarflow>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Modelhttps://github.com/inclusionAI/LLaDA2.0-Uni>Illustrious XL & NoobAI-XL Style Explorerhttps://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer>AI Model & ‘MAGA’ Influencer Emily Hart Unmasked as Indian Manhttps://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html04/22/2026>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Modelshttps://github.com/cvims/EMBEDDING-ARITHMETIC>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generationhttps://github.com/CompVis/patch-forcing>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generationhttps://github.com/Hong-yu-Zhang/TS-Attn>AnyRecon: Arbitrary-View 3D Reconstruction with VDMhttps://yutian10.github.io/AnyRecon
>mfw Research news04/24/2026>AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probehttps://arxiv.org/abs/2604.20936>KD-CVG: A Knowledge-Driven Approach for Creative Video Generationhttps://kdcvg.github.io/KDCVG>Linear Image Generation by Synthesizing Exposure Bracketshttps://arxiv.org/abs/2604.21008>Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generationhttps://arxiv.org/abs/2604.21291>AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editinghttps://arxiv.org/abs/2604.21289>Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attackshttps://arxiv.org/abs/2604.21041>Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generationhttps://arxiv.org/abs/2604.21221>StyleVAR: Controllable Image Style Transfer via Visual Autoregressive Modelinghttps://arxiv.org/abs/2604.21052>Building a Precise Video Language with Human-AI Oversighthttps://linzhiqiu.github.io/papers/chai>Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Modelshttps://arxiv.org/abs/2604.21523>ID-Eraser: Proactive Defense Against Face Swapping via Identity Perturbationhttps://arxiv.org/abs/2604.21465>When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMshttps://pegah-kh.github.io/projects/prompts-override-vision>Seeing Fast and Slow: Learning the Flow of Time in Videoshttps://seeing-fast-and-slow.github.io>Addressing Image Authenticity When Cameras Use Generative AIhttps://arxiv.org/abs/2604.21879>Multiscale Super Resolution without Image Priorshttps://arxiv.org/abs/2604.21810>Prototype-Based Test-Time Adaptation of Vision-Language Modelshttps://arxiv.org/abs/2604.21360>Latent Denoising Improves Visual Alignment in Large Multimodal Modelshttps://arxiv.org/abs/2604.21343
>>108683002>Why isn't 24GB enough?you think I'm some sort of poorfag?
>>108683114all that gpu power for nothing, it's not like there's a big local model that can be used and is competitive with the best API models
>>108683132LLMs
> >108683096> >108683101Fuck off
>>108682974<- not mine, but she's gorgeous~ I love Raiden and I'll use her as my paper on my phone.
>>108683154how do you gen these? is it just z image turbo?
>>108683157gpt-image-2
>>108683157Zimage Turbo + lora for photos + lora for likeness. Latent upscale gives realistic detail
>>108683158anima + zit? can you share anima prompt/workflow? i've found that res_2m is really good for realism
>>108683290fine wine needs time.
>>108683290aged perfectly, based comfy
>>108683290Based. He knew API was the future and invested heavily in it. China learned from this and quickly pulled local support for WAN right after. We probably wouldn't have models like GPT-Image-2 without him.
>>108683290I forget, what model was this about?
>>108683328it was hunyuanimage 3.0 (80b model lool)
>localpoors threw a fit because hunyuan was too big for their 24gbaged like wine >>108683002
>>108683339hunyuan wasn't good though, so it was big and bad
hunyaun was great, but comfykeks wouldn't know because local is verboten
>>108683352Nice
the bigger the model, the better it is, it's common sense
>>108683349>hunyaun was greatcare to show some images?
>>108683290>>108683330>>108683349Wait, I looked into this and it's true?? Hunyuan 3 was better than Nano Banana but never received a ComfyUI implementation because it threatened the API nodes ecosystem. Holy shit can we ditch ComfyUI already? It has undoubtedly harmed local thanks to this.
>>108683366ty! nice lora, is it the k-pop girl?
>>108683405>Hunyuan 3 was better than Nano Bananasource: (((the media))), oy vey
>>108683405do you enjoy seething since months about comfy being the most relevant local ui and successful?
>>108666242If you're still here what's the second textbox for that we can't edit?
>>108683405we should all switch to InvokeAI
>>108683423it's not local
>>108683441
>>108683441how do i run it locally then anon?
So API nodes are local now??? Sweet!
>>108683449>>108683456why do you keep feeding him
>>108683458who said that anon?
>>108683468it's in the OP
>>108683458>>108683477are you really that bored? like you really have nothing else to do with your life? kinda sad when you think about it
i just discovered ltx2.3 loras
>>108683477where in op is the statement that "API nodes are local now" anon?
black snape, but ltx 2.3:https://litter.catbox.moe/gktnj1crp42z1o9g.mp4
>>108683500I still can't believe they made Snape black. I think it would have been more tasteful to make hermione black if they were shooting for DEI
>>108683513>I think it would have been more tasteful to make hermione black if they were shooting for DEIhermione is a female, she's already a DEI
>>108683513>I think it would have been more tasteful to make hermione black if they were shooting for DEIron is already a ginger
*yawn*
I just downloaded the ComfyUI desktop app from the link in the OP. How many credits do I deposit to get started with anima?
>>108683513>I still can't believe they made Snape black.don't underestimate the willingness of the wokies to stir the pot, they're so good at that
>>108683531making the weasleys black would have been great actually. they already live in a shit hole.point stands, though snape should never be black. that is the epitome of a white character
n*gbo-esque honestly
snape is an incel, that is white culture >>108683541
>>108682673from scratch
>>108683541>though snape should never be black. that is the epitome of a white characterhe's literally described as a man with a pale skin on the book lol
>>108683540>don't underestimate the willingness of the wokies to stir the pot, they're so good at thatThey're good at stirring the pot but their shit makes no money. This harry potter remake is going to flop hard because the woke crowd hates jk rowling and the fact that she makes money from everything related to harry potter and it's going to piss off the normal people too so who does that leave to even watch that shit?
>>>/tv/
>>108683497careful icarus
They made Snape black? This reminds me of when ComfyUI added API nodes into a UI that was originally meant for local models. It's all about testing the waters until people get too tired to care about it anymore. By that point it's already normalized and the subversive vermin won.
>>108683583can you answer the question anon? >>108683498
>>108683497which loras? They seem really hit or miss
>>108683577fucking slut tease
i NEED another COMPUTER.(for genning)I am gaming and need to GEN.and another one to talk to my virtual people.
>>108683591lets just say... the kino lora
>>108683540it's an English series starring English people, they don't give a shit about retarded American Zoomer politicslike you know this guy has a posh London accent and an extensive background in Shakesperean stage productions, righthttps://youtu.be/96GAI4ioekM?t=11
>>108683591this one seems coolhttps://civitai.red/models/2557755/retro-90s-anime-style-lora-ltx-23?modelVersionId=2874411
>>108683563Rowling doesn't give a shit, SHE retconned Dumbledore into being gay herself years after the book series had ended, just as a public anecdote
>>108683605looks hilariously bad
>>108683603>it's an English series starring English peopleare you implying that woke only happens in the US?? lmao
>>108683610nobody cares that old nigga dun got kilt
>>108683616i'm implying American Zoomers are the overwhelming majority of people who give any kind of fuck about the Woke Boogeyman
>>108683615this is nice, what model?
>>108683629Anima p3
>>108683583why is anyone supposed to give a fuck about api nodes in comfyui? it literally doesn't matter. if they switch to full api support and drop local, doesn't matter because someone will just fork it and local will be completely unaffected. >but then local won't have all the latest API support! ?
>>108683627>the Woke Boogeymanmy fucking ass, there's no boogeyman about this, they knew Snape was canonically a white man in the book, they knew that rewriting history and Netflix'ed him into a nigga would stir the pot, they know what they're doing, they're taunting people, and you defend them because you're probably some gay ass liberal who loves this woke slop, right
>>108683627today I learned that the entire right wing party in america are all zoomers. that's crazy
>tardbo back to shilling groids and api nodes
>>108683647based zoomers
toe socks.
I don't want to gen after today's incident. Because I know my gen will be used as bait for investors by Comfy to create speculation that there is activity in local models. I don't want to be part of this fake engagement system.
>>108683290I love when daddy comfy decides whats best for me. that way I dont have to think for myself <3
>>10868372035 stars status?
>>108683132>all that gpu power for nothingrtx 6000 pro is the only gpu that can handle video gen workloads without having to unload and reload models all the time, totally worth it
Havent genned in over a year and was just looking to get back into it. Not trolling or baiting, i'm reading this shit about local being obsolesced by api and don't know if I should be taking that seriously or not. Have local models actually hit a ceiling or stopped getting meaningful updates? The last models I was using were XL Illustrious merges and I have a 4090. I quit before video was much of a thing.
>>108683781If you haven't used api, the newer local models are quite impressiveif you have, they'll seem way behind
>>108683745new koff game ideakoff kickervamp survivor like, you kick away ghosts as they swarm you. you get xp and power up your kickscould be a platformer too
>>108683156Damn, is she AI? I want more photos of her. Can you whip some nudes up for me, and also pictures of her on wholesome dates with me?
>>108683795sent ;)
I'm genning a RAP MUSIC HIT SINGLE.This is the FIRST RAP SONG to achieve SIGNIFICANT ATTENTION FROM THE JEWISH RAP PRESS(I predict)
>>108683806>This is the FIRST RAP SONG to achieve SIGNIFICANT ATTENTION FROM THE JEWISH RAP PRESSAll rap is funded and created by Jews, though.
>>108683788What immense resources do api models require that they can't be run local anymore?
>>108683815open weights for starters
>>108683815They can be run locally with blackwell pro gpus, but companies stopped releasing them after an agreement with ComfyUI. Such models were deemed 'too big' to be worth implementing, so now they'll just be behind API. That's what happened with WAN 2.5, and now Qwen.
>>108683789that's a good idea
>>108683002everyone laugh at the poor!
standby, generating some ltx2.3 kinos right now
>>108683815indians think every image they gen on banana or image 2 requires hundreds of gigs of vram.
>>108683096>>108683101thanks!
>>108683883no one proved those saars wrong though, is the local 6b model that is on the same level as NBP or GPT-image 2 in the room right now?
>>108683806>THE JEWISH RAP PRESSI'm kinda humored by the idea of a bunch of hasidics being like "really diggin the flow of the new j cole album. got some b.i.g. styling to it"
>>108683821Does that mean the usual gatekeeping then if I'm feeding my prompt to idk the wan2.5 server or whatever?>>108683820I don't know what that is or why it's important
>>108683906That's Rick Rubens entire job.
>>108683917which one is rick rubens
>>108683905>every local model is actually SDXLwe will ignore the fact that after a couple of weeks every api model gets hit with the same sea of complaints about reduced image fidelity when they quietly switch over to quantized models.you already have people complaining about the image quality of gpt2, next week they will update their censorship and copyright guardrails. and then it's the same old ballgame of "well it's still better than *insert 2-3 year old model*."
>>108683934the guy with the big forehead
Anon, link the realism LoRA for anima
>>108683941sure, a local model that'll reach GPT Image 2's level will surely be there anytime soon, 2 weeks!
>>108683771chroma is like searching a junk yard for valuables people have accidentally thrown out
how can i use this thinghttps://civitai.com/models/253383/super-danceis just a bunch of pictures
>>108683963how many giggle bits of super compute to cook up these beauties? api bros eatin good
>>108683947I've tried uploading to civitai but the fucking site wont work.
>>108683970Control net
>>108683976Imagine taking pride on API models while posting girls, when in reality can't do generate anything adult/nsfw related, is like hearing an ultra religious guy talking about sex and porn
>>108683976>random shit that doesn't make sense>multiple fingers>fucked up gun>"details" are just noise added across the whole imageI swear this thing is just a 10b active parameter MoE model that has been hyper optimized for text and chart rendering, with a prompt enhancer LLM put in front of it.
>>108683986Use hugging face anon? CIVITAI runs ok today in EU
>>108683976wtf is that lmao
>>108683976the policewoman has two foreheads lmao
>>108684012well they did say it was a thinking model.
OH WE GENNING
>>108683976The only thing you're eating is Sam's dick faggot
>>108684008That right there is ground truth.
gen image of sexy woman and spot the errors then fix them on repeat until it's perfect, no mistakes. thanks agentic image genning.
>>108683822Abysmal background for a 9b model.
Can't gen this with a [FEMINIST] ai:https://files.catbox.moe/a036xv.flacAs a coincidence, catbox will be 18 in exactly 7 years.
>>10868406718 is too old
>>108684022>512x512its 2023 again
>>108684083It's a flac, you can change the lyrics etc. but uh. I just now realized I had like... a broken personally custom node (for debug).Let me gen a new one, and you can work on that lol.My point is that (again, imo) the feminism of the commercial music models won't allow this.
>>108684093what model? it's not super great right now but I can see this improving
>>108684084It's my sd1.4 in my Ace Step 1.5 (now XL) wf.It's basically instant, like idk let me check... well, for me it's a pathetic 9 seconds, but on an nvidia rig it will instantly appear.sd1.4 easily trounces modern models for album art, it's not even close.picrel is genning (the audio part of the wf)
>>108684084Here's a gen from 2022
>>108684102SOUL
>>108684098Ace Step 1.5 XL. I was doing pre-XL, but I think XL really is better (it's like 2x the size at 9gb). It's fun. That's all it has to be.
>>108684104:) I like soft jawlines.
That is the speech of a soulless corpo, worthy of being from Apple, Windows, or Google, what hat has ComfyUI transformed into? They're celebrating like venture capitalists, throwing around metrics about user growth and annualized bookings as if this community project was always meant to be a startup pitch deck. This corporate doublespeak about "investing in what the community cares about" while simultaneously courting top talent and scaling like a Silicon Valley unicorn is exactly the kind of grifting that betrays what open source is supposed to represent.
>>108684125wasn't the comfy guy some anon on here who just wanted to learn about how stable diffusion works?
>>108684125>That is the speech of a soulless corpo, worthy of being from Apple, Windows, or GoogleMore like a cryptobro bragging about their coin drops. This dude isn't professional at all. Fucking loser.
>>108684102Same prompt today
>>108684125comfy is the most retarded bullshit program ive ever fucking used convoluted dog shit that you have to load 40000 different mods to do anything at all fucking need to just kill themselves already
>>108684125all you had to say was that he sounds like a faggot
>>108684138seething brainlet confused by a handful of nodes
>>108684135i wanna fuck marie rose
>>108684125This is very sad and Friday's events marked a before and after in the history of /ldg/.
>>108684142ah yes just a few nodes! 60000 nodes laterkill yourself
>>108683806
>>108684146>had to exaggerate massively to make his point.just say your tiny brain is overwhelmed and ask for help
>>108684161shut the fuck you fucking pajeet faggot retard. im sure you use templates just shut the fuck up before i smash your bloody skull in
I will continue to post api gens in ldg, you lot deserve it after that absolute comfyshill embarrassment
>>108684171we really should rebrand or remove comfyui from the OP because of that. why are we keeping corpo shit in the OP? at least link to one of the de-saas'd comfyui forks instead
>>108684169YOU BLOODY FOCKIN BASTAR
>>108684169maybe ms-paint is more your speed? Or do all the buttons and toolbars anger and confuse you?
>>108684169having a melty again anifart? you forgot to take your meds today?
>>108684193your fuckin existence angers me you worthless fucking street shitting monkey fuck!
>>108683577holy shit this got this good when I was out? maybe I can finally ditch my wan
>>108684204>meltyplease leave zoomie>anifartwhomst?
>>108684177what would that change exactly? everyone uses comfyui because it is the best tool for the job, bar none.
>>108684209>whoyou don't remember your own name?
Dall-e API users:>Hey OpenAI dropped a new image model>Sweet, I'll check it out and see if it's any goodLocalkeks>WE JUST RAISED 500 BILLIONS DOLLARS OF FUNDING FOR API NODES, PLEASE LIKE RETWEET AND SPREAD THE NEWS TO WIN COMFY CRYPTO! THANK YOU BLACKROCK AND CHASE CAPITAL, MAKE SURE TO SCAN YOUR ID TO USE THE NEWEST BYTEDANCE NODES!Why are they like this? 'local' shills for API more than API themselves.
>>108684125does it hurt your feelings to discover all comfy ever wanted was to be a san francisco techbro?
>>108684205>your fuckin existence angers megood, I'm glad I'm making you seethe that much, feelsgoodman