Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109185426https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>>109188075>>Maintain Thread Qualitywhy are these still here
>>109188091to maintain thread quality, what a dumb question "anon"
>>109188102well...has there been an issue with the thread quality for a while?
>>109188103Claude is so damn useful I'm thinking about upgrading to the pro plan.
claude sucks fucking dogshit
>>109188108you're proving my point, there's no issue with the thread quality because the links are here, you remove the links, the schizos come back to shit up this place
>>109188117that makes zero sense but ok
>>109188116way faster/better than asking /g/ for questions.
>>109188121>that makes zero sensethat's because your IQ is too low to get it I guess
>>109188091I think its mainly because the guy is openly black and thats enough to fracture the egos of 4channers into tiny pieces, lmao.
>>109188075>>109188088It may be that krea could do better with more early steps, idk. it's sort of like body horror behind a newspaper. Like liquid bones.
>>109188115Krea 2 raw with cfg set to 1 and the turbo lora at 0.6 strength seems to produce better results than the turbo model with minimal speed difference
claude is good but the company is a bunch of sóy guzzling reddit jews who hate their customers
>>109188134krea is just bad loras. It doesn't know what a person looks like.
>>109188112Why use Claude when you can just use Gemma 4 12B locally?
>>109188112>>109188112Yeah, I want a corpo AI in my explroer, yes, please! Let it browse and explore my folders. Please, Mr. Claude, feel free to browse and inspect my loli folder!!
>>109188140Gemma is actually shit
Gemma is better than claude simply off the basis that if I say "sex" or "cum" Gemma-chan won't start lecturing me about guidelines.
>>109188148retarded take
>>109188158HeyFuck you
>>109188158Shit meant for >>109188102
>>109188154Claude is for productivity. Why would you be prompting it degenerate sex terms?
>>109188154Last GLM on OpenRouter is already complete trash and it's a very heavy model. A 12B mosel running locally must be actual shit.
>>109188162Well I get horny at work sometimes and I need to goon sooooo
>>109188112>>109188116>>109188137>>109188154>>109188162>>109188167Local diffusion?
>>109188162It fucking sucks ass for productivity too dweeb-kunthe code sucks as of Opus 4.5 - everything after has been absolutely fucked through miles and miles of system prompts made to ensure anthropics shitty fucking business model. nerfed outputs, outputs that make no sense, outputs that are in response because the model thinks you are distilling it. Clude fuckin sucks, end-a-story!
>>109188170so do you go like "gen me an ascii pussy" or what
>>109187157>>109186814int8 convrov might be breaking Krea Turbo gens a bit more than anticipated
>mfw Resource news07/02/2026>PAPA: Online Personalized Active Preference Alignmenthttps://github.com/NasikNafi/papa>Condensing Large-Scale Datasets Directly with Minimal Information Losshttps://github.com/LINs-lab/CIM>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoninghttps://y-research-sbu.github.io/VisReason>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts https://github.com/tin2tin/Asset_Generator-2D-3D>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUIhttps://github.com/trx7111/ComfyUI-TrixLoader07/01/2026>Elastic Diffusion Transformer: Accelerating SOTA generation modelshttps://github.com/wangjiangshan0725/Elastic-DiT>Boogu-Image-0.1-Edit-Turbohttps://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo>GEAR: Guided End-to-End AutoRegression for Image Synthesishttps://github.com/Tencent-Hunyuan/GEAR>SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPEhttps://orhir.github.io/SpheRoPE>ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMshttps://github.com/yao-ustc/ADAPT>Phase-Aligned RoPE for Mixed-Resolution Diffusion Transformerhttps://hao-yu-wu.github.io/mixed_res>Ecocoro Preview 1 https://huggingface.co/alfredplpl/ecocoro-preview-1>ComfyUI FL-MCPhttps://github.com/filliptm/ComfyUI_FL-MCP>Magnificent 7 value shrinks by $2.3 trillion amid AI spending jittershttps://www.cnbc.com/2026/06/30/magnificent-7-stocks-sell-off-investors-grow-jittery-on-ai-spending.html>ShutterMuse: Capture-Time Photography Guidance with MLLMshttps://lijayutnt.github.io/ShutterMuse>ASASR — Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super-Resolutionhttps://huggingface.co/wafer-bob/ASASR>Qwen3.6-27B NVFP4: Quantized version of Alibaba's Qwen3.6-27B modelhttps://huggingface.co/nvidia/Qwen3.6-27B-NVFP4>Horus Lens 1.0https://huggingface.co/tokenaii/Horus-Lens-1.0
>>109188179you couldn't handle my gens
>>109188180real its why i dont even use nvfp4 and stick to bf16 because the former has always brought lora troubles and id totally expect the same from this new int8 meme format
>>109188185>mental illness haircutthat is unfortunate.........
>>109188185Usecase for the male in the picture?
>mfw Research news07/02/2026>Training-Free Debiasing of Diffusion Models via CLIP-Guided Denoising Optimizationhttps://arxiv.org/abs/2607.00817>AVSR-Diff: Scale-Agnostic Diffusion Priors for Temporally Consistent Arbitrary-Scale Video Super-Resolutionhttps://kaist-viclab.github.io/AVSR-Diff>EquiSteer: Cross-Attention Steering Towards a Fairer Text-Guided Image Generationhttps://arxiv.org/abs/2607.01147>Towards Memory-Efficient Autoregressive Video Generation via Instance-Specific Parametric Absorptionhttps://arxiv.org/abs/2607.00712>DriftScope: Measuring The Hidden Effects of Diffusion Model Adaptationhttps://arxiv.org/abs/2607.00183>Vitality-Aware Compression for Efficient Image-to-Shape Diffusion Transformershttps://arxiv.org/abs/2607.00382>Decoupled Guidance: Disentangling Subject and Context Pathways in Text-to-Image Personalizationhttps://arxiv.org/abs/2607.00766>Post-Training Pruning for Diffusion Transformershttps://arxiv.org/abs/2607.00927>The Illusion of High Utility in Safety Alignment of Text-to-Image Diffusion Modelshttps://adeelyousaf.github.io/SAGE_ECCV26_Project_Page>Not All Prediction Targets Keep Training-Free Diffusion Guidance on the Manifoldhttps://github.com/ManLuML/on-manifold-tfg>MEPA: Multi-Scale Representation Alignment for Visual Autoregressive Modeling with Mixture of Expertshttps://arxiv.org/abs/2607.00371>Flow-Map GRPO: Reinforcement Learning for Few-Step Flow-Map Generators via Anchored Stochastic Compositionhttps://arxiv.org/abs/2607.00535>M2Note: Continual Evolution of Vision Language Models via Mistake Notebook Learninghttps://arxiv.org/abs/2607.00685>MoHallBench: A Benchmark for Motion Hallucination in Video Large Language Modelshttps://arxiv.org/abs/2607.01117>Selective Test-Time Debiasing for CLIP via Reward Gatinghttps://arxiv.org/abs/2607.00423>LeVLJEPA: End-to-End Vision-Language Pretraining Without Negativeshttps://arxiv.org/abs/2607.00784
>>109188191that haircut is sexy and if you don't like it...you're gay
>>109188194none i can see
>>109188042You're wrong, and here's why.>weShould be capitalized.>,Should be a semicolon.>country bashingShould be hyphenated.>ffsThis expression of frustration at such an early point in the post gives the impression of impotent inarticulacy; this is deleterious to the post's rhetorical effectiveness.>iShould be capitalized.>how good their englishEnglish should be capitalized. "Fluent" should be chosen over "good", and of course you must append the word "is" to make this grammatically correct.>they as nice peopleAre*.>rightAs an interjection, this smells of a lower-class upbringing.>i'mShould be capitalized.>britishShould be capitalized, and British is not a country identity, it is a weak national identity typically associated with recent immigrants and "Multicultural Britain" ideologues; although a handful reactionaries who fondly remember the Empire will also self-identify as British.I would continue but I fear it would be cruel to go any further, as I suspect reading is laborious for you.
>>109188204holy unemployment
>>109188204absolutely fucking creampied that dudes mum with that post lowk
>>109188204go touch grass
>>109188204>>109188212>samefagthat's exactly why we need IDs, this faggotery needs to stop
>>109188134Catbox for that gen?
>>109188216cope
Feet are nice and all, but not nice enough to have pics like those, wtf are yall doing
>>109188216>samefagSee the attached image.>that'sShould be capitalized.>,Should be a colon.>faggoteryShould be spelled "faggotry"; I will excuse your choice of a vague and vulgar catch-all (rather than a word exactly suited to the purpose) on the grounds that "faggotry" echoes the "fag" in the "samefag" accusation.>needs to stopNeeds a stop, as it happens. Or you may know this piece of punctuation as a "period".
>>109188140this ffs this local man come home. gemma 4 is very good actually trained on Gemini. its purpose? to assist you and its good enough for almost any problems you might have and if you code something to give it to web search even better. It can vibe code solutions for giving it self web access and well you got a really powerful llm for free. it can do images, well there a few variants of gemma 4 that do certain task's better so people should check that out. I don't bother as googles search ai is usually all you need for most things. Claude if for coding but its expensive I hear, gemma 4 can do vibe coding but you got to know when to refresh the context window etc. Most people think that a larger context window = better, it actually don't because you get more hallucinations the large that context grows due to the llm reading the entire context window every turn.
>>109188144>no using a fucking clean no personal shit isolated system when using a powerful llm
>>109188254>See the attached image.you mean the photoshopped image?
>>109188254>>109188204shieet nigga y u actin a fool maneu sum kinda nazi? jus press da run butten and post dem waifus bruva
>>109188204>>109188254new schizo in town?
>>109188275there's only one schizo in hereand it's likely not me
>>109188134Muh dick
>>109188281There's only one schizo in here, and he's responsible for all of the posts I dislike, including many seemingly-innocuous gens which are coded attacks on me personally.
>>109188281>there's only one schizo in here>and it's meindeed
A hero appearshttps://civitai.red/models/2749801/rough-stuff?modelVersionId=3093366
there's only one person in this thread, and it's me.
>>109188319I know. Please stop persecuting me.
Now I see why the rentry is needed
>>109188180>>109188189Nvm, seems like the VAE is just too awful to handle group photo generations.
>>109188302it always seem to generate a dude with black gloves grabbing the woman's head
>model knows asukaperfect, this is the future
>>109188194>>109188198I guess that's true.
>>109188374Why does her upper body turn into a DAZ 3D weg gobbo?
>>109188374are you trying to get a vacation or
comfyui, ideogram.does face detailer work for you anons or it errors out?
>>109188493newer models dont need face detailer anon, thats a thing from the past
ziimage is but a relic now. rest in pss china, you lost the moment you sold out to api. slant-eyed jews
>>109188354yeah its all fucking bullshit i'm seriously considering hiring some GPU and creating dedicated lora's to smite those who will attempt to farm buzz because fuck those people. mutli concept lora's are shit, always was and always will be.
>>109188493Why do you upscale these to a retardedly high resolution
>>109188553Meant for >>109188542
>>109188553for my ultrawide giganto monitor that I work onalso for fun
>>109188374glad you've come around. got a catbox?
>1.6GB LoKrits ogre
im too sick to gen i feel like doo doo :(
>>109188354since you seem to be the only anon interested if you can find the gay anime lora for krea and also the inne pussy lora it works quite well together for anal at least. the inne pussy lora cancels out putting a dick on the receiving 1girl and the gay lora ensures anal sex always because that is mostly what its trained on and probably blowjob, but that lora is anime so it makes it look like a cartoon.basically this https://civitai.red/models/2745590/krea-2-nsfw-anime-yaoi-lora?modelVersionId=3088233and this https://civitai.red/models/2744291/innie-vagina-puffy-labia-majora?modelVersionId=3086654despite the limitations we have all things point to krea being a really good model for nsfw gens once we have decent lora's for each concept and not just all in one slop loras to farm buzz by stable yogi or some other fucking retard. among all the other trash these creatures upload to civitai such as AI influencers and other trash like 3 tits or face on crotch legs, amputees and all the usual fucking low effort slop that always comes first.
>>109188712>>109188712weird combination but I'll give it a shot laterthx anon
RAW gens look like absolute doo doo regardless if steps are adjusted or style loras are applied.
>>109188195Am I misremembering or did something come out to make safetensors more vram efficient? A new type
>>109188354are you using any filter bypass loras or nodes?