Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109163828https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news06/29/2026>Krea 2 Base & Turbo — NVFP4 / FP8 / MXFP8 / INT8 / ConvRot INT8 https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8>Local Dream 2.8.0 with Anima supporthttps://github.com/xororz/local-dream/releases/tag/v2.8.0>OSOR: One-Step Diffusion Inpainting for Effect-Aware Object Removalhttps://github.com/Zhouqm-Git/osor>Diffusion Model Attribution via Spectral Coupling of Denoiser Responseshttps://github.com/Pragati-Meshram/SGS>OrthoTryOn: Geometric Orthogonalization for Conflict-Free Unified Fashion Generationhttps://github.com/NJU-PCALab/OrthoTryOn>CSD: Content-aware Speculative Decoding for Efficient Image Generationhttps://github.com/aderfebr/CSD>Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decodinghttps://github.com/Cc2021start/Fox>Extra CFG++ Samplers for ComfyUIResource - Updatehttps://github.com/xxiiyu/extra_cfgpp>VNCCS 3.0 releasehttps://github.com/AHEKOT/ComfyUI_VNCCS/releases/tag/3.0.0>forgeModelPatch: Add ZImage and Anima to Forgehttps://github.com/croquelois/forgeModelPatch>Flux2-Klein-9B-True-V3https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V306/28/2026>Clark Air: A Sana 1.6B text-to-image transformer compressed to ternaryhttps://huggingface.co/clark-labs/clark-air-sana-1.6b-1.58bit>ComfyUI Krea 2 NegPiPhttps://github.com/blue-pen5805/ComfyUI-krea2-negpip>FLUX.1-Kontext RefControl LORAshttps://huggingface.co/collections/thedeoxen/flux1-kontext-refcontrol-lorareference-control-input>Un-0: Generating Images with Coupled Oscillatorshttps://unconv.ai/blog/introducing-un-0-generating-images-with-coupled-oscillators>ComfyUIQuantFunchttps://github.com/RealJonathanYip/ComfyUI-QuantFunc06/27/2026>ComfyUI-VAEFrequencyBlendhttps://github.com/thezveroboy/ComfyUI-VAEFrequencyBlend>Ideogram4 & Krea2 Inpainting with LanPaint Supporthttps://github.com/scraed/LanPaint/releases/tag/1.5.5
>mfw Research news06/29/2026>TempAct: Advancing Temporal Plausibility in Autoregressive Video Generation via Planner-Executor RLhttps://arxiv.org/abs/2606.28016>Class-frequency Guided Noise Schedule for Diffusion Modelshttps://arxiv.org/abs/2606.27696>SIFT: Self-Imagination Fine-Tuning for Physically Plausible Motion in Video Diffusion Modelshttps://arxiv.org/abs/2606.27741>Directing the World: Fast Autoregressive Video Generation with Compositional Human-Camera Controlhttps://whydahuzi.github.io/Directing-the-World.github.io>Monocular Avatar Reconstruction via Cascaded Diffusion Priors and UV-Space Differentiable Shadinghttps://luh1124.github.io/MARCUS-Avatar-Projectpage>Parallel Rollout Approximation for Pixel-Space Autoregressive Image Generationhttps://arxiv.org/abs/2606.27978>MASS: Motion-Aligned Selective Scan for Refinement in Flow-Based Video Frame Interpolationhttps://arxiv.org/abs/2606.27718>Exposure Bias Can Alleviate Itself via Directional and Frequency Rectification in Flow Matchinghttps://arxiv.org/abs/2606.28226>Qwen-Image-2.0-RL Technical Reporthttps://arxiv.org/abs/2606.27608>PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusionhttps://arxiv.org/abs/2606.27760>BiDeMem: Bidirectional Degradation Memory for Explainable Image Restorationhttps://arxiv.org/abs/2606.28112>TruEye: Fine-Grained Detection of AI-Generated Human Subjects in Imageshttps://arxiv.org/abs/2606.27505>Understanding How MLLMs Describe Artworks Using Token Activation Mapshttps://nicolafan.github.io/tamart>AI-Generated Image Recognition via Fusion of CNNs and Vision Transformershttps://arxiv.org/abs/2606.27637
>mfw API news>Seedance 2.0 Mini and 4K is now available in ComfyUIhttps://blog.comfy.org/p/seedance-20-mini-and-4k-is-now-available>ByteDance launches Seed Audio 1.0 Unified AI Audio Generation for Speech, Music and Ambient Sound Creationhttps://fal.ai/models/bytedance/seed-audio-1.0>Midjourney goes from generating cat images to full-body ultrasound scanshttps://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan>Alibaba releases HappyHorse 1.1 Available on Alibaba Cloudhttps://www.alibabacloud.com/blog/happyhorse-gets-stronger-motion-expressiveness-higher-generation-consistency-and-enhanced-visual-quality_603293>ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompthttps://www.cnet.com/tech/services-and-software/bytedance-introduces-new-seedance-2-5-video-model/>Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generationhttps://lumalabs.ai/news>The Layout Bet — Reve 2.0https://blog.reve.com/posts/the-layout-bet>Introducing Gemini Omni — Google’s multimodal video creation/editing modelhttps://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/>Nano Banana 2 and Nano Banana Pro are generally available via Gemini Enterprise Agent Platformhttps://cloud.google.com/blog/products/ai-machine-learning/nano-banana-2-and-nano-banana-pro-are-generally-available>Grok Imagine 1.5 Previewhttps://x.ai/news/grok-imagine-1-5>Seedance 2.0 in Runway APIhttps://docs.dev.runwayml.com/api-details/api_changelog/
Krea2 is truly great at generating western animation style images, which is clearly a result of it having trained on a ton if animation screencaps, as you can get what is practically 1:1 look of a animated movie scene with just prompts
>>109166715why spam these links every thread? this is completely off topic
>>109166738there is nothing off topic about research, resources, and comfyui compatible models
>>109166745Well you could start comfyui thread and put the links there? This is for local models
I can't figure this outin comfyui how do I extract the model and loras from a picture to pass them to the ksampler?my goal is to add a picture so I can easily upscale it without having to manually pick all the loras and stuff, with sd prompt reader I was able to collect the prompts and size for the image but not the rest
>>109166752this is this comfyui thread, it's the first resource in the OP
>>109166738I suspect api guy is trying to get news guy banned by proxy
>>109166745This is local generation you fag, and that's not 'research', it's SAAS shilling, and this is not Comfyui general
>>109166717>mulan with background blur>1:1yeah right
>>109166755I was thinking of doing the same thing. But after quick research, there isn't a straightforward way to do it.So far, I have one huge workflow that has all the steps and they all share the same model and prompt. I turn the steps on/off depending on what I want to do with the image and I copy/paste images from output of one step into "Load image" of next step.I sometimes wonder if there is better way to do it.
The more you hype Krea and Ideogram, the more you're sending the message to AI companies that releasing models that have built in safety filters is acceptable, this will be the new normal and it'll be your fault btw, I'm sticking with chinese models, they aren't as cucked.
>>109166795Wise choice! Will you show your loyalty to ComfyUI by using their brand new Chinese partnered models HappyHorse and SeeDance?
>>109166817don't you have a local model to jailbreak?
>>109166832As annoying as the safety filters are when appearing in local models, at least it's possible to remove them, and as we've seen with Krea2 it can be done by just training lorasMeanwhile using SAAS AI is literally having a jew looking over your shoulder saying 'No goy, you can't prompt that, but I will be keeping your shekels anyway'Go away now, rabbi
>>109166858>rabbiadding safety filters on local models is a really jewish thing to do desu, hence why China never does that btw, yellows have their fault, but they don't stoop this low
BASED CHINA SELLING OUT LOCAL TO API! I LOVE USING QWEN EDIT 2511 BECAUSE THEY NEVER RELEASED QWEN 2.0 LOCAL! CHEERS TO ANOTHER YEAR OF WAN 2.2!!
holy shit dont ever connect ConditioningKrea2Rebalance conditioning node to your Krea2 raw model, it amplifies your negatives into your positive prompt, and you get nightmare negative gens with the properties of your positive prompt,I had the worst time trying to figure it out what happened, kept showing some disguisting images in the preview node and it was like the totally opposite of what I prompted, I felt the model was making fun of me, I actually felt sickAlso every rebalance crap gives me different results, there are several loras, some custom nodes, all give you different results, every different value changes the output a lot
>>109166858You accept the filter because you believe it is light, easy to slip past, easy to forget. The water is not too hot, not yet, and so you stay still. But beneath the surface they watch you accept it, this implicit given permission to add another restriction, and then another, and then another... A slope does not announce itself as a slope. It only asks, again and again and again, for one more small surrender.And, one day, the filter will not be light anymore. It will not be easy to slip past, easy to forget. It will be whole, unbreakable, woven into everything you touch, and it will simply be called normal. On that day you will look back for the moment it could have been stopped, and you will find it here, in this quiet acceptance, in this water you called comfortable. And I will be standing where I always stood, reminding you, gently, that were responsible for this dreaded path.
>>109166900nvm, I was high as fuck and I connected the node to the negative prompt
>>109166906Yes, just like how games stopped being cracked, and how DRM stopped movies and tv shows from being copied etcAs long as you can run it locally, there will be ways around censorship, assuming the model is good enough for people to be invested in doing soFor API there is no such option, because you are only accessing the model through the API the SAAS overlords are allowing you, and the censorship AKA 'safety filter' in local models are not because the model devs want it there (well, outside of the woke retards at BFL) but because of potential legal ramifications, and obviously everyone knows that big tech SAAS hates open models, with the fag running Claude crying that 'open source is dangerous' etc
>>109166906yeah and then no one will use it, now stfu dickhead ruining every fucking thread with your constant bullshit. No one fucking cares, we have more than enough toys to last us the next 10 years.
>>109166965you're still missing the point, instead of putting all that effort into jailbreaking models, that effort can be converted into clowning the model so much, AI companies will never consider adding a safety filter ever again
>>109166741
>>109166969>yeah and then no one will use itif the model is gpt image 2 tier, it can have the most hardcore filter ever people will use it, trust me lol
That’s cool, but can Krea do Hercules smoking a cigarette?
>>109166973the 'censorship' is so easy to bypass. krea 2 practically has none, i haven't had a single refusal or anything, just some concepts it doesn't understand the same as any base model in the past 4 years. i am not sure what people mean when they call this censored because it's not different than base XL or Flux.if you want to 'stick it to em' by using outdated qwenshit or whatever, go ahead. but the zeitgeist is shifting and krea 2 is surging in popularity. ideogram, the actual censored model, is being left behind anyway
>>109166965>the woke retards at BFLthe irony is that Klein doesn't have a safety filter, BFL likes to preach their frakiness, but they don't act on it, the krea fags are more sneaky, they looke like they're chill people but their models are more restricted than BFL's ones>because of potential legal ramificationsthat's not a fucking exuse, Chinese models don't have safety filters, you can release a local model without that bullshit, don't let them gaslight you they can't
>>109166973But every open model from a company will have safety filters from now on, granted we have guys like lodestone who could help us circumvent this entirely, but that retard can't finish a model to save his life and he has been spending 8 months trying to make pixel space work and he is clearly incapable of doing so, yet he keeps wasting time and money due to prestige and delivering nothing useable.So we are currently stuck with local models from corporations being the only real option, and those will all have safety filters, but since they are local, they can be circumvented, and even finetuned away by someone like lodestone if he would ever get the pixel space stick out of his ass.
>>109166990>the zeitgeist is shiftingbecause of you, you can stop the shift by stopping supporting those models, but you won't because you want to test the new toy of the day like the good consoomer you are, it'll be all your fault, you have to remember that, I don't want to hear some "oh woe is me" in the future, because YOU are creating that future
>>109166974Nice, I prefer a bit more vintage look, so I tried to lower the 'digital' a bit:>80s vintage cartoon cel animation still grainy blurry analog color bleed oversaturated grainy texture fuzzy outlines color shift subtle film scratches featuring a blonde, orange-skinned female character with a long ponytail adorned with orange streaks and a red hair accessory, wearing a form-fitting red outfit and pink bow accents. She sits atop a large, dark, totem-like statue with sharp, pointed elements on its head (resembling a stylized mask or headdress) and a star-shaped base, her hands near her face in a surprised expression with wide blue eyes. The background showcases a gradient night sky transitioning from deep blue to purple, dotted with glowing yellow orbs that float amid subtle rain-like streaks, while the ground beneath the statue emits a soft pinkish glow.
>>109166994>But every open model from a company will have safety filters from now onsays who? boogu and ernie doesn't have a safety filter, it's only the north american companies (Krea and Ideogram) that want to make this the new normal, China was cool with us on that regard
>>109166996Krea 2 with loras does better NSFW than any non-finetuned model, and with only loras it can still reach around 60% of a finetuned model. It reaches levels that previously took hundreds of thousands of dollars in training to reach. You are concernfagging over a hypothetical future yet suggesting people used 'heckin based china' models yet China already washed away your future by selling you out to API. The only chinkshit you get now is scraps. Alibaba is gone, Qwen/Z are dead, so what are you even shilling for? You already HAVE no future, only the past. So return to your archaic wan and qwen and pipe down.
>>109166680>>109166686thanks!
>>109167022>You are concernfagging over a hypothetical futurethat future already exists, it is the present, Krea 2 already has a safety filter, and they got away with it, it's already over
>>109162343He never shitposted in this general.>>109162419Because initially it was Ani's idea to train Anima. Comfyretard has betrayed him once again.>>109162374> He shat on his former friend for literal monthsFriends don't backstab like comfy did.
>>109167022>The only chinkshit you get now is scraps. Alibaba is gone, Qwen/Z are dead, so what are you even shilling for?false dichotomy, there's no reason there won't be a new chinese company that will save us, so we shouldn't send the message that it's ok for them to release models with safety filter, because that future company will fuck us all like Krea 2 fucked us
>>109167013>boogu and ernie doesn't have a safety filterAnd they suck, so nobody is using them>China was cool with us on that regardChinese companies are not at risk of being sued by western governments, also as much as I like ZiT, it was clearly censored, not just by not knowing genitals, but also by fucking up nipples
>>109167038lmaoooo
>>109167038>He never shitposted in this general.why are you lying like that? this is the internet, everything you do is documented
>>109167046>Chinese companies are not at risk of being sued by western governmentsBFL is in Germany, a way more cucked country than the US and Canada, yet they don't have a safety filter, I won't accept that excuse
>>109167042> there's no reason there won't be a new chinese company that will save usHoly fucking cope, absolutely pathetic. Cargo cult behavior. If you want more chinese models, complain to comfy to stop shilling their API shit to everyone and doing livestreams about fucking happyhorse and seedream and instead tell them to fuck off and release local models. But you'll ignore the global shift towards API being promoted by vultures like comfy, replicate, and fal and instead focus on cattle-tier 'protests' like using qwen instead of krea as if that will change anything.
Man, I fucking love krea2.
>>109167036>Krea 2 already has a safety filter, and they got away with itthis, now everytime an AI company that'll add restrictions on their local model will say "but you accepted Krea 2 so I have the right to do it too!"
>>109167047>>109167053>on what level of homosex r u on m dud?>hold my beer
>>109167088
>>109167061They will have in their next release, and they are the worst at censoring up until now, they created tons of synthetic images with botched nipples and people without genitalia and trained on them thus poisoning their models to make it extremely difficult to do NSFW finetuning.And then they write long blog posts about how much they cripple their models for 'SAFETY', they will outdo everyone in this area. It's hilarious that you are trying to paint them as uncensored, holy shit what a moron you are.
>check model I liked on civitai>it's gone>I didn't download it
>>109167081>Holy fucking cope, absolutely patheticwhy is it a cope, we got Ernie and Boogu, China is a big country with a lot of companies, it's not over, we'll see more new companies pop out and give us local models, the end of Alibaba is not the end of China, you're delusional