Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>109166674https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUISDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineageWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://huggingface.co/modelshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Krea 2https://huggingface.co/krea/Krea-2-Rawhttps://huggingface.co/krea/Krea-2-Turbo>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
First for anima is on par with illustrious
Blessed thread of frenship
>38 stars>last update 3 months ago
Int8-convrot is the best thing that happened to the diffusion community in 2026, and it's not even close, I said what I said.
>inb4 nigbo is lonely again
where NVFP4
gguf status?
>>109169252my gguffy is full
>>109169252when convrot gguf? I want my int5 convrot
>>109169277I can fix her
>>109169277ZiT's still the goat in realism, some things just don't change
Last day of local pride :(
>mfw API news>Seedance 2.0 Mini and 4K is now available in ComfyUIhttps://blog.comfy.org/p/seedance-20-mini-and-4k-is-now-available>ByteDance launches Seed Audio 1.0 Unified AI Audio Generation for Speech, Music and Ambient Sound Creationhttps://fal.ai/models/bytedance/seed-audio-1.0>Midjourney goes from generating cat images to full-body ultrasound scanshttps://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan>Alibaba releases HappyHorse 1.1 Available on Alibaba Cloudhttps://www.alibabacloud.com/blog/happyhorse-gets-stronger-motion-expressiveness-higher-generation-consistency-and-enhanced-visual-quality_603293>ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompthttps://www.cnet.com/tech/services-and-software/bytedance-introduces-new-seedance-2-5-video-model/>Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generationhttps://lumalabs.ai/news>The Layout Bet — Reve 2.0https://blog.reve.com/posts/the-layout-bet>Introducing Gemini Omni — Google’s multimodal video creation/editing modelhttps://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/>Nano Banana 2 and Nano Banana Pro are generally available via Gemini Enterprise Agent Platformhttps://cloud.google.com/blog/products/ai-machine-learning/nano-banana-2-and-nano-banana-pro-are-generally-available>Grok Imagine 1.5 Previewhttps://x.ai/news/grok-imagine-1-5>Seedance 2.0 in Runway APIhttps://docs.dev.runwayml.com/api-details/api_changelog/
>>109169286So GOATed that you forgot to attach a gen?
>>109169286It's a model not a goat, styupid chinese man.
any data hunters, scraper and hoarding fags here. how many hours do you spend saving and downloading content off of social media, youtube, telegram and other media sharing sites. do you ever plan to use your massive pile of files for ai training? would you donate your collection for a open source model training.
>>109169301>>109169302>safety filter enjoyers cucks got uppity
>>109169299This is the local thread debo fuck off
>mfw Resource news06/30/2026>OmniDance: Multimodal Driven Dance Video Generation with Large-scale Internet Datahttps://github.com/AMAP-ML/OmniDance>SAFE-DiT: Semantics-Aware Fast-path Execution for High-Resolution Diffusion Transformershttps://github.com/xuanhuayin/SAFE-DiT>EcoVideo: Entropy-Orchestrated Video Generation Paradigm in Cloud-Edge Dynamicshttps://github.com/IF-LAB-PKU/EcoVideo>See Only When Needed: Context-Aware Attention Intervention for Mitigating Hallucinations in LVLMshttps://github.com/Iris1946/CAI>Spanning the Visual Analogy Space with a Weight Basis of LoRAshttps://research.nvidia.com/labs/par/lorweb>Krea 2 LoRA Trainerhttps://github.com/CaptainGrock/Krea2Trainer>Ideogram JSON Captioner Kit - making ID4 datasets slightly less painfulhttps://github.com/Adudeguyman/Ideogram-fantastic-upgraded-captioning-kit06/29/2026>Krea 2 Base & Turbo — NVFP4 / FP8 / MXFP8 / INT8 / ConvRot INT8 https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8>Local Dream 2.8.0 with Anima supporthttps://github.com/xororz/local-dream/releases/tag/v2.8.0>OSOR: One-Step Diffusion Inpainting for Effect-Aware Object Removalhttps://github.com/Zhouqm-Git/osor>Diffusion Model Attribution via Spectral Coupling of Denoiser Responseshttps://github.com/Pragati-Meshram/SGS>OrthoTryOn: Geometric Orthogonalization for Conflict-Free Unified Fashion Generationhttps://github.com/NJU-PCALab/OrthoTryOn>CSD: Content-aware Speculative Decoding for Efficient Image Generationhttps://github.com/aderfebr/CSD>Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decodinghttps://github.com/Cc2021start/Fox>Extra CFG++ Samplershttps://github.com/xxiiyu/extra_cfgpp>VNCCS 3.0 releasehttps://github.com/AHEKOT/ComfyUI_VNCCS/releases/tag/3.0.0>forgeModelPatch: Add ZImage and Anima to Forgehttps://github.com/croquelois/forgeModelPatch>Flux2-Klein-9B-True-V3https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V3
>>109169304>would you donate your collection for a open source model training.you can't really share copyrighted data on the internet, you'll get nuked instantly
>>109169154Hmm... now do I dare pull?
>mfw Research news06/30/2026>Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesishttps://arxiv.org/abs/2606.29814>Intermediate Text Representation Guided Text-to-Image Generation for Enhancing One-and-Only Alignmenthttps://basedoun-won.github.io/one-and-only-ir-guidance>Your Data Manifold is Secretly a Reward Model: Shell-LCC for Text-to-Video Generationhttps://arxiv.org/abs/2606.30248>Mural: Transferring LLM knowledge to image generation via Mixture-of-Transformershttps://arxiv.org/abs/2606.29013>Concept Removal Guidance: Evidence-Calibrated Negative Guidance for Safe Diffusion Samplinghttps://arxiv.org/abs/2606.29801>Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editinghttps://arxiv.org/abs/2606.30599>Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generationhttps://arxiv.org/abs/2606.30054>MuseBench: Benchmarking Intent-Level Audiovisual Arts Understanding in MLLMshttps://musebench.github.io>Rigel: Self-Distilled Score Adaptation for Image and Video Captioning Evaluationhttps://arxiv.org/abs/2606.29997>MAVIN: Multi-Shot Audio-Visual Generation with Narrative Controlhttps://arxiv.org/abs/2606.29473>DreamForge-World 0.1 Preview: A Low-Compute Real-Time Controllable World Modelhttps://trydreamforge.com>ScaleErasure: Inference-Time Minimal Intervention for Precise Concept Erasure in Next-Scale Autoregressive Image Generationhttps://arxiv.org/abs/2606.29282>The Human Creativity Benchmarkhttps://arxiv.org/abs/2606.30561>What Color is the Sky (for a non-human)?https://arxiv.org/abs/2606.28912>W4A4 Quantization for Inference on Wan2.2-I2V-A14Bhttps://arxiv.org/abs/2606.29337>Self-Evolving Agentic Image Restoration via Deliberate Planning and Intuitive Executionhttps://arxiv.org/abs/2606.28971>StackingNet: Collective Inference Across Independent AI Foundation Modelshttps://arxiv.org/abs/2602.13792
>>109169321>>109169333Fuck off d*bo you're not welcome here
>>109169304>collecting ai vids lol>do you ever plan to use your massive pile of files for ai training? Why wouldn't I? That's the whole point. >would you donate your collection for a open source model training.If I liked the trainer or genuinely thought they'd do something cool then yes. I don't think any of my datasets are large enough to warrant posting as-is on hf like others thoesu
>>109169321>>109169333Thanks!
>>109169328depend on how you share it and what is the content. i believe the sulphur team would value donations of authentic videos to improve training for the next sulphur model. There's plenty of videos on the internet that are not "copyrighted" material that could technically be used for ai training purposes.
>>109169391>non-copyrighted porndoes that even exist?
Copyright is dead anyway none of that matters
>>109169414>Copyright is deadOh god I wish...
looking at the comfyUI custom nodes section I notice there are several options when it comes to background removal. But all these custom nodes are making me shudder. Can you guys suggest me a good and reliable node for background removal?
>>109169414post a complete song on youtube to test that claim and see how long it lasts, that is if it doesn't identify it during the upload phase and prevents you from even posting it
holy shit anon was right about local captioners>Qwen VL 8b Instruct hallucinates things that aren't even happening>wd-eva02-large-tagger-v3 (20x faster from 2024 btw) gets all the tags rightlocal really has no proper tooling for the natural language era. There are no large-scale NL NSFW datasets and as such no proper captioners for them either. local really spent 4 years coasting off the hard work of danbooru taggers while doing nothing to prepare themselves for the future. i bet novelai has a solution though!
Is there a tag for these specific sexo lips like princess peach's?
>>109169475"slop lips"
>rgthree's fast bypasser node has been broken for weeks>ask claude to make me a custom node that does the same thing>works and even has more feature completelol, i love ai
>>109169438You're looking for "layer diffuse" or "layer diffusion"
>>109169475inflated bimbo lips
long dick general
>>109169439Jewtube being jewish has no baring on the state of copyright being effectively dead
>implying Qwen VL 8b is local captioning sota
>>109169497>the state of copyright being effectively deadin what lala land are you living in anon?
>>109169414this, for example, I host all the game music from my waifus games on my website and I claim to own the story she comes from publicly on the site, appropriate web content from other wiki owners, and make vns essentially fixing the shit lore, made a huge manifesto shitting on the creator for being an alcoholic rapist and they ain’t gonna do shit about it cept cry
>>109169407how enforceable is DNP(do not post) towards onlyfans leaks? nobody has been prosecuted and sued for leaking a ethots onlyfans. Only thing e-thots and foids can do is send dmca takedowns and beg the simps and white knights to hunt, report and snitch on the leakers.
>>109169508>no argument
>>109169283miracle penis cures everything>>109169286I think Krea2 might handle this better. OneTrainer support waiting room
>>109169514imagine even saving these
lol looks like mystic v2 and krea are the perfect smut machine. fast and pretty realistic.
>>109169517you didn't provide any argument in the first place, it was just a claim
quarterturn/krea2-yoimiya
>>109169321>>109169333thanks!
>>109169349>>109169522It's the same anon who posts the same 2.5 slop style ad nauseum while chimping out about local being dead. Did you expect him to have good taste?
krea 2 is absolutely mogging at anime and i'm tired of pretending it's not. russ has a lot of training to do if he wants to stay relevant into the second half of 2026.
>>109169545he'll always stay relevant, anima is a 2b model, do you think a lot of people can run a 12b models in the first place?
>>109169558but a 2b model is slower than a 12b model, how is that?
>>109169558> anima> 2026lul
>>109169558not even mentioning that krea can't into danbooru... THE anime database
>>109169360>>109169541:)
>>109169567Only if you compare krea turbo to anima base instead of turbo to turbo I guess, but why would you be disingenuous?
are anons only allowed one model on their computer or something? i just simply use both krea2 and anima
>>109169522twitter/x has soo much good shit. fucking hell, i have 180 tabs open right now. So much good shit gets posted over there and the algorithm is based and addictive as fuck. Spent 6 hours straight just building a stellar blade image collection various fan profiles over there.
>>109169578use case for using more than one model?
>>109169567me when I can gen 8 unique anima gens and 8 identical krea2 turbo gens
>>109169578its not about that, its about where resources will shift towards, you can have all the models in your machine but only a few of them get things released for them, thats what this is about
>>109169578Trolls love to sow discord that way.
anyone else collect huge datasets for future training but never actually do it?
>>109169590disk space resources?
>>109169578>animaThat is not allowed here
>>109169577even then, it/s wise anima is plainly quicker than krea. i dont know why some anon keeps trolling with something so easily verifiable.
>trainingpull back the curtain and you realize this tech hasn't evolved at all since 2022. still training loras and hoping the AI fills in the gap to understand the concept. why are diffusion models so far behind LLMs? nobody trains LLM loras, it's not needed, it's not even a thought anymore. you just provide a skill.md and it can understand how to do practically anything.meanwhile image models don't even understand that they're learning a style, you still have to brute-force them with epochs of training just to get them to understand the elements that make up a character. and even then you have to handle it delicately because they're janky as shit and will bake in elements of the image that aren't tagged properly.
>>109169475"thick lips" maybe also "pounty lips" for that shape
>>109169608i think its coming from where i said krea2 turbo outputs are about the same speed as anima-base ones but again thats 8 steps vs 30
>>109169614"pouty lips"*
holy meltie wall of text
>>109169614>>109169623I appreciate the lip gen but it looks nothing like what I'm after. This specific lip shape you commonly see with Princess Preach but also on other bimbos.
>>109169646something lip filler
How do you make the egg real size relative to her head? I mean it's okay for an ostrich's egg but let's be real, nobody is throwing it without being jailed for attempted murder.
she caught the egg
>>109169613Kneecapped and retarded synth datasets, please understand
>>109169613>why are diffusion models so far behind LLMs?diffusion model usecase? no one is developing it because it has high cost and low yield
>>109169646Those are Krea 2 gens, it's not trained on danbooru tags, gotta try those on Anima.
no Taylor Swift on flux
>>109169762who cares? I don't understand why she got popular in the first place, she doesn't make good music in the first place
>>109169778this is why I wanted to egg her
>>109169589real.
>>109169778She is classically good looking and her music is easy to listen to. Perfect recipy.
>>109169778as with many famous people today, she was bootstrapped by family wealth. not only that, but I think her dad or something was directly involved in the music industry or somethingwhat elevated her from neobaby to superstar was her ability to truly capture the essence of `mid`. nothing she produces is challenging or asks anything of the listener. perfectly digestible and forgettable. shes particularly resonant with mid white ladies because they love the idea that you can be a completely unremarkable white girl and still be a superstar somehow
more and more people are waking up to the fact that api is in another league of intelligence. comparing api to local today is like comparing krea to sd1.5. api is just that far ahead
>>109169846Off-topic spam to sdg please.
>>109169846Stop off topic spamming
>>109169872>>109169881>they hated him for he told them the truth
>>109169846hilarious how bad you still are at genningeven after 4 years you can't seem to figure out the basics and still post blurry slop
I might use Illustrious for life, since style mixing was a fortuitous side effect of CLIP and model architectures going forward might never have it again
>>109169923Nice, very sdxl
>>109169905he has mobile phone gacha slop taste in art. no amount of genning skills is going to fix his bad taste
>>109169846that looks pretty creative i like it :3
someones training overwatch porn on krea 2.... right?
>>109170278nah
>>109170278yeah, lora soon
Looking at GPUs and seriously considering this 9070XT because it's cheaper than a 5070ti. I'd mostly be playing vidya, but how are AMD these days for AI gen?
>>109169475part of it is artist dependent if you want a specific shape, probably? like hotglossed/gloss, or donaught, or even someone who does bimbos like thepit.there's also a lip slider for anima on civit now, it was early access before. i have yet to try but i'm gonna.
>>109170382>but how are AMD these days for AI gen?not good now and never will be good. it's functional, though
>>109170382same as always, shit
>>109170430>>109170437Lame.Was hoping it'd be better by now. My ass is sitting on a 3070 right now.
Ideogram train loras really nicely but running it on both the unconditional model and conditional model slows gen time by like 50% even though I have VRAM to spare
>>109170442local hardware (like many things local) is stagnant, purposefully to prevent local from outpacing API. they fabricated a whole 'vram shortage' just to excuse this and don't forget that even prior to this phony 'shortage' they were purposely cutting consumer vram short to upsell their H100 cards>2080ti, 11gb vram, $1000>3090, 24gb vram, $1500and then suddenly AI hits and consumer cards barely scratch above 24gb, a coordinated effort from every GPU manufacturer (amd included) to sell overpriced enterprise cards. nothing has improved, nothing will ever improved. soon enough consumer GPUs won't even be manufactured anymore because they want you forever dependent on cloud subscriptions.
>>109170499Best part is it doesn't overly bleed into similar looking characters already in the model.
>>109170499>>109170519Still flexible with styles too. It's a shame there's basically zero interest in this model for lora trainers. Civitai is a wasteland
been having fun with bullying Gemma. She gets sassy sometimes. I don't like how Llama is. He reminds me of the fat dude from Severance. In a not-good way.Been using LM Studio, but it honestly kind of sucks. I tried a persistent-memory plugin yesterday and every model started to hallucinate like crazy, despite writing a hard-lined system prompt with lots of guard rails. Having fun with things overall though.I really like the chirpy noise my GPU makes while the answers are being written out. Sounds like some kind of digital typewriter.
>>109170557is it as easy as krea2 and z image base to train with? wouldn't mind trying.
>>109170557>It's a shame there's basically zero interest in this model for lora trainers. Civitai is a wastelandtheir licence killed the model, no NSFW allowed
>>109170580Yes. But .json captions are a ball ache. I used thishttps://github.com/Auryg/Ideogram-Json-CaptionerBut a lot of the auto generated captions using Qwen3-VL-8B-Instruct-Q8 were garbage but the UI is very useful for boxing stuff up yourself
I vibecoded .json captioner for my dataset preparation thingy but haven't bothered to put it up anywhere.
>>109170382yes
>>109170595it's more than that, unlike krea2 ideogram wastes your time generating an "image blocked" output. the bounding-box prompting style is annoying (nobody likes this indian 'prompt engineering' shit), the model is slower than krea, and it has a gpt brown piss filter on it.
>>109170620everyone dropped id4. krea is the new hotness
>>109170648>unlike krea2 ideogram wastes your time generating an "image blocked" output.krea 2 also has a safety filter lol
>>109169613This is something that goes over most people's heads since most people are still running tiny vramlet models that need to be bruteforced to produce anything that isn't slop. Krea2 suffers from its VAE, but they at least put in effort to bolster the dataset. Chroma, despite the shitty seed lottery, is still pulling way ahead in terms of knowledge because it was a neckbeard effort and didn't have endless synthetic shit clogging up the dataset.
>>109170663>chroma mogs>posts gen that looks like it was made with SDXLokay
>>109169545How? All the anime generations are ass. Is it some trigger prompt? A lora? Every generation I get goes pixar cartoon or fucked up anatomy
>>109170663>and didn't have endless synthetic shit clogging up the dataset.Is this a joke? aesthetic 11, hello???
>>109170663neat impressionism background
>>109170676It's just anima-hating shizo on his never ending crusade. You can just ignore him.
>>109170663Why do I feel like you purposely made a dogshit krea gen to make a non-point.
>>109170651It's just a fun project. Implementing it wasn't that big of a task.
>>109170651Not me ID4 is objectively the better model
>>109170690Nah Chromaschizo posted that before Krea release initially.I can't make my mind if he is a real schizo or troll but that's not a Krea image.
>You can get past the ideogram filter by adding a couple more bounding boxes>You can get past the krea filter by slopping your shit up with a bypass lora that smooths everything out and reduces prompt adherence
>>109170701>jailbreaking a local modelreally, in front of my Z-image turbo?
>>109170701both are illegal and you will go to jail soon.
>>109170675Did you even read the post anon. I wasn't talking about image quality at all.
>>109170714Z-image "mutant pepperoni nipples" TurboThe filter is trained into the dataset
>>109170721My grandpa has tons of knowledge but that doesn't make him a good painter.
>>109170663i was on board with what you were saying until you went full retard and brought up chroma out of nowhere despite it being all-around terrible. what is with you and that model? it knows less characters, and styles than basic SDXL finetunes like illustrious and the 'natural language' is shit. chroma is in no way "pulling ahead" in anything
>>109170684I like to call it schizophrenic, thank you.>>109170699>poopdickschizo
>>109170729I hope your grandpa is doing well.
>>109170701Do we even know what exactly they did to censor Krea like that? Ideogram was easy enough to figure out.If we are going to have a solution that doesn't involve the slop it lora, we need to figure that out first.
>>109170768>Do we even know what exactly they did to censor Krea like that?they don't want to go to jail dude, I cri everitim :'(https://youtu.be/31jiUhCEjJ4?t=1668
>>109170768https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer
this krea2 lora works pretty well, even if you dont want full lewds it allows the model to be a bit more spicy with details. default strength 1, can go up to 10k, basically use it as a slider (if you get a black image, just lower it to like 10)https://civitai.com/models/2728234/krea2filterbypass?modelVersionId=3067151
>>109170779Scared of deepfakes? How about removing the celebrities instead of turbo cucking your model?
>>109170663chroma is a dead meme of a model. stop shilling that janky garbage and give it a rest. the failure of spark chroma v1 was the absolute final nail in the coffin for such unstable janky model.
>>109170791same prompt, bypass lora off, still good but if you want nsfw stuff (or nudes) then you can use the bypass.
>>109170809>How about removing the celebrities instead of turbo cucking your model?this, they say they're scared and stuff but have no issue adding celebrity slop in their model, the maths doesn't add up at all
>>109170827Yeah it's the celebrities that scare them and not the ability to literally generate CSAM.100% true fact checked by patriots. Taylor Swift did this personally.
>>109170791mm sweaty.
Am I the only one who actually used Krea here? What are people even talking about with 'censorship' regarding the model. How is this any different than base SDXL, base Cosmos, etc? Ideogram is actually censored, it spits back a denial image. Krea just generates a normal SFW image to the best of your prompt just like every base model before it.
>>109170839it's just angry ideogram and anima niggas
>>109170839yes
>>109170789I know about this. Have you actually used it with a specific character or person? It destroys likeness.>>109170809The explanation I can think of: They didn't initially intended this model to be open source. So they trained with celebrities, they can control against deepfakes easily on the API anyway.But then flux 2 vae came. So this last gen model was thrown away while they worked on a more SOTA version.After their API model was ready they made whatever surgery they did to the model and shipped it.
>>109170832Oh yeah, they're definitely scared of CSAM, which is why they trained their model on litteral porn.100% true fact checked by patriots.
>>109170839>Ideogram is actually censored, it spits back a denial image.You can bypass it by using bounding boxes without affecting the quality. Krea bypasses whether the node or lora have a negative impact on the quality. I mean look at these>>109170791>>109170817They're not even the same style
>>109170853Taylor Swift did this.100% fact checked by a sweating european muslim.
a 25 year old anime girl at the beach licking a blue popsicle. On the popsicle is the text "Krea 2" in white ice that is a part of the popsicle. she is looking at the camera.kino, such a versatile model. also kinda looks like misaki from NHK
>>109170852Just don't use Krea, you cry baby bitch.
>>109170869>such a versatile modelbecause it can do "close up, 1girl"? is this a joke?
>>109170832>Taylor Swift did this>>109170860>Taylor Swift did thisguys, I have a feeling that Taylor Swift did it
>>109170857Well of course if you try to force it into generating nsfw content with a fried lora or weight lobotomization then it's going to look fried. The exact same thing was the case with SDXL 'nsfw' loras pre-pony. You are using copium slop produced by civitbrowns in an attempt to farm buzz. You can use actual loras trained on real images and generate nsfw content fine with krea.Ideogram censorship being easy to bypass may be true, but the fact that the message appears at all is still way more censorship than Krea has. And if both are easy to bypass, whats with all the kvetching and melting over it? Is this not just a good thing for local that we have two models that take to NSFW concepts easily?
>>109170883>Point out valid problem>Wahhh you are crybaby bitch waaaahIt's just some model bro, I didn't insult your mother.I want to use it btw, that's why I am trying to figure out how to properly unfuck it.
>>109170909>And if both are easy to bypass, whats with all the kvetching and melting over it?https://en.wikipedia.org/wiki/Boiling_frog
>>109170912>that's why I am trying to figure out how to properly unfuck it.no you're not. you just bitch and moan about censorship meanwhile im genning anal porn
>>109170924You're generic plastic porn you dumb ESL.
>>109170930You're mother is benchode mudderfukker raciss
>>109170939kek
>>109170893no cause it's versatile and does a range of styles whether real or anime.
>>109170917why are you schizomelting when krea 2 is the easiest model to train NSFW loras for to date? why live your life as a schizo concern-trolling over big-tech censoring your models instead of just using what you have for fun? every model since SD1.5 was 'censored' in some way yet you're still here every day.krea is no different than any base model before it. plenty of NSFW loras on civitai show it can do nsfw at levels comparable to the previous gen's finetunes.
>>109170997klein edit 9b is so fun, with that and krea 2 you can do basically anything.>prompt "remove all the criminals">the black people vanishchina is funny.
a group of 4 25 year old anime girls at the beach each holding a vanilla ice cream cone with "Krea 2" etched into the vanilla ice cream scoop. they are looking at the camera.so apparently, for anime or text based gens, qwen lora is superior, wan vae may be better for photorealistic gens.
>>109171013neither of those are chinese models
>>109170988>krea is no different than any base model before it.krea has a built in safety filter, the other base models don't, why are you lying like that schizo?
>>109171023which can be avoided extremely easily with a lora or conditioning node, they probably only added the safety filter to make investors happy.
krea2https://files.catbox.moe/740eai.jpgf2k9bhttps://files.catbox.moe/goqird.jpgzithttps://files.catbox.moe/740eai.jpg
>>109171023point to a demonstrable effect of the 'safety filter'. because you can go on civitai right now and look for base krea nsfw loras and compare them to base sdxl nsfw loras. i don't know how you can see images like these and claim there is a working safety filter on the modelhttps://civitai.red/images/135328580https://civitai.red/images/135211992https://civitai.red/images/134915115
>>109171048if you zoom into the pussy lips it says "blocked by safety filter ;)"
>>109171048I found that there is an indeed safety filter for example if i wanted to write "cum" it changed the word, those safety bypass loras/nodes/conditioning change the output tho, some give you better image quality but hinder the text rendering, others do the opposite, there is not definitive "fix" for the censorship since every solution gives you a different result
>>109171089Looks like a pony realism gen
>>109171045Lost the ZIT link and also would you do Krea with only the censor-bypass 3-vector lora? What about swapping for abliterated or smut-finetuned Qwens?
>>109171097Blocked in your country sorry saar
>>109171020
I have in iMac, I'm downloading Comfy UI desktop, I want to turn my favourite images into nsfw video, what models should I download? Is it like LM Studio where you download a model and then just install something like Qwen etc?
iMacs can iGen? wtf?
>>109170701>You can get past the ideogram filter by adding a couple more bounding boxesThat doesn't work.
>>109171132my iMac with the silicon chip is pretty powerful, miles better than my old intel mac
>>109171145False
>>109171149Cool very powerful. I don't think the software has very good support for your in-house chips though.
>>109171152You can't even generate a picture of a bread with Ideogram.
>>109171170A bread?
>>109171170That bread might have had a vagina in it. Gotta block it.
cute krea miku chibisa chibi hatsune miku at the beach holding a green leek, she is smiling.
>>109171191
>>109171169yeah I doubt it, it's usually the main complaint of Macs is that there isn't much cross-platform support, but I wondered if any anon was doing the same thing
>>109171191>>109171201That's not chibi, that's Paper Mario style.I have seen Krea2 spit it out when asking for other styles too.It's probably a quirk of the turbo distill.
>>109171216I don't see why you would bother. Better off setting up a GPU server that you can use on your mac rather than using the mac hardware itself.
>>109171230NTA but when I google chibi I immediately get results that look just like his so I'm inclined to believe you're a retard chibi supremacist or are just lying.
>>109171232I can't find any decent image to video websites that don't require bajillions in tokens.So far I've just been using Perchance, the website for images, and I wanted to animate them.
>>109171255Try the API thread. This is local. Video generation is not cheap, though.
2023:>controlnet is amazing, it lets you pose anything! if you want full control, local is the only option!2026:>half the models lack controlnet, community gave up>only 2 models have even half-competent reference/edit and are way outdated by API standards>character/style transfer still sucks>you can just tell API to "take this character, put them in this outfit, and in this pose" and it workshow did local fall so far behind? it's not this bad with LLMs
>>109171264Controlnets are overrated. Most of the time they don't work.
>>109171264controlnet fuckin' sucks, though
>load up krea2 workflow>found an emma myers lora>prompt "show bobs and vagene">is perfectmy benis is experience concentration camp levels of abuse rn. zit and zim is ancient tech in comparison.
>>109171264The only thing local has going for it is nsfw. That is it. It's the only reason anyone would touch it
>>109171201and a regular miku
>>109171264>it's not this bad with LLMs???Local imagegen is closer to API than LLMs are lol
>>109171258For a while Grok of all things was letting me generate like 10 videos in high quality for free. OK, the prompting usually took some work, but they paywalled it. I was just wondering if there was a site that had something to that affect. I can't even find one that does more than 1 video per day.
>>109171274Which workflow?
>>109171278and a ghibli miku, cute!
You can train Krea2 loras with 12gb vram with Musubi Tuner.
>Musubi TunerI need to buy a car to train Krea 2? wtf
>>109171452Also with AI Toolkit, which has a great UI and easy to use.
>>109171464>AI Toolkitpukes
>>109171255Upload image and prompt and I'll generate one for you with wan 2.2 assuming you're not trying to touch yourself to it.
>>109171464I couldn't get a fresh install of AI toolkit to work with training Krea2 loras. It stalls without any errors at the start of training.
>>109171258what are the best nsfw API's then?
>>109171546the beginning takes a while. even on 5090. be patient.
>>109171546AI toolkit always rushes to be the first to implement new models and it results in rushed garbage implementation.
>>109171546Stalling usually means the python env isn't correct and it errors out before it even starts. The setup instructions are bad use the one click installer because it actually works and puts things where they need to be.
>>109171279not so sure about that. api is leagues ahead in image generation tooling. i can take random anon gens >>109170740 and ask GPT for an outfit breakdown, character turnaround, etc. it will analyze the image in detail with it's trillion parameter LLM and generate accurate relevant text. even the hex-codes it added are quite accurate. local is years away from this, 5 years at least.
>>109171546Are you using a pre-downloaded model or downloading through toolkit? there's an open issue on the github about that
I think it stalled for ~5 minutes before starting the training on 5090.
What is the best lora trainer for newer stuff
>>109171583i'll get shit for this but i've never had much problems with toolkit. never went back to other trainers after I started using it.
>>109171546I got this too. I re-installed and made sure the venv was created with python 3.12 and it worked. AI toolkit is dogshit really, it doesn't give any detailed logs, it wipes your config if you switch UI tabs, and had a cope issue for years that required you to manually edit sql to fix.
>>109171583just learning: ai-toolkitanima: this fork of easy scripts https://github.com/67372a/LoRA_Easy_Training_Scriptseverything else: musubi tuner
>>109171588Was that the one without a web ui?I really want that as I have it on a server for cooling reasons
>>109171609no ai-toolkit is the one with a web ui. musubi tuner is the one that's cli only i believe
>>109171575I used a predownloaded model as I always do when training with AI toolkit. For Krea2, it only needed the single large safetensors file and the text encoder safetensors. I got frustrated and browsed the stable diffusion subreddit. Someone vibecoded a krea2 character lora trainer that can use both AI toolkit and Musubi Tuner. It only allowed AI toolkit to train the turbo version and the Musubi tuner to train the raw version. Since I downloaded only the raw version of Krea2, I had to try the Musubi tuner. That didn't work. However, it saved me research time because the app provided the commandline output. I had to manually setup the musubi tuner since the app didn't do anything except download the github files.
Follow up then, how fast is Krea training compared to Anima?I got an image set that is just sd 1.5 tagged for a style I like will it be enough?
so did we figure out what the best local caption model is for Krea2?
>>109171654LickitySlickRicksCaptionerYou have to be verified to get it, though.
>>109171639quite a bit slower. turbo is just makeup, beneath the mask it's a bloated slow nu-model like the rest of them. on a 4090, krea-raw at 3000 steps at 1024x took 4-5 hours with ai-toolkit. with anima i can train 8000 steps in 2 hours. i didn't really like krea's results that much either
>>1091717263 ugly clone bitches. what a nightmare
1girl, peace sign, big boobs
>>109171774Krea really hates wet skin, you can use the enhancer node or the bypass lora but it just turns into really unpleasant thick slime
>activate anima clothing lora>instantly shifts the image towards realismloras really are outdated copium designed for 2022 models
>>109171828Poorly trained lora. Wrong dataset
>trainedyeah, that's outdated
>>109171847This. Any lora that completely changes anything outside what it was trained for was poorly made. I saw this all the time with Illustrious loras.
>>109171743is this k2 or ideogram
>>109171828whats the alternative?
>>109171881you can just show images to GPT. training is outdated, only local is still stuck training loras for basic tasks. it's time local moves on
>>109171881the alternative is using cloud models that can only need to be feed a few reference images
>>109171938Super Nigtendo
>>109171828>loras really are outdated copium designed for 2022 modelsnot if you knew anything about the technology
>>109171569gemma4 31b with ideogram official enhancer prompt and that image fed into vision
>>109171990Workflow?
>>109171990noooo you can't do that yet! you gotta wait 5 more years!
You guys are playing "paper dolls" ?
>>109171995llama.cpp webui to generate prompt and copy paste that into stock ideogram workflow
using asian women that are known for heavy plastic surgery/filters/make-up as a source for testing realism is a bit retarded, ngl
>>109172002we are prayin paperu dorru, gweilo pig monkee
>>109172006if you don't want to copy paste you can usehttps://github.com/hekmon/comfyui-openai-apidirectly inside comfy
>>109171990so it's completely wrong, nonsensical, hallucinated, and slopped? got it. local models have no intelligence, they are still stuck in the 'hallucinating' stage and never moved on to the reasoning stage
>he thinks my qwen3vl 4 billion parameter uncensored prompt enhancer can't reasonapifags are such uncs
>>109172051did you even look at the image?
>>109172006Could we get the system prompt? I had issues making those jsons when I tried
>>109172147https://github.com/ideogram-oss/ideogram4/blob/main/src/ideogram4/magic_prompt_system_prompts/v1.txt
>>109172115>hair is wrong>hair accessories are wrong>colors are wrong>shorts are longer, nice local censorship!>slopped WAI-XL styleare localkeks actually just blind? embarrassing really...
Every time anon makes a post I just laugh
Joke's on (You).
Has anyone else noticed OnlyFans girls starting to mix in AI generated photos of themselves with real ones? I've seen this a ton recently...
>>109172179I don't get it? was that supposed to be a known character?
>>109171938kek
>>109172375Did you not read? I fed it another anon's gen (>>109170740) and successfully extracted the features with 97% accuracy, the prompt was just>Create a fashion breakdown image showing all the elements of this character's outfit
>>109172372It's been like that for awhile now. It makes it so much harder for me to tell, not that it's impossible. But it does force me to double check.Thankfully they have no idea how long you and I have trained ourselves to detect ai same face.
97% you say?
LTX is really terrible for Realism thanks to the face drift. Its decent for anime though.... except sometimes LTX cant keep the artstyle as well....
>>109172391I didn't follow the thread that far up. Thanks I see what you mean now.
>>109172372Everyone does this now, even my sister. Welcome to the new age
>>109172428Marie?
>>109172428You're gonna have to catbox the whole gen anon.
>>109172372I don't check onlyfans. what's the purpose tho? pics in outfits you don't actually own? making yourself look hotter?
>>109172428It's terrible for I2V, but it does well when it comes to T2V
Is anyone actually getting a 2x speedup from wan22 i2v using int8-convrot?
>>109172391>>109172434You're not enhancing your own dating app pics with AI? And you call yourself a genner?
>>109172460>dating appthat's cancer. most sane guys know to avoid dating apps like the plague.
>>109172447I never tried T2V
Krea 2 is decent, but prompt understanding and the obvious grain patterns remain the biggest issues so far
>>109172478hot
>>109172478the problem is finetuning is hard without destroying knowledge. the more base models know, the more they forget when finetuning. it wasn't a big deal because models like sdxl and cosmos were quite dumb and unusable but krea actually has a lot of concepts that I could see being lost if they aren't represented in the finetune dataset. plus their training process is really intricate. it's getting harder and harder to finetune models without a lab-sized team and budget
>>109172434Your sister has what seems to be a highly accurate Flux lora of herself?
>>109172434Post your sister's LoRA or it never happened.
Fresh when ready >>109172561>>109172561>>109172561Fresh when ready
seeing grifters attempt to do capitalism and getting owned will never not be funny to me.
>>109172546>>109172537retards, she uses cloud shit. you think she has the technical knowledge to do local? lol
what prompt and lora are you using for that anon. love the realism here.
>>109169231What's /lmg/'s consensus on Anima? How does a compare to illustrious?