Are You Living In The Same Universe As Me EditionDiscussion and Development of Local Image and Video ModelsPrevious: >>108609718https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Can someone explain the anima > zit workflow?
>>108615707gen with anima, hiresfix with zitretard
>mfw Resource news04/16/2026>Motif-Video 2B: A micro-budget text-to-video diffusion transformer from Motif Technologieshttps://motiftech.io/videoshowcase>HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worldshttps://huggingface.co/tencent/HY-World-2.0>ErnieTurbo_extracted_lorahttps://huggingface.co/GuangyuanSD/ErnieTurbo_extracted_lora/tree/main04/15/2026>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching https://huggingface.co/tencent/DisCa>Lyra 2.0: Explorable Generative 3D Worldshttps://research.nvidia.com/labs/sil/projects/lyra2>AniGen: Unified S3 Fields for Animatable 3D Asset Generationhttps://github.com/VAST-AI-Research/AniGen>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Modelshttps://gyanendrachaubey.github.io/T2I-BiasBench>Generative Refinement Networks for Visual Synthesishttps://github.com/MGenAI/GRN>VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenizationhttps://videoflextok.epfl.ch>DiffusionPrint: Learning Generative Fingerprints for Diffusion-Based Inpainting Localizationhttps://github.com/mever-team/diffusionprint>Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Modelshttps://github.com/deep-optimization/CoM-PT>Self-Adversarial One Step Generation via Condition Shiftinghttps://github.com/LINs-lab/APEX>See-through WebUIhttps://github.com/BeamManP/see-through-webui>ERNIE-Image: Repackaged model files for ComfyUIhttps://huggingface.co/Comfy-Org/ERNIE-Image04/14/2026>Nucleus-Image Releasedhttps://huggingface.co/NucleusAI/Nucleus-Image>ERNIE-Image: Text-to-image generation model built on a single-stream Diffusion Transformerhttps://huggingface.co/baidu/ERNIE-Image>Danbooru Dataset Filter: High-Speed Metadata Explorer for AI Traininghttps://github.com/ThetaCursed/Danbooru-Dataset-Filter
>>108615744Thank you shitstain
>mfw Research news04/16/2026>Creo: From One-Shot Image Generation to Progressive, Co-Creative Ideationhttps://arxiv.org/abs/2604.13956>DiT as Real-Time Rerenderer: Streaming Video Stylization with Autoregressive Diffusion Transformerhttps://arxiv.org/abs/2604.13509>Enhanced Text-to-Image Generation by Fine-grained Multimodal Reasoninghttps://arxiv.org/abs/2604.13491>MaMe & MaRe: Matrix-Based Token Merging and Restoration for Efficient Visual Perception and Synthesishttps://arxiv.org/abs/2604.13432>Bias at the End of the Scorehttps://arxiv.org/abs/2604.13305>ASTRA: Enhancing Multi-Subject Generation with Retrieval-Augmented Pose Guidance and Disentangled Position Embeddinghttps://arxiv.org/abs/2604.13938>What Are We Really Measuring? Rethinking Dataset Bias in Web-Scale Natural Image Collections via Unsupervised Semantic Clusteringhttps://arxiv.org/abs/2604.13610>Who Gets Flagged? The Pluralistic Evaluation Gap in AI Content Watermarkinghttps://arxiv.org/abs/2604.13776>Rethinking Image-to-3D Generation with Sparse Queries: Efficiency, Capacity, and Input-View Biashttps://arxiv.org/abs/2604.13905>DiffMagicFace: Identity Consistent Facial Editing of Real Videoshttps://arxiv.org/abs/2604.13841>Seedance 2.0: Advancing Video Generation for World Complexityhttps://arxiv.org/abs/2604.14148>MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Modelshttps://arxiv.org/abs/2604.13287>VibeFlow: Versatile Video Chroma-Lux Editing through Self-Supervised Learninghttps://lyf1212.github.io/VibeFlow-webpage>ReConText3D: Replay-based Continual Text-to-3D Generationhttps://mauk95.github.io/ReConText3D>Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understandinghttps://arxiv.org/abs/2604.13540>Grid2Matrix: Revealing Digital Agnosia in Vision-Language Modelshttps://arxiv.org/abs/2604.09687
>>108615765you're welcome
>>108615104generated from scratch
>mfw nigbo
>>108615891I love the zitslop face
>>108615361There's no straightforward process with these models, It all comes down to luck. You never know how the AI will react to whatever dataset you throw at it qnd you always have to sacrifice something to get improvements. Base Noob has the best aesthetics but also the worst limb deformities, especially legs. How did they fix it? More neutral/slopped/semi realistic data, which killed the aesthetic. No model can balance aesthetics and accuracy yet.
>>108616034tyty!
>>108615891Prompt?
>>108616156i meant shit gens sorry
usecase for posting 2+ images from the same batch?
wai-anima may be sloppy but it's the only anime tune that has working franchise/copyright styles out of the box, base is terrible at this
>>108616208Your sdg buddies can help with that question :]
>>108616233It just sounds like youre not good at prompting
>>108616240but that's a dead shithole?
>>108616233Based
>>108616233>tunelel
>>108616170Hey, you are not me!>>108616264Lora?
>>108616159just write what you see in the image?
is he lying
>>108616324You are a complete and utter newfren if you still take "coming soon" posts seriously
>>108616336Why? Z released base like they said they would. They were going to release edit until BFL surprised them.
>>108616316jujutsu kaisen culling game arc 1 from civitai
>>108615635What can i do with a GTXX 1050 and 16gb of ram?Any model recommendation?
>>108616363>Why? Lurk for a couple more years and you'll understand
>>108616324no, you are shamesly shilling.did ernie labs payd to you? it's clear that erni flopped>>108616370NAI
>>108616370>gtx card>for ai>in 2026Is this bait?
>>108616370>2gbOuch!Probably some SDXL variant like Noob vpred.Either run at fp16 and eat offloading penalty or run at q8 (int8 if Pascal can accelerate that and if you can figure out how to get it working)You are SOL for anything newer.
>>108616324They will release it 2 weeks after bigma is released
>>108616425You also need to run a distill lora for sane speeds.I think there is a 2025 lora considered meta for step distilling SDXL, but I don't recall the name.
>>108616370There is a Noob Vpred Nunchaku
>>108616454Nunchaku needs at least 2000 series.
>>108616370anon you need 12-16gb vram and at least the healthy minimum 32gb of ram for ai workloads. if you can't afford a decent prebuilt gaming pc from costco or bestbuy then forget and go the saas route. Your not running any good ai model under 8gb of vram.
>>108616370cpu is probably faster
>>108616425>>108616454>>108617006>>108617015Well, i appreciate the helpThrowback from 2023, same setup. WebUI doesn't work for me anymore
What do people use to train anima? I don't wanna use wsl2.
>>108617289>not running linux bare metal NGMI
>>108617393>not having a proxmox server with dedicated VMs with k8s running on emLOL, fucking paesant
the problem with bluvoll is that he contaminates mugen and chenkin rf with his pedo hag dataset, it is not a 1:1 with noob dataset
So how many artist styles are you using at a time with anima? I set up my prompts to randomly pick between 1, 2, and 3, and at 3 it still seems coherent. One style seems to dominate overall, but you can still pick up hints of the others. I also have to try using no artists more often.
>>108617502i'm using 3
So is Ernie Image Turbo better than ZIT?
does anima include copyright tags? tried one with 800 entries on gel and 300 on dan but it didnt recognize>>108617289https://github.com/67372a/LoRA_Easy_Training_Scripts
>>108617453>pedo hag datasetQue?Also training on clip garbage in 2026 is the biggest problem with his models.And being obnoxious dipshit in general.>>108617502Just one. I am not sure how more artist tags help coherency.>I also have to try using no artists more often.The default style is too sloppy and soulless for me.
>>108617453>pedo hag datasetwhat that? straight shota?
>>108617289sd-scripts, bare metal Chadux.
>>108617540Did you use the correct tag syntax
>>108617562>Did you use the correct tag syntaxis that crap really a thing? It makes me mad, let me retry
>>108617568Special booru syntax has been a thing since 2023 thobeit
>>108617578I thought he meant the tag positioning (which fortunately you don't really need to abide to), yeah I used the tag exactly as it is on danbooru/gelbooru
>>108617584>exactly as it is on danbooru/gelbooruyou have to escape parentheses with a backslash
>>108617592I know, it doesn't have those, seems like for whatever reason this thing in particular wasnt included, even on the tag autocomplete it isn't present
>>108617651you should also try creating a lora for ltx2.3. imagine this style with sound
>everyone pretends that a 2b model can learn over a million character images and artist styles, when an LLM with the same parameters struggles learning a tenth of thatYou need at least a 24b model at minimum to achieve what you want.
How slow is Anima at doing 7680×2160? Can you even gen something over 1440p on it with consumer hardware?
>he scales at all costs
Is Chroma really not surpassed yet? We've had it for about a year now...
>>108618018There haven't been any other major NSFW capable tunes, yes. Shame it's too schizo. And memestone's other vibe training attempts have managed to become far more dysfunctional trainwrecks. At least you get lucky enough with Chroma sometimes.
>>108617709could be fun but probably requires latest hardware
>>108617954It works best at typical resolutions. Circlestone did release a Lora recently where 1536x1536 works without any major issues, and even 2048 (4 MP) works without falling apart. That's genning straight-up. You need to upscale to go bigger.
>>108616159>>108616323What he said, but here it is anyway>toki \(blue archive\), toki \(bunny\) \(blue archive\), blue archive, 1girl, alternate hairstyle, animal ear hairband, animal ears, ass, back, backless leotard, bare shoulders, blonde hair, blue eyes, blue hairband, blue leotard, blue nails, blue streaks, braid, breasts, bun cover, detached collar, expressionless, fake animal ears, fake tail, from behind, grabbing own ass, hair bun, hairband, half up braid, halo, highleg, highleg leotard, large breasts, leotard, looking at viewer, mechanical halo, median furrow, multicolored hair, nail polish, official alternate costume, playboy bunny, rabbit ear hairband, rabbit ears, rabbit tail, short hair, simple background, single hair bun, sitting, solo, strapless, strapless leotard, streaked hair, tail, white background, wrist cuffs
>>108618018For girl full nudity some ZiT and even some FK29B on civitai are better than chroma. If you're in ultra hardcore porn and weird kink though...
>>108617954I found anima to be completely predicatable with the scaling of time in relation to image size.It takes 30 seconds and 1024x1024 and 2 minutes at 2048x2048 on my 3090. So you just extrapolate the time it takes to a 1024x1024 image on your hardware and multiply it by how many times larger the image is than that.
>>108618795>>108618615>>108618702So Anima then Zit for hires fixes won over Chroma?Realistic models were saved by weebs?Why is Lodestone not fail tuning Anima yet?
>>108618702>some ZiT and even some FK29B on civitai are bettersuch as?
What API node do I use now that local is dead?
can't believe the last hope for local video is ltx...
is it finally safe pulling latest cumfart? didnt do it for a month
>>108619125why are you using comfyui anifart?
>>108619102Yeah... It's dogshit
Qwen3-VL-8B-Q8 or Qwen3-VL-32B-Q4? 5090 btw
>>108619186always bigger at a smaller quant
>>108619102>literal jews are my our best hope
>>108619186Why are you going q4 with 32b? You can easily do Q6 with 5090.Anyway 3.6 will probably mog both even as MOE.Probably get 3.5 27b q6 hauhaucs if you need NSFW (Although it will try its best it has low knowledge of NSFW subjects due to lacking training knowledge)
>>108619252and not even the best jews, like with sora, midjourney. we've got the team of talentless jews. what luck...
>>108619102That's like saying BFL was the "last hope" for local image kek
>>108619429אַזוי פֿיל געלט, אַזוי פֿיל שכל
>>108619455Make Yaoyao in her new outfit pls
https://files.catbox.moe/f1w6g6.jpgI really like Anima.
>>108619693SDXL and ControlNet still has potential...
>>108619804I understand /hgg/ fags and Oekaki shizo because Anima it's better at handling multiple characters and intricate poses, as well as abstract kino minimalist concepts with multiple characters in the case of Oekaki. But anyone else praising Anima is a poser. For example, >>108619433, >>108619455, and >>108618795 can be done with SDXL and ZiT hires fix.
>>108619872wicked
>>108619433I'm surprised it got her weird mid-spine tail correct. this is i2i maybe?
>>108618702I don't even care about nudity particularly. I just want a model that can make nice pics of cute chicks with some cleavage, the occasional bikini pic or some lingerie. Rarely nudity, it's not really essential. I find nothing is as good as Chroma. I've tried all the other FOTMs and I wasn't blown away.
Is it coming to API nodes anytime soon?
>>108620088took them awhile to add seedance, so wait and see.
I havn't used SDXL in such a long time by now lol, and some faggots are still hanging on that deprecated model lolImagine using clip in 2026
>>108619446Well, technically Flux Klein is still the best open model though
Some good coomer gens last thread.
>>108620144you gave me your workflow the other dayhave you had much luck generating realistic hardcore with it?
>>108619125It fucks up handling memory less often on VAE loading now, but still fucks up... BUT! I now get a lot of "Windows fatal exception: access violation" when refreshing the page Comfy loads, which needs to be done because the RTX node doesn't load properly without a restart. So I have long stretches where I'm just trying to get it to work.I really don't think they (or more likely, Claude!) know what they're vibing out over there memory-wise.