Discussion and Development of Local Image, Video, and Music ModelsPrevious: >>108972752https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, & Upscalershttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.info>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/tdrussell/diffusion-pipehttps://github.com/kohya-ss/sd-scriptshttps://github.com/kohya-ss/musubi-tuner>Zhttps://huggingface.co/Tongyi-MAI/Z-Image>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/https://animadex.net>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Wanhttps://github.com/Wan-Video/Wan2.2>LTX-2.3https://huggingface.co/collections/Lightricks/ltx-23>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
1st for Ani.
so like. where are the 1girls yo. like I came for the hot pointy chinned 1girls
sfw vageen is ascended
>bullet impacts sounding like drumsmade me leap out of my chair and scream "KINO!!!!!!!!!!!!!!!!!" 3 times and then do a partial backfliphttps://files.catbox.moe/qjzu25.mp4
In the standalone anima trainer, does flash attention made things faster in exchange for quality hit, or just faster?
>Scorsese uses FLUXzit faggots keep seething
>>108976848ask AI
Tested Wan22 Bernini. Here are my initial result on the single test case.R2V: Subject to Video generation.Best: 0.8 Megapixel 81 frames at 30 FPS, OOM on higher res/frames length on a 5090/128gb RAM. Heavily dependent on subject resolution, so best results may varies. Most accurate at 30FPS, lowering FPS seems to degrade reference accuracy. Accuracy also degrades after 81 frames, just like Wan22 base I guess. Bernini can be extended if you are determined to stich 81 frames video together. Seems to lose out against SCAIL on ease of use, VRAM requirements, but SCAIL can only do rigid open pose reference. Bernini can supposedly can do more things, need to test further.>vid related, Bernini 81 + 81https://github.com/Comfy-Org/ComfyUI/pull/14216https://bernini-ai.github.io/
>>108976878everyday I hate myself for being a VRAMlet
I don't get the appeal of video generation
>>108976878make moot do cute things
>>108976889its ok, its not your fault you were born brown
>>108976889making porn of unsuspecting women
>>108976887
Is it possible to train a lora on small (<64x64) sprites?
>>108976878
>>108976783I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT
>>108976998ma'am, i need that for studying
>>108976889for realistic nsfw, nothing else has the prompt understanding and adherence
>>108976889Its only good for porn. For other stuff its cringe
Surely Krea 2 open release won't be like their previous open release and will be better than ZIT, a model from half a year ago, right?
>>108975467not that hard, though 100k seems a bit thin for a full finetune. get a regularisation data set at the very leastLR at batch size 1 around 6e-6 to 8e-6, scale up from there correspondinglycaptioning is the painful part, what i did is run through WD14 or animetimm first, then filter out false positives that pop up when one uses these models on photos (asian, realistic, etc), then gemma4 31b with grounding from these tags and a good system prompti recommend to not tune existing photography tags like photo (medium) or cosplay girl as your main triggers, but do something fresh like an artist tag. trying to build atop the existing ones only resulted in slop semi realism for me
pretty good seed for the plane
american ship cloaking technology captured on filmhttps://files.catbox.moe/nw00el.mp4
>>1089768891girl, plot
>>108977124Hot glue gun to ass? I'd rather take a tattoo
>>108976783Baker, next OP, please:"Discussion and Development of Local Image, Video, Music and Anime Models"
>>108977173MEW my beloved
>>108977019You have Anima, why care?
>>108977173stop posting my gf
>>108977182>>108977193