[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108639162

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>108645380
he makes new threads
not that bad
>>
thank you comfy for providing us with this amazing software free of charge
>>
>>108645380
>after being found out
what happened? i havent been binge watching these threads lately
>>
Blessed thread of frenship
>>
>>108645413
he shits in threads. very unheigenic
>>
File: emad-mostaque.jpg (145 KB, 900x599)
145 KB JPG
what happened to him?
>>
File: 1769432142254647.jpg (1.73 MB, 979x2558)
1.73 MB JPG
>>108645453
>he shits in threads.
yeah, Anifart is definitely a nuisance
>>
>>108645478
who gives a fuck? he's not useful at all in this modern age of AI
>>
>>108645344
Thank you for baking this thread, anon
>>108645432
Thank you for blessing this thread, anon
>>
>>108645478
crypto scammer since he left sai
>>
>>108645488
> fake screen
> jan
>>
>civitai is now two domains of censored or removed loras
Jesus Christ someone kill this garbage.
>>
>>108645600
muh payment processors :(
>>
>>108645507
he made this modern age of ai
>>
>mfw Resource news

04/20/2026

>Elucidating the SNR-t Bias of Diffusion Probabilistic Models
https://github.com/AMAP-ML/DCW

>(1D) Ordered Tokens Enable Efficient Test-Time Search
https://soto.epfl.ch

>Frequency-Aware Flow Matching for High-Quality Image Generation
https://github.com/OliverRensu/FreqFlow

>From Zero to Detail: A Progressive Spectral Decoupling Paradigm for UHD Image Restoration with New Benchmark
https://github.com/NJU-PCALab/ERR

>China’s Alibaba launches 10,000-card computing cluster
https://www.scmp.com/tech/article/3349335/ai-race-us-intensifies-chinas-alibaba-launches-10000-card-computing-cluster

>Modly: Local, open source, AI-powered image-to-3D mesh generation
https://github.com/lightningpixel/modly

>DCW: Elucidating the SNR-t Bias of Diffusion Probabilistic Models
https://github.com/AMAP-ML/DCW

04/19/2026

>ZPix: Local AI image generator and editor powered by open image models.
https://github.com/SamuelTallet/ZPix

>Comfy Canvas: Local inline layer based image editor
https://github.com/Zlata-Salyukova/Comfy-Canvas

04/18/2026

>Rose: Range-Of-Slice Equilibration PyTorch optimizer
https://github.com/MatthewK78/Rose

04/17/2026

>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
https://yjx-research.github.io/ControlFoley

>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens
https://research.nvidia.com/labs/toronto-ai/tokengs

>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation
https://aka.ms/mm-webagent

>Qwen2D-VAE
https://huggingface.co/Anzhc/Qwen2D-VAE

>ComfyUI HY-World 2.0 — WorldMirror 3D
https://github.com/AHEKOT/ComfyUI_HYWorld2

>Anima Style Explorer: A free web tool for ComfyUI styles
https://anima.mooshieblob.com

>Stanford AI Index Report 2026
https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf
>>
>mfw Research news

04/20/2026

>Towards In-Context Tone Style Transfer with A Large-Scale Triplet Dataset
https://arxiv.org/abs/2604.16114

>Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration
https://arxiv.org/abs/2604.15829

>Motion-Adapter: A Diffusion Model Adapter for Text-to-Motion Generation of Compound Actions
https://arxiv.org/abs/2604.16135

>TwoHamsters: Benchmarking Multi-Concept Compositional Unsafety in Text-to-Image Models
https://arxiv.org/abs/2604.15967

>Repurposing 3D Generative Model for Autoregressive Layout Generation
https://fenghora.github.io/LaviGen-Page

>The Amazing Stability of Flow Matching
https://arxiv.org/abs/2604.16079

>DINOv3 Beats Specialized Detectors: A Simple Foundation Model Baseline for Image Forensics
https://arxiv.org/abs/2604.16083

>Sketch and Text Synergy: Fusing Structural Contours and Descriptive Attributes for Fine-Grained Image Retrieval
https://arxiv.org/abs/2604.15735

>AHS: Adaptive Head Synthesis via Synthetic Data Augmentations
https://keh0t0.github.io/AHS

>VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects
https://arxiv.org/abs/2604.16272

>Adapting in the Dark: Efficient and Stable Test-Time Adaptation for Black-Box Models
https://arxiv.org/abs/2604.15609

>From Competition to Coopetition: Coopetitive Training-Free Image Editing Based on Text Guidance
https://arxiv.org/abs/2604.15948

>UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs
https://arxiv.org/abs/2604.15871

>Efficient Video Diffusion Models: Advancements and Challenges
https://arxiv.org/abs/2604.15911

>Making Image Editing Easier via Adaptive Task Reformulation with Agentic Executions
https://arxiv.org/abs/2604.15917

>Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models
https://arxiv.org/abs/2510.21783
>>
ok now it's unblessed again
>>
>>108645488
i like how julien tries to shit on nigbo
infighting lolcows are always fun
>>
is a lora dataset treated exactly like a prompt? e.g. would you need to backslash escape like \(final fantasy\)?
>>
>>108645947
backlash escape my dick
>>
>>108645600
On the bright side, 99.9% of what's there are shitty fried jeet loras, so not much of worth is being lost.
>>108645947
Yes.
>>
>>108645995
>Yes.
No lmao. Unless you're using some fucked up trainer that has prompt weighting turned on by default (why would you even weight prompts at training time?). Every trainer I know of just processes your captions as is, so you don't escape anything.
>>
Any decent controlnet or inpainting tutorials (for dummies)?
>>
File: deLC_zi_00003_.png (2.35 MB, 1792x896)
2.35 MB PNG
>>
>>108646105
>plastic slop
you didn't need to tell us it's local we would have known lol
>>
>>108646105
Deformed goonery? Api is better than that, yes.
>>
>>108646105
>3dcgi style
*vomits*
>>
>>108646105
is this real?
>>
File: 1674062767414593.jpg (55 KB, 474x484)
55 KB JPG
>install new custom node for great audio
>brick install
>>
>>108646118
well he threw down the gauntlet. post one of your api gens that is better than that.
>>
File: file.png (2.55 MB, 1248x1824)
2.55 MB PNG
>>
>>
dont bother trying to ragebait us api chads
we dont post our gens often because we dont want to hurt your feelings
>>
>>108646229
better than what? I don't see anything
>>
>>108646207
How I feel you, chinlet baby.
>>
>>108646299
i don't think a low res face swap is going to ruffle any feathers.
>>
>>108646398
maybe so, but we can turn it up a notch, no biggie
>>
just train a lora retard
>>
Are we still waiting for nunchaku updates?
>>
given a 3d model, you should be able to create the perfect character lora, right? I wish there was a tool that automatically does this instead of manually taking screenshots from multiple angles and poses.
>>
>>108646476
why train a lora when you can toss a bunch of reference into latent space?
>>
>>108646491
>I wish there was a tool that automatically does this instead of manually taking screenshots from multiple angles and poses.
"Hey chatgpt can you create a python script to take N renders of this character on *insert 3d software*"
>>
>>108646515
i would just render a t pose doing a 360 and then dump it into comfy to convert it to images and caption it.
probably overkill unless you had a ridiculously complicated model.
>>
>>108646515
You'd need a way to have the model do atleast 10 or so poses for variety. I don't think any AI model can dynamically create poses from a rig yet.
>>
>>108646507
>why train a lora when you can toss a bunch of reference into latent space?
that sounds like it should work but it doesnt
>>
>>108646551
isn't that how image edit models work?
>>
>>108646547
also close-up shots for fine details or else you'll get slopped accessories
>>
File: qwen_wf.png (539 KB, 1640x949)
539 KB PNG
is this Qwen-image-edit workflow I'm using still up2date?
including that 8 steps lightnting lora?(what does it even do? is it making the gens faster but resulting in lower quality?)
>>
File: 5125124516146134.jpg (535 KB, 1344x768)
535 KB JPG
>>
>>108646599
no one knows how image edit models work, its a black box
>>
File: 7568653734673473.jpg (328 KB, 1344x768)
328 KB JPG
>>
>>108646686
Bad dataset
>>
>>108646740
What?
>>
>>108646854
increase your browser text size, BAD DATASET
>>
holy shit why is ldg suddenly so fucking dead?
>>
>>108647159
Ani is probably busy for once and other than than schizo posting, ldg has been pretty dead for months. Nothing exciting has been released since zit.
>>
calm down anonies its not that deep
>>
File: 1749844493790541.png (111 KB, 384x313)
111 KB PNG
>>108647159
there's nothing to talk about
>>
>>108647203
It's funny to see posts like "ZOMG Y THREAD DEADZ????" as if it's the first time LDG hasn't burned through a thread in less than 4 hours
>>
>>108647193
>ZiB
>anima
>"nothing exciting"
You will reply to this with cope and or seethe
>>
has anyone got a good qwen2512 img2img workflow? Seems simple but I can't get it working
>>
>>108647304
is the default comfyui example broken?
>>
>>108647309
am I missing something but it doesn't seem to exist for simple img2img, only text2img or controlnet
>>
>>108647193
LMAO my dude, LMAO.
you always crack me up.
>>
>>108647315
you're a retard,
qwen edit 2511 is for i2i (edit mode)
or if you meant class i2i with denoise shenaningans then just dilate bro
>>
>>108647345
i dont get it
>>
>>108647348
i want to do img2img with normal loras
>>
File: animapreview3base_00002_.jpg (668 KB, 2048x2048)
668 KB JPG
>>108647159
I’m recruiting anons from anime general to post here, /edg/, /hgg/, etc.
What are (You) doing for /ldg/, anonnies?
>>
One day you will appreciate the slow times.
>>
>>108647159
Local is a graveyard atm.
>>
>>108646434
based
>>
>>108647387
Just vae encode your image to latent and use low denoise. There's a good pose controlNet for 2512 too.
>>
>>108647572
Proof?
>>
>>108646299
This moment you realize Api fag have no idea what they're talking about...
>>
>>108646434
Kek... Sam's cum bucket fap on swim wear catalog...
>>
>>108647749
no u
>>
>>108645728
>>108645736
who rattled nigbo's cage again?
>>
>>108647749
Even if they knew about Malcom's HF, they're unable to use it so... Let them be.
>>
>>108646299
top hinge needs a minor fix xd
>>
>>108647762
>Kek... Sam's cum bucket fap on swim wear catalog...
its a blue board
>>
>>108647834
her hinge needs a minor fix, look at her legs.
>>
File: Z-Image_00005_.png (1.2 MB, 1080x1294)
1.2 MB PNG
>>
>>108647897
Bruh what are your settings even for Z Turdbo that is some heavy artifacting
>>
>>108647854
API cuck don't even knows how to catbox?
>>
https://www.reddit.com/r/StableDiffusion/comments/1squ6in/open_source_crt_animation_lora_for_ltx_23/
this is cool
https://litter.catbox.moe/70a5t3y01avkj291.mp4
>>
File: 1772910684322616.png (1.27 MB, 1344x768)
1.27 MB PNG
>>108648103
>dataset is only 20 clips
yup, thats a hard pass from me.
>>
>>108648139
kek I thought it was an actual screenshot from a porno video and then I saw the NBP logo on bottom left
>>
cascadeur, my beloved...
>>
>>108648139
Anon you really fap to this? Just buy a magazine or watch TV at this point xD
>>
>>108647159
>holy shit why is ldg suddenly so fucking dead?
Most people left when they realized the petr* schizo is samefagging with bots. It's just non-stop FUD aimed at contributors. Do you want to read FUD all day? Most people don't, so this general died. You should leave too.
>>
>>108648229
>You should leave too.
apply your own advice
>>
why is anima genning so slow? am I doing something wrong i'm just not trying it today
>>
>>108648031
localfags can't browse nsfw boards?
lmao
>>
API anon is such a plague, can anyone teach him how to use cumfart? I think he has no idea what we are able to gen...
>>
File: 1771039218200030.jpg (340 KB, 1080x1934)
340 KB JPG
>>108648228
i gave up fapping a long time ago
if i fap then i stop genning, and i dont want to stop genning
>>
>>108648252
lmaoo this is insanely good
>>
*yawn*
>>
>>108648244
We can... Andd we've never seen any Gemini logo there.
>>
>>108645344
can zit ai generate celebrities
>>
>>108648327
then you're not paying attention
>>108648367
yes, see >>108647897
>>
File: ZiT.png (2.68 MB, 1280x1280)
2.68 MB PNG
>>108648367
>can zit ai generate celebrities
it can do some
>>
seems like cloudcuck still prefers to post here instead of his own general. what causes this?
>>
File: 1752932406353692.png (17 KB, 369x304)
17 KB PNG
Has anyone done mask training with OneTrainer? Right now i have masked out faces with a black mask, with everything else i want to train in white. What should these settings be for that case? (i can easily invert the masks if i have it the wrong way around)
>>
>>108648392
the only cuck here is you, we're all posting gens while you just sit in the corner and watch
>>
>>108647911
not that anon but how do i lower the artifacting in zit
>>
>>
>>108648423
whatever helps you sleep at night little buddy
>>
https://civitai.red/models/2560840/anima-turbo-lora
Official Anima turbo lora. With this, the model is now 2x faster than SDXL. Has minimal style shift so you can use it with other checkpoints.
>>
>>108648423
lmao you destroyed that freak
>>
>>108648433
dont worry about me, i'll be sleeping like an angel tonight knowing i put these localdorks in their place
>>
File: arigato.jpg (161 KB, 648x1056)
161 KB JPG
>>108648449
https://civitai.com/models/2466415/cosmos-predict25-2b-base-distilled-extracted-dmd2-lora
This one works in 4 steps, even on Anima Preview 3, it's some sort of back magic. I will give the new one a try however, looks promising.
>>
>>108648449
>>108648500
yeah but you don't have cfg anymore, can't they just turbo'ed their models while keeping the cfg?
>>
>>108648500
That one destroys details to the point of being unusable. It's a miracle it works at all. I think the official turbo lora is much better than any other one I've tested. The latest RDBT 0.25 is pretty good also, but that one is a full checkpoint and has a strong style bias.
>>
>>108648449
POST
IN
ANIME
GENERALS
YOU FUCKING FAGGOOT
>>
>>108648544
uh oh, meltyy
>>
>>108648449
Dude, what did the anime generals on 4chan do to you that you never post your news there?
>>
Reading api cuck post... He's literally schizo praising his own post and answer. Take your meds anon and leave local, no one cares about api here.
>>
>>108648449
based KING DRUSSEL
>>
>>108648449
KYS
>>
>>108648449
>>108648573
Us in the know call him TouchDownRuss.
>>
>>108648544
>>108648577
we know who's fuming lmao
>>
>>108648529
The hell do you need high CFG for? If it's negative prompt you're missing you can give parts of your prompt negative weight with this: https://github.com/pamparamm/ComfyUI-ppm Works with Anima.
>>
>>108648449
based Tdrussel knowing that /ldg/ is the best diffusion general
>>
>>108648573
>>108648582
i call him mommy actually
>>
>>108648449
omfg im so tired of speedniggers

make a lora that removes VAEshit so it takes 10 times longer to generate but ACTUALLY MAKES THE IMAGE AND COLORS BETTER
FUCK NIGGERS
>>
>>108648449
THIS GENERAL IS TRASH DUDE WHY YOU KEEP HERE!!!! WHY DO YOU POST TO ANOTHER DEVS SHILLING ERNIE AND CHROMA!!?!?
>>
File: 1758144888888090.png (296 KB, 640x640)
296 KB PNG
>>108648604
>THIS GENERAL IS TRASH DUDE WHY YOU KEEP HERE!!!!
SAAR DO NOT REDEEM
>>
>>108648601
>that removes VAEshit
lodestone is trying that, with really mixed results...
>>
>>108648544
>>108648577
>>108648614
now I want russel to post more here, this schizo's melties are so funny lmao
>>
>>108648585
>https://github.com/pamparamm/ComfyUI-ppm Works with Anima.
first time I've heard about it, usually when you want to use negs without a CFG you go for NAG
>>
>>
>>108648449
Tdrusell! please! my wife is yours! fuck her!
>>
>>108648449
Thankies!
>>
>Another Touchdown Russell!
>/ldg/ - 100
>/hdg2/ - 0
SOMEBODY CALL THE DAMN MATCH
>>
File: shinonome.jpg (183 KB, 608x1408)
183 KB JPG
>>108648640
All the nodes I checked for NAG didn't work with Anima, but this does so I went with that and I haven't bothered to check up if anyone has finally implemented NAG for anima yet. Also, it might have been a skill issue but my experience with NAG was that it completely fucked my images half the time and I haven't had that problem with this.
>>
>>108648449
Someone call the Saber Alter slopper! We need him to test this and give some vague commentary. He’s the only one who care for this faggot!
>>
>>108648449
Since this is sort of a base version, do you plan to make some controlnets or will you wait for the final release?
>>
>>108648698
Ho ho ho, testing this turbo lora @takeuchi takashi is more “takauchi” than “Takeuchi.” Good lora!
>>
>>108648449
Why don’t you sell some wheels of your comfy granted Lamborghini and train a good Anima ControlNet?
>>
>>108648700
Unlikely until after full release. The current turbo lora was actually just an experiment with testing the techniques that I'll use for the full Turbo checkpoint. It was good enough that I decided to release it.
>>
>>108648760
any plans to release the realism lora?
>>
>>108648449
Isn't there an /adt/ thread for this. Why bring the updates to realism thread, it's not even relevant to anyone here????
>>
>>108648760
Thanks, can you repost this in any anime general of your choice?
>>
>>108648777
No, for optics reasons. But literally just take 1000 photos from any decent pre-captioned photo dataset and train a rank 32 lora, that's the realism lora basically, it's not complicated. Or just wait for someone to inevitably do that.
>>
>>108648791
>realism thread
read again, /ldg/ means local diffusion generals
>>
>>108648791
based, he is a fucking faggot, in /adt/we are planning to never use his model again
>>
russgod you singlehandedly saved local anime from noob slop thank you kind sir
>>
>>108648801
I judge by what people do, not what they say.
>>
File: 1776716800944930.png (1.4 MB, 1536x1536)
1.4 MB PNG
>>108648791
>>108647620
this is an everyday /adt/ anima gen, but turdrusell prefer to post here, in Lodestone general
>>
>>108648377
>>108648382
interdasting.
>>
>>108648802
>we
you and your voices?
>>
>>108648809
This is cute! why tdrusell don't post in /adt/?
>>
>>108648800
ah yeah thats fair. guess ill have to save up some money and make one but was debating on general photos or cosplay specifically for the dataset
>>
>>108648808
>I judge by what people do
good, because what people did was to post an anime lora here
>>
>>108648809
wow look it's my gen
>>
Squirm slop anime posters. Squirm and bow to the superior general known as LDG.
>>
Why tdrusell post an Anime lora in a thread that everyday posts 3d asian woman? Is part of the Comfy grant contract?
>>
>>108648826
unironically its pretty nuts that the more general local diffusion anime posts blow 90% of the "anime centric" general out of the water kek
>>
>>108648824
wow look you losted, and you keep using his model, you are a cuck
>>
>>108648822
How are you going to use this lora to generate Ortega's feet though?? It's just not relevant no matter how you look at it.
>>
>>108648840
>How are you going to use this lora to generate Ortega's feet though??
is this a bot or something?
>>
>>108648449
But this is a realism general, I want Katy perry, what are you doing here?
>>
File: ol yuffie.png (2.67 MB, 1856x1536)
2.67 MB PNG
>>108648449
pretty cool
>>
I just looked at adt for the first time why do all the gens look all slimy and not like anime?
>>
>>108648856
real anime is moe, and /adt/ is a moe general
>>
>>108648777
Look at this dude already salivating over new model he can use for CP diffusion. Ewwww this is just not the place for anima.
>>
>>108648837
I don't have anything to say to Mr. Touch Down, so I don't really care that he doesn't post in /adt/. I guess I lost(ed) and am a cuck.
>>
>>108648856
>all slimy
signature of model merges. in the biz we call them "slop merges" or "jeetmixes" for that very reason. because they are ugly.
>>
>>108648449
>>>/h/8864351
>>
>>108648851
wow, your gen is slop, but you have the SOTA dev posting here, the irony is beautifull
>>
File: ComfyUI_10363_.png (1.08 MB, 1024x1024)
1.08 MB PNG
>>108648856
>slimy
>>
>>108648449
>>108648630
>>
File: 1776022168238295.png (80 KB, 480x360)
80 KB PNG
>>108648878
>the SOTA dev posting here
a SOTA dev posting on a SOTA general, yeah that fits
>>
>>108648878
i like the way it looks, that's all i care about desu. if i wanted artsy shit i would've invested my time into drawing or painting
>>
best ai image gen general on the chanz no bullshit just cutting edge tech and gens
>>
>>108648882
>posted a 3d reaction pic
yeah, more Anima news for this thread! :^)
>>
>>108648885
>if i wanted artsy shit i would've invested my time into drawing or painting
are you implying AI can't do fine art? lol
>>
>>108648449
>>>/e/3073046
>>
>>108648449
is anime that good? people react to it like crazy, jesus
>>
Mass schizo suicide
>>
>>108648899
It is the only anime model that matters right now. Everything else is a cope unironically.
>>
File: 9967946948484.jpg (632 KB, 1344x745)
632 KB JPG
>>108647068
Ohhh, I think I understand what you meant by that. No, it looks the way it is supposed to. It was trained on Mushoku Tensei anime screenshots. I'm still playing around with the lora settings, but results are promising
>>
>>108648889
i'm not implying anything. i want it to look good and the computer makes it look good. i don't care about things looking "artistic" or any other definition some random person on the internet cares about like "slop"
>>
>>108648449
Big Russ! Where's the realism lora?
>>
File: 94784794.jpg (635 KB, 1344x768)
635 KB JPG
>>
File: DCom3xEYkMbioLO28D62t.png (988 KB, 832x1216)
988 KB PNG
An anime model being what finally kills this thread was not what I expected.
>>
>>108648760
Since the highres training is already in progress, I'm assuming the next release will be the "final" 1.0 base model?
>>
>>108648760
is there a reason why you went for that base model? don't get me wrong it's a really good finetune, but there were better candidates no?
>>
>>108648997
yeah like uuuuh uuuum
>>
>>108648977
>kills this thread
?
>>
>>108648737
I don't understand your bad faith approach, I don't see what point you're trying to make.
>>
File: ComfyUI_temp_cslne_00005_.png (2.04 MB, 1056x1440)
2.04 MB PNG
>>
File: 191562510455854.png (2.25 MB, 1248x1824)
2.25 MB PNG
>>
>>108649062
>>108649053
Good gens, you deseve Anima news!
>>
File: 75656885667.jpg (167 KB, 768x768)
167 KB JPG
>>108649011
The shitposting and samefagging orbiting around it is dwarfing the zit meltdowns and chroma meltdowns from last year. I don't know if it's just site traffic being down or what, but when 90% of posts are some bored asshole with a script it gets annoying.
>>
File: comfy__14.jpg (1.96 MB, 1501x2208)
1.96 MB JPG
>>
File: 674267857621504.jpg (2.11 MB, 2496x2006)
2.11 MB JPG
>>108648449
It's very fast, pretty good quality.
>>
>>108649100
Wtf? I like more the turbo version. Can you make more?
>>
>>108649079
>is dwarfing the zit meltdowns and chroma meltdowns from last year
Mayhaps. But there's been worse schizo meltdowns desu.
>>
>>108648449
>2x faster than SDXL
Does it have x2 the aesthetics of Noob?
>>
File: 705872094488448.png (1.98 MB, 1248x1824)
1.98 MB PNG
>>108649115
The turbo version captures less of the artist style and more of it's own distilled style, maybe that's why.
>>
julien status?
>>
File: 813892313650215.png (3.25 MB, 2303x1673)
3.25 MB PNG
>>108649115
>>108649131
Curiously though, when using an artist style lora it seems to adhere to it maybe even closer than the base model by itself.
>>
better i2v??? WHERE IS IT? SEED?
>>
File: 277170810591572.png (3.31 MB, 2303x1673)
3.31 MB PNG
>>
hm... what to train on.... hm ...
>>
>>108648449
This news is a nothingburger. It is known that Anima has worse style interpretation than Chenkin/Noob/Mugen, and now you add speed at the cost of more diluted styles.
What use is it to me to use such a diluted model if I can obtain superior results with Chenkin and its superior ControlNet that can capture any scene I propose with Cascadeur?
These little experiments you make are a nothingburger until you release a ControlNet for Anima.
>>
>>108649072
anima is limited with z. genital parts are horrible
>>
>>108649226
just use a pencil faggot
>>
File: 901834508854949.jpg (2.82 MB, 2048x1737)
2.82 MB JPG
>>
Cool example gen you posted to help illustrate your point anon
>>
>>108649093
Kino
>>
>>108649244
I don't post anime gens here, i'm not part of the problem
>>
File: 434430340517180.png (2.41 MB, 1536x1024)
2.41 MB PNG
>>108648909
>>108648937
serene
>>
/ldg/ is the true anime general
>>
>>108649239
Google Cascadeur before saying stupid things. It's a free 3D software where you can build 3D scenes and characters with integrated AI. For example, you can move any part of the body and the character moves along with the limb you drag. It's much more intuitive than drawing and much more intuitive than designing a 3D scene in Blender. Then you export that in depth or canny format and you own whatever scene you set out to create. With that, I don't need some stupid new random anime model that has +0.005 prompt adherence in exchange for 70% worse aesthetics.
>>
anon absolutely SEETHING keke
>>
File: 841977107163987.png (2.23 MB, 1536x1024)
2.23 MB PNG
>>
what a retard lmao
>>
File: 366465205940280.png (1.3 MB, 1536x1024)
1.3 MB PNG
>>
ControlNet, regional prompter, and Homing's new mosaic outpainting solved the majority of SDXL problems. I don't need a new anime model that understands "the cat is on the table" when I can mask where I want the cat to be and I can maks where I want the table to be. It seems retarded to not rely on peripheral tools.
>>
>>108649226
>>108649341
Did the journal factory explode
>>
shut the fuck up artcel
>>
File: 43195659524137.png (1.24 MB, 1536x1024)
1.24 MB PNG
>>
>>108649260
>>108649302
neat
>>
>>108648449
When are you going to share how to make a character lora on Windows ? Your tutorial only works for Linux and python.
>>
File: 7674373473472.jpg (702 KB, 1344x768)
702 KB JPG
>>
>>108649371
You don't already run Linux? What are you doing here?
>>
File: z-image_00957_.png (1.27 MB, 1280x720)
1.27 MB PNG
>>
File: ComfyUI_06674_.jpg (1.3 MB, 1536x2560)
1.3 MB JPG
Big russ can you train a lora that fixes the fingers?
>>
>scrape 2k+ img from online account
>now must manually sort through imgs good enough for dataset
why has this not been automated yet
>>
>>108649480
the fingers will be fixed at high res training, don't forget he's at 512x right now
>>
>>108649517
fpgaminer made this but I have no idea if is decent
https://github.com/fpgaminer/joyquality
>>
>>108649531
kekstone is that you???
>>
>>108649371
use wsl
>>
File: ComfyUI_03110_.webm (3.92 MB, 1200x2000)
3.92 MB WEBM
>>108649531
They said the same thing about Chroma...
>>
>>108649480
>fingers
desu still better than sdxl
>>
it's been a full year of wan gens and im still not burnt out and gen 24/7.
>>
>>108649480
no one care about fingers for drawings
>>
>>108649773
wan is special. Most vramlets here don't know
>>
cozy
>>
copy
>>
>>108649773
this. wan is even better than grok sometimes.
>>
>>108649773
I like it just wish we could get something new. Really want something with decent cartoon/anime lipsync.
>>
>>108649517
>why has this not been automated yet
Takes few hours, 2k images is nothing if you only have to sort them and not remove watermarks and crop
>>
>>108650024
Grok is actually worse by many metrics, but it's pretty good at randomly gluing dicks and balls onto things.
>>
>>108649773
There are enough models now that i can already forever bounce between ZIT/Chroma for realism, Noob for tranime, Wan for videos.
>>
As someone who followed Chroma until the "official" 1.0 release which was v50 or whatever it was, where can I learn more about all of the actual versions kekstone is training? There has to be a fucking list that explains them, right? It cant STILL all be in trooncord only?

There was some 2k or whatever the fuck that was touted as better than v48 and below, is that still the "best" regular current Chroma model or is there something after?

There is ZIT Chroma thats in training still, Radience pixelspace thats in training still, what else?
>>
>>108649398
Would mating press
>>
>>108649397
Where?
>>
File: 1775240292788259.png (1.03 MB, 705x928)
1.03 MB PNG
>crtanim — CRT / Retro Terminal Video LoRA for LTX‑2.3 22B
https://www.reddit.com/r/StableDiffusion/comments/1squ6in/open_source_crt_animation_lora_for_ltx_23/
https://huggingface.co/lovis93/crt-animation-terminal-ltx-2.3-lora
>>
>>108650124
I have no idea how chroma works. Every time I try some prompts on it, on get garbled shite now matter how much I tweak the settings. I'm using chroma1-hd and genning at 30 steps
>>
This is AI music right?
https://youtu.be/Ikkizhy3nMw
>>
pic related was made on a MacBook Neo with Anima preview 3 using the turbo lora that came out today. 8 steps took 253 seconds in total. This is around the maximum resolution you can do, and I had to use the Q8 GGUF version of Anima as well.

Good to know I can still generate a 1girl every 5 minutes if I'm bored on vacation with my wife's laptop I guess

>>108649773
the wan 2.1/2.2 rollout was one of the top 5 open source model releases of all time. if I didn't crave audio now as a result of trying SaaS models I wouldn't be burnt out on it either. It's also permanently "good enough" for many basic I2V tasks for simple memes or prototype animations so it'll keep being used for years to come.
>>
>>108650189
Looks pretty cool
>>
>>108650144
It's pretty much all in the discord now since most (all?) of the regular posters have left the general and nobody posts on HF. I'll try to cover them all.

Base is really just v48 since v50 introduced some quirks with its outputs. 2k was an attempt to compensate for the 512 training, but I haven't used it so I can't comment on if the anatomy is any better with it or not. Radiance and Kaleidoscope (Vae-less and Klein-based model respectively) have both had training failures and are effectively cancelled. That just leaves Zeta Chroma (Zimage-based) as the only current project. Unrelated to lode, there's a checkpoint called Spark that is currently in progress.

As for which one is better? Some people like the flash models since they clamp down on the model's schizo creativity, but they're more slopped as a result. For my niche use case the base model is still unmatched.
>>
tdruss and anima once again based for not hiding discussion behind some coomcord
>>
>>108650373
what training failure happened with radience? rip. is there a known best checkpoint?
>>
>>108650440
Sorry anon, I haven't really kept up that closely with it so I'm not certain. If I remember correctly Radiance was simply too expensive to keep maintaining.
>>
>>108648449
Imagine Anima but more slopped lmao
Looking forward to seeing some epic 'five porcupines on the left anime girl on the right abstract style with a gundam in the background' nonsense gens in /adt/ and /sdg/ that somehow look even uglier than usual
>>
File: 1747029221116112.png (2.57 MB, 1328x2048)
2.57 MB PNG
>>
File: 1761168635420497.png (3.07 MB, 1328x2048)
3.07 MB PNG
>>
File: 1770360678208696.png (1.25 MB, 1024x1024)
1.25 MB PNG
>>
File: 1753678298627795.png (557 KB, 2100x6300)
557 KB PNG
I don't know anything about model training, is that bump good or bad?
>>
>>108650610
oh dear SWEET MOTHER OF GOD WHT THE FUCK
>>
File: 1775262013602130.png (1.42 MB, 832x1280)
1.42 MB PNG
>>
File: Flux2-Klein_00078_.png (1.88 MB, 1024x1024)
1.88 MB PNG
>>
>>108650615
so it's bad?
>>
File: 1757831978637399.png (1.35 MB, 832x1280)
1.35 MB PNG
>>
Anyone knows a good AI 2hu themed general? I want to share some gens and some 2hu chatbot logs?
>>
File: sabergelion alter.jpg (2.6 MB, 3264x2020)
2.6 MB JPG
>>108648449
Holy sorcery, first the high-res lora and now this.

>>108648698
Shoukan ni ouji sanjoushita~

1920 height stretches the body. I should test if it can be combined with the high-res lora.
>>
File: 1280x1600_compare.jpg (2.55 MB, 3840x1700)
2.55 MB JPG
>>108650793
Works better at 1280x1600.
>>
>>108650793
>>108650809
Looks way worse with turbo lora
>>
File: boxer alter.jpg (2.63 MB, 3840x1700)
2.63 MB JPG
>>108650793
>>108650809
Forgot to mention the non-turbo results use 20 steps.

Unintentional text comes out clearer with 12 steps. Coloring is fancier than base's version of Nipi style.
>>
File: hires+turbo.jpg (2.98 MB, 3264x2020)
2.98 MB JPG
>>108650793
Stacking the high-res and turbo loras works decently. Dat hand tho.
>>
File: pill dond by nttruslan.jpg (2.69 MB, 3840x1700)
2.69 MB JPG
>>108650809
Hires + Turbo at 1280x1600. Clearer unintentional text at 12 steps again.

Takeuchi style comes out sharper and more-simplified with turbo overall. Tends to have some AI artifacts, but the speed is fun when 20 steps takes me several minutes.
>>
SD 1.5 fag here, are character LORAs a thing of the past with modern models?
>>
File: nipi27 (redrop absorbido).jpg (2.55 MB, 3840x1700)
2.55 MB JPG
>>108650853
Hires + Turbo with this one almost looks more like Redrop than Nipi.
>>
the most impressive thing about the turbo lora is that most other ones always introduce a bunch of artifacts but this remains clean
>>
>>108650997
They're still useful.
>>
>>108650982
>>108651029
can you share prompts?
>>
File: WaiAnima1+TurboLora.jpg (2.47 MB, 3264x2020)
2.47 MB JPG
>>108650793
Turbo lora works with WaiAnima too. Wai's aesthetic tuning over base preview 3 still shines through.

>>108650853
>Forgot to mention the non-turbo results use 20 steps.
Correction: 30 steps for the gown ones, and 20 for the MMA one.

>>108651160
For the gown ones:
>masterpiece, 1girl, saber alter, fate \(series\), @takeuchi takashi, evening gown, nightclub, bar, looking at viewer, serious
>Seed: 106979513466387
>er_sde simple
>30 steps

For the MMA one:
>1girl, saber alter, fate \(series\), @nipi27, low ponytail, black sports bra, petite, flat chest, black dolphin shorts, black fingerless gloves, bruise, dirty, sweat, heavy breathing, looking at viewer, serious, stretching, boxing ring, crowd
>Neg prompt: letterboxed, border, black border, white border, black background
>Seed: 790426829359823
>er_sde simple
>20 steps
>>
>>108648449
my man based, thanks
>>
>>108651255
test with prompt
>Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.
>>
When is anima finishing the training, sisters?
>>
When are of 35 stars status??????
>>
>>108650809
>>108651255
Wai+Turbo, 1280x1600. I like the middle one.
>>
File: WaiAnima1+Turbo_00001_.png (2.68 MB, 1600x1280)
2.68 MB PNG
>>108651265
Dozo. Wai+Turbo
>>
File: Anima0.3+Turbo_00011_.png (3.08 MB, 1600x1280)
3.08 MB PNG
>>108651265
>>108651306
And here's base preview3 + turbo. Hope you like guys, haha.
>>
File: 1773177153069470.png (2.46 MB, 2935x1604)
2.46 MB PNG
https://jeoyal.github.io/MegaStyle/
>style transfer lora
nice
>Flux 1
OH COME ON
>>
What's the best model for coomgen?
>>
File: WaiAnima1+TurboLora_boxer.jpg (2.33 MB, 3840x1700)
2.33 MB JPG
>>108650853
And finally Wai+Turbo of that. Shinier and grew an extra finger. This is the prompt that does NOT start with "masterpiece", interestingly.
>>
File: Untitled.png (1012 KB, 2995x559)
1012 KB PNG
Is the right one the best one? For nsfw
>>
>>108651380
It's the one I've used and like, FWIW. Haven't tried the other two.
>>
>>108651380
check out Pony Diffusion V6 XL, lil bro
>>
>>108651392
Im not a furry bwo
>>
Anima previews:
>latent2rgb: 1.67 s/it
>none: 1.75 s/it
??
>>
>>108650853
>>108650905
>>108650982
>>108651029
Saber Slopper, you anime poster traitor. Such a casual slopper, your posts are pure amateur cringe. You only open your UI to pretend you test things when tdrusell posts.
>>
>wai
stop eating shit, poojeets
>>
>>108651492
you're jealous because tdrusell got the grant and not you
>>
>>108651380
Chenkin is a good all rounded model. Noob, if you have a clearer idea of what you want, or Mugen for the first pass to get good colors and textures.
>>
>>108651495
Regardless of who I am, that does not mean the things I said about you are lies, on the contrary, they are very true. You are very much a newfag casual. You post whenever tdrusell goes.
>>
>>108651505
35 stars status?
>>
If everyone contributed their grain of salt and posted anime where it is supposed to be, none of this would be happening.
>>
File: WaiAnima1+Turbo_00014_.png (761 KB, 1024x1024)
761 KB PNG
>>108651492
It's a sloppy job, but hey, three testcases over time.
>>
Does anyone found a solution how to use Euler A with anima without the reoccuring weird noisy patterns? This hasn't really improved since preview1, so I don't think it's pixelstretching due to lack of 1024px training. Maybe ancestral sampler not working well with flow-matched model in general?
>>
>>108651494
But it's good shit, gens are quick and the coom rises quick through my balls
>>
>>108651582
I tried euler_a normal for a while and didn't notice anything. But I have no concept of how the various samplers and schedulers differ.
>>
>>108649226
>It is known that Anima has worse style interpretation than Chenkin/Noob/Mugen

Lol, lmao even.
>>
File: 1757585511364086.jpg (1.63 MB, 3256x1658)
1.63 MB JPG
https://www.reddit.com/r/StableDiffusion/comments/1srk0xx/release_comfyui_diffaid_patches_inferencetime/
https://github.com/xmarre/ComfyUI-DiffAid-Patches
https://arxiv.org/abs/2602.13585
>Beyond improving generation quality, Diff-Aid yields interpretable modulation patterns that reveal how different blocks, timesteps, and textual tokens contribute to semantic alignment during denoising.
looks like it can be used on flux 2 klein
>>
File: Gpt-Image 2.png (2.48 MB, 1448x1086)
2.48 MB PNG
>Gpt Image 2 is being rolled out to all ChatGPT accounts
local losted'ed
>>
File: 1772307853516045.png (110 KB, 510x183)
110 KB PNG
>>108651704
this is impressive desu, it even managed to write this little text here
>>
>>108651627
You never seen the pretty big circle-shaped grain patterns on details/shadows/random shit? Sorry but I can't believe this.
>>
File: big if true.png (361 KB, 736x444)
361 KB PNG
>>108651687
https://www.reddit.com/r/StableDiffusion/comments/1srk0xx/comment/ohfayx2/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>I think the more interesting part of this is SDXL, which will now be able to "edit"
GROK IS THAT TRUE??
>>
>>108651737
I asked gemini and it said no, that ledditor is tripping
>>
>>108651737
why ask grok when you can just test it yourself.
>>
>>108651728
Hmmm, nyope.
>>
>>108651687
can this be used on Z-image?
>>
File: Capture.png (3.52 MB, 3134x1302)
3.52 MB PNG
>>108651687
>https://arxiv.org/abs/2602.13585
if this can make Klein less slopped I'm all for it
>>
File: eulera.jpg (934 KB, 2812x1999)
934 KB JPG
>>108651772
>>108651582
for reference. I think you just never looked at your gens then. This is not a debatable issue. Anyone who used euler a has seen this.
>>
>>108651796
fuck euler a, blurry piece of shit
>>
>>108651810
mostly i agree, but sometimes i really want the softness of euler for my textures but the get cucked by that shit. Upscaling with tiled diffusion can fix this though.
>>
>>108651796
I have literally never seen this on euler a
what are your other settings, and try negging "halftone" and "screentones"
>>
>>108651737
dev, responded. the person was retarded.

> No, you are missing the point too. It's not about giving a model edit capabilities. It's about giving a model better prompt adherence (which in turn can boost pre-existing edit capabilities since they are intertwined with the prompt) and overall image quality by modulating text conditioning.

Sure sdxl will probably be better with inpaint tasks with the help of DiffAid, but it doesn't just suddenly give it the capabilities of a proper edit model like flux klein.
>>
File: dem hips.png (653 KB, 1024x1100)
653 KB PNG
>>108651796
That print-looking pattern? I don't seem to get it on mine though.
>>
>>108651927
its not an issue on cel shaded 2d or soft shaded semi-realistic styles. But whenever something a little bit more drawish is involved, it craps its pants.
>>
sex
sex with saber
sex with ARTORIA
>>
>>108651796
bro just go look at the average booru image and be disgusted
euler a is probably just making it easier for that shit from training to come up
>>
File: lul.png (531 KB, 2098x1492)
531 KB PNG
this is it, LTX3, as good as seedance 2.0, are you ready? :^)
>>
>>108652028
ltx 4.20 to the moon :rocket:
>>
>>108652028
I'm not expecing much (or anything at all perhaps), but I do like that they've taken the initiative so far; they had a positive response and they built on it, unlike some I could name who had a fantastic reception and followed it with radio silence
>>
these are seedanece 2.0 text2video nsfw, there is still some safety cucking filters and heavy copyright filters. I wish the chinks would stop with the endless wan2.1 and wan 2.2 finetunes and just focus on finetuning ltx 2.3. I really wish lightricks trained the model on accurate detailed nudity and anatomy.
https://litter.catbox.moe/bvwxgx.mp4
https://litter.catbox.moe/hy925n.mp4
https://litter.catbox.moe/0a9y3l.mp4
https://litter.catbox.moe/936897.mp4
https://litter.catbox.moe/9vigar.mp4
https://litter.catbox.moe/n1lgxt.mp4
https://litter.catbox.moe/j6ls3t.mp4
https://litter.catbox.moe/70lrzq.mp4
>>
>>108651492
Butthurt loser
>>
>>108652101
share this here >>>/wsg/6126746
>>
>>108652101
no one cares retarded faggot, stop spamming your shitty api gens here every single fucking time
>>
>>108652101
lol
>>
>>108651737
SDXL bros??? I knew that Anima was the false messiah, the antichrist of anime disguised as a good anime model, I knew it! Nobody in their right mind posts Anime model news here unless they are the antichrist of anime diffusion!
>>
>>108652177
>>108651528
>>
>>108651737
>rectified text and image
Bluvoll won and Mugen won
>>
File: 1770573245233488.png (65 KB, 333x498)
65 KB PNG
>>108652177
>>108652187
you fell for misinformation
>>
>>108652101
the world would explode if people would be able to get something locally lmao
>>
Did russell mention if there is going to be a preview4 etc or is the p3 "base" it now? Since he writes he will release a turbo model for Anima, I assume its also based on preview3 which would imply that that's it, we are done? Havent followed the thread much in recent weeks so idk if he said something to this end.
>>
>>108652215
fud status?
>>
File: 1762664986876495.png (1.52 MB, 1024x1024)
1.52 MB PNG
>can gen kino in 5 seconds now
HOLYYYYYYYYYYYYYYYY
>>
>>108652256
?
>>
File: 56546872341885848.png (2.12 MB, 1152x1536)
2.12 MB PNG
>>108651982
check yourself before you wreck yourself
>>
>>108652270
why are you poor
>>
File: 1772852862285135.jpg (836 KB, 1840x2456)
836 KB JPG
>>108652296
fixed her
youre welcome
>>
>>108652211
>locally with consumer enthusiasts gpu of 32gb vram?
The model would have trained and developed from the begin to designed from consumer level gpus. Currently the SOTA chink and western ai labs don't want to commit to making video models fit within 24-40gbs of vram at fp16. Ltx 2.3 fp8 720p is pushing the limit for a 5090/64gb of ram pc build.
>>
>>108652215
Anima 3 is the latest version of Anima. There won't be any more updates now.
>>
>>108652313
nice. what'd you use, zit?
i just started fucking around with anima, that's just cosplay photo of and then upscale by.
>>
>>108652344
I basically hires-fix'd your image with zit after captioning it, I love my zitslop
>>
>>108652353
i'm kind of loving anima, for such a small model it's surprisingly competent.
>>
>>108652338
Thank you
>>
File: 856725727272.jpg (1.09 MB, 2016x1152)
1.09 MB JPG
>>
>>108652215
>>108652338
It's still called "preview", so there's clearly more planned.
>>
>>108652338
>>108652441
go for a non meme base model though, why did you go for fucking chronos :(
>>
>>108652441
It's also called "base". And what's clear about that
>>
ltx2.3 chads where we at?
>>
>>108652361
Can you mix artist tags? Can you weight them? Can you use ControlNet? For such a small and new model it is pretty shit THO!
>>108652441
Can you show me where it says anima 3 preview?
>>
File: 64565.png (170 KB, 3522x621)
170 KB PNG
>>108652101
why does this happen for all those links?
>>
>>108652456
>For such a small and new model it is pretty shit THO!
what are you waiting for to make something better? you promised to >>108648583
>>
>>108652456
https://huggingface.co/circlestone-labs/Anima/tree/main/split_files/diffusion_models
>anima-preview3-base.safetensors
>>
>>108652456
Everywhere it's hosted, Ani. Everywhere it's hosted.
>>
>>108652456
in other words you need controlnets to get anything remotely looking nice. Skill issue. And yea you can weight artists.
>>
arggh why are you guys using anima just use mugen UGHHHH im so mad why cant you use the model we made??? yeah its melted and it works like fucking garbage but who cares? I'd rather shill my garbage model and fud 24/7 the competition otherwise I will realize I have no reason to live and sudoku because im a clueless talentless grifter and failed developer.
>>
>>108652501
kinda crazy that mugen was tailormade for 1girl yet /edg/, 1girl central, doesn't use it
>>
>>108652508
yeah I wonder why lmao
>>
File: 1753718602004154.png (959 KB, 1280x720)
959 KB PNG
>>108652501
kek, it do be like that
>>
/edg/ will never migrate from illustrious, they are too obsessed with specific gay little aesthetics they have on it. they care more about making their "signature" images than anything else
>>
>>108652528
wai anima is already that shiny brown ai slop "aesthetic"
>>
>>108652468
looks like it expired too early forgot to change the settings for longer than a hour. here you go
https://litter.catbox.moe/ilq97n.mp4
https://litter.catbox.moe/1pul90.mp4
https://litter.catbox.moe/b2f5jy.mp4
https://litter.catbox.moe/0y5gmp.mp4
https://litter.catbox.moe/m47p16.mp4
https://litter.catbox.moe/xwp2l7.mp4
https://litter.catbox.moe/60u9w5.mp4
https://litter.catbox.moe/jszt0b.mp4
https://litter.catbox.moe/86ypn3.mp4
https://litter.catbox.moe/qdvb2x.mp4
>>
>>108652564
uh oh...
localcucks are about to have a melty
>>
>still fudding
>>
>>108652211
i don't care about heavy seedance models. i want wan2.5, which can be local without a doubt
>>
it's over
>>
>>108652534
they want their specific loras that let them make their "signature" images
>>
File: 1763912753777134.jpg (52 KB, 499x499)
52 KB JPG
>>108652694
we are too smart for china. they know that if they share their new video models, they will never beat us again kek
>>
it's only just beginning.
>>
it's on going, with ups and downs
>>
Fresh

>>108652848
>>108652848
>>108652848
>>108652848
>>
>>108646491
>>108646541
>>108646515
Not OP, but using 3D models does make consistency way easier.

My workflow involve bringing Daz3D characters into Blender, rendering multiple angles into a 16:9 character sheets, then using a realism Lora. After that, I create all my scenes in Blender and just render first & last frames with the same realism Lora, only doing a face swap using the character sheet when needed.

The initial pre-production is tedious, but you can also just batch run these overnight and wakeup to all the frames you need.
>>
...guys I dont know even know what is going on or what is needed for these python scripts. I am that dumb. I'm not able to just run the scripts as posted in my Python shell, I'm like not even smart enough to ask the right questions. Just this whole git/pip thing eludes me. Trying to run WAN locally, have used StableDiffusion fine



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.