[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


iscussion and Development of Local Image and Video Models

Previous: >>108938075

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>model that lets you imitate a sound with your voice, then uses that vocal imitation together with text as input to generate the sound you actually want.

https://github.com/thxxx/VTS
https://www.reddit.com/r/LocalLLaMA/comments/1trve9e/open_source_turning_vocal_imitations_into_sound/

Is there another project like this? Surely there must be, this would be even better with a bigger audio gen model.
>>
File: 200245CUI_00001_.png (1.19 MB, 1192x1536)
1.19 MB PNG
>>
>>108943787
nice style
>>
File: Wan2.2_i2v_00020_.mp4 (532 KB, 784x1024)
532 KB
532 KB MP4
>>108943765
>>
File: zImageturbo_00052_.jpg (816 KB, 1920x1376)
816 KB JPG
>>
File: 200954CUI_00001_.png (1.7 MB, 1192x1536)
1.7 MB PNG
>>108943810
https://civitai.red/models/2256925/firedotinc-style-animanewbie-01
goated lora with insane versatility
>>
>>108943888
thanks, nice tripps
>>
File: zImageturbo_00059_.jpg (694 KB, 1920x1376)
694 KB JPG
>>
>>108943921
Kek
>>
File: 203018CUI_00001_.png (1.59 MB, 1192x1536)
1.59 MB PNG
>>108943898
Thanks. Check this 9.
>>
>mfw Resource news

05/30/2026

>Pixal3D — Apple Silicon (MPS / Metal) Port
https://github.com/pawel-mazurkiewicz/Pixal3D-mac

>Comfy-Org/PixelDiT (diffusion models & upscalers)
https://huggingface.co/Comfy-Org/PixelDiT/tree/main/diffusion_models

>Orion4D Generative Paint: ComfyUI advanced painting interface
https://github.com/orion4d/Orion4D_generative_paint

>ComfyUI Anima IP-Adapter
https://github.com/Wenaka2004/comfyui-anima-ipadapter

05/29/2026

>Colored Noise Diffusion Sampling
https://hadardavidson.github.io/CNS

>VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
https://videomla.github.io

>minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
https://github.com/shengshu-ai/minWM](https://github.com/shengshu-ai/minWM

>GPIC: A Giant Permissive Image Corpus for Visual Generation
https://gpic.stanford.edu

>SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
https://github.com/ModelTC/LightX2V

>Native Audio-Visual Alignment for Generation
https://ernie-research.github.io/NAVA

>GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
https://github.com/L-YeZhu/GASS_T2I

>SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
https://github.com/JiachengZ01/SAVVA

>Nexus BTA: Local AI image and video studio built around an embedded ComfyUI runtime.
https://github.com/JpAndreBTA/Nexus-BTA

05/28/2026

>MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation
https://github.com/AIM-SCU/CRAFT

>Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept Decompositions
https://github.com/vitryt/label-free-bias-identification

>VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer
https://github.com/Lrrrr549/VidPrism.git

>Wan2.2-NVFP4-Sparse (NVFP4)
https://huggingface.co/lightx2v/Wan2.2-NVFP4-Sparse
>>
>mfw Research news

05/30/2026

>Teaching Video Generators to Remember: Eliciting Dynamic Memory for Out-of-Sight State Evolution
https://arxiv.org/abs/2605.25333

>Geometry-Aware Image Flow Matching
https://arxiv.org/abs/2605.25294

>D3S2: Diffusion-Guided Dataset Distillation for Semantic Segmentation
https://arxiv.org/abs/2605.25022

>LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV
https://arxiv.org/abs/2605.26244

>Timestep-Aware SVDQuant-GPTQ for W4A4 Quantization of Wan2.2-I2V
https://arxiv.org/abs/2605.27003

>SP-MoMamba: Superpixel-driven Mixture of State Space Experts for Efficient Image Super-Resolution
https://arxiv.org/abs/2605.25892

>Beyond Surrogate Gradients: Fully Differentiable Token Pruning for Vision-Language Models
https://arxiv.org/abs/2605.28051

>BEAT: Rhythm-Elastic Alignment for Agentic Music-guided Movie Trailer Generation
https://arxiv.org/abs/2605.27067

>The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
https://arxiv.org/abs/2605.26415

>Leveraging Visual Signals for Robust Token-Level Uncertainty in Vision-Language Generation
https://arxiv.org/abs/2605.27136

>Quaternion Self-Attention with Shared Scores
https://arxiv.org/abs/2605.24920

>Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering
https://arxiv.org/abs/2605.29881

>DiscoForcing: A Unified Framework for Real-Time Audio-Driven Character Control with Diffusion Forcing
https://arxiv.org/abs/2605.28491

>Unified Neural Scaling Laws
https://arxiv.org/abs/2605.26248

>How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
https://arxiv.org/abs/2605.30260

>Revealing the core dimensions underlying representations in brains, behavior and AI
https://arxiv.org/abs/2605.26921

>Noise Scheduling as Information-Guided Allocation in Diffusion Training
https://arxiv.org/abs/2602.18647
>>
File: 204909CUI_00001_.png (1.87 MB, 1192x1536)
1.87 MB PNG
>>
File: Wan2.2_i2v_00021_.mp4 (1.68 MB, 784x1024)
1.68 MB
1.68 MB MP4
>>108944012
>>
>>108944142
does anima know any of these girls?

1 girl, {laura kinney, green eyes, long hair, black hair, lace choker, small breasts, petite|korra, dark-skinned female, blue eyes, ponytail, brown hair, hair tubes, medium breasts, armlet|tifa lockhart, final fantasy, 1girl, red eyes, black hair, long hair, low-tied long hair, medium breasts, earrings|kasumi \(doa\), dead or alive, 1girl, brown eyes, brown hair, long hair, ponytail, large breasts}, nude, {dimples of venus, ass, from behind, rear view|side view, from side, nipples, sparse pubic hair|dynamic angle, sparse pubic hair, nipples}, white background, on chair, folding chair, ((sitting backwards)), {candid, looking ahead, looking to the side|looking at viewer, seductive smile, aroused, come hither, pinup pose}, by kiriyama taichi, watercolor (medium), traditional media, elegant, spread legs, nude modeling, straddling,
>>
Anima has so much more potential, and they've poisend their model.

- NSFW concepts are mapped partly to refusal vectors
- refusal vectors degrade the SFW concepts

both are clearly visible when generating with and without abliterated text encoder

If they were to continue training Anima now - say, to version 1.5 or so - but use an abliterated text encoder, the image quality would improve significantly.
annoy them with that, they can fix it just with finetune
>>
>>108944165
i can't tell if this is a legit complaint or you're just the schizo who always whines about anima
>>
File: q_o47cv2.png (862 KB, 1344x768)
862 KB PNG
>>
>>108944165
abliteration doesn't tell you anything how it was trained
>>
>>108944177
all of the complaints are legit. the screaming faggot that dismisses complaints is a tranny that should have killed themselves a long time ago
>>
so whats going to beat DiT
>>
>>108944207
autoregressive whenever they figure out how to fit it on our shitware
>>
File: 210154CUI_00001_.png (1.52 MB, 1192x1536)
1.52 MB PNG
>>108944153
What prompt do you use do get that animated splash art style?

>>108944158
you can search for them here
https://animadex.net/?mode=characters&q=laura
>>
>>108944226
thanks
>>
>>108944214
>autoregressive
nah
>>
File: 210937CUI_00001_.png (2.3 MB, 1192x1536)
2.3 MB PNG
>>108944239
ure welcome
>>
>>108944188
An NSFW prompt triggers a refusal vector, and this vector is mapped to Anima. However, since the image also contains concepts that exist in SFW -such as faces, hands, backgrounds, etc. - a mismatch occurs with the rest of the training data.

Anyone can test this.
If you use an abliterated text encoder and trigger a refusal vector with an NSFW prompt, everything that exists in SFW concepts improves, but everything that normally falls under refusal vectors degrades.
This training error affects the entire model due to the high proportion of NSFW content. It’s as if a large portion of its dataset were incorrectly captioned.
If you continue finetuning with abliterated textencoder, this mismatch disappears.
A lot of NSFW concepts are being pressed into just a few refusals vectors; this would also significantly improve the NSFW quality.

It’s not hard to understand.
>>
> >108944113
> >108944124
Fuck off
>>
File: 211606CUI_00001_.png (2.27 MB, 1192x1536)
2.27 MB PNG
>>
File: Wan2.2_i2v_00026_.mp4 (446 KB, 720x544)
446 KB
446 KB MP4
>>
I'm getting the hand of ZIM, if only the gen times weren't so bad
>>
File: bell-curve-4216979229.png (43 KB, 465x263)
43 KB PNG
>>108944325
> muh license
> can't generate Taylor Swift
> what is a picture style?
just why?
>>
File: 213135CUI_00001_.png (1.74 MB, 1192x1536)
1.74 MB PNG
>>
File: 213620CUI_00001_.png (1.9 MB, 1192x1536)
1.9 MB PNG
>>
File: Wan2.2_i2v_00027_.mp4 (647 KB, 720x544)
647 KB
647 KB MP4
>>108944226
i just prompt regular. Simple sentences. Maybe it's the source image. Maybe it's those lightning loras. They tend to loop and they tend to make videos that feel "stiffer" than regular videos. It's fast though.
>>108944350
That's a truly scary graph. Only 16% of people have an IQ above 115.
>>
File: 214046CUI_00001_.png (1.82 MB, 1192x1536)
1.82 MB PNG
>>
>>108944290
Nah bro
>>
>>108944192
>the screaming faggot that dismisses complaints
But is the developer himself, and now he is dismissing HF complaints about the Anima license in the Anima license thread. He is literally a bigger dramafag than Julien.
>>
>>108944459
he's a gigantic grifting faggot that scammed the community's good will
>>
Unimaginably raped ^
>>
>>108944312
nice
>>
>>108944165
35 star status?
>>
File: 215753CUI_00001_.png (1.19 MB, 1192x1536)
1.19 MB PNG
>>
>>108944459
>He is literally a bigger dramafag than Julien.
hey now that's a bit harsh anon
>>
whats a solid body proportion slider for klein 9b? i have this one tit slider thats not even on civitai anymore and seems to work at random, even at really high strength. klein's 99% pulling off this style and idea i have but all i need now is crazy near-hyper proportioned bodies.
>>
so biggest res anima can gen at natively? 1024 x 1536 seems to work
i dont want to fuck with finding a wf with highresfix
>>
>>108944613
2.7MP is close to the upper limit from my testing
>>
>>108944459
>"dismissing complaints"
I read that thread and others, this is basically what happened:
>russ makes commercial license deal with TensorArt
>Tensor immediately makes Anima a subscription-only model without his knowledge or approval
>whole Tensor community lashes out at russ, calls him a scammer, greedy
>russ says that legally speaking Tensor is allowed to do this but that he'll reach out to them for more info and that negotiations are still ongoing
>meanwhile a Tensor community manager lies about the nature of the license to direct more blame onto russ
>some people in the HF thread are now like "wtf you get PAID by these platforms for a license? you greedy FUCK!"
>russ attempts to explain how the licenses work and why it's like that, openly shares all the details
It's literally like 2 people screeching that the model isn't being given away for free combined with TensorArt being shady af as usual.
>>
>>108944433
>Only 16% of people have an IQ above 115.
This isn't a measure of absolute intelligence. IQ score is a relative measure, with 115 simply being one standard deviation. IQ tests are periodically renormed so that 100 always represents the current average. This means if you took a test standardized from 1950, the average person today would score noticeably higher than 100 on that older scale. Researchers estimate roughly 3 IQ points per decade of real gains over much of the 20th century.
>>
>>108944645
you forgot the part where artists aren't being compensated from a commercial model. SAI was taken to court over this and they weren't even commercial
>>
>>108944657
>the average person today would score noticeably higher than 100 on that older scale
the average person now is a brown. so no.
and despite the renorm, iq is lowering in certain countries, for no reason at all of course.
>>
>>108944645
Thanks for your feedback BigRuss, I love being gaslighted by you, the anime messiah.
>>
>>108944666
>muh artists
there's a whole lot of commercial or API-only imagegen models and they're all trained on copyrighted data
>>
the pedo is: IN
>>
>>108944694
nearly every single one is dealing with lawsuits
>>
i wish i was blue
>>
File: 222618CUI_00001_.png (974 KB, 1192x1536)
974 KB PNG
me rn (I'm 5'11)
>>
>>108944666
the creators of art resources aren't being compensated when artists pirate their works either
>>
any news on big tunes of anima incoming?
what about illust and noobai new versions?
>>
>>108944730
U're darn tooting! We're gonna be alright right
>>
File: 223335CUI_00001_.png (845 KB, 1192x1536)
845 KB PNG
ain't nobody else gonna post slop

>>108944730
>what about illust and noobai new versions?
it's joever
>>
personally, I hate the commercialization of the local space
>>
>>108944698
>every single one
Don't fall for this narrative. If you have money, it's okay. If you don't, you're screwed.
Look at Seedance 2.0. Do you really think Hollywood pushed to censor it over copyright?
No, it's all about control. Guaranteed they have access to the full uncensored model internally.
>>
>>108944771
>commercialization of the local space
when did this start?
>>
>>108944772
it sounds like russ might be in some trouble if he gets sued
>>
>>108944783
he's basically untouchable with comfy backing him up
>>
>>108944780
I want to say tencent but it was probably SAI around sd3
>>
>>108944767
What is the legitimate purpose for posting multiple AI generated outputs if it's not just spam or shilling?
>>
>>108944789
no they would throw him under the rug. they censor complaints on the reddit. They are avoiding anything that would harm their valuation
>>
>>108944798
You would understand if you ever actually genned any images.
>>
File: 224021CUI_00001_.png (1.84 MB, 1192x1536)
1.84 MB PNG
>>108944798
I was under the impression that this was the designated spam/shilling ground.
>>
i mainly like genning anime girls and boys.
Are those anima checkpoints nice? Any reason to use them over sdxl?

btw what are your favourite checkpoints rn bros?
>>
>>108944828
If you are doing something complex with art styles I think it's worth it. Otherwise just stick with shitmixes if you want to keep it fast and simple
>>
We've been over this before. Spam is good. We need people constantly spamming images.
It solidifies the impression that AI is the hip new thing.
It's very current year
>>
>>108944835
Complex like what? I just like making nice fantasy portraits for my cards.
>>
>>108944845
just stick with Illustrious based models. If you inpaint then you would still need to use ill anyways
>>
>>108944837
Hip to be square. I have to return some sissy-hypno tapes
>>
File: 225607CUI_00002_.png (863 KB, 1192x1536)
863 KB PNG
leave the image limit to me
>>
>>108944883
>>108944816
>>108944767
>page4
Whatever BigRuss, you are killing the already dead /ldg/ thread monopolizing the conversation on only one model for almost half a year.
At least last year with Chroma we had more varied discussions, we used Qwen Edit and WAN. You don't even use Klein Edit to edit your gens nor do you change models even though you know there are better models that can also achieve good illustrations.
Congrats, enjoy the party alone faggot.
>>
File: Wan2.2_i2v_00038_.mp4 (662 KB, 816x1024)
662 KB
662 KB MP4
>>108944883
kek. Who's the stiff?
(she was supposed to blow the smoking barrel. oh well)
>>
File: 1778652524263722.png (1.45 MB, 1200x900)
1.45 MB PNG
>be looking at porn on civitai
>look at the prompt
>"The character in image 1"
>>
File: 3views.png (633 KB, 1024x768)
633 KB PNG
What's the most reliable option to generate back and side views of an already generated character? Editing? Outpainting with lora? Mesh generators do this, so I expect there is already a pre-existing solution
>>
File: 225812CUI_00001_.png (1.2 MB, 1192x1536)
1.2 MB PNG
>>108944994
how long does it take to gen these videos?
>>
>>108945032
220 - 240 seconds. I was just marveling at this fact right now. When this all started I was using those online sevices that gave you like one free gen a day but they later took that away.
When Wan came out it was fantastic but it took like half an hour for one 5 second video.
Now I'm sitting here genning a pretty high res video in 4 minutes. Pretty damn good if you ask me. It sounds like my PC is taking off and the temps hit 80 degrees C but that's okay.
I'm wondering if PID will make genning even more attractive but I still need to figure out the how of it.
>>
>>108945015
i ask klein 9b to do a front side and back perspective while feeding it all the reference images i have at various angles
>>
File: generated_videoNA.mp4 (1.41 MB, 464x688)
1.41 MB
1.41 MB MP4
>>108945073
What's your card? I'm sitting on a 12gb AMD GPU and Qwen and ZIT both shat the bed, so I'm not even gonna try messing with video gen. Back when Grok was free I used to play with it and that thing was bonkers. Video related.
>>
>>108945116
I have a 4090 with 24 GB. I consider myself very fortunate. I decided to buy soon after SD was released. It was pretty expensive at the time and I considered waiting, glad as hell now I didn't 'cause... well you know why.
You can run a Q4 or Q5 4B WAN 2.2 with your card but unfortunately won't be able to use the 14B WAN model with your 12GB. I've had some good results with 4B but my main gripe with it is that it will often lose the character's likeness in I2V gens, especially in fast moving video. Otherwise it's pretty fast, much faster than 14B
>>
>>108945116
you can run anything on 12gb it will just take some more time
wan2gp for video
comfyui for image gen
use Q6-Q8 GGUF quants of models and thats it
>>
File: jokka.jpg (40 KB, 1280x720)
40 KB JPG
>throw fuckhuge dataset into civitai not sure if they even train klein 9b loras
>They do
>mfw it fucking works right from the first epoch
>but the server went down
>>
File: 000209CUI_00001_.png (1.71 MB, 1192x1536)
1.71 MB PNG
>>108945209
>>108945226
Maybe I'll try again later. Since I'm new to comfy, there's a chance I'm messing up basic stuff.
>>
>using wan in the ltx era
shiggydiggy
>>
finally got around to trying JANIMA, good shit
https://civitai.red/models/2642932/janima?modelVersionId=2967640

the creator posted a gofile link to the model a few threads back
>>
>>108945298
What does it do better than anima 1.0?
>>
>>108945306
J stuff
>>
>>108945306
im not a weebnigger who compared all the previous tranime models but this feels similar to what illustrious did to its base model it was trained on when it came out, only to a lesser degree as this is a smaller lora obviously

basically i dont have to prompt a novel to get something that focuses on the subject while still putting the subject in a much more natural, creative scene/pose etc

>>108945298
found the link
https://gofile.io/d/jo1lcf
>>
>>108945328
also has more details etc
>>
File: ComfyUI_02158_.jpg (866 KB, 1474x1105)
866 KB JPG
>>
>>108945328
>found the link
>https://gofile.io/d/jo1lcf
do base loras work on it? the ones I tried didn't
>>
>>108945298
>>108945306
extract it into a LoRA and then say "nothing, it's just a big LoRA"
problem solved
>>
>>108945328
>finally! an anima shitmix that resembles illustrious!
heh
>>
This website is lowk boring as fuck.
>>
>>108945412
fr
>>
https://github.com/AdamNizol/ComfyUI-Anima-Enhancer
Thoughts? It kinda works but also makes everything look washed out.
>>
>>108945421
how is the fr pronounced?
>>
>>108945444
like fur
>>
so the main UI for diffusion is corpo enshittified, nobody called out Russ over his greedy licence in time, trainers are more incentivised to train with commercial models and nobody who develops comes here anymore. what can I actually look forward to because this space ended up getting worse after comfyui got funding
>>
35
>>
>>108945490
That's how things work. Something is good at first but commercialism creates enshittification.
>>
>>108945501
sad we don't have a blender equivalent for ml at this point
>>
File: 005727CUI_00002_.png (2.04 MB, 1192x1536)
2.04 MB PNG
>>
>>108945511
Maybe one day.
>>
>>108945490
code is just a means to an end. in 5 years you will be able to make any ui you want with 1 prompt
>>
>>108945847
then why would anyone invest in cumfart if that was the case?
>>
>>108945850
because you can earn the money now
>>
File: 4322456.webm (908 KB, 256x448)
908 KB
908 KB WEBM
OH N-
>>
>>108945859
but if I can replace cumfart in one click what's the point?
>>
>>108945871
because you cant make any ui u want right now in one click and have it work.
what part of this simple conversation arent you getting?
>>
File: ComfyUI_00635_.png (576 KB, 896x1152)
576 KB PNG
>>
File: ComfyUI_00636_.png (567 KB, 896x1152)
567 KB PNG
>>
what is hotter for horse cock
loli futa
curvy futa
tomboy muscular futa?
>>
>>108945865
wtf is that real?
>>
File: lamour haunted mesa.png (1.47 MB, 1219x2000)
1.47 MB PNG
>>108943691
Basic prompt strategy for worlds within world gens? It reminds me of picrel.


>>108943138
>video
>actually not shit
huh
>>
>wan2.2
>prompt character walking into a different room or towards the viewer, does it
>ltx2.3
>same thing, character just stands there while ambient noises play in the background
ltx's prompt adherence is fucking AWFUL holy shit.
>>
>>108946003
Then when you try to add end frames you get a PowerPoint gen, kek.
>>
Is there a Moore's Law for AI?
>>
>>108946047
yes. moore parameters!
>>
>>108946031
it really is awful for no reason but wan takes too goddamn long for 5 seconds.
>>
>>108946047
clearly not if there is still retarded shit still being hallucinated
>>
>>108946047
Yes, we call it the Law of More-ish. It states that every six months, AI will become twice as confident in its hallucinations while requiring four times as many GPUs to explain why it cannot browse the live web, simultaneously forgetting how to count the number of Rs in the word strawberry
>>
The number of crappy Loras will double every six months.
>>
It's been really funny seeing some lora creators say Anima is the future and then going back to Illustrious uploads after their Anima loras barely get engagement.
>>
>>108946047
Yep, the memory law:
>Every 12 months, llm memory requirements are halved, for the same performance.
>>
>>108946089
(and if this is true, it has profound implications for retro computing)
>>
File: 36477.webm (1.04 MB, 256x448)
1.04 MB
1.04 MB WEBM
>>108945973
yeah it happened a few hours ago
>>
File: z-image-turbo_00052_.png (1.22 MB, 1024x1024)
1.22 MB PNG
>>
>>108946104
I love how they run around randomly lmao
>>
File: z-image-turbo_00053_.png (1.07 MB, 1024x1024)
1.07 MB PNG
>>
File: ANIMA_bface_bad_00002_.png (1000 KB, 832x1216)
1000 KB PNG
>>
>>108946113
i think when i prompt "frantic running" then it does that. it makes it look less choreographed though
>>
>>108946047
is google offline?
https://en.wikipedia.org/wiki/Neural_scaling_law
>>
>>108945412
You probably shouldn't have ruined it then. Oh well. You live and learn.
>>
File: 4568468566.webm (1.75 MB, 256x448)
1.75 MB
1.75 MB WEBM
>>
>>108946047
Moore's Law isn't real because it didn't account for real life physics for some reason. Things always grow exponentially with a good base, then plateaus at a certain point when you reach the physical limit of how it could grow. Same thing will happen with AI. We're still in the blow up phase and will probably take a few more decades before it actually plateaus.
>>
if anima is just... plain superior to illustrious sdxl checkpoints. Is there any point to using it still?
Are there *any* downsides to anima?
>>
Hey there!

I was contemplating the idea of opening an OnlyFans account, pretending to be a girl, and posting full body and feet pics

I've messed around with generative AIs to generate cartoon/anime girls but I've not messed with realistic images

Which kinda tools would you recommend for this task? Extra points if ComfyUI compatible
>>
>>108946202
gm saar
>>
>>108946047
Yes, every three months the benchmark scores double
>>
>>108946202
how will you verify yourself, fag8?
>>
>>108944883
BASED
>>
What should I gen? Do you have any ideas, anon?
>>
>>108946218
Don't know about the inner works of OnlyFans, never had an account, that's why Im asking

Wouldn't I be able to pass the verification with the AI pics themselves? Is OnlyFans really not permitting AI content?
>>
>>108944988
holy raped retard
>>
>>108946202
lil indian bro you are 4 years late to this grift
>>
File: ANIMA_bface_bad_00003_.png (1.26 MB, 832x1216)
1.26 MB PNG
>>108946122
thanks to anon who pointed out this lora exists.

>>108946125
doing wan or lxt?

>>108946202
die of cancer <3
>>
>>108946232
you'll need a government ID for that
>>
>>108946238
It's like phone scams, it's here forever, now. picrel.
>>
>>108946248
They're available in bulk for cryptocurrency.
>>
>>108946202
What about Fansly?
>>
>>108946202
you can immediately spot these grifts in the wild because they all use SaaSlop so since you're new try out z image turbo
>>
>>108944113
>>108944124
thanks!
>>
>>108946202
Why do you fags do this instead of making an openly AI RP account that's just a virtual personality? One at least has some creative merit to it and you could even still do lewd porn of the character. The other you're just deceiving and being a faggot.
>>
>>108946301
>Why do you fags do this instead of making an openly AI RP account?

Because the idea is to make money
>>
>>108946240
>thanks to anon who pointed out this lora exists.
Which one?
>>
>>108946301
I'm curious too. I mean virtual tubers make money in spite of the fact that everybody knows it's literal troons behind it.
>>
>>108946202
>Indians wasting time doing this type of redeem saars shit instead of 10x the money by riding the AI rocket.

ngmi
>>
File: ANIMA_bface_bad_00005_.png (967 KB, 832x1216)
967 KB PNG
>>108946240
>>
>>108946318
https://civitai.com/models/2661055/realitymix-anima?modelVersionId=2988046
>>
>>108946337
>urban terrorist Jesus
>>
File: ANIMA_bface_bad_00006_.png (878 KB, 832x1216)
878 KB PNG
>>108946337
>>
>>108946345
he used to be a rural terrorist back then
>>
>>108946436
kek
>>
>>108946115
>>108946109
need more cute bois itt
>>
is comfyui getting native trellis?
>>
File: debo_anima_00034_.png (2.54 MB, 2048x1117)
2.54 MB PNG
>>108946485
>>
>>108946490
https://huggingface.co/Comfy-Org/TRELLIS.2

related. idk
>>
The reason I am glad to hear about local trellis is the custom nodes seem to be trash, for trellis.
>>
>>108946515
isnt pixal3d better?
>>
>>108943765
when we get I2v which run on basic Low end hardware ?

Never !!!

That is main point behind whole
aka Jensen AI operation !
>>
>>108946533
Others say no, not really, not when you rotate the object. But maybe for limited rotation?

I have never succeeded at getting trellis to work on my rdna2, so like I said, looking forward to it.
>>
>>108946534
what does low end hardware mean?
>>
File: 1765323030555243.png (3.4 MB, 1728x1536)
3.4 MB PNG
>>
>>108946534
>>108946549
I actually mostly don't like video (as in, in the whole world of cinema and tv etc), but this one was cool:
>>108943138
>>
File: 07545-3098156157.png (450 KB, 512x560)
450 KB PNG
I don't see a template for Trellis on Comfyui
>>
>>108946152
chad general standing directly underneath the helicopter with his hands on his hips
>>
File: desolation.png (1.58 MB, 1400x704)
1.58 MB PNG
>>
>>108946549
Low end Hardware Is pretty clear definition!
But Ok ...

You need clear definition...
Low end notebook which can run Cyberpunk on 30FPS.

Enough?
>>
File: ComfyUI_00638_.png (726 KB, 896x1152)
726 KB PNG
all grown up, eh?
>>
>>108946572
that sounds high end to me
>>
File: FK9B__00001_.png (1.43 MB, 832x1216)
1.43 MB PNG
>>108946426
I think non-q8 base flux 9b is better than q8. not sure yet though if it matters. the qwen I am using is the q8, not sure if that's okay either.

Gonna try kv.

My gpu is slow, so eh.
>>
Anon?
>>
>>108945209
> but unfortunately won't be able to use the 14B WAN model with your 12GB
This is not true. Real constrain is ram - it needs at least 32gb and even with 64gb ComfyUI will OOM if you change loras or their strength few times.
>>
>>108943765
I need ask this question guys It is only ME?

Maybe some of you help me confirm my suspicions ...

So... lately I get finally some free time to start looking into fundamental design of AI Specially I2V one .
It always scratched back of my brain with question
How is possible that this thing run so Inefficiently when Basic GPU from 10 years ago was able doing same job 30FPS with just 512 MB/1 GB V ram and 2 to 4 GB of Main Ram

And after Month or so of thorough research It start clearly looking that whole thing is designed to run slow!

And whole thing lead to Samsung and Nvidia
It even looks that All those AI companies Altman Group was founded by Both companies back in 2015 Strangely Both Companies are 2 biggest investors behind Iron Man Movie AI propaganda.

Now ... Am I too Paranoid ?

Or there is more people here who look behind curtain an can see what is behind.

Worst part is what is coming ... but For now I am not going elaborate on that!

Your thoughts on that Guys...?

Am I too Paranoid?
>>
>>108946679
you're not an artist so your opinion doesn't matter
>>
I fingerpaint in my spare time but I can only get off to my own gens now
>>
File: 1767285601661480.png (3.03 MB, 1728x1536)
3.03 MB PNG
harder to make cool wizards than it is cool knights
>>
File: 1775607026220251.png (3.98 MB, 2176x1216)
3.98 MB PNG
>>
The lack of various kinds of amputation loras for anima is disappointing. There are different kinds, and all should be supported.
>>
File: ComfyUI_00639_.png (731 KB, 896x1152)
731 KB PNG
>>108946807
just prompt "amputee" no?
>>
File: ComfyUI_00643_.png (1.27 MB, 896x1152)
1.27 MB PNG
>>
File: cutie.png (1.65 MB, 832x1216)
1.65 MB PNG
>>
>>108946810
I will need to try more/again, with v1, but I'm sure it's not gonna be great.

types of amputations:
1. no stump
2. stump, various lengths
3. at elbow/knee (so some % of the joint)
4. below elbow/knee
5. at wrist/ankle (so some % of the joint)

so there are 4 limbs and 5 types, so you have 25 types, but some of them can be mirrored.

Some combos have specific names.
>>
>>108946859
nice
>>
>>108946859
Cool
>>
Am I getting it wrong? Although flux klein 9b "kv" version is faster, it seems worse.
>>
the kino reactor has melted down. i'm seeing lines across my screen
>>
File: ComfyUI_00646_.png (1.14 MB, 896x1152)
1.14 MB PNG
prison arc
>>
File: ComfyUI_00647_.png (1.21 MB, 896x1152)
1.21 MB PNG
>>
>>108947018
:( almost amputated.
>>
At this point, ai models need to be tagging everything for cropping, so the model knows how to not crop the subject unless specified.

I should never gen a cropped subject.

but, no model actually understands the idea of "subject"
>>
File: self pleasure.png (1.35 MB, 1024x1024)
1.35 MB PNG
>>
File: 1767046141391763.mp4 (1.1 MB, 496x768)
1.1 MB
1.1 MB MP4
>>
>>108945902
very cool
>>
File: 1762430374196139.png (1.27 MB, 896x1152)
1.27 MB PNG
>>
File: 1775111421428060.png (1.35 MB, 896x1152)
1.35 MB PNG
>>
>>108947065
there's an anima g string lora. I haven't downloaded it. I'm not sure if I need it.
>>
File: ComfyUI_00653_.png (1.87 MB, 896x1152)
1.87 MB PNG
>>
>>108946058
quality
>>
>>108944417
catbox or name of art style please? I know it's anima. Looks really nice.
>>
>>108944444
checked.
>>
https://files.catbox.moe/jcgf9k.mp4
>>
>>108947057
Badass
>>
show yours

(sigmas)

This is what I'm playing with.
>>
>>108947192
and this is with flux 9b base bf16, and with the unquant qwen.

I find about 50% was body horror with more of s a slope at the start.

but will this fix it? idk, working with it. Every model has its ideal zone, and slopiness.
>>
File: 3548645.webm (357 KB, 256x448)
357 KB
357 KB WEBM
>>
File: 347597596.webm (1.52 MB, 448x256)
1.52 MB
1.52 MB WEBM
>>
File: 1573897305298.jpg (12 KB, 257x294)
12 KB JPG
Do you need some lora for realism with Anima or is it just about proooompting and anime in negs?
>>
Anyone using LLMs to make prompts? Any suggestions what to include in system prompt or how to work with it? Anyone has prompt for Anima, that is just about describing the scene and characters, not style?
>>
>>108947277
What goes in people's heads, when they pick a model that is explicitly not meant to be used for realism, and want to do realism with it?
>>
>>108947277
I use the one I mentioned
>>108946343
there's also the lenovo one. And he says to use some custom node. I haven't. I'm not a serious prompter.

>>108947285
with anima, if you are doing anime you need to have @joewhatever

where joewhatever is your artist.

https://tagexplorer.github.io/#/artists?tagFilter=
>>
why DO people gen realistic shit anyway? you're either boring asf or a pedo
>>
>>108947291
We had niggas dumping 3dpd made in anima so I'm curious.
>>
>>108947296
half of the images in the battlestation thread are my ai images and no one has even realized it yet
>>
>>108947302
based
>>
File: ANIMA_bface_bad_00001_.png (1.15 MB, 832x1216)
1.15 MB PNG
>>108947291
:^)
>>
>>108947333
box onegai? I need some template
>>
>>108947291
because realism models don't know/can't do what Anima does out of the box
>>
File: ANIMA_bface_bad_00002_.png (1.76 MB, 832x1216)
1.76 MB PNG
>>108947333
slow gpu, don't care.
>>
File: 48556.webm (2.1 MB, 256x448)
2.1 MB
2.1 MB WEBM
kino audio at the end https://files.catbox.moe/dgtwrt.mp4
>>
File: ANIMA_bface_bad_00003_.png (1.88 MB, 832x1216)
1.88 MB PNG
>>108947346


>>108947338
magic workflows don't exist.

You need
0. luck. everyone cherrypicks. Think most photos taken by Bresson were keepers?
1. good prompting, knowing how many gens to gen before trying changes, too.
2. sampler choice, hinges on model, also the content (because different content emphasizes different size stuff), and number of steps
3. sampler matters, but like euler is a standby for a reason. I suggest exp_heun_2_x0_sde
4. a lot of people use Auraflow, which despite where it goes, has the effect of just modifying the scheduler.

scheduler just supplies a literal ass list of numbers starting with 1.0 and going down to 0.0, except maybe not including 1.0 or 0.0, cuz of reasons
>>
File: ANIMA_bface_bad_00004_.png (1.61 MB, 832x1216)
1.61 MB PNG
>>108947374
behead all satans
>>
>>108947379
a man with a wild smile on his face holds a bloody sword midswing in front of three beheaded demons, whose heads are flying midair amidst much blood.

TRIGGER


my negative, which desu is just junk I accumulated
anime, cartoon, anime style, pointy chin, sharp chin, cleft chin, chin cleft, exaggerated chin, plastic surgery, deformed chin, unnatural chin, perfect skin, flawless skin,

ugly feet, bad feet, ugly hands, bad hands, worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, sepia


I just got a crappy gen. so, I won't post it. That's how it is, some gens are more like you get 1 good one out of 20.
>>
>>108947379
anti-semitic dogwhistle
>>
File: ANIMA_bface_bad_00006_.png (1.33 MB, 832x1216)
1.33 MB PNG
>>108947379
>>
i never tried using negative prompt. i will try it
>>
File: 35432745.webm (1.99 MB, 256x448)
1.99 MB
1.99 MB WEBM
https://files.catbox.moe/aohowd.mp4
>>
>>108947462
why
>>
>>108947462
Kino time! I'm grabbing my popcorn!
>>
>>108947465
cause they are our greatest ally
>>
>>108947135
@lynune
>>
>>108947413
stole it from wherever google says, I guess
>>
File: 468356.webm (1.89 MB, 256x448)
1.89 MB
1.89 MB WEBM
>>108947469
https://files.catbox.moe/k7xzmh.mp4
>>
>>108947462
Did you try giving it a prompt, for example, an excerpt from a battle sequence in Ernst Jünger's Storm of Steel?
>>
>>108947505
no, and it probably wouldn't be very good unless you fully renovated it to fit the ltx prompting style
>>
>>108947493
these are boring, could you at least give the choppers jiggling big naturals?
>>
File: ANIMA_bface_bad_00007_.png (1.68 MB, 832x1216)
1.68 MB PNG
Batch of EIGHT bangers
>>108947411
>>
File: ANIMA_bface_bad_00008_.png (1.16 MB, 832x1216)
1.16 MB PNG
>>108947529
*ALL* satans LINE UP!!!

>>108947529
btw he's the famous double lefty. always takes satans by surprise.
>>
File: ANIMA_bface_bad_00009_.png (1.7 MB, 832x1216)
1.7 MB PNG
>>108947532
>>
>>108947539
Wog
>>
File: ANIMA_bface_bad_00010_.png (1.67 MB, 832x1216)
1.67 MB PNG
>>108947539
>>
File: ANIMA_bface_bad_00011_.png (1.58 MB, 832x1216)
1.58 MB PNG
>>108947548

>>108947547
https://www.youtube.com/watch?v=B-vTgehex7E
>>
File: ANIMA_bface_bad_00012_.png (1.61 MB, 832x1216)
1.61 MB PNG
>>108947554
>>
File: ANIMA_bface_bad_00013_.png (1.71 MB, 832x1216)
1.71 MB PNG
>>108947557
I think that action is a surprising result of real anima
>>
File: 246245544.webm (1.76 MB, 256x448)
1.76 MB
1.76 MB WEBM
>>108947525
>boring
wrong way of spelling KINO
https://files.catbox.moe/hm5u36.mp4
>>
File: 094608CUI_00002_.png (1.69 MB, 1192x1536)
1.69 MB PNG
>>
>>108947563
still no big naturals
>>
File: ANIMA_bface_bad_00014_.png (1.65 MB, 832x1216)
1.65 MB PNG
>>108947560
>>
File: Enough from the clown.jpg (59 KB, 1122x799)
59 KB JPG
>>
File: 246255566.webm (2.64 MB, 256x448)
2.64 MB
2.64 MB WEBM
>>108947579
i don't know what that means
https://files.catbox.moe/ukw84r.mp4
>>
>>108947593
too late. already your demons are abandoning you. Soon, you will be unable to perform 2 digit multiplication.
>>
>>108947493
FIRST-PERSON POV, world war I, aerial combat over Athens, Greece, seen through the eyes of a fighter pilot. Camera mounted inside cockpit of a Hawker Hurricane. Golden Mediterranean spring morning, crystalline air, pale blue sky above ancient Athens. Twelve Hurricanes flying in tight formation, wings nearly touching, sunlight flashing from aluminum surfaces. The Acropolis and Parthenon drift beneath through thin haze, marble ruins glowing gold.

The camera remains locked to the pilot's perspective. Slight vibration from the Merlin engine. Gloved hands gripping controls.

Flight leader ahead, small silhouette against the sun. The formation wheels over Athens. Below, white buildings, olive groves, distant mountains.

Suddenly the sky fractures.

Dark specks appear high above. Then dozens. Then hundreds. German Bf 109s and Bf 110s diving from the sun. Enemy fighters pour downward like a steel storm. Contrails, sunlight glinting from canopies, aircraft multiplying.

Camera whips violently. Formation dissolves. Hurricanes move away in every direction. Near misses measured in feet. Machine-gun tracers crossing from every angle. The pilot jerks the aircraft into hard turns, horizon spinning, Athens revolving below like a giant map.

A Messerschmitt fills the gunsight. Trigger pressed. Muzzle flashes. Empty shell casings tumbling into sunlight. Another fighter flashes across frame. Another. Another. Aircraft everywhere.

sunlight caught on a propeller blade, oil streaks across the windscreen.

The aircraft collide almost accidentally through overcrowding.
Gradually the storm thins.
>>
whats with the schizoposting
>>
>>108947604
why all these shitty gens?
>>
File: 924318256.gif (82 KB, 498x468)
82 KB GIF
>why all these shitty gens?
>>
>>108947629
schizo
>>
>>108947609
ltx is very bad at planes for some reason, it makes every plane look like this https://unidentifiedphenomena.com/topics/multi-pointed-star-shaped-ufos/
>>
>>108947623
Fuck off schizo
>>
>>108947671
schizo melty
>>
>>108947672
Schizo
>>
>>108947686
guess you ran out of shitty chopper gens
>>
>>108947609
https://files.catbox.moe/8el1tw.mp4
you are going to need to inject some reference frames so it knows how the planes are meant to look like, and the greek buildings too
>>
>>108947698
nice little ambient tune there before the aids sound
>>
>>108947698
Thank you, anon. This is very interesting, I can't wait for a future where we can watch our books or recreate scenes like that.
>>
>>108947697
Fuck off dude who the fuck do you think you are?
>>
>>108947729
fuck off with your shitty gens
>>
>I don't like free things
ahahahah oh you sweet summer retard
>>
>>108947737
Shitty for you worthless faggot.
>>
>>108947719
yep. soon i'll try making a war kino that involves being a pilot in a dog fight and figure out what kind of prompt i need for ltx to make it look good, and then i can try to make it match your script
>>
File: 102403CUI_00001_.png (2.01 MB, 1192x1536)
2.01 MB PNG
>>
>>108947737
You're not the boss of this place.

I am.

I'm the king of around here.
>>
>>108947743
You are unable to explain how those shitty schizo gens have value.
and that coward denying me my Yous loves picking up dog shit off the ground and shoving it down his throat, it is free after all
this place is better when you bully schizo shit genners away
>>
Looks like someone is getting angry when we are not talking about his worthless anime model 24/7.
>>
I'm the boss.
>>
just download ltx and make some kinos then
>>
take it to /sdg/ you faggots
>>
ltx can't make kinos
>>
europoor hours are weird.
>>
>>108947761
I wont use Anima, and you can fuck yourself and find a real job Turd Rusell
>>
if you're not gonna have any actual discussion then atleast post gens
>>
how I sleep knowing I'm not retarded enough to spam the same shitty gens with shitty audio
>>
>>108943765
https://rentry.org/tdrusell
>>
File: 3473545687.webm (1.88 MB, 256x448)
1.88 MB
1.88 MB WEBM
https://files.catbox.moe/nimw75.mp4
>>
>>108947786
are you okay anon? you keep posting the same file over and over
>>
>>108947786
Based and freethinker pilled.
>>
I need to win the lotto.

1. pc for gaming
2. pc for local llms (useful)
3. pc for local llms (girlfriend)
4. pc for 3d modeling
5. pc for genning images
6. pc for genning images (anima / anime)
7. pc for genning music

I need these all at once. Each one, correctly configured, is at least $3,000. So just $21k in lotto, is that too much to ask?
>>
>>108947795
are you okay anon? you keep posting the same 1girl cowboy shot over and over
>>
File: 4525467.webm (1.69 MB, 256x448)
1.69 MB
1.69 MB WEBM
>>108947800
the kinos will flow
https://files.catbox.moe/lyxn4a.mp4
>>
>>108947809
>the kinos will flow
where?
>>
>>108947815
into yo mama ooooohhhhhhhh *air horns*
>>
>>108947815
Fuck off
>>
>>108947809
at least use litterbox
>>
>>108947826
that's where those gens belong yeah
>>
>>108947837
How about you kill yourself slowly and painfully?
>>
>>108947853
that's what looking at those repetitive gens is doing to us, yeah, we should stop
>>
File: 105500CUI_00002_.png (1.38 MB, 1192x1536)
1.38 MB PNG
>>
monstergirl hour
>>
you mean sdxl (ai-generated:1.9) slop hours
>>
brown hours
>>
>>108947971
How about you get in the truck.
>>
>>108943765
Is there any controlnet similar to CN-anytest for Klein?
>>
>>108948033
why don't you trust the clip? clip knows best.
>>
>>108947969
u guys dont post attractive females anymore
>>
why is anon melting
>>
Anyone using CFGnorm with anima?
>>
isn't this much nicer without the shitty gen spam?
>>
File: ComfyUI_00658_.png (1.03 MB, 896x1152)
1.03 MB PNG
>>
>>108948244
>>108948244
>>
>>108947803
get a dgx spark
>>
>>108948051
Because those of us that use to got hated by the mentally ill faggots that only ever post their cartoon slop. Just look at the articles section over at civitai, its full of these mentally ill freaks, they've infected everything with their filth.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.