[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Pride, Ego, Autism, Edition

Discussion and Development of Local Image and Video Models

Previous: >>108550008

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: ska.webm (998 KB, 512x896)
998 KB
998 KB WEBM
Should I look at this and think, "Yep! This is the ceiling, right here. It doesn't get any better than this." Or, is there much more room for improvement?
>>
Could just be my imagination, but i think the OP college is biased towards anima. surely that wasn't the only thing being posted in the entire thread.
>>
>mfw Resource news

04/07/2026

>Anima preview3 released
https://huggingface.co/circlestone-labs/Anima#preview3

>FrameFusion Image Interpolation: Compact image interpolation model for generating in-between frames
https://github.com/BurguerJohn/FrameFusion-Model

>An Inside Look at OpenAI and Anthropic’s Finances Ahead of Their IPOs
https://www.wsj.com/tech/ai/openai-anthropic-ipo-finances-04b3cfb9

>PrismML debuts energy-sipping 1-bit LLM in bid to free AI from the cloud
https://www.theregister.com/2026/04/04/prismml_1bit_llm

>ComfyUI Hires Fix Ultra - All in One
https://github.com/ThetaCursed/ComfyUI-HiresFix-Ultra-AllInOne

>ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity
https://github.com/hwang-cs-ime/ATSS

>1.x-Distill: Breaking the Diversity, Quality, and Efficiency Barrier in Distribution Matching Distillation
https://thu-accdiff.github.io/1.x-distill-page

>Your Pre-trained Diffusion Model Secretly Knows Restoration
https://sudraj2002.github.io/yptpage

>Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
https://github.com/lijun2005/CVPR26-DreamPRVR

>A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
https://deltatok.github.io

>SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
https://github.com/EasonXiao-888/SpatialEdit

>OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
https://github.com/OpenDCAI/OpenWorldLib

>KupkaProd Cinema Pipeline: Powered by LTX 2.3
https://github.com/Matticusnicholas/KupkaProd-Cinema-Pipeline

>Wan VACE Prep: ComfyUI nodes for video editng workflows
https://github.com/stuttlepress/ComfyUI-Wan-VACE-Prep

04/06/2026

>UNICA: A Unified Neural Framework for Controllable 3D Avatars
https://github.com/zjh21/UNICA

>WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models
https://github.com/SAI-Lab-NYU/WSVD
>>
>mfw Research news

04/07/2026

>Erasure or Erosion? Evaluating Compositional Degradation in Unlearned Text-To-Image Diffusion Models
https://arxiv.org/abs/2604.04575

>Beyond Few-Step Inference: Accelerating Video Diffusion Transformer Model Serving with Inter-Request Caching Reuse
https://arxiv.org/abs/2604.04451

>Training-Free Image Editing with Visual Context Integration and Concept Alignment
https://arxiv.org/abs/2604.04487

>Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning
https://arxiv.org/abs/2604.04746

>ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop
https://arxiv.org/abs/2604.03448

>Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
https://hyunsoocha.github.io/vanast

>KiToke: Kernel-based Interval-aware Token Compression for Video Large Language Models
https://arxiv.org/abs/2604.03414

>DiffSparse: Accelerating Diffusion Transformers with Learned Token Sparsity
https://arxiv.org/abs/2604.03674

>Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models
https://arxiv.org/abs/2604.03556

>OP-GRPO: Efficient Off-Policy GRPO for Flow-Matching Models
https://arxiv.org/abs/2604.04142

>AvatarPointillist: AutoRegressive 4D Gaussian Avatarization
https://kumapowerliu.github.io/AvatarPointillist

>The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models
https://arxiv.org/abs/2603.27139

>Beauty in the Eye of AI: Aligning LLMs and Vision Models with Human Aesthetics in Network Visualization
https://arxiv.org/abs/2604.03417

>ITIScore: An Image-to-Text-to-Image Rating Framework for the Image Captioning Ability of MLLMs
https://arxiv.org/abs/2604.03765

>Banana100: Breaking NR-IQA Metrics by 100 Iterative Image Replications with Nano Banana Pro
https://arxiv.org/abs/2604.03400
>>
>>
>>108553813
nah it can't be, because anima is DOA and no one is using it
>>
>>108553789
kinobox for the braphog in top right?
>>
can someone please change the writing on her shirt to debian?
>>
File: 1583345919393.jpg (113 KB, 990x990)
113 KB
113 KB JPG
TDRussell... please... I'm begging you... I want to see oversized insects get absolutely destroyed by futa's with huge cocks. Put it in the dataset, homie.
>>
>>108553861
i bet anima can do that with a good enough prompt
>>
File: Oh nyo.jpg (62 KB, 720x347)
62 KB
62 KB JPG
>>
File: 20230314_171527.jpg (5 KB, 192x159)
5 KB
5 KB JPG
>>108553868
I only tried, several, times with Preview2 and all I could get is is the futa rubbing up against their abdomen. I'm talking about t2i, i2i works but that's more effort on my part.
>>
>>108553833
wow, a thot video just like millions of others
>>
>>
>>108553878
>all I could get is is the futa rubbing up against their abdomen
Proof?
>>
>>108553869
fuck its over...just deleted anima and resubbed to NAI
>>
>>108553858
Not a request thread and I am too busy testing newest anima rn, but I will make a quick Klein gen later if no one else responds.
>>
>>108553886
Don't judge me, anon, please, my heart can't handle it.
https://files.catbox.moe/wzbh2c.png
>>
>>108553916
>no metadata
Let me help you anon provide an actual futabox
>>
>>
File: 4752158635.jpg (137 KB, 640x640)
137 KB
137 KB JPG
>>108553925
Alright here you go, anon. If I remember correctly I kept overly specifying things in the prompt because Anima was being stubborn, like turning the insect into an insect boy/girl.
https://files.catbox.moe/uk8mej.jpg
>>
>>
>>108553916
https://youtu.be/VliOCZKBM-w?si=si5q3thSrWTAaIkR&t=3
>>
>>108553962
You won't make me download metadataless insect futa a third time anon. I won't fall for your tricks again.
And I was going to help you.
>>
>>108553977
Anon, does catbox not have metadata on images? I specifically uploaded the original and then downloaded it off catbox to make sure it has the original metadata on it. Also I'm retarded and I don't know what the hell a futabox is and googling it gave me nothing.
>Also what do you mean by third time? That's the second link I shared
>>
File: 1766581180856600.jpg (616 KB, 1416x2064)
616 KB
616 KB JPG
>>
File: deNS_zi_00018_.png (3.62 MB, 1663x1164)
3.62 MB
3.62 MB PNG
>>
>>108553990
Post the prompt instead
>>
>the preview of my gen changing from a chinese woman to my waifu
https://www.youtube.com/watch?v=n5b_nIzdvfI
>>
File: ComfyUI_08584_.png (801 KB, 1024x1024)
801 KB
801 KB PNG
what did anima mean by this
>>
New version of anima seems alright. Feels just a bit side-gradey. Still want to see that realism lora...
>>
>>108554182
It is the russian equivalent of snickers.
>>
File: 1749391890567071.jpg (36 KB, 612x574)
36 KB
36 KB JPG
>>108554182
what a huge chocolate bar. i've never found one this size in my darn fallen eu...
>>
File: HFTMJnObgAA090R.jfif.jpg (869 KB, 3600x2025)
869 KB
869 KB JPG
seedance 2.0 beaten by a new model, likely also chinese
>>
and i should care about sneederboards for what reason?
>>
>>108554249
sora 2.5 maybe? the whole Disney thing falling apart still feels odd
>>
File: 1768147362790899.png (1.35 MB, 1836x2338)
1.35 MB
1.35 MB PNG
Any good Anima 3 upscale workflows?
>>
>>108554265
if youre not first youre last
>>
File: 1767907395674028.png (1.55 MB, 1164x2048)
1.55 MB
1.55 MB PNG
>>108553858
here u go broski
>>
File: file.png (293 KB, 1302x1044)
293 KB
293 KB PNG
Techlet here, I know more than your average techlet but i'm vastly out of my depth compared to even a dumb /g/ anon. I've got a 2080 Super and my gens are seizing at 50% moving at a glacial speed with a rampaging 95% gpu workload going on in task manager. Is it possible to input some sort of command to break the workload up into smaller resource dumps, or dick down my CPU a bit to try and peel some off of my GPU? I'm 99% sure it is based on my available VRAM that is causing this but I can be totally blind, retarded, and wrong.
>>
>>108554249
it's going to be open source
this will be the deepseek moment of videogen
>>
>>108553880
you don't understand
it's his thot video
>>
File: HappyHorse.png (201 KB, 800x810)
201 KB
201 KB PNG
>>108554249
That name... xD
>>
>>108554424
windows users btfoed
>>
File: Anima_01706_.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>>/vg/562917381
>>>/co/153133847
I am not a discord fudder or thread schizo and I love this model but I am disappointed and worried about anima. I compared roughly 40 images on preview 2 vs 3:
Potential for prompt adherence is higher (could do some prompts 2 struggled a lot to do), but it's still inconsistent, perhaps even more so than before.
Backgrounds are noticeably worse sometimes, I would guess on average too.
Hands are still seed lottery despite more high-res training.
Text is more or less the same, maybe slightly better but still inconsistent seed lottery.
It's struggling to do very common characters it could easily do in the past previews consistently, that might be the single most worrisome thing about it.
I never trained a large scale finetune before so I am unqualified for armchair engineering here but my amateur lora trainer impression is if I am being optimistic it's the weird halfway epoch before the lora churns into the optimal place. And if I am being pessimistic it's getting fried already.
Maybe prompting or sampler meta changed with new release? (Using cfg circa 5, euler ancestral/er sde simple, 30-40 steps, as tdrussell says, I really doubt this is the cause though. Tried different kinds of prompts too.)
I didn't notice any uptick in <100 instance artist prompt adherence despite what Tdrussell says, though I haven't tested that extensively.
Photo-realistic gens are maybe a bit better, but memes aside still feels practically unusable for this purpose without a lora of some sort.
>>
File: 1751824648004360.jpg (660 KB, 1536x1536)
660 KB
660 KB JPG
>>
File: 85686358389.jpg (1.15 MB, 1664x2432)
1.15 MB
1.15 MB JPG
>>
>>108554562
it's smaller than SDXL yet tries to do natural language as well. it's too complex for its size, the weights are full
>>
>>108554616
That's a new one hah
>>
>>108554249
>It's LTX 2.4 and will be open sourced tommorow
just imagine
>>
>>108554249
https://xcancel.com/AngryTomtweets/status/2041640342764843097#m
>metalic sound
that alone will not make the model better than seedance 2.0 lol
>>
are you demoralized by the current state local image generation? just finished burning through tons of credits testing recraft v4 pro and I'm speechless of the results. The model is still censored to a small degree with some generation rejections but holyshit.
https://files.catbox.moe/yg9m7m.png
https://files.catbox.moe/0zlujr.png
https://files.catbox.moe/avl42o.png
https://files.catbox.moe/fqgb4x.png
https://files.catbox.moe/863rm4.png
https://files.catbox.moe/aux9cj.png
https://files.catbox.moe/1yszq8.png
https://files.catbox.moe/ttbxkn.png
https://files.catbox.moe/bt73u2.png
https://files.catbox.moe/wc402a.png
https://files.catbox.moe/lmwtm4.png
https://files.catbox.moe/4wc6ug.png
https://files.catbox.moe/ptw1ny.png
https://files.catbox.moe/wbd3nl.png
https://files.catbox.moe/873kp6.png
https://files.catbox.moe/nkhia0.png
https://files.catbox.moe/wi5hl4.png
https://files.catbox.moe/pd6g9b.png
https://files.catbox.moe/h279o0.png
>>
File: it's so over.png (176 KB, 460x310)
176 KB
176 KB PNG
>>108554832
>how does it feel to see a set of API models that are way less cucked than the local models we currently have
it feels bad anon, what else can I say?
>>
>>108554769
Less than 0.00001% chance of that imo.
Most probably just new Veo version
Too soon and too good for Wan (won't be open source anyway)
Maybe some other dark horse Chinese model (very unlikely but a lot more likely than Israeli FOSS suddenly getting order of magnitude better and still be released for free)
>>
>>108554806
i hope that's a new minimax hailou or vidu q model. Fuck bytedance and their greedy cucked and nurfed seedance 2 model. I hope its not veo4 because google is bound to improve on the censorship with the latest gemini models.
>>
File: 1752320850043753.png (443 KB, 640x480)
443 KB
443 KB PNG
>>108554861
>I hope its not veo4 because google is bound to improve on the censorship with the latest gemini models.
google has become quite based recently, gemma 4 is so uncensored and smart it's laughable, I really have some hopes they'll also release an image model locally
>>
>>108554832
it's time to confront that fact that localkeks are here for ideology, not quality. they coped by saying saas was 'censored' but now more and more APIs are emerging with minimal restrictions. the goalposts move, the mask slips. it really is all about being poor and entitled.
local has been abandoned. it was obvious over a year ago. 'based china' was the eternal cope of 2024/2025, and now they've sold out too. but local models were shit long before everyone quit releasing them. midjourney was always more aesthetic, dall-e always had better cultural knowledge. flux was always just budget dall-e 3 with zero styles or characters, just coping with text on signs no different than what emad shilled with sd3.
local will never receive a SOTA image/video model again, it will be endless stagnation. by the time local receives a model as good as nano-banana 1, we'll already be exploring realtime coherent deep-dive virtual worlds with API.
>>
h-
>>
File: 1757693582715351.png (1.03 MB, 1250x754)
1.03 MB
1.03 MB PNG
https://huggingface.co/jdopensource/JoyAI-Image-Edit
When are you going to implement this you fucking ComfyUi fuckers???
>>
>>108554857
we desperately need new players in field for local open source image model. Alibaba has abandon open sourcing image and video generation models and black forest labs are just absolute safe cucks that continue to poison both their saas and open source models with safety bullshit and synthetic slop. 1mp image generation is dead end and seedvr2 has its limitations.
>>
>>108554832
>apicuck dumping his shit
kill yourself faggot
>>
>>108554885
there's qwen edit with the camera lora iirc
>>
>>108554911
it can do more than just rotate things, I want to see if it makes less slopped outputs than klein
>>
>>108554912
some of the examples with human subjects result in turboslopped outputs, so I wouldnt hold my breath honestly
>>
so happy I'm right about anima being a dogshit arch. you finally woke up sheeple
>>
holy schizo freekek seethe
>>
   
>>
The more sophisticated a model gets, the less abstract I can go..
>>
>>108554958
what cloud model is that
>>
>>108554832
nigger learn to create albums i'm not going to open this shit
>>
I know this will get lost in between the deranged ramblings of Julien and the raiders, but I am sincerely disappointed in Preview 3 in a way that I wasn't with Preview 2.
My confidence in the model is shaken. I hope the final version turns out alright but with the current trajectory, we can end up with something worse than Preview 2.
And no I am not interested in Mugen/Chenckin/whatever the fuck cliptranny discordtrannies are shilling, fuck off.
I guess this is just a bad year for local.
>>
File: ComfyUI_08232_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
>>108555049
> ramblings of Julien and the raiders
you mean of yourself and the raiders
fuck off ani
>>
Damn, preview3 has improved lolis quite a lot.

>>108554981
Seed dance 2.
>>
>>108554958
>the less abstract I can go..
Those look so cool thoughever
>>
>>108555075
the fud must spread
>>
>>108555077
why is it so fried and smeared?
>>
>>108555077
Apparently hags grow beards and mustache
>>
>Can't draw Ryu's gi with correct color consistently
>Hallucinated weird brooch into Lapis's outfit in the canoe gen
>Drew her skirt too short in another (And I made dozens of gens of her in Preview 1 + 2 and neither did anything similar)
>Forgot how to do Batgirl's outfit properly
>Spider-Gwen is completely raped
Seriously it's forgetting characters. I posted evidence besides trust me bro fud but no one cares yet it seems. If I can be arsed, I will sample multiple seeds to demonstrate the problem and make a post in the hf thread.
>>
Also does anyone know if there is a filesize/total number of images limit to how much shit I can attach to an hf post?
>>
https://huggingface.co/circlestone-labs/Anima/discussions/113
>those images
I know you're here
>>
>>108555252
>>108555264
Make sure you include the metadata :-)
>>
>>108555268
>it's so good it can perfectly reproduce the training data
that's bound to happen, the more AI models improve, the more they can do stuff we're asking from them in the first place, which is to reproduce the training data
>>
>>108555271
Of course smileytroon, in particular for you.
>>
>>108555252
>but no one cares yet it seems
The thread is usually slow around this time of day. It'll pick back up in a bit.
>>
File: 8563864373472.jpg (1.47 MB, 1664x2432)
1.47 MB
1.47 MB JPG
>>
File: 1773706478921640.jpg (703 KB, 1536x1536)
703 KB
703 KB JPG
>tfw no zitslop wife
>>
File: HDP2qjwXQAAjpWr.jpg (128 KB, 1179x1127)
128 KB
128 KB JPG
>>108555555
HOLY GET
>>
File: WERE SAVED.png (2.45 MB, 2286x1254)
2.45 MB
2.45 MB PNG
>>108554249
https://xcancel.com/bdsqlsz/status/2041793884146299288
https://happyhorse-ai.com/
>fully open source
>15b
OMG ITS HAPPENING!! OMGOMGOMGOMGOMGOMGOMGOMG
https://www.youtube.com/watch?v=xb2fjZa_L74
>>
File: 1771427149880488.png (1007 KB, 1080x895)
1007 KB
1007 KB PNG
>>108555676
BIG IF TRUE
>>
>>108555676
>>108555682
Do you think it's something like Z-video?
>>
File: file.png (353 KB, 800x450)
353 KB
353 KB PNG
>>108555676
I hope it's not a fake website anon, you got my hopes up.
>>
File: 1765500092347293.png (657 KB, 1395x1548)
657 KB
657 KB PNG
>>108555676
https://xcancel.com/bdsqlsz/status/2041809530942845107#m
confirmed open source, we'll get the model in 2 days
>>
>>108555676
>>108555707
>Inb4 100gb so that nobody can run it anyway so paypiggy for the API regardless.
>>
Oh website says 15b.
Well I just hope it runs fine on my 3060
>>
File: ComfyUI_20201.png (3.26 MB, 1200x1600)
3.26 MB
3.26 MB PNG
>>108554832
Why not choose different dimensions if everything is just going to be centered?

>>108555676
Fingers crossed that it fits in 24GB of VRAM...
>>
>>108555676
I'm sorry for doubting you Alibaba, you are the real goat
>>108555769
it's a 15b model, barely bigger than Wan 2.1
>>
>>108555676
>>108554249
This is not in the same level of Seedance 2.0 (obviously) but it's way better than fucking Wan 2.7, why would Alibaba give us that when they didn't want to give us their worse video models??
https://files.catbox.moe/57lulu.mp4
>>
Btw Chinese man says 5 sec in the tweet but artificialanalysis demos are 9sec.
>>
>>108554562
Your points are valid. The things you mention are realistic. Unfortunately, this is a bad place since it is a general of little relevance and very much a shitpost central. I would put your comment on the Huggingface page or on CivitAI itself.

The intelligence of Anima reached its limit in preview 1. In preview 2 and 3 there were aesthetic improvements, but the comprehension failures continue to be the same.
It is a shame, but at the same time I understand why tdrusell chose such a small model for this.

The use case of Anima and Illustrious are the same. In terms of scene and object character relationships, it is still crippled as hell, like low level Photoshop when you reach a certain degree of complexity.

It is a lightweight goon model updated in terms of architecture., also do not see it as necessary for it to be a more complex model and the best, since there is no realistic use for an anime model to be that intelligent, because 99 percent of its users are still making single girl cowboy shot or typical NSFW scenes.
>>
File: 1758323415928084.png (350 KB, 1754x1412)
350 KB
350 KB PNG
>>108555676
>https://happyhorse-ai.com/
>15b
This is just a finetune of daVinci-MagiHuman, right? I'm sure I'm right...
>>
>>108555833
Ask it how they differ.
>>
>>108555676
Would this run well on a 5090?
>>
File: 1754101408099756.png (301 KB, 1213x1317)
301 KB
301 KB PNG
>>108555838
maybe it's daVinci-MagiHuman but it can do native 720p this time
https://xcancel.com/bdsqlsz/status/2041811909884965324#m
>>
File: 1771040277091625.png (426 KB, 1563x1374)
426 KB
426 KB PNG
>>108555846
or maybe not
>>
>>108555840
if it's a 15b model you won't have any issue running it on Q8
>>
>>108555846
Now you know the magic of asking the right question to get the answer you want.
>>
is 32gb ddr3 fine for comfyui?
>>
File: based.png (431 KB, 800x582)
431 KB
431 KB PNG
>>108555676
>Users: "Please Alibaba, release Wan 2.5"
>Alibaba: "How about I give you something even better"
>>
>>108555874
i don't trust chinks
this better might be dog shit
>>
>>108555676
https://files.catbox.moe/cx8cg7.mp4
The audio quality is insane, if this is what we'll get locally then we're back to levels never imagined, holy shit dude
>>
>>108555882
it better might could be probably is
I really hope ltx hasn't become the bar
>>
>>108555676
APIKEKS IN SHAMBLES
>>
>>108555676
kek, I remember saying that davinci would be great if it was trained on more than 256p, looks like Alibaba got my memo
>>
>>108555862
It will slow down loading models significantly.
Also most likely terrible for any offloading.
But if you can fit the models into your GPU comfortably, it should be workable.
>>
File: 1749645077228879.png (893 KB, 1433x1299)
893 KB
893 KB PNG
>>108555676
he's not wrong lol
>>
File: 6457257237.jpg (1.96 MB, 1664x2432)
1.96 MB
1.96 MB JPG
>>
File: Ikr.png (77 KB, 1453x290)
77 KB
77 KB PNG
>>108555676
https://xcancel.com/ArtificialAnlys/status/2041591989083500933#m
More examples on this post
>>
>>108554592
nice, catbox?
>>
>>108555555
waste of perfectly good sextuple with fucking gook slop.
>>
>>108556059
>>108555552
>>108555126
>>108555831
Stop posting Anima, old news, if you want to keep shilling or discussing it go to dedicated Anime generals.
Fuck off.
>>
>>108556121
>old news
what is this schizo talking about?
>>
>>108556066
Here is a crazy theory. It's fake, gay and corrupt like anything else in the benchmeme ecosystem?
Like if some company were to approach them and say "We are launching a new model soon. Here is $50k, put us in the top spot of your leaderboard this week to build hype. Ok, thanks bye." why the fuck would they refuse?
>>
>>108556129
I don't know why they would need to do something like that, the simple fact it's way better than ltx 2.3 and will be open source is a good sign it'll get hyped to oblivion anyways
>>
>>108556066
no talking?
they said it's multilingual, did they mean the prompts?
>>
Hello, we discuss upcoming tech here. Anima has already been updated, and anons are discussing it in their respective anime threads. Once Anima is updated, there’s no point in discussing it here. If you don’t know how to read the room, let me say it for you.
>>
>>108556151
there's some examples where it's talking on twitter
https://xcancel.com/Ricardojiang888/status/2041794075779854530#m
>>
File: rug pull?.png (396 KB, 607x505)
396 KB
396 KB PNG
>>108555676
https://xcancel.com/EtherCoins/status/2041831371895927068#m
lmaooo
>>
>>108555707
>>108556185
>https://xcancel.com/bdsqlsz/status/2041809530942845107#m
HOLY SHIT HE REMOVED THE TWEET SAYING THAT IT'LL BE OPEN SOURCED, LMAOOOOOOO
>>
>>108555676
are you fucking kidding me
>>
>>108556191
this chinese insider guy is starting to act really sus lately, he said Qwen Image 2.0 would be released at the end of march, it didn't happen, now this? fucking bugs I swear...
>>
how much system does anon have/need?
>>
File: image.png (511 KB, 793x482)
511 KB
511 KB PNG
>>108556198
fuck off
>>
>>108556155
>we discuss upcoming tech here
says who?
>>
>>108556221
kek
>>
>>108556155
You don't get to decide what should be discussed here, go fuck yourself.
>>
>>108556191
https://xcancel.com/bdsqlsz/status/2041805114894381334#m
time to remove that one too anime man
>>
File: anima.jpg (1.15 MB, 1664x2432)
1.15 MB
1.15 MB JPG
>>
File: sad.png (281 KB, 947x899)
281 KB
281 KB PNG
>>108555676
It's over...
>>
>>108556249
Hey, I see you’re the only anime poster left. Just so you know, there are dedicated anime diffusion threads where you can talk about anime with others who share your interests. This isn’t the right place or context to keep discussing a model that has already been updated.
>>
>>108556291
>User impersonating 4chan staff.
>>
>>108556298
No, i'm a ldg anon who has been long enought. Dkn' put words in my mouth.
>>
>>108556304
>enought. Dkn'
saar?
>>
Anima P3 looks great, keep posting it
>>
>brown south american calling others indian
lmao
>>
>>108556304
you're a real sad act
>>
>>108556268
https://github.com/brooks376/Happy-Horse-1.0
sus
>>
>the fizzler
>>
>>108556315
... in their respective anime generals
>>
>>108556315
>keep posting it
this, make this retard >>108556330 angrier I want to see some melties
>>
>>108555778
>>108555778 #
holy based jenner


requesting permission to repost sexo jennies to /r/Realistic Parody AI. Your jennies mog everything they're genning over there
>>
File: Anima3_00048_.png (1.29 MB, 1296x896)
1.29 MB
1.29 MB PNG
Anima is great
>>
for WAI-Illustrious (or other models that know artists), should I do:
rustle, or rustle_\(artist\)?

and for names with spaces:
murata range, or murata_range
?
>>
Tdrusell belongs in anime generals desu
>>
File: ComfyUI_19716.png (2.23 MB, 1200x1600)
2.23 MB
2.23 MB PNG
>>108556336
Go for it. Once i they're on your computer you can do whatever you want with 'em.
>>
he's running out of FUD energy
>>
>>108556325
> Repository status: The model weights and inference code are marked "coming soon." This README documents the announced architecture, training, and benchmark results. Star and watch this repo to be notified the moment the weights are published. In the meantime, you can try the model and read the latest updates on happyhorses.io.
>>
When is the part where he links his own posts as a proof of other generals (apparently he has a hate boner against hgg and adt) raiding ldg?
>>
kek he's giving up
>>
friday, model released
saturday, comfy implementation
sunday, ai-toolkit integration
monday, everyone's bored
>>
>>108556358
what day do u kill yourself
>>
He's fizzling.
>>
>>108556358
friday - seethe
saturday - meltie
sunday - cry
moday - cope
>>
>>108556358
to be fair, the local models are pretty mid so it's easy to be bored of it quickly, if I had seedance 2.0 at home I would have fun with this thing until the end of my life
>>
>>108556349
>ranjeet
wait, Anifart is also on /lmg/?
>>
monday - no funding
tuesday - no funding
wednesday - no funding
thursday - no funding
friday - no funding
saturday - no funding
sunday - no funding
>>
>>108556351
Why do you namedrop a general who has 2 users samfagging? You are no relevant
>>
monday - no funding
tuesday - no funding
wednesday - no funding
thursday - 1 new star
friday - no funding
saturday - no funding
sunday - no funding
>>
File: anifart.jpg (1.73 MB, 979x2558)
1.73 MB
1.73 MB JPG
>>108556388
>a general who has 2 users samfagging?
Anifart is here, but who's the 2nd user?
>>
Openanima status????????? (it doesn't exist)
>>
>>108556344
always sharing the top notch gens
>>
>>108556241
Hello, I'm AnimAnon, I'm poor, I can't afford stuff.
>>
>>108556407
why wouldn't you want more local and actually fully free models?
>>
>>108556268
>>108555676
now that it's been revealted to be fake? what model is it then? A new Kling? Veo 4?
>>
>>108556120
1. ur racis
2. you will envy my hapa kids
>>
>>108556545
when was it revealted to be fake?
saas models aren't worth the risk after the seedance 2.0 cease and desist.
make a godly open model and sell server compute, the end result is the same and they avoid getting shutdown by hollywood.
>>
>>108556376
tuesday - 35 stars roping
>>
>>108556547

i quite enjoy healthy hapa pics
i see no wrong in that
>>
>>108556580
>when was it revealted to be fake?
here, he based his open source prediction >>108556268 on this completly sus github >>108556325

that guy is also talking about DeepseekV4 (that doesn't exist yet), 99% a scam
https://github.com/brooks376/DeepSeek-V4-AI-Coding-Assistant
>>
>>108556268
china man disappointed me
kijai never disappoints
>>
>>108556604
isn't it already on the leaderboard?
>>
>>108556427
if only the woman was not that ugly and boring
>>
>>108556713
you're talking about deeseek v4 or that happyhorse thing?
>>
File: AAAAAAA.png (58 KB, 636x640)
58 KB
58 KB PNG
>>108555780
>I'm sorry for doubting you Alibaba
>>108556268
Well... FUCK YOU CHINA
>>
Is there a differrence between batch and global batch in Onetrainer LR scaling?
>>
>>108556810
>thing
the seethe is palpable
>>
>>108556849
why should I care anymore? won't be open sourced anyway
>>
>>108554182

you know that i know right
i has warm and that choko is like big toblerone
>>
>>108556860
>What is Happy Horse 1.0?
>Happy Horse 1.0 is a 15B-parameter open-source AI video generation model that jointly produces video and synchronized audio from text or image prompts.
whats next, complain about 15B params, or open source bad?
>>
>>108556291
YOU ARE MENTALLY ILL. YOU SHOULD REFER YOU ARE SELF TO A MENTAL HEALTH INPATIENT FACILITY. YOU HAVE A BAD KIND OF RETARDATION WITH RETARDS TO YOUR ARE BRAIN. aLSO THIS IS AN ANIME WEBSITE. n-WORD N-WORD N-WORD
i even think anima is plastic slop too but i want them to post it more. I want them to post chungus 'astronaut riding horse on moon holding a sign that says "text conprehenssion" [misspelt on purpose] with a rabbit in the top left, a dog in the top right and three chibi anime girls are watching also' adg slop images here because it owns you to do that
>>108556345
He has a backup supply of CHUD energy and im afraid not an insignificant amount. He'll be going for days
>>
mogao 2.0
>>
>>108556887
>believing a fake website
ngmi >>108556604
>>
>>108556911
is artificialanalysis.ai a fake website too?
>>
>>108556925
did artificialanalysis say it's gonna be open sourced though?
>>
File: FxAzX-SX0AAZ-DV.jpg (9 KB, 210x338)
9 KB
9 KB JPG
>https://red.anthropic.com/2026/mythos-preview/
>~1000 open source repos tested
>frontier model discovered 595 basic tier bugs and dozens of severe bugs including 0days.
>>
>>108556940
the LLM believes they are bugs, but is it accurate on its predictions though?
>>
File: ComfyUI_08619_.png (910 KB, 1024x1024)
910 KB
910 KB PNG
>>
>>108556927
>the model is real but the site is fake
ok, that makes sense.
>>
File: mogaokekked.png (39 KB, 1029x229)
39 KB
39 KB PNG
>>108555676
Are there genuine, ironic brownoids in this thread who don't realize this is a fake pop-up site just like what happened with mogao (which turned out to be seedream)? I'll be sure to include all the "APIKEKS BTFO" posting in the screencap".
>>108555780
>>108555874
>>108555901
Local is an ABSOLUTE EMBARASSMENT
>>
What's a good wan lora for quick cut to sex?
>>
>>108556949
>the model is real, but a fake site is claiming it's gonna be open source
yes? can you show me your hands, I wanna verify something
>>
>>108556944
if you read the post, anthropic goes into detail on some of the most severe ones, and some maintainers like the ffmpeg people can confirm they've been getting patches for some short time now.
>>
>they got excited for an API model
this just proves that API is better than local. we all know it. local models are outdated shit.
>>
File: _AnimaPreview3_00003_.jpg (410 KB, 1248x1608)
410 KB
410 KB JPG
>>
>>108556951
>mogao (which turned out to be seedream)
ngl, bytedance is good at making models, I remember the images made by seedream and they were pretty realistic
>>
>>108556955
There were several on civitai before the great purging. No I won't share them.
>>
>>108556940
they could ask it to check comfyui repo if they need real test
>>
>>108556958
where did artificial analysis get the model? can you link the real site. you seem to know a lot about this happy horse thing.
>>
File: dammit.png (152 KB, 600x600)
152 KB
152 KB PNG
>>108556951
Sorry anon, the hopium dose got the best of me
>>
>>108556940
I think it's a better idea to share this on /lmg/
>>
>>108556983
i am the mystery twitter chinese
>>
Comfy was right to switch to API. Nobody likes local models, not even localkeks. So far behind....
>>
>>108557014
he was right to ditch anima as well for an api only noob model
>>
35 stars status?
>>
>>108557040
maybe he can finally implement inpainting one claude mythos is out
>>
https://civitai.com/models/897413/big-love
is there a gofile link to this klein 9b "finetune" somewhere?
>>
>>108557050
it's so plastic, Klein is really not good at realism, thank god Z-image turbo exists
>>
>>108557061
you just dont know how to use it
>>
>>108557073
you don't know how to download that finetune
>>
>>108556949
Why is this so hard to believe? The domain isn't even anything that would lend it credulity "happyhorse-ai.com", there are about a dozen domain names they probably would have went for before that if it was real
>>
>>108557082
>Why is this so hard to believe?
he knows, he's just trolling you
>>
>>108556059
phoebox please? so cute
>>
>>108556358
>sunday, ai-toolkit integration
nobody is gonna have vram to train that thing
>>
>>108557082
it doesn't take a genius to realize these companies are going to move towards open models to maintain plausible deniability and just sell server compute.
in the last 4 months we saw the worlds riches man have his AI lobotomized, and a 500 billion dollar company brought to its knees with a cease and desists letter.
>>
>>108557158
>in the last 4 months we saw the worlds riches man have his AI lobotomized, and a 500 billion dollar company brought to its knees with a cease and desists letter.
that's the thing, if they have that much power, they could destroy the life of someone who's willing to release a model that's too powerful to the nature (local)
>>
>>108553861
no e621 dataset = no fun, pray for a furry+anime (instead of furry only) finetune
>>
>>108557158
Absolute fucking localcope
>>
>>108557176
this. i need my farting_on_prey
>>
Is there a plugin or a node that lets you make these notes of text very minimalistic?
I plan on having a lot of them.
>>
>>108557188
Just use promptcat https://github.com/sevenreasons/promptcat or any other prompt storage tool. Or just fucking plain text file
>>
File: ComfyUI_temp_sjuir_00021_.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>108557188
can't get any more minimalistic than label node (rgthree).
>>
File: 7.png (1.37 MB, 960x1400)
1.37 MB
1.37 MB PNG
>anima
>>
>>108557182
>we can make a lot of money selling server time for our model
vs
>we can get sued by disney and then take our model offline
open models let companies exists in a legal grey area, it has nothing to do with "local".
>>
>>108557201
That's not userfriendly for multiple users.

>>108557206
That's almost perfect, but you can't select to copy paste very easily.
>>
File: cope-maxxing.png (178 KB, 480x270)
178 KB
178 KB PNG
>>108556268
Hmpff, it wasn't that good anyway!
>>
File: _AnimaPreview3_00044_.jpg (403 KB, 1248x1608)
403 KB
403 KB JPG
>>
File: 1766570287465007.jpg (720 KB, 1536x1536)
720 KB
720 KB JPG
>>108556596
1 more 4 u
>>
Russell, you have the patience of a saint for dealing with the retarded shit that gets posted on your HF page kek
>>
File: _AnimaPreview3_00054_.jpg (296 KB, 1160x1696)
296 KB
296 KB JPG
>>
Fudding time?
>>
>>108557333
cope seller, gimme ure strongest fuds
>>
>>108557338
free gifts like anima are le bad because i say so grrr >:(
>>
may i ask your settings? My gens are so bad vs this
>>
>>108557050
samples look really bad and overbaked, don't fall for this shit mixes anon, they are just glorified merged loras, those people don't have the resources to train a finetune, they just train a lora with a small dataset and merge into the base model, they are scamming you anon, you can do so much better, train your own stuff
>>
>happyfake
i don't know if this shit is really open source, but not for this week kek
>>
File: Z-image-Klein_00929_.png (828 KB, 1024x1024)
828 KB
828 KB PNG
nyaa
[spoiler]ignore the slopped paw[/spoiler]
>>
>>108557497
Looks good so far but I'm really hoping that it's an API model. Seedance is dead and other API models are too old to bother with now.
>>
>>108557533
it's not even close to seedance, and Sora is also dead soon, it's like replacing Messi and Ronaldo with Mbappé, it's a cope
>>
File: 38057800387796390787934.png (103 KB, 1189x810)
103 KB
103 KB PNG
>>108557568
Numbers look fine.
>>
File: 1617234637622.jpg (93 KB, 900x675)
93 KB
93 KB JPG
>>108557533
i don't think it's wise to ignore ltx. new loras appear regularly. alibaba has no choice but to attract attention, or risk losing the local market permanently
>>
>>108557596
now use your eyes and look at the videos
>>
>>108557607
>i don't think it's wise to ignore ltx.
it is, ltx sucks lol
>>
>omagawd are those benchmarks???
>>
>>108557610
that's how they are ranked
>Models are ranked using an Elo rating system derived from user votes in blind comparisons. Users compare videos generated from the same input image and choose the result they prefer. Higher Elo scores indicate a model is preferred more often.
>>108557623
you should relax, this might actually be a closed api model. you don't want to be in this thread in two days hyping up the happy horse api.
>>
>>108557640
>>108557623
he did the meme lol
https://www.youtube.com/watch?v=nsNrwHA6Big
>>
>>108557648
>people say this model looks better
>uhhh no they didn't
apicucks really are a special breed.
>>
>>108557665
>well the chart says...
>>
>>108555676
>A youtuber specialized on API models talks about that model
it's over... it'll definitely won't be local
https://www.youtube.com/watch?v=mmk9C6bkV_c
>>
>>108557684
>& likely Open!
>>
>>108557684
>Beats Seedance 2.0
nice cope.
>muh charts
lol
>>
>>108557701
you know it doesn't beat Seedance 2.0 because Holywood hasn't sent a cease a desist letter to them, it's that simple, if you don't make Jewlywood freak out, your video model is certified mid
>>
>>108557050
You should be able to wait 10 days for this jeetmix.
You will probably make 10 gens with it before realizing how fried it is and deleting it from your drive.
>>
>>108557712
good luck sending a cease and desist to an open model.
>reee someone made tom cruise!
>idk probably a lora or whatever, take it up with big_booty_genner69
defeat jews with this one simple trick
>All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
>>
File: anima3preview_.png (1.13 MB, 668x1188)
1.13 MB
1.13 MB PNG
>>
File: deMS_zi_00027_.png (3.91 MB, 1663x1164)
3.91 MB
3.91 MB PNG
>>
File: ComfyUI_158183_.png (3.78 MB, 1080x1920)
3.78 MB
3.78 MB PNG
>>
File: anima3preview_.png (2.91 MB, 1024x1808)
2.91 MB
2.91 MB PNG
>>
File: anima3preview_.png (2.49 MB, 1024x1808)
2.49 MB
2.49 MB PNG
>>
https://huggingface.co/black-forest-labs/FLUX.2-small-decoder
>>
File: 00004-4059277889.png (2.17 MB, 1024x1280)
2.17 MB
2.17 MB PNG
>>
>>108557802
are they fucking serious? vae decoding takes like 2 seconds on klein, why would I want to reduce the quality of my image just to gain 0.5 seconds of waiting, god they are so fucking retarded
>>
>>108557802
I pray for the poor saps who are bottlenecked by the vae, for images no less
>>
>>108557814
>bottlenecked by the vae
it's not even a thing since you can use the vae (tiled) node kek
>>
>>108557684
>>
anona?
>>
>>108557802
national holiday for india
>>
>>108557802
>KV meme and now this
who asked for this?? lmao
>>
>>108557838
what is this shit
>>
>>108557876
a brown person having a meltdown over a new model.
par for the course here at /ldg/
>>
>>108557890
calm down anon
>>
>>108557876
>what is this shit
Happyhorse, your Seedance 2.0 """killer"""
>>108557890
>a brown person having a meltdown over a new model.
you're talking about ani seething about Anima every single day?
>>
>>108557757
Why use anima if you're just going to make it look like SDXL shitmix
>>
>>108557811
For standard 1024px single-image generation on a mid-to-high-end GPU, you're right — it's a modest win. The real value is in: (a) high-res workflows where decode VRAM becomes the binding constraint, (b) batch/iterative pipelines where decode overhead compounds, and (c) tightly-constrained hardware where every GB matters and you'd otherwise fall back to quality-degrading tiling. It's essentially VRAM headroom you can redeploy toward higher resolution or larger batch sizes, which is genuinely useful in production scenarios.
>>
>>108557862
KV Cache stuff was a meme?
>>
>>108557902
it'll definitely won't be local
seedance is still king
>>
>>108557890
>a brown person having a meltdown over a new model.
exhibit A -> >>108557905
>>
>>108557909
thank you, chatgpt
>>
>>108557909
>—
>>
>anima is so bad because comfy won't gift me millions reee :(
>>
>>108557919
that anon is clearly not seeting at anima specifically
>>
>>108557910
yeah, it degraded the output a lot, and now we have TurboQuant to get better quality KV quants
>>
File: 00007-3140823318.jpg (1.51 MB, 2560x2048)
1.51 MB
1.51 MB JPG
>>
File: anima3preview_.png (1.5 MB, 1072x1072)
1.5 MB
1.5 MB PNG
>>
>>108557958
monk
>>
>>108557958
he looks like he is about to tell me local is dead.
>>
Fresh when ready

>>108557992
>>108557992
>>108557992
>>108557992

Fresh when ready
>>
File: anima3preview_.png (2.14 MB, 976x1952)
2.14 MB
2.14 MB PNG
>>
I am new to local llm/diffusion, I am mostly finding comfyui workflows in random places and playing with them.

I stumbled on this and was wondering if anyone had any other neat workflows.

https://weirdwonderfulai.art/comfyui-workflow/qwen-edit-2509-multiple-camera-angle-lora/


Is there a common repository everyone is using for workflows? Or where does everyone find them?
>>
>>108558000
wtf it got nuked, what did ani do this time?
>>
>>108558274
civitai has random workflows, most custom nodes you download will have an "examples" folder in the install directory. and any site that doesn't strip metadata can/will have the workflow imbedded into the image, so you can drag and drop the image into comfy and it will open the workflow.
>>
>>108558274
https://www.comfy.org/workflows/comfyui/
>>
>>108558000
y nuked tho?
>>
>>108558274
>Or where does everyone find them?
I make them myself
>>
File: 00009-4142179651.jpg (1.31 MB, 2560x2048)
1.31 MB
1.31 MB JPG
>>
>>108558329
janjans are probably tired of the schizo baker making early threads all the time
>>
>>108558374
calm down dweebo
>>
>>108558319
Ah nice, I will look civitai again, I didn't realize they included examples.

>>108558324

Thanks! I somehow missed this site.


>>108558356

Have you made any neat workflows that do something novel (like the changing a camera angle example)
>>
fresh when ready
>>108558395
>>108558395
>>108558395
>>
will update comfy, if god is real nothing will break
>>
it's over



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.