[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (2.93 MB, 3100x3731)
2.93 MB JPG
Discussion and Development of Local Image and Video Models

Previous: >>108943765

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
blessed chopper free thread
>>
File: ComfyUI_00659_.png (969 KB, 896x1152)
969 KB PNG
is nvidia pid worth it with 6 GB of VRAM?
>>
File: 122953CUI_00001_.png (1.53 MB, 1192x1536)
1.53 MB PNG
>>
Why is Catjack impersonating people to fake engagement?
>>
>>108948292
Good to know this is how you choose to spend what's left of your worthless life, Julien
It'll only accelerate your already inevitable suicide
>>
>>108948292
autismway isway whyway.
>>
File: 125016CUI_00002_.png (1.88 MB, 1192x1536)
1.88 MB PNG
>>
>>108948366
Women never actually do anything. Doing things is transcoded.
>>
>>108948244
someone made a demon deals lora lol?
>>
File: ComfyUI_00662_.png (1.79 MB, 896x1152)
1.79 MB PNG
>>
>>108948396
wat
>>
File: ComfyUI_00663_.png (1.26 MB, 896x1152)
1.26 MB PNG
>>108948417
huh
>>
File: 2454554.webm (3.69 MB, 384x640)
3.69 MB
3.69 MB WEBM
trying to figure out how to prompt a Phalanx CIWS
https://files.catbox.moe/b8c6nr.mp4
>>
File: 2462555.webm (3.5 MB, 384x640)
3.5 MB
3.5 MB WEBM
https://files.catbox.moe/eiuv2j.mp4
>>
File: 22844.jpg (1.19 MB, 1792x2368)
1.19 MB JPG
how hard is it to make 2 niggas kissing? very apparently
>>
>>108948469
I hope you electrocute yourself with a toaster in the bathtub.
>>
>>108948469
more
>>
>>108948455
>>108948466
welcome back KINO anon! what is in the KINO menu today, cheff?^^
>>
>>108948484
fine. you too:
>>108948478

You can share the bathtub, I think. You can ask deepseek and see if it will work.
>>
Now that there's a proper inbuild UI with a sane license we should sd.cpp to OP.
>>
>>108948490
He refuses to do a series of Revelation, because he's jewish.

jews go to hell, by the way. I know this from the Bible, it says so.
>>
just use invoke ai
>>
>>108948490
i have a few ideas
>>
>>108948494
will you ever shut up schizo?
>>
Sean Colmes is in hell. Did you know that? He won't be missed. a jew.
>>
File: 6044328.png (1.47 MB, 1152x1360)
1.47 MB PNG
>>108948478
ACK
>>
>>108948518
buy this one
https://www.amazon.com/Cuisinart-CPT-142BK-4-Slice-Compact-Toaster-Black/dp/B00D25P82U/
more contact area, and still maybe enough for the burial. Make sure to write down that you want cremation, so the undertaker doesn't catch aids.
>>
>>108948518
me and who
>>
If i am just starting on this is it better to use comfy or stuff like neoforge?
>>
>>108948577
use stable-diffusion.cpp
>>
>>108948577
if you just want to gen forge, if you plan to tinker/do complex workflows comfy
>>
File: ANIMA_bface_bad_00018_.png (1.05 MB, 832x1216)
1.05 MB PNG
>>
>>108948589
How complex are we talking here?

Also i ask because i hear comfy is very popular so while its obtuse as hell, its also way easier to find tutorials, right?
>>
>>108948598
>its also way easier to find tutorials, right?
yes, you can easily find simple ready to use workflows for txt2img and img2img and build on top of it if you need/want, the UX isn't nice but yes it can also be easy and simple to use
>>
>>108948598
complex usually means you are a VFX artist and you are doing shit in blender, nuke, maya etc. or you are a snake oil jeet. Cumfart is unstable and forge is too slow to get new models. Sd.cpp is dethroning forge
>>
>julien
>>
>>108948598
you can also use swarmui which is a frontend for comfyui. you can always switch to comfyui node ui if you need to use a comfyui workflow
>>
File: 133605CUI_00002_.png (1.14 MB, 1192x1536)
1.14 MB PNG
>>
>>108948610
isnt sd.cpp a literal commandline?
>>
>>108948594
>hasan when the 2 guys kissing get out of their spot on stream
>>
>>108948617
They have an official UI now bundled with it :)
>>
File: ComfyUI_temp_gpctu_00003_.png (3.94 MB, 1152x1664)
3.94 MB PNG
Anyone did comparisons on anima lora baking by gradually increasing epochs?
>>
why the fuck does comfyui uses 2 ports if i change it from the default of 8000
wtf is this buggy mess
>>
File: 133754CUI_00001_.png (1.09 MB, 1192x1536)
1.09 MB PNG
>>
File: ANIMA_bface_bad_00019_.png (1.03 MB, 832x1216)
1.03 MB PNG
>>108948594
>>
(Jesus incognito here)
>>
>>108948633
Time to switch to cpp
>>
>>108948649
but sd.cpp is shit anon
>>
How long would it take to vibecode comfy from pytorchslop to sdcpp backend?
>>
>>108948649
somehow, comfyui was using 7860 800 and 8010

I am so mad at it right now.

Is sd.cpp really good enough or are you memeing me.
>>
>>108948651
According to who? You? The no coder? Bwahahahaha
>>
>>108948662
I think it should have support for any model worth a damn, but you obviously have no nodes and the plethora of snakeoils.
>>
>he's unironically STILL seething
lolcow
>>
>>108948659
ComfyUI_frontend repo is a million lines of code
>>
>>108948662
it's around equivalency in speed but it has native gguf support and has all the latest models. You also don't have to do the cumfart humiliation rituals to update since it's just a binary
>>
>>108948631
? What is there to compare? Train more -> learns more, until it starts to overfit on high timesteps, this totally depending on your dataset size (interestingly enough time to overfit has a linear relationship with dataset size when pre-training diffusion models, which is to say as you double training data you can double the training steps)
https://arxiv.org/abs/2505.17638
>>
>>108948714
ops, meant to sent this one https://arxiv.org/abs/2605.06077 but they say kinda the same thing
>>
File: 1779526013752059.jpg (619 KB, 2400x1600)
619 KB JPG
surprised how far you can push anima on realism
>>108948631
look up validation sets anon, next to samples and comparing checkpoints across epochs that's your best metric to understand whether the model is still learning
>>108948469
kino
>>
File: 135745CUI_00001_.png (1.7 MB, 1192x1536)
1.7 MB PNG
>>
File: jani_00009_.png (1.48 MB, 896x1152)
1.48 MB PNG
>>
File: example 3.jpg (1.4 MB, 2439x1788)
1.4 MB JPG
>>
File: latina crawl.mp4 (2.62 MB, 1024x1024)
2.62 MB
2.62 MB MP4
>>
File: Flux2-Klein_00524_.jpg (678 KB, 1056x1472)
678 KB JPG
>>
>>108948280
>>108948366
>>108948754
FUCK OFF, CATJACK OR AT LEAST IF YOU HAVE ANY SHRED OF DIGNITY REVEAL YOURSELF.
>>
File: ComfyUI_00664_.png (1.36 MB, 896x1152)
1.36 MB PNG
>>
>>108948750
is this Anima?
>>
>>108948902
I think it's Dominos.
>>
>>108948832
sexo
>>
>>108948919
link?
>>
uh oh melty
>>
File: ComfyUI_00666_.png (1.29 MB, 896x1152)
1.29 MB PNG
THE NUMBER OF THE BEAST
https://files.catbox.moe/ldgrii.mp4
>>
>>108948919
double pepperoni?
>>
File: 143307CUI_00001_.png (1.39 MB, 1192x1536)
1.39 MB PNG
>>108948786
Based.
>>
File: 1771473107396552.jpg (736 KB, 2400x1600)
736 KB JPG
>>108948902
yes, with a lora which is still cooking. will likely post it somewhere in the coming days
>>108948919
actually it's pizza hut
>>
File: ComfyUI_00667_.png (853 KB, 896x1152)
853 KB PNG
>>
File: 75629924.png (2.39 MB, 1280x1656)
2.39 MB PNG
>>108948533
you and >>108948643
>>
File: ComfyUI_00668_.png (776 KB, 896x1152)
776 KB PNG
>>
File: ComfyUI_00181_.png (1.62 MB, 864x1536)
1.62 MB PNG
>>
File: 145651CUI_00002_.png (1.42 MB, 1600x904)
1.42 MB PNG
>>
File: q_k0w8og.png (995 KB, 960x1344)
995 KB PNG
>>
File: mikuanima.jpg (286 KB, 1168x880)
286 KB JPG
>>
>>108949136
that text gen is really fucking good
>>
>>108949136
zelda 64 lora?
>>
File: 151654CUI_00002_.png (1.72 MB, 1536x1152)
1.72 MB PNG
>>
https://prismml.com/news/bonsai-image-4b
Introducing 1-bit and Ternary Bonsai Image 4B: Image Generation for Local Devices
>>
>>108949300
based
phonechads keep winning
>>
File: 153547CUI_00001_.png (2.05 MB, 1536x1152)
2.05 MB PNG
>>
>>108949300
But I can't generate smut on public places and Anima is better.
>>
>>108942745
I have a bad feeling comfyorg will do this for gibs
>>
How long it might take before a model outperforms Anima in both SFW and NSFW?
>>
>>108949358
some teams are starting to push out larger ZIT finetunes now, and the arch would definitely allow it to happen
animas license is garbage too so the motivation is definitely given
>>
>>108949385
But are they really as effective and VRAM friendly as Anima? I seriously doubt it. Most of the local scene( jeets and VRAMlet) are goint to stick with Anima unless something better with fewer parameters shows up.
>>
>>108949405
>>108949300
>>
>>108949405
i mean one of the main complaints about anima is how slow it is, even compared to klein and zit. though i dont have exact numbers
>>
>>108949482
there is the turdbo lora tho
>>
need animaXL
>>
File: uq6w27.png (985 KB, 1280x768)
985 KB PNG
>>
>>108949358
Local diffusion had a before and after Comfy moment, and now it has a before and after Anima moment.
>>108949482
Yeah it's slow, but the prompt understanding is so good it destroys any argument about speed. And on top of that they keep finding out new stuff it can do, like it might work as an edit model too.
>>
File: 161225CUI_00002_.png (1.92 MB, 1536x1152)
1.92 MB PNG
>>
File: 32467357.webm (1.43 MB, 256x448)
1.43 MB
1.43 MB WEBM
>>
>>108949538
WRONG. We need anima_pony.
>>
>>108949559
Jihadi propaganda.
>>
>>108949541
its also potential to become the best photoreal NSFW model. out there, with loras. the shit i can generate with this >>108948750 is pretty insane so far, basically 0 concept knowledge loss from anima and it doesnt become CGI plastic either. meanwhile the current state of the art is still biglust or playing the klein anatomy gacha.

people are STILL sleeping on this model despite it being super popular already
>>
File: 745255.webm (2.99 MB, 256x448)
2.99 MB
2.99 MB WEBM
>>
Seriously, how did BigRuss discover Cosmos? Maybe perhaps an nvidia insider recommended the model to him?
>>
>>108949634
>discover Cosmos
you just have to look up at the night sky mate
>>
>>108949634
you say as if it was a dark unknown model that no one heard before, if you do any research on diffusion models of course you stumble upon a nvidia one
>>
>>108949627
>>108949559
Could you add a LiveLeak watermark? kek
>>
>>108949644
Come on, nobody discussed Cosmos here. We fell for every snake oil model imaginable, yet nobody ever shared a "baby, baby, wake up" Cosmos link.
Desuarchive backs me up on that.
>>
>>108949663
you say as if this thread had a regular robust posters dedicated to the testing of models and they failed to find something, not the case
>>
>>108949663
we talked about it briefly but the licence was a no go for altering the text encoder directly
>>
File: Qwen_IE__00018_.png (938 KB, 832x1248)
938 KB PNG
>>
>>108949482
I get like 20s per gen at 1024px. Slower than SDXL and friends, faster than Chroma.

>>108949663
It was listed in the Comfy examples but that's about it.
>>
>>108949685
not that you'd ever want to do that
>>
>>108949645
i added that to the prompt and it put random text
>>
>>108949675
Yeah, in 2025 we had the Miku tester and the Deus Ex anon tester. George Floyd tester in 2026 and *atjack. We have got our own lineup for that kind of thing.
>>
File: 163344CUI_00002_.png (1.71 MB, 1536x1152)
1.71 MB PNG
>>
>>108949692
nobody wanted to touch it so discussion died which is the point
>>
>>108948750
Nice, but can you do a medium or full body shot, so we can see how it handles body anatomy. So far all realism anima checkpoints and loras have pretty awful body anatomy.
>>
Anima is such a blessing
Thank you comfy
Thank you bigrussel
>>
>>108949800
In what way is it a blessing, can you clarify? Can you post an example image of something it blessed you with?
>>
>masterpiece, best quality, score_9, score_7
>>
File: 1779241536542806.png (3.73 MB, 1600x2400)
3.73 MB PNG
>>108949761
what are you looking for specifically? hands/face?
>>
>>108949873
The slop gauntlet. Overhead shot lying on grass.
>>
File: 170838CUI_00001_.png (2.03 MB, 1536x1152)
2.03 MB PNG
>>
>>108949894
A pelican riding a bicycle is the new an astronaut riding a horse.
>>
>>108949916
it can never surpass dog wearing sunglasses
>>
File: 246245.webm (1.26 MB, 256x448)
1.26 MB
1.26 MB WEBM
>>
File: 1776071758847180.png (1.58 MB, 832x1216)
1.58 MB PNG
>>108949894
definitely not fully consistent. this one is okayish and body generally is fine, but other gens fucked a bit with the hands even more, and also some face details. though to be fair thats more of an issue with eating upside-down than the pose kek
lets see whether this gets better over the remaining epochs
>>
>>108950004
shovel faced human
>>
>>108950004
why do even modern models still have problems with people displayed upside down
>>
Why aren't we engaging? :(
>>
File: ComfyUI_00150_.png (2.05 MB, 864x1536)
2.05 MB PNG
>>
File: 174258CUI_00001_.png (931 KB, 1152x1536)
931 KB PNG
>>
>>108949873
Just the overall body anatomy, checking if any body part is too big or too small.
Looks pretty nice. Perhaps the head size is a bit too big.
>>
so much for the creativity that anima would enable, every single gen has been 1girl, standing since it dropped, fucking dregs
>>
>>108950164
this happens with literally every model anima is nothing unique in that regard. they could drop the best SOTA local model tomorrow and everyone would still be genning 1girls with it.
>>
File: 43453453534543.png (193 KB, 1545x869)
193 KB PNG
Another slightly better lora of my favourite influencer crush created, one step closer to perfection and death from cooming
>>
File: reCOOM.png (793 KB, 952x720)
793 KB PNG
>>108950237
is it possible to train LoRA of irl people on civitai for offline use? training and downloading but not publishing them?
>>
File: 3575365.webm (1.71 MB, 256x448)
1.71 MB
1.71 MB WEBM
new biblical event just dropped
>>
File: 180754CUI_00001_.png (782 KB, 1152x1152)
782 KB PNG
>>
>>108950253
>biblical event
>is fake
fitting
>>
File: 365757457.jpg (72 KB, 800x805)
72 KB JPG
>>108950248
idk I'm training locally
>>
>>108950248
Yes
>>
>>108950253
you should make it drop a biblically accurate angel instead of the helicopter, I wanna see that
>>
Anyone ever got something useable out of bigASP 2.5 and derived models? I find it interesting to use just to see what it spits out but the results are usually kind of garbage and not very useable
>>
>>108950368
>not very useable
What is it supposed to be useable for?
>>
>>108950412
generating decent looking images
>>
>>108950368
it had higher prompt adherence but worked worse with LoRas. never bothered training on it, was considering it for 2.6 but seems like these were abandoned

snakebite and the few other models based on it are complete dogshit from my experience
>>
File: wNtgCaTXR1-1779828361683.png (1.17 MB, 750x1050)
1.17 MB PNG
applying the knowleage

https://uwucards.com
>>
>>108948455

phalanx or oerlicon just look at wikipedia description
>>
>>108949597

not sure
do you like google trends
how would you see red lightning
>>
>>108950527
retarded ass concept
>>
https://huggingface.co/circlestone-labs/Anima/discussions/37#6a1c7679824a08c2594c4d42
What a piece of shit
>>
>>108950527
Go ahead and bring on "popular AI artists" to make exclusive cards and you'll have something worth using.
>>
>>108950253

a massive directed energy beam they wrote about it in early 1980s
>>
>>108950527
Find a real job
>>
>>108950527
>https://uwucards.com
is this you?
>>
>>108950614
yup I'm boycotting this now
redownloading illustrious as we speak
>>
>>108950527
nice when can I buy merch
>>
>>108950614
Look I don't want to be rude but here's a quick reality check.
Community SDXL finetunes were mostly dogshit. The only reason any of them passed into common usage was because:
A: The average user is totally retarded, sees WAIslop and clicks gen with buzz.
B: SDXL trains (for example Noob vPred) were so nuclear deep fried that you needed stuff like r3mix to fix it.
C: There were so many fuckin checkpoints that people tried them out of insane cope because NAI was pulling ahead.

Anima has no fundamental issues like SDXL/vpred did. Therefore your ideals of community checkpoints is *solely* based on personal gain.
Shove them up your ass, nobody wants them, they're shit.
You cannot hope to compete with actual model trainers and large scale training operations, stop pretending. You just want to make fake gay internet points and get money, literally zero people want your crap.
>>
>>108950527
>needs account
>cloud game
>web based
Where is the local aspect of it, loser?
>>
>>108950647
You are trash Turd Russell Ani was right not to give you an inch you are a greedy bastard.
>>
>>108950527
>https://huggingface.co/circlestone-labs/Anima/you need to market this to school kids. That being said be careful with servicing kids with gambling mechanics.
>>
>ani was right
kek, good one
>>
>>108950614
It's always like this. They are so much holier than anyone else except their work is still based on copyrighted art so to speak. Just because he managed to shit out a model doesn't mean much else legally speaking.
>>
File: 1779409250465154.png (2.97 MB, 1664x2432)
2.97 MB PNG
>>108950647
>Anima has no fundamental issues
ehh the LLM fuckery tdrussell did for a while is unironically a bit of a mess. you wont really notice it in base anymore, but its there
>>108950614
yeah we wont get MIT licensed anime slop, especially with current GPU prices holy FUCK i hate crypto bros so much
>>
>>108950614
L-A-X was more chill with the licence, maybe it's a western thing being obsessed with money.
>>
Is anon still upset that tenorart jewed out with Anima? That's not Russ God's fault.
>>
I don't care about gay anal drama I just want checkpoints capable of nsfw
>>
File: image-34.png (78 KB, 3325x216)
78 KB PNG
>>108950614
Looooooooool
>>
File: Radiance_00007_.png (598 KB, 512x512)
598 KB PNG
Is there anyway to get lodestones radiance working or is it just not ready yet? Outputs weird shit both with his own workflow and any attempted changes

https://files.catbox.moe/15dkl4.png
>>
>>108950748
to be fair he had a bit of a shitty timing with qwen3.5 and z-image
>>
>>108950748
Man trained a cutting edge local model for 50k
AngelBussyless burned 300k on nothing whatsoever
poetry.
Even LoadStores made Chroma etc.
Quality of trainers is going up over time!
>>
File: 1735064915544710.png (224 KB, 533x608)
224 KB PNG
>>108950748
>50k to train a model
Why is training stuff so expensive?
>>
>>108950753
>Is there anyway to get anything lodestones working
no
>>
>>108950614
Find a real and honorable job, you faggot. Stop trying to profit from gooners.
>>
>>108950748
>50k
Damn, that's pretty cheap. Anyone with a job could make their own personal Anima-equivalent?
>>
>>108950766
prices on vast.ai for GPUs unironically doubled this week because some shitcoin came out that was/is profitable i shit you not
also everyone is out of GPUs and anthropics and openais code slop pockets are deeper than our gooner pockets
though ill also say i would have thought its more than just 50k
>>
>>108950766
More than half of that was giving for free because Comfy grant
>>
>>108950748
bro just open a patreon or something
there are plenty of paypigs who'd throw money at him for what he's doing
>>
>>108950795
unironically this. lodestone made well over 20k with his ko-fi or whatever
just add the non slop data from e621 and youre golden
>>
>>108950748
embarrassing desu
>>
>>108950527

i did not fall to wallpaperengine i will not fall to uwucards
>>
>>108948244
Why can zimage generate nipples?
>>
>>108950748
Why don't you go all the way like NAI? Or do the same thing as Illustrious, but in a more honorable way?
>>
>>108950748
Wait its unironically only like 50k to create such a model? Interesting
>>
>>108950836
50k from his own poket, Comfy granted him the other half.
>>
>>108950614
>What a piece of shit
Honestly, why do you say this? idgi
>>
>>108950844
he's selling a model made with stolen art and yet he acts like he has some sort of moral point
>>
>>108950842
That's still interesting then, i had the opinion it's much more expensive
Wondering if trdrussel and the other guys kept some kind of blog of their findings when training their models
>>
>>108950844
>>108950855
Also his model can do (anime) child pornography
>>
>>108950855
>stolen art
>you wouldnt download a picture
you still care about that?
>>
>>108950855
>he's selling a model
Huh? But i didn't spend a dime to use his model. That's weird
>>
Complaining about stolen art is luddite coded
>>
File: 07547-967078330.png (492 KB, 512x560)
492 KB PNG
what is happening? why are Anima users having a melt down?
>>
i for one am happy for every artist that starves or every 3rd world child dies of thirst because my 4090 chugs up their last drip of water and makes me not pay for their 4th rate gooner slop
>>
>>108950855
just download the weights yourself from the HF repo its free
>>108950879
its not very convincing to pretend like the opposite of something is happening which some anons seem to do frequently
>>
>>108950879
anima is claiming ownership of all the loras you train for it
you aren't even able to make them exclusive to your patreon because of the license
>>
>>108950237
>>
>>108950748
Locals don't deserve you. Remove Anima from Hugging Face, make it subscription based, and forget about money problems. Don't give locals an inch. They will ask you for more and won't give anything in return.
>>
Russel is sitting in his Lamborghini right now reading this thread and laughing at you...
>>
>>108950906
With the fenecc fox sucking his dick
>>
so its another nothing burger then? shame
>>
>>108950889
very based
>>
File: 1767406007300115.png (382 KB, 522x711)
382 KB PNG
I love genning but I make sure to hide any AI tags on every site I browse. It's like smelling your own farts vs other people's.
>>
>>108950748
>I need the money otherwse you wont get Anima 2
Ani was right
>>
>>108950889
>anima is claiming
but its a file on my computer it cant talk or do anything on its own
>>
>>108950953
So much this
>>
>>108950953
Ani was right about everything. Fuck comfyui and fuck russ
>>
>>108950889
>anima is claiming ownership of all the loras you train for it
NVIDIA right? Not Anima. And perhaps it's a good thing. It keeps the loras free for everyone.
>>108950888
wat?
>>
>>108950753
Radiance is mothballed. Zeta Chroma is the only project still training.
>>
>>108950966
no it comes from the anima side. Nvidia can change the licence if they want but that hasn't happened (yet)
>>
We should apologize to Ani
>>
Sorry, I'm still gunna use Anima... Ugh, I know, I know, sorry, but I'm still gunna use it!
>>
>>108950748
So basically the TLDR is that he spent more money than he expected because he trained on flawed architecture, and now he expects us, the users, to cover that cost so he can build another version of Anima using a better model?
>>
File: uwssad.png (1.24 MB, 1280x768)
1.24 MB PNG
>>
Can we go back to the "stolen art" thing instead? That was funny
>>
>>108950998
yes and cumfart technically does the same thing
>>
>>108951010
I was thinking about that the other day when I saw a image on gelbooru with visible anti-AI glaze thing kek
>>
>>108950998
its almost as if nobody in this space including us has no idea wtf theyre doing and we get better by trial and error
the one red flag here is, we have yet to see any model of this kind whose sequel didnt end up being a complete shitshow
>>
>>108951039
literally ponydev v2 all over again
>>
>>108950614
from what i understand any output created by anima or loras from it can be used commercially, you can get donos too, its just paywalling loras and limiting where they are hosted that is forbidden.

so whats the issue?
>>
>>108948989
not bad at all anon
>will likely post it somewhere in the coming days
i would love to try it
>>
>>108951007
this one is exceptionally pretty. I like the contrast of the orange to the more desaturated landsacpe.
When I get a chance I might make another "MAINTAiN thread Quality" video from it.
>>
>>108951059
Honest and hardworking people like Sarah Peterson can't make a living creating exclusive content.
>>
>>108951059
>so whats the issue?
Anon is upset that russel is getting attention for training a model while he is not
>>
>>108951071
You've always been on the wrong side of history catjak
>>
>>108951059
>paywalling loras is forbidden
*without getting the commercial license

which i assume isnt gonna be that expensive, i mean its free for anima creator to make a deal with you and if you are already about to profit from his model then sharing some profit isnt a problem.
>>
File: 3efu8p.png (605 KB, 1280x768)
605 KB PNG
>>108951064
thx. it's public domain so do whatev.
>>
>>108951085
it's $500 for the first 10000 images then $0.01 per gen after. I dunno how you enforce that on a local lira
>>
File: 1759075099818875.png (3 KB, 255x31)
3 KB PNG
Reminder that /ldg/ is an anti Sarah Jeeterson general.
>>
>>108951039
>we have yet to see any model of this kind whose sequel didnt end up being a complete shitshow
Why does it seem impossible for one-man teams to follow up their initial success? It's so sad :(
>>
File: Flux2-Klein_00509_.jpg (455 KB, 1104x1408)
455 KB JPG
>>108949044
>>
>>108951086
>it's public domain so do whatev.
now that's a good attitude to have. ;)
thanks though
>>
>>108951099
ge said it in the first line. nobody knows what they are doing
>>
>>108949903
Artist? That's a sweet style.
>>
>>108951099
they spend their trust fund thinking they'll recoup their losses later. they won't.
Welcome to the dog-eat-dog world of diffusion arts.
>>
>>108950748
Probably NAI or PixAI will hire him like they did with Laxhar
>>
im waiting to see what lodestone is cooking
>>
>>108950998
This is a good thing. I'd hate to hear "he did everything perfectly, this is as good as it gets for local".
I'm glad to hear "yes things can get way, way better". Thanks, that's reassuring.
>>
>>108951143
how much money do you think comfyorg burns on dumb bullshit?
>>
>>108951107
would
>>
>>108951158
>>108951138
I like how the paid Anima shills try to change the subject when their beloved Turd Russell gets exposed.
>>
>>108951158
good one
>>
File: Flux2-Klein_00533_.jpg (403 KB, 1248x1248)
403 KB JPG
>>
File: TrueKleinV2_00463_.png (1.85 MB, 1424x944)
1.85 MB PNG
alright lads
>>
>>108951210
All right!
>>
>>108951166
>how much money do you think comfyorg burns on dumb bullshit?
Idk. But I think comfyorg might be on to something sustainable. He was at the right place at the right time. And I have to hand it to him he's got vision, and he's taking bold action to try and get a hold in a very uncertain market. He's uniquely positioned in all of this. I wish him all the best tbqh.
>>
If i'm trying to keep a specific face from pictures, is textual inversion still the way or is lora better?
>>
>>108951254
textual inversion is in no way capable of doing that, has to be lora
>>
My prompts are textual perversion.
>>
>>108951249
being an API wrapper first and putting everything into that as the only revenue is a bit sketchy
>>
>>108951225
poor bob :(
>>
>>108951249
have they ever reported revenue?
>>
>>108951286
>bit sketchy
bit risky. perhaps. but that's business. nothing ventured, nothing gained.
>>
https://huggingface.co/spaces/webml-community/bonsai-image-webgpu

Does this count as local?
>>
File: 1779059379290.gif (776 KB, 168x128)
776 KB GIF
How powerful would a model made by an autistic furry transsexual brony sonic fan be?
>>
has the apache2 anima project released any progress images yet? it was announced a while ago and im wondering how far along its come
>>
>>108948244
I just noticed that some diffusion models have a fixed resolution.
Is there research on (ab)using models to generating larger images than they were designed for, at a cost of visual fidelity? I think it'd be really cool to take a 512x512 model and make like, 4096x4096 or so images with them. I like bringing artefacts to the fore, makes systems more transparent.
>>
>>108949115
>>108949115
>Because those of us that use to got hated by the mentally ill faggots that only ever post their cartoon slop. Just look at the articles section over at civitai, its full of these mentally ill freaks, they've infected everything with their filth.

Genuinely dont get what you mean. You got bullied out of posting 1girls?
>>
>wake up
>read ai news
>no local seedance 2.0
Well I guess thats it for today, now off to gooning for 16 hours to pass the time again.
>>
>>108950368
Assuming you are trying to gen realism, dmd2 lora 6 steps, portrait orientation, start prompts with something like "sfw flickr high quality professional photograph," Possibly "bokeh" can help, but also has a possibility of just adding meaningless fireflies everywhere. "perspective, foreshortening" may help. Make sure you have the ModelSamplingSD3 node connected (I always leave it on 1.0). I don't recommend the PAG node unlike what the model creator says. But also, it's SDXL so there will always be issues. I think anima has obsoleted SDXL and I hope we get a halfway decent realism finetune soon, I don't think it will take much to eclipse this shit completely.
>>
I've trained models and Im realizing that my generations colors are duller than my dataset. They are like plaster colors. My images are greyish and darker than the dataset, the dataset is more vivid and colorful. I don't understand why is it like this because my dataset doesn't have these color palettes.

The skin tones are less shinny and grey, there is less light and overall the images have a darker color palette.

Do you know what this happens, has this happened to you also?
>>
>>108951506
https://civitai.com/articles/18193/hey-diffusion-models-your-colors-are-wrong
>>
>>108951210
lmao
>>
File: 1764341432047660.jpg (150 KB, 1599x531)
150 KB JPG
>>108951517
Holy fuck anon yeah, this is exactly my problem, this is a very good example
>>
>>108951542
>>108951506
>>108951517
I'm retarded, what does this mean?
>>
>>108950795
+1, i would donate a bit to tdrussell. anima is pretty good and we need more like it
>>
>>108950614
tdrussell is right desu
>>
>>108951555
tl;dr: most trainers do not properly support color if it's not in sRGB, you need to convert your pics to sRGB before training or you will get fucked up colors.
>>
>>108951555
basically if you set the color profile on your output images right they will look good, that's the quick hack. it's like rendering HDR images in a program that doesn't understand HDR, same kind of bug, colors come out wrong.

but really your input pipeline is the cause and you should make all inputs in one color space and then also output in that same color space.
(multi-colorspace is probably also possible and even better than either, but if you can't do A or B then you won't ever get around to this much more refined detail work.)
>>
cozy
>>
What if you're not the right color to begin with?
>>
>>108951577
>>108951604
So I have to change all of my color image profiles to sRGB then train again? Fuuuuck

What about already trained models, anything I can do ?
>>
>>108951503
Tried with and without dmd. It feels super finicky and erratic and strange overall, changing a single word can have totally unexpected consequences and radically change the whole output, it feels interesting on a technical level and to play around with but not for genning. Idk it kinda feels like the model is at its limits and overwhelmed by the information crammed into it and bursting at the seams
>>
>>108951635
>What about already trained models, anything I can do ?
train again
KEK
>>
File: x.png (1.97 MB, 1400x704)
1.97 MB PNG
>>
i've noticed qwen is really bad at using multiple loras at once, is there anything that helps like different settings or samplers
>>
>>108951635
option A in my post, already told you that.
just embed a color profile that mostly fixes the colors.
>>
>>108951663
so, you need a high-rank adaptation to make your low-rank adaptations work?
>>
>>108951663
it probably doesnt help that people are probably training them with bad settings but using multiple loras together doesn't really work correctly anyway, you need this https://arxiv.org/abs/2311.13600 https://arxiv.org/abs/2412.04465
>>
>>108951657
beautiful. looks like a post-card from Spore: the high res remake
>>
>>108951708
the dino turd fossils
>>
>>108951676
I'm checking my dataset images and they are all sRGB except a couple ones untagged. I don't undertand then why are they a bit grey.
>>
>>108951708
thx.
>>108951751
i'll fix the raptor craptor.
>>
>>108951771
My models are base trained anima, I'm both using character loras and style loras, and I'm using forge if that info is useful.
>>
>>108951693
Can you do a tldr.? Mixing loras is bad ? I've been mixing loras, am I doing it wrong?
>>
All Anima 2 needs is a bump in text encoder size and some well captioned real porn
>>
>>108951771
can't help you with a live-debugging, ask someone else.
>>
>>108951771
Check your prompt then and any other loras you're using. I remember this lora https://civitai.com/models/2583128/anima-rl made stuff look kind of dark and less vivid, could be prompts you're using that do the same thing.
>>
>>108951771
at least check that your output images have the same profile then.
>>
>>108951785
>Mixing loras is bad ?
yes, but if it works it works, if it doesn't it wasn't supposed to anyway
>I've been mixing loras, am I doing it wrong?
If it works no, if it doesn't you can try the ziplora thing to merge loras in a way that makes them conflict less
>>
File: file.png (42 KB, 1757x367)
42 KB PNG
is this normal? I use default settings, 14B WAN2.1, 16GB RAM, 12GB VRAM.
>>
>>108951826
yes. we all wait 7h between gens. it helps retain your cum
>>
>>108951826
without sage attention or any of the other speedups, yeah it takes a long time.
>>
File: file.png (84 KB, 2012x125)
84 KB PNG
>>108951835
It already uses sage attention.
>>
>>108951836
use the distilled lora and lower the cfg
>>
File: file.png (412 KB, 1855x1932)
412 KB PNG
>>108951840
is this settings OK? still default, all I changed was the inference steps
>>
>>108951815
They are untagged, I change to sRGB and the color is still the same
>>
>>108951845
the Lora is cfg1 and 4steps minimum.search it up then add it in the Lora loader with like 8 steps so it's not complete shit
>>
File: 1779579074398347.jpg (1.89 MB, 3333x3317)
1.89 MB JPG
>>108951447
pls respond, someone
>>108951846
I'd say it's over if it were my project...
you will have to make a custom ICC profile to color correct them if you want results asap, and still debug and fix your model.
>>
Btw training a lora with the exact same parameters, dataset and tags should give you the exact same Lora right? Or are there any random values you don't control?
>>
>>108951850
this lora?
https://civitai.com/models/1786270/dmd2-lighting8stepcfg15
is there a rentry I can read through? all the ones in OP are severely out of date (2025)
>>
>>108951447
You can expand or cut the image or use simple background, white background to generate extra space
>>
>>108951826
I mean I got good result with default settings, 21 inference steps, but zamn 7 hours?
>>
>>108951855
if not the exact same a extremely close one, if they are very different something significant changed
>>
>>108951826
>16GB RAM
oh no no no no no no
>8h
and you just know bro is swapping to his one and only system sata 300mb/s max ssd
>>
>>108951855
the GPU can run things in a different order even with everything else being equal but it shouldn't make a difference
>>
>>108951868
I put my goon material on external SSD, yes. RAM is soldered, I run this on tablet PC with external graphics card.
>>
>>108951858
that is for xl. you want https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_HIGH_fp16.safetensors
>>
>>108951865
that makes sense but how would that guarantee that the tiles actually fit together?
I was thinking more about a solution somewhere right in the inference layer, not pre- or post-processing.
hence "research paper", I haven't heard of this existing yet and it's a bit of an obscure feature to want (deliberately shitty "enhance enhance enhance" without a second model).
>>
>>108951875
Yknow what, feels like I'm going to rebuild my abandoned PC back in covid era. fuck.
>>
>>108951875
at that point it would be faster to get simple job and work for a month than wait for a gen.
>>
>>108948816
>someone saved my pic
t-thanks
>>
>>108951875
>I put my goon material on external SSD
So did Epstein.
>>
>>108951893
Yea, I'm considering to rebuild my old PC right now. It still has nice 32GB DDR4 stick, the motherboard is fucked and the CPU is ancient i5 crap.
>>108951897
Wow I didn't know that.
>>
>>108951885
Dunno I've never had the need for larger images, I usually use upscaling models
>>
>>108951854
https://github.com/Merserk/ComfyUI-PiD
it takes a lot of VRAM but you can get 2K or 4k images with it. You can download it with comfy manager but you also need to navigate to the custom nodes folder of it and install requirements - see the github readme for instructions
apart from that you really should google and research hires-fix if you're not already familiar with it.
Waldo is there?
>>
>>108951883
Thanks I will take a look
>>
>>108951930
>>108951930
>>
File: 1.jpg (80 KB, 560x350)
80 KB JPG
>>108951751
Spore had some pretty planets.
>>
>>108951906
yeah so that's not an approach that interests me, I want to push a lil tiny model further at the interference layer in a fun way to learn more about the tech.
>>
>>108951908
thanks, that may be a step in the right direction. I want it to get real fucking silly though, like 512^2 to 1M^2.
>>108951947
I really preferred the first stage (amoebae and so on) visually, the rest of the game was a little disappointing after that.
>>
>>108951657
nigger you on fire. i save a lot of your shit btw.
>>
>>108951966
I like the space stage. The flight mechanics felt really good. I loved warping in and out of the atmosphere with the scroll wheel.
>>
>>108951988
I get that. I've dev'd space games before so it looked same-ish to me, but I also recall it having really smooth mechanics.
>>
>>108951953
The problem is that generating a bigger image requires more computational power, dunno how much do you need for 4000x4000
>>
>>108952035
More power in total, but if it can be generated piecemeal it's CSI zoom for gennies. Even cooler if it's consistent between zoom levels.
>>
>>108948244
I haven't run difffusion models since 2023.
What is the goto way to take clothes off of real photos of women in 2026? Is it still inpainting?
If yes, is with comfyui? And what model for realism?
Is there a specific workflow?
pls respond (I have autism)



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.