[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (3.44 MB, 5000x3145)
3.44 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>109157211

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: image.jpg (609 KB, 1792x1344)
609 KB JPG
>>109161087
I'm not an anime fag
>>109161031
>>109161069
https://civitai.red/images/135210245
same deal
>>
2026 sucks, the only decent model we had was Flux.2 Klein
>>
File: ComfyUI_00104_.png (3.77 MB, 1347x1888)
3.77 MB PNG
This.
>>
File: krea2_00135_ .jpg (249 KB, 1600x1920)
249 KB JPG
>>109158678
its peak
>>
I meant to say
>>109161136
^this
>>
File: ComfyUI_00095_.png (2.5 MB, 1499x1199)
2.5 MB PNG
>>109161136
^
This.
>>
File: image(1).jpg (827 KB, 1792x1344)
827 KB JPG
>>109161121
Last test. Left: SDXL. Right: Anima.
>>
File: 00006-3203376286.png (1.49 MB, 1536x1536)
1.49 MB PNG
>>
File: 1764679340320763.png (1.1 MB, 578x1024)
1.1 MB PNG
Krea2 is probably the easiest its ever been to make comics/manwha/manga etc. Especially with some of those complicated compositions where one scene blends into the next one - something even cloud models fail to do.

The text generation is lackluster but honestly you can just have the model generate the text boxes and them composite the text yourself.
>>
>>109161179
dumb tests
>>
>>109161179
https://civitai.red/images/135210265
prompt and settings
>>
>>109161121
>I'm not an anime fag
Then don't fucking use anime as comparsion, if you can't tell if it generates the characters correctly.
>>
>>109161184
You have a point. To be fair, I should at least be generous to SDXL and pick better checkpoints. The truth is, I'm an Anima fanboy, so I deliberately picked some of the worst CivitAI shitmerges, I practically selected them without even looking because I want Anima to win.
>>
File: 1769247704197017.png (226 KB, 371x371)
226 KB PNG
>>109161136
Local has been thriving. Its cloud models that have stagnated into complete shitty uselessness.
That includes LLMs too unless you're a vibecoding grifter. Gemma 4 has been more useful for me than shit ass fucking Claude
>>
>>109161196
Yeah, but like I said during that test, it never really mattered much to me anyway. Character knowledge is arbitrary in the grand scheme of things. What matters is quality, and in that regard, I like Anima more.
>>
>>109161136
https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V3
New and improved model just dropped.
>>
Interesting that 7 months after Anima’s release, there still hasn’t been a comparison with SDXL, while Kreka, Klein, and Z Image were compared from day one.
>>
File: 1771801968096198.jpg (1.27 MB, 3840x1376)
1.27 MB JPG
>>109161273
>https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V3
Interesting
>>
>>109161273
>no int8
yawn
>>
>>109161278
Anima is shit at sex positions which is the entire point of the vramlet slop models. Russ cannot into datasets or use a proper fucking captioning tool. fucking Gemma? seriously?!?!
>>
File: ComfyUI_temp_bkmie_00044_.png (1.76 MB, 1024x1024)
1.76 MB PNG
loving krea so far.
>>
>>109161292
>Gemma
kek fucking retard. he only used it for the example style lora
>>
File: img_00580_.jpg (336 KB, 984x1264)
336 KB JPG
>>109161282
convrot version pls
>>
>>109161317
he used it for the captioning
>>
File: kek.png (367 KB, 800x452)
367 KB PNG
>>109161318
I like how the "wen gguf" meme wille become "wel convrot" from now on
>>
>>109161328
for the captioning of what
>>
>>109161334
Anima previews and base
>>
File: ComfyUI_temp_amqvn_00144_.jpg (436 KB, 1728x1152)
436 KB JPG
>>
>>109161333
kind of. a lot of people still need gguf because int8 is still too big or dynamic vram is fucking them over
>>
>convrot
someone explain this meme to me
>>
>>109161341
anima's training began long before the release of gemma 4 fucking retard
>>
>>109161328
>>109161292
Yeah. No wonder Anima can't understand proper positioning. Vision AI is shit at describing anime pictures. Especially anything related to sex.
>>
File: img_00588_.jpg (457 KB, 984x1264)
457 KB JPG
>>
File: 1754480267106556.jpg (2.86 MB, 5726x3072)
2.86 MB JPG
>>109161350
2x faster than fp8, better quality, it's a miracle
>>
>>109161365
>rtx3090
yawn
what are the numbers on ada and blackwell?
>>
>>109161365
is it faster than fp16?
>>
>>109161378
fp16 and fp8 have similar speed, so yeah
>>
>>109161377
from what I search convrot is specifically for 3090 because it doesnt have fp8 "support", since 4090 has it it doesn't do anything to it
>>
>>109161384
alright, so it's a boost for poorfags
sad and gay
>>
>>109161389
aww cheer up buddy, you still have fp4
>>
>>109161389
>it's a boost for poorfags
so it's a boost for 95% of users, who the fuck have a 4090 or a 5090 nowdays?
>>
File: kekekekeek.png (354 KB, 500x500)
354 KB PNG
>>109161389
you're bitter because older cards can now have similar speed than your 3000 dollars card, no refund btw
>>
>>109161384
>>109161389
it's also giving a boost of speed for more recent cards (abeit more modest one), and since the quality of int8-convrot is also better than fp8, there's still no reason to keep using fp8 from now on
>>
>>109161419
alright already, I'm sold
>>
>>109161365
just make a int6, int5 and int4 and gguf are dead, Comfy was right to not support it after all
>>
>>109161434
not how that works
>>
>>109161434
comfy didn't invent convrot retard and gguf is still superior for llms
>>
File: ComfyUI_00004_.png (2.46 MB, 1536x1024)
2.46 MB PNG
>>109161282
Thanks for sharing, very interesting.

btw, happy monday /ldg/ any new cope of the week?
>>
>>109161453
>comfy didn't invent convrot
I never said he did, are you mentally challenged?
>gguf is still superior for llms
who gives a fuck about llms this is a diffusion model general, oh god you're so fucking dumb
>>
>>109161434
Yannik fluoride stare
>>
File: 00010-1571398177.png (2.17 MB, 1344x1728)
2.17 MB PNG
>>
>>109161282
The loss of quality is minimal in relation to speed. Thank you for sharing, kind anon. One small step at a time is all we need. ^^
>>
File: ZiT still da goat.png (1.12 MB, 643x1284)
1.12 MB PNG
>>109161454
yup thats me, you're probably wondering how i got here
>>
File: ComfyUI_temp_amqvn_00148_.jpg (468 KB, 1728x1152)
468 KB JPG
>>109161342
next one is in the oven
>>
>>109161454
>>109161472
bot
>>
>>109161384
No it boosts 40xx and 50xx as wll, to the tune of 20-30% depending on resolution
>>
>>109161473
ZiT is still great for realism, and it's easy training loras for people and have them come out great, NSFW training is much harder but you can still get good results, biggest problem is that once you try to use more than two loras at the same time, the quality goes to crap.

Krea 2 is almost as good at training persons, better at artstyles, MUCH better at NSFW and it can handle more than two loras without any problems
>>
>>109161562
unfortunate then cuz I can't get that shit to work, made black images here, will look around/update later and try again
>>
>>109161592
i hope you're lying
>>
>>109161599
You are most likely using the wrong model file, use:

Krea2_Turbo_convrot_int8mixed.safetensors from here: https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8/tree/main
>>
File: new.jpg (2.92 MB, 1504x2000)
2.92 MB JPG
>>109161273
>>109161282
>>109161472
mxfp8
>>
File: das right.png (114 KB, 640x640)
114 KB PNG
>>109161282
no int8 no try
>>
gguf status?
>>
>>109161282
>why... so... buttchin...?
>>
>>109161622
why use this over silveroxide
>>
File deleted.
>>109161604
The only lies I make are those towards the unpaid janitors. I had 52 days of sheer contempt and no distractions to make ammo/AI gens, even with a shitty 3090 laptop in the field and at sea.
>>109161454
May I ask what is the work flow for this or catbox? I personally hate "Jacks with a passion and anything to do with sharty but this looks like it has promising use.
>>
>>109161679
why not, if it works it works, I don't give a fuck who's making it
>>
>>109161434
CUTIE ALERT
>>
>>109161686
you really should look up how vaginas work at some point catpissjulien
>>
>>109161636
is it with an image reference, or is it just a simple text2image? looks good either way
>>
>>109161592
>>109161686
Did the blog factory explode
>>
File: 1774519468358841.png (3.88 MB, 1450x1700)
3.88 MB PNG
>>109161282
one of the biggest issues with this model is prompt adherence and this feels alot better but I dunno,
>>
File: ComfyUI_01649_.jpg (3.58 MB, 3456x2688)
3.58 MB JPG
Whatever happened to Ani is he still here?
>>
>>109161282
Klein 9b is the reason why I don't take licences seriously, BFL's licence is terrible yet you have two big finetune projects on that model, it just shows that when a base model is good and has potential, people will go for it, regardless of everything else
https://huggingface.co/fancyfeast/bigasp-3
>>
>>109161754
No clue
>>
File: 9849945581.jpg (772 KB, 2688x1344)
772 KB JPG
int8convrot vs fp8
both from https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8/tree/main
er sde 16 steps beta, wtf?
>>
>>109161773
you need to show bf16 too to see which one is the closest to it
>>
File: 1763508946676222.jpg (611 KB, 1328x1744)
611 KB JPG
soulful zitslop
>>
File: 035806.jpg (748 KB, 2688x1344)
748 KB JPG
>>109161781
I don't have that, reduced to 8 steps (I remember klein turbo also fried if you went too much over the turbo number of steps), now convrot just looks more detailed
>>
>>109161806
>convrot just looks more detailed
I guess that's good
>>
>>109161282
I think bigasp has more potential since it can do coom
>>
>>109161773
>>109161806
I am noticing the residual noise in the convrot more than the others but the others smooth everything out. probably not the best sampler/scheduler combo or maybe not enough steps
>>
Is the janitor x tohsaka guy like targeting a guy who waifus tohsaka or whats happening here.
>>
>>109161832
>I am noticing the residual noise in the convrot more than the others
for Krea 2 it's good since that model tends to make look everything like some studio photoshopped edits
>>
>>109161679
I don't know what silveroxide's version is, but this one works with Comfy int8, that I know
>>
>>109161827
Potential to be banned since BFL can revoke his right to distribute the model if they don't like what it does, as per the Klein 9B license
>>
>>109161708
t2i
>>
File: 1759111161491318.png (1.34 MB, 1000x600)
1.34 MB PNG
>>109161861
he took the risk, we'll see if he'll get rewarded for it or not
>>
>>109161865
can you try with an image reference and see if it keeps the people's face better or not
>>
>>109161843
I think its some kinda humilation fetish correct me if i'm wrong.
I mean if you hate the jannies that much why would post pictures of them fucking girls? Right?
>>
>>109161861
which is why I earlier said he's a fucking moron for not just training anima

if you're training a model with a non commercial license anyway, then compared to klein 9b, anima has:
no rugpull terms in the license
comes with full booru knowledge and nsfw
smaller and faster
surprisingly good realism already using some shitty loras + prompt wrangling

imagine what happens if anima gets a few million high res photos trained into it
>>
File: 1781575153094056.png (111 KB, 498x204)
111 KB PNG
>>109161937
tranima only has anime knowledge, is a fucking 2b model and has fucking Qwen vae, and you think this will be enough to get good coom realism? he's right for going for Klein 9b, he's not doing it for the money...
>>
>>109161937
>imagine what happens if anima gets a few million high res photos trained into it
we don't have to. The catastrophic forgetting meme turned out to be real
>>
>>109161937
just use cosmos predict 2 if you like realism anima that much
>>
>>109161951
Did you ever find proof for this or are you still memeing
>>
>>109161937
you lost turd russel
>>
>>109161937
I agree that it's a really weird choice since all that time and money spent on training can be worth nothing should BFL decide that they don't like the type of images are being generated by a model that is a derivative of theirs, particularly since they are oh-so gung-ho about SAFETY.

I mean yes, it's his time and money to spend as he see fit, but it seems stupid...
>>
File: ZIT_INT8_CONVROT_00011_.jpg (522 KB, 1152x1760)
522 KB JPG
>>
>>109161951
>catastrophic forgetting
how was anima trained to know as much as it does in the first place, if it just forgets everything when you try to train it? nobody can answer this question. it's almost like people are simply frying it with dogshit sdxl training settings and then blaming the model
>>
>>109161960
>Did you ever find proof for this
no he did not
>>
>>109161960
he forgot where
>>
>>109161754
i was thinking about visiting him in carla street
>>
File: ZIT_INT8_CONVROT_00020_.jpg (553 KB, 1152x1760)
553 KB JPG
>>
>>109161970
not stupid at all, this gigachad is just not afraid to BFL, so far they haven't asked him to remove his model so...
>>
>>109161937
I don't think anima was released when he started tho, right?
>>
>>109162085
possibly, but for every past bigasp model he rented multiple h100 nodes and training only took like 2 weeks. if it's the same here then the training run would have started post-anima
>>
File: krea2 eceleb machine.jpg (1.07 MB, 3840x1642)
1.07 MB JPG
>>
>>109162126
>eceleb
those are all just regular celebs
>>
>>109162126
so slopped
>>
GIVE ME NVFP4!!!! FUCK ALL YOU POORFAGS!!!
>>
>>109162126
they put a safety filter just so that this fucking model can render some celeb slop, let that sink in
>>
>109162209
:skull:
blud thinks this is true
XD
>>
>>109161974
>>109162008
wood berry
>>
File: give ani your seed.jpg (126 KB, 1866x682)
126 KB JPG
which one of you is this
>>
Q8 or death
>>
>>109162236
lmao, link?
>>
>>109162236
>tagexplorer.io anon
kek he must be meming
>>
File: flowers1.jpg (472 KB, 1448x1086)
472 KB JPG
>>109162230
know your Ramona
>>
>>109162236
he didn't react to my mails so i'm trying like this kek
>>
>>109162236
>>109162251
I thought you didn't like him. But more importantly, are you gonna update tagexplorer for anima base?
>>
>>109161183
ideogram is easier both for the composition and text tho.

not saying krea2 is bad
>>
>>109162266
>I thought you didn't like him.
yeah i think he's a pedo
>>
>>109162236
Higher chance on return of investment than buying OpenAI stock though
>>
What's the goto for disabling the filter in ideogram4? Skipping one step ruins the image entirely. Not even nsfw shit, just get rid of it entirely.
>>
>>109162277
4-5 bounding boxes
>>
>>109162277
IIRC people just used a few bboxes until the censorship is ignored
>>
File: that's good.png (270 KB, 500x500)
270 KB PNG
>>109162236
https://github.com/FizzleDorf/AniStudio/issues/89
>What is the status of this project?
lmaoooooo
>>
>>109162277
a guy posted a nsfw one that only worked when you included a big enough bbox with text somewhere, like, it could even be empty
>>
>>109162236
I came across some old screenshots i had of anistudio and was reminded of how cozy it is. Shame the development pretty much halted. Just a couple QOL bugs and some other stuff needs to be ironed out desu.
>>
Stop bullying Ani, degenerates. He just wanted to make own gui and model.
>>
>>109162327
this is honestly why is is better than most faggots that post here, myself included
>>
desu if you cant handle a bit of banter then you shouldnt be a 4chan dev
>>
explain to me like i'm retarded (I am)
what benefit does raw have for inference over turbo? I understand turbo is distilled or whatever but does that drastically change quality of output?
>>
>>109162287
>>109162289
Huh, that works. It's a powerful tool, just annoying for simple images.
>>
>>109162327
>>109162333
he shitposted on this general way too hard to deserve any respect
>>
>>109162343
You are replying to him, but you certainly knew that. On another topic, what's the point of impersonating me?
>>
>>109162340
The benefit of raw is that it wasn't raped with aethetic tuning. More varied outputs than turbo.
>>
>>109162348
>impersonating me
who?
>>
>>109162340
depends on the model, in general the base model would be more capable, more adhering to the prompt overall whereas turbo step diffusion just takes the fast shortcuts

but then there are the turbo checkpoints that got RL training at the same time where the base model did not and so on.

it's a difficult thing in detail the same way as explaining how many steps on what model do what

>>109162342
agreed.
>>
>>109162273
But what about the other thing
>>
>>109161590
Krea 2 trains like a champ and takes on several loras like a cheap whore.
>>
>>109162327
He shat on his former friend for literal months
He's literal human garbage (not only because of that) and deserves all the bullying
>>
>>109162374
this
>>
File: 1782504039418349.png (335 KB, 1674x1389)
335 KB PNG
https://xcancel.com/LiangZheng_06/status/2071374479406813197#m
uh oh, pixel sissies, our answer?
>>
>>109162327
He never showed any interest or initiative in making a model until the moment Anima dropped. Then he suddenly kicked into high gear trying to spitebake a clone. And it truly was a pure spitebake attempt. If you track down some discord messages, the latest plan before it all fizzled out was to copy anima's architecture exactly, same LLM adapter and text encoder, same VAE, same everything, and retrain from the original cosmos weights just to have a different license.
>>
>>109162409
Dis nigga don't know bout the editing? HOLY LEL
>>
File: 1769211685348070.png (3.09 MB, 935x1600)
3.09 MB PNG
>>109162271
True - if you're making entire pages ideogram is definitely better, for example pic related.
But I just dont have as much success on a per generation basis with ideogram, plus it takes longer.
>>
>>109162409
does he mean he has BOTH a good vae and a main model trained faster and better than a pixel space model with all the same resolution flexibility etc?

i mean... most want the vae out of the way because it gives weird compression errors and figuring out a great vae itself also isn't easy... maybe that's an overly simple way to describe this and someone has better wording, but either way the interaction of the vae training and model training is a bit bothersome
>>
>>109162419
that's still something, but yeah
>discord drama
I hate how newfags get 0 memos on discord cancer, we're not supposed to join any discord ever, if you had seen all the bullshit that's been going on there, you'd avoid it like the plague
it's not le ebin 4chins frens hangout they want to sell you it is, it's full of setups through and through >>109162419
>>
chinese vae's just kicked my dog
>>
>>109162454
That's nice
>>
>>109162305
Is this the part where you cry about how "he" was "bullied so hard he had quit thanks to these threads"?
>>
>>109162474
>and figuring out a great vae itself also isn't easy...
it definitely isn't, the krea fags said that they spend the first 3 months trying to find the right vae and text encoder, if you can simplify the architecture by removing the vae, it'll make the life easier for everyone involved
>>
>>109162305
It's a featureless wrapper that crashes all the time anon
>>
>>109162490
No he's a pussy who can't hang i just like the concept of what he was trying to do. I already vibecoded by own UI.
>>
>>109162490
I wonder if he unuronically thinks that "schizo anon" (the made up person in his head) is the reason his project failed
>>
>>109162409
>pixel sissies, our answer?
a 1 parameter model converges the fastests of all of them so, nice metric
>>
>>109162546
uhm.... that's wrongthink!
>>
File: radiance.jpg (308 KB, 1024x1536)
308 KB JPG
>>109162493
right? for now this seems much easier. idk how that guy doesn't know this despite probably being more skilled in this field overall.
>>
File: new.jpg (2.61 MB, 4538x4057)
2.61 MB JPG
>>109161887
like this?
>>
>>109161454
>any new cope of the week?
Yeah, a not so good style transfer for Krea, a not so good improvment for Klein
>>
File: 00016-2926467915.png (2.76 MB, 1248x1824)
2.76 MB PNG
>>
>>109162490
In the discord he said he's not "supporting" (= pulling sd.cpp) anima on purpose so his "users" (lol) wouldn't use it
Yes, he's that much of a faggot
>>
>>109162646
based billionaire jew mindset.
>>
>>109162305
>>109162327
>>109162333
Imagine not only being subhumanly retarded, but also a rape victim
Oh right, that's something Julien doesn't have to imagine LMFAO
>>
>You should see an update in the next 24 hours.
Prepare for this week to be full of Anima and Comfy fuding. And not the smart kind.
>>
>>109162698
>an update in the next 24 hours.
what? there will be a new version of anima or something?
>>
>>109162698
aren't they just releasing their updates? fud how
>>
>>109162703
Learn to read
>>
>>109162719
read what?
>>
File: 5678867484888.jpg (913 KB, 3714x1676)
913 KB JPG
I used ChatGPT to change her race, original on the left, is there anyway to change someones skin colour in a video but keep everything else the same? I have a 5070 ti and ComfyUI already installed but I don't really know how to use it.
>>
>>109162725
gross and disgusting,
yummy,
yummy,
oooo mama yummy she makes my fa lala go la la la
>>
>>109162703
>>an update in the next 24 hours.
do you think he'll finally fix wrap around text in the prompt boxes kek or add some "new" samplers and schedulers
>>
>>109162725
who cares - what you just made is useless shit and you could probably put it out completely deformed and it wouldn't change a thing
>>
File: 309t093094ti3t.png (130 KB, 326x419)
130 KB PNG
>>109162725
Is it possible to do it with picrel?
I had a look at the nodes and it wants a reference video and image, if I use another AI to alter her appearance in a screenshot of the first frame and use that as the reference image and the original video as the reference video, will that work?
>>
Using other models after Krea feels like absolute shit.
>>
>>109162792
I deleted anima because of this
>>
kemonotech.ai
>>
>>109162792
>Using other models after Krea feels like absolute shit.
>Krea
*Z-image turbo
>>
>>109162800
>trannytech.ai
no thanks.
>>
>>109162725
"sexy" ayys crack me up
>>
>>109162800
Do you want to get the IP of "singular schizo anon"?
Because you will never catch him lolcow
>>
File: 1754210051685050.png (548 KB, 645x678)
548 KB PNG
>>109162800
>wasting money on a .'ai' to post a shitty obviously claude vibe coded website.

baka....
>>
We at /ldg/ eagerly anticipate FurkUI
>>
>>109162837
Prompt and model?
>>
Feels fizzly here
>>
>>109162792
Ideogram still beats it for realism lora training (even NSFW) provided you have enough data. And you don't have to deal with the stubbornness and slight slop of a turbo model.
>>
>>109162862
bbox a shit on my cock; very smelly
>>
File: 00018-1774082390.png (2.96 MB, 1248x1824)
2.96 MB PNG
>>
File: 1764487473081519.png (2.29 MB, 1500x998)
2.29 MB PNG
>>109162854
https://civitai.red/models/2210210/zootopia-1-and-2-style-krea-2-il-anima?modelVersionId=3079184

>>109162862
Yeah but I dont make shitty porn addiction images so I'm good lil bro
>>
>>109162845
You will only be able to prompt 'ohwx man' and the only image it will gen is a sweaty Furkan
>>
>>109162880
What more could you possibly need
>>
wait is the ani guy the guy who made swarmUi? that shit is baller as fuck
>>
File: ZIT_INT8_CONVROT_00040_.jpg (585 KB, 1152x1760)
585 KB JPG
>>
>>109162942
Yes
>>
File: 23424u0t034g.png (186 KB, 304x371)
186 KB PNG
>>109162725
>>109162766
Is this a better choice?
>>
35 stars status?
>>
File: comfyui_00024_.png (1.26 MB, 1152x896)
1.26 MB PNG
>>
technology is amazing. this nigga made a television display an image of cheese
>>
>>109161590
>ZiT is still great for realism
In a single stock style that's very close to slop, with Asian faces that look extremely similar*

Outside of it, not so great. Plus there's no seed variety. I will never get the "Z is great for realism" meme.
>>
File: animatunetest1_00007_.jpg (67 KB, 512x512)
67 KB JPG
>>
>>109163055
let's see your krea cope screenshots
>>
File: ComfyUI_00001_.png (2.23 MB, 1088x1440)
2.23 MB PNG
Anyone else getting png transparency backgrounds with id4?
>>
>>109161773
>>109161832
Winnougan is and always has been a retard and quantized the entire damn thing without utilizing a proper layer filter.
This is a way better version: https://huggingface.co/silveroxides/K2Q/tree/main
>>
>>109163055
Also similar poses regardless of how hard you prompt. Same pose, same face, same style. Maybe would've been decent with the equivalent of a reconditioning node there to decensor its slop, but that would only expose it can only those poses, faces and style well.
>>
>>109163074
safety filter
>>
File: ComfyUI_00003_.png (2.12 MB, 1088x1440)
2.12 MB PNG
>mfw my date asks for a second happy meal

>>109163082
Weird filter.
>>
>>109163108
mcdonalds ceo needs to take notes on how a real man eats a borger
>>
>royally fuck up the python embded env while updating torch without realizing shit aint compatible yet
>decided to move to UV
I graduated ;)
>>
>>109163130
>>109163016
>>
File: ss_20260629_134800.png (84 KB, 1253x335)
84 KB PNG
>>109163134
119k, troontard
>>
>>109163073
I don't consider ZiT slop realistic. The closest I could find to its aesthetic in the wild is some heavily edited kpop images or kdrama screenshots. No DSLR or smartphone camera takes pictures or anything like it, because, why would they? It lacks dynamic range, and Z is at best pseudo-realistic with the same faced humans it's depicting.
>>
>>109163134
38 stars now bitch
>>
>>109163147
Then again, maybe if we go back to the 70s, suddenly Z images are realistic.
>>
File: 14138.jpg (349 KB, 2688x1344)
349 KB JPG
silveroxides vs Winnougan convrot
>>
Give Ani a star to show your support!
>>
>>109163075
>Winnougan is and always has been a retard
almost as retarded as naming your repo "K2Q" with no readme and no explanation whatsoever of what the files are or how to use them or why you would use one over the other. and then doing that for every single model you upload. and then crashing out on discord if anyone asks "what is this? what version am I supposed to use?"
>>
>>109163176
make it a femboy and maybe i'll look at your image
>>
forgive ani
>>
>>109163185
Yea I kind of agree with this. Like if it was a girl it would be easy ofcourse, but a femboy is also quite easy on the eyes. These muscular guys in feminine clothing make it really difficult to analyze the image properly, because I become actively averted by the image. No offense to you posting the image ofcourse, if that's what you like, that's what you like. Just giving my two cents aswell.
>>
>>109163075
>Winnougan is and always has been a retard
but naming a repo "K2Q" isn't? give me a fucking break
>>
>>109163185
Sorry, only high T images
>>
Is there any node that allows to merge klein with ratios given block by block?
>>
>>109163074
transparency was a mistake, ideogram has a lot of shit ideas desu, but I respect that they tried something new, but c'mon, forcing bboxes isn't the way to go
>>
File: ZIT_INT8_CONVROT_00077_.jpg (425 KB, 1152x1760)
425 KB JPG
>>109163147
It's always uphill battle against zit gens. So close to greatness until you look closer.
>>
File: debo_sf_k2_uv_00001.jpg (3.15 MB, 6192x2580)
3.15 MB JPG
>>
>>109163263
Fuck off debo
>>
>>109163147
>I don't consider ZiT slop realistic.
ZiT is not perfect, but it's way closer to realism to Krea 2, and if you say otherwise you need to get your eyes checked
>>
>>109163263
Melty gen
>>
>>109163248
I think some people just forget what photos look like in the spirit of image gen. By good "realism" they mean it gens neat with little mistakes images in that style (exactly as intended to by overfitting it in same face, same pose, same style). If you were to remind them what photos actually look like, they'd understand why Chroma is one of the greatest models ever released, but of course, they're not after actual realism, but slop, so they will shit on Chroma etc...
>>
>>109163282
Chroma was never critisized for being a slopped model, the problem with Chroma was its consistency, Z-image turbo and Krea 2 can give you coherent images 90% of the time, with Chroma it's more like 30% of the time, it's a frustrating model to play with, it has the heightest highs, but also really low lows
>>
>>109163248
>>109163176
what is convrot? a new cope quant?
>>
>>109163300
>what is convrot?
probably the best thing we got this year >>109161365
>>
>>109163266
Of course. It gives you neat images in what the model thinks is realism. Still not more coherent than Krea though, you can't even properly do a flip or a place model upside down while controlling the position of their feet with Z because the devs didn't intend you to.
>>
ZiT was only "good" at realism when it first released and maybe a couple weeks after. Its distilled nature made it very boring very quick desu. Base did not have this problem.
>>
>>109163308
Alright, now give me the bitter pill. This is local, so you always end up paying it back double or triple in some other cost, right?
So tell me: do you have to retrain the models from scratch? Does it not work with any LoRA at all? Does it load the whole model into RAM? Does it hammer your SSD with extra writes and shorten its lifespan? Does it need some janky dependency that only runs on one specific, opinionated version of Linux?
>>
Ideogram can do 4k, I think that's neat
>>
>>109163308
Does it work with stable-confusion.cpp or is it mutually exclusive with what ggufs do?
>>
>>109163357
no bitter pill anon, it's easy and fast to convert, it can be loaded with the normal comfyui loader, it's as big as fp8 while having a better quality and is 2x faster, this convrot method (thank you google for inventing that) was already a big deal on the LLM space, it was used to quant the KV cache, I didn't expect it to be a hit on the diffusion models as well but here it is
>>
>>109161248
Local was ,is , and will forever be shit
>>
>>109163365
sd.cpp is super behind in basically every way
>>
>>109163371
Alright, so which models actually support that? Guess I'll migrate my LoRA training and finetunes over to something better then, Anima's been nothing but a headache.
>>
>>109163384
But on the other hand I don't need a gorillion python dependencies and venvs.
It's also got Anima, Ideogram4 and Krea2 “already”, so it's no longer behind on the SOTA models
>>
>>109163384
If that's the case why do some anons think everyone will switch to it from others?
>>
>>109163392
>Alright, so which models actually support that?
all the relevant models I guess
https://huggingface.co/models?search=convrot
>>
File: ZIT_INT8_CONVROT_00113_.jpg (425 KB, 1152x1760)
425 KB JPG
>>109163282
I'd rank Zbase higher than Chroma, Ideogram probably beats both
>>
>>109161773
Only valid comparison is to the original fp16. It's making it more realistic (probably by actually bringing out the sampler)... try fewer steps. Different samplers will probably look like the right btw.
>>
>>109163384
in what? It supports all the models that matter and had int8 for over a year
>>
File: 1757362329419529.jpg (817 KB, 1073x1332)
817 KB JPG
can krea 2 have the subject be off center?
>>
File: 14138 copy.jpg (250 KB, 1920x1408)
250 KB JPG
>>
File: krea_00107_2.png (3.18 MB, 1014x1659)
3.18 MB PNG
is it genuinely unironically over for anima?? krea just does everything better and hasn't even been finetuned yet.
>>
>>109163416
Yeah just prompt for it
>>
>>109163426
no danbooru artists doe
>>
>>109163426
krea is too slow
a extensive fine-tune would render the turbo un-usable, and the new raw checkpoint would be colossally slow because of CFG, unless its both finetuned and a distill is also provided
>>
>>109163433
>krea is too slow
Do you have 2gb of vram or something
>>
>>109163438
I'm on a 4090, have you tried generating a image on the undistilled model?
>>
>>109163438
saaar please no make fun of my financial status
>>
turbokekking is the new sabotage. guarantees the model will never be tuned properly
>>
Can local finally make gens of a character with just an image reference or you still coping with loras?
>>
>>109163426
>>109163432
Computer, calculate how much it would cost to train Krea Raw on danbooru
>>109163450
True. But all the poorfags go crazy for it. Some have also tricked themselves into liking the aethetics and rigidity as well.
>>
>>109163429
i hadn't downloaded the model yet i was wondering if people had examples cause my internet kinda shit
>>
>>
>>109163371
>it can be loaded with the normal comfyui loader
No it can't. If it works, you're just using regular INT8.
>>
I'm not noticing any difference in video quality on Wan 2.2 with turbo mode enabled, does it affect quality that much?
>>
why not just finetune turbo? if loras work, why wouldn't a finetune work?
>>
>>109163471
>if loras work, why wouldn't a finetune work?
because... it wouldn't? If you can lift 1gram why can't you lift 1ton?
>>
>>109163462
Models that use LLMs as encoders like krea have spatical understanding
>>
if anima2 is trained on krea2 ill use it regardless of the 5min gen times
>>
>>109163503
there will never be an anima2 ya duhmass
>>
File: 1753668858373629.png (61 KB, 1013x464)
61 KB PNG
>>109163392
Most models should work. You can convert and use them with this node
https://github.com/BobJohnson24/ComfyUI-INT8-Fast
Picrel is how you convert.
>>
>>109163503
*forgot to add, "if Anima 2 is trained without tdrusell in charge and using the Noob dataset...!
>>
>>109163464
yes it can, what are you talking about?
https://github.com/Comfy-Org/ComfyUI/pull/14636
>>
the noob dataset is unironically garbage. chinks have no idea how to train a model
>>
>>109163526
they view life in widescreen. dunno why anybody thought they could put together a good dataset.
>>
File: 1757250655473709.png (13 KB, 334x177)
13 KB PNG
>>109163524
Again, that's regular INT8 not convrot. If you try to run a INT8-convrot model with the normal loader, you get this error.
You need this node >>109163514 to run and quant to convrot.
>>
>>109163459
lol
lmao even
>>
>>109163526
Funny you say that. Never once needed 'ai generated' in my negative prompt with Noob, but with Anima it's mandatory thanks to that built in slop bias. Never got that paint splattered hair look with Noob, never got that WAI sheen either, those are an Anima and pony score exclusive.
>>
ah. woops.
>>
It was a lot simpler when you just picked the .gguf that fit in your vram
>>
Inability to properly comprehend English lead to China completely falling behind in the natural language era. This is why they have been unable to match nano banana, gpt, flux, krea, ideogram, etc. They have the most dogshit captions imaginable, just slopped through JoyCaption-tier (noob team showed these off themselves btw, they were terrible).
Local captioners (like most local models in general) are really bad. If you want to train a proper model you need to shill out for top-tier captioning through Gemini, Claude, etc.
>>
>>109163405
Flux.2 VAE with a decent realism tune will always beat Chroma at realism, but the problem is that Ideogram lacks variety in styles, and always adds too much detail. Besides, you'd be surprised what you can push Krea 2 combined with Chroma to do with the right settings
>>109159306 despite lacking the Flux.2 VAE... the model suddenly punches way above its weight with a different sampler. ID4, F2 Klein just enhance noisy details in images which is why they get that photoreal effect, but the styles being enhanced matter just as much as the enhancement, enhanced plastic will still look like plastic.
>>
>>
>>109163545
that's because you didn't run the right quant, go for that one for example and it'll work
https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8/blob/main/Krea2_Turbo_convrot_int8mixed.safetensors
>>
>>109163580
tf is wrong with you
>>
>>109163564
>Chroma to do with the right settings
It's time to Old Yeller it.
>>
>>109163590
what's wrong?
>>
>>109163562
Whatever >>109161121 >>109161179
Look at the right picture of this images and tell me with serious face that tdrusell didn't train Anima with CivitAI slop and /edg/ highlights in the name of "stability"
>>
>>109163545
some int8 convrot quants were converted before comfyui's official implementation, so the layers names don't fit and you'll get incompatibilities, don't go for older int8 convrot and try to find ones that were made after the merge and it'll be fine
>>
>>109163564
>Flux.2 VAE with a decent realism tune will always beat Chroma at realism
it's already beating it, kekestone in shambles
https://huggingface.co/fancyfeast/bigasp-3
>>
>>109163514
>can quantize any model for 2x speed gains
>not Qwen
>not Krea 2
>not Ideogram
but Chroma
>>
What's the convrot cope?
Is it another case of vramlet poorfags coping that "FP4 is no different than FP8 is no different than FP16, why yes I use nunchaku, sage attention, and 4 step turbo lora on top and you can't tell the difference!"
I'm seeing people claim it's a free speedup but the actual images look significantly different like >>109161773
>>
>>109163580
ejaculation
>>
>>109163588
>>109163604
I see. How do you quant them without that node then?
>>
>>109163602
Would be more convincing if anon posted metadata desu without that tests are worthless
>>
>>109163616
everyone forgetting >>109161806
>>
>>109163616
>I'm seeing people claim it's a free speedup but the actual images look significantly different like
as they should, fp8 is quite different to bf16, so if convrot is close to fp16, then fp8 will be quite different to convrot, just look at the metrics, convrot is always superior to fp8
https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md
>>
>>109163602
Nta and I'm also not an anime fag but
>Yes, I will retarded prompt a model without a reference to styles, and expect to get a stylistic result

SDXL would flop compared to Anima when the latter is prompted simply because it's a DiT model, and also because you can prompt for painterly styles fine.
>>
>>109163618
seems like you have to use that to convert to convrot
https://github.com/silveroxides/convert_to_quant
>>
>>
>>109163618
>>109163611
Quanting without node is oretty easy, just ask Claude how the node does it, it's a single installation and command (which is how I did it for Chroma with convrot and then T5 ). If you do it from the node I don't think its compatible with Comfy's actual code anyways.
>>
>>109163635
i would've guessed this was a real image
>>
File: .png (16 KB, 485x222)
16 KB PNG
>>109161365
What is this bullshit?
>>
>>109163611
Lodestone single handedly bred more Stockholm syndrome than most cult leaders manage, he missed his calling as a Pentecostal pastor.
>>
>>109163635
slop
>>
>>109163672
There is literally only a single chromakek left in these threads shilling it. He refuses to let go and somehow sees the disgusting noise artifacting as 'realistic analog detail'
>>
>>109163660
It's alright alright
>>
>>109163666
update_troonfyui_and_dependencies.bat
>>
>.bat
FILTHY SCUM
>>
>Anima, 30 steps, 1024x1024: 9 seconds
>Krea2 turbo int8 convrot, 8 steps, 1024x1024: 4 seconds
IT'S OVER
>b-but 30 steps vs 8
using recommended steps for both. anima suggests 30-50. I can double Krea's to 16 and still gen faster.
>>
>>109163635
>>109163742
Can do these out of the box with Chroma. Lora coping leads to less prompting control.
>>
chromakek, you're the only one coping here. it's time to move on
>>
>>109163768
>Can do these out of the box with Chroma
No you wont
>>
>>109163742
can juggetnaut do coom?
>>
>>109163672
>>109163681
You can't convince me you're not just an anti-Chroma cult because you dislike its creator. I'm not even posting raw Chroma outputs anymore.
>>
we are on /h/ levels of image to troll ratio
>>
>>109163790
Or perhaps the creator is disliked because he shitbakes the model, throws actual logic aside, and burns through hundreds of thousands of dollars to produce a mediocre underwhelming result?
>>
>>109163828
>>109163828
>>
>>109161148
this indeed
>>
>>109161299
i dunno.. seems like it just blurs and adds splotchy shit just to make it seem 'realistic' but actually it just looks muddy and blurry
>>
>>109161359
cute
>>
>>109162619
yoooo what did you use to make this lol
>>
>>109163405
o snap



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.