[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>108972752

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
1st for Ani.
>>
so like. where are the 1girls yo. like I came for the hot pointy chinned 1girls
>>
sfw vageen is ascended
>>
File: 85358566.webm (2.37 MB, 256x448)
2.37 MB
2.37 MB WEBM
>bullet impacts sounding like drums
made me leap out of my chair and scream "KINO!!!!!!!!!!!!!!!!!" 3 times and then do a partial backflip
https://files.catbox.moe/qjzu25.mp4
>>
File: 3089031.jpg (38 KB, 540x540)
38 KB JPG
In the standalone anima trainer, does flash attention made things faster in exchange for quality hit, or just faster?
>>
File: 1777497626575143.png (1.29 MB, 2259x1315)
1.29 MB PNG
>Scorsese uses FLUX
zit faggots keep seething
>>
>>108976848
ask AI
>>
File: wan22 Scail vs Bernini.mp4 (3.98 MB, 1920x1080)
3.98 MB
3.98 MB MP4
Tested Wan22 Bernini. Here are my initial result on the single test case.

R2V: Subject to Video generation.
Best: 0.8 Megapixel 81 frames at 30 FPS, OOM on higher res/frames length on a 5090/128gb RAM. Heavily dependent on subject resolution, so best results may varies. Most accurate at 30FPS, lowering FPS seems to degrade reference accuracy. Accuracy also degrades after 81 frames, just like Wan22 base I guess. Bernini can be extended if you are determined to stich 81 frames video together. Seems to lose out against SCAIL on ease of use, VRAM requirements, but SCAIL can only do rigid open pose reference. Bernini can supposedly can do more things, need to test further.

>vid related, Bernini 81 + 81

https://github.com/Comfy-Org/ComfyUI/pull/14216

https://bernini-ai.github.io/
>>
>>108976878
everyday I hate myself for being a VRAMlet
>>
I don't get the appeal of video generation
>>
>>108976878
make moot do cute things
>>
>>108976889
its ok, its not your fault you were born brown
>>
>>108976889
making porn of unsuspecting women
>>
File: technologyidea.mp4 (660 KB, 720x480)
660 KB
660 KB MP4
>>108976887
>>
Is it possible to train a lora on small (<64x64) sprites?
>>
File: bernini_s.png (1.67 MB, 871x1080)
1.67 MB PNG
>>108976878
>>
>>108976783
I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT
>>
File: ComfyUI_00724_.png (894 KB, 896x1152)
894 KB PNG
>>
>>108976998
ma'am, i need that for studying
>>
>>108976889
for realistic nsfw, nothing else has the prompt understanding and adherence
>>
>>108976889
Its only good for porn. For other stuff its cringe
>>
File: ZIT.png (2.71 MB, 1536x1536)
2.71 MB PNG
Surely Krea 2 open release won't be like their previous open release and will be better than ZIT, a model from half a year ago, right?
>>
>>108975467
not that hard, though 100k seems a bit thin for a full finetune. get a regularisation data set at the very least
LR at batch size 1 around 6e-6 to 8e-6, scale up from there correspondingly
captioning is the painful part, what i did is run through WD14 or animetimm first, then filter out false positives that pop up when one uses these models on photos (asian, realistic, etc), then gemma4 31b with grounding from these tags and a good system prompt

i recommend to not tune existing photography tags like photo (medium) or cosplay girl as your main triggers, but do something fresh like an artist tag. trying to build atop the existing ones only resulted in slop semi realism for me
>>
File: ComfyUI_00725_.png (965 KB, 896x1152)
965 KB PNG
>>
File: 324654.webm (3.42 MB, 256x448)
3.42 MB
3.42 MB WEBM
pretty good seed for the plane
>>
File: 584565.webm (2.86 MB, 256x448)
2.86 MB
2.86 MB WEBM
american ship cloaking technology captured on film
https://files.catbox.moe/nw00el.mp4
>>
File: t.mp4 (1.33 MB, 480x720)
1.33 MB
1.33 MB MP4
>>108976889
1girl, plot
>>
>>108977124
Hot glue gun to ass? I'd rather take a tattoo
>>
>>108976783
Baker, next OP, please:
"Discussion and Development of Local Image, Video, Music and Anime Models"
>>
File: 1.jpg (418 KB, 1552x832)
418 KB JPG
>>
>>108977173
MEW my beloved
>>
>>108977019
You have Anima, why care?
>>
File: file.png (3.7 MB, 1328x1776)
3.7 MB PNG
>>108977173
stop posting my gf
>>
File: 3.jpg (309 KB, 1136x1136)
309 KB JPG
>>108977182
>>108977193
>>
File: ComfyUI_00722_.png (1.36 MB, 896x1152)
1.36 MB PNG
>>108977215
ZIT? My ZIT celeb LoRA aren't this good
>>
File: 4.jpg (309 KB, 1552x832)
309 KB JPG
>>108977229
>>
>>108977278
where do I find Klein 9b loras?
>>
>>108976878
> 81 frames
What's the point?
>>
File: 1772684675740695.png (2.81 MB, 1256x1672)
2.81 MB PNG
>>108977229
dont use the malcolmrey slop in case you do, he literally doesnt use any captions
>>
>>108977149
> Music
Is there a development? What's to discuss?
>>
>>108977324
>What's to discuss?
Discussion
>>
>>108977324
Local stagnated in ZImage and here we are discussing nothing, so why not add music?
>>
>>108977229
Emma's 18.00000001 yo sfw vageen
>>
File: 1754853732531427.png (2.64 MB, 1928x1088)
2.64 MB PNG
lore accurate
>>
why is civitai lgbt now
>>
>>108977462
always has been
>>
File: 102135CUI_00001_.png (1.72 MB, 1536x1152)
1.72 MB PNG
>>
>>108977477
Hi Catjack ^^!
>>
File: 104227CUI_00002_.png (1.6 MB, 1152x1536)
1.6 MB PNG
>>
File: 1756903890834544.png (1.08 MB, 1536x1152)
1.08 MB PNG
>>108974879
catbox plz? what lora was being used here? (picrel)
>>
File: file.png (2.2 MB, 1056x1584)
2.2 MB PNG
>>108977312
>>108977278
Was experimenting with Qwen earlier today, the likeness is there, but the detail isn't.
>>
File: 042513CUI_00001_.png (1.27 MB, 1536x1152)
1.27 MB PNG
>>108977559
this anon gave me the link for the lora, dunno if he was the one who made it: >>108964312

https://files.catbox.moe/kle4bj.png
>>
>>108976889
It's another fun toy, things don't have to have a purpose. That said, local vidgen is absolutely fucking terrible while saas vidgen is so censored it's barely usable so it's in a dubious spot as of now.
>>
VR use case?
>>
>>108977604
final frontier for goonmaxxing when video gen hits realtime speeds for 1080p
>>
>108977559
>108977594
Why he is samefagging?
>>
>>108977636
what makes you think he is samefagging?
>>
>>108977627
final frontier for nuclear kinos
>>
>>108977604
imagine once things are powerful enough and compact enough
you can literally have vr worlds being generated in realtime by your ai, anything you can imagine
>>
File: 031741CUI_00001_.png (1.03 MB, 1536x1152)
1.03 MB PNG
You know what would be cool? A DQ8 style lora. How many screencaps would I need to make one?
>>
>>108977741
100 to 300
>>
File: 095815CUI_00001_.png (1.72 MB, 1248x1824)
1.72 MB PNG
>>108977773
Seems doable. Although, back when I last played it on an emulator, it had some minor visual artifacts. It's been a long time though, maybe it's fixed now. I seriously doubt I can find 100 good screencaps on the internet.
>>
You know, I'm thinking of making a Vixen media group LoRA
How many images will it need?
>>
>>108977741
sometimes styles work with 20 images on some models and training, sometimes you need hundreds of images.

loha/lokr trainings may be relevant for styles, could also try those not just standard lora
>>
>>108977869
I'll take a look. Never trained anything before.
>>
>>108977825
20-80 just get good quality diverse ones
And train a Z Image Turbo LoRA for it first
https://github.com/ostris/ai-toolkit/
>>
>>108977594
thank you anon
>>
File: xena(y).png (194 KB, 483x425)
194 KB PNG
>>
>>108977923
oops, forgot to quote >>108977908
>>
File: 115300CUI_00001_.png (1.87 MB, 1192x1536)
1.87 MB PNG
>>
>>108977583
i have no experience with qwen as a model but klein and zit should have no issues
maybe a data set issue? what resolution are you training at? and how many images? regularisation?
>>
>>108977393
Flux Klein.
>>
I'm going to let Microsoft 360 AI walled garden generate my images. No more venvs for me! Please strip CR/LF off of them if you move the data to Linux.
>>
>>108977975 >>108977583
qwen [/edit] also is *really* good at learning realistic characters, potentially better than zit/zib even. but slower.
>>
>>108977324
Either the ltx guy who is actually doing interesting stuff sparked more interest in the OP or a very lonely loser doing ultra gay OPs and by Ops I mean operations. On that note there was a good talk about music generation from anons last thread
>>
File: 1760500639134785.png (707 KB, 1740x1191)
707 KB PNG
>>108962693
>>108962836
tested this with default setting and the result is quite good actually

>>108977741
DQ8 would be great
>>108977927
np
>>
>>108976878
in the worfklows on the github, i see them loadign the image for the anime girl but not the video for the woman dancing. is it missing? there's an input that says "reference video" where i assume it would go. could you share your workflow?
>>
File: i1uqnk.png (1.54 MB, 1024x1024)
1.54 MB PNG
>>
>>108978025
> ltx guy who
Who?
>>
>>108978259
Some anon is making music with ltx
>>
File: 1756865503853071.jpg (994 KB, 1248x1824)
994 KB JPG
>>
File: e73h2y.png (1.48 MB, 1024x1024)
1.48 MB PNG
>>
File: 424324.webm (1.73 MB, 448x256)
1.73 MB
1.73 MB WEBM
mansley launched the missile
>>
File: 141728CUI_00001_.png (1.2 MB, 1536x1152)
1.2 MB PNG
>>
File: 5.jpg (265 KB, 1312x976)
265 KB JPG
>>108977583
yeah klein needs lora stack before it works, but its fast so iteration is easy
>>
File: fish.png (2.39 MB, 1024x1536)
2.39 MB PNG
>>
File: Krea 2 anime.png (864 KB, 1360x800)
864 KB PNG
open weights when
>>
>>108978808
>>108978879
>>108978885
>>108978888
>>
>>108978903
which ones are you using?
>>
>>108978967
trash, almost embarrasing, if they release anything other than krea 2 large its not even funny how dead will it be
>>
>>108978996
the ones I've trained!
>>
Is it possible to create gens with a transparent background? Like just a character on a 100% alpha background?
>>
File: 153226CUI_00002_.png (1.9 MB, 1152x1536)
1.9 MB PNG
>>
>>108979139
It's called "layer diffuse". Implemented in all engines I believe.
>>
>>108979307
>all engines
name all
>>
>>108979068
share the celeb ones on hf fren
>>
just checking to see if you faggots have cracked the filter on that new model yet
>>
I want to split my prompt window into a separate one just for quality tags and other for the rest. Can I use conditioning concatenate node to merge them or text concat?
>>
File: wan bernini.jpg (42 KB, 402x477)
42 KB JPG
>>108978127

Videos files are not provided. You have to use your own data.
>>
>>108979501
Even when cracked it's not a very interesting model desu
>>
>>108979521
oh ok. but it works as i thought then. they should've had a note there explaining it but whatever
>>
>>108979564
tb h i kinda want a krea type model for design-slop cases.
i only care about filters because they do genuinely get in the way of higher end design stuff where a bit of titty or w/e is nbd
>>
>>108976878

 You are a helpful assistant specialized in subject-to-video generation.

Change the girl in the video with the girl in image0, image1. Short hair with twin tails, two white pom poms on her head. Remove ponytail.

Shot 1 - She is blinking.
Shot 2 - She is laughing and covering her mouth.
>>
>>108979844
What is the ref image? From what I've seen, it greatly changes the style, making it useless for anything good.
>>
cozy bread
>>
>>108979844
impressive now replace the girl with tony soprano
>>
File: 260605-011345 Svi 00001.mp4 (1.77 MB, 640x1024)
1.77 MB
1.77 MB MP4
fucking random debris just appear randomly, ruining the whole gen
>>
>>108979893
yeah, not her trying to remove her skin though, that also has nothing to do with the gen being shit
>>
File: latent upscale.png (309 KB, 3072x652)
309 KB PNG
HOLY FUCK IT IMPROVES THE QUALITY
>>
Was there a single diverse tune since pony diffusion v6?
>>
i trained a lora on anima with several characters in it, and now when i prompt it with several of them at once it merges them together instead of making two distinct ones.
its not my prompt, because if i prompt for characters anima already knows it works perfectly fine.
is this a known issue? anyone has advice on this?
>>
>>108980007
>so comfortable
>>
>>108980032
I've never trained a lora but it seems obvious you need to be extremely careful when associating a word with an image, and so in your case the word became associated with multiple characters.
>>
>>108980070
or maybe you need both, a trigger word for each character separately AND a trigger word for images of the characters together.
>>
>>108980007
is this an anima upscale? can you share it? i don't know how to upscale anima
>>
how do I become a memeber on civitai?
>>
>>108980147
You need to be made first. Most people get whacked before this happens.
>>
>>108980147
Send $50 to the CEO of India
>>
>>108980070
> it seems obvious you need to be extremely careful when associating a word with an image
depends on the model and training data size

it's hard on various models to predict how bad it would be to have some incorrect or shoddy caption/tags, but it's certainly something to check

sometimes the tolerance is really quite large tho, doesn't have to be this

>>108980032
>anyone has advice on this?
it's not that easy to say what you should do but I'd say try supplying more training data with just one character each and maybe a different optimizer with different settings.

It easily may also be a good situation to try a additonal not necessarily large number of regularization images or one of the techniques that is sort-of like regularization (e.g. ai-toolkit's DOP or blank prompt preservation).

All assuming the captions are not extremely wrong.
>>
File: Untitled.png (596 KB, 3712x934)
596 KB PNG
>>108980108
I'm still working on it. But you can have a look at some of these settings.
Believe it or not, that 6 steps at 0.50 denoise actually works. bong_tangent is a strange one.
>>
>>108980245
Isn't bislerp better?
>>
>>108980259
bislerp isn't as good as gayswallow
>>
>>108980259
I don't know yet, I'll check it out. I'm probably going to create a custom node that does the repetitive Upscale By in here: >>108980007 since it seems to improve quality.
>>
>>108980245
have you tried the PiD model nvidia released this week for the qwen vae?
>>
>>108980245
Why do I feel instinctual disgust whenever looking at none based interfaces? I'm European and both my grand parents had the greater aryan certificate btw
>>
>>108980280
>t. mutt without bidet
>>
>>108979893
>random debris
it's sea foam because of the waves. it doesn't understand where to put it.
>>
>>108979893
gross
>>
>11 days till I'm a 30 year old NEET virgin 1girl prompter
>>
File: 1771410632598012.png (164 KB, 1212x801)
164 KB PNG
>ideogram shills are flooding the subreddit
>mods are in on it
>redditors have had enough
holy fucking shit
>>
File: Jennaiian Dance.webm (3.93 MB, 720x1280)
3.93 MB
3.93 MB WEBM
>>108976889
You can make your images move... what's not to get?
>>
>>108980592
I also don't like the fact the cumfart CEO is allowed to have duplicate posts across subreddits when that shit got me banned before for sharing papers
>>
Thank you for keeping us apprised of the plebbit situation, anon!
>>
>>108980592
It's just indians chasing izzat, ok?
>>
>>108980592
>reddit mods being corpo mercenaries
In other news shit smells like shit more news at 11
>>
>>108980517
i'd just like to let this general know I have had sex, twice, at 29, and it was with a cute asian girl
>>
File: 20655.png (1.85 MB, 1088x1368)
1.85 MB PNG
>>108980517
its fine fren, you aren't alone
>>
>>108979893
which Lora? it's been a long time, since i've had any weird situations
>>
Anima's pretty good.
>>
>>108980592
whats with this pozzed era of nu-local? cumrag API shilling, local models more censored than grok, workflow grifting, restrictive licenses.
>>
File: 1511667108879.png (298 KB, 512x512)
298 KB PNG
>>108980007
How exactly does it differ from oneshotting it? You are not running any denoise after each step.
>>
File: 9rj92mypyh351.jpg (94 KB, 1125x813)
94 KB JPG
>>108980592
i'm so fucking sick of reddit mods

>le using reddit!!!!
yes it's where everyone is and I fucking hate it, some fucking losers who have been on reddit for a decade working for free control the biggest hubs of discourse for every single fucking hobby I hate them so much
>>
>>108980745
>steals artists work
>thought pony tags were a good idea
>multiple character loras don't work without mixing into each other
>no controlnet
>snake oil projects
>slop without loras
>russ owns the rights to your loras and finetunes and no you aren't getting paid
lol. good one
>>
guise... GUISE!!!!!!
anima is bad okay???????
>>
>>108980845
it's ok but it's not the generational leap everyone hypes it out to be. It's still using outdated tech
>>
>>108980834
?? I got paid today
>>
>>108980845
it's fine, but illustrious (nova anime, etc) works well and has controlnets, adetailer, and many more tools and it is effective.
>>
>>108980834
yeah but like, futa on male.
>>
>>108980921
that's illegal. I am reporting you to Russ's lawyer
>>
just STOP using anima okay??? STOP. IT.
DO NOT USE IT. okay?!?!?
>>
>>108980923
weird since I see the most amount of complaints coming from the /d/ thread because danbooru has the most vanilla futa and not much else
>>
>>108980922
You don't need that tooling with Anima if you're skilled enough thoughever unironically. Manual inpainting works and we have cnets.
>>
File: output_1780602935.png (1.81 MB, 832x1216)
1.81 MB PNG
Anyone else practice memory photography? When a woman walks by, I try to memorize what she looks like and how she's dress. I enter that into my phone, and then look one more time to see if I got it right.

Example (not a real one):
>On the sidewalk of a retail space in front of a nail salon, there is a brown skinned woman with long black wet recently cut hair, she has a red-ish tan off the shoulders short dress with a bright thick yellow belt, and she's wearing flip-flops. She's talking on a cellphone and appears to be a transgender thai woman.

pretty annoying how all the jeets of civitai want the pointy witch chin in every image. kind of get tiredl of witch thin.
>>
>>
>>108980915
>It's still using outdated tech
t. 4ch VAE with T5 encoder illustrious giga jeetslop merge
>>
And, if you watch, zit will start with a non-pointy chin.
>>
>>108980961
listen man, I haven't seen anything it can do that il with loras couldn't already do. If anima was great it wouldn't need lora shitpiles like sdxl
>>
>>108980915
It is a generational leap for what it is, especially for the size.
>>
>>108980954
>most amount of complaints coming from the /d/
Anima can do futa on male cowgirl, which I can't do with Illustrious so idk.
>>
>>108980938
aaaaaw come now, sis. Not Russ' lawyer.
>>
>>108980990
>It is a generational leap for what it is
out of all the new models he had to choose the warehouse robot one with the worst licence imaginable
>>
>>108980985
I've never seen a troon kill xirself irl but that doesn't mean it doesn't happen
>>
dont worry anonie just a few more "its a bad license" posts and im sure someone will help you make apache 2 anima
>>
>>108980995
go elsewhere with your homo stuff

God damned faggots
>>
>>108981011
you should kys tonight in the mirror and you'll know for sure :)
>>
>>108980995
have you never asked an anon in the /d/ thread?
>>
>>108981010
realistically speaking what WAS available? keep in mind anima released almost on the same day as z-image base, klein 9b came out like 2 weeks before that
>>
35 star status?
>>
>>108981044
yeah and comfyorg knew they had a better vae so what gives?
>>
>>108981044
anima is good and the benchods are unhappy (good)
>>
>>108981021
Nyo
>>108981037
I didn't even know there was a /d/ thread
>>
File: 01644.png (2.42 MB, 1616x912)
2.42 MB PNG
>>108981010
>warehouse robot one
kek, next local sota: Animazon Diffusion by Jeff Bezos
>>
really weird how many anima shills are here all the time
just sayin'
>>
>>108981010
>warehouse robot one
its pretty incredible that he was able to wrangle it into local SOTA anime and local SOTA realism with a light lora
>>
>>108981058
just because comfy org gets fed some information in advance to implement their day 0 nodes doesnt mean tj russia has access to the full model to finetune on it anon
>>
>>108981081
Not really surprising since this is the cutting edge /g/ imagen thread
>>
>>108981091
it's literally because he wanted to spend the least amount of money so he could get paid more. why wasn't it zim?
>>
File: 1752777789732944.png (1.39 MB, 1024x1024)
1.39 MB PNG
klein edit distilled is so good, even at 4 steps.
>>
>>108981118
because he cant time travel i presume
>>
>>108981118
>spend the least amount of money so he could get paid more.
sounds like a smart man especially since it became local SOTA
>>
>>108980981
No
>>
>>108981134
>SOTA
it's not even an edit model. don't kid yourself
>>
go back to bed bluvoll
>>
>>108981066
No really, we will take back over, and the death penalty for homosexuality will be implemented per the Bible.
>>
>>108981137
I don't believe your country should be allowed to have any transistors.
>>
File: 202657CUI_00001_.png (1.27 MB, 1152x1536)
1.27 MB PNG
>>
>>108981047
Is the project dead?
What even is the point now that the main project has it's own UI?
>>
File: zImageturbo_00010_.jpg (757 KB, 1624x1944)
757 KB JPG
>>
>>108981173
leprosy
>>
File: zImageturbo_00017_.jpg (583 KB, 1280x1760)
583 KB JPG
>>
>ANOTHER POINTY CHIN

our benchods sure love to tag those pointy chins "beautiful"
>>
chin chin pointy chin
chiny chiny point chiny chin
>>
>>108981158
t.closet homosexual futa enjoyer
>>
File: 204804CUI_00001_.png (1.23 MB, 1152x1536)
1.23 MB PNG
Karin DLC for SF6 never ever. Gayest game of all time.
>>
I tried using the Flux checkpoint but Forge was throwing me an error, what gives
>>
File: 1774892689973600.png (1.35 MB, 1024x1024)
1.35 MB PNG
>>108981129
>>
you didnt even link the error, RETARD
>>
>>108981237
@grok help him
>>
>>108981240
>>108981241
It was giving me that 'Type None' bull shit when i would click generate
>>
>>108981237
What checkpoint (quant/precision)? What error? Is this an updated forge fork?
Learn some basic troubleshooting, nobody can offer you a solution if you just say "it no work!!!"
>>
>>108981245
This is a very common error with Flux in Forge — the "Type None" (usually TypeError: 'NoneType' object is not iterable) is a generic catch-all that often hides the real problem (bad model, missing files, outdated Forge, etc.).
Quick Fixes to Try (in order)

Use the right Forge version
Download/re-download the recommended one-click package: webui_forge_cu121_torch231.7z from lllyasviel's GitHub releases.
Extract it fresh if possible.
Run update.bat (very important) before launching.

Use the correct Flux model
Download from lllyasviel's repo (not random ones from Civitai/HF unless you know they're Forge-compatible):
Best balance: flux1-dev-fp8.safetensors https://huggingface.co/lllyasviel/flux1_dev
Faster/smaller (if your GPU supports it): flux1-dev-bnb-nf4.safetensors (or v2) https://huggingface.co/lllyasviel/flux1-dev-bnb-nf4

Put it in: webui\models\Stable-diffusion\

Add the required support files (this fixes a lot of NoneType errors)
VAE: ae.safetensors from Black Forest Labs place in webui\models\VAE\
Text Encoders (CLIP + T5):
clip_l.safetensors
t5xxl_fp8_e4m3fn.safetensors (or fp16 version)

Put them in webui\models\clip\ or wherever Forge asks in the VAE / Text Encoder dropdown.

In the UI:
Select the Flux model.
Set Sampling method to something like Flux or Euler / Simple.
Steps: 20-30 is usually enough for Flux dev.
CFG: 1.0 (Flux doesn't like high CFG).
Make sure you're not using any incompatible extensions or old SD 1.5 settings.


If it still fails

Close Forge completely delete the webui folder if you have a backup, re-extract, run update.bat again.
Check the full console output (the terminal window) when you hit Generate — the real error is usually printed there before the NoneType one.
Try switching between fp8 and nf4 versions.

Flux in Forge can be finicky, but once you have the right files + updated Forge it works reliably. Let me know what exact model you're using and what the full console error says if these don't fix it.
>>
File: zImageturbo_00037_.jpg (425 KB, 1280x1760)
425 KB JPG
>>
>>108981251
Thanks a bunch
>>
>>108981239
>tfw when the cube is more fuckable than the female protag
>>
File: 1773288903331042.png (1.33 MB, 1024x1024)
1.33 MB PNG
>>108981129
>>
i'm using anima and it's great
>>
holy moly...
https://x.com/elonmusk/status/2062337074368508253
>>
File: zImageturbo_00048_.jpg (546 KB, 1280x1760)
546 KB JPG
>>108981296
anima 2 when
>>
File: zImageturbo_00063_.jpg (692 KB, 1280x1760)
692 KB JPG
>>
File: aylameo.png (167 KB, 1777x868)
167 KB PNG
Ideogramesque
>>
Does BREAK work on Anima?
>>
>>108981401
what was the question?
>>
>>108981447
Yes
>>
>>108981383
ZiT is still the GOAT for realistic generations, and it's also insanely fast
>>
>>108981447
No
>>
File: onnesm.png (1.8 MB, 1216x832)
1.8 MB PNG
>>
File: zImageturbo_00073_.jpg (558 KB, 1280x1760)
558 KB JPG
>>108981449
Was asking for prompts for a woman that looks similar to Ramona Flowers

>>108981461
It's great, nothing more to say really
>>
File: ComfyUI_28998.jpg (3.6 MB, 1500x1920)
3.6 MB JPG
>>108981129
NGL, I liked the reveal despite being tired of the GoW gameplay longer than most people ITT have been alive... it was more visually imaginative than everything else in the show.
>>
>>108981461
>GOAT for realistic generations
What is the GOAT for surrealistic generations?
>>
>>108981556
Chroma
>>
>>108981599
There's deliberate surreal and accidental surreal.
>>
>>108981609
Chroma is both.
>>
File: zImageturbo_00077_.jpg (550 KB, 1280x1760)
550 KB JPG
>>
File: 1777883526026493.png (1.49 MB, 1024x1024)
1.49 MB PNG
now this is a better game.
>>
ramona anon, please link your ZiT/Flux Klein workflows, your work is the greatest i've ever seen in these threads holy shit
>>
File: 173289074289343245.png (18 KB, 1187x48)
18 KB PNG
>>108981251
This is the error im getting after all of that
>>
>>108981650
https://huggingface.co/xixxix-HF/RamonaFlowers_ZImageTurbo/tree/main
Workflow is anons dual sampler one, should be there few threads back. I'll try to make model card with examples
>>
File: ComfyUI_00250_ayakon.jpg (3.56 MB, 4096x2650)
3.56 MB JPG
>>
Catjak is our special little troon lolcow
>>
File: AnimaBase_Output_262627.png (2.83 MB, 1248x1824)
2.83 MB PNG
>>108981010
What else would he have used to get the same results in terms of quality to speed balance?
>>
>>108981804
kek I thought this was a chick getting railed by a wolf for a sec
>>
>>108981819
at that point just use the zim arch with less params. It only cost them 200k so less params would have been in the ballpark
>>
>>108981839
Z arch is Lumina 2 lol, it's slow as fuck
>>
>>108981743
niiiice thanks.
>>
>>108981839
Where is your model?
>>
>>108976998
how much vram do i need to make this?
i dont really play vidya i have rx6600
>>
>>108981804
that's a big ... tail
>>
can someone make a booru with images from this thread
>>
>>108981899
For the shiny jeetslop you quoted? You can generate it with like 4gb vram
>>
>>108981907
no
>>
>>108981890
I don't suck cumfart cock. sorry
>>
>>108981915
what models should i play around with and what front end to use
i never really genned anything beacuse i didnt have patience to wait a minute for a single picture but i suppose it's a bit more optimized by now?
>>
>>108981743
Neat, ty
>>
>>108981926
anima model and forge neo ui
>>
>>108981917
you tried to though until he left you on the side of the street like garbage. you could even say human garbage.
>>
>>108981907
Fine or generate for me a script to scrape images based on thread name from the archive and I'll set one up
>>
>>108981970
I at least respect ani for not wanting to be a part of enshitifying everything. he isn't like comfyui or russ that wants to turn everything into a grift scam. he isn't like you who tries to ruin the thread 24/7
>>
File: mknkes.png (1.16 MB, 1216x832)
1.16 MB PNG
>>
>>108981998
You will continue to cry
>>
>>108982038
look around you. the community is weeping. everything fucking sucks and it's only getting worse
>>
>>108982043
Where are these crying people? To me average user seems pretty happy
>>
>>108981815
why do you post about your schizo friend every thread
>>
>>108982043
No amount of FUDing will get anyone to join your "team"
>>
Maintain Thread Frivolity.
>>
>>108981729
Queen why are you sending this to me instead of a chatbot or searching the github issues? No way
This is a GPU architecture compatibility issue — very common with RTX 50-series cards (5060, 5070, 5080, 5090 etc.) on the standard Forge one-click package.
The default Forge package uses Torch 2.3.1 + CUDA 12.1, which doesn't include the kernel for Blackwell GPUs (compute capability sm_120). That's why you get "no kernel image is available."
Quick Fix (Recommended)

Go to your Forge folder open the venv\Scripts folder.
Hold Shift + Right-click "Open PowerShell window here" (or Command Prompt).
Run these commands one by one:

Bash.\python.exe -m pip uninstall -y torch torchvision torchaudio
Bash.\python.exe -m pip install --pre --upgrade --no-cache-dir torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

Close everything and restart Forge with run.bat.

This installs the nightly Torch with CUDA 12.8 support for 50-series cards.
Alternative / More Stable Method
Some users prefer installing a specific wheel. After uninstalling:

Download the latest compatible wheel from: https://download.pytorch.org/whl/nightly/cu128/torch/
Then install it with .\python.exe -m pip install torch-....whl (matching your Python version).

Extra Tips

Make sure your NVIDIA drivers are up to date (Studio Driver or Game Ready, latest).
Add this to your webui-user.bat launch arguments if needed: --disable-cuda-malloc
After updating, check in console: python -c "import torch; print(torch.__version__); print(torch.version.cuda); print(torch.cuda.is_available())"

Let me know your GPU model (run nvidia-smi in a terminal) and if this works or throws a new error. This fixes it for most people on 50-series + Forge + Flux.
>>
File: Z_Image_Turbo_00002_.png (2.98 MB, 1536x1536)
2.98 MB PNG
ZAMN z image tardbo is fast, this dual sampler workflow is perfect.
>>
File: zImageturbo_00140_.jpg (483 KB, 1840x1152)
483 KB JPG
>>
is PiD good? my webui got updated to add it
>>
File: 1755996921456834.png (3.33 MB, 2048x1024)
3.33 MB PNG
>>108982124
for qwen and sdxl i assume you mean?
yes, though ive only gotten it to work nicely so far for realism or detailed styles
>>
>>108982141
well i think it's broken for me cause it only outputs black images. i have to wait for the developer to fix it
>>
File: zImageturbo_00152_.jpg (438 KB, 1840x1152)
438 KB JPG
>>108982112
res_2s / simple, only one step with the second ksampler. Retains lora effect a bit better and still has smoothing effect
>>
File: q_ezd5yo.png (1.63 MB, 1344x960)
1.63 MB PNG
>>
File: 1774093051029869.png (1.51 MB, 1024x1024)
1.51 MB PNG
>>
>>108982193
God of War : Tranae
>>
File: ComfyUI_00246_.png (1.23 MB, 1024x1024)
1.23 MB PNG
comfy update.....................................................................................................................................................................................................................................................................................................................................................................................................................
>>
>>108982177
there was this one uploader on huggingface that posted a ton of celebrity loras, would you happen to know who that was? having a hard time tracking down links.
the guy who did that ramona flowers is really good.
>>
>>108982229
https://huggingface.co/malcolmrey
be warned, its fucking slop because he trains without any captions last i checked
>>
File: 1762681823068442.png (1.2 MB, 1024x1024)
1.2 MB PNG
I think edit models have made photoshop mostly obsolete. except for maybe touchups or specific tasks. the time invested to shoop all the stuff out would be too long, even with the clone stamp tool or whatever.
>>
>>108982233
ssheeiiit. noted.
>>
>>108982233
it seems to me that some of his models are trained on only faces only because they get the body wrong often.
>>
>>108982242
adobe will just integrate it into photoshop. a lot of the appeal for the CC suite is all of their other software being linked together, so i don't think much will change for them as long as they add some AI features
>>
i tried this guy's loras but couldnt get them working, even with the activation tag. or im just using it wrong.

https://huggingface.co/SDim1973/Z-Image-Loras/tree/main
>>
>>108981926
Anima is the best model for anime right now but in terms of interfaces avoid forgeshit like plague and use either comfyui or swarmui (which is basically comfy with a very usable base interface on top)
>>
>>108982260
https://huggingface.co/nphSi/Z-Image-Lora/discussions/32
> For characters its often enough to use a trigger and the upper class like "person" or "animal" or "cartoon" for best flexibility.
yeah i have no high hopes here, though it seems like he's at least doing more than malcolmrey
also im always surprised hf hasnt cracked down yet on these in general
>>
>>108982261
use claude code to make your own interface and use sdcpp. we're living in the agentic ai age.
>>
How do I inspect an embed to see what its full of? Like ppls meme pos/neg ones.
>>
>>108981743
Sweet, thanks kind anon!
>>
>>108982330
You mean the workflow? Drag and drop it into comfy, but it doesn't work on 4chan pics.
>>
>enabled torch compile
>each generation takes 5 times longer now
who came up with this?
>>
>>108982107
Thanks I got it working
>>
File: 1718045134048117.png (308 KB, 498x469)
308 KB PNG
how should I go about updating comfy? do I git pull and then install requirements.txt or should I update it through the manager? or both? ironunderstan
>>
>>108982461
The instructions are written under your foreskin. Check there.
>>
>updating cumfartui in the big 2026
>>
im gonna update my drivers. i hope i dont lose everything like last time
>>
>update comfy
>no visible improvements, random things break
haha thanks
>>
>>108982480
mustard gas just came out of my computer
>>
File: 010724CUI_00001_.png (1.27 MB, 1536x1152)
1.27 MB PNG
dayum
res_2s + bong_tangent is extremely fast
someone recommended checking it out in the other thread
thanks my negro
>>
>>108982524
See you on the other side anon
>>
>>108982467
Are they also available in braille? I can read it with my tongue
>>
>>108982461
Go into your ComfyUI directory, do 'git pull' then when it is done, do 'source venv/bin/activate' followed by 'pip install -r requirements.txt'

Now re-start Comfy and profit or suffer, depending on the updates.
>>
>>108982461
comfy has bat files for updating
>>
im noticing its hard to get nsfw images using flux. just ignores the prompts for the most part
>>
>>108982529
Fast? Are you high? It's good but it's slow as fuck.
>>
>>108982529
singlestep samplers are supposed to be outperformed by multistep samplers when given the same compute, 2s takes double the time per step, 3s 3x, etc, 30 steps on 2S takes the same time as 60 steps on multistep
>>
>>108982594
>he is not aware
>>
File: SH3-Heather-00001.png (161 KB, 378x394)
161 KB PNG
>>108982599
I meant to say it needed fewer steps but >>108982605 made me realize It took the same amount of time but at half the steps kek
>>
>>108982611
Doesnt work on my machine :(
>>
>>108982572
what does the manager update do then?
>>
>>108982623
The entire flux family is safetyslopped even towards softcore lewds. You can use loras, but don't expect anything from the bare model.
>>
>>108982637
crashes your install with no survivors
>>
>>108982623
>>108982594
in case this is not a shitpost
anything flux is heavily censored, though theres some halfway decent loras
>>
I'm about a week into building my manual tag correction program for LoRA training datasets, and I'm starting to think about how you fix e.g. 10,000 tagged images at a time. Since one anon talked about directories with 10,000 training images.

I have folder view, it can load folders with 20k+ images with thumbnails, it's fast (caching thumbnails on hard drive. It is what it is. Hard to go anywhere near that fast without doing this.)

I think the next part will be doing: select a tag (from an autocomplete tag search widget, already built), all images turn green border if they have that in their tags. Shift-clicking on image adds the tag to its tags (unless already in there), alt-clicking removes. Mouse-over with control held loads image larger in a floating tooltip for quick-look ("are her eyes green? I can't remember...")

Also, filter by tags, ofc. So you *just* look at everything tagged "blue eyes", then go through adding "aqua eyes".

This further implies: auto-replace. A text widget where you specify all tags to be removed when you add the current tag. Implementing already-used wildcards like '%color', so you can say '%color eyes' and remove any eye color when you add aqua eyes.

Letting users save these rules and load them again next time would save a lot of their trouble, so I think I should do that too.

This would still be a ton of fucking work for 10k images, but it's never *not* going to be a lot of work to manually correct 10k+ images.

Anyway, much to think about
>>
File: 121364342_p1.png (229 KB, 857x598)
229 KB PNG
>>108982642
>>
>>108982652
Forgot to mention: it's vibecoded Claudeslop, of course

I think it will be rather good vibecoded Claudeslop but feel free to /spit
>>
reddit spacing, makes you think
>>
>>108982664
You were all one-shotted by this stupid meme from 2017, one common way of formatting a post (with a long history of use on 4chan) became de facto illegal punishable by mandatory thread derailment because of a few paranoid faggots on /pol/ trying to figure out who was a reddit refugee from the subreddit closures and who wasn't
>>
>>108982669
I normally don't use the meme, but that post is so egregious lol.
>>
>>108982652
>Shift-clicking on image adds the tag to its tags (unless already in there), alt-clicking removes
could just have it detect existence on shift click and remove or add automagically without needing another hotkey
>>108982652
>Mouse-over with control held loads image larger in a floating tooltip for quick-look ("are her eyes green? I can't remember...")
i like this. and could even show the full prompt alongside the expanded preview
the rest is really good too
>>
What's the best perceptual hashing algorithm not taking speed into consideration, I tried PDQ and PHash and they're both kinda crap
>>
File: IMG_6535.png (204 KB, 2403x988)
204 KB PNG
Pls help, for a style lora im use came/cosine with restarts and 3 cycles, and it seems overtrained even with 0.00002 lr. What coul i use instead
>>
>>108982687
that LR is too high for anima. rule of thumb is to take 6e-6 to 8e-6 at batch size 1 and then upscale by times the square root of your effective batch size. so try 8e-6 or 1e-5 at the very least
ive never used came but heard it wants even lower LRs typically so even this might be way too high.
>>
>>108982687
>cosine with restarts
Have this ever given anyone good results ?
>>
Fresh
>>108982713
>>108982713
>>108982713
Fresh
>>
>>108982677
>could just have it detect existence on shift click and remove or add automagically
Could. I guess with an obvious enough visualization of 'present or not' the risk of getting the wrong result isn't there.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.