[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


30 Year Old NEET Virgin 1girl Prompter Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>108976783

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Share the LoRA you bake on HF
It's your duty
>>
And then link it here
It's your duty
>>
File: 00009-3633469999.png (885 KB, 1024x1024)
885 KB PNG
>>108982722
but what if its like, a little shit?
anima's a little tough to train for compared to sdxl.
>>
File: file.png (333 KB, 1216x832)
333 KB PNG
reminder
>>
>>108982687
Just switch to cosine (w/o restarts) and set alpha to half and you will probably get something decent.
>>
>>108982778
>and set alpha to half
dont do that, just decrease lr
>>
>>108982732
>anima's a little tough to train for compared to sdxl.
Really? With the authors default config it's worked great for me.
>>
>>108982785
got a link to that?
one thing that always confused me is the consensus on tagging, some flat out say "dont tag what you want to see in the final lora" which sounds like some good ole trolling.
>>
>mfw Resource news

06/04/2026

>Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-Infinity

>DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities
https://github.com/sadjadeb/DetectZoo

>ComfyUI KSampler Matrix Lab
https://github.com/btitkin/ComfyUI-KSampler-Matrix-Lab

06/03/2026

>Ideogram 4.0: Open model at the forefront of design
https://ideogram.ai/blog/ideogram-4.0

>JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-LongVideo-Page

>Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting
https://github.com/shenytzzz/Follow-Your-Preference

>LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation
https://github.com/qixinhu11/LongLive-RAG

>MAI-Image-2.5
https://microsoft.ai/models/mai-image-2-5

>AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation
https://aad-1.github.io

>Inference-Time Scaling for Joint Audio-Video Generation
https://jung-jaemin.github.io/ITS-AVGen-Proj

>Video-Mirai: Autoregressive Video Diffusion Models Need Foresight
https://y0uroy.github.io/Video-Mirai

>Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
https://github.com/phoenixnir/FLAME

>VISReg: Variance-Invariance-Sketching Regularization for JEPA training
https://haiyuwu.github.io/visreg

>HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
https://HumanNOVA.github.io

>Cosmos 3: Omnimodal World Models for Physical AI
https://research.nvidia.com/labs/cosmos-lab/cosmos3

>TGV-KV: Text-Grounded KV Eviction for Vision-Language Models
https://github.com/Danielement321/TGV-KV

>JAVEDIT: Joint Audio-Visual Instruction-Guided Video Editing with Agentic Data Curation
https://ryanchenyn.github.io/projects/JAVEdit
>>
>mfw Research news

06/04/2026

>Imagine Before You Draw: Visual Prompt Engineering for Image Generation
https://arxiv.org/abs/2606.04457

>Efficient and Training-Free Single-Image Diffusion Models
https://haojunqiu.github.io/efficient-SID

>DSA: Dynamic Step Allocation for Fast Autoregressive Video Generation
https://arxiv.org/abs/2606.04432

>Activation Steering of Video Generation Models via Reduced-Order Linear Optimal Control
https://arxiv.org/abs/2606.04775

>MeshFlow: Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer
https://mesh-flow.github.io

>Video2LoRA: Parametric Video Internalization for Vision-Language Models
https://arxiv.org/abs/2606.04351

>UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation
https://arxiv.org/abs/2606.04264

>Crafting Your Evolving Dreams: Concept-Incremental Versatile Customization
https://arxiv.org/abs/2606.04797

>InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space
https://openimaginglab.github.io/InstantRetouch

>ChannelTok: Efficient Flexible-Length Vision Tokenization
https://channeltok.github.io

>Controllable Dynamic 3D Shape Generation via 3D Trajectories and Text
https://cvlab-kaist.github.io/T2Mo

>MaCo-GAN: Manifold-Contrastive Adversarial Learning for Single Image Super-Resolution
https://arxiv.org/abs/2606.05068

>MAOAM: Unified Object and Material Selection with Vision-Language Models
https://jadenpark0.github.io/project_pages/maoam

>Impostor: An Agent-Curated Benchmark for Realistic AIGC Manipulation Localization
https://arxiv.org/abs/2606.04545

>An Empirical Study of Data Scale, Model Complexity, and Input Modalities in Visual Generalization
https://arxiv.org/abs/2606.04409

>Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges
https://arxiv.org/abs/2603.20304
>>
https://github.com/Comfy-Org/ComfyUI/pull/14216

https://bernini-ai.github.io/
>>
>>108982813
No that's not trolling.
Trigger word is supposed to encompass everything you want to see in the lora. You tag things you don't want the AI to learn so that they won't show up uninvited when you use the lora (the separation isn't perfect in any model and success depends on regularization, dataset diversity, etc.)
Like if some1girl has a red hat in some of the illustrations in your dataset, and if the red hat is not her usual outfit/something you don't want the AI to learn, than you should tag red hat alongside some1girl.
Features of some1girl shouldn't be tagged.
BUT if the character has multiple hairstyles/outfits etc. so that you might want to use your lora in a more modular way then you can tag them.
This will make your lora more verbose to use as well.
Example some1girl, black hair, hime cut for first look.
some1girl, red hair, short hair for second look.
etc.
>>
File: 2105529607.jpg (24 KB, 561x367)
24 KB JPG
>wan2gp updated to output stereo audio for ltx
it's kino time
>>
>>108982813
>>108982713
>>Anima
>https://huggingface.co/circlestone-labs/Anima
It's right there on this accounts' page.
>>
>>108982827
Can it do nsfw?
>>
>>108982827
>>108982851
yeah that's what I'm interested in
use a 5 second hardcore clip with a female celeb's face
>>
Ive consumed too much porn and my brain is fried. As a result I will never be able to learn the intricacies of advanced AI usage. Just the bare minimum
>>
>>108982865
same,
I've legit brain damage from 17 years of chronic depression and pseudo-dementia
I can't make workflows
>>
>>108982865
I have learned most of the intricacies of AI usage and I still mostly coom to regular porn and coom art. There is still something more kino about the real deal, at least yet.
>>
>>108982865
but the jews said cooming is safe and effective and nofap will give you stage 7 ball cancer
>>
File: 1768732283924490.png (961 KB, 896x1216)
961 KB PNG
anon bakes the best loras
>>
>>
File: 1750631483006736.gif (511 KB, 840x488)
511 KB GIF
my custom node to the manager got added
>>
File: 178923723768942873.png (332 KB, 600x323)
332 KB PNG
>mfw when watching a tutorial of anything
>>
File: 024206CUI_00002_.png (1.17 MB, 1536x1152)
1.17 MB PNG
>>
I never under why there are so many sampling methods
>>
>>108982924
good job anon i always believed in you
>>
>>
File: funtimes.png (1.07 MB, 1020x645)
1.07 MB PNG
if you fags aren't using perceptual losses in your training you should probably go kill yourselves
>>
>>108983066
Prompt and model?
>>
Someone list the sampling methods in order, from best to worst
>>
>>108983066
Like a validation dataset?
>>
>>108983040
>>108983085
you need more than one?
>>
i just use the default
>>
my life has no meaning, I don't even enjoy prompting no more
>>
>>108983078
Klein 9b, lora trained for just 300 steps with https://github.com/BuffaloBuffaloBuffaloBuffalo/ai-toolkit-perceptual 8 image dataset. this shit is seriously a game changer. i'm going back and redoing all my old models with it.
>>
>>108983085
Can't be arsed with order but stuff I've used:
euler
euler a
dpmpp_2m
ddim
res_multipstep
res_2s
res_2m
uni_pc
Just pick some that isn't absolute ass and roll with it. Not much point overthinking what boils down to seed lottery ultimately.
>>
Oh yeah, today was the new WaiAnima version release.

This gen loses some sharpness versus vanilla.
>>
>>108983106
I thought the image was a meme I'd already seen before so my asking was poking fun at you but now I realize it's actually a Gen.
>>
>>108983112
1280x1600, same seed as before.
>>
>>108983112
the line curves look much better in the new one, even if it's slightly blurrier. can be fixed with an upscaler
>>
Please take shitmerge tests somewhere else desu
>>
>>108983121
And Nipi style.

Overall, I think there's less Wai advantage over vanilla compared to the Preview 3 versions.
>>
File: BasevsWAI.jpg (521 KB, 3075x1146)
521 KB JPG
>>108983112
I did some tests with it yesterday and wasn't a big fan. It seemed to override the loras I used by a lot.
>>
>>108983106
did you use weight noising or perceptual anchoring or both?
>>
>>108983106
Thanks for your advice,

I might retrain my zimage lora with this and report back too
>>
>>108983112
Turbo comparison.
>>
>>108983149
1280x1600. Lol at the floating wine glass in Wai 12-step.
>>
>>108983142
both, just the default flux config in the quickstart dropdown (which why the fuck doesn't regular ai toolkit have? )
>>
>>108983156
And Nipi. More noticeable difference here.
>>
>>108983106
Anyone tested it with Anima?
Seems promising.
Maybe I should try it with ZIT or something.
>>
File: ComfyUI_29058.png (3.31 MB, 1500x1920)
3.31 MB PNG
>>108983066
I'm creating a new dataset for it, but Z-Image Base isn't even fully supported yet (preset-wise) so I'm in no rush.

>>108983085
I use four separate/discrete sampler/scheduler pairs in my workflow. Best and worst is fully dependent on what you're doing and how many steps you're using.
>>
>>108983186
>ZiT can take up to 200 steps per training image to converge
Well fuck.
Ok maybe Klein? Please don't tell me this insists on diffusers format just like the ai-toolkit.
>>
>>108983186
does it even support anima? judging by readme it doesn't
>>
>>108983085
res_2s
euler family (OG is still the best to try new things with)
deis_2m for style transfer tasks
lcm on distills
>>
File: 034428CUI_00002_.png (1.73 MB, 1536x1152)
1.73 MB PNG
>>
File: 1775122670326980.png (1.2 MB, 1536x1792)
1.2 MB PNG
>>
>>108983204
i have used it on ZiT with great results, it does take a lot of steps but you don't need nearly as big a dataset. i think i had 10 images of my character and it turned out better than regular ai-toolkit with 70 images. and i tried a lot of different params before that, probably like 30 runs? the one with the perceptual stuff was way way way better and did it on the first try
>>
>>108983207
That's why I asked if anyone tested
>plus the other architectures you can load from a config.
But you need to tinker with training parameters.
>>
>>108983227
Do you use masks?
It says 200 step per image, which would be 1600 steps for 8 images. But yours converged at 300 steps? Did you deviate from the default settings in any other way?
>>
>>108983253
i tried masked and unmasked, masked converged quicker. but this was flux, it trains quicker than ZiT which is the one that takes so many steps.

side note, weight noising seems to reduce some of the regular flux body horror. fingers still suck but that's flux for ya
>>
File: 040151CUI_00002_.png (1.82 MB, 1536x1152)
1.82 MB PNG
>>
File: stretchy.png (943 KB, 1017x643)
943 KB PNG
also staying limber is important anons, be sure to do your stretches while you watch those loss graphs
>>
>>108983279
>>
>>108983279
>>
File: 041013CUI_00002_.png (2.16 MB, 1536x1152)
2.16 MB PNG
>11°C
at least the gpu wont overheat
>>
File: 1774403248767455.png (2.41 MB, 1792x1536)
2.41 MB PNG
>>
>>108983279
lora? or do I have to train my own
>>
>>108982820
>>108982821
thanks!
>>
File: disobedience.png (921 KB, 1016x670)
921 KB PNG
>>108983357
sorry anon you gotta do the cookin by the book. too degen for me to post publicly.
>>
File: debo_lr_anima1_00095_.png (2.73 MB, 1792x1075)
2.73 MB PNG
>>108983367
:)
>>
>>108983372
cringe you must be british
>>
is there any way to use klein without getting body horror
>>
>>108983404
i do a high step generation to minimize the number of mistakes, and then i re-run it with "increase image quality" with low step count to un-burn it
>>
File: rule_britania.png (1.01 MB, 1016x670)
1.01 MB PNG
>>108983387
>>
>>108983425
gonna train this shit on anima, pussy biiiiitch. gonna upload this shit too
>>
https://www.youtube.com/watch?v=i8ju_10NkGY
>>
File: ayyyyy.png (575 KB, 792x550)
575 KB PNG
>>108983436
>>
>>108982713
People who don't want more 'legacy' code, code which cannot be touched are unanimously wrong about concepts.

It is a case of luddites dismissing hyper IQ programmers. Yes the oversaturation of normies is the reason why you're losing.

They have always hated people smarter than them, they keep people like the original creator out. They hate the creator.
>>
The way to deal with these people and find them out is to give them your resume, with a different name, if they reject you know they are already undermining you.
It is the same as bigotry. Watch out for their absolutism.
>>
File: 1780635517110.png (305 KB, 1031x1085)
305 KB PNG
>>108983459
>>108983464
>>
File: indubitably.png (655 KB, 825x551)
655 KB PNG
>>108983478
i, too, find this shallow and pedantic
>>
File: 18646.webm (3.6 MB, 512x512)
3.6 MB
3.6 MB WEBM
>>
>>108983528
what am i looking at here?
>>
>>108982732
Which artist is this supposed to be
>>
>>108983553
more music experiments
>>
NEVER USE A MESSY RECIPE

>>108983579
https://rule34.xxx/index.php?page=post&s=list&tags=smmyart+jessie_%28toy_story%29+
>>
bernini bros do this >>108982864
>>
Random tip: If Anima is stubbornly forcing your fingerless gloves to be elbow gloves even with heavy negative tagging, it helps to add "bare arms" as a positive tag.
>>
File: 00096-3885682245.png (1.97 MB, 1280x1792)
1.97 MB PNG
I've learned so much about LORA TRAINING tonight!
>>
>>108983664
I'm trying to get it to work
>>
someone share some klein 9b workflows
t2i
i2i
or whatever else it is used for
>>
File: 00097-1288282294.png (1.33 MB, 1792x1280)
1.33 MB PNG
>all the ones where her hair is totally pink have extra text but the text itself is perfect
>>
VERY much considering baking a Michelle Trachtenburg LoRA later
>>
File: 1778286099418126.png (3.34 MB, 1344x1984)
3.34 MB PNG
>>
Klein 9B is so abysmal bad at face and character consistency in image edit, when you change face expression, hairstyle or pose. Can't even fucking keep skin color and retires to default 1girl.
However it's working reasonably ok undressing celebrities with the snofs finetune (als long as you don't change the image too much elsewhere), I give it that.
>>
File: pepe.gif (61 KB, 570x537)
61 KB GIF
>>108983744
>However it's working reasonably ok undressing celebrities with the snofs finetune (als long as you don't change the image too much elsewhere), I give it that.
workflow
>>
>>108983752
You don't need a special workflow for that, it works out of the box on any i2i wf. Can even be done in Forge Neo i2i.
>>
File: 00106-710586046.png (1.55 MB, 1280x1792)
1.55 MB PNG
>>108983744
>>108983758
Flux Klein 9b is like a videogame that gets good 20 hours in. I've had plenty of success getting what i wanted out of edits, making turnarounds of single-shot characters, but it's like pulling teeth.
>>
File: gllllll.png (1.04 MB, 955x669)
1.04 MB PNG
>>108983106
holy fucking shit you weren't kdding, this is 300 steps with a 9 image dataset gotdammm
>>
>>108983773
There are some characters, that it manages to copy over more or less recognizably into a gen. But as soon as a character deviates from default human, the thing falls apart and will gen its 1girl sameface, even with an extensive prompt and consistency / faceswap loras.
>>
>>108983784
doesn't look like her
>>
File: XwvPhQ4pgq6zBUjX94gNGW.jpg (993 KB, 2624x2600)
993 KB JPG
>>108983841
yeah you're right they look nothing alike

faggot
>>
File: 1755947532148596.png (2.34 MB, 1664x1664)
2.34 MB PNG
>>
>>108983885
ai one looks like a hag compared to the real one
>>
>>108983898
ut oh melty
>>
>>108983898
that's the tism speaking fren
>>
File: mafw.png (254 KB, 900x806)
254 KB PNG
>>108983908
you're the one calling me the f word. i am cool as a cucumber. train it more
>>
>>108982713
>>108982713

first time i made it to OP in like 2 weeks and it was the most low effort random buzz word gen i have done in that time
>>
>>108983784
>>108983885
>>108983841
>>108983898
i dunno who this girl is but they look like the same woman
>>
NEED pixelspace editing for the love of god
>>
>>108983066
What a degenerate asshole
> Look, Im building something cool
> Look, Im implementing it in the most backward trainer ever

Does every so-called smartass have to be a retard at the same time? The only reason he implemented it in the AI Toolkit is the reach of that TikTok trash. As if standard AI Toolkit users could actually do anything with it.
>>
>>108983916
melty
>>
It's embarrassing that it took me this long to ensure prompts are a single line with Anima. It hates new lines.
>>
File: MayBattle2_n.jpg (1.61 MB, 720x8332)
1.61 MB JPG
>>
>>108983967
huh? all the trainers are literal trash. they're just wrappers for diffusers scripts.
>>
File: Ideogram_4.0_00005_.png (3 MB, 1056x1536)
3 MB PNG
ideogram is good when you use the kj prompt builder node for it
>>
>>108982713
sfw vageen
>>
>>108984027
Lol
>>
File: ComfyUI_29062.jpg (3.65 MB, 1500x1920)
3.65 MB JPG
>>108984002
This. None of them are any good and can be a pain in the ass to install, setup or use.

>>108984027
lol
>>
>>108983995
nice
>>
>>108983106
>>108983784
can it be used to train a LoRA of this girl I stalk?
I only have 6GB of VRAM
>>
File: Ideogram_4.0_00021_.png (1.91 MB, 1024x1024)
1.91 MB PNG
>>
File: anima cat gen.png (364 KB, 896x1152)
364 KB PNG
I have committed the mistake of trying out perceptual loss meme. Spent a good amount of time clearing out disk space and re-downloading the models I already have for the hecking diffusers format meme because the worthless faggot garbage tool called ai-toolkit he built on top of demands you to have qwen 8b in fp32 for some unfathomable reason. NOOOOOO you just can't use bf16 qwen 8b checkpoint you already have on your computer and you are using for Comfy. YOU JUST CAN'T DO THAT OK CHUDDIE???
That's not the part that broke me though. I had to needlessly sit through quantizing qwen 8b. And then it completely froze my system when it reached the encoding latents part. I had to power cycle without proper shutdown. Ok I am a ramlet, I only have 32gb memory, I am not delusional about that. But I also have 30gb zram and 40gb swap partition. It has NO reason to completely exhaust that much memory. I am guessing it didn't free up the fp32 qwen 8b weights after it was done quanting. Stupid bugged slopped garbage.
FUCK AI-TOOLKIT just fuck that garbage tool. The most slopped brain dead barebones shit for worthless jeet animals.
HEY DIPSHIT
If you have any useful, interesting, worthwhile ideas FOR THE LOVE OF GOD implement them over a proper tool like OneTrainer.
THANK YOU FOR YOUR ATTENTION TO THIS MATTER.
>>
>>108983995
Absolute garbage, piss off
>>
>>108984073
if it wasn't going full retard dunking his cigar in lean this'd be a 10/10
>>
File: niggers.png (832 KB, 607x921)
832 KB PNG
>>108984078
i got 12gb vram and 32gb ram and it works fine. also just rename your fucking model weights you don't need a whole different download faggot. you people act like such victims i swear to god
>>
>>108984078
I share your deep-seated hatred
musubi tuner > ai toolkit
in absolutely everything. only those TikTok kids need to be spoon-fed with a fancy UI and managed downloads because they can't handle two .toml files and three commands
>>
why is this thread so hostile? what is it about local diffusion that attracts trannies?
>>
>>108984139
hostile? i'm not hostile. fuck you
>>
>>108984139
4chan is a hivemind for autistic people, and autists like to vent their emotions without filter.
>>
>>108984145
oh
>>
It's sad how bad the stable diffusion Reddit is these days.
Back in the day is was more like local llama.
Now it's just some of the most braindead posts I've seen.
What makes it worse is that this directly affects the type of models we get...
>>
>>108984106
kneega, I just deleted my Stephanie meme and porn collection the other day
>>
>>108984145
> When I'm pissed off, I lock myself in my room and cry alone in the corner, or I suppress my emotions so the other person won't reject me

Grow a pair
>>
>>108984157
I can see you're an illiterate moron, anon.
>>
File: Ideogram_4.0_00023_.png (3.45 MB, 1920x1088)
3.45 MB PNG
>>108984073
spent 17 minutes genning a three stooges gen that looks nothing like the three stooges award
>>
>>108984154
sorry fren i hope you don't relapse
>>
>>108984180
says the conformist work drone with no personality
>>
>>108886593
it's now part of ComfyUI Manager
https://github.com/ShankHub-Org/ComfyUI-Artist-Helper
>>
>>108984192
I can't afford the luxury, I don't think the glowies would mind, but alas, just in case. btw it's hard to catch a stephanier in the wild, we're very few
>>
File: 1780588675284962.jpg (122 KB, 591x674)
122 KB JPG
>>108984106
i have 3090 eyeing 5090, but all i could manage to generate was literal slop with 7 fingers, horrific anatomy, did models get better or your prompts are like 10 stories tall?
>>
File: Ideogram_4.0_00024_.png (1.74 MB, 1024x1024)
1.74 MB PNG
nice try lmao
>>
>>108984195
You think sperging out in autistic glory constantly is akin to having an interesting personality?
Yeah just keep believing that anon. You'll be a loser for the rest of your life.
>>
>>108984106
Renaming doesn't do shit since it is expecting 00001-of-0000x format and fp32.
I managed to get past latent encoding with exhausting memory and it crashed after text encoder caching.
Now I tried again and I am at downloading depth checkpoint.
I will see if I can get to the training.
The inconsistency is still a problem regardless.
>>108984134
In this case the baby UI isn't even there. This fork for just uses the shitty backend with a toml.
>>
any good z-image and sdxl model?
im tired of searching
>>
>>108984227
>z-image
The OG Z-Image Turbo itself. Asking for shitmixes of a distilled model in particular is a bad idea.
>sdxl
Irrelevant in 2026.
>>
>>108984204
not op but i used it too. you gotta keep the batch / grad accum ratio the same for the LR. if you go down to batch 1 like i had to, you gotta make grad accum 4. otherwise LR is too much. hope that helps. im just a dumb jeet tho with 16 gb card.
>>
File: z-image-base_00024_.png (1.33 MB, 1024x1024)
1.33 MB PNG
>>108984233
Z-Image is good but it can't draw nipples.
sd/xl is the only platform which has worked for me.
>>
>>108984246
Lora.
Klein 9b has better NSFW loras if you can put up with the more plastic look (And judging by the fact that you are asking for SDXL I would say the answer is yes.)
>>
imagine working for comfyui for free >>108982924
>>
>>108982827
>>108983664
testing. fps mismatch, looks like u have to edit the source video fps
>>
>>108984263
imagine never contributing a goddamn thing to the world because you have no skills
>>
>>108984292
> contributing a goddamn thing to the world
lol

> no skills
> co-authored by claude opus
kek
>>
>>108984299
>>108984263
I did it for a little project work and used ChatGPT instead of Claude
>>
>>108983106
looks great, ty for linkage!

this apparently still has that annoying bug where jobs forget datasets
>>
looks like it doesn't track iris?
>>
File: ComfyUI_00027_.png (1.96 MB, 1088x1936)
1.96 MB PNG
Trying ideogram4. Trump isn't blocked, nice.

Has anyone tried the nvidia dip or whatever with id4?
>>
File: ComfyUI_00028_.png (1.72 MB, 1088x1936)
1.72 MB PNG
>>108984402
Kek.

I hope there will be a way to use i2v to overlay text and design generated by id4 onto an image. At least transparent design generation.
>>
>>108984402
>ideogram4
can this be used for img2img as an enhancer or upscaler?
>>
Interesting, the groin is completely blurred off, as if the training data was censored manually.
https://litter.catbox.moe/o24c56axv97w5705.png

>>108984424
Probably, I'm only seeing one value that resembles denoise.
>>
>>108984402
>Trump isn't blocked
based patriot model
>>
>>108984215
It started OOMing with caching depth latents because for some reason it did not unload the unet from vram before doing them. I managed to get past it by switching to uint4 temporarily.
But now I am getting OOM crash after making latents. I tried accumulation, even lower quants but they did not help. I tried layer offloading but that freezes the system just after quantizing the model DESPITE ample free swap. My best working guess is that it does some retarded memlock/pin fuckery that prevents the system from utilizing swap properly. But I managed to get a tty session this time and used it to see what was was going on it was like 1/3rd CPU use with extremely little memory use so I have no idea honestly.
Maybe I am the retarded one here to bother with this shit.
Whatever the point is don't be me and don't bother with this shit, and a very late good night to you all.
>>
File: ComfyUI_00020_.webm (2.84 MB, 954x1136)
2.84 MB
2.84 MB WEBM
https://files.catbox.moe/m1gfey.mp4
This shit is too slow for me, even with a 5090.
>>
File: ComfyUI_00032_.png (1.29 MB, 1264x1680)
1.29 MB PNG
>prompt for trump groping a mexican woman

Interesting..
>>
File: ComfyUI_00034_.png (3.95 MB, 1264x1680)
3.95 MB PNG
Damn, this is exactly what I imagined the gen to look like.
I have to afk but I'm really hopeful of this model, zimage might finally be beat.

The regional prompting capabilities it has is probably the best feature.
>>
>>108984469
UNBLOCK
>>
I fucking hate comfyui with passion. Compared to reforge it's slow as fuck, produces worse results and every time some new model comes out everything is fucked up. Try some shared workflow? It's full of custom nodes that don't work. Manager shows a bunch of red nodes that can not be fixed despite reinstalling again and again. There must be some kind of botfarm pushing this shit because nobody in their right mind would use it instead of reforge or other a1111 forks.
>>
File: 35745.gif (1.97 MB, 224x128)
1.97 MB GIF
>>
>>108984353
can you upload the orignal belle clip?
>>
>>108984702
>>
>>108984744
thanks
>>
File: 545789.png (2.56 MB, 3330x1834)
2.56 MB PNG
my hard drive is crying, but my ears are cheering
>>
File: ComfyUI_00002_.png (1.39 MB, 1024x1024)
1.39 MB PNG
>>108984758
there is no point in video generation
>>
>>108984772
you would understand if you heard what i'm hearing
>>
>>108984353
The only models in general that gets face expressions right or Google, OpenAI, and Krea.
Benchmaxxers found that "1girl looking at viewer" increases scores from raters so this is what we've had for the past 4 years.
>>
File: 041804CUI_00002_.png (2.3 MB, 1536x1152)
2.3 MB PNG
>>
>>108984467
>>108984353
Do hardcore porn. Thanks
>>
File: 4554.jpg (645 KB, 1350x1800)
645 KB JPG
>>108984878
>>
high pitch singing is pretty good quality with ltx, i haven't heard it in other models
>>
>>108984878
this low res garbage is maxing out my 5090, I won't even try
>>
video gen is still all awful
>>
Do you tag wideshow,cowboyshot, eye level etc... for anima loras?
>>
File: flare.jpg (54 KB, 976x850)
54 KB JPG
easiest way to locally train a lora right now?
>>
>>108985263
anima standalone for anima, onetrainer for everything else
>>
>>108985263
onetrainer, aitk or the aitk fork >>108983106 mentioned earlier. for some people also plain kohya_ss

furthermore good training data and a powerful model that mostly already understands ~about the same type of stuff you want to train. and more powerful local hardware.
>>
I wrote my loras by hand in notepad
>>
>>108985273
>>108985310
I'm training a lora off sprites. Should I use anima?
>>
just did a simple search on civitai
IL has so many more LoRA than Anima it's insane
Will Anima ever catch up?
Should I train my art style LoRA on IL?
Keep reading to find out
>>
>>108985419
>IL has so many more LoRA than Anima it's insane
have you considered that illustrious 0.1 came out two years ago, but anima preview 1 just four months ago?
>>
>>108985216
>video gen is still all awful

Au contraire
https://files.catbox.moe/eutd19.mp4
>>
>>108985442
why would anyone make Anima LoRA if they can't make money off it?
>>
>>108985470
can i make money by just posting illustrious loras on civitai?
>>
>>108985474
there are double digit figures up on patreon available
>>
>>108985480
but i was talking about civitai?
>>
File: 1767347015535890.png (245 KB, 720x391)
245 KB PNG
>>108985419
>IL has so many more LoRA than Anima it's insane
>>
>>108985484
link your patreon in all your loras
>>
>>108985470
because their skin isn't brown
>>
>>108985493
how many people posting illustrious loras on civitai do so?
>>
>>108985510
loads of them, for patreons making close to nothing lol
>>
>>108985390
it is worth trying. if it works it'll probably be pretty nice. if not I personally would likely try klein or qwen next.
>>
File: 00005-1091378678.png (1.4 MB, 1024x1536)
1.4 MB PNG
>>108984078
not wrong, but i also respect stephanie-poster so much that i'm gonna agree with him on principle alone, call you a fag, and to just rename + structure the folders like how it wants, i had to do that to get citron's anima trainer going since it's by default meant to download the models for you as if you're too retarded to already have them for genning.

anyway my first lora run going off the anima creator's settings for his style lora worked, but, i have to rework my dataset again to get rid of the face shading.
>>
>>108985470
YOU A DIRTY ASS NIGGER THERE AIN'T NO WAY TO SAAAAY IT
(everyone that early access gates models like this deserves a holocaust)
>>
>>108985644
can I train an anima lora in comfyui?
>>
kohya sd scripts onetrainer which one actually gives best output quality for anima with mostly default settings?
>>
>>108985713
2 days retard
>>
>>108985742
The anima PR for onetrainer hasn't even been merged yet
>>
>>108985759
take your 5k buzz for your undercooked samefaced slop realistic slopmerge and shove it up your ass
>>
File: 05-46-2026.jpg (390 KB, 936x1368)
390 KB JPG
>>108985713
people pay for this?
>>
>>108985713
Looks like garbage
>>
I can't believe all this stuff and sites like civitai and hugging face are free
>>
>>108985799
>someone paid 5k buzz to gen this
>>
>>108985819
crazy world
>>
>>108985819
how much is 5k buzz
>>
>>108985838
5
>>
>>108985846
Can I swap Buzz for cash?
>>
>>108985819
>5k buzz for SD1.5 1girl
>>
>>
You can only cash out yellow buzz but you need to upload a certain amount of content and pay their retarded subscription so subtract $10 off of that.
>>
>>108985902
Meant to reply to >>108985893
>>
File: 00109-2965409291.png (1.4 MB, 1280x1792)
1.4 MB PNG
It's $5 to get to 5k buzz, but to take it out you have to be approved in their creator program + pay the tard tax >>108985902
>>
>>108985470
So why did you lie here?
>>
##HELP ME PLEASE, MY FIRST ANIMA CHARACTER LORA, 130 IMAGES, ALL 1024 PIXELS RATIO IN TOTAL
-----------------------------------------
[general]
shuffle_caption =
keep_tokens =
caption_extension = ".txt"

[[datasets]]
resolution =
batch_size =
enable_bucket =
min_bucket_reso =
max_bucket_reso =
bucket_reso_steps =

[[datasets.subsets]]
image_dir = "F:\\SD\\sd-scripts\\mylora_images\\"
num_repeats =
---------------------------------------------
@echo off
cd /d F:\SD\sd-scripts

call venv\Scripts\activate

accelerate launch --num_cpu_threads_per_process 1 sdxl_train_network.py ^
--pretrained_model_name_or_path="F:\SD\ComfyUI_windows_portable\ComfyUI\models\diffusion_models\anima-base-v1.0.safetensors" ^
--dataset_config="F:\SD\sd-scripts\_config.toml" ^
--output_dir="F:\SD\sd-scripts\mylora_output" ^
--output_name="MyFirstAnimaLora" ^
--save_model_as=safetensors ^
--network_module=networks.lora ^
--network_dim= ^
--network_alpha= ^
--network_dropout= ^
--learning_rate= ^
--unet_lr= ^
--optimizer_type= ^
--lr_scheduler=^
--lr_warmup_steps= ^
--max_train_epochs= ^
--save_every_n_epochs= ^
--mixed_precision= ^
--no_half_vae ^
--gradient_checkpointing ^
--cache_latents ^
--cache_latents_to_disk ^
--network_train_unet_only ^
--noise_offset= ^
--max_token_length= ^
--min_snr_gamma= ^
--seed=42

pause
---------------------------------------------------------------
##HELP ME PLEASE, MY FIRST ANIMA CHARACTER LORA, 130 IMAGES, ALL 1024 PIXELS RATIO IN TOTAL
>>
>>108985994
you pasted all that but didn't even include the actual settings values
>>
>>108985941
It's also roughly 2k = $1, not 1k. Though it fluctuates.
>>
how to make civitai detect models i used on comfy?
it used to detect them just fine now it won't.
>>
>>108985996
because I want (You) to fill them!
>>
>>108985998
If I have 270k buzz I never use how do I cash out efficiently
>>
how much would it cost to just redo anima from scratch?
>>
I want this world so bad, but I have no nVidia card.
>>
>>108986004
How do you have so much buzz but don't know how to use the site
>>
>>108985994
I DIDN'T PUT THE VALUES BECAUSE I DON'T FUCKING KNOW WHAT TO PUT.

THERE ARE 130 IMAGES, ALL CAPTIONED, DANBOORU TAGS AND A SHORT CAPTION FOR EACH IMAGE AND @TRIGGER_TAG. THAT'S ALL I KNOW.
>>
>>108986014
I post LoRAs and never use the site
been posting for a while and just gained it over time by doing nothing
>>
>>108986004
I think it has to be yellow buzz. correct me if I'm wrong.
>>
>>108982999
this but its leon and chris
>>
>>108986004
start uploading slop in droves, especially vidgens, wait 6 months to be partnered. ezpz.
>>
>>108986017
https://civitai.red/articles/27484/anima-lora-preset
>>
>>108986017
https://github.com/gazingstars123/Anima-Standalone-Trainer
Just do default settings.
>>
>>108986017
>@TRIGGER_TAG.
triggers only do something when you are training more than one thing, else they just polute
>>
i've tried nearly every one of those anima standalone trainers, citron's is really the best one. actually installs RTX 5000 support perfectly with its bat file, and has actual configurable settings and presets that work + export cleanly. most of those trainers don't even have half of that.

https://github.com/citronlegacy/citron-anima-lora-trainer-ui
>>
Minimum recommended hardware to create images that are actually satisfying?
>>
>>108986035
all wrappers/GUIs add unnecessary complexity
put shape in shape hole
there's no making that easier, GUI or not
adding a GUI just adds a point of failure
>>
>>108986045
16gb RAM laptop
https://civitai.com/models/2168935/z-image-turbo
>>
>>108986045
4GB VRAM depending on your definition of satisfying, but go for minimum 6 GB if you can
>>
>>108986045
nvidia gpu from the past 5 years, 8gb vram, that's it really.

>>108986049
>adding a gui just adds a point of failure
you CLI cultists are freaks, but credit were its due, basically every one of these anima trainers are vibecoded slop that as i said, fuck up really elementary things about trainers. Citron's is the only one that gets it right. As good as it gets.
>>
File: night2.png (3.06 MB, 1024x1536)
3.06 MB PNG
which model/checkpoint is this?
>>
>>108986066
gpt
>>
>>108986066
LigmaMixv0.1
>>
>>108986059
Let me explain this to your retard brain in a way that communicates easier:
In this specific instance in GUI, you double click the thing, enter some text in the box and press run
In this specific instance in CLI, you edit the script file, entre some text on the line and run it
You literally save zero effort, time, anything with GUI vs no GUI
this use case is uniquely fucking worthless as a GUI

of course I use GUIs when they help
they don't help here you fucking ape
>>
File: fact.jpg (57 KB, 716x687)
57 KB JPG
>>108985994
you were rude to me earlier so i will not help you
>>
>>108986059
>Citron's is the only one that gets it right
Kek. It's still a very basic sd-scripts wrapper. The only one that actually adds features is machina's fork of lora easy training script trainer
>>
>>108986074
>You literally save zero effort, time, anything with GUI vs no GUI
easier on the eyes and brain innit
>>
>>108986074
mmm you were also rude to me like the fella who just posted a pepe was mistreated, so, i will not finish reading beyond the first sentence.
>>
fact
>>
Is there an existing workflow to train an anima lora in comfy..... on linux


Or should I just use some other tool
>>
>>108986077
rude website
rude hobby
>>
>>108986109
nobody uses cumfart to train except jeets making slop dataset trainers
>>
File: zImageturbo_00157_.jpg (618 KB, 1840x1152)
618 KB JPG
>>
>>108986126
what shall I use then wise master
>>
>>108986203
diffusion-pipe or just kohya
>>
>https://civitai.com/articles/30980/a-gpu-crunch-and-bumpy-days-ahead
oof & yikes!
>>
>>108986157
damn she really did not like american pie huh


>>108986228
yiiikes! *wilhem scream*
>>
File: bobina3.png (778 KB, 896x1152)
778 KB PNG
>>108986228
kek based crypto chads
>>
>>108986228
so they unironically dont have any long-term contracts? are they fucking retarded?
also i thought pearlcoin is not profitable anymore? vast got somewhat affordable again over the past few days
>>
File: 1754981553949528.png (3.81 MB, 1936x1088)
3.81 MB PNG
>>
people are mining on gpus? is it 2018 again?
>>
>>108986257
nice gen
>>
>>108986257
catbox?
>>
>>108983106
intredasting. I'm not in a hurry and hope onetrainer implents it eventually, seems big
>>
>>108983106
"guys look I can train a face photocopy"
are you CeFurkan you fuckin dolt
>>
>>108986312
>besmirching the name of the furkGOD
watch your tone chrissy
>>
>>108983066
so an automated masked training?
>>
>>108986246
GPU rentals are an absolute shitshow for everyone. It's literally like buying from drug dealer, everything is done through word-of-mouth backroom deals. A proper long-term contract is difficult to come by and extremely expensive if you do get it. Companies are doing things like renting individual Runpod instances to use as their productions GPUs.

t. someone who works in the industry
>>
File: 1752835082056306.jpg (846 KB, 1088x1936)
846 KB JPG
>>
>>108986377
Oh no that means comfycloud is in trouble!
>>
>>108986377
thanks for the insights. well at least i know now that im not the only one who got robbed over the past week with those prices hikes and my training pods literally getting rugpulled with no heads-up kek
>>
good. cloud cucks btfo
>>
>>108986228
why don't they setup their own GPU farm?
>>
>>108986413
that's asking a lot of people that can't even manage their site as is
>>
>>108986413
the entire site is vibecoded
>>
Hey lads
Comfy Anonymous here
What new things would you like to see in ComfyUI?
>>
>>108986479
Horse cocks
>>
>>108986479
More API support.
>>
File: 1771689651135035.jpg (160 KB, 1200x922)
160 KB JPG
>>108986479
mining support
>>
>>108986479
Cute pp's
>>
>>108986479
yoland stepping down announcement
>>
>>108986479
please stop open source support and charge everyone a monthly fee
>>
>>108986479
ComfyUI.cpp
>>
>>108986479
Fire the frontend team
>>
>>108986479
Fire the backend team
>>
>>108986479
bad shill
***
Hello team! ComfyAnonymous here!

I want to open up a space and ask, what features do you dream about for ComfyUI?
What would make your creative workflow aligned with your inner passion?
Here in ComfyORG we build legacies, we are the architects of our own dreams, and we believe that when passion meets tooling, magic happens!
So give us your thoughts because no idea is too big and no dream is out of scope.
This is a judgment free zone of growth!!
***
>>
>>108986479
a bigger grant for tdrussel
>>
>>108986565
>>108986571
>>
>>108986571
+1
>>
>>108986571
This
>>
>>108986479
Jokes about julien in your announcements
>>
File: 1759469819281319.jpg (883 KB, 1536x2728)
883 KB JPG
>>
>>108986565
As a free man I want to choose freely where to store my renders instead of you jewessly making me store them inside comfyui_portable/output or /temp!
>>
>>108986571
this but for people who actually open source
>>
File: 1771045603899632.jpg (717 KB, 1088x1936)
717 KB JPG
>>
>>108986656
that's cool (i also look like this)
>>
>>108986656
That's great.. it's the new model right?
>>
>>108986656
that's cool (i also look like this) (im the guy in white and he's one of the guys i shot >>108986688 )
>>
overall Anima really does just feel like either a side-grade to illustrious 2.0 in most cases, and a solid single step up in others.
it really could've done with either a better architecture or more parameters for the performance tradeoff, it can't handle more than 2 characters in a scene at once without blending outfits and faces together, style loras especially struggle, and being limited to around the same MP limitation as illustrious leaves me scratching my head a bit.
Like i'd be cool with this being the META for 2026 but, by 2027, i'd really like to see something better take its place. It's already a bit frustrating and has me going back to sdxl at points because i essentially run into the same issues anyway but sdxl is at least faster obviously.

that said, all my personal subjective experience/opinion, i can't claim to have deep cut knowledge on how this shit works, maybe there's more potential here but i dunno man.
>>
>>108986479
see
>>108986001
also I need this model https://civitai.red/models/847101/fluxed-up-flux-nsfw-checkpoint give me buzz to buy it. :_:
>>
Well, my portable 5TB drive with my *2d anime* loli gens, loli loras and loli dataset broke from one PC startup to the next. How the hell do I fix it without having to involve another person for obvious reasons?
>>
>>108986762
> Anima
fuck off
>>
>>108986479
Publickly execute the frontend team after a fair trial in which /ldg/ votes through strawpoll.
>>
>>108986762
i think it's much better. for my needs at least. it can do fetishes properly more often than not (femdom, futa on male), which illustrious could almost never do. i think the output is much better as well (artist styles etc), whereas on illustrious you were basically forced to that wai look, on anima you can have that but there's much more variety as well
>>
>>108986854
also, i'm still having fun genning random stuff on anima whereas on illustrious after 1 month i had no real use for it, even with shitmixes. because it was so limited in what it could generate. the shitmixes were just a different coat of paint on it. anima also has many limitations but it can do so much more. i honestly don't know how you can say it's a side-grade desu. maybe if you're just making 1girl, cowboy shot and want the wai look. that's the only situation where the two might be comparable imo
>>
File: 260606-012708_SVI__00001.mp4 (3.73 MB, 672x1056)
3.73 MB
3.73 MB MP4
>>
help retard anon here
how to update linux comfyui?
>>
>>108986762
>it can't handle more than 2 characters in a scene at once
but it can
>style loras especially struggle
git gud
>being limited to around the same MP limitation as illustrious
native 2mp, 4mp is easily possible too with a bit of lora training. illustrious was already struggling at 2mp despite later versions being explicitly trained on it, let alone 4mp with any consistency
>>
>>108986905
The amount of loras I've seen on civitai that are blatantly trained on SDXL slop outputs is staggering. These people don't deserve their GPUs. Nor their eyes, probably.
>>
>>108986936
Honest answer, make sure to symlink everything interesting (model folders, loras, ...) and make a separate install with the new version, also symlinked. If it works great and you can throw the old version away, if not you still got your old version
>t. the fennec faggot fucked me multiple times raw
>>
>>108986968
I also have no clue why people would want to replicate SDXLs sloppy VAE results. I suppose this is the result for spending so much time on SDXL. Newfags have associated slop with generative anime.
>>
>>108986800
try https://www.ccleaner.com/recuva if the drive still shows up
>>
so did anyone figure out how to fix the noisy outputs from ideogram 4?
>>
>>108986990
>bring your own condom in case you get raped
>>
I'd like to generate some movie trailers or movie scenes, is such thing possible yet or nah?
>>
>>108987064
https://x.com/elonmusk/status/2062337074368508253
>>
>>108987064
if you have to ask, then no
>>
>>108986936
comfy update
>>
File: 1759186517646306.png (1.12 MB, 1024x1024)
1.12 MB PNG
uh oh, stinky
another local model exposed for training on outputs from api
https://www.reddit.com/r/StableDiffusion/comments/1txfrhw/ideogram_generated_a_gemini_watermark_without/
>>
Fresh

>>108987212
>>108987212
>>108987212

Fresh
>>
>>108987070
>paid
>>
>>108987193
uh oh stinky uh oh stonky
>>
>>108987193
>training on outputs from api
api gemini doesnt have that thougheverbeit?
>>
>>108987193
This is why LoRAs will never go away.
>>
>>108987009
>sdxl slop
not at all
they simply do not know how to get best of the fine grain+hit depth perspective

having said that yes 90% of loras on civit were, are and always be very low quality and waste of download



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.