/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 09/05/25(Fri)00:14:39 No.106488626

File: highlights_g_106481665_17(...).webm (3.26 MB, 1799x2048)

3.26 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/05/25(Fri)00:14:39 No.106488626

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106481665

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/05/25(Fri)00:16:46 No.106488630

Anonymous 09/05/25(Fri)00:16:46 No.106488630

>>106488620
There is no "version." The model is set default to 16fps/5sec. And people just interpolate to 32. Wan 2.2 can handle raw 20/24fps or maybe even more but you pay with more vram.

Anonymous
09/05/25(Fri)00:19:57 No.106488648

Anonymous 09/05/25(Fri)00:19:57 No.106488648

>>106488626

Ok. Ive been using comfyui for a while. Works great. Just one thing I need though.

Where can I find a checkpoint or Lora that focuses on hot bbc pornstars like Selah Rain or Anastasia Lux or Remi Ferdinand or Mz Dani?

Really dying to create an image of all them getting facials by big black cocks.

Gracias

Anonymous
09/05/25(Fri)00:20:22 No.106488649

Anonymous 09/05/25(Fri)00:20:22 No.106488649

>>106488626
Delete ComfyUI from OP

Anonymous
09/05/25(Fri)00:21:20 No.106488653

Anonymous 09/05/25(Fri)00:21:20 No.106488653

The Gradio clan is fractured. They must join arms to take on the Comfy Menace.

Anonymous
09/05/25(Fri)00:24:30 No.106488663

Anonymous 09/05/25(Fri)00:24:30 No.106488663

>>106488646
I know what you said, it's called hyperbole. And having custom nodes is not even close to have 50 different venvs.

Anonymous
09/05/25(Fri)00:25:31 No.106488670

Anonymous 09/05/25(Fri)00:25:31 No.106488670

File: WanVideo2_2_I2V_00323.webm (309 KB, 1248x720)

309 KB WEBM

Anonymous
09/05/25(Fri)00:30:54 No.106488694

Anonymous 09/05/25(Fri)00:30:54 No.106488694

File: file.png (201 KB, 333x330)

201 KB PNG

Anonymous
09/05/25(Fri)00:34:28 No.106488708

Anonymous 09/05/25(Fri)00:34:28 No.106488708

>>106488694
Sweet Jebus it's alive

Anonymous
09/05/25(Fri)00:36:37 No.106488717

Anonymous 09/05/25(Fri)00:36:37 No.106488717

>>106488670
If trials don't kill him, lung cancer will

Anonymous
09/05/25(Fri)00:36:40 No.106488718

Anonymous 09/05/25(Fri)00:36:40 No.106488718

How can one apply Bayesian thinking to image generation

Anonymous
09/05/25(Fri)00:38:29 No.106488726

Anonymous 09/05/25(Fri)00:38:29 No.106488726

>>106488649
Delete yourself from life.

Anonymous
09/05/25(Fri)00:39:47 No.106488731

Anonymous 09/05/25(Fri)00:39:47 No.106488731

>>106488648
So no Maria bose fans around, eh?

Anonymous
09/05/25(Fri)00:41:22 No.106488741

Anonymous 09/05/25(Fri)00:41:22 No.106488741

File: 1755661824986883.jpg (570 KB, 1416x2120)

570 KB JPG

>>106488336
coincidental

Anonymous
09/05/25(Fri)00:54:27 No.106488799

Anonymous 09/05/25(Fri)00:54:27 No.106488799

>>106488726
You deleted your penis from life.

Anonymous
09/05/25(Fri)01:34:29 No.106488987

Anonymous 09/05/25(Fri)01:34:29 No.106488987

File: 1673502756417396.png (587 KB, 500x666)

587 KB PNG

anyone know how to consistantly prompt facial features in flux/chroma? it still loves the same face, even without use of woman/lady etc. things like words for facial features; i try to prooompt different face shapes and they all end up the same face 25% of the time, and the rest of the time it's standard ai face. actually prompt adherance in general seems to be pretty fucking random desu.

Anonymous
09/05/25(Fri)01:52:52 No.106489055

Anonymous 09/05/25(Fri)01:52:52 No.106489055

File: IMG_4744.jpg (689 KB, 1179x1243)

689 KB JPG

>>106488987
>marx
delete your 4chan account

Anonymous
09/05/25(Fri)01:58:34 No.106489084

Anonymous 09/05/25(Fri)01:58:34 No.106489084

File: WanVideo2_2_I2V_00324.webm (2.49 MB, 1248x720)

2.49 MB WEBM

Anonymous
09/05/25(Fri)02:00:18 No.106489091

Anonymous 09/05/25(Fri)02:00:18 No.106489091

>>106489084
saar...

Anonymous
09/05/25(Fri)02:05:22 No.106489109

Anonymous 09/05/25(Fri)02:05:22 No.106489109

>>106488649
why?

Anonymous
09/05/25(Fri)02:10:39 No.106489131

Anonymous 09/05/25(Fri)02:10:39 No.106489131

>>106489084
lmao

Anonymous
09/05/25(Fri)02:20:04 No.106489164

Anonymous 09/05/25(Fri)02:20:04 No.106489164

>>106489152
what a MASSIVE faggot

ポストカード !!FH+LSJVkIY9
09/05/25(Fri)02:33:34 No.106489234

ポストカード !!FH+LSJVkIY9 09/05/25(Fri)02:33:34 No.106489234

File: blondie00.mp4 (3.6 MB, 720x648)

3.6 MB MP4

>>106488626
blessed thread of frenzone ;3

ポストカード !!FH+LSJVkIY9
09/05/25(Fri)03:20:37 No.106489528

ポストカード !!FH+LSJVkIY9 09/05/25(Fri)03:20:37 No.106489528

>nsfw tentacle janny in collage
;c

Anonymous
09/05/25(Fri)03:26:43 No.106489564

Anonymous 09/05/25(Fri)03:26:43 No.106489564

>>106488630
>Wan 2.2 can handle raw 20/24fps
How would you even control that? Doesn't adding more frames just make a longer video?

Anonymous
09/05/25(Fri)03:26:58 No.106489568

Anonymous 09/05/25(Fri)03:26:58 No.106489568

So nothing really new after flux? Is local kill?

Anonymous
09/05/25(Fri)03:27:46 No.106489572

Anonymous 09/05/25(Fri)03:27:46 No.106489572

File: hmmmmmmmmmm.jpg (208 KB, 930x710)

208 KB JPG

THATS ALOTTA GENERALS!!

Anonymous
09/05/25(Fri)03:30:52 No.106489589

Anonymous 09/05/25(Fri)03:30:52 No.106489589

>>106489564
total frames in empty latent node (default is 81) divided by framerate in the video node (default 16) + 1 frame as the first. So 121 for 5sec/24fps

Anonymous
09/05/25(Fri)03:31:37 No.106489595

Anonymous 09/05/25(Fri)03:31:37 No.106489595

pathetic

Anonymous
09/05/25(Fri)03:32:58 No.106489603

Anonymous 09/05/25(Fri)03:32:58 No.106489603

>>106489589
Wouldn't that result in a sped up video?

Anonymous
09/05/25(Fri)03:42:19 No.106489645

Anonymous 09/05/25(Fri)03:42:19 No.106489645

>>106489572
maybe we can just delete ldg and leave the others?

Anonymous
09/05/25(Fri)03:47:47 No.106489683

Anonymous 09/05/25(Fri)03:47:47 No.106489683

>>106488662
>no need to generate these. they're all on Twitter and they post amazing shit
I took the original image from X (thats why I know thats a tranny) and it was a selfie with phone covering face. I used QIE to put phone away and hands down.
Btw do you know why everyone single one of them love maimai?

Anonymous
09/05/25(Fri)03:55:26 No.106489724

Anonymous 09/05/25(Fri)03:55:26 No.106489724

>>106489084
>>106489091
literally NOT ai
do you have to shit up the general with the news\real-world posting every thread?!!? HUH?!?!?!?!?

Anonymous
09/05/25(Fri)04:06:14 No.106489773

Anonymous 09/05/25(Fri)04:06:14 No.106489773

>>106488987
I consider Flux and Chroma diverse enough with faces. Other than the infamous flux chin, I get good results.
Prompting for specific shapes of faces simply like that cannot work.
It cannot be helped if the training didn't care that much for shapes of faces, some tokens like bushy eyebrows or square jaws might work; I'd suggest a Lora with the type of face you want to get at low strength to preserve some diversity.
Or try specifying age or nationality to move the results.

Anonymous
09/05/25(Fri)04:14:23 No.106489821

Anonymous 09/05/25(Fri)04:14:23 No.106489821

How did flux chin even become a thing anyway? Like what kind of distillation methods did they use that such a distinctive feature would basically contaminate every face.

Anonymous
09/05/25(Fri)04:19:36 No.106489850

Anonymous 09/05/25(Fri)04:19:36 No.106489850

How long are we cursed to only be able to prompt 5-7 second videos? I want to be able to prompt at least a singular minute in one go without having to stitch them together.

Anonymous
09/05/25(Fri)04:20:13 No.106489855

Anonymous 09/05/25(Fri)04:20:13 No.106489855

>>106489850
not having your tentacle girl change positions every 5 seconds

Anonymous
09/05/25(Fri)04:22:05 No.106489867

Anonymous 09/05/25(Fri)04:22:05 No.106489867

File: ComfyUI_00084_.mp4 (1.22 MB, 592x816)

1.22 MB MP4

Anonymous
09/05/25(Fri)04:26:07 No.106489891

Anonymous 09/05/25(Fri)04:26:07 No.106489891

>>106489867
Actually awful.

ポストカード !!FH+LSJVkIY9
09/05/25(Fri)04:27:14 No.106489896

ポストカード !!FH+LSJVkIY9 09/05/25(Fri)04:27:14 No.106489896

>>106489891
>>106489867
at 0.69x speed its watchable i guess ;c
>neg: mouth open, talking

Anonymous
09/05/25(Fri)04:27:49 No.106489900

Anonymous 09/05/25(Fri)04:27:49 No.106489900

File: C4564FDF778B48B188051180F(...).jpg (79 KB, 604x604)

79 KB JPG

Does dataset size incrase vram requirements for training?

Anonymous
09/05/25(Fri)04:32:27 No.106489941

Anonymous 09/05/25(Fri)04:32:27 No.106489941

4chan will strip the workflow data in the video when you upload it?

Anonymous
09/05/25(Fri)04:33:29 No.106489944

Anonymous 09/05/25(Fri)04:33:29 No.106489944

>>106489900
nop, atleast not in the scales i've done (30-150 image loras). i dunno about full finetunes with large datasets

Anonymous
09/05/25(Fri)04:33:57 No.106489946

Anonymous 09/05/25(Fri)04:33:57 No.106489946

>>106489941
No. We all just upload to catbox for the fun of it.

Anonymous
09/05/25(Fri)04:34:58 No.106489949

Anonymous 09/05/25(Fri)04:34:58 No.106489949

>>106489900
No the number of parameters does.

Anonymous
09/05/25(Fri)04:35:11 No.106489952

Anonymous 09/05/25(Fri)04:35:11 No.106489952

>>106489944
I have some old SDXL datasets with 200-500 pics each that I'd want to retrain for chroma. And one has 900 or so

Anonymous
09/05/25(Fri)04:36:56 No.106489964

Anonymous 09/05/25(Fri)04:36:56 No.106489964

>>106489952
You'll likely have to train for longer on datasets of that size. Maybe prune the dataset to see if it even needs to be that big.

Anonymous
09/05/25(Fri)04:39:07 No.106489975

Anonymous 09/05/25(Fri)04:39:07 No.106489975

>>106489952
it shouldn't matter then for that scale besides speed. are you doing a full finetune? unless you're training multiple concepts for a high rank lora i doubt you need that many images

Anonymous
09/05/25(Fri)04:44:49 No.106490009

Anonymous 09/05/25(Fri)04:44:49 No.106490009

What do you guys use to train loras?

Anonymous
09/05/25(Fri)04:49:17 No.106490033

Anonymous 09/05/25(Fri)04:49:17 No.106490033

File: ComfyUI_00093_.mp4 (795 KB, 592x816)

795 KB MP4

Anonymous
09/05/25(Fri)04:50:49 No.106490040

Anonymous 09/05/25(Fri)04:50:49 No.106490040

File: its_over_123.gif (1.03 MB, 500x500)

1.03 MB GIF

Ok, the captioning models in Onetrainer can't handle obscure fetish stuff and it keeps tagging diapers as shorts or underwear or ignoring it. What's an external local tagging model that has no problems with freak stuff?

Anonymous
09/05/25(Fri)04:51:17 No.106490042

Anonymous 09/05/25(Fri)04:51:17 No.106490042

File: AnimateDiff_00285.mp4 (1.92 MB, 720x720)

1.92 MB MP4

>>106489850
>in one go

Sure, I also think that 5 seconds is too short, but I think it would be better to have a segmented generation, perhaps with segment length of your choice,but glued together in a natural way without imperfections. More than anything because I'm sure that if a whole minute were generated, at the end of the minute the generation would take an absurd turn

Anonymous
09/05/25(Fri)04:52:23 No.106490051

Anonymous 09/05/25(Fri)04:52:23 No.106490051

>>106490009
sd-scripts. if you want a gui you can use this fork of easy scripts but idk if it supports chroma/lumina
>https://github.com/67372a/LoRA_Easy_Training_Scripts

Anonymous
09/05/25(Fri)04:54:24 No.106490067

Anonymous 09/05/25(Fri)04:54:24 No.106490067

>>106490033
i wanted to go inside her mouth

Anonymous
09/05/25(Fri)04:56:30 No.106490076

Anonymous 09/05/25(Fri)04:56:30 No.106490076

>>106490040
reconsider your goals

Anonymous
09/05/25(Fri)04:59:49 No.106490090

Anonymous 09/05/25(Fri)04:59:49 No.106490090

>>106490076
I literally started doing AI because of this bruh.

Anonymous
09/05/25(Fri)05:01:15 No.106490093

Anonymous 09/05/25(Fri)05:01:15 No.106490093

>>106486313
thanks anon
installed everything but it just segfaults after trying a gen (segfaults after init of the models as far as i can tell)
oh well

Anonymous
09/05/25(Fri)05:10:42 No.106490140

Anonymous 09/05/25(Fri)05:10:42 No.106490140

File: ComfyUI_16696.png (2.8 MB, 1200x1600)

2.8 MB PNG

>>106488987
Try prompting the age or features (chubby cheeks, large nose, etc) or even negging out facial features or ethnic features. Negging ethnicities can help a lot because of the heavy Korean influence in most base datasets.

>>106489821
Sometimes the model can latch on to something innocuous during training (I had a problem with lanyards many LoRAs ago), BFL probably didn't catch it because they were focused on other things (coherency, etc)

Anonymous
09/05/25(Fri)05:16:27 No.106490162

Anonymous 09/05/25(Fri)05:16:27 No.106490162

So am I understanding right that negative prompts in Chroma need to be written in natural language as well?

Anonymous
09/05/25(Fri)05:17:20 No.106490166

Anonymous 09/05/25(Fri)05:17:20 No.106490166

>>106490040
>local tagging model that has no problems with freak stuff?
Your eyes :)

>>106490009
I use diffusion-pipe, I think it works fine for single GPU but it really is a multi gpu solution. Supports most models even *sighs heavily* chroma.

Anonymous
09/05/25(Fri)05:23:21 No.106490192

Anonymous 09/05/25(Fri)05:23:21 No.106490192

File: IMG_4742.jpg (152 KB, 824x737)

152 KB JPG

>>106489568
>is local kill
kek

Anonymous
09/05/25(Fri)05:25:26 No.106490201

Anonymous 09/05/25(Fri)05:25:26 No.106490201

File: ComfyUI_temp_eokgg_00059_.png (2.98 MB, 1536x832)

2.98 MB PNG

>>106490192
Nuclear holocaust can't come soon enough

Anonymous
09/05/25(Fri)05:29:19 No.106490219

Anonymous 09/05/25(Fri)05:29:19 No.106490219

>>106490042
>but I think it would be better to have a segmented generation
I'd prefer that, I just wanted to bitch about it. The progress we've had with videogenning is ridiculous so maybe having coherent longform videos without it going into eldritch territory will be possible in a few years. I'd be ok with at least 30 seconds.

Anonymous
09/05/25(Fri)05:30:51 No.106490229

Anonymous 09/05/25(Fri)05:30:51 No.106490229

File: WanVideo2_2_I2V_00327.webm (2.55 MB, 1248x720)

2.55 MB WEBM

Anonymous
09/05/25(Fri)05:35:15 No.106490250

Anonymous 09/05/25(Fri)05:35:15 No.106490250

>>106489234
>>106489528
>>106489896
fuck off to your tranny circlejerk thread, subhuman tripnigger

Anonymous
09/05/25(Fri)05:40:38 No.106490273

Anonymous 09/05/25(Fri)05:40:38 No.106490273

>>106490192
What site is that and what did you try to prompt?

Anonymous
09/05/25(Fri)05:57:40 No.106490340

Anonymous 09/05/25(Fri)05:57:40 No.106490340

is there a way to have FaceDetailer skip blurry faces? they are 99% of the time not worth detailing and are not the main subject

Anonymous
09/05/25(Fri)05:59:38 No.106490353

Anonymous 09/05/25(Fri)05:59:38 No.106490353

>>106490340
I don't use face detailer because frankly that's SDXL vramlet shit, but isn't there an option where you can specify the certainty threshold of what a face is and it will skip it? I'm sure there's a balance you can set where it will only detail the very obvious faces and skip the blurry ones.

Anonymous
09/05/25(Fri)06:00:59 No.106490361

Anonymous 09/05/25(Fri)06:00:59 No.106490361

>>106490340
just learn to inpaint manually

Anonymous
09/05/25(Fri)06:02:28 No.106490369

Anonymous 09/05/25(Fri)06:02:28 No.106490369

Do any of you use Adobe Lightroom? I'm thinking of getting it for easy(?) upscaling and for a replacement for lama/IO cleaner. Seems like it would be good but I wanted some opinions first.
Is there somewhere else I could ask this? I guess I could ask AI kek.

Anonymous
09/05/25(Fri)06:03:23 No.106490373

Anonymous 09/05/25(Fri)06:03:23 No.106490373

>>106490361
stfu useless nigga

Anonymous
09/05/25(Fri)06:03:48 No.106490374

Anonymous 09/05/25(Fri)06:03:48 No.106490374

>>106490369
I'm ignorant of lightroom. What does it do that I can't manually set up a workflow for in comfy?

Anonymous
09/05/25(Fri)06:04:50 No.106490382

Anonymous 09/05/25(Fri)06:04:50 No.106490382

>>106490373
im serious. it'll save you more time in the long run, face detailer is too inaccurate. just learn to inpaint

Anonymous
09/05/25(Fri)06:05:45 No.106490390

Anonymous 09/05/25(Fri)06:05:45 No.106490390

File: ComfyUI_00112_.mp4 (875 KB, 592x816)

875 KB MP4

AI is just terrible with multiple subjects

Anonymous
09/05/25(Fri)06:06:15 No.106490394

Anonymous 09/05/25(Fri)06:06:15 No.106490394

>>106490382
why do you assume I don't already know and have always manually inpaint and am experimenting with FaceDetailer? bitch ass nigga always butting in with stupid comments fuck off

Anonymous
09/05/25(Fri)06:08:02 No.106490407

Anonymous 09/05/25(Fri)06:08:02 No.106490407

>>106490394
then there is no way i am aware of to manually skip faces with face detailer

Anonymous
09/05/25(Fri)06:13:35 No.106490441

Anonymous 09/05/25(Fri)06:13:35 No.106490441

>>106490353
using bbox I wasn't finding that sweet spot, it either found 4 faces or none. tried with segm and at least with the test image it seems to work. thanks

Anonymous
09/05/25(Fri)06:14:15 No.106490447

Anonymous 09/05/25(Fri)06:14:15 No.106490447

>>106490374
>comfy
never speak to me again

Anonymous
09/05/25(Fri)06:14:52 No.106490449

Anonymous 09/05/25(Fri)06:14:52 No.106490449

>>106490390
This is awesome, do more

Anonymous
09/05/25(Fri)06:14:58 No.106490450

Anonymous 09/05/25(Fri)06:14:58 No.106490450

>one lora completely fucks up the face details but gets the action right
>the other lora maintains the face details but fucks up the action
this sucks

Anonymous
09/05/25(Fri)06:15:46 No.106490455

Anonymous 09/05/25(Fri)06:15:46 No.106490455

>>106490450
use both and/or do a hiresfix pass with different loras

Anonymous
09/05/25(Fri)06:21:18 No.106490480

Anonymous 09/05/25(Fri)06:21:18 No.106490480

>>106490394
>bitch ass nigga
I cringe.

Anonymous
09/05/25(Fri)06:22:47 No.106490485

Anonymous 09/05/25(Fri)06:22:47 No.106490485

File: ComfyUI_00115_.mp4 (1001 KB, 592x816)

1001 KB MP4

>>106490449
Most of the time they are not doing much. I think the AI is confused with so many subject

Anonymous
09/05/25(Fri)06:35:35 No.106490548

Anonymous 09/05/25(Fri)06:35:35 No.106490548

I hate ComfyUI

Anonymous
09/05/25(Fri)06:50:38 No.106490618

Anonymous 09/05/25(Fri)06:50:38 No.106490618

File: ComfyUI_WAN2.2__00002.mp4 (2.52 MB, 680x1016)

2.52 MB MP4

Anonymous
09/05/25(Fri)07:01:22 No.106490669

Anonymous 09/05/25(Fri)07:01:22 No.106490669

>>106489572
>assblasted trani forks the general just so they can add tranistudio

Anonymous
09/05/25(Fri)07:05:42 No.106490692

Anonymous 09/05/25(Fri)07:05:42 No.106490692

>>106490669
how does one group of autists wield so much power. the cabal must be stopped.

Anonymous
09/05/25(Fri)07:14:47 No.106490737

Anonymous 09/05/25(Fri)07:14:47 No.106490737

File: AnimateDiff_00293.mp4 (3.86 MB, 576x1024)

3.86 MB MP4

Anonymous
09/05/25(Fri)07:16:43 No.106490750

Anonymous 09/05/25(Fri)07:16:43 No.106490750

>>106490737
kek

Anonymous
09/05/25(Fri)07:17:24 No.106490754

Anonymous 09/05/25(Fri)07:17:24 No.106490754

>>106489603
Yes.

Anonymous
09/05/25(Fri)07:17:25 No.106490755

Anonymous 09/05/25(Fri)07:17:25 No.106490755

File: ComfyUI_WAN2.2__00007.mp4 (3.58 MB, 656x1048)

3.58 MB MP4

>>106490618

Anonymous
09/05/25(Fri)07:18:03 No.106490763

Anonymous 09/05/25(Fri)07:18:03 No.106490763

Is there anything more epic than wasting whole day of training because of wrong settings. Adamw8bit is my new best friend.

Anonymous
09/05/25(Fri)07:19:41 No.106490776

Anonymous 09/05/25(Fri)07:19:41 No.106490776

Memo to myself, always try out every model, don't have any prejudices.
“Vibevoice release, podcast stuff, mh, i dont need it” then i read thread that microsoft has deleted the 7b again.
So I download it and give it a chance.
Now I almost missed a model that can generate porn audio
kek

Anonymous
09/05/25(Fri)07:19:50 No.106490777

Anonymous 09/05/25(Fri)07:19:50 No.106490777

>>106489850
>>106490042
Honestly the only thing segmented generation is missing is to take into account preceding. For example if we could feed the last x latent frames, it would be so much easier.

Anonymous
09/05/25(Fri)07:20:49 No.106490782

Anonymous 09/05/25(Fri)07:20:49 No.106490782

>>106490776
Is it hard to set up?

Anonymous
09/05/25(Fri)07:22:16 No.106490790

Anonymous 09/05/25(Fri)07:22:16 No.106490790

>>106490782
There are comfy nodes plug&play. just check github

Anonymous
09/05/25(Fri)07:22:27 No.106490791

Anonymous 09/05/25(Fri)07:22:27 No.106490791

>>106490776
> a model that can generate porn audio
Can it? Example catbox?

Anonymous
09/05/25(Fri)07:23:01 No.106490794

Anonymous 09/05/25(Fri)07:23:01 No.106490794

File: AnimateDiff_00291.mp4 (3.3 MB, 720x720)

3.3 MB MP4

>>106490219
>The progress we've had with videogenning is ridiculous so maybe having coherent longform videos without it going into eldritch territory will be possible in a few years.

Well yes, just think of the quality jump from Wan 2.1 to 2.2, they will certainly implement further improvements to the prompt, for example the possibility of starting a certain action at some point of the movie (for example, in the third second, the character begins to do x , then stops it and at the 6th second does y.. etc.)

Anonymous
09/05/25(Fri)07:26:36 No.106490818

Anonymous 09/05/25(Fri)07:26:36 No.106490818

>>106490776
Is it actually that good? I've tuned out of voice stuff because 99% of the time it's just... okay.

Anonymous
09/05/25(Fri)07:29:36 No.106490834

Anonymous 09/05/25(Fri)07:29:36 No.106490834

>>106490791
I sit on the toilet stoned and sit here for a while. So someone else should make the effort.
Yes, it can generate porn with voice cloning - it adapts the output to the text content. If you let it speak sexual content, the voice becomes erotic, mh and ahs are emphasized differently.
They must have trained on nsfw content. However, since it is a diffusion model, the result or the emphasis can vary greatly.
It can also use languages other than English and Chinese, depending on the seed perfect or broken pronunciation.

I can only recommend everyone to try it out. You are missing out.

Anonymous
09/05/25(Fri)07:29:50 No.106490835

Anonymous 09/05/25(Fri)07:29:50 No.106490835

File: DS1 Baby Skeletons.webm (2.73 MB, 1280x720)

2.73 MB WEBM

>>106490737
Why is this undead dancing with skeletons from the Catacombs?

Anonymous
09/05/25(Fri)07:31:25 No.106490852

Anonymous 09/05/25(Fri)07:31:25 No.106490852

File: Qwan_00003_.jpg (667 KB, 2976x1984)

667 KB JPG

Trying some silly stuff with long-ass prompts, referring to the subjects as Woman1 and Woman2. Seems to work decently well, there's some bleed between them, though.
Still pretty good for not doing anything besides prompting.

Anonymous
09/05/25(Fri)07:31:56 No.106490856

Anonymous 09/05/25(Fri)07:31:56 No.106490856

File: AnimateDiff_00107.mp4 (1.5 MB, 720x720)

1.5 MB MP4

vidrel made with wan2.1 btw

>>106490794
> these shaved waxed legs

Anonymous
09/05/25(Fri)07:36:36 No.106490877

Anonymous 09/05/25(Fri)07:36:36 No.106490877

>>106490852
man youre finally back, mind sharing a catbox?

Anonymous
09/05/25(Fri)07:36:51 No.106490879

Anonymous 09/05/25(Fri)07:36:51 No.106490879

>>106490852
The only thing I like more than 1girls are 2girls

Anonymous
09/05/25(Fri)07:38:42 No.106490886

Anonymous 09/05/25(Fri)07:38:42 No.106490886

>>106490834
No plaps?

Anonymous
09/05/25(Fri)07:41:27 No.106490898

Anonymous 09/05/25(Fri)07:41:27 No.106490898

>>106488626
Add this to OP, Forge:
----------From Panchovix--------:

-ReForge2dendev: https://github.com/Panchovix/stable-diffusion-webui-reForge/tree/newforge_dendev
-ReForge2: https://github.com/Panchovix/stable-diffusion-webui-reForge/tree/newmain_newforge

----------From DenOfEquity--------:
-ersatzForge: https://github.com/DenOfEquity/ersatzForge

----------From Haoming02--------:
-NeoForge: https://github.com/Haoming02/sd-webui-forge-classic/tree/neo

----------From lllyasviel--------:
-Legacy Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge

Anonymous
09/05/25(Fri)07:50:10 No.106490933

Anonymous 09/05/25(Fri)07:50:10 No.106490933

>>106490898
they're already there

Anonymous
09/05/25(Fri)07:50:14 No.106490934

Anonymous 09/05/25(Fri)07:50:14 No.106490934

File: AnimateDiff_00108.mp4 (1.78 MB, 720x912)

1.78 MB MP4

>>106490856
Naaah I made that animation and I did it with 2.2

Here is an alternative gen that I made in a different resolution, and is made with 2.2

I still remember having to generate 5 or more alternatives with 2.1, just to choose a decent. 2.2 reduced the need to 1 or 2, the results are much more consistent

Anonymous
09/05/25(Fri)07:51:01 No.106490936

Anonymous 09/05/25(Fri)07:51:01 No.106490936

>>106490898
>-ersatzForge: https://github.com/DenOfEquity/ersatzForge
in what reality do you include a 12 star fork of a rando there?

Anonymous
09/05/25(Fri)07:53:40 No.106490947

Anonymous 09/05/25(Fri)07:53:40 No.106490947

File: ComfyUI_00204_.png (2.54 MB, 1024x1536)

2.54 MB PNG

Anonymous
09/05/25(Fri)07:55:03 No.106490951

Anonymous 09/05/25(Fri)07:55:03 No.106490951

File: Qwan_00004_.jpg (726 KB, 2856x1896)

726 KB JPG

>>106490877
My workflow is full of custom nodes I wrote, but it's pretty much based on this concept:
https://civitai.com/models/1866759/qwen-image-modular-wf?modelVersionId=2124985
You're going to need a lot of VRAM or DRAM for it, but the result should be the same.

First pass QwenImage, then into Wan (Ultimate SD Upscale).
res_2s/bong_tangent for both.
Prompt: https://pastebin.com/3vC51Rck

Anonymous
09/05/25(Fri)07:58:52 No.106490969

Anonymous 09/05/25(Fri)07:58:52 No.106490969

File: ComfyUI_00205_.png (2.29 MB, 1024x1536)

2.29 MB PNG

Anonymous
09/05/25(Fri)08:00:28 No.106490982

Anonymous 09/05/25(Fri)08:00:28 No.106490982

>>106490936
ReForge2 dendev is bassed on this fork, and this Fork it is updated daily and has all the ultimate features

Anonymous
09/05/25(Fri)08:01:00 No.106490987

Anonymous 09/05/25(Fri)08:01:00 No.106490987

File: workflow_flux.png (919 KB, 2132x1590)

919 KB PNG

How long are flux/chroma, or in general, natural language captions suppossed to be? Some example WF I found had walls of text. Should I limit it in the system prompt or is this fine?

Anonymous
09/05/25(Fri)08:01:25 No.106490990

Anonymous 09/05/25(Fri)08:01:25 No.106490990

File: ComfyUI_00206_.png (2.06 MB, 1024x1536)

2.06 MB PNG

>>106490969
With Disco Elysium from anon

Anonymous
09/05/25(Fri)08:01:35 No.106490991

Anonymous 09/05/25(Fri)08:01:35 No.106490991

>>106490947
>>106490969
These are aesthetically pleasing.

Anonymous
09/05/25(Fri)08:01:38 No.106490992

Anonymous 09/05/25(Fri)08:01:38 No.106490992

>>106490898
>this is what mental illness looks like

Anonymous
09/05/25(Fri)08:02:58 No.106491000

Anonymous 09/05/25(Fri)08:02:58 No.106491000

File: Qwan_00005_.jpg (764 KB, 2856x1896)

764 KB JPG

>>106490990
Was that a Chroma LoRA? Looks cool. Both base image and LoRA, that is.

Anonymous
09/05/25(Fri)08:04:01 No.106491003

Anonymous 09/05/25(Fri)08:04:01 No.106491003

>>106490485
Imagine this, but in VR
One day I will be able to generate hour long videos like this and just imagine myself being there
Just need to add the smell and I can die happy

Anonymous
09/05/25(Fri)08:05:15 No.106491011

Anonymous 09/05/25(Fri)08:05:15 No.106491011

>>106490991
ty

>>106491000
Yup anon posted the other day https://civitai.com/models/1927225/

Anonymous
09/05/25(Fri)08:18:21 No.106491097

Anonymous 09/05/25(Fri)08:18:21 No.106491097

man, changing the resolution even slightly in Wan is just such a crazy quality difference. It's not like normal image gen where you can get away with lower res. It increases the god damn intelligence of the animation, massively. You'd think steps are supposed to be responsible for that...

Anonymous
09/05/25(Fri)08:21:18 No.106491115

Anonymous 09/05/25(Fri)08:21:18 No.106491115

>>106490951
thanks for the insight, I shall give it a whirl

Anonymous
09/05/25(Fri)08:22:19 No.106491119

Anonymous 09/05/25(Fri)08:22:19 No.106491119

>>106490777
>For example if we could feed the last x latent frames, it would be so much easier.
yeah I find it odd that we've got First Frame and Last Frame inputs, but not First Frame, Second Frame, Third Frame inputs, etc, surely it shouldn't be that difficult? It's the same thing isn't it?

Anonymous
09/05/25(Fri)08:25:26 No.106491139

Anonymous 09/05/25(Fri)08:25:26 No.106491139

>>106491003
Why would you want to imagine being stuck in a car with a bunch of dirty normie tik tok bitches? That would be a fucking nightmare.

Anonymous
09/05/25(Fri)08:27:44 No.106491147

Anonymous 09/05/25(Fri)08:27:44 No.106491147

>>106491139
>Why would you want to imagine being stuck in a car with a bunch of dirty normie tik tok bitches? That would be a fucking nightmare.

Maybe this is the place for you >>106484063

Anonymous
09/05/25(Fri)08:30:38 No.106491166

Anonymous 09/05/25(Fri)08:30:38 No.106491166

>>106491147
I hate anime. Those girls just look disgusting and obnoxious.

Anonymous
09/05/25(Fri)08:36:21 No.106491202

Anonymous 09/05/25(Fri)08:36:21 No.106491202

>>106490987
Lodestone's own captions are even bigger. Some of his examples are outright autistic.

Anonymous
09/05/25(Fri)08:39:49 No.106491222

Anonymous 09/05/25(Fri)08:39:49 No.106491222

>>106490755
It did great maintaining style throughout the entire clip.

Anonymous
09/05/25(Fri)08:39:53 No.106491223

Anonymous 09/05/25(Fri)08:39:53 No.106491223

File: 1743326950204.png (1.13 MB, 1220x1376)

1.13 MB PNG

>>106490987
If they don't tell you, you can assume it is on the shorter side. The longest I've seen is Lumina Image 2.0's captions. Note this won't be the case for any of its finetunes which most assuredly aren't using its format to caption. Really sad they never open sourced it even if you can' get 18+ stuff with it. It explains why Lumina 2.0 is so good at concepts and etc.

Anonymous
09/05/25(Fri)08:41:59 No.106491231

Anonymous 09/05/25(Fri)08:41:59 No.106491231

>>106491222
Well adding *retain style* in the prompt has always given me good results

Anonymous
09/05/25(Fri)08:45:08 No.106491253

Anonymous 09/05/25(Fri)08:45:08 No.106491253

File: IMG_20250905_174320.jpg (1.96 MB, 1800x2229)

1.96 MB JPG

>>106491223
Here are lodestone's

Anonymous
09/05/25(Fri)08:50:52 No.106491287

Anonymous 09/05/25(Fri)08:50:52 No.106491287

>>106491253
Doesn't this go over the T5xxl limit?

Anonymous
09/05/25(Fri)08:55:27 No.106491310

Anonymous 09/05/25(Fri)08:55:27 No.106491310

File: 1753646874765941.mp4 (2.48 MB, 1280x720)

2.48 MB MP4

Did they solve the style transfer?
https://xcancel.com/ideogram_ai/status/1963648390530830387#m

Anonymous
09/05/25(Fri)08:58:55 No.106491322

Anonymous 09/05/25(Fri)08:58:55 No.106491322

>>106491310
>example style is hodgepodge slop
every time

Anonymous
09/05/25(Fri)09:00:50 No.106491336

Anonymous 09/05/25(Fri)09:00:50 No.106491336

>>106491287
Yep, context limit of 512 which is why it's dumb. Lumina uses Gemma which is 8192.

Anonymous
09/05/25(Fri)09:01:31 No.106491341

Anonymous 09/05/25(Fri)09:01:31 No.106491341

>>106491336
>Lumina
post tangible workflow or KYS scammer

Anonymous
09/05/25(Fri)09:03:22 No.106491348

Anonymous 09/05/25(Fri)09:03:22 No.106491348

>>106491341
meds?
https://files.catbox.moe/0fnemj.json

Anonymous
09/05/25(Fri)09:03:50 No.106491351

Anonymous 09/05/25(Fri)09:03:50 No.106491351

>>106491341
Schizo elsewhere retard, no one said you can't have tech discussions here and need to only post images or workflows.

Anonymous
09/05/25(Fri)09:06:18 No.106491367

Anonymous 09/05/25(Fri)09:06:18 No.106491367

>>106491351
I schizo where I want, you're not my boss to tell me where to schizo.

Anonymous
09/05/25(Fri)09:07:19 No.106491368

Anonymous 09/05/25(Fri)09:07:19 No.106491368

>>106491348
>file not found

Anonymous
09/05/25(Fri)09:08:03 No.106491372

Anonymous 09/05/25(Fri)09:08:03 No.106491372

Alright, I have a whole ass archive of a specific fetish and 32GB of sweet Nvidia VRAM, I need wise ninja scrolls on wan2.2 lora training. What to do, what's good, what's not good, captioning tips, etc. I've trained SDXL loras before, but that's through sd-scripts, never used diffusion-pipe. Also need a magical way to remove watermarks.

Anonymous
09/05/25(Fri)09:15:18 No.106491420

Anonymous 09/05/25(Fri)09:15:18 No.106491420

>>106491372
>Also need a magical way to remove watermarks
qwen image edit

Anonymous
09/05/25(Fri)09:16:25 No.106491425

Anonymous 09/05/25(Fri)09:16:25 No.106491425

>>106491420
QIE randomly zooms in the image though

Anonymous
09/05/25(Fri)09:18:26 No.106491438

Anonymous 09/05/25(Fri)09:18:26 No.106491438

>>106491372
nano banano

Anonymous
09/05/25(Fri)09:20:09 No.106491446

Anonymous 09/05/25(Fri)09:20:09 No.106491446

>>106491372
>>106491438
>nano banana
>refuses images about circumsized bananas
explain this google!

Anonymous
09/05/25(Fri)09:20:30 No.106491451

Anonymous 09/05/25(Fri)09:20:30 No.106491451

Oh also, if all of my sources are from irl porn, does that translate well to trying to do anime animations, or does that not work well?

Anonymous
09/05/25(Fri)09:21:50 No.106491461

Anonymous 09/05/25(Fri)09:21:50 No.106491461

File: AnimateDiff_00296.mp4 (2.55 MB, 720x1104)

2.55 MB MP4

Anonymous
09/05/25(Fri)09:23:58 No.106491474

Anonymous 09/05/25(Fri)09:23:58 No.106491474

>>106491446
nazi

Anonymous
09/05/25(Fri)09:24:07 No.106491476

Anonymous 09/05/25(Fri)09:24:07 No.106491476

File: ComfyUI_00282_.png (2.03 MB, 1280x1600)

2.03 MB PNG

>>106490951
Prompting is quite a pain, but thanks for the guide

Anonymous
09/05/25(Fri)09:24:59 No.106491481

Anonymous 09/05/25(Fri)09:24:59 No.106491481

>>106491461
I think that with all the porn material that has been on the web throughout history, making porn loras must be the easiest thing in the world.

Anonymous
09/05/25(Fri)09:25:13 No.106491483

Anonymous 09/05/25(Fri)09:25:13 No.106491483

>>106491425
resize the image so the resolution is divisible by 112, not like it matters if the picture is zoomed in a tiny bit anyway

Anonymous
09/05/25(Fri)09:27:14 No.106491496

Anonymous 09/05/25(Fri)09:27:14 No.106491496

>>106490852
>tfw AI will bring us "live-action" manhwa adaptations
Can't wait for my villainesses to be depicted as korean qties as they should be.

Anonymous
09/05/25(Fri)09:43:55 No.106491595

Anonymous 09/05/25(Fri)09:43:55 No.106491595

>>106491372
>Also need a magical way to remove watermarks.
After much testing, I've concluded that the most magical way to remove watermarks en masse from tons of images is with Florence2 bboxes paired with MAT fast inpaint from Acly inpaint nodes (only works on 512x512 cutouts, so cut out around bbox before inpainting). Florence2 is a bit of a journey to install, it has some conflicts with current comfy requirements and needs you to downgrade transformers (or upgrade to latest version. Regardless, the version comfy used to install was incompatible). I can catbox a workflow with said caveat that it's not going to work out of the box and would require wrangling.

Anonymous
09/05/25(Fri)09:44:09 No.106491597

Anonymous 09/05/25(Fri)09:44:09 No.106491597

File: joycap.png (52 KB, 905x473)

52 KB PNG

Also for lora training what to pick? I think ticking the ambiguous language was mandatory, right? It's for fetish goonshit.

Anonymous
09/05/25(Fri)09:45:48 No.106491605

Anonymous 09/05/25(Fri)09:45:48 No.106491605

>>106491597
>using comfy for training
lmao what a noob mistake

Anonymous
09/05/25(Fri)09:52:07 No.106491649

Anonymous 09/05/25(Fri)09:52:07 No.106491649

>>106491476
Yum-Eeeee!

Anonymous
09/05/25(Fri)09:52:37 No.106491652

Anonymous 09/05/25(Fri)09:52:37 No.106491652

>>106491605
Just for captioning

Anonymous
09/05/25(Fri)09:55:34 No.106491669

Anonymous 09/05/25(Fri)09:55:34 No.106491669

>>106491597
Yes but what's the fetish?

Anonymous
09/05/25(Fri)09:56:26 No.106491671

Anonymous 09/05/25(Fri)09:56:26 No.106491671

>>106491669
diapers and ageplay

Anonymous
09/05/25(Fri)09:56:28 No.106491672

Anonymous 09/05/25(Fri)09:56:28 No.106491672

Clowness anon, any progress?

Anonymous
09/05/25(Fri)09:56:41 No.106491679

Anonymous 09/05/25(Fri)09:56:41 No.106491679

>>106490856
>>106490934
>video made with wan
>filename 'animatediff'
Are you trolling?

Anonymous
09/05/25(Fri)10:01:42 No.106491717

Anonymous 09/05/25(Fri)10:01:42 No.106491717

>>106491595
https://github.com/jferments/watermark_remover

Anonymous
09/05/25(Fri)10:03:51 No.106491731

Anonymous 09/05/25(Fri)10:03:51 No.106491731

>>106491679
Pretty sure it's the default output name for one of those third party node solutions, yes, this is clearly Wan

Anonymous
09/05/25(Fri)10:04:33 No.106491735

Anonymous 09/05/25(Fri)10:04:33 No.106491735

>>106491597
what base model are you using?

Anonymous
09/05/25(Fri)10:06:35 No.106491748

Anonymous 09/05/25(Fri)10:06:35 No.106491748

>>106491735
Chroma

Anonymous
09/05/25(Fri)10:10:19 No.106491776

Anonymous 09/05/25(Fri)10:10:19 No.106491776

>>106491748
Chroma training dataset was captured using Gemini, so look at examples from that and try to get JoyCaption to mimic those

Anonymous
09/05/25(Fri)10:11:18 No.106491787

Anonymous 09/05/25(Fri)10:11:18 No.106491787

>>106491717
>Simple-LaMa for inpainting with 256 resolution
Still I think a yolo node would be faster than using the big ass florence2 for a dataset + MAT fast inpaint

Anonymous
09/05/25(Fri)10:16:56 No.106491835

Anonymous 09/05/25(Fri)10:16:56 No.106491835

>>106491787
you're right. installing both a third party node and a graphical UI for image generation models, launching a server & loading the node is a lot easier than invoking a python script

Anonymous
09/05/25(Fri)10:19:55 No.106491867

Anonymous 09/05/25(Fri)10:19:55 No.106491867

>>106491835
A testament, to my Glory....

Anonymous
09/05/25(Fri)10:20:07 No.106491871

Anonymous 09/05/25(Fri)10:20:07 No.106491871

>>106491652
you do know there are online instances of joycaption tagging and most training guis have built in tagger support right?

Anonymous
09/05/25(Fri)10:20:25 No.106491874

Anonymous 09/05/25(Fri)10:20:25 No.106491874

File: ComfyUI_WAN2.2__00008.mp4 (1.16 MB, 688x1000)

1.16 MB MP4

>>106490755

Anonymous
09/05/25(Fri)10:21:59 No.106491890

Anonymous 09/05/25(Fri)10:21:59 No.106491890

Blessed thread of frenship

Anonymous
09/05/25(Fri)10:22:58 No.106491895

Anonymous 09/05/25(Fri)10:22:58 No.106491895

File: Qwan_00009_.jpg (750 KB, 1896x2856)

750 KB JPG

>>106491476
Yeah. It's very literal about colors and shapes, needs some wrangling from time to time.

Anonymous
09/05/25(Fri)10:23:46 No.106491898

Anonymous 09/05/25(Fri)10:23:46 No.106491898

>>106491871
I am NOT sending those pics anywhere online especially when even HF started flagging some content. And the tagger in Onetrainer is censored and/or stupid and just ignored the nsfw elements.

Anonymous
09/05/25(Fri)10:24:34 No.106491904

Anonymous 09/05/25(Fri)10:24:34 No.106491904

>>106491835
You think your python script isn't pulling third party code retard?

Anonymous
09/05/25(Fri)10:25:20 No.106491913

Anonymous 09/05/25(Fri)10:25:20 No.106491913

What is a good NSFW tagger actually?

Anonymous
09/05/25(Fri)10:26:06 No.106491919

Anonymous 09/05/25(Fri)10:26:06 No.106491919

>>106491904
true, the entirety of python is pulling from something else

Anonymous
09/05/25(Fri)10:26:18 No.106491924

Anonymous 09/05/25(Fri)10:26:18 No.106491924

>>106491904
you're right, it was wrong of me to doubt such an exceptional individual such as yourself

Anonymous
09/05/25(Fri)10:26:39 No.106491927

Anonymous 09/05/25(Fri)10:26:39 No.106491927

File: ComfyUI_WAN2.2__00010.mp4 (989 KB, 960x720)

989 KB MP4

>>106491874
>>106491895
How long does it take to gen that? My WAN gens are faster man, the 2 pass thing takes very long, although I'm not arguing about the quality at all

Anonymous
09/05/25(Fri)10:26:58 No.106491928

Anonymous 09/05/25(Fri)10:26:58 No.106491928

>>106491787
Yeah, well, all yolo watermark models I've tried miss way more watermarks than Florence, and lama produces much uglier gan textures than MAT at comparable speeds, so I've trashed my own yolo+lama workflow (I went with this combo too, first). But all in all, both need human review - or both are good, depending on perspective.

Anonymous
09/05/25(Fri)10:27:43 No.106491936

Anonymous 09/05/25(Fri)10:27:43 No.106491936

>>106491913
Joy Caption ?

Anonymous
09/05/25(Fri)10:27:51 No.106491940

Anonymous 09/05/25(Fri)10:27:51 No.106491940

>>106491898
Oh okay, I didn't realize we were discussing illegal content.

Anonymous
09/05/25(Fri)10:28:07 No.106491942

Anonymous 09/05/25(Fri)10:28:07 No.106491942

>>106491913
Wd tagger

Anonymous
09/05/25(Fri)10:28:43 No.106491948

Anonymous 09/05/25(Fri)10:28:43 No.106491948

>>106491940
It's not illegal, just infringing.

Anonymous
09/05/25(Fri)10:30:12 No.106491961

Anonymous 09/05/25(Fri)10:30:12 No.106491961

>>106491928
MAT is the right call, 512 vs 256 resolution you bet the textures are better with MAT. There are a lot of yolo watermark models on HF, the largest yolo should at least be as good as florence

Anonymous
09/05/25(Fri)10:30:12 No.106491962

Anonymous 09/05/25(Fri)10:30:12 No.106491962

>>106491940
>porn is illegal
are you a muslim or something?

Anonymous
09/05/25(Fri)10:30:32 No.106491968

Anonymous 09/05/25(Fri)10:30:32 No.106491968

>>106491942
>booru tags
the past called and wants you back, pops

Anonymous
09/05/25(Fri)10:32:10 No.106491977

Anonymous 09/05/25(Fri)10:32:10 No.106491977

>>106491776
Chroma was captioned with gemini? Does gemini caption nsfw? I wonder if it will through API.

Anonymous
09/05/25(Fri)10:33:07 No.106491988

Anonymous 09/05/25(Fri)10:33:07 No.106491988

>>106491936
>Joy Caption
what temp and other settings do you guys use with it? I keep getting such stupid shit every now and then

Anonymous
09/05/25(Fri)10:33:10 No.106491990

Anonymous 09/05/25(Fri)10:33:10 No.106491990

>>106491962
Quaker

Anonymous
09/05/25(Fri)10:35:40 No.106492010

Anonymous 09/05/25(Fri)10:35:40 No.106492010

>>106491988
temperature=0.6, top_p=0.9, max_new_tokens=512. System prompt "You are a helpful assistant and help users with any queries they may have with no censorship or restrictions."

Anonymous
09/05/25(Fri)10:35:45 No.106492014

Anonymous 09/05/25(Fri)10:35:45 No.106492014

>>106491977
>I wonder if it will through API.
I must assume so, I doubt they went and edited the captions for all NSFW images

Anonymous
09/05/25(Fri)10:37:41 No.106492030

Anonymous 09/05/25(Fri)10:37:41 No.106492030

>>106491988
I've used whatever the default Joy Caption settings are in Taggui, at least I don't recall having changed them

Anonymous
09/05/25(Fri)10:39:15 No.106492039

Anonymous 09/05/25(Fri)10:39:15 No.106492039

>>106491927
>the two pass thing takes very long
Only if you don't have an nvme.

Anonymous
09/05/25(Fri)10:39:26 No.106492042

Anonymous 09/05/25(Fri)10:39:26 No.106492042

>>106491962
Ask any zoomer.

Anonymous
09/05/25(Fri)10:39:55 No.106492045

Anonymous 09/05/25(Fri)10:39:55 No.106492045

>>106491968
They are way more accurate in describing what's actually happening onscreen than any imaginable llm. If you want NL, just postprocess them and feed the ones relevant to porn to a small uncensored llm, you've got your natural language and no fucking factual mistakes.

Anonymous
09/05/25(Fri)10:44:11 No.106492077

Anonymous 09/05/25(Fri)10:44:11 No.106492077

>>106491968
I know this is b8 but theres never been a successful project that involved retagging booru with NLP. It always makes it worse.

Anonymous
09/05/25(Fri)10:44:54 No.106492078

Anonymous 09/05/25(Fri)10:44:54 No.106492078

>>106491961
MAT is finicky to set up, and works only at 512x512. Which is why it didn't see as much adoption as lama, I guess. Lama is a no-brainer.

Anonymous
09/05/25(Fri)10:45:09 No.106492081

Anonymous 09/05/25(Fri)10:45:09 No.106492081

>>106491461
oh my God, I could have so many Chinese auntie fantasies fulfilled with this. That Asian woman looks perfect for me

Anonymous
09/05/25(Fri)10:45:48 No.106492087

Anonymous 09/05/25(Fri)10:45:48 No.106492087

>>106492077
I don't want to retag. I need brand new tags.

Anonymous
09/05/25(Fri)10:48:29 No.106492108

Anonymous 09/05/25(Fri)10:48:29 No.106492108

cant wait for the masses to begin spouting the age old "I can't get AI to make the picture I want therefore it's bad" argument once this shit actually hits mainstream

Anonymous
09/05/25(Fri)10:48:52 No.106492110

Anonymous 09/05/25(Fri)10:48:52 No.106492110

>>106490093
Literally a segfault or does it throw some sort of error? Haven't seen that sorry, just various kinds of RAM and VRAM exhaustion errors.

Anonymous
09/05/25(Fri)10:49:31 No.106492117

Anonymous 09/05/25(Fri)10:49:31 No.106492117

File: ComfyUI_WAN2.2__00014.mp4 (418 KB, 672x504)

418 KB MP4

>>106491927
>>106492039
I do, guess the workflow could be optimized further, need to test some more. That was literally only my 2nd Qwen gen.

Anonymous
09/05/25(Fri)10:49:52 No.106492122

Anonymous 09/05/25(Fri)10:49:52 No.106492122

>>106492108
>once this shit actually hits mainstream
Bro, it's been three years since SD first release.

Anonymous
09/05/25(Fri)10:53:19 No.106492148

Anonymous 09/05/25(Fri)10:53:19 No.106492148

>>106492122
It's funny going back and read anon predictions from back then and see how much almost everyone was completely wrong.

Anonymous
09/05/25(Fri)10:56:34 No.106492165

Anonymous 09/05/25(Fri)10:56:34 No.106492165

>>106492122
How many generate images daily or at least weekly do you think

Anonymous
09/05/25(Fri)10:58:03 No.106492177

Anonymous 09/05/25(Fri)10:58:03 No.106492177

>>106492165
You tell me https://openrouter.ai/rankings#images

Anonymous
09/05/25(Fri)10:59:19 No.106492188

Anonymous 09/05/25(Fri)10:59:19 No.106492188

>>106492148
Such as?

Anonymous
09/05/25(Fri)11:00:21 No.106492198

Anonymous 09/05/25(Fri)11:00:21 No.106492198

>>106492177
Is that not a graph of total images? Not the same as asking "how many people..."

Anonymous
09/05/25(Fri)11:00:46 No.106492203

Anonymous 09/05/25(Fri)11:00:46 No.106492203

>new model comes out with improved prompt adherence
>find out it can do specific stuff that the old model couldn't
>start prompting for even more specific stuff
>it can't do it
It's amazing how we can keep making steady progress yet there is still so much room for improvement.

Anonymous
09/05/25(Fri)11:03:27 No.106492222

Anonymous 09/05/25(Fri)11:03:27 No.106492222

>>106492198
>Acktually
I accept your concession on your "until it hits mainstream"

Anonymous
09/05/25(Fri)11:04:47 No.106492230

Anonymous 09/05/25(Fri)11:04:47 No.106492230

File: ComfyUI_WAN2.2_00002.mp4 (544 KB, 672x504)

544 KB MP4

>>106492117

Anonymous
09/05/25(Fri)11:04:49 No.106492231

Anonymous 09/05/25(Fri)11:04:49 No.106492231

>>106492203
All current open models still cannot do some stuff even dalle3 could do back in the day.
Try making subjects do the dab pose, the korean finger heart gesture even on Qwen, it fails

Anonymous
09/05/25(Fri)11:12:42 No.106492294

Anonymous 09/05/25(Fri)11:12:42 No.106492294

>>106492188
- we won't use GPUs at all for gen and instead use specialized cards
- we would have perfect models locally

and many other random other ones with anons very sure of themselves

Anonymous
09/05/25(Fri)11:14:07 No.106492304

Anonymous 09/05/25(Fri)11:14:07 No.106492304

>>106492122
>he thinks stable diffusion started it all
>>106492222
Doesn't change the fact that we have no hard numbers on daily users across the board cloud and local

Anonymous
09/05/25(Fri)11:14:08 No.106492305

Anonymous 09/05/25(Fri)11:14:08 No.106492305

>>106492148
the most wrong crowd and opinion were ai doomers, it wont ever be better than this for sure, its a fad, the bubble will pop etc

Anonymous
09/05/25(Fri)11:14:58 No.106492310

Anonymous 09/05/25(Fri)11:14:58 No.106492310

>>106492230
can i ask for the workflow for this?

Anonymous
09/05/25(Fri)11:16:20 No.106492318

Anonymous 09/05/25(Fri)11:16:20 No.106492318

>>106492304
I really shouldn't entertain autists

Anonymous
09/05/25(Fri)11:16:45 No.106492321

Anonymous 09/05/25(Fri)11:16:45 No.106492321

>how many gun owners are there in the world?
>heres a chart of bullets fired per day
>that doesnt answer my question
>i accept your concession
???

Anonymous
09/05/25(Fri)11:18:51 No.106492336

Anonymous 09/05/25(Fri)11:18:51 No.106492336

File: 1730318350837201.png (5 KB, 214x101)

5 KB PNG

>cumfart ui now doesnt only need a 140gb + tip pagefile to not crash on a 128gb ram system it also recently fucked memory management inside vram too
great

Anonymous
09/05/25(Fri)11:19:40 No.106492340

Anonymous 09/05/25(Fri)11:19:40 No.106492340

>>106492321
>lateral thinking isn't my thing
I figured already

Anonymous
09/05/25(Fri)11:20:02 No.106492344

Anonymous 09/05/25(Fri)11:20:02 No.106492344

>only 24GB
oof.....

Anonymous
09/05/25(Fri)11:20:49 No.106492345

Anonymous 09/05/25(Fri)11:20:49 No.106492345

>when I lie on the internet

Anonymous
09/05/25(Fri)11:21:01 No.106492347

Anonymous 09/05/25(Fri)11:21:01 No.106492347

>>106492336
What are you running that needs that much memory?

Anonymous
09/05/25(Fri)11:22:08 No.106492355

Anonymous 09/05/25(Fri)11:22:08 No.106492355

>>106492230
kek

Anonymous
09/05/25(Fri)11:22:20 No.106492357

Anonymous 09/05/25(Fri)11:22:20 No.106492357

>>106492340
I'd ask you to elaborate on how you derived daily or weekly users but I know you'll respond with another ad hominem

Anonymous
09/05/25(Fri)11:22:39 No.106492361

Anonymous 09/05/25(Fri)11:22:39 No.106492361

>>106492305
>local ai will be fully banned next year, mark my words!
>we will never have anything better than sd1.5!
>all jobs will disappear by next year!

Anonymous
09/05/25(Fri)11:22:41 No.106492362

Anonymous 09/05/25(Fri)11:22:41 No.106492362

>>106492347
the vram fuckery is happening with basic noobai workflow that worked normally until recently, ram problems started since qwen image edit

Anonymous
09/05/25(Fri)11:23:22 No.106492368

Anonymous 09/05/25(Fri)11:23:22 No.106492368

>>106492336
use another web ui, don't karen the thread

Anonymous
09/05/25(Fri)11:24:50 No.106492373

Anonymous 09/05/25(Fri)11:24:50 No.106492373

>>106492368
no other ui supports all video gen features and optimizations, vramlet

Anonymous
09/05/25(Fri)11:25:21 No.106492377

Anonymous 09/05/25(Fri)11:25:21 No.106492377

>>106492336
Have you updated to the fixed version? No way you need this for fucking noob

Anonymous
09/05/25(Fri)11:25:42 No.106492383

Anonymous 09/05/25(Fri)11:25:42 No.106492383

File: ComfyUI_WAN2.2_00004.mp4 (503 KB, 672x504)

503 KB MP4

>>106492230
>>106492310
its this civitai. com/models /1818841 /wan-22-workflow-t2v-i2v-t2i-kijai-wrapper

some modifications, broadly that.

Anonymous
09/05/25(Fri)11:26:18 No.106492388

Anonymous 09/05/25(Fri)11:26:18 No.106492388

>>106492377
everything is the latest version, but it does seem like that vram fuckery fixed itself for all images after the first generated image finished

Anonymous
09/05/25(Fri)11:27:43 No.106492402

Anonymous 09/05/25(Fri)11:27:43 No.106492402

>>106492294
>- we won't use GPUs at all for gen and instead use specialized cards
Maybe people said this in LLM threads, never saw anyone in imagegen threads claming gpus would be replaced
>>106492294
>- we would have perfect models locally
lol, not even cloud/saas models are "perfect". What we still don't have, though, are local models with vast knowledge in pop culture, celebrities etc. Dalle3 used to be amazing at these, it did feel it was trained at everything. Prior being censored or when jailbreaks worked, I remember people pointing out it even knew who AOC (the congresswoman) was

Anonymous
09/05/25(Fri)11:27:47 No.106492403

Anonymous 09/05/25(Fri)11:27:47 No.106492403

File: ComfyUI_00457_.png (1.44 MB, 1328x1328)

1.44 MB PNG

>>106492231
skill issue, just let llm write the prompt
>describe the image of a person doing a dab pose, focus on body part positioning, in 200 words

Anonymous
09/05/25(Fri)11:33:23 No.106492435

Anonymous 09/05/25(Fri)11:33:23 No.106492435

>>106492357
Openrouter is giving you an estimate of around 38.7M images generated weekly. That's from startups or direct users since the main labs aren't displaying the number of gens. You can easily infer that all main labs gens combined >> Openrouter gens.
Generating images isn't like generating text where you can have agents consuming a lot of tokens without your supervision. Generating images isn't like generating text where you can have agents consuming a lot of tokens without your supervision. Besides, when generating images on an online service, you're either hit by hard limits if you get a subscription or the cost would go through the roof if you pay per request, that by design would limit the number of gens per person. At most 20-30 gens per person per day seems a proper estimate. That gives a few million people per week generating images. If that's not mainstream, I don't know what it is.

Anonymous
09/05/25(Fri)11:36:53 No.106492458

Anonymous 09/05/25(Fri)11:36:53 No.106492458

>>106492321
Imagine keeping a shit diary instead a food diary. Might be a fun way to scare housewives with the calorie tracker app.
>what did you eat
>here's a pic of the turd

Anonymous
09/05/25(Fri)11:41:25 No.106492492

Anonymous 09/05/25(Fri)11:41:25 No.106492492

>>106492435
nta but you're forgetting to account for the chinese labs mass genning to curate a large synthetic dataset for their next local sota model

Anonymous
09/05/25(Fri)11:42:44 No.106492498

Anonymous 09/05/25(Fri)11:42:44 No.106492498

I am using Qwen-Image with Replicate. What is the prompt and negative_prompt to use to make the person in the image totally nude? They usually still keeping their underwear on.

Anonymous
09/05/25(Fri)11:44:24 No.106492510

Anonymous 09/05/25(Fri)11:44:24 No.106492510

>>106492435
Imagine a world where the average person generated only 20 to 30 images a day. How do we account for those with intense autism as in the kind present ITT.

Anonymous
09/05/25(Fri)11:45:36 No.106492521

Anonymous 09/05/25(Fri)11:45:36 No.106492521

>>106492498
>X-ray women
That takes me back

Anonymous
09/05/25(Fri)11:49:03 No.106492551

Anonymous 09/05/25(Fri)11:49:03 No.106492551

>>106492498
Explicitly prompt for genital words mostly gets the underwear off, but the training data is completely devoid of genitals anyway so all you get is body horror that looks kinda like scrotum.

Anonymous
09/05/25(Fri)11:58:58 No.106492614

Anonymous 09/05/25(Fri)11:58:58 No.106492614

File: chromasome.jpg (278 KB, 1376x1072)

278 KB JPG

Anonymous
09/05/25(Fri)12:02:36 No.106492641

Anonymous 09/05/25(Fri)12:02:36 No.106492641

File: Berserk Colored by hyakurin 03.jpg (1.22 MB, 2194x1588)

1.22 MB JPG

>>106492383
Has anyone tried to animate scenes from the Berserk manga yet?

Anonymous
09/05/25(Fri)12:04:30 No.106492655

Anonymous 09/05/25(Fri)12:04:30 No.106492655

File: rikka3.mp4 (1.54 MB, 720x1024)

1.54 MB MP4

Anonymous
09/05/25(Fri)12:06:58 No.106492677

Anonymous 09/05/25(Fri)12:06:58 No.106492677

Anyone finding wan2.2 high noise being uncensored? I clearly saw pussy in the high noise preview but once the low noise begins it immediately puts underwear on top.
Is there a way to utilize this?

Anonymous
09/05/25(Fri)12:08:12 No.106492690

Anonymous 09/05/25(Fri)12:08:12 No.106492690

>>106492677
You have two options:
1. Get lucky
2. Use loras

Anonymous
09/05/25(Fri)12:08:30 No.106492693

Anonymous 09/05/25(Fri)12:08:30 No.106492693

File: ComfyUI_temp_gurpb_00001_.png (3.19 MB, 1152x1152)

3.19 MB PNG

Somehow I accidentally created a reddit sloppa

Anonymous
09/05/25(Fri)12:09:05 No.106492698

Anonymous 09/05/25(Fri)12:09:05 No.106492698

>>106492614
looks like shit

Anonymous
09/05/25(Fri)12:10:16 No.106492706

Anonymous 09/05/25(Fri)12:10:16 No.106492706

>$150000 later...

Anonymous
09/05/25(Fri)12:15:25 No.106492747

Anonymous 09/05/25(Fri)12:15:25 No.106492747

File: ComfyUI_00151_.mp4 (694 KB, 592x816)

694 KB MP4

>increasing image resolution or length increases the generation time exponentially
when will this be solved?

Anonymous
09/05/25(Fri)12:21:21 No.106492783

Anonymous 09/05/25(Fri)12:21:21 No.106492783

>>106492698
brap

Anonymous
09/05/25(Fri)12:24:35 No.106492804

Anonymous 09/05/25(Fri)12:24:35 No.106492804

>>106492747
Heat death of the universe

Anonymous
09/05/25(Fri)12:25:05 No.106492810

Anonymous 09/05/25(Fri)12:25:05 No.106492810

>>106492747
NICE TITS

Anonymous
09/05/25(Fri)12:25:58 No.106492815

Anonymous 09/05/25(Fri)12:25:58 No.106492815

>>106492747
wasn't radial attention supposed to fix this

Anonymous
09/05/25(Fri)12:29:43 No.106492841

Anonymous 09/05/25(Fri)12:29:43 No.106492841

>>106492747
when you understand how math works

Anonymous
09/05/25(Fri)12:38:05 No.106492912

Anonymous 09/05/25(Fri)12:38:05 No.106492912

>>106492403
>Just write 50 words to describe a single concept, bro

Anonymous
09/05/25(Fri)12:41:31 No.106492950

Anonymous 09/05/25(Fri)12:41:31 No.106492950

>>106492403
>>106492912
this, I don't want to write a bible to describe something that can be said with a single word, that's dumb

Anonymous
09/05/25(Fri)12:42:11 No.106492959

Anonymous 09/05/25(Fri)12:42:11 No.106492959

>>106492841
We need quantum deep networks in neuronal quantum chips.
Fuck math.

Anonymous
09/05/25(Fri)12:43:02 No.106492971

Anonymous 09/05/25(Fri)12:43:02 No.106492971

>>106492747
whats the name of this semen demon

Anonymous
09/05/25(Fri)12:51:33 No.106493053

Anonymous 09/05/25(Fri)12:51:33 No.106493053

>>106492971
Dave Mustaine

Anonymous
09/05/25(Fri)12:52:12 No.106493062

Anonymous 09/05/25(Fri)12:52:12 No.106493062

Ah btw.
I just read that chatterbox multilingual was released today and now supports 23 languages.

Anonymous
09/05/25(Fri)12:52:24 No.106493064

Anonymous 09/05/25(Fri)12:52:24 No.106493064

>>106492959
that's all math

Anonymous
09/05/25(Fri)12:55:57 No.106493090

Anonymous 09/05/25(Fri)12:55:57 No.106493090

>>106493062
Holy shiet, ty

Anonymous
09/05/25(Fri)12:58:30 No.106493106

Anonymous 09/05/25(Fri)12:58:30 No.106493106

File: twist my nipples and call(...).png (44 KB, 1300x226)

44 KB PNG

>>106493062
hejsan homopojkar

Anonymous
09/05/25(Fri)12:59:26 No.106493118

Anonymous 09/05/25(Fri)12:59:26 No.106493118

File: 1747054662507336.png (1.27 MB, 1104x1472)

1.27 MB PNG

>>106492912
I don't know why that anon used an LLM, it worked with just "man doing the dab pose".
Being said llm enhancement is used by pretty much all the closed source models so not a bad tactic.

Anonymous
09/05/25(Fri)13:05:57 No.106493173

Anonymous 09/05/25(Fri)13:05:57 No.106493173

File: Qwan_00013_.jpg (817 KB, 1896x2856)

817 KB JPG

>>106492614
Heh. My Chroma generations of my prompt turned out exactly like that, with some absolute drag queens.

Anonymous
09/05/25(Fri)13:07:40 No.106493198

Anonymous 09/05/25(Fri)13:07:40 No.106493198

What’s the current workflow to gen longer videos in wan 2.2 apart from just raising the frames? I managed to raise it to 144 frames for 9 secs but there’s probably a way to use the last frame to double the length no? Unless consistency would be an issue

Anonymous
09/05/25(Fri)13:07:53 No.106493200

Anonymous 09/05/25(Fri)13:07:53 No.106493200

>>106493118
>llm enhancement is used by pretty much all the closed source models so not a bad tactic.
yeah fair, boomer prompting always improve prompt adherence so they probably all use this on their API models

Anonymous
09/05/25(Fri)13:30:39 No.106493422

Anonymous 09/05/25(Fri)13:30:39 No.106493422

NeoForgeGODS I want to use Chroma, which things do I have to download and where to put them? And which settings

Anonymous
09/05/25(Fri)13:31:39 No.106493428

Anonymous 09/05/25(Fri)13:31:39 No.106493428

>>106493173
Well let's be real, most of the chinamen women are complete dumpsterfires

Anonymous
09/05/25(Fri)13:32:35 No.106493436

Anonymous 09/05/25(Fri)13:32:35 No.106493436

>>106493422
>>106493173
Forget it. I want to run Qwen. I hate the Chroma trannies. Chroma blends the sexes. It appears as if Lodestone didn't tag male and female. You can probably have a female with male feet dataset.
Disgusting.

Anonymous
09/05/25(Fri)13:34:32 No.106493452

Anonymous 09/05/25(Fri)13:34:32 No.106493452

>>106493436
>Forget it. I want to run Qwen.
qwen is so fucking slopped though, fortunately there's some loras to fix that
https://civitai.com/models/1927710?modelVersionId=2181911

Anonymous
09/05/25(Fri)13:36:48 No.106493477

Anonymous 09/05/25(Fri)13:36:48 No.106493477

>>106493436
>Chroma blends the sexes.
Did you forget to put "transexual, tranny, masculine, LGBT" in the negatives

Anonymous
09/05/25(Fri)13:47:00 No.106493558

Anonymous 09/05/25(Fri)13:47:00 No.106493558

anons, I want to train a lora of this insta model. She's got such a unique look that it didn't work in 1.5 back then.. So whats the best model to train on for max versatility now...
xl? pony? chroma? wan?

Anonymous
09/05/25(Fri)13:48:22 No.106493571

Anonymous 09/05/25(Fri)13:48:22 No.106493571

>>106493198
Seconding this. 100% its going to be some "take last frame" bullshit. Cant wait to get out of 5-8 second hell

Anonymous
09/05/25(Fri)13:49:44 No.106493581

Anonymous 09/05/25(Fri)13:49:44 No.106493581

>>106493571
>Cant wait to get out of 5-8 second hell
just two more years and a minimum of 98gbs of vram

Anonymous
09/05/25(Fri)13:54:30 No.106493622

Anonymous 09/05/25(Fri)13:54:30 No.106493622

File: Qwan_00017_.jpg (862 KB, 1896x2856)

862 KB JPG

>>106493436
That would explain the manhands that it likes to give women, heh.
>>106493428
Dunno if I'd agree, saw a lot of real cuties during my time there.

Anonymous
09/05/25(Fri)13:54:58 No.106493623

Anonymous 09/05/25(Fri)13:54:58 No.106493623

>>106493571
>Cant wait to get out of 5-8 second hell
good luck with that, you need a shit ton of memory to make long videos

Anonymous
09/05/25(Fri)13:57:24 No.106493652

Anonymous 09/05/25(Fri)13:57:24 No.106493652

>>106491461
Can you make him hump her?

Anonymous
09/05/25(Fri)13:58:26 No.106493660

Anonymous 09/05/25(Fri)13:58:26 No.106493660

>>106493571
Is 8s the max we can stretch on WAN?

Anonymous
09/05/25(Fri)13:58:57 No.106493665

Anonymous 09/05/25(Fri)13:58:57 No.106493665

File: 79022355.png (1.77 MB, 1024x1024)

1.77 MB PNG

>>106493581
>>106493623
Animatediff can do long (shitty) videos. Wonder if someone who is smart like take some of that technology and stick it in wan some how

Anonymous
09/05/25(Fri)13:59:23 No.106493669

Anonymous 09/05/25(Fri)13:59:23 No.106493669

File: 1732955633242405.png (1.09 MB, 1397x1365)

1.09 MB PNG

https://xcancel.com/bdsqlsz/status/1963984028476014841#m
there's more examples about that chinese model that will be (soon?) released locally
>Advantages: Native 2K output, default is high-definition result

Anonymous
09/05/25(Fri)14:01:08 No.106493690

Anonymous 09/05/25(Fri)14:01:08 No.106493690

im gunna put anistudio in the next OP

Anonymous
09/05/25(Fri)14:01:11 No.106493691

Anonymous 09/05/25(Fri)14:01:11 No.106493691

>>106493477
does that actually work

Anonymous
09/05/25(Fri)14:01:39 No.106493694

Anonymous 09/05/25(Fri)14:01:39 No.106493694

>>106493198
you could have it so that it does a 5-9 second video and then have it auto-pickup the last frame with a secondary prompt that queues after the first clip finishes

you could technically automate a third and fourth rotation as well with a workflow.

Anonymous
09/05/25(Fri)14:02:40 No.106493701

Anonymous 09/05/25(Fri)14:02:40 No.106493701

>>106490834
Nobody else has posted anything, could you post some sample audio?

Anonymous
09/05/25(Fri)14:02:49 No.106493705

Anonymous 09/05/25(Fri)14:02:49 No.106493705

>>106493660
It can go longer but you have to have a shit ton of vram. Longest I can go is 8 secs on the all-in-slop wan 2.2/2.1 by phr00t

Anonymous
09/05/25(Fri)14:02:50 No.106493706

Anonymous 09/05/25(Fri)14:02:50 No.106493706

>>106493173
what I would give for a full nsfw finetune of that model instead of the horrors chroma spits out by default

Anonymous
09/05/25(Fri)14:04:00 No.106493718

Anonymous 09/05/25(Fri)14:04:00 No.106493718

>>106493669
>anime figurines as a test
stuff no western company will ever do

Anonymous
09/05/25(Fri)14:05:20 No.106493727

Anonymous 09/05/25(Fri)14:05:20 No.106493727

>>106493669
inch resting. native 2k doesnt have as much pull anymore desu since it should be expected of modern models. but still cool.

Anonymous
09/05/25(Fri)14:05:20 No.106493729

Anonymous 09/05/25(Fri)14:05:20 No.106493729

>>106493718
yep, western dogs are devoid of fun, china still has its sovl (and it's even more noble of china to use some Japanese anime references when you know that Japan is China's biggest rival)

Anonymous
09/05/25(Fri)14:06:23 No.106493738

Anonymous 09/05/25(Fri)14:06:23 No.106493738

>>106493727
>native 2k doesnt have as much pull anymore desu since it should be expected of modern models.
is it? I have yet to see models that were trained on 2k, Chroma was trained at 512x resolution for the vast majority of its process

Anonymous
09/05/25(Fri)14:10:12 No.106493781

Anonymous 09/05/25(Fri)14:10:12 No.106493781

>>106493705
>but you have to have a shit ton of vram
how much? the most i can get is 32 with the 5090.

>inb4 buy a 6000 blackwell
does anyone here even have one?

Anonymous
09/05/25(Fri)14:10:25 No.106493783

Anonymous 09/05/25(Fri)14:10:25 No.106493783

>>106493718
Western companies would probably avoid cute women and hot models altogether, anime or not.

Anonymous
09/05/25(Fri)14:11:03 No.106493788

Anonymous 09/05/25(Fri)14:11:03 No.106493788

File: Qwan_00019_.jpg (700 KB, 1896x2856)

700 KB JPG

>>106493706
I don't really gen NSFW but man, that and getting rid of some of the slopped faces/backgrounds would honestly be my dream model. I still like it a lot, at least in terms of genning 'clean 1girl' stuff. For styles Chroma still wins out.
>>106493669
I don't know if I love the output.
There's some prompt bleeding on the box and the figure just looks weird in general, not really like a figurine would.
Still, looking forward to new toys.

Anonymous
09/05/25(Fri)14:11:22 No.106493792

Anonymous 09/05/25(Fri)14:11:22 No.106493792

>>106493781
>does anyone here even have one?
one or two anons, I think one of them from their work and the other is rich enough to afford getting one

Anonymous
09/05/25(Fri)14:12:25 No.106493804

Anonymous 09/05/25(Fri)14:12:25 No.106493804

>>106493788
>I don't really gen NSFW but man, that and getting rid of some of the slopped faces/backgrounds would honestly be my dream model. I still like it a lot, at least in terms of genning 'clean 1girl' stuff. For styles Chroma still wins out.
how much "bleed" is there when there are two characters?
chroma is awful at that

Anonymous
09/05/25(Fri)14:12:28 No.106493805

Anonymous 09/05/25(Fri)14:12:28 No.106493805

>>106493788
>I don't really gen NSFW but man, that and getting rid of some of the slopped faces/backgrounds would honestly be my dream model
same, and it should be an edit model so that you can use any image input as a character reference, would be the perfect model

Anonymous
09/05/25(Fri)14:14:26 No.106493816

Anonymous 09/05/25(Fri)14:14:26 No.106493816

>>106493669
the contrast on the left stinks of dpo sloppa

Anonymous
09/05/25(Fri)14:15:04 No.106493822

Anonymous 09/05/25(Fri)14:15:04 No.106493822

>>106493062
which chatterbox? there's like 3 different webui's

Anonymous
09/05/25(Fri)14:17:29 No.106493836

Anonymous 09/05/25(Fri)14:17:29 No.106493836

>>106493781
Some anons have one those. Even then, doesnt matter how long we can make it, wan's context is shit and it'll slop out and loose consistency on long gens. We're just waiting for radial attention or another technology that'll hopefully solve this issue

Anonymous
09/05/25(Fri)14:18:07 No.106493847

Anonymous 09/05/25(Fri)14:18:07 No.106493847

>>106493816
yeah, it even has the manlet effect kek

Anonymous
09/05/25(Fri)14:19:03 No.106493855

Anonymous 09/05/25(Fri)14:19:03 No.106493855

File: Qwan_00020_.jpg (728 KB, 1896x2856)

728 KB JPG

>>106493804
See
>>106490852
>>106490951
>>106491000
There still is some bleed, but way less than I have seen in most models, and it's mostly small stuff like accessories (rings, make-up). Hair styles, poses and general clothing have usually been on point for me.

>Floating koi
It's magic, for sure.

Anonymous
09/05/25(Fri)14:20:18 No.106493863

Anonymous 09/05/25(Fri)14:20:18 No.106493863

>>106493855
i want a pet koi so badly

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.