/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 09/01/25(Mon)06:59:10 No.106451942

File: highlights_g_106447640_17(...).webm (3.83 MB, 2017x2048)

3.83 MB WEBM

/ldg/ - Local Diffusion General Anonymous 09/01/25(Mon)06:59:10 No.106451942 Archived

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106447640

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Samplers: https://stable-diffusion-art.com/samplers/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/01/25(Mon)07:05:35 No.106451972

Anonymous 09/01/25(Mon)07:05:35 No.106451972

genuine thread
untouched by discordtranny hands

Anonymous
09/01/25(Mon)07:09:51 No.106451991

Anonymous 09/01/25(Mon)07:09:51 No.106451991

I shall be using this one

Anonymous
09/01/25(Mon)07:20:38 No.106452041

Anonymous 09/01/25(Mon)07:20:38 No.106452041

>>106451254
Catbox please I want to gen more of these.

Anonymous
09/01/25(Mon)07:21:18 No.106452047

Anonymous 09/01/25(Mon)07:21:18 No.106452047

File: highlights_g_106447640_17(...).webm (3.25 MB, 2048x2019)

3.25 MB WEBM

>NSFW on a worksafe board

Anonymous
09/01/25(Mon)07:24:06 No.106452065

Anonymous 09/01/25(Mon)07:24:06 No.106452065

>>106452041
I'm all cosy in my blankey right now but the prompt was something like.

Large African American man wearing nothing but a towel approaches from behind and violently pushes the women's head under the water.

Anonymous
09/01/25(Mon)07:25:35 No.106452071

Anonymous 09/01/25(Mon)07:25:35 No.106452071

File: miku_does_some_stupid_action.mp4 (1020 KB, 672x1344)

1020 KB MP4

>>106451834
>>Miku Hatsune does some stupid action
say no more senpai

Anonymous
09/01/25(Mon)07:26:41 No.106452077

Anonymous 09/01/25(Mon)07:26:41 No.106452077

File: BLACKman.mp4 (3.18 MB, 832x640)

3.18 MB MP4

>>106452065
Thanks I'll give it a try. Goodnight sir.

Anonymous
09/01/25(Mon)07:29:40 No.106452092

Anonymous 09/01/25(Mon)07:29:40 No.106452092

Blessed thread of frenship

Anonymous
09/01/25(Mon)07:39:28 No.106452147

Anonymous 09/01/25(Mon)07:39:28 No.106452147

File: ComfyUI_00056_.mp4 (323 KB, 512x320)

323 KB MP4

Anonymous
09/01/25(Mon)07:43:27 No.106452169

Anonymous 09/01/25(Mon)07:43:27 No.106452169

>>106452071
>>106452077
Finally some good anime

Anonymous
09/01/25(Mon)07:46:00 No.106452178

Anonymous 09/01/25(Mon)07:46:00 No.106452178

File: ComfyUI_00060_.png (1.44 MB, 1024x1536)

1.44 MB PNG

Just train a lora anon

Anonymous
09/01/25(Mon)07:47:39 No.106452185

Anonymous 09/01/25(Mon)07:47:39 No.106452185

>>106452071
ponder the aroma
>>106451972
based, fuck those guys pushing for their stupid shitty barely functioning guy frfr no cap

Anonymous
09/01/25(Mon)07:48:10 No.106452189

Anonymous 09/01/25(Mon)07:48:10 No.106452189

>>106452178
i dont have the hardware :(

Anonymous
09/01/25(Mon)07:56:50 No.106452231

Anonymous 09/01/25(Mon)07:56:50 No.106452231

>>106452178
Been thinking about trying that, just for the fun of it but I wouldn't even know what for (or what model to use).
And, to be honest, I don't even know how to build a decent dataset.

Anonymous
09/01/25(Mon)07:58:11 No.106452242

Anonymous 09/01/25(Mon)07:58:11 No.106452242

>>106452189
What hardware do you have ?

Anonymous
09/01/25(Mon)08:00:17 No.106452261

Anonymous 09/01/25(Mon)08:00:17 No.106452261

Is it possible to gen videos with RTX 5070 TI?
Also can SwarmUI do it?

Anonymous
09/01/25(Mon)08:00:52 No.106452262

Anonymous 09/01/25(Mon)08:00:52 No.106452262

>>106452231
>I don't even know how to build a decent dataset.
its as easy as grabbing the highest quality images of the thing you want to train. include variety of course. then use ai to caption the images and make adjustments. it's as simple as that.

some people let the lora trainer bucket their images(bucket means the trainer crops/resizes the images to the closet resolution it supports), but i've found I got much, much better results by manually cropping/resizing the images myself.

some autistic anons use charts and other bullshit to analyze loss overtime but all that shit is unnecessary.

Anonymous
09/01/25(Mon)08:00:56 No.106452263

Anonymous 09/01/25(Mon)08:00:56 No.106452263

>>106452231
In terms of hardware demands in training, Wan / Qwen needs the most, then Flux, then Chroma, then SDXL, then SD15

As for dataset, it depends very much of what you want to train

Anonymous
09/01/25(Mon)08:05:42 No.106452283

Anonymous 09/01/25(Mon)08:05:42 No.106452283

>>106452178
i transform photos into animes in qwen edit, run several upscales with illustrious/pony and different artist loras and sort out extremely strictly - to create high-quality pairs. I add nsfw, landscapes, objects, people and various artists.
Two loras are planned for anime > photo and photo > anime.
A bit tedious, but I'm making good progress.

Is this welcome in the community?

Anonymous
09/01/25(Mon)08:06:39 No.106452289

Anonymous 09/01/25(Mon)08:06:39 No.106452289

>>106452261
>Is it possible to gen videos with RTX 5070 TI?
yes

Anonymous
09/01/25(Mon)08:08:57 No.106452301

Anonymous 09/01/25(Mon)08:08:57 No.106452301

>>106452178
Ahhhhg manly hands, let me guess... Chroma right?

Anonymous
09/01/25(Mon)08:09:41 No.106452306

Anonymous 09/01/25(Mon)08:09:41 No.106452306

I'm so tired of 5070ti. It can't run qwen q8 fast. I demand more vram

Anonymous
09/01/25(Mon)08:12:27 No.106452322

Anonymous 09/01/25(Mon)08:12:27 No.106452322

>>106452261
>16gb vram
Should be fine don't let the vramlet sayers get you down.

Anonymous
09/01/25(Mon)08:12:57 No.106452327

Anonymous 09/01/25(Mon)08:12:57 No.106452327

File: ComfyUI_00064_.png (1.42 MB, 1024x1536)

1.42 MB PNG

>>106452189
>dont have the hardware
There's always renting

>>106452231
>what for (or what model to use)
Train for what you want to see, but cannot do currently. Chroma1-HD is the best base model for local training imo.

>>106452283
>Two loras are planned for anime > photo and photo > anime.
Nice

>>106452301
>Chroma right?
Yes, perspective issue. It has trouble sizing things realistically. Is that what he trimmed off and didn't realize it?

Anonymous
09/01/25(Mon)08:14:58 No.106452337

Anonymous 09/01/25(Mon)08:14:58 No.106452337

thoughts on anisora v3? kijai released fp8 scaled models but I'm not sure how to implement them or what the best settings are
https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled/tree/main/I2V/AniSora

Anonymous
09/01/25(Mon)08:15:09 No.106452342

Anonymous 09/01/25(Mon)08:15:09 No.106452342

Why the fuck is a 3090 so much better than modern cards?

Anonymous
09/01/25(Mon)08:16:31 No.106452349

Anonymous 09/01/25(Mon)08:16:31 No.106452349

>>106452342
by modern you mean like a 5060 or some shit? 3090 was top of the line and has 24gb vram

Anonymous
09/01/25(Mon)08:17:15 No.106452354

Anonymous 09/01/25(Mon)08:17:15 No.106452354

File: ComfyUI_00002_resized.jpg (3.91 MB, 3621x6447)

3.91 MB JPG

Hey hey Anon, Anon here.
Gave Chroma HD1 another run with some sampler/schedulers I wanted to try out and I like the way skin detail/face variance has been heading in compared to older versions. I really like it, apart from the usual hands and body horror sometimes.
I think genning at higher res (just used the Qwen default for 9:16) really helps.
Anyway, nothing new, but here's some plots.

Full sizes
Titty Fairy: https://file.garden/aIdN6xfH0QVghCy0/ChromaHD1-tests/ComfyUI_00001_.jpg
Trash girl: https://file.garden/aIdN6xfH0QVghCy0/ChromaHD1-tests/ComfyUI_00002_.jpg
Rat girl: https://file.garden/aIdN6xfH0QVghCy0/ChromaHD1-tests/ComfyUI_00003_.jpg

Anonymous
09/01/25(Mon)08:17:39 No.106452358

Anonymous 09/01/25(Mon)08:17:39 No.106452358

File: cleanitupjanny.mp4 (3.36 MB, 832x640)

3.36 MB MP4

Anonymous
09/01/25(Mon)08:18:33 No.106452364

Anonymous 09/01/25(Mon)08:18:33 No.106452364

File: ComfyUI_00057_.mp4 (414 KB, 400x400)

414 KB MP4

Anonymous
09/01/25(Mon)08:19:03 No.106452367

Anonymous 09/01/25(Mon)08:19:03 No.106452367

>>106452342
modern cards?
4090 & 5090 are better. 3090 is better than 5070ti/etc because of the 24gb vram. vram is king for wan gens. less vram means some of the model has to be offloaded to system ram which makes gens significantly slower.

Anonymous
09/01/25(Mon)08:22:08 No.106452397

Anonymous 09/01/25(Mon)08:22:08 No.106452397

File: ComfyUI_00015_.jpg (1.31 MB, 1024x1536)

1.31 MB JPG

>>106452178

I've been using it to gen FB MILFs I know being freaks, I can't stop genning.

>>106452189

You can train on as low as 12GB VRAM comfortably on onetrainer, they even have presets to use 8GB.

>>106452231

A dataset can legitimately be made out of like 12 images, my convergence point for Chroma has been around 1800-ish steps for 12/13 images so far.

>>106452261

16GB is more than enough. Just use comfy.

>>106452354

GOAT!!!! I was waiting for this. it seems like chroma performs better if the smallest edge is at least over 1024.

Anonymous
09/01/25(Mon)08:22:17 No.106452399

Anonymous 09/01/25(Mon)08:22:17 No.106452399

>>106452342
Ehh... I have a 3090, and a 5090 is about 2x its performance on the same ai workload.

I would gladly trade, that said my dusty old 3090 is still going strong, it's a really good card. Overall the 30xx era was the best, with the 3060 Ti being the best bang for buck of all Nvidia cards to date.

Anonymous
09/01/25(Mon)08:23:57 No.106452416

Anonymous 09/01/25(Mon)08:23:57 No.106452416

>>106452399
if only it was just 2x the price

Anonymous
09/01/25(Mon)08:24:07 No.106452418

Anonymous 09/01/25(Mon)08:24:07 No.106452418

>>106452399
>3060 Ti being the best bang for buck of all Nvidia cards to date
1080GODS, what did this nigger mean by this?

Anonymous
09/01/25(Mon)08:24:36 No.106452419

Anonymous 09/01/25(Mon)08:24:36 No.106452419

>>106452397
>they even have presets to use 8GB.
Gotta hand it to OneTrainer, they do some magic with offloading where it allows for extremely low vram requirements and yet very little performance penalty.

Anonymous
09/01/25(Mon)08:25:01 No.106452421

Anonymous 09/01/25(Mon)08:25:01 No.106452421

>>106452418
old and busted

Anonymous
09/01/25(Mon)08:25:51 No.106452428

Anonymous 09/01/25(Mon)08:25:51 No.106452428

>>106452418
I speak the truth, do not reject my message!

Anonymous
09/01/25(Mon)08:26:42 No.106452433

Anonymous 09/01/25(Mon)08:26:42 No.106452433

>>106452399
All that time and the new card is only twice the performance is hardly great. Especially for the price.

Anonymous
09/01/25(Mon)08:27:48 No.106452447

Anonymous 09/01/25(Mon)08:27:48 No.106452447

File: ComfyUI_00067_.png (1.77 MB, 1024x1536)

1.77 MB PNG

>>106452354
ty, always interesting! beta seems to win. Have you tried OSS?

>>106452342
>3090
It's nice being in the 24GB club right? This 4090 I bought has paid for itself so many times.

>>106452397
>I can't stop genning.
Same. I've been quiet since the release because there isn't much I can share

Anonymous
09/01/25(Mon)08:27:51 No.106452448

Anonymous 09/01/25(Mon)08:27:51 No.106452448

>>106452433
performance uplifts are getting harder to achieve

Anonymous
09/01/25(Mon)08:29:09 No.106452457

Anonymous 09/01/25(Mon)08:29:09 No.106452457

Hardware can be the difference between a job and unemployment. Secure your future as a prompt engineer by purchasing the RTX 5090 today.

Anonymous
09/01/25(Mon)08:29:15 No.106452458

Anonymous 09/01/25(Mon)08:29:15 No.106452458

>>106452448
yeah, its so hard a random chink in his basement can install 96gb vram onto a card that has 24gb without touching anything about the core of the gpu

Anonymous
09/01/25(Mon)08:30:18 No.106452468

Anonymous 09/01/25(Mon)08:30:18 No.106452468

File: fucked.png (289 KB, 480x832)

289 KB PNG

I tried to use wan 2.2 but it goes to shit pretty quickly.

I followed the rentry guide, and on workflow ldg_2_2_i2v_14b_480p.json, I get this error when I run a job

Set LoRA node does not use low_mem_load and can't merge LoRAs, disable 'merge_loras' in the LoRA select node.

It goes through when I disable it but the generation is fucked.

Any idea what the issue could be?

Running a rtx 5090

Anonymous
09/01/25(Mon)08:30:23 No.106452469

Anonymous 09/01/25(Mon)08:30:23 No.106452469

>>106452458
that's not a performance uplift you mouth breathing retard

Anonymous
09/01/25(Mon)08:30:48 No.106452475

Anonymous 09/01/25(Mon)08:30:48 No.106452475

>>106452433
True, it's two generations apart, Nvidia sure is coasting on the lack of competition.

Anonymous
09/01/25(Mon)08:31:33 No.106452478

Anonymous 09/01/25(Mon)08:31:33 No.106452478

I use ComfyUI how can I make images with more than one character? When I do it he mix the prompts of the characters... so I think that there is a better way just to put everything just in CLIP Text Encode?!

Anonymous
09/01/25(Mon)08:31:47 No.106452480

Anonymous 09/01/25(Mon)08:31:47 No.106452480

>>106452458
speaking of, I wonder if nvidia is changing anything about their board designs or drivers to prevent that from happening again, or if they're avoiding it in order not to tank their Chinese partners.

Anonymous
09/01/25(Mon)08:32:15 No.106452483

Anonymous 09/01/25(Mon)08:32:15 No.106452483

>>106452469
>running a model almost 4 times the size that is much much better at fast speed that was impossible before since it didnt fit into vram is not a performance improvement, only core clock ghz improvements count!!!
talk about a mouthbreathing retard, lmao

Anonymous
09/01/25(Mon)08:32:50 No.106452486

Anonymous 09/01/25(Mon)08:32:50 No.106452486

>>106452447

>Same. I've been quiet since the release because there isn't much I can share

My guilty pleasure is genning them in NSFW I2V and then adding a bit of bitrate compression and throwing it on chatpic lol, nobody can even sus out that they're AI.

Anonymous
09/01/25(Mon)08:33:49 No.106452492

Anonymous 09/01/25(Mon)08:33:49 No.106452492

>>106452480
I doubt it's a huge enough industry to be a threat to their dominance
if they did implement counter measures it would be incredibly petty

Anonymous
09/01/25(Mon)08:35:28 No.106452499

Anonymous 09/01/25(Mon)08:35:28 No.106452499

>>106452478
regional prompting. generally this is the shortcoming/flaw of SDXL.

Anonymous
09/01/25(Mon)08:36:30 No.106452505

Anonymous 09/01/25(Mon)08:36:30 No.106452505

>>106452469
I know what you meant, but being able to load a lot more of, or even an entire model into vram is a massive performance uplift.

The less you need to offload, the faster things go.

Anonymous
09/01/25(Mon)08:36:31 No.106452506

Anonymous 09/01/25(Mon)08:36:31 No.106452506

>>106452483
you, you're the retard that doesn't understand what like for like comparisons are

Anonymous
09/01/25(Mon)08:36:52 No.106452511

Anonymous 09/01/25(Mon)08:36:52 No.106452511

>>106452499
>regional prompting
elaborate? I'm a total beginner

Anonymous
09/01/25(Mon)08:36:55 No.106452512

Anonymous 09/01/25(Mon)08:36:55 No.106452512

>>106452468
well did you disable merge_loras?

Anonymous
09/01/25(Mon)08:37:22 No.106452515

Anonymous 09/01/25(Mon)08:37:22 No.106452515

>>106452506
>that performance improvement doesn't count!!!
close your mouth when breathing, npc retard

Anonymous
09/01/25(Mon)08:40:53 No.106452543

Anonymous 09/01/25(Mon)08:40:53 No.106452543

File: AD_00009.mp4 (1.09 MB, 624x480)

1.09 MB MP4

>>106452358

Anonymous
09/01/25(Mon)08:41:18 No.106452546

Anonymous 09/01/25(Mon)08:41:18 No.106452546

>>106452512

I did, and it goes thorugh but then the colors are all weird.

Anonymous
09/01/25(Mon)08:42:03 No.106452550

Anonymous 09/01/25(Mon)08:42:03 No.106452550

>>106452505
but then you need to make clear you're talking about two very different scenarios
the card with lower VRAM can be 1.5x/2x/3x/4x faster for models that fit in it, which is better then?
no matter what just don't be a retarded baboon like >>106452515

Anonymous
09/01/25(Mon)08:44:29 No.106452562

Anonymous 09/01/25(Mon)08:44:29 No.106452562

>>106452550
>the card with lower VRAM can be 1.5x/2x/3x/4x faster for models that fit in it, which is better then?
this is moving the goalpost from
>performance uplifts are getting harder to achieve
which got proven wrong, showing that they can be acheived with easy almost x4 vram
so now adding more arbitrary qualifiers is just cope from a retard that talked big before he got btfod like a retarded baboon that you are

Anonymous
09/01/25(Mon)08:46:46 No.106452572

Anonymous 09/01/25(Mon)08:46:46 No.106452572

>>106452562
I'm sorry you're too stupid to understand what a controlled variable is.
For both our sakes I concede, you're right, all cards should be tested with a 50 trillion parameter model and nothing else.
I wish you nothing but a good rest of your life.

Anonymous
09/01/25(Mon)08:47:34 No.106452580

Anonymous 09/01/25(Mon)08:47:34 No.106452580

>>106452546
huh? merge_loras is disabled by default. i don't know why you even touched that

Anonymous
09/01/25(Mon)08:48:52 No.106452588

Anonymous 09/01/25(Mon)08:48:52 No.106452588

>>106452397
but i only have 16gb vram, ive been told im a vramlet and i need at least an h100 :(

Anonymous
09/01/25(Mon)08:49:46 No.106452592

Anonymous 09/01/25(Mon)08:49:46 No.106452592

>>106452572
>I'm sorry you're too stupid to understand what a controlled variable is.
yeah, it needs to be a controlled variable after your initial statement got disproven so you can ad hoc rescue your position
>the card with lower VRAM can be 1.5x/2x/3x/4x faster
notice the phrase "can be"? your imagining a scenario that doesnt need to happen in order to make you right, appeal to possibility is a fallacy, retard
>all cards should be tested with a 50 trillion parameter model and nothing else.
ad absurdum fallacy of something newer said from your brain that is in cognitive dissonnance to help you cope

Anonymous
09/01/25(Mon)08:50:47 No.106452603

Anonymous 09/01/25(Mon)08:50:47 No.106452603

>>106452592
>newer said
never said

Anonymous
09/01/25(Mon)08:52:16 No.106452617

Anonymous 09/01/25(Mon)08:52:16 No.106452617

File: ComfyUI_00058_.mp4 (781 KB, 400x400)

781 KB MP4

Anonymous
09/01/25(Mon)08:52:28 No.106452618

Anonymous 09/01/25(Mon)08:52:28 No.106452618

>>106451219
Guise help

Anonymous
09/01/25(Mon)08:54:24 No.106452627

Anonymous 09/01/25(Mon)08:54:24 No.106452627

>>106452618
https://huggingface.co/api/resolve-cache/models/lodestones/Chroma/9540b7a813c3e06ca8eb0f01c25f3e76f931c08e/ChromaSimpleWorkflow20250507.json

Anonymous
09/01/25(Mon)09:01:00 No.106452663

Anonymous 09/01/25(Mon)09:01:00 No.106452663

>Still no Wan 2.2 VACE
It's not coming is it?

Anonymous
09/01/25(Mon)09:01:18 No.106452666

Anonymous 09/01/25(Mon)09:01:18 No.106452666

>>106452592
>yeah, it needs to be a controlled variable after your initial statement got disproven so you can ad hoc rescue your position
performance comparisons require that you control those variables, it's a worthless comparison otherwise
so it was not ad hoc, it was always there you dimwit
>notice the phrase "can be"? your imagining a scenario that doesnt need to happen in order to make you right, appeal to possibility is a fallacy, retard
I'm sorry are you denying that this is possible? that if you slap 96GB on a GTX 960 it will beat a 5060? oh right, you don't care about comparing apples to apples, you'd take whatever model size gives the 960 the advantage which leads us to
>ad absurdum fallacy of something newer said from your brain that is in cognitive dissonnance to help you cope
do tell us what is the RIGHT model size to test cards with. the upper bound seems to be 50 trillion

Anonymous
09/01/25(Mon)09:01:44 No.106452668

Anonymous 09/01/25(Mon)09:01:44 No.106452668

>>106452327
did you make the large Watson lora?

Anonymous
09/01/25(Mon)09:02:01 No.106452671

Anonymous 09/01/25(Mon)09:02:01 No.106452671

>>106452418
Nah man 1080 ti was better.

Anonymous
09/01/25(Mon)09:03:48 No.106452679

Anonymous 09/01/25(Mon)09:03:48 No.106452679

>>106452580
I didn't I swear. I redownloaded the json and looks like it's working fine now. don7t know wth happened. Thanks for the help.

Anonymous
09/01/25(Mon)09:04:15 No.106452685

Anonymous 09/01/25(Mon)09:04:15 No.106452685

truth is joycaption is a weak captioning model and anything trained with it is doomed to have large flaws

Anonymous
09/01/25(Mon)09:04:22 No.106452688

Anonymous 09/01/25(Mon)09:04:22 No.106452688

>>106452468
>>106452618

Was going to load some stuff up for you guys but catbox is being funky to me right now and won't resolve.

Anonymous
09/01/25(Mon)09:04:56 No.106452693

Anonymous 09/01/25(Mon)09:04:56 No.106452693

chroma is pissing me off. feels like I have to go through 100 seeds to get something that isn't a body horror

Anonymous
09/01/25(Mon)09:05:31 No.106452695

Anonymous 09/01/25(Mon)09:05:31 No.106452695

>he fell for it

Anonymous
09/01/25(Mon)09:06:05 No.106452699

Anonymous 09/01/25(Mon)09:06:05 No.106452699

>he fell for "he fell for it"

Anonymous
09/01/25(Mon)09:07:43 No.106452709

Anonymous 09/01/25(Mon)09:07:43 No.106452709

File: ComfyUI_00071_.png (2.1 MB, 1024x1536)

2.1 MB PNG

>>106452486
No share, no problems. ez pz

>>106452618
anon catbox is down but try this https://iili.io/KfhE4KN.png

>>106452668
Maybe. Why?

Anonymous
09/01/25(Mon)09:08:10 No.106452712

Anonymous 09/01/25(Mon)09:08:10 No.106452712

>>106452679
Hold on there buddy. You are going to have to upload some of your new gens as payment for the tech support I look forward to it.

Anonymous
09/01/25(Mon)09:09:15 No.106452723

Anonymous 09/01/25(Mon)09:09:15 No.106452723

>>106452666
>performance comparisons require that you control those variables, it's a worthless comparison otherwise
no, what matters is how fast a gpu can run X model, arbitrarily saying that more vram doesnt count as an improvement despite it improving performance in all models that couldnt fit before but now do is just faslse
>I'm sorry are you denying that this is possible?
strawman, i never denied its possible, i specifically said that appeal to possibility is a fallacy, i argued that it doesnt NEED to be the case, and isnt
holy shit you really are low iq retard that cant follow basic conversation
>do tell us what is the RIGHT model size to test cards with. the upper bound seems to be 50 trillion
there doesnt need to be a specific limit, you made up that requirement, if there is an open source model that is good at something, people are gonna care if that model can fit into X gpu, simple as

given your general retardation and how much you were proven wrong already i will not shit up the thread anymore by allowing your npc brain to pollute it more with low iq fallacies, feel free to continue to cope in the replies without daddys attention anymore

Anonymous
09/01/25(Mon)09:11:15 No.106452738

Anonymous 09/01/25(Mon)09:11:15 No.106452738

>>106452337
If it's anything like the versions released for 2.1, it's extremely censored.

Anonymous
09/01/25(Mon)09:11:22 No.106452740

Anonymous 09/01/25(Mon)09:11:22 No.106452740

>>106452709
>Maybe. Why?
what did you use for captioning, it works so well? Just about to do a test run with OneTrainer. also: please give me all your loras

Anonymous
09/01/25(Mon)09:11:47 No.106452746

Anonymous 09/01/25(Mon)09:11:47 No.106452746

>>106452337
in fact i just realized this trash is still 2.1 trained.
into the garbage can. never going back to wan2.1

Anonymous
09/01/25(Mon)09:18:28 No.106452784

Anonymous 09/01/25(Mon)09:18:28 No.106452784

With wan, if I want to get genitals working with a lora, is it important that the base gen with just the model generates blank genitals first?

I remember playing with loras in 2.1 and it still generated gore even with loras at high weight a lot of the time. I'm thinking you still have to avoid certain keywords because a lora cannot fix badly mutated genitals.

Anonymous
09/01/25(Mon)09:22:29 No.106452820

Anonymous 09/01/25(Mon)09:22:29 No.106452820

File: ComfyUI_00068_.png (1.73 MB, 1024x1536)

1.73 MB PNG

>>106452740
>what did you use for captioning
InternVL3-8B outputs mostly correct captions. I have it describe the entire scene, lighting, etc, and name the subject- close to the 512 token limit.
>Just about to do a test run with OneTrainer
One of us!
>also: please give me all your loras
I've shared too much already. Plus, there's over 60 and I wouldn't know where to start

Anonymous
09/01/25(Mon)09:26:43 No.106452859

Anonymous 09/01/25(Mon)09:26:43 No.106452859

>>106452723
>strawman, i never denied its possible, i specifically said that appeal to possibility is a fallacy, i argued that it doesnt NEED to be the case, and isnt
>and isnt
it clearly fucking is? lmao
>there doesnt need to be a specific limit, you made up that requirement
it was an absurd value to make you realize a point but you failed

you're right, you are shitting up the thread with your fallacy fallacy stupidity

Anonymous
09/01/25(Mon)09:37:45 No.106452952

Anonymous 09/01/25(Mon)09:37:45 No.106452952

File: ComfyUI_00004_resized.jpg (3.87 MB, 4057x7245)

3.87 MB JPG

>>106452397
Qwen 9:16 is 928x1664 which already seems to help. My usual testing resolution looked worse, that's for sure.
So, 'go big' is the take-away, which most Anons already figured out.
Here's another quick set of gens at 1.15 times the resolution to take the smallest edge over 1024.
Should be expected that an arbitrary resolution like that would kinda ruin the output, didn't expect it to ruin it by that much, though.

>>106452447
I only used OSS for face detailing, not for entire gens. I'll plot some Euler/OSS comparisons. But I'm not all too interested in OSS since I guess the main point is getting faster gens. I don't really care too much about speed if the output is good, but for face detailing in a longer pipeline it kinda makes sense.

In general, I still haven't looked at NAG along with Chroma, so there's still a lot of way to go as far as testing goes. Which kinda sucks because I've been sucked in the Qwen/Wan rabbithole along the way. Feels like leaving an old friend behind, almost.

Anonymous
09/01/25(Mon)09:43:44 No.106452997

Anonymous 09/01/25(Mon)09:43:44 No.106452997

>>106452709
thanks anon

Anonymous
09/01/25(Mon)09:51:03 No.106453058

Anonymous 09/01/25(Mon)09:51:03 No.106453058

>>106452588
>at least an h100
Don't listen to Nvidia shills

You can train a lora in as little as 8gb, lora is the great equalizer

Anonymous
09/01/25(Mon)09:55:57 No.106453089

Anonymous 09/01/25(Mon)09:55:57 No.106453089

>look through old folder
>hours and hours of guy walks in and kisses anime girl gens
I only make them because they are cute.

Anonymous
09/01/25(Mon)09:57:55 No.106453106

Anonymous 09/01/25(Mon)09:57:55 No.106453106

File: AnimateDiff_00252.mp4 (1.54 MB, 720x1040)

1.54 MB MP4

Anonymous
09/01/25(Mon)10:00:19 No.106453121

Anonymous 09/01/25(Mon)10:00:19 No.106453121

>>106453058
We are not talking about sdxl, kiddo

Anonymous
09/01/25(Mon)10:01:42 No.106453133

Anonymous 09/01/25(Mon)10:01:42 No.106453133

File: chuu.mp4 (2.88 MB, 832x640)

2.88 MB MP4

Anonymous
09/01/25(Mon)10:05:41 No.106453163

Anonymous 09/01/25(Mon)10:05:41 No.106453163

>>106453121
You can do Chroma lora with 8gb, using OneTrainer

They even include a 8gb preset in the trainer

Anonymous
09/01/25(Mon)10:15:31 No.106453232

Anonymous 09/01/25(Mon)10:15:31 No.106453232

>OneTrainer
>NotImplementedError: Loading of single file Chroma models not supported. Transformer-only safetensor files can be loaded by using the diffusers base model and overriding the transformer.
I cant fit this shit into Windows cache. Why can't I choose the dl directory if it downloads models straight from Huggingface?

Anonymous
09/01/25(Mon)10:20:02 No.106453270

Anonymous 09/01/25(Mon)10:20:02 No.106453270

>>106453232
Did you clone the whole https://huggingface.co/lodestones/Chroma1-HD or just dl the safetensors? I think you need configs included for OT

Anonymous
09/01/25(Mon)10:21:52 No.106453291

Anonymous 09/01/25(Mon)10:21:52 No.106453291

If I want to train a LoRA and the subject has a specific body type, should I include both full body pictures and detailed pictures of the face and tag accordingly? Will clothing be an issue?

Anonymous
09/01/25(Mon)10:22:46 No.106453298

Anonymous 09/01/25(Mon)10:22:46 No.106453298

>>106453270
Just downloaded the safetensor https://huggingface.co/lodestones/Chroma/blob/main/chroma-unlocked-v48.safetensors since I wanted to test training on earlier version first

Anonymous
09/01/25(Mon)10:26:39 No.106453322

Anonymous 09/01/25(Mon)10:26:39 No.106453322

>>106453298
The v48 equivalent diffusers-style repo would be https://huggingface.co/lodestones/Chroma1-Base

Anonymous
09/01/25(Mon)10:30:49 No.106453351

Anonymous 09/01/25(Mon)10:30:49 No.106453351

File: ComfyUI_00061_.mp4 (346 KB, 304x400)

346 KB MP4

Anonymous
09/01/25(Mon)10:34:12 No.106453377

Anonymous 09/01/25(Mon)10:34:12 No.106453377

>>106453322
I don't get it. Does it require transformer files too? I'm just so used to just choosing the base model that I can't wrap my head around this

Anonymous
09/01/25(Mon)10:38:09 No.106453415

Anonymous 09/01/25(Mon)10:38:09 No.106453415

>>106453377
It needs the model_index.json and for the subfolders to match and also have their own config.json

You could git clone the whole repo (re-downloading everything), or download the .json files and re-create the structure. The OT error says it doesn't support choosing the model as an individual file without the structure around it

Anonymous
09/01/25(Mon)10:39:54 No.106453423

Anonymous 09/01/25(Mon)10:39:54 No.106453423

File: ComfyUI_00421_.png (2.05 MB, 1328x1328)

2.05 MB PNG

Anonymous
09/01/25(Mon)10:45:42 No.106453454

Anonymous 09/01/25(Mon)10:45:42 No.106453454

File: ComfyUI_temp_tnhlj_00030_.png (1.5 MB, 1344x768)

1.5 MB PNG

I have an sdxl dataset with caption files. What LLM will let me turn the booru tags into natural language? Preferably in batches.

Anonymous
09/01/25(Mon)10:47:16 No.106453462

Anonymous 09/01/25(Mon)10:47:16 No.106453462

>>106453106
uwaaaaaa put a smile on my face

Anonymous
09/01/25(Mon)10:51:40 No.106453497

Anonymous 09/01/25(Mon)10:51:40 No.106453497

>>106453377
No way to screw it up if you:
git clone https://huggingface.co/lodestones/Chroma1-Base and point OT to that Chroma1-Base folder

Anonymous
09/01/25(Mon)10:55:47 No.106453527

Anonymous 09/01/25(Mon)10:55:47 No.106453527

>>106453423
a 5090 is still in the VRAMlet caste

Anonymous
09/01/25(Mon)10:57:22 No.106453535

Anonymous 09/01/25(Mon)10:57:22 No.106453535

>>106452543
A Kia?

Anonymous
09/01/25(Mon)10:57:45 No.106453540

Anonymous 09/01/25(Mon)10:57:45 No.106453540

>>106453415
>>106453497
TY dudes

Anonymous
09/01/25(Mon)11:09:25 No.106453613

Anonymous 09/01/25(Mon)11:09:25 No.106453613

>>106453527
nah, but I do think 24~32gb would be more akin to merchants/commoners. knights would be 6000 pro users. aristocrats would be h100/h200 users. anything below 16gb is peasant tier.

Anonymous
09/01/25(Mon)11:20:32 No.106453717

Anonymous 09/01/25(Mon)11:20:32 No.106453717

onetrainer loras do not work in chromaforge (the superior way to generate for chroma)

Anonymous
09/01/25(Mon)11:25:29 No.106453749

Anonymous 09/01/25(Mon)11:25:29 No.106453749

File: 1716575713105840.jpg (37 KB, 948x699)

37 KB JPG

>>106453717
how is it superior if it literally doesn't work

Anonymous
09/01/25(Mon)11:35:03 No.106453828

Anonymous 09/01/25(Mon)11:35:03 No.106453828

>>106453749
chromaforge has hiresfix, so it wins by default.
imagine having to tard wrangle a handful of nodes when grugg press buton for gud imaeg

Anonymous
09/01/25(Mon)11:37:30 No.106453845

Anonymous 09/01/25(Mon)11:37:30 No.106453845

>muh gimmick upscaling method

Anonymous
09/01/25(Mon)11:38:21 No.106453851

Anonymous 09/01/25(Mon)11:38:21 No.106453851

>>106453497
nta but as someone who wants to train an assload of loras on onetrainer, why choose base over HD? was HD not further trained on higher res images?

Anonymous
09/01/25(Mon)11:38:34 No.106453853

Anonymous 09/01/25(Mon)11:38:34 No.106453853

>>106452723
>no, what matters is how fast a gpu can run X model
in that case the 5090 does not have just 2x the performance of a 3090

Anonymous
09/01/25(Mon)11:40:49 No.106453866

Anonymous 09/01/25(Mon)11:40:49 No.106453866

>>106453828
thats 1 node in comfy and it takes all of 2 seconds to setup. you really dont even need hiresfix with chroma depending on the sampler

Anonymous
09/01/25(Mon)11:45:44 No.106453907

Anonymous 09/01/25(Mon)11:45:44 No.106453907

>>106453851
anon probably thinks v48 is still better than HD

Anonymous
09/01/25(Mon)11:56:12 No.106454002

Anonymous 09/01/25(Mon)11:56:12 No.106454002

Hey /g/uise, does anyone have a retard proof tutorial on how to get video generation? I would also like to add voice and lip synch.
Preferably, I would like to use my Mac Mini M4 Pro 64GB but if not I have a 4090.

Anonymous
09/01/25(Mon)11:57:23 No.106454018

Anonymous 09/01/25(Mon)11:57:23 No.106454018

File: ComfyUI_00072_.png (1.55 MB, 1024x1536)

1.55 MB PNG

>>106453851
HD resolves details much better imo. anon asked how to make v48 work and that's what Chroma1-Base ended up being.

Anonymous
09/01/25(Mon)12:03:18 No.106454070

Anonymous 09/01/25(Mon)12:03:18 No.106454070

>>106452354
>I really like it, apart from the usual hands and body horror sometimes.
>I think genning at higher res (just used the Qwen default for 9:16) really helps.
What resolutions do you recommend for chroma hd?

Anonymous
09/01/25(Mon)12:11:14 No.106454148

Anonymous 09/01/25(Mon)12:11:14 No.106454148

File: WanVid_00005.webm (215 KB, 720x536)

215 KB WEBM

Anonymous
09/01/25(Mon)12:15:46 No.106454192

Anonymous 09/01/25(Mon)12:15:46 No.106454192

File: ere we go again.png (54 KB, 1677x283)

54 KB PNG

Got inspired by anon, Chroma run go!

Anonymous
09/01/25(Mon)12:16:44 No.106454196

Anonymous 09/01/25(Mon)12:16:44 No.106454196

>>106453232
If you git clone (or manually download all files) the https://huggingface.co/lodestones/Chroma1-HD you can put it anywhere and just set the model path in OneTrainer to it.

Anonymous
09/01/25(Mon)12:20:47 No.106454231

Anonymous 09/01/25(Mon)12:20:47 No.106454231

>>106453298
You still need the Chroma1-HD repo, because the safetensors file is just the transformer

However if you have the Chroma1-HD repo, and want to train on another transformer, put the path to that transformer in the 'Prior model' in the OneTrainer model setting

So first: Chroma1-HD then under it in Prior Model, put path to chroma-unlocked-v48.safetensors (or whatever other version you want to use)

Anonymous
09/01/25(Mon)12:27:57 No.106454294

Anonymous 09/01/25(Mon)12:27:57 No.106454294

>>106454231
I guess that's pretty nice considering one repo takes around ~84gb disk space. Is it a workaround so people can train with under 24gb cards?

Anonymous
09/01/25(Mon)12:30:48 No.106454313

Anonymous 09/01/25(Mon)12:30:48 No.106454313

>>106454294
what. the HD repo is barely 30gb. even smaller if you delete the safetensor file

Anonymous
09/01/25(Mon)12:34:38 No.106454338

Anonymous 09/01/25(Mon)12:34:38 No.106454338

>>106454294
>~84gb disk space
??? My Chroma1-HD repo uses ~25gb, you can delete the .safetensors file in the repo if you don't need it since it's not used for training, only for inference (and perhaps you use another version for that, like I do)

But that doesn't account for ~84gb, are you sure you don't have git binary blob directories left from cloning ? If so you can delete them.

Anonymous
09/01/25(Mon)12:35:01 No.106454342

Anonymous 09/01/25(Mon)12:35:01 No.106454342

File: screenshot.1756744488.jpg (133 KB, 517x476)

133 KB JPG

>>106454313
NTA, but if you git clone a repo, the ".git" folder itself contains a bunch of shit.

for example, git cloning the Wan2.2-T2V-A14B repo is like 235 GB.

Anonymous
09/01/25(Mon)12:35:42 No.106454347

Anonymous 09/01/25(Mon)12:35:42 No.106454347

Are you >>106453481 /ldg/? Or are you a generate batch sloppa?

Anonymous
09/01/25(Mon)12:35:48 No.106454350

Anonymous 09/01/25(Mon)12:35:48 No.106454350

>>106454192
Based anon, godspeed!

Anonymous
09/01/25(Mon)12:36:33 No.106454359

Anonymous 09/01/25(Mon)12:36:33 No.106454359

>>106454342
>the ".git" folder itself contains a bunch of shit.
Isn't that just some cached crap you don't need?

Anonymous
09/01/25(Mon)12:37:57 No.106454376

Anonymous 09/01/25(Mon)12:37:57 No.106454376

>>106454359
you wont be able to do git pull to update, not that you'd need to since it's a finished model.

Anonymous
09/01/25(Mon)12:38:14 No.106454378

Anonymous 09/01/25(Mon)12:38:14 No.106454378

>>106454338
>are you sure you don't have git binary blob directories left from cloning ? If so you can delete them.
Indeed I had. Thanks anon!

>>106454342
Why the hell does it hoard this crap

Anonymous
09/01/25(Mon)12:38:39 No.106454385

Anonymous 09/01/25(Mon)12:38:39 No.106454385

>>106454359
Yes, I mean unless you are going to make changes and upstream them, for end user purposes like training etc, just delete them.

Anonymous
09/01/25(Mon)12:44:17 No.106454429

Anonymous 09/01/25(Mon)12:44:17 No.106454429

>>106453853
Yeah for sdxl 1girls stuff it’s faster than that. Heck for that simple tuff I think my 5070ti is faster too, not vram limited there

Anonymous
09/01/25(Mon)12:45:58 No.106454443

Anonymous 09/01/25(Mon)12:45:58 No.106454443

File: 1730466158010967.jpg (90 KB, 1242x846)

90 KB JPG

8gb vram isn't nearly enough for video generation is it

Anonymous
09/01/25(Mon)12:48:25 No.106454460

Anonymous 09/01/25(Mon)12:48:25 No.106454460

>>106454443

It's doable with quants and a good workflow, I was genning with an 8GB 2070S before I upgraded.

Anonymous
09/01/25(Mon)12:48:50 No.106454462

Anonymous 09/01/25(Mon)12:48:50 No.106454462

Checking in after 2 months. Is Wan still limited to 5 seconds for no reason?

Anonymous
09/01/25(Mon)12:57:14 No.106454530

Anonymous 09/01/25(Mon)12:57:14 No.106454530

>>106454443
Not unless you like waiting + doghsit quality. You can technically "do video gen" on 2gb vram if 90% of the model is offloaded to sys ram + light2x lora + torch/mag/tea/nag/easy cache + Q2 quants.

Anonymous
09/01/25(Mon)13:01:00 No.106454564

Anonymous 09/01/25(Mon)13:01:00 No.106454564

>>106454462
if it is what it was trained on how is it for no reason?

Anonymous
09/01/25(Mon)13:05:34 No.106454598

Anonymous 09/01/25(Mon)13:05:34 No.106454598

is Topaz still the upscaler for video?

Anonymous
09/01/25(Mon)13:06:34 No.106454608

Anonymous 09/01/25(Mon)13:06:34 No.106454608

Which model/models are considered the highest quality and most versatile and general purpose, being able to follow detailed prompts to as high a degree as possible, and able to run on gtx 1070 8gb vram 16gb system ram?

Anonymous
09/01/25(Mon)13:09:03 No.106454632

Anonymous 09/01/25(Mon)13:09:03 No.106454632

File: 1746876344499330.gif (3.8 MB, 304x219)

3.8 MB GIF

>>106454608
>the highest quality
>on gtx 1070 8gb vram 16gb system ram

Anonymous
09/01/25(Mon)13:21:42 No.106454715

Anonymous 09/01/25(Mon)13:21:42 No.106454715

what's the best "high resolution fix" for comfy ?

Anonymous
09/01/25(Mon)13:22:13 No.106454719

Anonymous 09/01/25(Mon)13:22:13 No.106454719

>>106454462
I do 7-8 no problem.

Anonymous
09/01/25(Mon)13:23:49 No.106454735

Anonymous 09/01/25(Mon)13:23:49 No.106454735

I am trying to chain videos together with end frames, but the color of the next video changes slightly. Is there a way to prevent that?

Anonymous
09/01/25(Mon)13:24:38 No.106454742

Anonymous 09/01/25(Mon)13:24:38 No.106454742

>everything i gen comes out as slowmo
just one of those days i guess

Anonymous
09/01/25(Mon)13:25:26 No.106454746

Anonymous 09/01/25(Mon)13:25:26 No.106454746

File: 13.jpg (103 KB, 646x316)

103 KB JPG

Jesus christ, I can delete this shit, right?

Anonymous
09/01/25(Mon)13:27:22 No.106454762

Anonymous 09/01/25(Mon)13:27:22 No.106454762

>>106454608
Perhaps try Q6 or Q8 GGUF quants of wan2.2
and/or chroma/hidream/qwen/sana/[...] via the distorch multigpu loader

Unfortunately I don't know exactly what fits in 16GB system RAM (+much worse, swapping to SSD/HDD). But picking a quant and the offloading in the aforementioned node has some flexibility.

I'm not sure you won't enjoy SDXL or other smaller models that generate faster tho.

Anonymous
09/01/25(Mon)13:27:36 No.106454767

Anonymous 09/01/25(Mon)13:27:36 No.106454767

>>106454746
did you not read the previous posts, yes.

Anonymous
09/01/25(Mon)13:29:34 No.106454779

Anonymous 09/01/25(Mon)13:29:34 No.106454779

>>106454746
yes, but then you may not be able to update via git.

you actually might just want to switch to a ssd/hdd with the appropriate size for modern bloat

Anonymous
09/01/25(Mon)13:31:27 No.106454797

Anonymous 09/01/25(Mon)13:31:27 No.106454797

>>106454598
Yes.

Anonymous
09/01/25(Mon)13:32:44 No.106454804

Anonymous 09/01/25(Mon)13:32:44 No.106454804

>>106454735
I know you can use latent instead, apparently even the last few latent frames (for ex 16) with some custom nodes to give info about movement, but I didn't try so I have no idea outside of knowing it exists.

Anonymous
09/01/25(Mon)13:33:27 No.106454811

Anonymous 09/01/25(Mon)13:33:27 No.106454811

>>106454742
stop using lightv2x

Anonymous
09/01/25(Mon)13:42:55 No.106454885

Anonymous 09/01/25(Mon)13:42:55 No.106454885

>>106452511
>>106452499
also curious about the best way to go about this

Anonymous
09/01/25(Mon)13:48:20 No.106454919

Anonymous 09/01/25(Mon)13:48:20 No.106454919

>>106452478
try hidream, wan (can also be used for images), qwen, chroma

it generally works better there. some may of course not have trained your characters and then for 2 characters at least you obviously still get better results with noob/illustrious if they HAVE trained the characters

Anonymous
09/01/25(Mon)13:49:22 No.106454925

Anonymous 09/01/25(Mon)13:49:22 No.106454925

>>106454885
>>106452511

NTA, but take a gander at this workflow. It is simple enough to understand.

https://civitai.com/models/1080711/comfyui-regional-prompter-workflow

Anonymous
09/01/25(Mon)13:53:50 No.106454947

Anonymous 09/01/25(Mon)13:53:50 No.106454947

File: 1756123717081214.png (133 KB, 739x737)

133 KB PNG

Wake me up when we get something like Flux or Chroma but working locally and having the braindead image editing of Gemini Banana.

Anonymous
09/01/25(Mon)13:57:25 No.106454968

Anonymous 09/01/25(Mon)13:57:25 No.106454968

>>106452478
wait for someone to crack NovelAI's character separation technology

Anonymous
09/01/25(Mon)13:58:40 No.106454978

Anonymous 09/01/25(Mon)13:58:40 No.106454978

>>106454947
instead of sleeping you could be working to get the nano banana weights

Anonymous
09/01/25(Mon)13:58:46 No.106454979

Anonymous 09/01/25(Mon)13:58:46 No.106454979

>>106454968
>>106454906

Anonymous
09/01/25(Mon)14:02:11 No.106454999

Anonymous 09/01/25(Mon)14:02:11 No.106454999

>>106454762
>sana
Maybe skip that one. I still cannot believe they made the VAE worse than what SDXL had

Anonymous
09/01/25(Mon)14:02:30 No.106455000

Anonymous 09/01/25(Mon)14:02:30 No.106455000

>>106454979
cry about it, 'jesh

Anonymous
09/01/25(Mon)14:07:12 No.106455034

Anonymous 09/01/25(Mon)14:07:12 No.106455034

>>106454762
I'm not interested in generating videos though. mainly high quality realistic images following long and detailed prompts as close as possible with the most amount of context awareness possible. I know it's possible to tweak the workflow by splitting things up to create better context awareness, but I'm interested in a model that is as context aware as possible from the get go.
Right now I'm using SDXL base 1.0 without any loras or anything, and the output is just not very good. It doesn't adhere to the prompt very well and there are often weird artifacts and smudgy stuff. I've tried different samplers and schedulers, and it's just not very good at adhering to more complex/longer prompts.
Right now I'm not using any launch arguments or extensions, and one gen of a 1024x1024 on SDXL Base is taking almost 2 minutes.
I feel like it shouldn't take that long even with how low my specs are. Maybe I'm doing something wrong. I'm not sure which arguments I could launch with to make it faster.
If I can't get it any faster, I'll have to live with it for now. But SDXL base is just not cutting it for me. it's just not good enough of a standard output that I think it will help a lot with using loras and whatever else, to tweak it. I'm feeling there must be some better models than SDXL base at the same level of hardware requirements. maybe some finetuned checkpoints based on SDXL or some other type of model.

Anonymous
09/01/25(Mon)14:12:30 No.106455077

Anonymous 09/01/25(Mon)14:12:30 No.106455077

Best practices for captioning for style LoRAs for Qwen? I'm training on 19th-century Academic Orientalism paintings. Is it better to caption the all the images as "Orientalism paintings", or to use a variety of more generic terms like "painterly" or "realistic oil painting" across the dataset?

Anonymous
09/01/25(Mon)14:14:25 No.106455090

Anonymous 09/01/25(Mon)14:14:25 No.106455090

>>106455034
I can get everything with qwen image and go for some wan passes.
Open source u cant get better results

Anonymous
09/01/25(Mon)14:19:53 No.106455130

Anonymous 09/01/25(Mon)14:19:53 No.106455130

Anyone know anything about the audio generators? Anything local that tries to compete with ElevenLabs?

Anonymous
09/01/25(Mon)14:21:23 No.106455136

Anonymous 09/01/25(Mon)14:21:23 No.106455136

I have not been here for a month.

I have only one question:

Did the Mayli anon delivered? do we have a Mayli NSFW folder?

Anonymous
09/01/25(Mon)14:24:08 No.106455160

Anonymous 09/01/25(Mon)14:24:08 No.106455160

>>106455130
Been using Chatterbox. Here's Emma reading some Orwell https://voca.ro/1ikmIkpzsLHX

Anonymous
09/01/25(Mon)14:30:10 No.106455200

Anonymous 09/01/25(Mon)14:30:10 No.106455200

>>106455077
If the pattern is easy for the ai to 'grok', there are arguments for just describing the contents of the image in generic terms, without mentioning style, medium etc, likewise there are arguments for adding those in order to piggyback on the knowledge already in the base model, thus giving faster convergence.

I don't think there's a real consensus, you probably have to decide for yourself and your specific training data.

Anonymous
09/01/25(Mon)14:35:23 No.106455231

Anonymous 09/01/25(Mon)14:35:23 No.106455231

>PAWG wan lora
>use it with loli
eeehehehe

Anonymous
09/01/25(Mon)14:38:10 No.106455247

Anonymous 09/01/25(Mon)14:38:10 No.106455247

>>106455200
Too bad there's no easy answer. I love to experiment but training Qwen takes way too damned long on my 3090. Maybe I'll just do a runpod or Civitai or something.

Anonymous
09/01/25(Mon)14:41:53 No.106455268

Anonymous 09/01/25(Mon)14:41:53 No.106455268

>>106454715
Hiresfix like auto1111 is just an additional sampler pass at larger size of your image so just stick an extra k sampler node between your first pass and image output

Anonymous
09/01/25(Mon)14:42:05 No.106455269

Anonymous 09/01/25(Mon)14:42:05 No.106455269

>>106455247
>Too bad there's no easy answer.
AI training in a nutshell I'm afraid. Particularly when you want to reach the best quality.

Anonymous
09/01/25(Mon)14:42:30 No.106455270

Anonymous 09/01/25(Mon)14:42:30 No.106455270

>>106452178
i tried to do one of Mila Kunis and it took forever and came out shitty.. i have no idea how to do one properly.. i followed some guide online but there's a billion options and no one covers even 1/10th of them

Anonymous
09/01/25(Mon)14:43:30 No.106455275

Anonymous 09/01/25(Mon)14:43:30 No.106455275

>>106452399
just bought a 5090 last saturday.. noticeably faster than my 4090

Anonymous
09/01/25(Mon)14:43:42 No.106455276

Anonymous 09/01/25(Mon)14:43:42 No.106455276

>>106455231
makes zero sense, it's a t2v lora

Anonymous
09/01/25(Mon)14:44:21 No.106455282

Anonymous 09/01/25(Mon)14:44:21 No.106455282

>>106455034
wan for a single frame is an image.

the advantage to this is that wan does understand multiple characters and spatial instructions and such much better than SDXL

else hidream, chroma, qwen, ... are also better or much better

Anonymous
09/01/25(Mon)14:44:37 No.106455284

Anonymous 09/01/25(Mon)14:44:37 No.106455284

>>106455268
Yeah I got it, thanks.

Anonymous
09/01/25(Mon)14:44:58 No.106455287

Anonymous 09/01/25(Mon)14:44:58 No.106455287

>>106455090
what do you mean with open source? you mean with distorch?

Anonymous
09/01/25(Mon)14:45:08 No.106455290

Anonymous 09/01/25(Mon)14:45:08 No.106455290

>>106455276
silly naive anon
you know nothing

Anonymous
09/01/25(Mon)14:46:23 No.106455301

Anonymous 09/01/25(Mon)14:46:23 No.106455301

>>106455290
great explanation anon, thanks

Anonymous
09/01/25(Mon)14:47:33 No.106455308

Anonymous 09/01/25(Mon)14:47:33 No.106455308

>>106455287
I think he means local

Anonymous
09/01/25(Mon)14:47:33 No.106455309

Anonymous 09/01/25(Mon)14:47:33 No.106455309

>>106455282
Oh, ok. Thanks. I didn't know that.
I'll try it out.

Anonymous
09/01/25(Mon)14:48:13 No.106455318

Anonymous 09/01/25(Mon)14:48:13 No.106455318

File: file.png (156 KB, 1772x1234)

156 KB PNG

>>106455160
That's pretty good. Have you tried any of the others?

Anonymous
09/01/25(Mon)14:50:20 No.106455336

Anonymous 09/01/25(Mon)14:50:20 No.106455336

Where the fuck is the refresh nodes button? I updated comfyui but the UI changed and the button dissapeared
I know the hotkey is R, but I would like to know where it is in case comfy change the hotkey too

Anonymous
09/01/25(Mon)14:51:41 No.106455352

Anonymous 09/01/25(Mon)14:51:41 No.106455352

>>106455308
yeah. Sorry. I know what he means now. I got confused. I thought he said can instead of can't.

Anonymous
09/01/25(Mon)14:51:43 No.106455354

Anonymous 09/01/25(Mon)14:51:43 No.106455354

>captioning art with Gemini
>a handful of the images include women with unshaved armpits
>the armpit hair is always the first thing mentioned in the captions even though it isn't the focus of the image at all
So basically Gemini looks at images exactly the same way I do.

Anonymous
09/01/25(Mon)14:55:29 No.106455378

Anonymous 09/01/25(Mon)14:55:29 No.106455378

File: 1563400429238.jpg (43 KB, 446x456)

43 KB JPG

>genning gooning during day
>training when I'm sleep
This shit will fix my fucked up sleep schedule, dang

Anonymous
09/01/25(Mon)14:57:02 No.106455391

Anonymous 09/01/25(Mon)14:57:02 No.106455391

File: ComfyUI_00438_.png (2.1 MB, 1328x1328)

2.1 MB PNG

>>106453613

Anonymous
09/01/25(Mon)14:57:22 No.106455394

Anonymous 09/01/25(Mon)14:57:22 No.106455394

>>106455318
Nope, just chatterbox. Havent had the time

Anonymous
09/01/25(Mon)15:00:33 No.106455422

Anonymous 09/01/25(Mon)15:00:33 No.106455422

>>106455391
kek, but knights would 48gb vram. definitely not the same power as the king himself

Anonymous
09/01/25(Mon)15:00:55 No.106455425

Anonymous 09/01/25(Mon)15:00:55 No.106455425

>>106455268
What denoise value do you recommend in the second ksampler?
I've gone
1024x1536 -> x1.5 -> denoise 0.5

Anonymous
09/01/25(Mon)15:03:11 No.106455443

Anonymous 09/01/25(Mon)15:03:11 No.106455443

>>106455378
more like training when I'm sleep + at work. my computer has been on for 2 weeks now working non-stop at max gpu. sometimes I queue up 30x wan gens during work and get notifications to my phone when each is finished. they're like little treats.

Anonymous
09/01/25(Mon)15:07:07 No.106455466

Anonymous 09/01/25(Mon)15:07:07 No.106455466

>>106455160
That’s pretty good compared to all the meme ones we were doing with that online service a few years ago. I listened to the whole thing and didn’t hear any errors or weirdness. Maybe a little too monotone if anything. I really should look into audio, make my own asmr sleepy time shit

Anonymous
09/01/25(Mon)15:07:12 No.106455468

Anonymous 09/01/25(Mon)15:07:12 No.106455468

>>106455443
This. I thought genning would be my huge time sink, turns out it's training that drives the majority of my interest.

Anonymous
09/01/25(Mon)15:08:36 No.106455482

Anonymous 09/01/25(Mon)15:08:36 No.106455482

File: artworks-000128767967-tti(...).jpg (144 KB, 500x500)

144 KB JPG

>random pic of girl at grocery store
>wan2.2-i2v-high-oral-insertion-v1.0
>wan2.2-i2v-low-oral-insertion-v1.0

>old photo of crush
>oral-insertion-v1.0

>comfy dev
>oral-insertion-v1.0

Anonymous
09/01/25(Mon)15:09:15 No.106455488

Anonymous 09/01/25(Mon)15:09:15 No.106455488

>>106455425
There’s no “correct” number, you gotta futz with it. One value might be fine for one image and give you flesh monsters for another, or if it’s like not denoised enough then it’ll just be splotchy. Too much and it’ll change the image too much, and and so on. Plus, it depends on your sampler and your model and your seat and blah blah blah. Like everything else in this forsaken hobby, you have to roll it like a dozen times just to see.

Anonymous
09/01/25(Mon)15:13:20 No.106455511

Anonymous 09/01/25(Mon)15:13:20 No.106455511

>>106455488
Just tried, I was using chroma because nsfw, 0.5 gives a blurry mess, 0.7 seems better, I think I'll try again with 0.8-0.9.

Anonymous
09/01/25(Mon)15:13:51 No.106455516

Anonymous 09/01/25(Mon)15:13:51 No.106455516

>>106455482

oooooooooooo he's COOKIN now!

Anonymous
09/01/25(Mon)15:14:09 No.106455518

Anonymous 09/01/25(Mon)15:14:09 No.106455518

File: 1726844488233385.png (2.41 MB, 1080x1080)

2.41 MB PNG

>wonderbread
>oral-insertion-v1.0

Anonymous
09/01/25(Mon)15:23:14 No.106455572

Anonymous 09/01/25(Mon)15:23:14 No.106455572

>>106455482

Qwen image edit and Wan2.2 certainly fucked up my mind. So I am not the only one.

Anonymous
09/01/25(Mon)15:24:09 No.106455577

Anonymous 09/01/25(Mon)15:24:09 No.106455577

>>106455136
nope, hasn't posted for a long time neither.

Anonymous
09/01/25(Mon)15:30:47 No.106455617

Anonymous 09/01/25(Mon)15:30:47 No.106455617

>>106455572
>Qwen image edit
no nsfw so not sure how

Anonymous
09/01/25(Mon)15:31:11 No.106455621

Anonymous 09/01/25(Mon)15:31:11 No.106455621

>>106455572
qwen image edit? dont you mean chroma. qwen sucks at realism.

Anonymous
09/01/25(Mon)15:31:23 No.106455623

Anonymous 09/01/25(Mon)15:31:23 No.106455623

What are people using for character consistency nowadays? Is it QIE or something else?

Anonymous
09/01/25(Mon)15:34:44 No.106455652

Anonymous 09/01/25(Mon)15:34:44 No.106455652

>>106455623
>What are people using
alcohol

Anonymous
09/01/25(Mon)15:44:44 No.106455716

Anonymous 09/01/25(Mon)15:44:44 No.106455716

File: 1729712242757133.jpg (41 KB, 367x480)

41 KB JPG

what to use to inpaint realistic genitalia?
I'm way too used to convenient tools for anime styles

Anonymous
09/01/25(Mon)15:45:38 No.106455722

Anonymous 09/01/25(Mon)15:45:38 No.106455722

File: ChromaGiger_00006_.png (1.85 MB, 992x1456)

1.85 MB PNG

>>106454350

Anonymous
09/01/25(Mon)15:47:08 No.106455736

Anonymous 09/01/25(Mon)15:47:08 No.106455736

>>106453851
HD is more temperamental to prompt with and struggles with smaller resolutions. I'm not at my PC but I have some comparisons I could make once I am.

Anonymous
09/01/25(Mon)15:49:09 No.106455753

Anonymous 09/01/25(Mon)15:49:09 No.106455753

>>106454443
I get by on q8 with ram offloading just fine, and if you're less picky q4 is as low as you should go. You won't be outputting huge resolutions but it's doable.

Anonymous
09/01/25(Mon)15:49:15 No.106455755

Anonymous 09/01/25(Mon)15:49:15 No.106455755

>>106455722
Nice nightmare fuel

Picked up the Giger style well, combining airbrush with sharp acrylic painting

Anonymous
09/01/25(Mon)15:53:45 No.106455780

Anonymous 09/01/25(Mon)15:53:45 No.106455780

File: ChromaGiger_00008_.png (2.07 MB, 992x1456)

2.07 MB PNG

>>106455755
It's not bad since I just ran it with default values. Need to increase rank and learning rate.

Anonymous
09/01/25(Mon)15:57:50 No.106455809

Anonymous 09/01/25(Mon)15:57:50 No.106455809

>>106455780
Yeah it looks good, if you're using OneTrainer, the best lora preset IMO is using 'blocks', it's a bit slower than 'attn' or 'attn,mlp' but it picks up more detail at the same number of trained epochs, I wouldn't do 'full' though, it doesn't seem to have any benefit over 'blocks' and is slower.

Overall when it comes to art style training, I really like how Chroma picks up the 'texture', particularly when compared to Flux dev

Anonymous
09/01/25(Mon)15:58:00 No.106455812

Anonymous 09/01/25(Mon)15:58:00 No.106455812

>>106455780
>that wasn't a microdose

Anonymous
09/01/25(Mon)16:04:30 No.106455856

Anonymous 09/01/25(Mon)16:04:30 No.106455856

>>106455482
>this post
>oral-insertion-v1.0
https://file.garden/aIdN6xfH0QVghCy0/thispost/thispost.mp4

Anonymous
09/01/25(Mon)16:05:46 No.106455866

Anonymous 09/01/25(Mon)16:05:46 No.106455866

File: ChromaGiger_00015_.png (1.99 MB, 992x1456)

1.99 MB PNG

>>106455809
Thanks for the tip, I'll check the blocks setting. I wanna try if it works with lycoris+came+cosine.

>I really like how Chroma picks up the 'texture'
That seems to be the case. I'll upload this lora once I've retrained it

Anonymous
09/01/25(Mon)16:07:18 No.106455881

Anonymous 09/01/25(Mon)16:07:18 No.106455881

>>106455812
Giger never took a microdose of anything!

Anonymous
09/01/25(Mon)16:10:55 No.106455908

Anonymous 09/01/25(Mon)16:10:55 No.106455908

>>106455866
Not sure about lycoris/lokr support in OneTrainer, never used it, but CAME is supported

Anonymous
09/01/25(Mon)16:13:20 No.106455930

Anonymous 09/01/25(Mon)16:13:20 No.106455930

>>106455866
this looks like herpes

Anonymous
09/01/25(Mon)16:14:04 No.106455939

Anonymous 09/01/25(Mon)16:14:04 No.106455939

is there a reliable way to make sure wan gives female characters flat chests or small breasts? Specifically clothed characters that don't already have their breasts exposed in the input image.

Too many times wan gives the girl big boobs when she is supposed to be flat/petite. I've tried various different forms of conditioning in the prompt but nothing is consistent.

Anonymous
09/01/25(Mon)16:16:10 No.106455959

Anonymous 09/01/25(Mon)16:16:10 No.106455959

>>106455866
Here you go Gigerbro

https://files.catbox.moe/8n4c3d.mp4

Anonymous
09/01/25(Mon)16:24:34 No.106456029

Anonymous 09/01/25(Mon)16:24:34 No.106456029

File: meat.mp4 (3.75 MB, 832x640)

3.75 MB MP4

Anonymous
09/01/25(Mon)16:24:45 No.106456031

Anonymous 09/01/25(Mon)16:24:45 No.106456031

>>106454947
WAKE UP BITCH
https://files.catbox.moe/3oic8d.mp4

Anonymous
09/01/25(Mon)16:26:13 No.106456041

Anonymous 09/01/25(Mon)16:26:13 No.106456041

>>106455959
This lora must have been downloaded more than twice at this point, right ?

Anonymous
09/01/25(Mon)16:27:10 No.106456044

Anonymous 09/01/25(Mon)16:27:10 No.106456044

File: spaceposting.mp4 (1.52 MB, 832x640)

1.52 MB MP4

Anonymous
09/01/25(Mon)16:27:39 No.106456048

Anonymous 09/01/25(Mon)16:27:39 No.106456048

>>106455939
I’m purely guessing here, but they probably either didn’t train that or maybe even trained against it, because for all the SDXL base models flat chest was basically the cheat code for ToT. And they don’t wanna make it easy to do that sort of thing with a realistic model.

Anonymous
09/01/25(Mon)16:30:48 No.106456065

Anonymous 09/01/25(Mon)16:30:48 No.106456065

>>106455866
>>106456029
>>106456031
>>106456044
based gens

Anonymous
09/01/25(Mon)16:37:09 No.106456110

Anonymous 09/01/25(Mon)16:37:09 No.106456110

File: ChromaGiger_00021_.png (1.68 MB, 1536x920)

1.68 MB PNG

>>106455881
>Giger never took a microdose of anything!
I bet he did after Aliens 3 clusterfuck

>>106455908
It has lycoris support and 8bit came, should be doable. Just have to figure out where to put extra arguments if it even needs them

Anonymous
09/01/25(Mon)16:41:51 No.106456148

Anonymous 09/01/25(Mon)16:41:51 No.106456148

>>106456044
New Space Jockey anime when ? Untapped potential!

Anonymous
09/01/25(Mon)16:43:24 No.106456161

Anonymous 09/01/25(Mon)16:43:24 No.106456161

>>106456110
>I bet he did after Aliens 3 clusterfuck
More like a gigadose, maybe even a mega pint

Nice gen

Anonymous
09/01/25(Mon)16:44:32 No.106456167

Anonymous 09/01/25(Mon)16:44:32 No.106456167

>>106456044
This is me btw

Anonymous
09/01/25(Mon)16:45:33 No.106456175

Anonymous 09/01/25(Mon)16:45:33 No.106456175

>>106456029
>vore gens in my /ldg/ ?
My likely than you'd think

Anonymous
09/01/25(Mon)16:49:08 No.106456206

Anonymous 09/01/25(Mon)16:49:08 No.106456206

a*****dio will redeem

Anonymous
09/01/25(Mon)17:03:53 No.106456338

Anonymous 09/01/25(Mon)17:03:53 No.106456338

File: ChromaGiger_00041_.png (2.18 MB, 992x1456)

2.18 MB PNG

>>106456161
ty!

Anonymous
09/01/25(Mon)17:21:26 No.106456499

Anonymous 09/01/25(Mon)17:21:26 No.106456499

does anyone really think they could keep up with nvidia who earn just enough to invest 10x as much in research as all competitors worldwide combined?

Anonymous
09/01/25(Mon)17:29:49 No.106456569

Anonymous 09/01/25(Mon)17:29:49 No.106456569

>>106456499
Too much money on the table to let Nvidia walk away with it all.

Anonymous
09/01/25(Mon)17:34:54 No.106456616

Anonymous 09/01/25(Mon)17:34:54 No.106456616

>>106455422
surely the king would be a B200 at 180GB VRAM

Anonymous
09/01/25(Mon)17:49:58 No.106456741

Anonymous 09/01/25(Mon)17:49:58 No.106456741

File: ComfyUI_00089_.png (1.8 MB, 1024x1536)

1.8 MB PNG

>>106452740
Here's one more https://gofile.io/d/W8rk7u

>>106454192
Flip those bits anon

>>106455270
>it took forever and came out shitty
Unfortunate. I usually bake 8-10 hours.. unless low image count, then 100 epochs.

>>106455722
>>106455780
>>106455866
Nice

Anonymous
09/01/25(Mon)17:51:13 No.106456750

Anonymous 09/01/25(Mon)17:51:13 No.106456750

>>106456031
stop making everyone suck dick :(

Anonymous
09/01/25(Mon)17:51:24 No.106456751

Anonymous 09/01/25(Mon)17:51:24 No.106456751

File: AnimatedDiff_00006.mp4 (1.57 MB, 832x640)

1.57 MB MP4

>>106456029

Anonymous
09/01/25(Mon)17:52:32 No.106456762

Anonymous 09/01/25(Mon)17:52:32 No.106456762

>>106456751
holy shit real captain falcon

Anonymous
09/01/25(Mon)17:59:14 No.106456834

Anonymous 09/01/25(Mon)17:59:14 No.106456834

>>106456750
Seek medical help for your limp dick

Anonymous
09/01/25(Mon)18:06:06 No.106456884

Anonymous 09/01/25(Mon)18:06:06 No.106456884

File: 101921102_p11.png (59 KB, 400x400)

59 KB PNG

>>>/vt/104115369
Can I use AI somehow to remove the demented watermarks the schizo ai antis artists use?

Anonymous
09/01/25(Mon)18:06:50 No.106456890

Anonymous 09/01/25(Mon)18:06:50 No.106456890

>>106456741
>Here's one more https://gofile.io/d/W8rk7u
Thanks for sharing. celebrity?

Anonymous
09/01/25(Mon)18:16:11 No.106456971

Anonymous 09/01/25(Mon)18:16:11 No.106456971

File: ComfyUI_00491_.png (1.19 MB, 880x1184)

1.19 MB PNG

>>106456884
can easily be done with flux kontext or qwen image edit. kontext seems easier to wrangle for this purpose but it produces jpeg-like artifacting, pic related

Anonymous
09/01/25(Mon)18:18:06 No.106456988

Anonymous 09/01/25(Mon)18:18:06 No.106456988

why hasn't diffusion been very popular lately? there seems to be a large dropoff in interest. probably a lot of things like models being too big, the edit models not being a bigger deal than what we ended up getting, the software keeps committing seppuku every update or breaks itself with memory leaks. what do you guys think?

Anonymous
09/01/25(Mon)18:24:57 No.106457055

Anonymous 09/01/25(Mon)18:24:57 No.106457055

File: qwenedit_00074_.png (1015 KB, 880x1176)

1015 KB PNG

>>106456971
i guess qwen edit works just as well or better

Anonymous
09/01/25(Mon)18:28:22 No.106457082

Anonymous 09/01/25(Mon)18:28:22 No.106457082

>>106456971
>>106457055
How do you prompt the edit models? Legit boomer prompting like "remove xy, do wz, turn a into b" ?

Anonymous
09/01/25(Mon)18:31:21 No.106457098

Anonymous 09/01/25(Mon)18:31:21 No.106457098

File: 6fe378e5638fe157a0fa5059b(...).jpg (881 KB, 1061x1500)

881 KB JPG

>>106456971
>>106457055
can they remove "adversarial noise" like this?

Anonymous
09/01/25(Mon)18:31:58 No.106457106

Anonymous 09/01/25(Mon)18:31:58 No.106457106

>>106456750
>stop making everyone suck dick :(
Everyone and everyTHING will submit to BWC, as nature and God intended.

https://files.catbox.moe/ziriy0.mp4

Anonymous
09/01/25(Mon)18:32:21 No.106457113

Anonymous 09/01/25(Mon)18:32:21 No.106457113

>>106457055
for that it was just "remove the watermark in the bottom right and on the girl's shoulder". tabbing between the two it's clear that qwen did a better job at removing the watermark and the colors are more vibrant.

Anonymous
09/01/25(Mon)18:33:34 No.106457126

Anonymous 09/01/25(Mon)18:33:34 No.106457126

File: 1754975017228360.jpg (1.14 MB, 1184x1728)

1.14 MB JPG

>>106457106
he can't keep getting away with it

Anonymous
09/01/25(Mon)18:35:15 No.106457138

Anonymous 09/01/25(Mon)18:35:15 No.106457138

>>106456988
the reddit community seems to be going apeshit with it. I think the dopamine hits you get from genning is so far off the charts, way higher than we could have bargained for. I think that if you ever got into genning, not genning anymore probably isn't a concept that exists for you.rpt0x

Anonymous
09/01/25(Mon)18:37:31 No.106457151

Anonymous 09/01/25(Mon)18:37:31 No.106457151

>>106457138
reddit is going nano bannanas but a lot of that is also bots shilling it. I don't think there is much interest in local diffusion

Anonymous
09/01/25(Mon)18:38:01 No.106457157

Anonymous 09/01/25(Mon)18:38:01 No.106457157

>>106457138
yeah, im sad to say gen'ing has completely taken over as my main hobby. i haven't done fucking anything else besides ai gen for the past 6-7 months. not remotely close to 'burning out'. still so many concepts, ideas and things to try it feels near infinite.

Anonymous
09/01/25(Mon)18:39:57 No.106457173

Anonymous 09/01/25(Mon)18:39:57 No.106457173

File: qwenedit_00075_.png (925 KB, 856x1216)

925 KB PNG

>>106457098
yerp. I didn't even realize what you meant at first until I zoomed into the image. I just prompted "remove the watermark in the top right and bottom right" and it drew the image without the noise.

>>106457113
meant for
>>106457082

Anonymous
09/01/25(Mon)18:40:40 No.106457182

Anonymous 09/01/25(Mon)18:40:40 No.106457182

>>106457157
Its what people that learn to draw had always experienced

Anonymous
09/01/25(Mon)18:41:05 No.106457187

Anonymous 09/01/25(Mon)18:41:05 No.106457187

>>106457157
I'm jealous. I seem to be a creatively bankrupt person incapable of producing original ideas. I frequently get really excited about some ideas or settings but when it comes to actually creating something I draw a blank. I learned so much and have gigabytes of catboxes but I mostly just produce slop every now and then and not even that anymore.

Anonymous
09/01/25(Mon)18:44:41 No.106457214

Anonymous 09/01/25(Mon)18:44:41 No.106457214

File: WAN_FINAL_00001.mp4 (2.03 MB, 872x568)

2.03 MB MP4

Anonymous
09/01/25(Mon)18:45:32 No.106457223

Anonymous 09/01/25(Mon)18:45:32 No.106457223

>>106457126
i'd personally do this but sadly I am in the middle of doing a wan gen

Anonymous
09/01/25(Mon)18:45:40 No.106457225

Anonymous 09/01/25(Mon)18:45:40 No.106457225

>>106457151
The closest thing to organic on reddit is the hustlers trying to monetize the waves of astroturfing. The answer is much simpler in that all the moneyed interest has shifted to video. It's a newer set of keys to jingle in front of increasingly skeptic investors. Image generation is not very useful, is saddled with big lawsuits, and is viewed negatively by the public. But hey, it's great for making weird porn.

Anonymous
09/01/25(Mon)18:47:48 No.106457243

Anonymous 09/01/25(Mon)18:47:48 No.106457243

>>106457225
I dont see how video would be more enticing for said investors if they already understand the limitations of image gen, is has a lot of the same issues for them if not more

Anonymous
09/01/25(Mon)18:48:06 No.106457246

Anonymous 09/01/25(Mon)18:48:06 No.106457246

File: kobeni.mp4 (1.57 MB, 720x1072)

1.57 MB MP4

Anonymous
09/01/25(Mon)18:48:23 No.106457248

Anonymous 09/01/25(Mon)18:48:23 No.106457248

>>106456988
Diffusion has taken a backseat to video generation, this is 100% expected since video has such a broader reach.

That said the endless possibilities of i2v means diffusion will remain in wide use.

Anonymous
09/01/25(Mon)18:49:39 No.106457261

Anonymous 09/01/25(Mon)18:49:39 No.106457261

Any hope for a NSFW lora/finetune for Qwen edit image or no? Licence is much better than Kontext, but a lora won't do the cut if the model has 0 nsfw trained in, will it

Anonymous
09/01/25(Mon)18:50:53 No.106457270

Anonymous 09/01/25(Mon)18:50:53 No.106457270

>>106457243
Yeah, but investors are also basically stupid animals that act entirely on emotions hence Nvidia stock dropping everytime a shitty AI company gets bad press.

Anonymous
09/01/25(Mon)18:52:18 No.106457283

Anonymous 09/01/25(Mon)18:52:18 No.106457283

>>106457157
Same except I'm not sad to say it. I've practically stopped gaming, and I've totally stopped doing stuff in Blender (unless I'm doing something to use with AI), it's a seemingly endless journey into what you can make this technology do, and as soon as you think it's reached a plateu, something like Wan drops and you're just like WTF ?

Anonymous
09/01/25(Mon)18:52:19 No.106457284

Anonymous 09/01/25(Mon)18:52:19 No.106457284

>>106457243
it's new. not really but it's still got the wow factor, for now
that's what this whole industry is propped up by, potential

Anonymous
09/01/25(Mon)18:52:26 No.106457286

Anonymous 09/01/25(Mon)18:52:26 No.106457286

File: eternalflow.png (2.36 MB, 1696x1296)

2.36 MB PNG

Anonymous
09/01/25(Mon)18:53:06 No.106457293

Anonymous 09/01/25(Mon)18:53:06 No.106457293

>>106457261
It's not really practical with how big and slopped the model is. Chroma only exists because of a big community funding effort and one autistic guy, and schnell was much smaller of a model.

Anonymous
09/01/25(Mon)18:56:28 No.106457329

Anonymous 09/01/25(Mon)18:56:28 No.106457329

>>106457283
I wasn't even done exploring SDXL. In fact I was doing some nice art that people in a discord I'm in enjoyed, but I don't even have time to do SDXL art anymore because WAN has taken over. I really need dual GPU's so I can do the art stuff while WAN is processing.

However I'm not dumb enough to blow on my money on another GPU yet. I still need to max out my Roth IRA for the year and reach my investment goals for the year. Finances always come first.

Anonymous
09/01/25(Mon)18:56:32 No.106457330

Anonymous 09/01/25(Mon)18:56:32 No.106457330

>>106457225
>Image generation is not very useful, is saddled with big lawsuits
Same will be true for video, but it's certainly harder to monetize image generation than video generation, the latter has a much broader appeal.

I don't think services like Dall-E makes has a chance of making a profit even if you don't count the initial cost, they're just company tech demos.

Anonymous
09/01/25(Mon)18:56:50 No.106457335

Anonymous 09/01/25(Mon)18:56:50 No.106457335

>>106457126
you were asking for it
https://files.catbox.moe/fn03ja.mp4

Anonymous
09/01/25(Mon)18:57:05 No.106457336

Anonymous 09/01/25(Mon)18:57:05 No.106457336

>>106457293
Welp, guess I will just use it to get more pics for a character dataset, and remove watermark then

Anonymous
09/01/25(Mon)18:58:20 No.106457342

Anonymous 09/01/25(Mon)18:58:20 No.106457342

File: ComfyUI_00092_.png (1.97 MB, 1024x1536)

1.97 MB PNG

>>106456890
Kruek

Anonymous
09/01/25(Mon)19:01:53 No.106457363

Anonymous 09/01/25(Mon)19:01:53 No.106457363

>>106457342
Smallville sex cultist

Anonymous
09/01/25(Mon)19:02:10 No.106457367

Anonymous 09/01/25(Mon)19:02:10 No.106457367

>>106457342
>kristin kreuk
based anon

Anonymous
09/01/25(Mon)19:03:03 No.106457374

Anonymous 09/01/25(Mon)19:03:03 No.106457374

>>106457330
The studios have huge dicks to swing around while also having dumb suits that want to train their *own* models. It's just impossible to monetize this stuff in its current form in any way.

Anonymous
09/01/25(Mon)19:03:45 No.106457379

Anonymous 09/01/25(Mon)19:03:45 No.106457379

>>106457248
while this is true it's not what I mean. yes I see more YouTube AI vid grifting but I don't see any wider interest for people doing it themselves. when wan came out the threads were moving fast but nowadays it just seems like people are burnt out

Anonymous
09/01/25(Mon)19:05:39 No.106457392

Anonymous 09/01/25(Mon)19:05:39 No.106457392

I dont think they will see any money until they can make entire movies that look good

Anonymous
09/01/25(Mon)19:06:38 No.106457397

Anonymous 09/01/25(Mon)19:06:38 No.106457397

>>106457379
>nowadays it just seems like people are burnt out
I think it's what you brought up before. everyone got obsessed with trying to make it faster for so long but updates and snake oils made it extremely frustrating for a prolonged amount of time. comfyui is definitely worse than it was a year ago, forge is dead, wan2gp is a bit limited and anistudio is updating at a snail's pace

Anonymous
09/01/25(Mon)19:11:14 No.106457420

Anonymous 09/01/25(Mon)19:11:14 No.106457420

bit of a niche here, anyone got/know a wan lora for just the body teleporting out of their clothes? or maybe a just the body turning invisible one? trying to experiment with living clothes

Anonymous
09/01/25(Mon)19:12:26 No.106457432

Anonymous 09/01/25(Mon)19:12:26 No.106457432

>>106457379
These threads have been much deader in the past. This is an expected lul after a release and even then it still move decently fast. That being said. I think video really is the future. Wan 2.2 is already excellent at single frame generation. I think we're only scratching the surface of what's possible what video at this point. I sometimes run my gens through video models to get more interesting an natural posing.

Anonymous
09/01/25(Mon)19:12:36 No.106457433

Anonymous 09/01/25(Mon)19:12:36 No.106457433

I need a couple links for AnimateDiff model download, as well as a simple workflow
Thanks in advance

Anonymous
09/01/25(Mon)19:13:05 No.106457435

Anonymous 09/01/25(Mon)19:13:05 No.106457435

>>106457379
This stuff is too technical for 95% of people and 4chan is slowing down a lot on its own for other reasons. It's definitely a niche space.

Anonymous
09/01/25(Mon)19:13:35 No.106457437

Anonymous 09/01/25(Mon)19:13:35 No.106457437

>>106457379
>seems like people are burnt out
More like genning stuff you can't share online, as in hot real world people in NSFW gens.

You can't share that online since it's illegal, for good reason, but I'm thinking a LOT of gpu time has been spent on that since Wan came out.

Anonymous
09/01/25(Mon)19:13:51 No.106457439

Anonymous 09/01/25(Mon)19:13:51 No.106457439

>>106457433
>AnimateDiff
Halt. This is a suspicious request. gonna need to ask why you need that model.

Anonymous
09/01/25(Mon)19:14:28 No.106457445

Anonymous 09/01/25(Mon)19:14:28 No.106457445

>>106457433
no one uses animatediff. that's just the default filename for the video combine node

Anonymous
09/01/25(Mon)19:15:14 No.106457448

Anonymous 09/01/25(Mon)19:15:14 No.106457448

>>106457437
Wan truely is wantastic. If you're willing to spend the time on the gen, the outputs are genuinely usable depending on what your use case is. Like if you animate sprites right now, you're eating good.

Anonymous
09/01/25(Mon)19:15:27 No.106457450

Anonymous 09/01/25(Mon)19:15:27 No.106457450

We really are about to go on a whole year without any new big anime model since noob, are we
Weeb models are really getting outdated

Anonymous
09/01/25(Mon)19:16:28 No.106457455

Anonymous 09/01/25(Mon)19:16:28 No.106457455

>>106457379
Everyone was hyped about wan, and although it did deliver it wasn't on the level and quality everyone imagined and had expected. That's why it died down over time when everyone realized it was just average slope.

Anonymous
09/01/25(Mon)19:18:06 No.106457467

Anonymous 09/01/25(Mon)19:18:06 No.106457467

>>106457455
lel

Anonymous
09/01/25(Mon)19:18:26 No.106457471

Anonymous 09/01/25(Mon)19:18:26 No.106457471

>>106457455
Wan sucks because it doesn't fit in <my current VRAM size> and/or is too slow on <my current card>

Anonymous
09/01/25(Mon)19:18:36 No.106457473

Anonymous 09/01/25(Mon)19:18:36 No.106457473

>>106457439
r-reasons

Anonymous
09/01/25(Mon)19:19:35 No.106457478

Anonymous 09/01/25(Mon)19:19:35 No.106457478

>>106457455
>it wasn't on the level and quality everyone imagined and had expected
Huh? It's genuinely fantastic?

Anonymous
09/01/25(Mon)19:20:16 No.106457481

Anonymous 09/01/25(Mon)19:20:16 No.106457481

Anyone tried booru captions with Chroma lora? Should work since it was trained on that, but I'm sceptical.

Anonymous
09/01/25(Mon)19:21:07 No.106457489

Anonymous 09/01/25(Mon)19:21:07 No.106457489

>>106457455
No wan really is pretty good. I just think that it takes a little more effort than most people anticipate. Diffusionists are crackheads conditioned to instant gratification. As soon as something takes more than 5 mins to get results, forget aboud it

Anonymous
09/01/25(Mon)19:22:40 No.106457503

Anonymous 09/01/25(Mon)19:22:40 No.106457503

>>106457489
>I swear bro 4 steps is more than enough

Anonymous
09/01/25(Mon)19:23:22 No.106457510

Anonymous 09/01/25(Mon)19:23:22 No.106457510

>>106457489
Patience is a virtue. My WAN2.2 take 2 hours to complete(80 steps, no optimizations or lightx2 lora). I've got no problem waiting for quality.

Anonymous
09/01/25(Mon)19:23:41 No.106457512

Anonymous 09/01/25(Mon)19:23:41 No.106457512

File: WanVideo2_2_I2V_00277.webm (746 KB, 1248x720)

746 KB WEBM

Anonymous
09/01/25(Mon)19:24:07 No.106457516

Anonymous 09/01/25(Mon)19:24:07 No.106457516

>>106457471
>>106457478
>>106457489
I said it did deliver, and I am happy with what it can do, but it's not on a level where I would say it's perfect like how everyone expected.
It still can't do anime well, sill has issues with spatial directions, still has noisy slope quality.

Anonymous
09/01/25(Mon)19:25:37 No.106457525

Anonymous 09/01/25(Mon)19:25:37 No.106457525

>>106457512
what is this mental illness called?

Anonymous
09/01/25(Mon)19:26:05 No.106457529

Anonymous 09/01/25(Mon)19:26:05 No.106457529

>>106457510
>spend 2 hours baking a prompt
>get trolled by your lora adding in something completely different in the middle of it

Anonymous
09/01/25(Mon)19:26:31 No.106457532

Anonymous 09/01/25(Mon)19:26:31 No.106457532

>>106457530
>>106457530
>>106457530
>>106457530

Anonymous
09/01/25(Mon)19:28:23 No.106457543

Anonymous 09/01/25(Mon)19:28:23 No.106457543

>>106457529
yeah, it has many times in the past. thankfully the high noise(the motion) only takes about 15 mins to finish, so If something looks wrong in the high noise I can just cancel and not waste 2 hours. This was a huge problem in wan2.1 where you had to wait for the entire thing to finish

Anonymous
09/01/25(Mon)19:30:18 No.106457560

Anonymous 09/01/25(Mon)19:30:18 No.106457560

real bake
>>106457557
>>106457557
>>106457557
>>106457557

Anonymous
09/01/25(Mon)19:30:26 No.106457562

Anonymous 09/01/25(Mon)19:30:26 No.106457562

>>106457512
>Nothing like a good smoke after railing Meg
Based Herc

Anonymous
09/01/25(Mon)19:30:29 No.106457563

Anonymous 09/01/25(Mon)19:30:29 No.106457563

>>106457532
>AniStudio in the OP
18 star github repo.. is this a troll bake?

Anonymous
09/01/25(Mon)19:31:54 No.106457571

Anonymous 09/01/25(Mon)19:31:54 No.106457571

>>106457532
>>106457560
Begun, the bake wars has

Anonymous
09/01/25(Mon)19:32:24 No.106457578

Anonymous 09/01/25(Mon)19:32:24 No.106457578

>>106457563
a butthurt bake, it's funny though, I expected them to remove comfy's link our of spit.

Anonymous
09/01/25(Mon)19:33:10 No.106457583

Anonymous 09/01/25(Mon)19:33:10 No.106457583

>>106457510
I get annoyed that the current workflow I'm using takes 4mins a gen would rather get it down to 2mins. Then again mostly I just want fast slop to fap to or post same as I did for image gens.

Anonymous
09/01/25(Mon)19:33:48 No.106457588

Anonymous 09/01/25(Mon)19:33:48 No.106457588

>>106457578
>butthurt
the second bake looks like the butthurt one to me desu

Anonymous
09/01/25(Mon)19:35:06 No.106457597

Anonymous 09/01/25(Mon)19:35:06 No.106457597

>>106457588
why don't you just fly to julien and suck his cock?

Anonymous
09/01/25(Mon)19:36:46 No.106457609

Anonymous 09/01/25(Mon)19:36:46 No.106457609

>>106457597
concession accepted

Anonymous
09/02/25(Tue)00:30:40 No.106459320

Anonymous 09/02/25(Tue)00:30:40 No.106459320

>>106456029
prompt?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.