/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 11/09/25(Sun)12:55:04 No.107154826

File: collage.jpg (1.64 MB, 3814x1984)

1.64 MB JPG

/ldg/ - Local Diffusion General Anonymous 11/09/25(Sun)12:55:04 No.107154826 Archived

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107145378

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
11/09/25(Sun)12:59:11 No.107154861

Anonymous 11/09/25(Sun)12:59:11 No.107154861

File: 1751284365805507.png (1.73 MB, 1152x896)

1.73 MB PNG

Anonymous
11/09/25(Sun)13:01:49 No.107154883

Anonymous 11/09/25(Sun)13:01:49 No.107154883

From "Localsong" + a lora:

https://voca.ro/1cbIetpoY6Gv

I am telling ya, this shit has potential

Anonymous
11/09/25(Sun)13:02:03 No.107154884

Anonymous 11/09/25(Sun)13:02:03 No.107154884

File: no lora vs lora.jpg (1.67 MB, 1968x2528)

1.67 MB JPG

Anonymous
11/09/25(Sun)13:02:08 No.107154885

Anonymous 11/09/25(Sun)13:02:08 No.107154885

>>107154861
based

Anonymous
11/09/25(Sun)13:02:08 No.107154886

Anonymous 11/09/25(Sun)13:02:08 No.107154886

File: SDXL_00012_.jpg (469 KB, 1984x2480)

469 KB JPG

blessed bred

Anonymous
11/09/25(Sun)13:03:01 No.107154891

Anonymous 11/09/25(Sun)13:03:01 No.107154891

>>107154883
that's no language I've ever heard, sounds like gibberish

Anonymous
11/09/25(Sun)13:03:23 No.107154896

Anonymous 11/09/25(Sun)13:03:23 No.107154896

File: 1762099606807161.png (2.4 MB, 1152x896)

2.4 MB PNG

>>107154885
thanks. any OCR or VLM anons want to see if their model can read these?

Anonymous
11/09/25(Sun)13:03:44 No.107154904

Anonymous 11/09/25(Sun)13:03:44 No.107154904

File: ComfyUI_temp_qmfoy_00048_.jpg (1.14 MB, 1824x1248)

1.14 MB JPG

https://files.catbox.moe/9egs1f.png

Anonymous
11/09/25(Sun)13:04:06 No.107154908

Anonymous 11/09/25(Sun)13:04:06 No.107154908

File: Bloodborne..jpg (679 KB, 1366x768)

679 KB JPG

What program/model do I use to gen cool landscape images?

Anonymous
11/09/25(Sun)13:04:21 No.107154913

Anonymous 11/09/25(Sun)13:04:21 No.107154913

>>107154891
Who cares when the melody sounds cool
Modern music is garbage precisely because artists try to give emphasis to the lyrics way too much

Anonymous
11/09/25(Sun)13:04:51 No.107154915

Anonymous 11/09/25(Sun)13:04:51 No.107154915

File: ComfyUI_temp_qmfoy_00003_.jpg (823 KB, 1824x1248)

823 KB JPG

https://files.catbox.moe/cqg2n9.png

Anonymous
11/09/25(Sun)13:05:10 No.107154918

Anonymous 11/09/25(Sun)13:05:10 No.107154918

For those who missed it:
https://github.com/Lakonik/ComfyUI-piFlow
https://huggingface.co/spaces/Lakonik/pi-Qwen
https://huggingface.co/Lakonik/pi-Qwen-Image
https://huggingface.co/Lakonik/pi-FLUX.1
>>107154174
>Ok this thing is kind of insane. I made a workflow to compare it with normal Qwen, and it's basically the same level of quality while taking less than 10% of the time. Works out of the box with loras also. In fact, with a custom lora on a mediocre quality dataset, the results are arguably better with this thing at 4 steps. It is partially counteracting the shitty quality of my dataset. Absolutely the new meta for using Qwen, it will be impossible to go back with how fast it is.

Anonymous
11/09/25(Sun)13:05:18 No.107154920

Anonymous 11/09/25(Sun)13:05:18 No.107154920

File: SDXL_00013_.jpg (501 KB, 1984x2480)

501 KB JPG

>>107154886

Anonymous
11/09/25(Sun)13:07:46 No.107154937

Anonymous 11/09/25(Sun)13:07:46 No.107154937

File: ComfyUI_temp_qmfoy_00043_.png (3.63 MB, 1824x1248)

3.63 MB PNG

>>107154908
You can try regional prompting, so that in one region of the image it'll follow this prompt, then in another this other prompt. You can also try inpainting

----
https://files.catbox.moe/2zhb62.png

Anonymous
11/09/25(Sun)13:07:57 No.107154939

Anonymous 11/09/25(Sun)13:07:57 No.107154939

File: ComfyUI_00018_.png (935 KB, 832x1216)

935 KB PNG

>>107154918
>20s qwen gen
not bad, i would still give a little denoise with something to tidy it up.

if you gen with qwen then do wan denoise, where do you even post that on civit?

Anonymous
11/09/25(Sun)13:08:46 No.107154943

Anonymous 11/09/25(Sun)13:08:46 No.107154943

>>107154937
>>107154904
how long do your WAN gens take anon?

Anonymous
11/09/25(Sun)13:09:07 No.107154944

Anonymous 11/09/25(Sun)13:09:07 No.107154944

>>107154937
I just wanna go:

>landscape, big castle, atmospheric, dark clouds, lightning, mountains

What does that?

Anonymous
11/09/25(Sun)13:09:40 No.107154948

Anonymous 11/09/25(Sun)13:09:40 No.107154948

File: ComfyUI_temp_qmfoy_00028_.png (3.28 MB, 1824x1248)

3.28 MB PNG

https://files.catbox.moe/e3dk4s.png

Anonymous
11/09/25(Sun)13:11:13 No.107154956

Anonymous 11/09/25(Sun)13:11:13 No.107154956

File: SDXL_00017_.jpg (460 KB, 1984x2480)

460 KB JPG

>>107154920

Anonymous
11/09/25(Sun)13:11:19 No.107154958

Anonymous 11/09/25(Sun)13:11:19 No.107154958

File: ComfyUI_00033_.png (1.2 MB, 832x1216)

1.2 MB PNG

>>107154918
>6s flux gen with 4steps

Anonymous
11/09/25(Sun)13:12:37 No.107154972

Anonymous 11/09/25(Sun)13:12:37 No.107154972

File: ComfyUI_temp_qmfoy_00025_.png (2.23 MB, 2304x960)

2.23 MB PNG

>>107154943
Takes about 3 mins per generation. The workflow I use has an upscaler that basically generates the image twice

>>107154944
Hmm, I see. Any image generator can do that. I thought you were going for a specific compostion, etc

https://files.catbox.moe/tcgxrp.png

Anonymous
11/09/25(Sun)13:13:09 No.107154975

Anonymous 11/09/25(Sun)13:13:09 No.107154975

>>107154883
alright, i'm gonna give this a try with some instrumental tracks and see what happens. this was convincing, lyrics aside (which i know the page said it wasn't trained on lyrics)

Anonymous
11/09/25(Sun)13:14:00 No.107154981

Anonymous 11/09/25(Sun)13:14:00 No.107154981

>>107154972
you got the patience for that? asking coz I dont. I can get a 540p WAN video at least twice. I know your gens are super good. its just too long I fee.

Anonymous
11/09/25(Sun)13:15:15 No.107154989

Anonymous 11/09/25(Sun)13:15:15 No.107154989

>ram prices skyrocketing
>rumors of 5000 series supers being delayed

bros... I'm about to give in. I'm tired of waiting. Should I buy a used 3090 or 5070ti? they are about the same price

Anonymous
11/09/25(Sun)13:16:26 No.107154997

Anonymous 11/09/25(Sun)13:16:26 No.107154997

File: ComfyUI_temp_qmfoy_00023_.png (3.74 MB, 1248x1824)

3.74 MB PNG

https://files.catbox.moe/sp4jkj.png

>>107154981
Yeah, I actually set up a bunch of them in a row then I got eat a snack or something, lol. Thanks for the compliment btw.
Also: You can cut generation time by half by skipping the upscaler/upres part of the workflow

Anonymous
11/09/25(Sun)13:17:25 No.107155004

Anonymous 11/09/25(Sun)13:17:25 No.107155004

>>107154997
Ill give it a shot. I havnet your level of realism till now.

Anonymous
11/09/25(Sun)13:22:23 No.107155038

Anonymous 11/09/25(Sun)13:22:23 No.107155038

File: wbkfmb.png (3.38 MB, 1824x1248)

3.38 MB PNG

>>107155004
WAN is perfect to recreate the "modern digital" photography style, that you see with most photojournalism and some photographers
Also, it has pretty much perfect anatomical precision, but adding loras (i.e porn loras) decrease this precision

https://files.catbox.moe/wbkfmb.png

Anonymous
11/09/25(Sun)13:23:43 No.107155045

Anonymous 11/09/25(Sun)13:23:43 No.107155045

>>107155038
oh yeah the military ones look damn good.

Anonymous
11/09/25(Sun)13:24:02 No.107155048

Anonymous 11/09/25(Sun)13:24:02 No.107155048

File: ComfyUI_00095_.png (1.58 MB, 1216x832)

1.58 MB PNG

https://files.catbox.moe/rc3h45.png

Anonymous
11/09/25(Sun)13:24:15 No.107155050

Anonymous 11/09/25(Sun)13:24:15 No.107155050

>>107155038
can you do images like this but with bikini thighhighs girls?

Anonymous
11/09/25(Sun)13:26:37 No.107155066

Anonymous 11/09/25(Sun)13:26:37 No.107155066

File: ComfyUI_00009_.png (1.77 MB, 1216x832)

1.77 MB PNG

>>107155050
I can, but I don't wanna get the banhammer. Also, I don't have access to the 5090 I use to generate the imgs rn.
I'll post some NSFW next post. I'll just post the catbox link, i won't up the img on the thread
https://files.catbox.moe/3jpm5w.png

Anonymous
11/09/25(Sun)13:26:51 No.107155069

Anonymous 11/09/25(Sun)13:26:51 No.107155069

>>107155050
+1

Anonymous
11/09/25(Sun)13:27:20 No.107155072

Anonymous 11/09/25(Sun)13:27:20 No.107155072

File: ComfyUI_00024_.webm (958 KB, 480x768)

958 KB WEBM

>>107154920
nta making the other wan gens

Anonymous
11/09/25(Sun)13:35:08 No.107155117

Anonymous 11/09/25(Sun)13:35:08 No.107155117

>>107155050
>>107155072
I don't have access to the 5090 I use to generate images rn, sorry. The porn images I've are mostly artsy-fartsy ones

Anonymous
11/09/25(Sun)13:39:43 No.107155166

Anonymous 11/09/25(Sun)13:39:43 No.107155166

File: ComfyUI_temp_bsnuz_00022_(...).jpg (199 KB, 1824x603)

199 KB JPG

>>107155050
>>107155069
This gen is a rare one made in the "digital photojournalistic" style I've on hands rn

https://files.catbox.moe/lei0s5.png

Anonymous
11/09/25(Sun)13:41:09 No.107155178

Anonymous 11/09/25(Sun)13:41:09 No.107155178

File: ComfyUI_temp_ullsp_00025_(...).jpg (582 KB, 1824x555)

582 KB JPG

>>107155050
>>107155069
An example of my typical "artsy fartsy" gens.lmk if you guys want more

https://files.catbox.moe/y93k43.png

Anonymous
11/09/25(Sun)13:42:04 No.107155187

Anonymous 11/09/25(Sun)13:42:04 No.107155187

File: 3641827738.png (589 KB, 1216x832)

589 KB PNG

Anonymous
11/09/25(Sun)13:42:07 No.107155188

Anonymous 11/09/25(Sun)13:42:07 No.107155188

>>107155166
can you generate feminist protesting free nipples or something feminist but are actually hot babes with big tiddies in underwear and wearing thighhighs?

Anonymous
11/09/25(Sun)13:42:16 No.107155190

Anonymous 11/09/25(Sun)13:42:16 No.107155190

>>107155178
Yess ofc definitely!

Anonymous
11/09/25(Sun)13:42:26 No.107155195

Anonymous 11/09/25(Sun)13:42:26 No.107155195

man, all these dit models kinda suck. was raping ram really worth having nlp? everything was just fine if not better when we used controlnets and ipadapter. edit models were a mistake

Anonymous
11/09/25(Sun)13:43:30 No.107155204

Anonymous 11/09/25(Sun)13:43:30 No.107155204

File: 1752442757507484.jpg (804 KB, 2048x2048)

804 KB JPG

Anonymous
11/09/25(Sun)13:43:54 No.107155206

Anonymous 11/09/25(Sun)13:43:54 No.107155206

>>107155069
Nice Redditor Gold there, kind stranger!

Anonymous
11/09/25(Sun)13:45:52 No.107155217

Anonymous 11/09/25(Sun)13:45:52 No.107155217

>>107155178
i too would like more

Anonymous
11/09/25(Sun)13:47:11 No.107155222

Anonymous 11/09/25(Sun)13:47:11 No.107155222

File: tmp7vfgm7y6.mp4 (1.87 MB, 832x576)

1.87 MB MP4

Anonymous
11/09/25(Sun)13:47:45 No.107155225

Anonymous 11/09/25(Sun)13:47:45 No.107155225

File: lora_00033_.jpg (367 KB, 1336x912)

367 KB JPG

>>107155166
>>107155178
These are great

Anonymous
11/09/25(Sun)13:49:00 No.107155234

Anonymous 11/09/25(Sun)13:49:00 No.107155234

File: WAN2.2_00472.mp4 (3.95 MB, 872x592)

3.95 MB MP4

>>107155048

Anonymous
11/09/25(Sun)13:50:04 No.107155240

Anonymous 11/09/25(Sun)13:50:04 No.107155240

>>107155234
What track is this?

Anonymous
11/09/25(Sun)13:50:52 No.107155245

Anonymous 11/09/25(Sun)13:50:52 No.107155245

>>107155240
le circuit de wan

Anonymous
11/09/25(Sun)13:51:09 No.107155251

Anonymous 11/09/25(Sun)13:51:09 No.107155251

>>107155240
this is going to be the first playable "world simulator" game. just an infinite race track. probably releasable by someone like deepmind right now

Anonymous
11/09/25(Sun)13:51:52 No.107155256

Anonymous 11/09/25(Sun)13:51:52 No.107155256

File: ComfyUI_temp_cnsni_00071_.jpg (77 KB, 1248x779)

77 KB JPG

>>107155217
>>107155190
https://files.catbox.moe/32hb6v.png
>>107155188
can't, sorry. this machine can't gen imgs

Anonymous
11/09/25(Sun)13:53:36 No.107155272

Anonymous 11/09/25(Sun)13:53:36 No.107155272

>>107155225
Thanks a lot, fren!

>>107155234
>>107155222
Awesome gens, fren! Loved how the lead car went to the F-Zero shield recharge strip at the end there, lmao

>>107155240
Reminds me of the start/finish line from Imola, but it's not any particular track

Anonymous
11/09/25(Sun)14:02:48 No.107155339

Anonymous 11/09/25(Sun)14:02:48 No.107155339

File: ComfyUI_00442_.jpg (30 KB, 1216x262)

30 KB JPG

Fencing duel gens, complete pic(s) in the catbox
https://files.catbox.moe/10dpcm.png
https://files.catbox.moe/9g7xb8.png

Anonymous
11/09/25(Sun)14:05:32 No.107155364

Anonymous 11/09/25(Sun)14:05:32 No.107155364

File: ComfyUI_temp_gdjxo_00147_(...).jpg (15 KB, 352x279)

15 KB JPG

TW: suifuel (contains happy couple)
https://files.catbox.moe/ngt115.png

Anonymous
11/09/25(Sun)14:06:55 No.107155370

Anonymous 11/09/25(Sun)14:06:55 No.107155370

File: ComfyUI_00456_ - Copia.jpg (36 KB, 854x372)

36 KB JPG

last one for now, gtg work. another duel, this time to the death
https://files.catbox.moe/y7jlxy.png

Anonymous
11/09/25(Sun)14:12:35 No.107155410

Anonymous 11/09/25(Sun)14:12:35 No.107155410

Blessed thread of frenship

Anonymous
11/09/25(Sun)14:13:56 No.107155425

Anonymous 11/09/25(Sun)14:13:56 No.107155425

>>107155204
recipe for this bread?

Anonymous
11/09/25(Sun)14:15:13 No.107155437

Anonymous 11/09/25(Sun)14:15:13 No.107155437

>>107154958
Does it work with Chroma since it supports Flux?

Anonymous
11/09/25(Sun)14:24:35 No.107155505

Anonymous 11/09/25(Sun)14:24:35 No.107155505

>>107155437
try it and find out

Anonymous
11/09/25(Sun)14:31:12 No.107155543

Anonymous 11/09/25(Sun)14:31:12 No.107155543

File: 1742420329343968.png (287 KB, 635x563)

287 KB PNG

>>107154896

Anonymous
11/09/25(Sun)14:40:26 No.107155614

Anonymous 11/09/25(Sun)14:40:26 No.107155614

Sega Genesis Sonic-style track on "LocalSong":

https://voca.ro/13U9LKll5na4

Things got a bit bad in the end, but overall pretty good

Anonymous
11/09/25(Sun)15:01:30 No.107155799

Anonymous 11/09/25(Sun)15:01:30 No.107155799

File: ComfyUI_00060_.png (1014 KB, 832x1216)

1014 KB PNG

>>107155437
>60s with (30s -> face detailer), 12steps using 8step lora. no dice on chroma, it has hardcoded qwen and flux in the loader

Anonymous
11/09/25(Sun)15:06:48 No.107155852

Anonymous 11/09/25(Sun)15:06:48 No.107155852

Need a wan lora from the Tylers poop festival video

Anonymous
11/09/25(Sun)15:08:18 No.107155866

Anonymous 11/09/25(Sun)15:08:18 No.107155866

>happily gen some cute anime 1girls at the start of the year
>look away from the screen for a moment
>Huge fucking pile of optimizations happen
I feel like unless you're keeping up with this daily, you're just hopelessly left behind because its impossible to find information on whatever sage attention or these other -attention fixes are, how to use it, or what they're for because it gets buried under a sea of new or conflicting information.

Anonymous
11/09/25(Sun)15:09:38 No.107155875

Anonymous 11/09/25(Sun)15:09:38 No.107155875

>>107155866
that would be the case if anyone used said optimizations. unless it's merged into mainline comfyui, most of the good optimizations (both for speed and quality) just get ignored/forgotten.

Anonymous
11/09/25(Sun)15:19:01 No.107155946

Anonymous 11/09/25(Sun)15:19:01 No.107155946

>>107154100
>>107154342
Nope, doesn't build with downgraded toolkit:(
Yaps about nvvc not existing after idling for half an hour. I guess the other anon who warned about incompatibility was right.
Gonna wait TM for official support or make separate docker for it later.

Anonymous
11/09/25(Sun)15:21:58 No.107155977

Anonymous 11/09/25(Sun)15:21:58 No.107155977

File: 1552572011.png (1.07 MB, 1152x896)

1.07 MB PNG

Anonymous
11/09/25(Sun)15:27:32 No.107156022

Anonymous 11/09/25(Sun)15:27:32 No.107156022

What do you want the most for a local model?

https://poal.me/7udx6s
https://poal.me/7udx6s
https://poal.me/7udx6s
https://poal.me/7udx6s

Anonymous
11/09/25(Sun)15:30:00 No.107156045

Anonymous 11/09/25(Sun)15:30:00 No.107156045

>>107156022
anyone voting anything than video is retarded, images are already mostly there, the biggest thing we need is edit model without vae, video has a long way to go in comparison

Anonymous
11/09/25(Sun)15:31:01 No.107156054

Anonymous 11/09/25(Sun)15:31:01 No.107156054

>>107156045
>anyone voting anything than video is retarded
*or vramlet

Anonymous
11/09/25(Sun)15:31:06 No.107156056

Anonymous 11/09/25(Sun)15:31:06 No.107156056

>>107156045
yep this was my take too

Anonymous
11/09/25(Sun)15:31:13 No.107156059

Anonymous 11/09/25(Sun)15:31:13 No.107156059

Retards rise up

Anonymous
11/09/25(Sun)15:32:54 No.107156072

Anonymous 11/09/25(Sun)15:32:54 No.107156072

>>107156045
Video models are less suitable for prompt alignment for a single frame

Anonymous
11/09/25(Sun)15:37:30 No.107156110

Anonymous 11/09/25(Sun)15:37:30 No.107156110

>>107156045
I'm excited for video because I know video brings audio in with it immediately as well. Immediately ASMR and braps and sound effects and short dialogue sentences and memes and swears and so much more are solved before we even get a text-to-audio model that's good

Anonymous
11/09/25(Sun)15:39:46 No.107156130

Anonymous 11/09/25(Sun)15:39:46 No.107156130

You know deep in your hearts that you will not be able to run Sora 2 grade stuff without 48gb vram and waiting 10+ minutes per video even with distillation and quants

Anonymous
11/09/25(Sun)15:39:47 No.107156131

Anonymous 11/09/25(Sun)15:39:47 No.107156131

>>107154918
>ctrl f "edit"
>zero results
does it work for qwen-e

Anonymous
11/09/25(Sun)15:42:15 No.107156157

Anonymous 11/09/25(Sun)15:42:15 No.107156157

>>107156130
correct, we will have something much better than dogshit sora lol

Anonymous
11/09/25(Sun)15:44:31 No.107156167

Anonymous 11/09/25(Sun)15:44:31 No.107156167

>>107155799
Lame ty. Glanced at the code and it seems like there's a few places that would need adapting

Anonymous
11/09/25(Sun)15:44:49 No.107156170

Anonymous 11/09/25(Sun)15:44:49 No.107156170

>>107156157
I am an openai hater as well, but come on anon, let's not cope that way

Anonymous
11/09/25(Sun)15:47:45 No.107156194

Anonymous 11/09/25(Sun)15:47:45 No.107156194

File: sora 2.png (293 KB, 549x617)

293 KB PNG

>>107156170
toy model for memes whose only great thing is the fact that they trained on the entire youtube dataset, without that its literally worse than wan 2.2

Anonymous
11/09/25(Sun)15:47:48 No.107156195

Anonymous 11/09/25(Sun)15:47:48 No.107156195

>>107155614
well it got the genesis instruments right for sure

Anonymous
11/09/25(Sun)15:57:58 No.107156269

Anonymous 11/09/25(Sun)15:57:58 No.107156269

>>107154918
Loaded this up and I'm getting 20 second Qwen gens even with my shitty setup, what sorcery is this

Anonymous
11/09/25(Sun)15:59:50 No.107156279

Anonymous 11/09/25(Sun)15:59:50 No.107156279

>>107156269
vram?

Anonymous
11/09/25(Sun)16:00:05 No.107156282

Anonymous 11/09/25(Sun)16:00:05 No.107156282

What is the current meta lora for speeding up wan 2.2 14b i2v?

Anonymous
11/09/25(Sun)16:00:32 No.107156291

Anonymous 11/09/25(Sun)16:00:32 No.107156291

>>107156269
16GB, RX 9070 XT.

Anonymous
11/09/25(Sun)16:03:29 No.107156310

Anonymous 11/09/25(Sun)16:03:29 No.107156310

>>107156194
It's still superior to any open video model in existence by a country mile, and that will remain true for a long time. To this day, there isn't a single local model that can pull some of the stuff that dalle3 could in 2023
If you cherrypick things, Wan does mangled outputs just as often

Anonymous
11/09/25(Sun)16:04:04 No.107156312

Anonymous 11/09/25(Sun)16:04:04 No.107156312

>>107154918
does it work with gguf?

Anonymous
11/09/25(Sun)16:06:19 No.107156335

Anonymous 11/09/25(Sun)16:06:19 No.107156335

>>107156310
>If you cherrypick things, Wan does mangled outputs just as often
not by a mile
sadly for you, the apicuck model cant be tested 1:1 with local because its locked into a chastity cage, like all who shill for it

Anonymous
11/09/25(Sun)16:13:11 No.107156393

Anonymous 11/09/25(Sun)16:13:11 No.107156393

>>107156335
>sadly for you, the apicuck model cant be tested 1:1 with local because its locked into a chastity cage, like all who shill for it
You do realize there are other possible prompts other than porn and politically incorrect stuff, right? So yes, they can be compared

Anonymous
11/09/25(Sun)16:19:16 No.107156458

Anonymous 11/09/25(Sun)16:19:16 No.107156458

>>107156282
Let me be more clear.
Apperantly I am still using this from 3 months ago:
https://huggingface.co/lightx2v/Wan2.2-Lightning/blob/main/Wan2.2-I2V-A14B-4steps-lora-rank64-Seko-V1/high_noise_model.safetensors
Is this:
https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main
Or anything else better than it?

Anonymous
11/09/25(Sun)16:19:39 No.107156462

Anonymous 11/09/25(Sun)16:19:39 No.107156462

>>107154918
>uses own ksampler
>uses own model loader
INTO THE TRASH IT GOES

Anonymous
11/09/25(Sun)16:20:27 No.107156479

Anonymous 11/09/25(Sun)16:20:27 No.107156479

>>107156393
NTA compared =/= 1:1

Anonymous
11/09/25(Sun)16:20:56 No.107156485

Anonymous 11/09/25(Sun)16:20:56 No.107156485

>>107156393
Wow. I didn't know that. You're telling me now for the first time

Anonymous
11/09/25(Sun)16:23:35 No.107156509

Anonymous 11/09/25(Sun)16:23:35 No.107156509

>>107156485
You're welcome anon. It's enlightening indeed to know there are more prompts other that "1girl big bobs and vagene", who would have guessed!

Anonymous
11/09/25(Sun)16:25:18 No.107156523

Anonymous 11/09/25(Sun)16:25:18 No.107156523

>>107156458
There also seems to be a moe distill lora...

Anonymous
11/09/25(Sun)16:30:10 No.107156559

Anonymous 11/09/25(Sun)16:30:10 No.107156559

>>107156509
damn, gotta step my game up, i mean imagine a 1girl with smal bobs... it got my creative juices flowing
(and unretarding for a minute: curiosity in how to setup those matrix comparison graphs people post every now and then, since those can be programmed, i think?)

Anonymous
11/09/25(Sun)16:47:05 No.107156717

Anonymous 11/09/25(Sun)16:47:05 No.107156717

>>107156523
There also seems to be v1030 that got deleted
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_4step_lora_v1030_rank_64_bf16.safetensors
I don't expect a wall of text spoonfeeding me strengths and weaknesses of all but just what are anons here using in their daily gens?

Anonymous
11/09/25(Sun)16:49:18 No.107156736

Anonymous 11/09/25(Sun)16:49:18 No.107156736

>>107156157
we still don't have DALL-E 3 at home, stop coping

Anonymous
11/09/25(Sun)16:51:34 No.107156750

Anonymous 11/09/25(Sun)16:51:34 No.107156750

what's a good free software for managing gens? preferably one that shows the metadata like prompts. I'm getting to have too many. bonus points if it does wan too, though idk if it actually has metadata yet. I only just started with that

Anonymous
11/09/25(Sun)16:52:23 No.107156761

Anonymous 11/09/25(Sun)16:52:23 No.107156761

File: 1747494964850870.png (489 KB, 836x284)

489 KB PNG

>>107156736
correct yet again, we have something much better than dalle 3, the possibility to train a lora on anything you want and generate with any parameters you want with no limits, including training a dalle 3 style lora itself like picrel

Anonymous
11/09/25(Sun)16:53:20 No.107156771

Anonymous 11/09/25(Sun)16:53:20 No.107156771

>>107155204
It upsets me that I can't reproduce this solid vectorized style.

Anonymous
11/09/25(Sun)16:53:21 No.107156772

Anonymous 11/09/25(Sun)16:53:21 No.107156772

nano banana 2 is too good
its over for local

Anonymous
11/09/25(Sun)16:53:25 No.107156774

Anonymous 11/09/25(Sun)16:53:25 No.107156774

>>107156761
lora https://civitai.com/models/2093591

Anonymous
11/09/25(Sun)16:53:53 No.107156778

Anonymous 11/09/25(Sun)16:53:53 No.107156778

File: 1754450011443883.png (1.29 MB, 768x1344)

1.29 MB PNG

Anonymous
11/09/25(Sun)16:56:05 No.107156804

Anonymous 11/09/25(Sun)16:56:05 No.107156804

>>107156772
The better proprietarycuck edit models are, the better outputs the new qwen image edit model can be easily trained on, thanks for spending millions for local to snatch it all up for free before training a clothes remover lora within a couple hours lol

Anonymous
11/09/25(Sun)17:03:26 No.107156840

Anonymous 11/09/25(Sun)17:03:26 No.107156840

>>107156804
based

Anonymous
11/09/25(Sun)17:04:08 No.107156845

Anonymous 11/09/25(Sun)17:04:08 No.107156845

>>107156761
it's not about the style, or any specific thing object/concept, retard
that you thought it was tells me all I have to know about your intellectual level, you don't understand what dall-e 3 has that local still has not and you never will understand because you're a moron

Anonymous
11/09/25(Sun)17:05:48 No.107156854

Anonymous 11/09/25(Sun)17:05:48 No.107156854

>>107156845
>no argument
oof, thanks for conceeding

Anonymous
11/09/25(Sun)17:06:21 No.107156855

Anonymous 11/09/25(Sun)17:06:21 No.107156855

File: ComfyUI_11606.png (3.02 MB, 1280x1600)

3.02 MB PNG

>>107156462
This. I can't fucking use this in my workflow. I needs my snake oil!

Anonymous
11/09/25(Sun)17:06:33 No.107156856

Anonymous 11/09/25(Sun)17:06:33 No.107156856

File: 1761864588001962.jpg (476 KB, 1264x1656)

476 KB JPG

>>107154826
>not collaging the real braphog

Anonymous
11/09/25(Sun)17:07:03 No.107156860

Anonymous 11/09/25(Sun)17:07:03 No.107156860

File: nano banana 2 map.png (1.84 MB, 1408x768)

1.84 MB PNG

>>107156772
It still can't do maps. (Courtesy of some plebbitor.)
But yes the whiteboard math equation stuff is impressive.

Anonymous
11/09/25(Sun)17:15:19 No.107156920

Anonymous 11/09/25(Sun)17:15:19 No.107156920

>>107156335
>not by a mile
No local model can gen multiscene videos WITH audio at the same time, so yes, nothing local comes close to it currently

The closest thing to it is this Wan fine-tune for multiscene, which has no audio:

https://holo-cine.github.io/

(and I haven't seen any anon use this)

Apparently they will release the weights for an audio component later though, so we'll see (there is a HoloCine-audio in the roadmap as well as an I2V version)

Anonymous
11/09/25(Sun)17:17:53 No.107156940

Anonymous 11/09/25(Sun)17:17:53 No.107156940

>>107156920
no proprietary model is gonna allow you lora creation for whatever you want nor to tweak every gen parameter, that is the thing that actually matters, everything else can already either be done locally or can be done locally but with more manual work worst case scenario, but proprietarycucks literally CANT do these things and wont ever be able to in any way.

Anonymous
11/09/25(Sun)17:24:08 No.107156982

Anonymous 11/09/25(Sun)17:24:08 No.107156982

File: 1756195819814295.png (1.21 MB, 896x1152)

1.21 MB PNG

>a- aunt jemima... is that OK to wear in public?

Anonymous
11/09/25(Sun)17:29:48 No.107157028

Anonymous 11/09/25(Sun)17:29:48 No.107157028

>>107156854
keep on coping, copeboy

Anonymous
11/09/25(Sun)17:32:43 No.107157052

Anonymous 11/09/25(Sun)17:32:43 No.107157052

>>107157028
>no argument
already accepted your concession lil bro, keep crashing out

Anonymous
11/09/25(Sun)17:36:02 No.107157073

Anonymous 11/09/25(Sun)17:36:02 No.107157073

>>107156982
Very nice anon

Anonymous
11/09/25(Sun)17:36:23 No.107157076

Anonymous 11/09/25(Sun)17:36:23 No.107157076

>>107157052
you do whatever it takes to keep the cope alive
is this you?>>107156940
>everything else can already either be done locally or can be done locally but with more manual work worst case scenario
lol, lmao even

Anonymous
11/09/25(Sun)17:38:49 No.107157092

Anonymous 11/09/25(Sun)17:38:49 No.107157092

>>107157076
>no argument
this has to be a bot, right? lol

Anonymous
11/09/25(Sun)17:39:37 No.107157098

Anonymous 11/09/25(Sun)17:39:37 No.107157098

Most important things for new pc if I wanna do decent video gens in a non absurd timeframe?
I don’t wanna reply to ever webm in here asking for pc specs but if someone wants to post some with their specs/how long it took I’d greatly appreciate it
Budget is about 2.5k for new pc

Anonymous
11/09/25(Sun)17:41:42 No.107157114

Anonymous 11/09/25(Sun)17:41:42 No.107157114

>>107157098
16gb vram is the single most important thing. more than that is better. less than that you're fucked.

Anonymous
11/09/25(Sun)17:44:33 No.107157141

Anonymous 11/09/25(Sun)17:44:33 No.107157141

>>107157092
of course, anyone who laughs at your lack of intelligence is a bot
the argument is that you're a retard, you give more weight to what can be done locally just to poop on the things local can't do yet, that's moron behavior
>can be done locally but with more manual work worst case scenario
ANYTHING can be done locally but with more manual work, just grab a camera, hire actors, make a set, film it, pay jeets to VFX it and there you have it, no Sora 2 needed
it's an useless statement, you absolute shit for brains baboon
the whole point of AI is to have less manual work, if Sora 2 can do it without the manual work then it is (even if just for now) better

Anonymous
11/09/25(Sun)17:51:40 No.107157199

Anonymous 11/09/25(Sun)17:51:40 No.107157199

>>107157098
nvidia gpu is the only thing that really matters. 16gb vram+. 24vram is practically required if you want top quality video gens. minimum 64gb ddr5 ram for offloading model cache if needed. cpu isnt important but you'll want something made within the past 10 years at least.

Anonymous
11/09/25(Sun)18:01:30 No.107157280

Anonymous 11/09/25(Sun)18:01:30 No.107157280

Question to the anons using Wan2.2 text-to-video (not I2V), which lora are you using?

Anonymous
11/09/25(Sun)18:03:00 No.107157290

Anonymous 11/09/25(Sun)18:03:00 No.107157290

>>107155166
crazy workflow, nice
>>107155364
im so lonely bwos

Anonymous
11/09/25(Sun)18:04:37 No.107157311

Anonymous 11/09/25(Sun)18:04:37 No.107157311

>>107157280
There was this released two days ago if you're talking about lightx2v
https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-rank64-Seko-V2.0

Anonymous
11/09/25(Sun)18:12:56 No.107157370

Anonymous 11/09/25(Sun)18:12:56 No.107157370

File: 1757125369853283.png (592 KB, 1572x773)

592 KB PNG

>>107157141
>be proprietarycuck

>you cant train a lora to add a style to the model
>you cant train a lora to add a character or a person to the model
>you cant train a lora to add a concept to the model
>you cant train a lora for anything at all
>you cant finetune the model
>no big company can finetune the model like many companies are doing right now with wan
>you cant have anyone research around the model at all to improve its architecture, find optimization avenues, fix issues, change specific layers, text encoders, vaes, learn how to make better models in the future and advancing the entire ai industry itself etc
>you cant generate gore
>you cant generate pornographic material
>you cant generate anything else someone else would deem "problematic", no matter how mundane it might be
>you cant generate anything they at any point in time say you cant generate in the future when they change their mind overnight
>you cant generate anything at all if their servers are overloaded, not online, or broken
>you cant generate anything without it being logged and all your data harvested and sold
>you cant control dozens of generation parameters that would allow you to have precise control over what you generate, no matter how specific
>you cant write nor test out new generation parameters like new specialized samplers and schedulers
>you cant do anything about it if they decide to lobotomize the model you are using or remove it completely overnight, never being able to truly recreate what you once did and liked
>you cant test out new papers coming out with new technologies like completely changing how an entire portion of inference works, like completely changing how cfg works, completely changing how negative prompting works (https://github.com/hako-mikan/sd-webui-negpip) etc etc
As a proprietarycuck you are paying to be in a limited and spied on cuck cage and you lash out when someone calls out your evil corpo master and your pathetic cuck predicament.

Anonymous
11/09/25(Sun)18:25:29 No.107157453

Anonymous 11/09/25(Sun)18:25:29 No.107157453

>>107156982
how the fuck are you guys, like pancakechad for example, genning animateinanimate like this? fuck this is so good.
man i know my brain is rotted when i find pancake and syrup women hotter than any e-girl kek

Anonymous
11/09/25(Sun)18:28:03 No.107157470

Anonymous 11/09/25(Sun)18:28:03 No.107157470

>>107157453
very carefully

Anonymous
11/09/25(Sun)18:33:41 No.107157516

Anonymous 11/09/25(Sun)18:33:41 No.107157516

>>107157470
i asked how you gen them, not how you fuck them!

but true.

Anonymous
11/09/25(Sun)18:37:26 No.107157546

Anonymous 11/09/25(Sun)18:37:26 No.107157546

>>107157311
No lora I found works well with Holocine (the multiscene fine-tune)

Anonymous
11/09/25(Sun)18:39:04 No.107157556

Anonymous 11/09/25(Sun)18:39:04 No.107157556

File: 1751074644698500.jpg (73 KB, 735x739)

73 KB JPG

WAN 2.2 anons: just bought a 5070ti and I've been playing around all weekend to get a good workflow for keyframing a longer animation
>Generate ~12 separate 'keyframes' in SD for character LORAs
>Inpaint poses/details - create depth masks to quickly delete background in photoshop to keep character in white void for WAN
>send color 'keyframes' 1 + 2, 2 + 3, to FFLF2V to get a crude timeline of 2-3 second clips (turning, raising, pointing, draining a pint glass, etc. )
>i2v Q_8 gguf in the comfy 'workaround' gets jarring "Flashes" on reaching last frame as it quickly tries to compensate for color degradation, but LORAs are made for i2v.
>Inpaint Q_8 gguf seems to go faster and solves the flashes, seems to take the LORAs but i'm still unsure how well it will work long term.

curious how to proceed here:
>finish all the 2-3 second clips in i2v and try to save it in premiere
>keep playing with the inp. to get it to follow styles so I only need to fix the front half in post or re-gens
>Learn how to use VACE and how to use the last and first 8 frames of each clip to preserve the motion
>Take the entire 24 second video with jank coloring and learn VACE v2v to depth mask the entire thing and regen.

>>107157199
minimum 64gb ddr5 ram for offloading model cache if needed
I have 32 and have been holding off because prices are gay. is it actually super necessary?

Anonymous
11/09/25(Sun)18:40:26 No.107157565

Anonymous 11/09/25(Sun)18:40:26 No.107157565

>>107157556
vace

Anonymous
11/09/25(Sun)18:41:47 No.107157572

Anonymous 11/09/25(Sun)18:41:47 No.107157572

>>107157556
>is it actually super necessary?
No but excessive swap use you get with 32 gigs slow generation down considerably.

Anonymous
11/09/25(Sun)18:43:38 No.107157584

Anonymous 11/09/25(Sun)18:43:38 No.107157584

File: 1743026961283844.jpg (19 KB, 409x160)

19 KB JPG

Why are the vue nodes so fucking huge? I want to use them, but this is ridiculous.

Anonymous
11/09/25(Sun)18:44:16 No.107157592

Anonymous 11/09/25(Sun)18:44:16 No.107157592

File: lora_00090_.jpg (325 KB, 891x1336)

325 KB JPG

>>107157556
It's much faster with 64gb+ ram

Anonymous
11/09/25(Sun)18:50:12 No.107157641

Anonymous 11/09/25(Sun)18:50:12 No.107157641

File: ComfyUI_01308_.png (1.12 MB, 1152x896)

1.12 MB PNG

>>107157453
prompt for the original one:
>professional 4k high resolution hyperrealistic 3d render by Disney Pixar of a beautiful nude curvy woman slime girl who is made entirely out of maple syrup. Her whole body and face are translucent and seethrough syrup. Her hair is made out of melting butter. She sits cross-legged on top of a huge stack of pancakes. Her body melts onto the pancakes. The pancakes are on a modest porcelain plate in a 50s American diner restaraunt.
>raytracing, beautiful lighting.

standard chroma WF

Anonymous
11/09/25(Sun)18:50:25 No.107157642

Anonymous 11/09/25(Sun)18:50:25 No.107157642

>>107157199
What does offloading model cache mean and what do you mean by 16gb vram + .24vram?

Anonymous
11/09/25(Sun)18:53:07 No.107157662

Anonymous 11/09/25(Sun)18:53:07 No.107157662

File: lora_00094_.jpg (301 KB, 891x1336)

301 KB JPG

Anonymous
11/09/25(Sun)18:56:04 No.107157685

Anonymous 11/09/25(Sun)18:56:04 No.107157685

Easy Cache, Lazy Cache, Apply First Block Cache, Wan Video Tea Cache, Wan Video Mag Cache, Wan Video Tea Cache Native, Wan Video Easy Cache
Which cope cache node do you use and at what settings?

Anonymous
11/09/25(Sun)18:58:47 No.107157705

Anonymous 11/09/25(Sun)18:58:47 No.107157705

File: 175498415651458.png (506 KB, 640x610)

506 KB PNG

>>107157592
would 96 make any difference or is that just pointless? the price ladder from 64 is a lot narrower than it used to be due to being a weirder size + slower clocks for XMP

>>107157565
>Vace
what's the point of the 3gb "Module" Vace FUNs at https://huggingface.co/Kijai/WanVideo_comfy_GGUF/tree/main/VACE
versus the large models at https://huggingface.co/QuantStack/Wan2.2-VACE-Fun-A14B-GGUF/tree/main/HighNoise?

Do you load the modules in the same chain as the regular i2v (or inp) model to save on disk space while achieving the same result?

Anonymous
11/09/25(Sun)18:58:57 No.107157707

Anonymous 11/09/25(Sun)18:58:57 No.107157707

>>107157642
For example, Wan2.2-I2V-A14B-LowNoise-Q8_0.gguf is 15.4gb. If you only have 16gb of vram on your gpu, that leaves you with 0.6gb of vram. Keep in mind, the text encoder + loras + vae also are stored in the vram. Since all that can't fit on a tiny 16gb card, you can set a specific amount of the model to be swapped to your system ram. IE, 10gb of the wan model off loaded to system ram. This will allow you to gen without running out of memory. Off loading to ram is much, much slower, but it works.

Optionally, you can use a lower quant version of the model, like Wan2.2-I2V-A14B-LowNoise-Q6_K.gguf which is 12gb, but lower quants = lower quality.

Anonymous
11/09/25(Sun)18:59:50 No.107157712

Anonymous 11/09/25(Sun)18:59:50 No.107157712

>>107157642
He's saying you should aim for 16gb vram minimum but 24 is preferable. Offloading is when you can't fit the entire model into vram so you use your system ram. wan 2.2 q8 is like 15 gigs(?) for one of the models

Anonymous
11/09/25(Sun)18:59:53 No.107157713

Anonymous 11/09/25(Sun)18:59:53 No.107157713

>>107157685
For Wan2.2, you don't use any of them.

Anonymous
11/09/25(Sun)19:02:51 No.107157732

Anonymous 11/09/25(Sun)19:02:51 No.107157732

>>107156022
imgchad rein eternal

Anonymous
11/09/25(Sun)19:03:45 No.107157739

Anonymous 11/09/25(Sun)19:03:45 No.107157739

File: 1653334650116.jpg (46 KB, 750x1086)

46 KB JPG

>>107157370

Anonymous
11/09/25(Sun)19:03:55 No.107157740

Anonymous 11/09/25(Sun)19:03:55 No.107157740

>>107157713
Is there a reason why?

Anonymous
11/09/25(Sun)19:23:37 No.107157881

Anonymous 11/09/25(Sun)19:23:37 No.107157881

>>107157705
>would 96 make any difference or is that just pointless?
Hard to say really. Depends of the motherboard combo I guess.

Anonymous
11/09/25(Sun)19:31:37 No.107157947

Anonymous 11/09/25(Sun)19:31:37 No.107157947

>>107157740
Video generation is iterative.

Anonymous
11/09/25(Sun)19:37:15 No.107157987

Anonymous 11/09/25(Sun)19:37:15 No.107157987

https://civitai.com/models/2114848/2000s-amateur-photography
As requested. Not perfect, but reduces vaginahorror and manfaces.

Anonymous
11/09/25(Sun)19:38:28 No.107157997

Anonymous 11/09/25(Sun)19:38:28 No.107157997

File: wan2.2_00001.mp4 (3.13 MB, 832x480)

3.13 MB MP4

I tried Holocine and I could not get the same results as their demo even with 15 seconds lol
I used the same prompt
I obviously had to make some sacrifices like using distillation models with 5bit quants

"b-but local is better than saas, trust me bro!"
"results are shit? It's your fault you are poor and don't own an H100, the pinnacle of LOCAL gpus :^)"

Anonymous
11/09/25(Sun)19:41:37 No.107158021

Anonymous 11/09/25(Sun)19:41:37 No.107158021

>>107157641
thanks <3

Anonymous
11/09/25(Sun)19:43:25 No.107158038

Anonymous 11/09/25(Sun)19:43:25 No.107158038

>>107154956
this is AI?

Anonymous
11/09/25(Sun)19:43:52 No.107158042

Anonymous 11/09/25(Sun)19:43:52 No.107158042

>>107157987
That looks like a zoomer idea of what 2000s photography looks like, and some of the photos in the showcase don't look "amateur" at all. At least search for photos that used popular cameras from that time like Sony Cybershot, Olympus, Canon PowerShot etc, or search for old myspace photos or older photos from Flickr.

t. Millennial

Anonymous
11/09/25(Sun)19:44:26 No.107158049

Anonymous 11/09/25(Sun)19:44:26 No.107158049

>>107157987
bruh moment, as the kids say. https://civitai.com/models/978314/ultrareal-fine-tune?modelVersionId=1413133

Anonymous
11/09/25(Sun)19:48:47 No.107158090

Anonymous 11/09/25(Sun)19:48:47 No.107158090

>>107157987
wait regular chroma cant do vageen? wtaf

Anonymous
11/09/25(Sun)19:53:13 No.107158114

Anonymous 11/09/25(Sun)19:53:13 No.107158114

>>107158042
Dataset is mostly from 2000-2010 era.

>>107158090
It can, but it gets confused.

Anonymous
11/09/25(Sun)19:54:36 No.107158131

Anonymous 11/09/25(Sun)19:54:36 No.107158131

>cold weather
>gpu 100% to warm room
Ohh shit it is GOON season

Anonymous
11/09/25(Sun)19:55:40 No.107158135

Anonymous 11/09/25(Sun)19:55:40 No.107158135

But for what shall i goon to?

Anonymous
11/09/25(Sun)19:55:53 No.107158137

Anonymous 11/09/25(Sun)19:55:53 No.107158137

File: 1741043482920713.png (1.18 MB, 896x1152)

1.18 MB PNG

Anonymous
11/09/25(Sun)19:56:32 No.107158147

Anonymous 11/09/25(Sun)19:56:32 No.107158147

correct me if im wrong, but is there any reason to make a high noise of a character lora for wan? there's no motion, so what would be the point?

Anonymous
11/09/25(Sun)19:58:46 No.107158162

Anonymous 11/09/25(Sun)19:58:46 No.107158162

>>107158114
>Dataset is mostly from 2000-2010 era.
I am a Millennial boomer who lived that era and at least the showcase images don't resemble the amateur pics from that era at all

Anonymous
11/09/25(Sun)20:03:43 No.107158193

Anonymous 11/09/25(Sun)20:03:43 No.107158193

>>107158147
It's less about "motion" strictly but denoising strength.
You might be able to make do if your character looks like a normal human with just low denoising lora. But for something like say Kirby or Sonic, you probably want for both.

Anonymous
11/09/25(Sun)20:06:24 No.107158212

Anonymous 11/09/25(Sun)20:06:24 No.107158212

File: ComfyUI__00002_.mp4 (479 KB, 832x640)

479 KB MP4

>>107155187

Anonymous
11/09/25(Sun)20:08:11 No.107158224

Anonymous 11/09/25(Sun)20:08:11 No.107158224

>>107158193
I see, thanks. I've been experimenting with my character lora while using other NSFW loras, and I noticed that using the low rank of some loras forces my character(person) to look like whatever person that lora was trained on. How can I avoid that? Increase the strength of my character's LOW lora? remove the NSFW's low model? I've tried both but haven't found anything solid that works. I can't get rid of the low lora for some NSFW loras because wan needs that data to create for example, a penis or cumshot.

The twerk lora for example, always makes the ass bigger and i don't want that. its so annoying. lowering the strength of the nsfw lora helps but also reduces the motion

Anonymous
11/09/25(Sun)20:09:52 No.107158244

Anonymous 11/09/25(Sun)20:09:52 No.107158244

File: ComfyUI__00003_.mp4 (587 KB, 640x832)

587 KB MP4

n00n0

Anonymous
11/09/25(Sun)20:10:59 No.107158258

Anonymous 11/09/25(Sun)20:10:59 No.107158258

>>107158244
nani kore wa yameto my ramenu betta stoppa acting up i'm gonna nækædæshi my ramanu

Anonymous
11/09/25(Sun)20:13:03 No.107158274

Anonymous 11/09/25(Sun)20:13:03 No.107158274

>>107158224
>How can I avoid that?
I should note that I never trained a WAN lora, but this seems like a generic lora compatibility issue to me. Try lowering the strength of other lora?
>Increase the strength of my character's LOW lora?
Maybe just a bit if you are desperate.
>remove the NSFW's low model?
Probably not.
>The twerk lora for example, always makes the ass bigger and i don't want that.
This just means the person who trained it, trained on big asses.
Train your on with diverse dataset of asses of all sizes?

Anonymous
11/09/25(Sun)20:13:35 No.107158280

Anonymous 11/09/25(Sun)20:13:35 No.107158280

File: 1750330283072872.jpg (746 KB, 1536x2688)

746 KB JPG

Anonymous
11/09/25(Sun)20:19:18 No.107158330

Anonymous 11/09/25(Sun)20:19:18 No.107158330

flux/chromosome users, how do you handle your text encoders? do you use specific quants? i'm starting to wonder if my shit gens are a product of what i'm using, but i'm not sure. cumfartui is very confusing as well so that's a variable. the default flux krea workflow is 3 whole seconds slower than an old workflow i was using earlier this year..

Anonymous
11/09/25(Sun)20:20:51 No.107158346

Anonymous 11/09/25(Sun)20:20:51 No.107158346

File: ComfyUI__00006_.mp4 (514 KB, 640x832)

514 KB MP4

Anonymous
11/09/25(Sun)20:21:19 No.107158350

Anonymous 11/09/25(Sun)20:21:19 No.107158350

>>107158330
keep t5 at fp16 imo.

Anonymous
11/09/25(Sun)20:22:51 No.107158362

Anonymous 11/09/25(Sun)20:22:51 No.107158362

File: ComfyUI__00007_.mp4 (642 KB, 832x640)

642 KB MP4

Anonymous
11/09/25(Sun)20:24:52 No.107158378

Anonymous 11/09/25(Sun)20:24:52 No.107158378

>>107158330
q8 chroma, fp16 clip, 26-35 steps, euler simple/beta
try "aesthetic 1" in negative

Anonymous
11/09/25(Sun)20:26:54 No.107158385

Anonymous 11/09/25(Sun)20:26:54 No.107158385

File: ComfyUI__00009_.mp4 (596 KB, 832x640)

596 KB MP4

it doesn't understand left/right but far/near seem to work

Anonymous
11/09/25(Sun)20:32:21 No.107158417

Anonymous 11/09/25(Sun)20:32:21 No.107158417

>>107158350
>>107158378
thanks. i guess i was trying too hard to save on vram by lobotomizing the text models.

Anonymous
11/09/25(Sun)20:32:35 No.107158418

Anonymous 11/09/25(Sun)20:32:35 No.107158418

>>107158385
Why do text encoders struggle with directions? That's not an isolated incident.
Quick theory:
Is this because right/left can mean both viewer's right/left and character's right/left, which ends up confusing the UNET during training?

Anonymous
11/09/25(Sun)20:35:10 No.107158435

Anonymous 11/09/25(Sun)20:35:10 No.107158435

>>107157987
thank you for your hard work

Anonymous
11/09/25(Sun)20:36:12 No.107158443

Anonymous 11/09/25(Sun)20:36:12 No.107158443

File: ComfyUI_temp_xgutr_00007_.png (2.28 MB, 880x1472)

2.28 MB PNG

Anonymous
11/09/25(Sun)20:38:45 No.107158461

Anonymous 11/09/25(Sun)20:38:45 No.107158461

File: ComfyUI_temp_xgutr_00008_.png (2.24 MB, 880x1472)

2.24 MB PNG

Anonymous
11/09/25(Sun)20:40:05 No.107158475

Anonymous 11/09/25(Sun)20:40:05 No.107158475

File: ComfyUI_temp_xgutr_00009_.png (2.25 MB, 880x1472)

2.25 MB PNG

Anonymous
11/09/25(Sun)20:47:07 No.107158532

Anonymous 11/09/25(Sun)20:47:07 No.107158532

File: ComfyUI_temp_xgutr_00011_.png (2.11 MB, 1024x1248)

2.11 MB PNG

Anonymous
11/09/25(Sun)20:48:32 No.107158542

Anonymous 11/09/25(Sun)20:48:32 No.107158542

File: ComfyUI_temp_potmx_00018_.png (1.41 MB, 832x1152)

1.41 MB PNG

Anonymous
11/09/25(Sun)20:50:08 No.107158554

Anonymous 11/09/25(Sun)20:50:08 No.107158554

So sounds like it’s worth going down a generation to the 4x cards if I want 24gb vram at a more reasonable cost

Anonymous
11/09/25(Sun)20:53:26 No.107158586

Anonymous 11/09/25(Sun)20:53:26 No.107158586

>>107158475
>>107158542
neat

Anonymous
11/09/25(Sun)20:53:40 No.107158587

Anonymous 11/09/25(Sun)20:53:40 No.107158587

File: ComfyUI_temp_xgutr_00014_.png (2.2 MB, 952x1320)

2.2 MB PNG

Anonymous
11/09/25(Sun)20:55:22 No.107158595

Anonymous 11/09/25(Sun)20:55:22 No.107158595

File: ComfyUI_temp_xgutr_00015_.png (2.28 MB, 952x1320)

2.28 MB PNG

Anonymous
11/09/25(Sun)20:57:18 No.107158607

Anonymous 11/09/25(Sun)20:57:18 No.107158607

File: peasant girls.jpg (508 KB, 2688x1536)

508 KB JPG

i literally gooned for 12 hours today

Anonymous
11/09/25(Sun)21:01:47 No.107158637

Anonymous 11/09/25(Sun)21:01:47 No.107158637

>>107158554
so a 4090 then? aren't they like 1500 dollars

Anonymous
11/09/25(Sun)21:02:03 No.107158639

Anonymous 11/09/25(Sun)21:02:03 No.107158639

>>107158162
I guess I could rename it, fair point

>>107158435
npnp

Anonymous
11/09/25(Sun)21:05:22 No.107158665

Anonymous 11/09/25(Sun)21:05:22 No.107158665

File: 1732200222847408.jpg (960 KB, 768x1344)

960 KB JPG

>>107158607
Can you catbox your picrel or a similar gen?

Anonymous
11/09/25(Sun)21:08:49 No.107158683

Anonymous 11/09/25(Sun)21:08:49 No.107158683

File: ComfyUI_temp_potmx_00029_.png (747 KB, 832x1152)

747 KB PNG

Anonymous
11/09/25(Sun)21:13:13 No.107158711

Anonymous 11/09/25(Sun)21:13:13 No.107158711

File: ComfyUI_temp_xgutr_00020_.png (1.89 MB, 952x1320)

1.89 MB PNG

Anonymous
11/09/25(Sun)21:15:35 No.107158723

Anonymous 11/09/25(Sun)21:15:35 No.107158723

>>107158665
https://files.catbox.moe/0andv6.png

Anonymous
11/09/25(Sun)21:15:56 No.107158726

Anonymous 11/09/25(Sun)21:15:56 No.107158726

File: ComfyUI_temp_xgutr_00021_.png (1.99 MB, 952x1320)

1.99 MB PNG

Anonymous
11/09/25(Sun)21:17:21 No.107158739

Anonymous 11/09/25(Sun)21:17:21 No.107158739

>>107158723
Thanks!

Anonymous
11/09/25(Sun)21:25:04 No.107158781

Anonymous 11/09/25(Sun)21:25:04 No.107158781

File: ComfyUI_temp_xgutr_00024_.png (840 KB, 952x1320)

840 KB PNG

Anonymous
11/09/25(Sun)21:25:45 No.107158784

Anonymous 11/09/25(Sun)21:25:45 No.107158784

>>107158781
>>107158726
>no large breasts, wide hips

Anonymous
11/09/25(Sun)21:32:38 No.107158829

Anonymous 11/09/25(Sun)21:32:38 No.107158829

>he doesn't (large breasts, wide hips, thick thighs:1.5)

Anonymous
11/09/25(Sun)21:35:38 No.107158850

Anonymous 11/09/25(Sun)21:35:38 No.107158850

File: ComfyUI_temp_xgutr_00027_.png (1.19 MB, 1472x880)

1.19 MB PNG

Anonymous
11/09/25(Sun)21:38:20 No.107158869

Anonymous 11/09/25(Sun)21:38:20 No.107158869

File: ComfyUI_temp_xgutr_00030_.png (1.86 MB, 1472x880)

1.86 MB PNG

Anonymous
11/09/25(Sun)21:45:39 No.107158919

Anonymous 11/09/25(Sun)21:45:39 No.107158919

>we may be getting a $2k stimulus check
and i'm 100% going to use that money to buy a 4090, kek

Anonymous
11/09/25(Sun)21:52:33 No.107158954

Anonymous 11/09/25(Sun)21:52:33 No.107158954

File: ComfyUI__00010_.mp4 (1.06 MB, 832x640)

1.06 MB MP4

Anonymous
11/09/25(Sun)22:10:38 No.107159071

Anonymous 11/09/25(Sun)22:10:38 No.107159071

>>107158919
You should by stocks, dummy. Preferably OpenAI stocks of course lol

Anonymous
11/09/25(Sun)22:11:59 No.107159077

Anonymous 11/09/25(Sun)22:11:59 No.107159077

>>107159071
but i want faster gens right NOW

Anonymous
11/09/25(Sun)22:13:22 No.107159091

Anonymous 11/09/25(Sun)22:13:22 No.107159091

File: Hakurei.Reimu.full.422084(...).png (605 KB, 1191x1684)

605 KB PNG

Hello, I am from the TouHou AI general on >>>/jp/2huAI/. It is a very nice and good quality general, but it is slow to answer simple questions. I have a question about making a lora. Is this the right place to ask?

Anonymous
11/09/25(Sun)22:19:24 No.107159127

Anonymous 11/09/25(Sun)22:19:24 No.107159127

>>107159091
you could've just asked the question instead of wasting a post asking for permission to ask a question

scabPICKER
11/09/25(Sun)22:24:05 No.107159161

scabPICKER 11/09/25(Sun)22:24:05 No.107159161

ooooh baby SongBloom is cookin up some songs real nice

hear that sizzle & smell dem onions

scabPICKER
11/09/25(Sun)22:37:24 No.107159256

scabPICKER 11/09/25(Sun)22:37:24 No.107159256

>>107159161
https://vocaroo.com/1myH3aeJX4hT

Trying again, the tricky part is trying to get lyrics adherence but the right amount of song stealing.

Anonymous
11/09/25(Sun)22:41:10 No.107159282

Anonymous 11/09/25(Sun)22:41:10 No.107159282

File: ComfyUI__00022_.mp4 (322 KB, 640x832)

322 KB MP4

scabPICKER
11/09/25(Sun)22:42:50 No.107159294

scabPICKER 11/09/25(Sun)22:42:50 No.107159294

>>107159282
nice, can she SING?

Anonymous
11/09/25(Sun)22:43:32 No.107159296

Anonymous 11/09/25(Sun)22:43:32 No.107159296

>>107159091
Going to bed now might answer your question hours later if it makes sense, if no one else answered it and if I don't feel too lazy.

Anonymous
11/09/25(Sun)22:45:20 No.107159308

Anonymous 11/09/25(Sun)22:45:20 No.107159308

File: ComfyUI__00028_.mp4 (1005 KB, 640x832)

1005 KB MP4

Anonymous
11/09/25(Sun)22:46:56 No.107159313

Anonymous 11/09/25(Sun)22:46:56 No.107159313

File: ComfyUI__00023_.mp4 (267 KB, 640x832)

267 KB MP4

scabPICKER
11/09/25(Sun)23:05:34 No.107159410

scabPICKER 11/09/25(Sun)23:05:34 No.107159410

>>107159296
nobody cares bro.

bro go to the poop festival, in your dreams.

bro

Anonymous
11/09/25(Sun)23:06:23 No.107159413

Anonymous 11/09/25(Sun)23:06:23 No.107159413

>>107159294
dunno

Anonymous
11/09/25(Sun)23:13:59 No.107159452

Anonymous 11/09/25(Sun)23:13:59 No.107159452

>>107159308
>slow motion shit
lightx2 crap

scabPICKER
11/09/25(Sun)23:15:49 No.107159466

scabPICKER 11/09/25(Sun)23:15:49 No.107159466

>>107159256
https://vocaroo.com/1cHl7bT8AHk0

Anonymous
11/09/25(Sun)23:18:01 No.107159481

Anonymous 11/09/25(Sun)23:18:01 No.107159481

> no new better cards
> no new better models
it's so over for local

scabPICKER
11/09/25(Sun)23:18:35 No.107159486

scabPICKER 11/09/25(Sun)23:18:35 No.107159486

>>107159481
I'm literally posting SongBloom gens, dearest sir of the African persuasion.

Anonymous
11/09/25(Sun)23:20:27 No.107159495

Anonymous 11/09/25(Sun)23:20:27 No.107159495

>>107156778
cth-uwu.

scabPICKER
11/09/25(Sun)23:22:03 No.107159504

scabPICKER 11/09/25(Sun)23:22:03 No.107159504

>>107159495
do you know how you would prompt this?
https://www.bbc.com/news/articles/c1wl5jp94eno

I genuinely have no idea

Anonymous
11/09/25(Sun)23:25:34 No.107159517

Anonymous 11/09/25(Sun)23:25:34 No.107159517

Is normal forge still the only UI that uses Gradio 4?

Anonymous
11/09/25(Sun)23:25:40 No.107159519

Anonymous 11/09/25(Sun)23:25:40 No.107159519

>>107159504
you could try this maybe? it does alright
https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one

scabPICKER
11/09/25(Sun)23:33:28 No.107159561

scabPICKER 11/09/25(Sun)23:33:28 No.107159561

>>107159519
>https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
>This photograph features a large, shiny, blue, abstract sculpture of a humanoid figure with a rounded, bulbous body and simplified, elongated limbs. The sculpture has a smooth, glossy texture, reflecting the surrounding environment. It stands outdoors on a paved area with a grassy patch behind it. In the background, there are palm trees and a building with a white facade and a red horizontal stripe near the top. The sculpture's head is slightly tilted downward, and its expression is indistinct due to its abstract nature. The bright blue color contrasts with the green grass and the white and red building.

I may try it.

Anonymous
11/09/25(Sun)23:33:32 No.107159562

Anonymous 11/09/25(Sun)23:33:32 No.107159562

>>107159452
nogen crying

Anonymous
11/09/25(Sun)23:41:44 No.107159618

Anonymous 11/09/25(Sun)23:41:44 No.107159618

File: 1757137130565758.jpg (174 KB, 1086x386)

174 KB JPG

>>107159562
I gen 24/7 but ok

Anonymous
11/09/25(Sun)23:42:24 No.107159623

Anonymous 11/09/25(Sun)23:42:24 No.107159623

File: ComfyUI__00035_.mp4 (1.18 MB, 720x1280)

1.18 MB MP4

Anonymous
11/09/25(Sun)23:43:26 No.107159632

Anonymous 11/09/25(Sun)23:43:26 No.107159632

>>107159618
cry moar kid

scabPICKER
11/09/25(Sun)23:43:54 No.107159640

scabPICKER 11/09/25(Sun)23:43:54 No.107159640

>>107159618
win VOMIT dows

Anonymous
11/09/25(Sun)23:44:17 No.107159643

Anonymous 11/09/25(Sun)23:44:17 No.107159643

local still can't compare to grok imagine

Anonymous
11/09/25(Sun)23:44:47 No.107159646

Anonymous 11/09/25(Sun)23:44:47 No.107159646

>>107159643
can grok do explicit porn? no? who cares.

Anonymous
11/09/25(Sun)23:45:21 No.107159652

Anonymous 11/09/25(Sun)23:45:21 No.107159652

>>107159618
WHOA lookout, we got a WINDOWS guy here

Anonymous
11/09/25(Sun)23:50:11 No.107159679

Anonymous 11/09/25(Sun)23:50:11 No.107159679

Any good alternatives to lightx2v, other than waiting 10 minutes for a gen?

scabPICKER
11/09/25(Sun)23:50:53 No.107159686

scabPICKER 11/09/25(Sun)23:50:53 No.107159686

>>107159679
Personally, I think SongBloom is the best replacement, but some may disagree, for very stupid reasons.

Anonymous
11/09/25(Sun)23:51:49 No.107159690

Anonymous 11/09/25(Sun)23:51:49 No.107159690

>>107154861
lol

scabPICKER
11/09/25(Sun)23:53:35 No.107159702

scabPICKER 11/09/25(Sun)23:53:35 No.107159702

>>107159561
Running it

Anonymous
11/09/25(Sun)23:53:45 No.107159707

Anonymous 11/09/25(Sun)23:53:45 No.107159707

File: 1742645567943191.jpg (467 KB, 1028x1421)

467 KB JPG

>>107159652
Yes. I use freeBSD, headless debian and primarily Windows, anon.

scabPICKER
11/09/25(Sun)23:54:52 No.107159718

scabPICKER 11/09/25(Sun)23:54:52 No.107159718

>>107154883
thanks

SongBloom is way beyond lol

Anonymous
11/09/25(Sun)23:57:41 No.107159727

Anonymous 11/09/25(Sun)23:57:41 No.107159727

>>107158919
>4090
Shit, $2k is like 8GB of RAM these days.

Anonymous
11/09/25(Sun)23:59:23 No.107159737

Anonymous 11/09/25(Sun)23:59:23 No.107159737

File: 1755826040903345.jpg (234 KB, 1011x694)

234 KB JPG

>>107159727

Anonymous
11/10/25(Mon)00:16:26 No.107159816

Anonymous 11/10/25(Mon)00:16:26 No.107159816

>>107159452
>lightx2 crap
let's hope the new 4 steps distillation method will make the slo mo shit dissapear >>107154918

Anonymous
11/10/25(Mon)00:21:04 No.107159835

Anonymous 11/10/25(Mon)00:21:04 No.107159835

>>107159737
why not just buy a 5090 at that point

Anonymous
11/10/25(Mon)00:21:16 No.107159836

Anonymous 11/10/25(Mon)00:21:16 No.107159836

>>107159816
I've said it a billion times but slow-mo isn't the only problem with lightx2. it nukes the liveliness of animations. everything is simply less animated. less things move.

scabPICKER
11/10/25(Mon)00:22:07 No.107159838

scabPICKER 11/10/25(Mon)00:22:07 No.107159838

File: waifu located.png (1.46 MB, 1024x1024)

1.46 MB PNG

>>107159702
image & text go together, but cba to make a video, so just play it and stare at the picture. or ask someone to wan it, I guess.
https://vocaroo.com/1otWsnlu7TwZ

Anonymous
11/10/25(Mon)00:22:46 No.107159840

Anonymous 11/10/25(Mon)00:22:46 No.107159840

>>107159835
That's an extra $700-900.
At that point may as well buy a RTX 6000 Pro. Just a few extra $$
May as well buy an H100
Shit may as well buy an H200

scabPICKER
11/10/25(Mon)00:23:10 No.107159844

scabPICKER 11/10/25(Mon)00:23:10 No.107159844

>>107159835
:^)

so, apparently the 5090 sucks for ai, anyway.

Anonymous
11/10/25(Mon)00:23:50 No.107159846

Anonymous 11/10/25(Mon)00:23:50 No.107159846

So the only recent local developments are speedcrap for turdworld poorfag shitskins? Local really died with wan2.5, what a letdown

Anonymous
11/10/25(Mon)00:24:46 No.107159853

Anonymous 11/10/25(Mon)00:24:46 No.107159853

>>107159840
5090 is $2k bud

Anonymous
11/10/25(Mon)00:24:52 No.107159854

Anonymous 11/10/25(Mon)00:24:52 No.107159854

a pickle, for the knowing ones

Anonymous
11/10/25(Mon)00:36:26 No.107159899

Anonymous 11/10/25(Mon)00:36:26 No.107159899

>>107159308
>Uses ai to create something that basically doesn't exist. An east-asian with giant tits that aren't fake. This is the future of stable diffusion.

Anonymous
11/10/25(Mon)00:38:34 No.107159910

Anonymous 11/10/25(Mon)00:38:34 No.107159910

File: wan22___0004.png (1.69 MB, 832x1216)

1.69 MB PNG

>another day, another lora

Anonymous
11/10/25(Mon)00:44:14 No.107159932

Anonymous 11/10/25(Mon)00:44:14 No.107159932

File: Video_00137.mp4 (1.56 MB, 544x960)

1.56 MB MP4

>>107159910
>2.2, still slow mo

scabPICKER
11/10/25(Mon)00:47:55 No.107159949

scabPICKER 11/10/25(Mon)00:47:55 No.107159949

File: spics fear it.png (1.19 MB, 1024x1024)

1.19 MB PNG

https://vocaroo.com/1fh5yWC322DT

used joy caption on the top image from
https://www.artforum.com/features/yuk-hui-daniel-birnbaum-interview-1234733869/

>Photograph of a clear, rectangular ice cube suspended in mid-air against a bright blue sky with scattered white clouds. The ice cube is transparent with visible internal crystal structures and slight surface imperfections. In the background, there are blurred green trees and a tall evergreen tree, indicating an outdoor setting. The image has a sharp focus on the ice cube, with a shallow depth of field that blurs the background. The sunlight illuminates the ice cube from the front, highlighting its transparent and textured surface. The overall composition emphasizes the contrast between the sharp, detailed ice cube and the soft, blurred natural background.

chroma hd and SongBloom

Anonymous
11/10/25(Mon)00:49:54 No.107159960

Anonymous 11/10/25(Mon)00:49:54 No.107159960

>>107159910
What lora is that

scabPICKER
11/10/25(Mon)00:53:00 No.107159976

scabPICKER 11/10/25(Mon)00:53:00 No.107159976

>>107159960
who cares, they're all literally the same.

Anonymous
11/10/25(Mon)00:53:32 No.107159982

Anonymous 11/10/25(Mon)00:53:32 No.107159982

File: wan22___0009.png (1.48 MB, 832x1216)

1.48 MB PNG

>>107159960
https://civitai.com/models/2063310?modelVersionId=2334783
just published the wan version

Anonymous
11/10/25(Mon)00:59:47 No.107160004

Anonymous 11/10/25(Mon)00:59:47 No.107160004

>>107157987
how many images for the dataset?

Anonymous
11/10/25(Mon)00:59:50 No.107160005

Anonymous 11/10/25(Mon)00:59:50 No.107160005

>>107159982
Nice

Anonymous
11/10/25(Mon)01:07:49 No.107160029

Anonymous 11/10/25(Mon)01:07:49 No.107160029

>>107157370
Unfathomably based

Anonymous
11/10/25(Mon)01:08:52 No.107160039

Anonymous 11/10/25(Mon)01:08:52 No.107160039

>>107157290
don’t worry anon that’s just your dumb hormones
women are overrated

Anonymous
11/10/25(Mon)01:11:58 No.107160054

Anonymous 11/10/25(Mon)01:11:58 No.107160054

>>107160004
600, next version has 648

Anonymous
11/10/25(Mon)01:12:16 No.107160055

Anonymous 11/10/25(Mon)01:12:16 No.107160055

File: 1740414827149879.png (1.5 MB, 1800x606)

1.5 MB PNG

>>107154918
this is way closer than the lightning method, impressive

Anonymous
11/10/25(Mon)01:18:20 No.107160083

Anonymous 11/10/25(Mon)01:18:20 No.107160083

File: 1745128418003985.png (2 MB, 1344x1728)

2 MB PNG

>gooning to your own gens
isn't this just a more convoluted and expensive way to goon to your own imagination? what's the point?

Anonymous
11/10/25(Mon)01:22:24 No.107160106

Anonymous 11/10/25(Mon)01:22:24 No.107160106

>>107160054
damn dude, i rarely go over 20 images. have you done any tests with smaller training datasets?

Anonymous
11/10/25(Mon)01:22:41 No.107160110

Anonymous 11/10/25(Mon)01:22:41 No.107160110

>>107160083
aphantasia

Anonymous
11/10/25(Mon)01:38:53 No.107160198

Anonymous 11/10/25(Mon)01:38:53 No.107160198

>>107160106
>have you done any tests with smaller training datasets?
Yeah. I prefer larger datasets for more variation

Anonymous
11/10/25(Mon)01:50:27 No.107160256

Anonymous 11/10/25(Mon)01:50:27 No.107160256

>using supervacetools to make long video
>long pauses between each gen
>swap the "patch sage attention kj" node with "model patch torch settings" node
>no more retarded long pauses in between gens
>near double the speed

fp16 accumulation is pretty dope, wonder if it'll work on wan2.2

Anonymous
11/10/25(Mon)02:06:24 No.107160349

Anonymous 11/10/25(Mon)02:06:24 No.107160349

>>107159932
this i2v or t2v? what x2v youre using?

Anonymous
11/10/25(Mon)02:09:50 No.107160370

Anonymous 11/10/25(Mon)02:09:50 No.107160370

>>107157370
train a lora to make Wan have the prompt adherence of Sora 2

Anonymous
11/10/25(Mon)02:12:15 No.107160391

Anonymous 11/10/25(Mon)02:12:15 No.107160391

https://github.com/wallen0322/ComfyUI-Wan22FMLF

Improved tech just dropped.

Anonymous
11/10/25(Mon)02:14:36 No.107160400

Anonymous 11/10/25(Mon)02:14:36 No.107160400

And a qwen edit upscaler.
https://huggingface.co/vafipas663/Qwen-Edit-2509-Upscale-LoRA

Anonymous
11/10/25(Mon)02:25:45 No.107160466

Anonymous 11/10/25(Mon)02:25:45 No.107160466

>>107160391
>Wan fuck my life

Start, middle, and end frame is a nice addition.

Anonymous
11/10/25(Mon)02:34:27 No.107160523

Anonymous 11/10/25(Mon)02:34:27 No.107160523

>>107160391
> - Dual MoE conditioning outputs (high-noise and low-noise stages)
> - Multi-motion frames support for dynamic sequences
> - Automatic video chaining with offset mechanism
> - SVI-SHOT mode for infinite video generation with separate conditioning
> - Adjustable constraint strengths for each stage

Interesting.

Anonymous
11/10/25(Mon)02:42:56 No.107160565

Anonymous 11/10/25(Mon)02:42:56 No.107160565

File: 1737619368908935.png (841 KB, 896x1152)

841 KB PNG

SPARK chroma fixed chroma.

Anonymous
11/10/25(Mon)02:48:02 No.107160582

Anonymous 11/10/25(Mon)02:48:02 No.107160582

>>107160523
Thought svi was for 2.1 and 2.2 5b? Would it properly work with 2.2 14b? Also pretty sure there's either going to be dedicated nodes or comfyui native implementation, soon hopefully

Anonymous
11/10/25(Mon)02:55:14 No.107160622

Anonymous 11/10/25(Mon)02:55:14 No.107160622

>>107160582
Native comfyUI nodes often times are riding free ideas from others or poorly supported just for market capture. Dedicated nodes from other people tends to get faster updates and works better. More than once, I am disappointed with Comfy implementation. IE, inpainting and wan2.2

Anonymous
11/10/25(Mon)02:57:53 No.107160636

Anonymous 11/10/25(Mon)02:57:53 No.107160636

>>107160565
Show realism, then we talk

Anonymous
11/10/25(Mon)03:12:52 No.107160721

Anonymous 11/10/25(Mon)03:12:52 No.107160721

>>107160391
I was just asking about something like this a few threads ago. There is a ton of multi image pixiv illustrations as well as my own ai ones that would make for great animation with this.

Anonymous
11/10/25(Mon)03:13:40 No.107160725

Anonymous 11/10/25(Mon)03:13:40 No.107160725

What's the state of the art for local photorealistic video gen?

Anonymous
11/10/25(Mon)03:20:02 No.107160772

Anonymous 11/10/25(Mon)03:20:02 No.107160772

PSA: pi-flow combined with loras seems to be slightly last slopped than the normal Qwen-Image experience. To get rid of plastic skin slop and "cinematic" stuff, avoid using words like "a photograph of (...)" or "an image of", and use "Amateur footage of (...)" instead and you will consistently get better photo-realistic results. Recommended model: Lenovo Ultrareal

Anonymous
11/10/25(Mon)03:23:26 No.107160804

Anonymous 11/10/25(Mon)03:23:26 No.107160804

>>107160622
makes me wish a different UI got all the community attention. anons are too doompilled on comfy since it focuses on saas more than anything nowadays

Anonymous
11/10/25(Mon)03:27:35 No.107160837

Anonymous 11/10/25(Mon)03:27:35 No.107160837

>>107160622
>>107160804
It's not too late for (You) to contribute to stable-diffusion.cpp

Anonymous
11/10/25(Mon)03:46:36 No.107160932

Anonymous 11/10/25(Mon)03:46:36 No.107160932

>>107160772
>Lenovo Ultrareal
My favourite LORA

Anonymous
11/10/25(Mon)03:50:01 No.107160944

Anonymous 11/10/25(Mon)03:50:01 No.107160944

>>107160772
>slightly last
*Slightly less. I am sleepy

Anonymous
11/10/25(Mon)03:51:59 No.107160953

Anonymous 11/10/25(Mon)03:51:59 No.107160953

>>107160083
it's gooning to gambling and chance that the prompt matches what you had in mind beforehand. twice the degeneracy and an excuse to edge endlessly

Anonymous
11/10/25(Mon)03:54:20 No.107160965

Anonymous 11/10/25(Mon)03:54:20 No.107160965

>>107156856
now that is just ugly

stop
11/10/25(Mon)03:56:11 No.107160980

stop 11/10/25(Mon)03:56:11 No.107160980

File: 00017.png (62 KB, 1024x1024)

62 KB PNG

>Try to generate a headshot
>The top of the girl's head is always out-of-frame
How can I fix this?

Anonymous
11/10/25(Mon)03:58:27 No.107160994

Anonymous 11/10/25(Mon)03:58:27 No.107160994

>>107160837
tell the Chinese to switch onto it, it's run by one of their own so I don't get it. it would also gatekeep western companies since they hire Indian slaves to poothon all day. would be hilarious if they changed their minds and cucked america by doing that though

Anonymous
11/10/25(Mon)04:06:22 No.107161034

Anonymous 11/10/25(Mon)04:06:22 No.107161034

In case you guys want to be disappointed of the state of local: prompt Qwen-Image for centaurs.

Anonymous
11/10/25(Mon)04:14:37 No.107161070

Anonymous 11/10/25(Mon)04:14:37 No.107161070

>>107160965
She looks like your average Bong woman

Anonymous
11/10/25(Mon)04:15:44 No.107161080

Anonymous 11/10/25(Mon)04:15:44 No.107161080

File: tmp6mjeba9y.png (1.11 MB, 1024x1024)

1.11 MB PNG

>>107160980
If putting "out of frame, cropped" in negative prompt isn't enough you can always just outpaint.
Another option is to resize and draw in the hair color in an editor then inpaint that to match the rest of the image.
Failing that you could always generate a taller aspect ratio full body shot and crop out what you don't want to use.

Anonymous
11/10/25(Mon)04:24:33 No.107161121

Anonymous 11/10/25(Mon)04:24:33 No.107161121

>>107160980
Add more details to the prompt of what you want to see: eyes, hair, etc.

Anonymous
11/10/25(Mon)04:26:41 No.107161131

Anonymous 11/10/25(Mon)04:26:41 No.107161131

Does anyone know the default wan2.2 settings without light loras? 20 step (10 h + 10 l) and euler + simple, high start step 0 end step 5 / low start step 5 end step 10000?

Anonymous
11/10/25(Mon)04:29:28 No.107161145

Anonymous 11/10/25(Mon)04:29:28 No.107161145

File: ComfyUI_temp_xjjal_00006_.png (2 MB, 1328x1328)

2 MB PNG

>>107161034
Best I've got.

Anonymous
11/10/25(Mon)04:34:41 No.107161160

Anonymous 11/10/25(Mon)04:34:41 No.107161160

>>107160083
>isn't this just a more convoluted and expensive way to goon to your own imagination? what's the point?
For me it's a combination of aphantasia like the other anon said (I can't visualize things) as well as "playing with dolls" (some anon once mentioned that genning 1girls is the same delayed brain development as people who play with dolls and I 100% agree because I independently came to the same realization/conclusion)

>>107161131
"Default" wan 2.2 to me sounds like 50 steps (25 each) on unipc sampler with default CFG and flow shift values

Anonymous
11/10/25(Mon)04:46:25 No.107161206

Anonymous 11/10/25(Mon)04:46:25 No.107161206

>>107161160
Got it to work with the 20 step but only gen once. I tried genning again and get an error of

CLIPTextEncode

'GGUFModelPatcher' object has no attribute 'named_modules_to_munmap'

I've already updated everything to the latest version. Doesnt seem to want to work without light loras, kek

Anonymous
11/10/25(Mon)04:48:30 No.107161214

Anonymous 11/10/25(Mon)04:48:30 No.107161214

File: ComfyUI_temp_hkhkn_00014_.png (256 KB, 512x512)

256 KB PNG

>>107161034
It can, however, do the reverse (prompt was 'a headless horse', after some seed hopping)

Anonymous
11/10/25(Mon)05:05:00 No.107161306

Anonymous 11/10/25(Mon)05:05:00 No.107161306

>>107160582
The node doesn't work even without svi lora.

Anonymous
11/10/25(Mon)05:12:08 No.107161341

Anonymous 11/10/25(Mon)05:12:08 No.107161341

>>107161206
Never seen that error before, you can open your ComfyUI folder in Visual Studio Code and give Copilot the error and see if it can help figure it out.

Since this is a clip error I'm assuming you're doing image2video? Since only wan i2v should be using a clip model (clip vision to read your input images)

Text to video and image to video have different settings and nodes required. The default comfy workflows on the GitHub for wan are fp8 scaled and don't have any lightning enhancements so you can use those to do your full-step gens I guess

Anonymous
11/10/25(Mon)05:20:35 No.107161384

Anonymous 11/10/25(Mon)05:20:35 No.107161384

>>107156291
on linux?
please share your setup, what distro and kernel version

Anonymous
11/10/25(Mon)05:24:00 No.107161397

Anonymous 11/10/25(Mon)05:24:00 No.107161397

guys is qwen fp8 better or q8?

Anonymous
11/10/25(Mon)05:24:59 No.107161402

Anonymous 11/10/25(Mon)05:24:59 No.107161402

>>107161341

Yes, its i2v. I just switched to the multigpu unet and clip nodes instead and that solved the problem. Yeah there's always some kind of new error every update

Anonymous
11/10/25(Mon)05:35:55 No.107161470

Anonymous 11/10/25(Mon)05:35:55 No.107161470

>>107161397
>guys is qwen fp8 better or q8?
Q8 is fp8 with some layers kept unquantized so it should be strictly better

Anonymous
11/10/25(Mon)05:37:06 No.107161474

Anonymous 11/10/25(Mon)05:37:06 No.107161474

>>107161470
thjanks. I dont know what it is with qwen, but it takes so long to gen. Wan videos are so much faster!

Anonymous
11/10/25(Mon)05:40:44 No.107161497

Anonymous 11/10/25(Mon)05:40:44 No.107161497

>>107161470
>Q8 is fp8 with some layers kept unquantized
No. Q8 is basically FP16. It is much better than FP8. This is common information you can google.

Anonymous
11/10/25(Mon)05:49:53 No.107161547

Anonymous 11/10/25(Mon)05:49:53 No.107161547

>>107161497
What you said didn't invalidate what I said. Q8 is some of the blocks at int8 and some blocks at f32.

I'm planning on testing t5_xxl with the different fp8 versions versus Q8_0 today

Anonymous
11/10/25(Mon)06:10:05 No.107161642

Anonymous 11/10/25(Mon)06:10:05 No.107161642

>>107161474
Don't you use a speed-up lora?

Anonymous
11/10/25(Mon)06:10:51 No.107161648

Anonymous 11/10/25(Mon)06:10:51 No.107161648

>>107161642
theres so much quality loss tho

Anonymous
11/10/25(Mon)06:13:44 No.107161666

Anonymous 11/10/25(Mon)06:13:44 No.107161666

>>107161648
are you real? finally a real person that agrees lightx2 and speedup loras are fucking dogshit

Anonymous
11/10/25(Mon)06:18:14 No.107161688

Anonymous 11/10/25(Mon)06:18:14 No.107161688

>>107161666
umm yeah, if u dont think theres quality loss then ur blind af

Anonymous
11/10/25(Mon)06:20:07 No.107161697

Anonymous 11/10/25(Mon)06:20:07 No.107161697

>>107161666
No one is denying that. But also a genuinely improved t2v version came out recently, and so did an i2v version but it was worse than previous ones so only t2v got an improvement recently

Anonymous
11/10/25(Mon)06:24:36 No.107161730

Anonymous 11/10/25(Mon)06:24:36 No.107161730

>>107161341
i've been doing i2v without clip vision and it seems just fine, would i get better results if i add it?

Anonymous
11/10/25(Mon)06:31:36 No.107161775

Anonymous 11/10/25(Mon)06:31:36 No.107161775

>>107161730
I have absolutely no idea to be honest since I hardly do i2v locally

I never used it either but it's supposed to be required. It works on a GGUF workflow without clip vision but broke for me using kijais nodes iirc

But I am also out of the loop of what nodes to use nowadays. My 2.2 workflow is full of deprecated and beta nodes since it just works, it's GGUF and there's no actual benefit I get from remaking it until until there's a new model to run, which will have its own nodes and workflow needed anyways probably

Anonymous
11/10/25(Mon)06:54:17 No.107161882

Anonymous 11/10/25(Mon)06:54:17 No.107161882

>>107161730
>>107161775

Asked perplexity about clip vision for wan2.2...

>Clip Vision is generally not necessary or beneficial for WAN2.2 workflows, according to user reports and in-depth testing from the image-to-video AI community. WAN2.2 is designed so that it no longer relies on CLIP embeddings; this marks a shift from previous models like WAN2.1, which did have image cross-attention layers that could utilize CLIP vision. When Clip Vision is supplied in a WAN2.2 workflow, it is simply ignored, so it does not improve generation quality or prompt adherence, and may actually slow down video creation times by several minutes.

Anonymous
11/10/25(Mon)06:56:36 No.107161890

Anonymous 11/10/25(Mon)06:56:36 No.107161890

>>107159091
Don’t mind the mean people

Anonymous
11/10/25(Mon)06:57:33 No.107161898

Anonymous 11/10/25(Mon)06:57:33 No.107161898

>>107161648
There's a new one that promises to be better, look up the thread.
Altenatively, gen in stages, and only use the full model during critical ones. I don't think I could tolerate genning with Qwen at all, but at less than 10 seconds per preview without optimizations, I'm chugging along merrily. But then I've split my workflow into so many sampling stages, the workflow is becoming unwieldy by itself.

Anonymous
11/10/25(Mon)07:01:22 No.107161926

Anonymous 11/10/25(Mon)07:01:22 No.107161926

>>107161882
>and may actually slow down video creation times by several minutes.
Good to know.

Anonymous
11/10/25(Mon)07:01:34 No.107161929

Anonymous 11/10/25(Mon)07:01:34 No.107161929

>>107161882
Then kijai is more of a vibe coder than I thought lol. Pretty sure it's not a false-memory that I needed to download clip_vision_h in order to get one of his nodes to stop complaining, even if that node ended up never using it

I'm bored and my new job starts in a month so I'll spend today making an "opinionated 2.2 t2v guide/recommendation" rentry as well I guess

Anonymous
11/10/25(Mon)07:02:54 No.107161938

Anonymous 11/10/25(Mon)07:02:54 No.107161938

>>107161926
Probably a hallucination. Clip vision is like a 70mb model so even if it's being loaded and unloaded every generation without doing anything it can't be adding more than a couple of seconds max

Anonymous
11/10/25(Mon)07:04:32 No.107161947

Anonymous 11/10/25(Mon)07:04:32 No.107161947

>>107156462
>Stanford University, Adobe Research
yeah i'm not going to be installing that adobe research shit on my machine. I really just don't trust them not to sell my data for research purposes using some fuckery inside of their nodes. also no gguf support?

TRASH

Anonymous
11/10/25(Mon)07:07:13 No.107161964

Anonymous 11/10/25(Mon)07:07:13 No.107161964

>>107161947
This post activated the neurons in my brain that reminded me that Tel Aviv University made a really good text to video model and put out a paper and then never released it. I think this was either before or during the wan 2.1 era

Anonymous
11/10/25(Mon)07:13:37 No.107161995

Anonymous 11/10/25(Mon)07:13:37 No.107161995

File: Screenshot_20251110_221211.png (20 KB, 521x39)

20 KB PNG

>>107161938
> Clip vision is like a 70mb model

Anonymous
11/10/25(Mon)07:16:52 No.107162021

Anonymous 11/10/25(Mon)07:16:52 No.107162021

>>107161995
>>107161938
clip_vision_h used for wan21 is like 1gb. still, on an ssd you'd barely notice it. if wan22 doesn't use it then there's no reason to have it.

Anonymous
11/10/25(Mon)07:21:42 No.107162057

Anonymous 11/10/25(Mon)07:21:42 No.107162057

>>107161995
I was wrong but I also swear I downloaded a tiny clip vision h as a .pt before

>>107162021
>clip vision h for wan 2.1 only
That explains it. Thank God 2.2 got rid of the double text encoder autism that hunyuan introduced. Too bad we got refiner autism instead

Anonymous
11/10/25(Mon)07:25:13 No.107162068

Anonymous 11/10/25(Mon)07:25:13 No.107162068

do you get better prompt adherence with fp16 clip compared to fp8 scaled?

Anonymous
11/10/25(Mon)07:33:01 No.107162109

Anonymous 11/10/25(Mon)07:33:01 No.107162109

File: 1743731954230389.jpg (181 KB, 793x598)

181 KB JPG

>>107162068
fyi, just ask claude these type of questions. higher precision models will always be better. how much of a difference in quality/prompt adherence will always be subjective and debatable because it entirely depends on the prompt and model.

https://claude.ai/

Anonymous
11/10/25(Mon)07:34:47 No.107162123

Anonymous 11/10/25(Mon)07:34:47 No.107162123

>>107162109
please go away

Anonymous
11/10/25(Mon)07:36:31 No.107162140

Anonymous 11/10/25(Mon)07:36:31 No.107162140

>>107162123
you asked a question and got an objectively correct answer. if you're upset that it was ai generated while also posting in a general about generating ai content then you're a fucking retard.

Anonymous
11/10/25(Mon)07:47:20 No.107162221

Anonymous 11/10/25(Mon)07:47:20 No.107162221

>>107162140
>>107162109
> is model A good?
> according to benchmarks model A is the best...

Anonymous
11/10/25(Mon)07:52:50 No.107162259

Anonymous 11/10/25(Mon)07:52:50 No.107162259

>>107159646
why don't you just have sex?

Anonymous
11/10/25(Mon)07:58:41 No.107162297

Anonymous 11/10/25(Mon)07:58:41 No.107162297

new
>>107162296
>>107162296
>>107162296
>>107162296

Anonymous
11/10/25(Mon)07:58:56 No.107162300

Anonymous 11/10/25(Mon)07:58:56 No.107162300

>>107162259
the kind of sex i want is forbidden.

Anonymous
11/10/25(Mon)07:59:30 No.107162308

Anonymous 11/10/25(Mon)07:59:30 No.107162308

>>107162300
stop lusting after horses

Anonymous
11/10/25(Mon)08:00:10 No.107162314

Anonymous 11/10/25(Mon)08:00:10 No.107162314

>>107162300
ask the friendly fbi agents to kindly break all of your limbs

Anonymous
11/10/25(Mon)08:48:45 No.107162606

Anonymous 11/10/25(Mon)08:48:45 No.107162606

>>107160349
t2v

Anonymous
11/10/25(Mon)08:54:13 No.107162639

Anonymous 11/10/25(Mon)08:54:13 No.107162639

>>107162606
check the new thread, but is this with the new seko v2.0 version of lightx2v? in my testing slow motion has gotten much better most of the time

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.