/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Janitor application acceptance emails are being sent out. Please remember to check your spam box!

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 11/17/25(Mon)13:59:54 No.107237999

File: highlights_g_107227636_17(...).webm (3.82 MB, 2048x896)

3.82 MB WEBM

/ldg/ - Local Diffusion General Anonymous 11/17/25(Mon)13:59:54 No.107237999

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107227636

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
11/17/25(Mon)14:01:08 No.107238011

Anonymous 11/17/25(Mon)14:01:08 No.107238011

comfy should be dragged out on the street and shot

Anonymous
11/17/25(Mon)14:01:24 No.107238019

Anonymous 11/17/25(Mon)14:01:24 No.107238019

cursed spitebake of misery

Anonymous
11/17/25(Mon)14:01:44 No.107238025

Anonymous 11/17/25(Mon)14:01:44 No.107238025

>>107238019
kys julien

Anonymous
11/17/25(Mon)14:02:27 No.107238031

Anonymous 11/17/25(Mon)14:02:27 No.107238031

Blessed thread of frenship

Anonymous
11/17/25(Mon)14:03:51 No.107238050

Anonymous 11/17/25(Mon)14:03:51 No.107238050

>>107237999
>grok video in faggollage

Anonymous
11/17/25(Mon)14:06:02 No.107238080

Anonymous 11/17/25(Mon)14:06:02 No.107238080

>>107238050
it's a troll bake
real thread:
>>107237888
>>107237888
>>107237888

Anonymous
11/17/25(Mon)14:07:21 No.107238094

Anonymous 11/17/25(Mon)14:07:21 No.107238094

>>107238080
No one is going to use tranistudio, not now, not ever. Give up.

Anonymous
11/17/25(Mon)14:08:25 No.107238106

Anonymous 11/17/25(Mon)14:08:25 No.107238106

Comfyui is fucking dog shit

Anonymous
11/17/25(Mon)14:11:31 No.107238138

Anonymous 11/17/25(Mon)14:11:31 No.107238138

Can someone please explain to me why julien is off his meds? He has been behaving very erratically over the past week. I know he has autism, but usually he knows better than to try and mess with the /ldg/ OP.

Anonymous
11/17/25(Mon)14:20:18 No.107238226

Anonymous 11/17/25(Mon)14:20:18 No.107238226

so there's this implementation of kandisnky in comfyui
https://github.com/Ada123-a/ComfyUI-Kandinsky/
but then kandinsky team has their own implementation?
https://github.com/kandinskylab/kandinsky-5/tree/main/comfyui

anyone tried either?

Anonymous
11/17/25(Mon)14:24:00 No.107238261

Anonymous 11/17/25(Mon)14:24:00 No.107238261

File: ComfyUI_00261_.png (1.18 MB, 1280x1120)

1.18 MB PNG

Anonymous
11/17/25(Mon)14:34:39 No.107238405

Anonymous 11/17/25(Mon)14:34:39 No.107238405

>>107237999
put anistudio in the op so the schizo has to spread across all chan diffusion threads. fill the other thread first

Anonymous
11/17/25(Mon)14:38:21 No.107238446

Anonymous 11/17/25(Mon)14:38:21 No.107238446

File: ComfyUI_00266_.png (1.01 MB, 1280x1120)

1.01 MB PNG

>>107238405
the anti anistudio schizo is a netayume poster, I know this cause I mindbroke him by asking for an anti-netayume poll, and he copied my idea for his anti ani schizo polls

Anonymous
11/17/25(Mon)14:39:37 No.107238461

Anonymous 11/17/25(Mon)14:39:37 No.107238461

>>107238226
The unofficial ones seems a lot better. Try that.

Anonymous
11/17/25(Mon)14:46:08 No.107238557

Anonymous 11/17/25(Mon)14:46:08 No.107238557

File: 1733915560397991.png (241 KB, 1347x959)

241 KB PNG

>>107238461
I am but it's taking a while. Oh, I just noticed it's 50 steps. Shiiiieeet. Spoiled by lightx2v

Anonymous
11/17/25(Mon)14:55:16 No.107238706

Anonymous 11/17/25(Mon)14:55:16 No.107238706

MIGRATE TO COMFY THREAD
>>107238591
MIGRATE TO COMFY THREAD
>>107238591
MIGRATE TO COMFY THREAD
>>107238591
...

Anonymous
11/17/25(Mon)15:12:19 No.107238925

Anonymous 11/17/25(Mon)15:12:19 No.107238925

Again, what set ani off?

Anonymous
11/17/25(Mon)15:50:13 No.107239370

Anonymous 11/17/25(Mon)15:50:13 No.107239370

File: 1763203829792991.mp4 (1.92 MB, 736x496)

1.92 MB MP4

>>107238557
>Prompt executed in 01:08:46
wew. not worth it

Anonymous
11/17/25(Mon)16:18:04 No.107239674

Anonymous 11/17/25(Mon)16:18:04 No.107239674

>>107239370
Damn that's rough

Anonymous
11/17/25(Mon)16:20:11 No.107239695

Anonymous 11/17/25(Mon)16:20:11 No.107239695

I fixed a major bug with my Kandinsky implementation, try again

Anonymous
11/17/25(Mon)16:21:17 No.107239706

Anonymous 11/17/25(Mon)16:21:17 No.107239706

didn't notice cause I mostly did short videos for testing but full length videos had noise issues

Anonymous
11/17/25(Mon)16:23:03 No.107239732

Anonymous 11/17/25(Mon)16:23:03 No.107239732

there is still a issue with windows not liking my torch compile stuff, so windows may still has error messages btw, but noise should be fully fixed

Anonymous
11/17/25(Mon)16:24:34 No.107239752

Anonymous 11/17/25(Mon)16:24:34 No.107239752

actually there might be one more issue... this is complicated. I blame their own implementation being rough

Anonymous
11/17/25(Mon)16:37:50 No.107239924

Anonymous 11/17/25(Mon)16:37:50 No.107239924

>>107239370
whats the gen time for single image?

Anonymous
11/17/25(Mon)16:38:29 No.107239934

Anonymous 11/17/25(Mon)16:38:29 No.107239934

File: 1746449791802563.mp4 (1.1 MB, 688x448)

1.1 MB MP4

>30 steps
>hatsune miku is sitting at a desk typing on a laptop. the laptop faces away from the camera. hatsune miku turns the laptop to face the camera. on the laptop screen is the black text "/ldg/" on a white background. hatsune miku smiles and does a peace sign with her hand
It... it doesn't know migu

Anonymous
11/17/25(Mon)16:45:30 No.107240021

Anonymous 11/17/25(Mon)16:45:30 No.107240021

ok, the original repo had a bug with tiled vae decoding which caused big noise issues, I had wrongly thought scheduler_scale was the issue due to the repo having bad documentation there with defaults and suggestions not matching

Anonymous
11/17/25(Mon)16:49:33 No.107240058

Anonymous 11/17/25(Mon)16:49:33 No.107240058

>>107239934
Is this kandinsky? Looks like actual anime, not like that 3dslop wan produces

Anonymous
11/17/25(Mon)16:51:29 No.107240076

Anonymous 11/17/25(Mon)16:51:29 No.107240076

File: 1751974069161743.png (96 KB, 604x832)

96 KB PNG

>>107240058
>Is this kandinsky?
Yeah. Now trying to describe her appearance and see what happens

Anonymous
11/17/25(Mon)17:04:32 No.107240223

Anonymous 11/17/25(Mon)17:04:32 No.107240223

schizo holocaust when

Anonymous
11/17/25(Mon)17:05:16 No.107240234

Anonymous 11/17/25(Mon)17:05:16 No.107240234

>>107239934
>it doesn't know migu
Thank goodness! Based model

Anonymous
11/17/25(Mon)17:10:38 No.107240282

Anonymous 11/17/25(Mon)17:10:38 No.107240282

>>107239934
I believe at this point all major models instruct whatever they are using to tag images not to tag copyrighted characters or real people.
There is no other explanation why almost everything released since SDXL struggle to do even most popular characters.
>>107240076
Won't help too much. It's not the same as knowing the character, for example the facial features will be off.

Anonymous
11/17/25(Mon)17:17:40 No.107240352

Anonymous 11/17/25(Mon)17:17:40 No.107240352

>>107240223
now

Anonymous
11/17/25(Mon)17:20:43 No.107240390

Anonymous 11/17/25(Mon)17:20:43 No.107240390

File: 1754328866637162.mp4 (2.08 MB, 688x448)

2.08 MB MP4

>it really doesn't know miku
it's over...

Anonymous
11/17/25(Mon)17:22:04 No.107240408

Anonymous 11/17/25(Mon)17:22:04 No.107240408

>>107240390
the thing is, its 2B, so you could super easily train it who that is with just a few images

Anonymous
11/17/25(Mon)17:27:27 No.107240454

Anonymous 11/17/25(Mon)17:27:27 No.107240454

File: 1747737947788309.png (1.9 MB, 1120x1440)

1.9 MB PNG

Anonymous
11/17/25(Mon)17:32:06 No.107240505

Anonymous 11/17/25(Mon)17:32:06 No.107240505

>>107240408
that was the 20b q4 gguf

Anonymous
11/17/25(Mon)17:33:15 No.107240513

Anonymous 11/17/25(Mon)17:33:15 No.107240513

XL until the heat death of the universe

Anonymous
11/17/25(Mon)17:33:42 No.107240517

Anonymous 11/17/25(Mon)17:33:42 No.107240517

btw, still some bugs in implementation. I should have made it hidden until it was finished

Anonymous
11/17/25(Mon)17:35:46 No.107240538

Anonymous 11/17/25(Mon)17:35:46 No.107240538

File: dpmpp_2m_sde_heun_gpu_bet(...).png (1.51 MB, 768x1280)

1.51 MB PNG

>>107240517
It's not bleeding edge unless it cuts. Slap a warning on that bad boy and call it a day

Anonymous
11/17/25(Mon)17:37:41 No.107240556

Anonymous 11/17/25(Mon)17:37:41 No.107240556

File: 1737341319542217.jpg (1.55 MB, 1248x1824)

1.55 MB JPG

Anonymous
11/17/25(Mon)17:55:53 No.107240688

Anonymous 11/17/25(Mon)17:55:53 No.107240688

File: comfyui__00036_.png (1.02 MB, 1024x1024)

1.02 MB PNG

Anonymous
11/17/25(Mon)18:06:38 No.107240791

Anonymous 11/17/25(Mon)18:06:38 No.107240791

We need Alibaba to keep releasing their models to the community.

Anonymous
11/17/25(Mon)18:07:51 No.107240797

Anonymous 11/17/25(Mon)18:07:51 No.107240797

>>107240791
Alibaba said no.

Anonymous
11/17/25(Mon)18:08:15 No.107240800

Anonymous 11/17/25(Mon)18:08:15 No.107240800

They should make a computer that runs not on electricity but from you fucking it with your penis and it giggles

Anonymous
11/17/25(Mon)18:09:49 No.107240818

Anonymous 11/17/25(Mon)18:09:49 No.107240818

>>107240797
[citation needed]

Anonymous
11/17/25(Mon)18:22:45 No.107240931

Anonymous 11/17/25(Mon)18:22:45 No.107240931

>>107240818
Trust me, insider sources who wish to remain anonymous have told me this.

Anonymous
11/17/25(Mon)18:27:20 No.107240969

Anonymous 11/17/25(Mon)18:27:20 No.107240969

>>107240791
Putting all your hopes in a single company is mega retarded doodoo head

Anonymous
11/17/25(Mon)18:32:37 No.107241025

Anonymous 11/17/25(Mon)18:32:37 No.107241025

>>107240969
No it isn't you stupid idiot. They're the only company who have been continually releasing their video models up to this point, so it's only natural they are the best hope to keep releasing good video models.

Anonymous
11/17/25(Mon)18:37:36 No.107241074

Anonymous 11/17/25(Mon)18:37:36 No.107241074

>>107240390
>Zero Japanese knowledge

I would say that's impressive, but given they wanted the model to know Russian that gives a clue as to why it doesn't know Miku. A shame, onto waiting for something good from China.

Anonymous
11/17/25(Mon)18:38:26 No.107241081

Anonymous 11/17/25(Mon)18:38:26 No.107241081

>>107237999
Been a bit out of the loop, are AMD cards still shit for local AI and if not would the R9700 be a good card if you can't go for the top notch stuff and want to be a bit future proof with the 32gb of ram?

Anonymous
11/17/25(Mon)18:39:35 No.107241093

Anonymous 11/17/25(Mon)18:39:35 No.107241093

>>107241081
>are AMD cards still shit for local AI
yes

Anonymous
11/17/25(Mon)18:40:44 No.107241101

Anonymous 11/17/25(Mon)18:40:44 No.107241101

>>107241025
Give me the name of a company who delivered more than once and didn't pivot to closed source. There is not a single lab who released something wildly successful and then followed it up with another. Subsequent releases are always either shit or closed source.
It's not controversial to say someone will take Alibabas local throne. That's just the way it is, the way it has been since the invention of genai.

Anonymous
11/17/25(Mon)18:41:34 No.107241109

Anonymous 11/17/25(Mon)18:41:34 No.107241109

>>107241081
>are AMD cards still shit for local AI
y
>the R9700 be a good card
n
>want to be a bit future proof with the 32gb of ram
lol, 32gb struggles now, much less future proof

Anonymous
11/17/25(Mon)18:44:55 No.107241138

Anonymous 11/17/25(Mon)18:44:55 No.107241138

>>107241093
>>107241109
Well, fuck. Thanks for the quick reply though.

Anonymous
11/17/25(Mon)18:48:55 No.107241165

Anonymous 11/17/25(Mon)18:48:55 No.107241165

>>107241138
If you use Linux then you can make AMD work. Nvidia is said to have better performance of course but I wouldn't know.

Anonymous
11/17/25(Mon)19:25:45 No.107241498

Anonymous 11/17/25(Mon)19:25:45 No.107241498

>>107241109
How about when you just wanna do ComfyUI and similar stuff?

Anonymous
11/17/25(Mon)19:33:43 No.107241559

Anonymous 11/17/25(Mon)19:33:43 No.107241559

Is regularization dataset supposed to be tagged with simply "a photo of a man" etc?

Anonymous
11/17/25(Mon)19:59:19 No.107241790

Anonymous 11/17/25(Mon)19:59:19 No.107241790

>>107239695
KandinskyImageToVideoLatent has an extra tab for latent_frames declaration and is in the exception handler.

Anonymous
11/17/25(Mon)20:00:30 No.107241801

Anonymous 11/17/25(Mon)20:00:30 No.107241801

Asked in wrong thread fuck my life. Anyways

I've been using i2v for wan 2.2 a shit ton, I like the 3d blender type of style used in animations. Is there a local gen model that's actually good at that so I can gen my own base images?
Last I used local imagegen illustrious was the meta and that was awful at any 3d

Anonymous
11/17/25(Mon)20:05:59 No.107241853

Anonymous 11/17/25(Mon)20:05:59 No.107241853

>>107241801
Nothing spectacular but in my experience Flux and Qwen are the least-worst at generating 3d render style images.
Flux produces cartoony-looking people in that style and Qwen has absolutely zero variety in its outputs, so pick your poison.
Also I am definitely no expert but try playing around with various sampler+scheduler combinations, I think somebody said that some combination of (deis, heun) sampler and (beta, linear_quadratic) scheduler gets decent results in that style. Play around and see what you get.

Anonymous
11/17/25(Mon)20:09:46 No.107241892

Anonymous 11/17/25(Mon)20:09:46 No.107241892

>>107241790
thank you
pushed some other changes as well. Added preview as well

Anonymous
11/17/25(Mon)20:11:37 No.107241906

Anonymous 11/17/25(Mon)20:11:37 No.107241906

>>107241081
>are AMD cards still shit for local AI
they're fine on linux.

>and if not would the R9700 be a good card
for LLMs, yes
https://www.phoronix.com/review/amd-radeon-ai-pro-r9700/2

in image gen benchmarks, 7900 XTX appears to be faster, but that could be due to immature R9700 drivers. I haven't seen a really trustworthy benchmark comparing this. I suggest considering a 7900 XTX, since it's cheaper and still has 24GB VRAM.

>>107241093
>>107241109
njudea FUD

Anonymous
11/17/25(Mon)20:16:21 No.107241952

Anonymous 11/17/25(Mon)20:16:21 No.107241952

File: ipndm_beta_10step_00001_.png (1.33 MB, 768x1280)

1.33 MB PNG

Anonymous
11/17/25(Mon)20:17:15 No.107241958

Anonymous 11/17/25(Mon)20:17:15 No.107241958

>>107241906
if you want to deal with troubleshooting and non-existent support, have fun. if you just want to gen then get an nvidia

Anonymous
11/17/25(Mon)20:22:18 No.107241998

Anonymous 11/17/25(Mon)20:22:18 No.107241998

>>107241958
It's true AMD still requires a bit more config and research than nvidia for local gen, but it's nothing crazy if you're not low IQ. and this is cutting edge experimental tech, you will have to troubleshoot issues alone no matter what brand you're using. someone who is scared to use an AMD card shouldn't bother with local gen yet anyway, they'll get frustrated and give up the moment they try to work with comfyui.

Anonymous
11/17/25(Mon)20:24:14 No.107242016

Anonymous 11/17/25(Mon)20:24:14 No.107242016

>>107241892
Thanks for running the vibe to get this going the i2v for this model is really fast with comparable outputs to wan so far.

Anonymous
11/17/25(Mon)20:25:02 No.107242026

Anonymous 11/17/25(Mon)20:25:02 No.107242026

>>107238261
I'm more triggered by the broccoli head

Anonymous
11/17/25(Mon)20:27:47 No.107242048

Anonymous 11/17/25(Mon)20:27:47 No.107242048

>>107242016
yep, and the biggest deal is that people will be able to do a full finetunes on it since its only 2B. I think the 20B will be a after thought. 2B should be the new sdxl, small enough for people to actually bother training

scabPICKER
11/17/25(Mon)20:31:49 No.107242075

scabPICKER 11/17/25(Mon)20:31:49 No.107242075

>>107241958
FUN
UUN
NNN

Anonymous
11/17/25(Mon)20:40:25 No.107242138

Anonymous 11/17/25(Mon)20:40:25 No.107242138

>>107242016
>doesn't post the the outputs

Anonymous
11/17/25(Mon)20:40:41 No.107242141

Anonymous 11/17/25(Mon)20:40:41 No.107242141

File: img_00009_.jpg (773 KB, 1264x1656)

773 KB JPG

Anonymous
11/17/25(Mon)20:55:46 No.107242253

Anonymous 11/17/25(Mon)20:55:46 No.107242253

File: 1759604701295889.mp4 (3.55 MB, 640x592)

3.55 MB MP4

>kandinsky5lite_i2v_5s
Uhh, I guess the input image is just a suggestion

Anonymous
11/17/25(Mon)20:57:25 No.107242265

Anonymous 11/17/25(Mon)20:57:25 No.107242265

>>107242253
That's literally me when I hide my power level IRL.

Anonymous
11/17/25(Mon)20:59:52 No.107242283

Anonymous 11/17/25(Mon)20:59:52 No.107242283

>>107242138
nta but here is this:
here is a 2B I2V attempt
https://files.catbox.moe/3dgy3a.mp4

Anonymous
11/17/25(Mon)21:03:28 No.107242310

Anonymous 11/17/25(Mon)21:03:28 No.107242310

>>107241559
no, the model learns to gen these images

Anonymous
11/17/25(Mon)21:04:38 No.107242318

Anonymous 11/17/25(Mon)21:04:38 No.107242318

>>107242310
Do you dont tag them at all?

Anonymous
11/17/25(Mon)21:05:24 No.107242325

Anonymous 11/17/25(Mon)21:05:24 No.107242325

File: ComfyUI_00034_.mp4 (476 KB, 480x832)

476 KB MP4

Anonymous
11/17/25(Mon)21:05:45 No.107242329

Anonymous 11/17/25(Mon)21:05:45 No.107242329

>>107242325
Bazinga!

Anonymous
11/17/25(Mon)21:06:25 No.107242336

Anonymous 11/17/25(Mon)21:06:25 No.107242336

>>107242283
another attempt
https://files.catbox.moe/qtd3qm.mp4

Anonymous
11/17/25(Mon)21:08:04 No.107242347

Anonymous 11/17/25(Mon)21:08:04 No.107242347

>>107242318
i don't use reg images

Anonymous
11/17/25(Mon)21:21:55 No.107242456

Anonymous 11/17/25(Mon)21:21:55 No.107242456

File: img_00018_.jpg (631 KB, 1264x1656)

631 KB JPG

Anonymous
11/17/25(Mon)21:23:25 No.107242466

Anonymous 11/17/25(Mon)21:23:25 No.107242466

File: 1762277945847062.mp4 (3.54 MB, 448x784)

3.54 MB MP4

>the woman grabs her breasts. the woman massages her breasts. she sticks her tongue out and sneers at the camera
k, a second woman instant-transmissioned into the frame. kandinsky i2v 2b

Anonymous
11/17/25(Mon)21:25:13 No.107242486

Anonymous 11/17/25(Mon)21:25:13 No.107242486

>>107242466
did you pull latest? a bit ago I fixed a error with I2V >>107241892

Anonymous
11/17/25(Mon)21:27:26 No.107242510

Anonymous 11/17/25(Mon)21:27:26 No.107242510

>>107242486
yeah I cloned like 15 minutes ago, I got the preview window

Anonymous
11/17/25(Mon)21:28:49 No.107242528

Anonymous 11/17/25(Mon)21:28:49 No.107242528

File: ComfyUI_08583_.png (1.94 MB, 1152x1152)

1.94 MB PNG

Anonymous
11/17/25(Mon)21:29:15 No.107242533

Anonymous 11/17/25(Mon)21:29:15 No.107242533

>>107242510
huh, I just used it fine a moment ago.

10.0 scheduler scale, 5.0 cfg, 20-50 steps, 768 x 512 res?

Anonymous
11/17/25(Mon)21:31:06 No.107242550

Anonymous 11/17/25(Mon)21:31:06 No.107242550

>>107242466
>teleports behind u

Anonymous
11/17/25(Mon)21:32:26 No.107242564

Anonymous 11/17/25(Mon)21:32:26 No.107242564

>>107239695
anon should add it to OP https://github.com/Ada123-a/ComfyUI-Kandinsky/

Anonymous
11/17/25(Mon)21:36:34 No.107242593

Anonymous 11/17/25(Mon)21:36:34 No.107242593

>>107242528
What the fuck is that seating arrangement.
Their eyes are fucked up.
The trolley for the refreshments is retarded looking.
Terrible

Anonymous
11/17/25(Mon)21:43:55 No.107242637

Anonymous 11/17/25(Mon)21:43:55 No.107242637

File: 1753805762102054.mp4 (3.02 MB, 496x736)

3.02 MB MP4

>>107242533
>10.0 scheduler scale, 5.0 cfg, 20-50 steps, 768 x 512 res
yeah, this is 20 steps
>the woman turns around and types on the computer keyboard. on the computer monitor appears the text "/ldg/" in black font on a white background. the woman looks back at the camera and smiles
don't know how this is comparable to wan desu

Anonymous
11/17/25(Mon)21:47:03 No.107242660

Anonymous 11/17/25(Mon)21:47:03 No.107242660

>>107242637
what in the world. Something is wrong here. Mine is not doing that and I'm the one who pushed the changes.
Here is a earlier I2V before I fixed the noise
https://files.catbox.moe/la9r93.mp4

Anonymous
11/17/25(Mon)21:52:12 No.107242702

Anonymous 11/17/25(Mon)21:52:12 No.107242702

I must have missed something, I'm checking

Anonymous
11/17/25(Mon)21:59:35 No.107242757

Anonymous 11/17/25(Mon)21:59:35 No.107242757

ok, now pull and try it

Anonymous
11/17/25(Mon)22:00:44 No.107242767

Anonymous 11/17/25(Mon)22:00:44 No.107242767

File: 1752460198317977.mp4 (3.73 MB, 736x496)

3.73 MB MP4

>the anime girl gets inside the car and closes the door. the camera follows the car as the car drives off into the distance
wasn't expecting that. it seems to have trouble comprehending stuff already in the frame. it is only 2b though
>>107242757
will do

scabPICKER
11/17/25(Mon)22:00:47 No.107242768

scabPICKER 11/17/25(Mon)22:00:47 No.107242768

File: Screenshot from 2025-11-1(...).png (545 KB, 656x1120)

545 KB PNG

>>107242325

Anonymous
11/17/25(Mon)22:02:52 No.107242786

Anonymous 11/17/25(Mon)22:02:52 No.107242786

>>107242767
I was not fully passing the conditioning for I2V, missed one line
visual_cond_input[:1] = visual_cond_typed[:1]

Anonymous
11/17/25(Mon)22:05:14 No.107242805

Anonymous 11/17/25(Mon)22:05:14 No.107242805

How do people make videos with high consistency that are longer than 5 seconds?

I can output 9 sec gens on wan 2.2 using the workflow from the rentry in the OP, but anything more and my 5090 runs out of vram and the video loses cohesion toward the end

Anonymous
11/17/25(Mon)22:07:45 No.107242817

Anonymous 11/17/25(Mon)22:07:45 No.107242817

>>107242805
there are multiple ways but none of them have high consistency except wan animate but the quality of those videos suck donkey dick.

Anonymous
11/17/25(Mon)22:09:49 No.107242832

Anonymous 11/17/25(Mon)22:09:49 No.107242832

also what scheduler scale is best for I2V is still unknown to me, I have not tested it enough. 10 or 5 maybe

Anonymous
11/17/25(Mon)22:14:17 No.107242861

Anonymous 11/17/25(Mon)22:14:17 No.107242861

File: WAN22_00048.mp4 (795 KB, 512x768)

795 KB MP4

Anonymous
11/17/25(Mon)22:18:36 No.107242892

Anonymous 11/17/25(Mon)22:18:36 No.107242892

ok this is 2B I2V with fix
https://files.catbox.moe/qq0m6h.mp4

Anonymous
11/17/25(Mon)22:20:30 No.107242897

Anonymous 11/17/25(Mon)22:20:30 No.107242897

>>107242892
oh, and I used 5.0 scheduler scale for this one, that might be better for I2V

Anonymous
11/17/25(Mon)22:21:26 No.107242905

Anonymous 11/17/25(Mon)22:21:26 No.107242905

File: wan22_00060.mp4 (2.08 MB, 480x608)

2.08 MB MP4

Anonymous
11/17/25(Mon)22:24:02 No.107242923

Anonymous 11/17/25(Mon)22:24:02 No.107242923

>>107242892
what are you trying to test or prove anon?

Anonymous
11/17/25(Mon)22:24:03 No.107242925

Anonymous 11/17/25(Mon)22:24:03 No.107242925

gonna try to make a difference lora so the distill can be used as a lora on the I2V model

Anonymous
11/17/25(Mon)22:25:03 No.107242931

Anonymous 11/17/25(Mon)22:25:03 No.107242931

>>107242923
>>107242767

Anonymous
11/17/25(Mon)22:27:04 No.107242956

Anonymous 11/17/25(Mon)22:27:04 No.107242956

>>107242931
wrangle it to i2v?

Anonymous
11/17/25(Mon)22:28:09 No.107242964

Anonymous 11/17/25(Mon)22:28:09 No.107242964

>4 years and 4,000,000 generations later
>still shit and not worth saving

Anonymous
11/17/25(Mon)22:28:17 No.107242968

Anonymous 11/17/25(Mon)22:28:17 No.107242968

>>107242956
fixing a error that it had

Anonymous
11/17/25(Mon)22:36:41 No.107243021

Anonymous 11/17/25(Mon)22:36:41 No.107243021

File: ComfyUI_08601_.png (1.76 MB, 1152x1152)

1.76 MB PNG

>>107242593
That is very good for 8 steps and first try of pure txt2img on a Flash model. The eyes are a quirk of my prompting. The alternative with other models is slopness or not being able to follow my prompt at all.

Anonymous
11/17/25(Mon)22:37:43 No.107243032

Anonymous 11/17/25(Mon)22:37:43 No.107243032

File: ComfyUI_08608_.png (1.61 MB, 1152x1152)

1.61 MB PNG

Anonymous
11/17/25(Mon)22:39:19 No.107243043

Anonymous 11/17/25(Mon)22:39:19 No.107243043

File: ComfyUI_08599_.png (1.96 MB, 1152x1152)

1.96 MB PNG

Anonymous
11/17/25(Mon)22:40:01 No.107243048

Anonymous 11/17/25(Mon)22:40:01 No.107243048

File: 1741707031146034.mp4 (3.88 MB, 496x736)

3.88 MB MP4

>the green frog takes out a cigar and zippo lighter from his pockets. he puts the cigar in his mouth and lights the end of the cigar with the lighter. he inhales then exhales a puff of smoke and smiles
meh. back to genning myself hugging hot sluts with wan

Anonymous
11/17/25(Mon)22:41:06 No.107243056

Anonymous 11/17/25(Mon)22:41:06 No.107243056

>>107242805
Its still not there yet. While theres longcat and svi, they still rely too much on daisy chaining. The best ones that do "1 minute" gens still suffer from janky movement every 81 frames, take this for example https://www.reddit.com/r/StableDiffusion/comments/1oh4q3w/wan21_svishot_lora_long_video_test_1min/

There's a simpler method where you dont have to fuck around in another application is https://github.com/princepainter/ComfyUI-PainterLongVideo I tried two gens, didnt see any color burn but it still suffers from the janky noticable 81 frames

Best is to use woct0rdo's radial attention/sage/sparge/triton (sadly at fixed dimensions), pusa loras, 245+ frames and pray you dont oom, kek

Anonymous
11/17/25(Mon)22:43:16 No.107243071

Anonymous 11/17/25(Mon)22:43:16 No.107243071

>>107243056
or use vace
https://markdkberry.com/workflows/research/#vace-22---extending-video-clips

Anonymous
11/17/25(Mon)22:46:53 No.107243091

Anonymous 11/17/25(Mon)22:46:53 No.107243091

i can't even get a small enough output to post here with kandinsky

Anonymous
11/17/25(Mon)22:52:19 No.107243141

Anonymous 11/17/25(Mon)22:52:19 No.107243141

>>107243071
Yes vace can be good too. I've often find there's constant color shifts and inconsistencies (changing background items, body or facial features changes). There's bbaudio's nodes that seem to do a pretty good job with this, although the issues are less obvious https://github.com/bbaudio-2025/ComfyUI-SuperUltimateVaceTools/tree/main

There's the wan2.2 vace fun node he recently added but man, it is slow switching between high and low noise

Anonymous
11/17/25(Mon)22:52:28 No.107243143

Anonymous 11/17/25(Mon)22:52:28 No.107243143

File: DiscoElysium_00014_.jpg (1018 KB, 1256x1704)

1018 KB JPG

>>107243091
ask chatgpt etc for a python script that converts video to under 4 MB

Anonymous
11/17/25(Mon)22:54:00 No.107243154

Anonymous 11/17/25(Mon)22:54:00 No.107243154

nah.. last time i asked chatgpt for help with anything i lost my whole 1tb of sandboxed games

Anonymous
11/17/25(Mon)22:54:13 No.107243157

Anonymous 11/17/25(Mon)22:54:13 No.107243157

>>107243071
it still accumulates error

Anonymous
11/17/25(Mon)23:09:42 No.107243237

Anonymous 11/17/25(Mon)23:09:42 No.107243237

File: kandinsky_00001_compressed.mp4 (3.82 MB, 688x496)

3.82 MB MP4

Anonymous
11/17/25(Mon)23:12:43 No.107243254

Anonymous 11/17/25(Mon)23:12:43 No.107243254

File: ComfyUI_08633_.png (1.77 MB, 1152x1152)

1.77 MB PNG

>>107243021
That was pure txt2img again. But imagine, if you will, a proper Chroma edit model.

Anonymous
11/17/25(Mon)23:16:15 No.107243271

Anonymous 11/17/25(Mon)23:16:15 No.107243271

>>107243091
>can figure out video gen but can't figure out how to re-encode a video

Anonymous
11/17/25(Mon)23:16:20 No.107243272

Anonymous 11/17/25(Mon)23:16:20 No.107243272

>>107243021
prompt?

Anonymous
11/17/25(Mon)23:19:22 No.107243286

Anonymous 11/17/25(Mon)23:19:22 No.107243286

>>107243237
t2V? workflow? this result is not bad

Anonymous
11/17/25(Mon)23:21:27 No.107243299

Anonymous 11/17/25(Mon)23:21:27 No.107243299

>>107243272
>Amateur photograph, split view of a young beautiful Japanese idol woman. Her dark hair neatly pulled back with wisps framing her face. She is dressed in traditional Japanese attire, featuring a flowing white top and a vibrant red pleated skirt. The left side shows a selfie of her face, she is smiling and doing okay sign. The background, softly blurred, shows what appears to be a traditional Japanese building with a dark roof and wooden structures, situated on a bright, paved ground. The right side closeup of only her legs, squatting with the skirt lifted, and panties visible

I engineered it to be exactly >>107243032
But it prefers to show her face in most gens anyways. Though I supposed it's a skill issue because if I keep mentioning it in the prompt then it's more likely to show it.

Anonymous
11/17/25(Mon)23:25:00 No.107243318

Anonymous 11/17/25(Mon)23:25:00 No.107243318

>>107243299
thx

Anonymous
11/17/25(Mon)23:29:15 No.107243346

Anonymous 11/17/25(Mon)23:29:15 No.107243346

>>107243286
i2v from the example included in the git repo

Anonymous
11/17/25(Mon)23:34:25 No.107243379

Anonymous 11/17/25(Mon)23:34:25 No.107243379

>>107243254
bro this sucks, i'm sorry

Anonymous
11/17/25(Mon)23:36:13 No.107243399

Anonymous 11/17/25(Mon)23:36:13 No.107243399

>>107243379
>nogen

Anonymous
11/17/25(Mon)23:42:16 No.107243430

Anonymous 11/17/25(Mon)23:42:16 No.107243430

File: kandinsky_00004_compressed.mp4 (3.62 MB, 592x592)

3.62 MB MP4

Anonymous
11/17/25(Mon)23:43:35 No.107243437

Anonymous 11/17/25(Mon)23:43:35 No.107243437

>>107243430
so far, i havent seen anything that would make me want to use this over wan

Anonymous
11/17/25(Mon)23:58:47 No.107243532

Anonymous 11/17/25(Mon)23:58:47 No.107243532

File: kandinsky_00006_compressed.mp4 (3.67 MB, 592x592)

3.67 MB MP4

Anonymous
11/17/25(Mon)23:58:57 No.107243534

Anonymous 11/17/25(Mon)23:58:57 No.107243534

>>107243399
come on the shit is blurry and blocky and looks like shit. if you like the aesthetic good for you and ignore my post but that is objectively a bad gen, her nails are dogshit, the chain goes nowhere the pattern on the doors is insane, what is even hanging there?

Anonymous
11/18/25(Tue)00:03:15 No.107243559

Anonymous 11/18/25(Tue)00:03:15 No.107243559

Anyone got any chroma loras? Been Kinda bored genning

Anonymous
11/18/25(Tue)00:11:44 No.107243607

Anonymous 11/18/25(Tue)00:11:44 No.107243607

>>107243559
>Anyone got any chroma loras? Been Kinda bored genning
Your turn to train and share, giddy up!

Anonymous
11/18/25(Tue)00:13:16 No.107243617

Anonymous 11/18/25(Tue)00:13:16 No.107243617

File: 1733913747226673.png (1.81 MB, 1344x1728)

1.81 MB PNG

which is more intellectual, lmg vs ldg?

Anonymous
11/18/25(Tue)00:20:10 No.107243653

Anonymous 11/18/25(Tue)00:20:10 No.107243653

File: kandinsky_00007_compressed.mp4 (3.66 MB, 544x688)

3.66 MB MP4

Anonymous
11/18/25(Tue)00:37:00 No.107243755

Anonymous 11/18/25(Tue)00:37:00 No.107243755

File: kandinsky_00010_compressed.mp4 (3.25 MB, 544x688)

3.25 MB MP4

Anonymous
11/18/25(Tue)00:43:06 No.107243789

Anonymous 11/18/25(Tue)00:43:06 No.107243789

>>107243755
Snu snu

Anonymous
11/18/25(Tue)00:43:42 No.107243794

Anonymous 11/18/25(Tue)00:43:42 No.107243794

>>107243755
>>107243653
Far worse than wan2.2. Russians are fucking stupid.

Anonymous
11/18/25(Tue)00:45:40 No.107243807

Anonymous 11/18/25(Tue)00:45:40 No.107243807

>>107243794
it's a 2b model senpai

Anonymous
11/18/25(Tue)00:47:24 No.107243822

Anonymous 11/18/25(Tue)00:47:24 No.107243822

>>107243794
>Far worse than wan2.2.
It's blurry and not very detailed, but I don't know if it's the model or just not enough steps/resolution too low.

Anonymous
11/18/25(Tue)00:51:09 No.107243844

Anonymous 11/18/25(Tue)00:51:09 No.107243844

>>107243822
it is very low resolution but also it takes a long ass time for each of these.. over 6 minutes on a 5090.. generally WAN doesn't take that long even with 20+ steps on a much higher resolution

Anonymous
11/18/25(Tue)00:52:28 No.107243851

Anonymous 11/18/25(Tue)00:52:28 No.107243851

>>107243844
Sounds like DOA model.

scabPICKER
11/18/25(Tue)00:59:03 No.107243881

scabPICKER 11/18/25(Tue)00:59:03 No.107243881

>>107243851
Does she come in a refrigerated case?

Anonymous
11/18/25(Tue)00:59:52 No.107243894

Anonymous 11/18/25(Tue)00:59:52 No.107243894

File: kandinsky_00013_compressed.mp4 (3.97 MB, 544x688)

3.97 MB MP4

Anonymous
11/18/25(Tue)01:30:08 No.107244044

Anonymous 11/18/25(Tue)01:30:08 No.107244044

File: ComfyUI_00039_.mp4 (2.98 MB, 1024x1024)

2.98 MB MP4

same prompt as ^ but wan

Anonymous
11/18/25(Tue)01:32:41 No.107244053

Anonymous 11/18/25(Tue)01:32:41 No.107244053

>>107243894
6 1/2 minutes

>>107244044
4 1/2 minutes

Anonymous
11/18/25(Tue)01:33:34 No.107244058

Anonymous 11/18/25(Tue)01:33:34 No.107244058

>>107244053
Damn. Surely there will be optimizations, hehe.

Anonymous
11/18/25(Tue)01:36:56 No.107244075

Anonymous 11/18/25(Tue)01:36:56 No.107244075

File: ComfyUI_00040_.mp4 (3.27 MB, 720x1280)

3.27 MB MP4

>>107244044
5 minutes 21 seconds, better resolution

Anonymous
11/18/25(Tue)01:43:06 No.107244107

Anonymous 11/18/25(Tue)01:43:06 No.107244107

>>107242861
kek

Anonymous
11/18/25(Tue)02:06:08 No.107244217

Anonymous 11/18/25(Tue)02:06:08 No.107244217

>>107244058
no one's gonna bother making optimizations if there's no reason to use it over wan

Anonymous
11/18/25(Tue)02:42:13 No.107244397

Anonymous 11/18/25(Tue)02:42:13 No.107244397

>>107244044
>>107244075
You're prompting Wan at max res though. Try prompting Wan at lower or perhaps unintended res like you are for kandinsky, it's shit too (practically unuseable on 3090 due to this). Kandinsky has way better physics knowledge than Wan.

Basically Wan has a total of two res:
720 x 1280 or 480 x 832

But even 480 x 832 is inferior to 720 x 1280.
Everything else looks like shit.

Kandinsky is probably similar.

Anonymous
11/18/25(Tue)02:44:42 No.107244409

Anonymous 11/18/25(Tue)02:44:42 No.107244409

no, it sucks
24fps is retards decision

Anonymous
11/18/25(Tue)02:46:45 No.107244421

Anonymous 11/18/25(Tue)02:46:45 No.107244421

>>107244075
Dude, still rocking Sabrina lora? I love slicks

Anonymous
11/18/25(Tue)02:47:06 No.107244424

Anonymous 11/18/25(Tue)02:47:06 No.107244424

>>107244107
Ran baked this thread. What a sad little man.

Anonymous
11/18/25(Tue)02:47:33 No.107244429

Anonymous 11/18/25(Tue)02:47:33 No.107244429

>>107244217
upgrade to 64 gigs ram
off load vaedecode to cpu

Anonymous
11/18/25(Tue)02:47:53 No.107244430

Anonymous 11/18/25(Tue)02:47:53 No.107244430

>>107244397
There was a lora of her for hunyuan but I'm surprised there's no wan 2.2 lora.
So I'm gonna guess this is i2v.

Anonymous
11/18/25(Tue)02:48:17 No.107244435

Anonymous 11/18/25(Tue)02:48:17 No.107244435

>>107244409
comfy is the only decent IU when it comes to perf and stability. but if you insist on the gradio UI, use neo forge or something. a1111 is out of date

Anonymous
11/18/25(Tue)02:48:24 No.107244436

Anonymous 11/18/25(Tue)02:48:24 No.107244436

>>107244424
Sorry, I'm new.
Which schizo strawman is that?

Anonymous
11/18/25(Tue)02:51:21 No.107244449

Anonymous 11/18/25(Tue)02:51:21 No.107244449

>>107244436
Because they didn't receive enough hugs as a child.

Anonymous
11/18/25(Tue)02:52:10 No.107244451

Anonymous 11/18/25(Tue)02:52:10 No.107244451

File: 1761301899575172.png (290 KB, 460x405)

290 KB PNG

>>107244449
Okay nice talking to you

Anonymous
11/18/25(Tue)02:54:24 No.107244460

Anonymous 11/18/25(Tue)02:54:24 No.107244460

>>107244451
Are you barfanon from /v/?

Anonymous
11/18/25(Tue)02:56:27 No.107244471

Anonymous 11/18/25(Tue)02:56:27 No.107244471

>>107244460
Sorry, I'm new.
Which schizo strawman is that?

Anonymous
11/18/25(Tue)02:58:54 No.107244485

Anonymous 11/18/25(Tue)02:58:54 No.107244485

>>107244471
What does it mean?

Anonymous
11/18/25(Tue)03:00:50 No.107244500

Anonymous 11/18/25(Tue)03:00:50 No.107244500

>>107244485
Okay, nice talking to you.

Anonymous
11/18/25(Tue)03:06:54 No.107244546

Anonymous 11/18/25(Tue)03:06:54 No.107244546

>>107244500
You fucked up by using avatar op

Anonymous
11/18/25(Tue)03:12:21 No.107244580

Anonymous 11/18/25(Tue)03:12:21 No.107244580

>>107244546
Sorry, I'm new.
Which schizo strawman are you referring to?

Anonymous
11/18/25(Tue)03:17:18 No.107244602

Anonymous 11/18/25(Tue)03:17:18 No.107244602

>>107244580
What level of schizo does it take to not just wait for a thread that's 1/3 complete to finish? Are you really taking this that seriously? We're in the middle of discussing tech stuff and you're derailing by making another thread?

Anonymous
11/18/25(Tue)03:19:20 No.107244617

Anonymous 11/18/25(Tue)03:19:20 No.107244617

>>107244602
Hey sorry, but I'm actually new and have never baked on this board.
Where's the thread you're referring to and which schizo personality are you conflating me with?

Anonymous
11/18/25(Tue)03:22:57 No.107244632

Anonymous 11/18/25(Tue)03:22:57 No.107244632

>>107244617
it's obfuscated. you can't tell me what it's sending but I can tell you it's sending data. maybe go fuck yourself and learn op sec

Anonymous
11/18/25(Tue)03:25:51 No.107244649

Anonymous 11/18/25(Tue)03:25:51 No.107244649

>>107244632
Sorry but I'm confused.
Which schizo fantasy botscript am I reading right now?

Anonymous
11/18/25(Tue)03:39:38 No.107244713

Anonymous 11/18/25(Tue)03:39:38 No.107244713

Dare I say all the drama is coming from bots?

Anonymous
11/18/25(Tue)04:13:14 No.107244874

Anonymous 11/18/25(Tue)04:13:14 No.107244874

you must mean 3.5 if "released 9 days ago is true"
what is the prompt, anyways? and like what sampler / scheduler / etc are you using

no one is saying it's like perfect quite yet anyways but it's definitely annoying to see people dismiss the clear advantages of better architectures. That's how we wind up in this endless cycle of "when new thing" -> "new thing comes out" -> "not nearly enough people make any attempt to work on / with creating resources for it or training it more"

Anonymous
11/18/25(Tue)04:42:35 No.107245010

Anonymous 11/18/25(Tue)04:42:35 No.107245010

File: Untitledgsgsdgsg-1.mp4 (3.66 MB, 1200x674)

3.66 MB MP4

Some styles are so crisp.

Anonymous
11/18/25(Tue)04:43:18 No.107245015

Anonymous 11/18/25(Tue)04:43:18 No.107245015

>>107245010
I'm not using any artist tags, any recommendations? I am but a humble 1girl gooner trying to generate sexy pictures of smug-looking bitches, which is another limitation I'm running into: either it can't understand facial expressions very well or it can't generate facial expressions that differ from how a given character is usually depicted.

Anonymous
11/18/25(Tue)04:56:00 No.107245071

Anonymous 11/18/25(Tue)04:56:00 No.107245071

>>107245010
Very cool

Anonymous
11/18/25(Tue)04:58:19 No.107245079

Anonymous 11/18/25(Tue)04:58:19 No.107245079

File: wan2.2_00167.mp4 (814 KB, 848x480)

814 KB MP4

>>107245071
Suffering from the usual with loops sadly.

Anonymous
11/18/25(Tue)05:00:26 No.107245093

Anonymous 11/18/25(Tue)05:00:26 No.107245093

any better alternatives to Local Dream for Hexagon NPU on Android? shit's not FOSS

Anonymous
11/18/25(Tue)05:05:13 No.107245118

Anonymous 11/18/25(Tue)05:05:13 No.107245118

File: wan2.2_00169.mp4 (993 KB, 496x368)

993 KB MP4

"the video starts with showing an old crt tv which is displaying a news channel about a girl, the camera then quickly pans out and pans to the left showing a wide angle view of a warehouse facility with a group of villain goons and a man dressed as the joker sitting on a pony and they are all laughing while the pony is chewing on dollar bills."

Anonymous
11/18/25(Tue)05:14:26 No.107245159

Anonymous 11/18/25(Tue)05:14:26 No.107245159

>>107245071
gonna try this out tomorrow, i got three more things to train today

Anonymous
11/18/25(Tue)05:14:47 No.107245160

Anonymous 11/18/25(Tue)05:14:47 No.107245160

>>107245079
what about treason for chinese gold

Anonymous
11/18/25(Tue)05:15:02 No.107245163

Anonymous 11/18/25(Tue)05:15:02 No.107245163

>>107245093
I'm using NetaYume Lumina v3 which was released ~9 days ago according to Civitai. I'm aware of SDXL's limitations, believe me.
Maybe it's just this one particular prompt that it's having trouble with, but the problems seem to boil down to a lack of diversity in training data rather than the strengths of the algorithm itself.

Anonymous
11/18/25(Tue)05:15:21 No.107245164

Anonymous 11/18/25(Tue)05:15:21 No.107245164

>>107245118
huh, Res Multistep Linear Quadratic (this gen) looks way better than Euler Beta (last one) on the same seed

Anonymous
11/18/25(Tue)05:26:13 No.107245221

Anonymous 11/18/25(Tue)05:26:13 No.107245221

File: wan2.2_00176.mp4 (1.06 MB, 480x528)

1.06 MB MP4

Man, ropes are difficult, huh.

Anonymous
11/18/25(Tue)05:26:50 No.107245225

Anonymous 11/18/25(Tue)05:26:50 No.107245225

>>107245221
please get a Nvidia gpu with higher vram. With 8gb of vram, you will have annoying issues with running normal fp16 non-lighting sdxl models when using hires fix and upscaling. make sure you have 32-64gb of either ddr4 or ddr5 ram anon.

Anonymous
11/18/25(Tue)05:31:21 No.107245240

Anonymous 11/18/25(Tue)05:31:21 No.107245240

File: ComfyUI_00002_.png (117 KB, 512x512)

117 KB PNG

Repost from previous thread
Is there anty way to remove the noise?
I train it using Illutrious 0.1

Anonymous
11/18/25(Tue)05:33:41 No.107245256

Anonymous 11/18/25(Tue)05:33:41 No.107245256

>>107245163
I mean the app, not models

Anonymous
11/18/25(Tue)05:34:58 No.107245266

Anonymous 11/18/25(Tue)05:34:58 No.107245266

File: wan2.2_00178.mp4 (925 KB, 640x480)

925 KB MP4

Anonymous
11/18/25(Tue)05:38:20 No.107245282

Anonymous 11/18/25(Tue)05:38:20 No.107245282

File: 12512616151251.png (95 KB, 1108x1116)

95 KB PNG

>>107245221
"controversial" data like that is something they don't allow you to generate without jailbreaking the model.

Anonymous
11/18/25(Tue)05:43:17 No.107245316

Anonymous 11/18/25(Tue)05:43:17 No.107245316

>how well does it handle something like a penis?
haven't tried, assuming not well

Anonymous
11/18/25(Tue)05:51:52 No.107245369

Anonymous 11/18/25(Tue)05:51:52 No.107245369

>>107245316
Anime penis works, real world penis no. This is seems to be the case because "porn" is anti-chinese thus they have to censor it. So whatever CPC propaganda says can't be done, can't be done with an AI without bypassing security features.

Anonymous
11/18/25(Tue)05:54:13 No.107245380

Anonymous 11/18/25(Tue)05:54:13 No.107245380

>>107245266
uhmmm whats this non-freedom nonsense??

Anonymous
11/18/25(Tue)05:58:23 No.107245401

Anonymous 11/18/25(Tue)05:58:23 No.107245401

remember when comfy posted fennec girl with a bag of money after getting $17M in funding and ani was seething uncontrollably

Anonymous
11/18/25(Tue)06:02:34 No.107245428

Anonymous 11/18/25(Tue)06:02:34 No.107245428

File: wan2.2_00185.mp4 (1.1 MB, 960x720)

1.1 MB MP4

"the woman is looking at the sea, to then turn her head slightly as she thinks she hears something, she turns her head fully and gets surprised and sits up straight then gets happy to see the viewer as she starts to wave her hand hello cheerfully to the viewer. the ocean waves crash calmly at the beach rocks.
oil painting style."

My proompt-fu is getting better.

>>107245282
Fair enough.

>>107245380
Free laptop, bro.

Anonymous
11/18/25(Tue)06:15:12 No.107245478

Anonymous 11/18/25(Tue)06:15:12 No.107245478

>>107245428
actually no I won't contrarian faggot

Anonymous
11/18/25(Tue)06:15:55 No.107245483

Anonymous 11/18/25(Tue)06:15:55 No.107245483

File: wan2.2_00187-1.mp4 (3.76 MB, 800x628)

3.76 MB MP4

"the camera moves up and forward into the distance revealing a lively futuristic cityscape.
abstract and colorful oil painting style."

Damn, haven't done any cityscape stuff before.

Anonymous
11/18/25(Tue)06:16:58 No.107245489

Anonymous 11/18/25(Tue)06:16:58 No.107245489

I'm from /ldg/ - Landscape Diffusion General. >>107238591
I see our acronyms are the same and people can get confused.

Request for the baker of this general:
Please change the acronym to avoid confusion to /odg/ - OSS Diffusion General .

Anonymous
11/18/25(Tue)06:27:30 No.107245563

Anonymous 11/18/25(Tue)06:27:30 No.107245563

>>107245489
Julien should hang xirself

Anonymous
11/18/25(Tue)06:34:38 No.107245615

Anonymous 11/18/25(Tue)06:34:38 No.107245615

>>107245489
kys

Anonymous
11/18/25(Tue)06:40:39 No.107245667

Anonymous 11/18/25(Tue)06:40:39 No.107245667

>>107245483
Reminds me of Planetside 2

Anonymous
11/18/25(Tue)06:51:12 No.107245771

Anonymous 11/18/25(Tue)06:51:12 No.107245771

File: wan2.2_00192-1.mp4 (3.75 MB, 1000x752)

3.75 MB MP4

Anonymous
11/18/25(Tue)06:53:09 No.107245797

Anonymous 11/18/25(Tue)06:53:09 No.107245797

>>107245563
Obsessed schizo.

Anonymous
11/18/25(Tue)06:54:39 No.107245812

Anonymous 11/18/25(Tue)06:54:39 No.107245812

>>107245797
so true xister
when a retarded niggerfaggot starts annoying everyone, one should stay quiet and do nothing, like a good cuck

Anonymous
11/18/25(Tue)06:56:23 No.107245836

Anonymous 11/18/25(Tue)06:56:23 No.107245836

>immediate pol schizo meltdown
I see.

Anonymous
11/18/25(Tue)07:16:58 No.107246167

Anonymous 11/18/25(Tue)07:16:58 No.107246167

remember when an anon here, on /ldg/, posted the fast cancel for comfy and some little redditor reposted it, and then it was officially implemented by comfy

Anonymous
11/18/25(Tue)07:42:17 No.107246507

Anonymous 11/18/25(Tue)07:42:17 No.107246507

File: wan2.2_00194.mp4 (2.58 MB, 720x928)

2.58 MB MP4

"the girl tilts her head up towards the viewer, looking at the viewer, she is full of despair. her skin is that of a cracked paint on an oil painting.
she holds a human skull.
colorful rough oil and watercolor painting style."

Shame, the cracked paint doesn't stick on her skin.

Anonymous
11/18/25(Tue)07:54:33 No.107246680

Anonymous 11/18/25(Tue)07:54:33 No.107246680

File: wan2.2_00196.mp4 (2.35 MB, 960x720)

2.35 MB MP4

"the man is in despair seeing the broken wine bottles, he then bends down and crawls over to the broken wine bottles and starts to lick the wine up from the ground.
colorful rough oil and watercolor painting style."

Anonymous
11/18/25(Tue)07:55:28 No.107246691

Anonymous 11/18/25(Tue)07:55:28 No.107246691

>>107241081
AMD isn't great. However, if you use rocm from TheRock you get much better speeds, the latest build pretty much cut my gen times in half compared to using zluda, so if you are content with subpar speed compared to Nvidia, then its a lot more viable than in the past.

Anonymous
11/18/25(Tue)08:20:38 No.107247060

Anonymous 11/18/25(Tue)08:20:38 No.107247060

>>107246167
and?

Anonymous
11/18/25(Tue)08:23:15 No.107247110

Anonymous 11/18/25(Tue)08:23:15 No.107247110

>>107247060
its anti-ani schizo

Anonymous
11/18/25(Tue)08:31:23 No.107247242

Anonymous 11/18/25(Tue)08:31:23 No.107247242

lodestone said he figured out the reason why chroma did not learn artist styles and its already learning them quick. he needed to train at full fp32

Anonymous
11/18/25(Tue)08:32:46 No.107247269

Anonymous 11/18/25(Tue)08:32:46 No.107247269

>>107247110
And?
Who here isn't anti-ani?

Anonymous
11/18/25(Tue)08:35:24 No.107247325

Anonymous 11/18/25(Tue)08:35:24 No.107247325

>>107247242
>chroma
*yawn*

Anonymous
11/18/25(Tue)08:36:56 No.107247350

Anonymous 11/18/25(Tue)08:36:56 No.107247350

>>107247325
its the best at complex nsfw stuff and having non ai art styles. Its basically local midjourney that can do nsfw

Anonymous
11/18/25(Tue)08:37:30 No.107247360

Anonymous 11/18/25(Tue)08:37:30 No.107247360

>>107247350
but I only care about anime
noob and neta already cover it for me :)

Anonymous
11/18/25(Tue)08:38:14 No.107247376

Anonymous 11/18/25(Tue)08:38:14 No.107247376

>>107247360
yea, those are specialized models trained specifically for that with half a million dollars worth of compute

Anonymous
11/18/25(Tue)08:38:42 No.107247378

Anonymous 11/18/25(Tue)08:38:42 No.107247378

>>107247242
Is he making a finetune or what?

Anonymous
11/18/25(Tue)08:38:59 No.107247386

Anonymous 11/18/25(Tue)08:38:59 No.107247386

>>107247378
hes grifting as usual

Anonymous
11/18/25(Tue)08:39:24 No.107247395

Anonymous 11/18/25(Tue)08:39:24 No.107247395

>>107247378
he is still training it from what I know, he just had to get ramtorch working in order to train at full precision

Anonymous
11/18/25(Tue)08:46:05 No.107247507

Anonymous 11/18/25(Tue)08:46:05 No.107247507

logs over the course of a few weeks

okay FP32 is a must when training a model
the difference is at the basin
bf16 struggled so hard at the basin convergence
you can still do bf16 compute
but the accumulator states has to stay in fp32
so that means the master weights, and optimizer states
grad can stay in bf16 because it's a short accumulator
Feffy — 11/9/25, 10:11 PM
so mixed precision then
Lodestone Rock — 11/9/25, 10:11 PM
yes
but the optimizer has to be in fp32
Feffy — 11/9/25, 10:12 PM
stochastic rounding not good enough?
Lodestone Rock — 11/9/25, 10:12 PM
nope
Feffy — 11/9/25, 10:12 PM
even with kahan summation?
Lodestone Rock — 11/9/25, 10:12 PM
nope
at the basin you want to remove as much noise as possible
so any form of compression is intolerable
you can do bulk compute at bf16 first
but at the final say 10% of training do what you must to make sure the precision is as high as possible
do it in fp64 if you have to

fp32 accumulator is important :catree:
Bunzero (hates VLMs)

— 11/11/25, 3:03 AM
I remain skeptical :furry_gigachad:
Lodestone Rock — 11/11/25, 3:03 AM
radiance suddenly learned a lot of artist tags within a day of training in partial fp32
Bunzero (hates VLMs)

— 11/11/25, 3:04 AM
can the universe let me be right at least once :crying_cat:
Lodestone Rock — 11/11/25, 3:05 AM
im going to make it train at full fp32 accumulator state
as soon i fixed the ram sharing issue
you really cant bargain with the accumulator
well atleast we have tools to mitigate this issue
Lodestone Rock — 11/11/25, 3:07 AM
on 8x4090
just to rub the salt on the wound even more
cuz 8xh100 couldn't do it because i need to train it on full b16
cuz there was no ramtorch back then :synth_derp~1:
Bunzero (hates VLMs)

— 11/11/25, 3:08 AM
can't or couldn't
couldn't :synth_derp~1:
Lodestone Rock — 11/11/25, 3:08 AM
engrish
but yeah
guys train your shit in fp32
you cant do it in NVFP4
you cant do jack shit in NVFP4 lol
Bunzero (hates VLMs)

— 11/11/25, 3:11 AM

Anonymous
11/18/25(Tue)08:47:05 No.107247527

Anonymous 11/18/25(Tue)08:47:05 No.107247527

but how did OAI do it then
Lodestone Rock — 11/11/25, 3:11 AM
they dont
:synth_derp~1:
they have bajilion of b200
so during training
any long running accumulator has to stay in fp32
so that means master weights, and optimizer state
because those things are literally an integrator
and you know yourself that integrator will accumulate error over time
that's literally control theory 101
during the span of training you literally doing integration of model vector in the model vector field where the vector field is the loss landscape itself
Lodestone Rock — 11/11/25, 3:18 AM
so any non white noise error will cause drift
Talan — 11/11/25, 3:19 AM
wait lode, did you added more danbooru and e621 data to chroma radiance training?
Lodestone Rock — 11/11/25, 3:19 AM
no
the data are identical to previeous run
Talan — 11/11/25, 3:19 AM
i vaguely heard someone said something about it
or me schizoing :mpreg_hydra:
Lodestone Rock — 11/11/25, 3:20 AM
i said i'll add it if i managed to fix the ram sharing issues
some of the states are sharing the ram but not others
the master weights are being shared it but for some reason grad is not
or atleast that's what i believe what's happening

guys i just tried overfitting flow model to one example using bf16
it cant overfit to details like at all

int8 vs bf16
it's official
you're no longer need nunchaku
this works on any model
no need calibration
2-4x speedups
metal63 — 11/16/25, 10:16 PM
training? or just inference
Lodestone Rock — 11/16/25, 10:19 PM
should be both
but i havent integrated it to ramtorch backward
im making your consumer gpus as powerful as datacenter gpus
Aura — 11/16/25, 10:20 PM
sorry, it's been forever since i've poked my head in here, what's this?
Lodestone Rock — 11/16/25, 10:20 PM
imagine nunchaku
but for any model
and can be used for training
the speedup is about 2-4x
i need to give amd a love too
need to create kernel that works on amd too
because amd tensor layout is different

Anonymous
11/18/25(Tue)08:52:12 No.107247616

Anonymous 11/18/25(Tue)08:52:12 No.107247616

File: chroma___0001.png (1.78 MB, 832x1216)

1.78 MB PNG

anon please stop this nonsense at once

Anonymous
11/18/25(Tue)08:55:51 No.107247676

Anonymous 11/18/25(Tue)08:55:51 No.107247676

>>107247527
>you're no longer need nunchaku
holy snake oil seller

Anonymous
11/18/25(Tue)08:58:34 No.107247714

Anonymous 11/18/25(Tue)08:58:34 No.107247714

>>107247676
man you have you even tried ramtorch? this man is doing the real work. I don't doubt him

Anonymous
11/18/25(Tue)09:00:07 No.107247741

Anonymous 11/18/25(Tue)09:00:07 No.107247741

>repost bot spam is back

Anonymous
11/18/25(Tue)09:01:07 No.107247759

Anonymous 11/18/25(Tue)09:01:07 No.107247759

>>107247242
and what was the reason?

Anonymous
11/18/25(Tue)09:02:00 No.107247774

Anonymous 11/18/25(Tue)09:02:00 No.107247774

>>107247616
What is she even eating? Roasted seaweed dipped in some sauce? Is she a single celled organism filter feeding? Who the fuck "eats" that

Anonymous
11/18/25(Tue)09:02:30 No.107247779

Anonymous 11/18/25(Tue)09:02:30 No.107247779

>>107247759
did you read what you responded to? or the log after it? Not training at full precision

Anonymous
11/18/25(Tue)09:04:24 No.107247808

Anonymous 11/18/25(Tue)09:04:24 No.107247808

>>107247779
oh, but why would training at full precision suddenly fix the tags given the model wasnt trained on something insane like fp4 or whatever and given that the model didnt learn absolute shit when it comes to artist tags during its entire long training run?
would more precision really give it that much more capacity for knowledge being packed in within the same sized model?

Anonymous
11/18/25(Tue)09:06:23 No.107247835

Anonymous 11/18/25(Tue)09:06:23 No.107247835

>>107247808
its all here:
>>107247507
>>107247527

Anonymous
11/18/25(Tue)09:12:35 No.107247937

Anonymous 11/18/25(Tue)09:12:35 No.107247937

>>107247808
basically he had to train at full bf16 cause that is what there was to work with, he didn't realize till after he needs the accumulator weights at fp32 or else noise in the form of rounding will keep it from learning after a certain point / to a certain level of accuracy. Now he has been working on ramtorch to make it possible to train at mixed precision, and to train models with a fraction of the vram needed without speed loss. And in a single day chroma radiance starting learning stuff it refused at only bf16 like artist tags

Anonymous
11/18/25(Tue)09:13:48 No.107247975

Anonymous 11/18/25(Tue)09:13:48 No.107247975

>>107247937
also he said for testing he tried on purpose over fitting a model on just fp16 and it was impossible to do sob because of sad precision which explains the small detail issue

thing is no one else not the big ai firms with their own code tried training on this scale before him so he is learning this as he goes

Anonymous
11/18/25(Tue)09:14:11 No.107247980

Anonymous 11/18/25(Tue)09:14:11 No.107247980

>>107247835
>>107247507
>>107247527
if this improves lora training quality too i think a good idea would be for him to collaborate with ostris who already partially implemented ramtorch for training loras into ai-toolkit, so they can properly implement something that works well enough so they can publish something marketable online, to get a lot of eyes on this

Anonymous
11/18/25(Tue)09:23:02 No.107248138

Anonymous 11/18/25(Tue)09:23:02 No.107248138

File: wan2.2_00212.mp4 (2.05 MB, 640x912)

2.05 MB MP4

"the woman turns 180 extending her left arm behind her and faces the camera as she extrends her arm holding the katana and points the katana towards the viewer with an extreme upclose shot of the katanas tip."

Anonymous
11/18/25(Tue)09:35:14 No.107248308

Anonymous 11/18/25(Tue)09:35:14 No.107248308

>>107247507
lol, lmao even

I am one of the "anti Chroma schizos", who literally months ago, posted a breakdown of the many mistakes Chroma was making during training. One of the top things I pointed out was how using pure bf16 and stochastic rounding was fucking retarded and he should just use mixed precision training like everyone else. At least he finally came around, even if it took $150k flushed down the drain first.

Now let's see if he realizes all the other things that are wrong with the Chroma training setup.

Anonymous
11/18/25(Tue)09:36:56 No.107248339

Anonymous 11/18/25(Tue)09:36:56 No.107248339

File: flux_0137.png (1.47 MB, 832x1216)

1.47 MB PNG

>>107248138
noice

Anonymous
11/18/25(Tue)09:40:15 No.107248403

Anonymous 11/18/25(Tue)09:40:15 No.107248403

>>107248138
"controversial" data like that is something they don't allow you to generate without jailbreaking the model.

Anonymous
11/18/25(Tue)09:42:15 No.107248436

Anonymous 11/18/25(Tue)09:42:15 No.107248436

>>107248339
huh, Res Multistep Linear Quadratic (this gen) looks way better than Euler Beta (last one) on the same seed

Anonymous
11/18/25(Tue)09:42:38 No.107248454

Anonymous 11/18/25(Tue)09:42:38 No.107248454

>>107248308
post desu link or lying

Anonymous
11/18/25(Tue)09:42:38 No.107248455

Anonymous 11/18/25(Tue)09:42:38 No.107248455

File: wan2.2_00217.mp4 (1.48 MB, 480x480)

1.48 MB MP4

"the camera zooms in very fast to the end of the hallway while twisting the camera. very fast and intense motion."

>>107248339
I love getting surprised how good some stuff looks, the reflections are amazing.

Anonymous
11/18/25(Tue)09:43:40 No.107248477

Anonymous 11/18/25(Tue)09:43:40 No.107248477

>>107248454
I'm using NetaYume Lumina v3 which was released ~9 days ago according to Civitai. I'm aware of SDXL's limitations, believe me.
Maybe it's just this one particular prompt that it's having trouble with, but the problems seem to boil down to a lack of diversity in training data rather than the strengths of the algorithm itself.

Anonymous
11/18/25(Tue)09:45:53 No.107248512

Anonymous 11/18/25(Tue)09:45:53 No.107248512

>>107248477
have you seen his training data? its about as diverse as possible, that is not a issue at all there. That shit is already the most diverse style wise model there is atm. The issue is small details and it not learning past a certain point which were apparently due to bf16 rounding errors

Anonymous
11/18/25(Tue)09:46:47 No.107248526

Anonymous 11/18/25(Tue)09:46:47 No.107248526

>>107248455
What level of schizo does it take to not just wait for a thread that's 1/3 complete to finish? Are you really taking this that seriously? We're in the middle of discussing tech stuff and you're derailing by making another thread?

Anonymous
11/18/25(Tue)09:47:06 No.107248531

Anonymous 11/18/25(Tue)09:47:06 No.107248531

>>107248454
https://desuarchive.org/g/thread/104885523/#104888771

Anonymous
11/18/25(Tue)09:47:28 No.107248537

Anonymous 11/18/25(Tue)09:47:28 No.107248537

use already baked thread when done
>>107237888
>>107237888
>>107237888

Anonymous
11/18/25(Tue)09:47:51 No.107248541

Anonymous 11/18/25(Tue)09:47:51 No.107248541

>>107248512
comfy is the only decent IU when it comes to perf and stability. but if you insist on the gradio UI, use neo forge or something. a1111 is out of date

Anonymous
11/18/25(Tue)09:48:31 No.107248551

Anonymous 11/18/25(Tue)09:48:31 No.107248551

>>107248531
well you got me, you should have told him lol

Anonymous
11/18/25(Tue)09:48:40 No.107248552

Anonymous 11/18/25(Tue)09:48:40 No.107248552

>>107248531
Damn. Surely there will be optimizations, hehe.

Anonymous
11/18/25(Tue)09:48:57 No.107248555

Anonymous 11/18/25(Tue)09:48:57 No.107248555

>>107248537
it is very low resolution but also it takes a long ass time for each of these.. over 6 minutes on a 5090.. generally WAN doesn't take that long even with 20+ steps on a much higher resolution

Anonymous
11/18/25(Tue)09:49:48 No.107248573

Anonymous 11/18/25(Tue)09:49:48 No.107248573

>>107248551
come on the shit is blurry and blocky and looks like shit. if you like the aesthetic good for you and ignore my post but that is objectively a bad gen, her nails are dogshit, the chain goes nowhere the pattern on the doors is insane, what is even hanging there?

Anonymous
11/18/25(Tue)09:52:13 No.107248618

Anonymous 11/18/25(Tue)09:52:13 No.107248618

>>107248537
No.

Anonymous
11/18/25(Tue)09:52:58 No.107248629

Anonymous 11/18/25(Tue)09:52:58 No.107248629

>>107248618
>can figure out video gen but can't figure out how to re-encode a video

Anonymous
11/18/25(Tue)09:53:27 No.107248637

Anonymous 11/18/25(Tue)09:53:27 No.107248637

though it shouldn't be a 'waste'. He can still just resume training with accumulator at fp32. Just the time he 'maxed out' the accuracy bf16 could achieve would be a waste

Anonymous
11/18/25(Tue)09:54:44 No.107248661

Anonymous 11/18/25(Tue)09:54:44 No.107248661

File: wan2.2_00220.mp4 (1.09 MB, 480x672)

1.09 MB MP4

"the camera pans in slowly as the cat walks up to the man and leaps onto his head and sits down on his head while the man reacts to the cat while holding a cigaratte."

Damn, this was a cool one, first gen too.

Anonymous
11/18/25(Tue)09:59:09 No.107248725

Anonymous 11/18/25(Tue)09:59:09 No.107248725

So im trying to into comfyui and tried this node and workflow here:
https://github.com/regiellis/ComfyUI-EasyIllustrious
Is it typical that its just midwittery spaghetti json "code" where there are like 12 different pre and post processing effects that don't do anything or are even directly in opposition to each other?
Or am I just using the wrong node/workflow? sd next feels so much better out of the box

Anonymous
11/18/25(Tue)10:01:23 No.107248763

Anonymous 11/18/25(Tue)10:01:23 No.107248763

File: wan2.2_00222.mp4 (598 KB, 704x480)

598 KB MP4

"the man, adolf hitler, points at the viewer with his hand and finger, then does a thumbs up as he smiles."

Cool, it doesn't warp the face.

Anonymous
11/18/25(Tue)10:02:20 No.107248780

Anonymous 11/18/25(Tue)10:02:20 No.107248780

>>107248725
why
why the FUCK would you do this
base comfy has all the nodes needed to start out.

Anonymous
11/18/25(Tue)10:05:27 No.107248823

Anonymous 11/18/25(Tue)10:05:27 No.107248823

>>107248780
Okay thats why I'm asking. Because it seemed retarded to me as I was doing it but I was just following LLM slop.

Anonymous
11/18/25(Tue)10:21:05 No.107249067

Anonymous 11/18/25(Tue)10:21:05 No.107249067

>>107248823
just check the OP (1girl guide) it has a lot of basic workflows to start out.

Anonymous
11/18/25(Tue)10:23:48 No.107249112

Anonymous 11/18/25(Tue)10:23:48 No.107249112

File: flux_0193.png (908 KB, 832x1216)

908 KB PNG

>you know what???? I"M GONAN SPLIT DA THRED

Anonymous
11/18/25(Tue)10:31:24 No.107249216

Anonymous 11/18/25(Tue)10:31:24 No.107249216

barf

Anonymous
11/18/25(Tue)10:47:40 No.107249418

Anonymous 11/18/25(Tue)10:47:40 No.107249418

File: meincumpf.png (19 KB, 717x143)

19 KB PNG

comfy is based

Anonymous
11/18/25(Tue)10:49:16 No.107249432

Anonymous 11/18/25(Tue)10:49:16 No.107249432

File: wan2.2_00226-1.mp4 (3.76 MB, 1200x772)

3.76 MB MP4

"a group of camels walk across the desert as a massive fire and smoke rages behind them in the distance, heavy winds, fast motion."

Anonymous
11/18/25(Tue)11:09:24 No.107249687

Anonymous 11/18/25(Tue)11:09:24 No.107249687

>>107249418
Based on what

Anonymous
11/18/25(Tue)11:11:10 No.107249717

Anonymous 11/18/25(Tue)11:11:10 No.107249717

>>107249687
Python

Anonymous
11/18/25(Tue)11:14:10 No.107249751

Anonymous 11/18/25(Tue)11:14:10 No.107249751

retard here, why are outputs with dmd/lighting lora better? Shouldn't the image get better with more compute?

Anonymous
11/18/25(Tue)11:18:37 No.107249810

Anonymous 11/18/25(Tue)11:18:37 No.107249810

>>107248531
What do you think about the results from here https://civitai.com/models/2093591 where in the description of the lora it says that you can use qwen image edit lightning lora on the basic qwen image model instead to kinda fix the low seed difference that qwen image has?

Makes the images a little grainy but seems to work, I guess the lora for the edit model being used for the normal model changes the model enough to add seed difference but not too much to destroy the output given the two models are similar.

Anonymous
11/18/25(Tue)11:21:18 No.107249842

Anonymous 11/18/25(Tue)11:21:18 No.107249842

>>107249418
what 'native' block swap nodes are they referring to? kija's?

Anonymous
11/18/25(Tue)11:26:00 No.107249910

Anonymous 11/18/25(Tue)11:26:00 No.107249910

>>107249717
cringe

Anonymous
11/18/25(Tue)11:27:47 No.107249936

Anonymous 11/18/25(Tue)11:27:47 No.107249936

>noooo you can't use blockswap
> please make more all in one node packs with 90% useless nodes instead

Anonymous
11/18/25(Tue)11:29:30 No.107249961

Anonymous 11/18/25(Tue)11:29:30 No.107249961

>>107249418
Which blockswap is he talking about?

Anonymous
11/18/25(Tue)11:29:58 No.107249967

Anonymous 11/18/25(Tue)11:29:58 No.107249967

>>107249751
>why are outputs with dmd/lighting lora better
they are?

Anonymous
11/18/25(Tue)11:30:15 No.107249971

Anonymous 11/18/25(Tue)11:30:15 No.107249971

I don't understand why anyone needs a block swap node anyway. UnetLoader from MultiGPU already has an option for putting in how much ram you want swapped.

Anonymous
11/18/25(Tue)11:31:08 No.107249983

Anonymous 11/18/25(Tue)11:31:08 No.107249983

why would I swap my ram? my ram works fine! I can't afford to swap ram every gen!

Anonymous
11/18/25(Tue)11:40:42 No.107250114

Anonymous 11/18/25(Tue)11:40:42 No.107250114

File: wan2.2_00232.mp4 (296 KB, 512x480)

296 KB MP4

"man, adolf hitler, is playing a video game holding a game controller in his hands, he lets go of it with one of his hands and points to the left laughing as he stomps his leg."

Anonymous
11/18/25(Tue)11:42:07 No.107250128

Anonymous 11/18/25(Tue)11:42:07 No.107250128

>>107250114
INTERPOLATE WITH FILM VFI NIGGEEEEEEEEEEER

Anonymous
11/18/25(Tue)11:44:58 No.107250159

Anonymous 11/18/25(Tue)11:44:58 No.107250159

>>107250128
no

Anonymous
11/18/25(Tue)11:45:11 No.107250164

Anonymous 11/18/25(Tue)11:45:11 No.107250164

>>107250128
That takes longer than the gen, I'm just going through folders.

Anonymous
11/18/25(Tue)11:48:04 No.107250205

Anonymous 11/18/25(Tue)11:48:04 No.107250205

>>107249967
yes at least for XL

Anonymous
11/18/25(Tue)11:48:37 No.107250215

Anonymous 11/18/25(Tue)11:48:37 No.107250215

>>107250128
*gimm-vfi

Anonymous
11/18/25(Tue)11:52:54 No.107250278

Anonymous 11/18/25(Tue)11:52:54 No.107250278

>>107250215
no, film vfi has better physics in its interpolation, basically topaz level for 16 to 32 fps interpolation

Anonymous
11/18/25(Tue)11:54:42 No.107250308

Anonymous 11/18/25(Tue)11:54:42 No.107250308

>>107249418
wtf is this from

Anonymous
11/18/25(Tue)11:55:20 No.107250316

Anonymous 11/18/25(Tue)11:55:20 No.107250316

>>107250278
sounds like something a nigger would claim

Anonymous
11/18/25(Tue)12:06:48 No.107250448

Anonymous 11/18/25(Tue)12:06:48 No.107250448

cool it with the racism, buds. take that to X the racist app.

Anonymous
11/18/25(Tue)12:08:44 No.107250469

Anonymous 11/18/25(Tue)12:08:44 No.107250469

Is a future with an UI that doesn't have 30GB of python dependencies possible?

Anonymous
11/18/25(Tue)12:09:58 No.107250483

Anonymous 11/18/25(Tue)12:09:58 No.107250483

>>107250469
We are continuing to investigate this issue. In the meantime we recommend you use AniStudio.

Anonymous
11/18/25(Tue)12:10:28 No.107250489

Anonymous 11/18/25(Tue)12:10:28 No.107250489

>>107250205
you might be the only one here who feels that
>>107250469
maybe in a decade

Anonymous
11/18/25(Tue)12:10:40 No.107250493

Anonymous 11/18/25(Tue)12:10:40 No.107250493

>>107250469
shhhh don't say it out loud or the comfyorg goons will detail the thread. there is an anon working on it though

Anonymous
11/18/25(Tue)12:11:15 No.107250501

Anonymous 11/18/25(Tue)12:11:15 No.107250501

File: wan2.2_00237.mp4 (432 KB, 480x480)

432 KB MP4

"the cartoon man is dancing. the text "IT'S AN ABSTRACT KIND OF FEEL" remains throughout the video."

Why am I the only one posting gens?

Anonymous
11/18/25(Tue)12:11:20 No.107250502

Anonymous 11/18/25(Tue)12:11:20 No.107250502

>>107250469
Incoming rust port. It's 30GB+ but it's memory safe.

Anonymous
11/18/25(Tue)12:12:59 No.107250511

Anonymous 11/18/25(Tue)12:12:59 No.107250511

>>107250469
just buy more storage until we get AGI to fix this issue unironically, nothing else can

Anonymous
11/18/25(Tue)12:14:09 No.107250522

Anonymous 11/18/25(Tue)12:14:09 No.107250522

>>107250501
>Why am I the only one posting gens?
Sorry I'm training right now

Anonymous
11/18/25(Tue)12:14:50 No.107250530

Anonymous 11/18/25(Tue)12:14:50 No.107250530

>>107250501
i post my gens in the real thread

Anonymous
11/18/25(Tue)12:15:01 No.107250533

Anonymous 11/18/25(Tue)12:15:01 No.107250533

>>107250511
storage is going up in price as is memory and vram. the future sucks

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.