/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/24/25(Fri)01:49:25 No.106991205

File: highlights_g_106988458_17(...).jpg (1.37 MB, 2755x1920)

1.37 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/24/25(Fri)01:49:25 No.106991205 Archived

Even Comfy Himself Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106988458

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/https://huggingface.co/neta-art/Neta-Lumina

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
10/24/25(Fri)01:51:43 No.106991224

Anonymous 10/24/25(Fri)01:51:43 No.106991224

THREE MORE YEARS OF SDXL

Anonymous
10/24/25(Fri)01:51:58 No.106991226

Anonymous 10/24/25(Fri)01:51:58 No.106991226

>>106991205
>https://gumgum10.github.io/gumgum.github.io/https://huggingface.co/neta-art/Neta-Lumina
fix this link and seperate them you retarded nigger faggot

Anonymous
10/24/25(Fri)01:52:14 No.106991228

Anonymous 10/24/25(Fri)01:52:14 No.106991228

File: ComfyUI_23220_.png (749 KB, 1280x720)

749 KB PNG

Anonymous
10/24/25(Fri)01:53:45 No.106991232

Anonymous 10/24/25(Fri)01:53:45 No.106991232

>>106991228
How you achieved a crappy camcorder look? is it a lora or just some prompting?

Anonymous
10/24/25(Fri)01:54:08 No.106991233

Anonymous 10/24/25(Fri)01:54:08 No.106991233

>>106991232
it's this
https://civitai.com/models/1134895/2000s-analog-core

Anonymous
10/24/25(Fri)01:55:11 No.106991238

Anonymous 10/24/25(Fri)01:55:11 No.106991238

how do I create hyperslop

Anonymous
10/24/25(Fri)01:57:16 No.106991245

Anonymous 10/24/25(Fri)01:57:16 No.106991245

>>106991238
use a mix or merge created in the last six to nine or so months

Anonymous
10/24/25(Fri)01:57:31 No.106991246

Anonymous 10/24/25(Fri)01:57:31 No.106991246

File: ComfyUI_00316_.png (1.07 MB, 912x1144)

1.07 MB PNG

Anonymous
10/24/25(Fri)01:58:03 No.106991250

Anonymous 10/24/25(Fri)01:58:03 No.106991250

>>106991233
oh yes I fucking love grainy analog y2k look, slop me more bra

Anonymous
10/24/25(Fri)01:58:24 No.106991252

Anonymous 10/24/25(Fri)01:58:24 No.106991252

>>106990996
that's just for part of it in the most recent version though, it's probably not a big deal. Mixed NLP / tag captions are generally what you want for this kind of model anyways.

>>106991062
that's not gonna happen lmao, it would take an enormously huge amount of degradation given the text encoder itself is far superior to CLIP

Anonymous
10/24/25(Fri)01:58:41 No.106991254

Anonymous 10/24/25(Fri)01:58:41 No.106991254

File: ComfyUI_00301_.png (1.06 MB, 984x1064)

1.06 MB PNG

Anonymous
10/24/25(Fri)01:59:22 No.106991261

Anonymous 10/24/25(Fri)01:59:22 No.106991261

>>106991252
>that's not gonna happen lmao, it would take an enormously huge amount of degradation given the text encoder itself is far superior to CLIP
tbqh i think it's saying something considering the model still retains a lot of its original knowledge even after extensive training on anime

Anonymous
10/24/25(Fri)01:59:42 No.106991264

Anonymous 10/24/25(Fri)01:59:42 No.106991264

File: ComfyUI_00324_.png (979 KB, 1024x1016)

979 KB PNG

Anonymous
10/24/25(Fri)01:59:43 No.106991265

Anonymous 10/24/25(Fri)01:59:43 No.106991265

I wish Qwen was nearly 1/5th as good as Chroma

Anonymous
10/24/25(Fri)02:00:07 No.106991266

Anonymous 10/24/25(Fri)02:00:07 No.106991266

>>106991261
yeah the realism that must be from base Lumina isn't that degraded at all, you can bring it back pretty easily with boomer prompts

Anonymous
10/24/25(Fri)02:00:36 No.106991268

Anonymous 10/24/25(Fri)02:00:36 No.106991268

I wish Chroma wasn’t 1/5 the resolution of Qwen

Anonymous
10/24/25(Fri)02:00:48 No.106991270

Anonymous 10/24/25(Fri)02:00:48 No.106991270

File: ComfyUI_00311_.png (1.08 MB, 1248x832)

1.08 MB PNG

Anonymous
10/24/25(Fri)02:02:54 No.106991280

Anonymous 10/24/25(Fri)02:02:54 No.106991280

>>106991268
KEEEEEEK chroma really was trained at 512x512 in 2025. embarrassing!

Anonymous
10/24/25(Fri)02:04:17 No.106991282

Anonymous 10/24/25(Fri)02:04:17 No.106991282

Does anyone else notice Yume suffers from duplications at resolutions higher than ~1400px. Need controlnets ASAP.

Anonymous
10/24/25(Fri)02:05:37 No.106991289

Anonymous 10/24/25(Fri)02:05:37 No.106991289

>>106991282
Or just a non shit model

Anonymous
10/24/25(Fri)02:08:41 No.106991297

Anonymous 10/24/25(Fri)02:08:41 No.106991297

>>106991282
Well yeah, clearly it’s not trained above that resolution. Happens with SD1.5 above 768 and SDXL above 1200

Anonymous
10/24/25(Fri)02:09:34 No.106991302

Anonymous 10/24/25(Fri)02:09:34 No.106991302

>>106991282
not really, I gen at 1536x1536 with it all the time. Even higher every now and then. Could depend on your artist tags though possibly

Anonymous
10/24/25(Fri)02:10:05 No.106991304

Anonymous 10/24/25(Fri)02:10:05 No.106991304

File: 1750753560885489.mp4 (948 KB, 1280x720)

948 KB MP4

>>106991228

Anonymous
10/24/25(Fri)02:10:30 No.106991306

Anonymous 10/24/25(Fri)02:10:30 No.106991306

File: ComfyUI_23225_.png (771 KB, 1280x720)

771 KB PNG

Anonymous
10/24/25(Fri)02:10:34 No.106991307

Anonymous 10/24/25(Fri)02:10:34 No.106991307

File: 1756984080829792.png (637 KB, 1758x693)

637 KB PNG

i'm using a bunch of different face detailers in my workflow, but i think i would be getting way better results if i took the detected area, resized it, inpainted it at a higher resolution, and then downscaled it. is there a clean way to do this? would simply resizing the whole image before and after work well?

Anonymous
10/24/25(Fri)02:10:51 No.106991309

Anonymous 10/24/25(Fri)02:10:51 No.106991309

>>106991297
IDK about Neta Lumina 1.0 but Yume is supposed to have been mult-res trained at between 768 and 1536.

Anonymous
10/24/25(Fri)02:11:26 No.106991311

Anonymous 10/24/25(Fri)02:11:26 No.106991311

File: AnimateDiff_00001.mp4 (1.49 MB, 480x848)

1.49 MB MP4

>try out the double loras for an old gen

Jesus christ, why is it so bad?

Anonymous
10/24/25(Fri)02:11:59 No.106991315

Anonymous 10/24/25(Fri)02:11:59 No.106991315

>>106991302
>Could depend on your artist tags though possibly
Without a doubt, now that I think about it. Still I would love cnets so I can at least gen at a lower res in order to choose which I throw through a second pass. Like other high(er than XL) res models, it's a cool feature but I much prefer "highres fix"ing instead.

Anonymous
10/24/25(Fri)02:17:47 No.106991342

Anonymous 10/24/25(Fri)02:17:47 No.106991342

How much longer until local reach midjourney levels?

Anonymous
10/24/25(Fri)02:17:57 No.106991345

Anonymous 10/24/25(Fri)02:17:57 No.106991345

File: ComfyUI_07801_.png (2.25 MB, 1152x1152)

2.25 MB PNG

>>106991191
Chroma really being carried by prompt engineering here. Can't do this at all if I describe
>Lifeless body of a man
And descriptions as such just gives me body horror.

But then changing that to
>sleeping man, who lays with his arms spread, eyes closed

Is much closer to what I want even if not perfect (I wanted her to hold axe, but then it can't depict it unless I have her standing there alone).

Chroma truly is all about prompt engineering and that's why Plebbitors are sleeping on it.

>>106991193
It's down, but prompt was
>Amateur flash photograph capturing a striking and adventurous beautiful young Japanese idol woman, embodying a mix of fierce determination and ethereal beauty, squatting low in a shadowy woodland clearing at night beside the sleeping man, who lays with his arms spread, eyes closed, extended across the leaf-strewn ground with a faint glimmer of crimson catching the camera's harsh light. She grips a katana, its sharp blade prominently displayed, with dried blood on it, and held in a manner both triumphant and solemn, as if to mark a rite of passage in this rugged outdoor expedition. She has long, dark hair with heavy bangs covering her forehead, and seems to be wearing makeup that creates a tired or distressed look, with smudged eyes and possibly pale skin. She gazes directly into the lens with wide, intense eyes enhanced by subtle makeup—her expression a magnetic blend of quiet pride, melancholy, and idol-like poise—while her chilled cheeks flush with the effort. Her attire is as dark as her: A maid dress, with ripped stockings. The backdrop fades into a veil of dense trees and tangled undergrowth barely touched by the abrupt, brilliant flare of the flash, suggesting the vast obscurity of the woods beyond. The overall scene radiates themes of survival instinct, primal empowerment, and the uncanny allure of an idol transformed into a huntress under the stark, unflinching glow of a nighttime capture.

Anonymous
10/24/25(Fri)02:18:02 No.106991346

Anonymous 10/24/25(Fri)02:18:02 No.106991346

>>106991315
have you tried just using two KSamplers where everything is exactly the same except for the denoise strength, with an upscale model in the middle? Should work fine at like 0.3 - 0.4 strength, that's how I upscale with Neta sometimes. Unless your reason for using controlnet tile was solely to save memory

Anonymous
10/24/25(Fri)02:18:39 No.106991350

Anonymous 10/24/25(Fri)02:18:39 No.106991350

>>106991342
>>106991224

Anonymous
10/24/25(Fri)02:19:18 No.106991354

Anonymous 10/24/25(Fri)02:19:18 No.106991354

>>106991345
You can thank T5 for that. The thing wants extremely literal prompts or else it'll misinterpret it.

Anonymous
10/24/25(Fri)02:19:24 No.106991355

Anonymous 10/24/25(Fri)02:19:24 No.106991355

>>106991342
It has, MidJourney isn't even close to the top of any benchmark chart that exists anywhere

Anonymous
10/24/25(Fri)02:20:34 No.106991360

Anonymous 10/24/25(Fri)02:20:34 No.106991360

>>106991355
i think that's the point he's making. local is so shit for the past few years and the next several years while API is constantly raising the bar every week

Anonymous
10/24/25(Fri)02:21:46 No.106991364

Anonymous 10/24/25(Fri)02:21:46 No.106991364

>>106991355
There are no valid charts for image models

Anonymous
10/24/25(Fri)02:21:47 No.106991365

Anonymous 10/24/25(Fri)02:21:47 No.106991365

File: ComfyUI_23230_.png (747 KB, 1280x720)

747 KB PNG

Anonymous
10/24/25(Fri)02:22:28 No.106991368

Anonymous 10/24/25(Fri)02:22:28 No.106991368

>>106991360
SDXL came out in late June 2023, Flux came out in August 2024.

Anonymous
10/24/25(Fri)02:22:53 No.106991369

Anonymous 10/24/25(Fri)02:22:53 No.106991369

>>106991354
Just requires intuition what works and what doesn't. It can depict two people together perfectly, and I have confirmed that thanks to the POV experiments, so after that you just guess what words it wants and where it wants them.

Anonymous
10/24/25(Fri)02:22:58 No.106991370

Anonymous 10/24/25(Fri)02:22:58 No.106991370

>>106991346
I'm sure it would, but I prefer using laten upscale which requires a high denoise and thus cnets. Pixelspace upscaling is often not terrible, but latent is superior hands down.
>Unless your reason for using controlnet tile was solely to save memory
No, just because I think latent is much better.

Anonymous
10/24/25(Fri)02:23:58 No.106991374

Anonymous 10/24/25(Fri)02:23:58 No.106991374

>>106991355
> top of any benchmark chart
like hynuan 3.0?

Anonymous
10/24/25(Fri)02:24:02 No.106991375

Anonymous 10/24/25(Fri)02:24:02 No.106991375

all midjourney gens rook same same though. sometimes its okay but often it ruins it.

Anonymous
10/24/25(Fri)02:25:58 No.106991388

Anonymous 10/24/25(Fri)02:25:58 No.106991388

>>106991370
latent upscale is just using traditional dumb algos to increase the size before you move into the next KSampler, it's not superior in any way to using a purpose trained ESRGAN / DAT / etc model to do the exact same thing, really it's worse by all accounts. I frankly don't understand what you mean.

Anonymous
10/24/25(Fri)02:29:52 No.106991397

Anonymous 10/24/25(Fri)02:29:52 No.106991397

>>106991388
For one, needing to translate to and from pixel space isn't lossy... so just based on that it's better. Also the use of cnets allows the user more control over how the second pass holds to or departs from the original image. Subjectively, any denoise lower than 0.4 is pointless anyway.
>DAT
Desu the best out of the bunch but still not as good as latent when doing comparisons.

Anonymous
10/24/25(Fri)02:30:52 No.106991402

Anonymous 10/24/25(Fri)02:30:52 No.106991402

>>106991397
>isn't lossy.
*isn't lossless

Anonymous
10/24/25(Fri)02:31:21 No.106991407

Anonymous 10/24/25(Fri)02:31:21 No.106991407

Give me one good reason why training with synthetic content is a bad thing

Anonymous
10/24/25(Fri)02:32:01 No.106991412

Anonymous 10/24/25(Fri)02:32:01 No.106991412

>>106991307
Are you using comfy or some variation of forge/webui/etc? What you described is the inpaint behavior in webui if you have "masked area only" set. It scales the area to whatever resolution your have appreciated and you can specify a padding to bring in more context around the inpainted region

Anonymous
10/24/25(Fri)02:33:43 No.106991423

Anonymous 10/24/25(Fri)02:33:43 No.106991423

>>106991397
>needing to translate to and from pixel space isn't lossy
nta but the absolute best upscaling workflow imo would be training DAT but exclusively on VAE degradation. traditional latent upscaling methods aren't great because they're, like the other anon said, using dumb algos like bicubic, nearest, etc. with the very low resolution of the actual latents, this often hurts details more than it helps. the true endgame would be using a model similar to DAT but in latent space, but this would require a much much more powerful arch due to the very low resolution of the latents.

Anonymous
10/24/25(Fri)02:33:58 No.106991428

Anonymous 10/24/25(Fri)02:33:58 No.106991428

>>106991370
>>106991397
You're relying on intuition. Empirically it's better to upscale the raw image, not the latents. Yes it requires one extra pass through the vae, but that isn't as lossy as you think

Anonymous
10/24/25(Fri)02:41:34 No.106991469

Anonymous 10/24/25(Fri)02:41:34 No.106991469

>>106991423
>>106991428
Perhaps. I have done direct 1:1 tests (on XL to be clear) and pixel space has always fucked the outputs. Again, sure it's often not terrible but latents superiority virtually jumps out of the screen at me.
>dumb algos like bicubic, nearest, etc.
I wish Comfy had that aliased latent upscale that whatever Forge fork has.
>but that isn't as lossy as you think
This is especially true with *Lumina models but, again, I have done tests, and the benefits of latent far surpass that of pixelspace.
With Chroma I was surprised at how well it holds an image when doing pixelspace upscale second pass, but even that still falls apart when you push the denoise up to anything close to .7.

>>106991428
What is the downside to cnet support, regardless of mine and your points? I don't see a reason to NOT have them desu.

Anonymous
10/24/25(Fri)02:42:17 No.106991474

Anonymous 10/24/25(Fri)02:42:17 No.106991474

File: c_hunyuan_3_8bit_lp_00005_.png (1.64 MB, 1024x1024)

1.64 MB PNG

>>106991345
Hunyuan 3. Yeah, about those benchmark rankings...

Anonymous
10/24/25(Fri)02:42:36 No.106991477

Anonymous 10/24/25(Fri)02:42:36 No.106991477

File: dbhgzdhbgzdfhbzd.png (11 KB, 336x254)

11 KB PNG

>>106991469
>I wish Comfy had that aliased latent upscale that whatever Forge fork has.
this?

Anonymous
10/24/25(Fri)02:42:48 No.106991478

Anonymous 10/24/25(Fri)02:42:48 No.106991478

File: 1752290634499048.png (19 KB, 716x252)

19 KB PNG

>>106991412
im using comfy right now. resizing the whole image beforehand works, however it seems to mess with bbox detection.

Anonymous
10/24/25(Fri)02:44:10 No.106991484

Anonymous 10/24/25(Fri)02:44:10 No.106991484

>>106991477
Yes, and that bicubic antialiased.

>>106991388
>>106991423
>traditional dumb algos
Is there a problem with Bislerp? I only use that.

Anonymous
10/24/25(Fri)02:44:40 No.106991489

Anonymous 10/24/25(Fri)02:44:40 No.106991489

>>106991469
>pixel space has always fucked the outputs
hasn't been the case in my experience. if anything latent upscale often introduced more artifacts for me.

Anonymous
10/24/25(Fri)02:46:11 No.106991495

Anonymous 10/24/25(Fri)02:46:11 No.106991495

Name one thing chroma does better than other models

Anonymous
10/24/25(Fri)02:46:48 No.106991497

Anonymous 10/24/25(Fri)02:46:48 No.106991497

>>106991489
I think often the problem lies in ones cnet settings and prompt. It's a bitch to dial in (especially with some models) but once one does, it's like magic.

Anonymous
10/24/25(Fri)02:48:32 No.106991508

Anonymous 10/24/25(Fri)02:48:32 No.106991508

>>106991484
Use case for latent upscaling?
(I jumped into this conversation just to help jog your memory i have no idea whats going on im running on 2% brainpower but i wanna see were this goes)

i ran some upscales with latent bicubic antialiased and it looks really good

Anonymous
10/24/25(Fri)02:54:23 No.106991546

Anonymous 10/24/25(Fri)02:54:23 No.106991546

>>106991495
Its the only model that has a built-in noise filter

Anonymous
10/24/25(Fri)02:54:34 No.106991549

Anonymous 10/24/25(Fri)02:54:34 No.106991549

>>106991508
>>106991397
Even if the loss is minimal, the logical approach is to minimize it as much as possible as in not translating at all. It is admittedly less now with models like Flux, Lumina, and other modern arch compared to the shit that is XL's. But still, it's there.
For past models, it was most apparent in the colors and high noise details. Even with a suped up external VAE.

Anonymous
10/24/25(Fri)02:55:54 No.106991556

Anonymous 10/24/25(Fri)02:55:54 No.106991556

sell me on using Qwen

Anonymous
10/24/25(Fri)02:58:03 No.106991566

Anonymous 10/24/25(Fri)02:58:03 No.106991566

File: 00041-962605011.png (2.17 MB, 1792x1024)

2.17 MB PNG

Anonymous
10/24/25(Fri)02:58:08 No.106991568

Anonymous 10/24/25(Fri)02:58:08 No.106991568

>>106991556
You can use the analog lora and pretend its chroma to trick anons into thinking chroma is actually good

Anonymous
10/24/25(Fri)02:59:03 No.106991572

Anonymous 10/24/25(Fri)02:59:03 No.106991572

>>106991556
it's like chroma but worse in every way

Anonymous
10/24/25(Fri)02:59:48 No.106991577

Anonymous 10/24/25(Fri)02:59:48 No.106991577

>>106991556
highest param open image model ever released

Anonymous
10/24/25(Fri)02:59:59 No.106991579

Anonymous 10/24/25(Fri)02:59:59 No.106991579

>>106991572
Post a chroma guitar with 6 strings and 6 pegs

Anonymous
10/24/25(Fri)03:00:18 No.106991583

Anonymous 10/24/25(Fri)03:00:18 No.106991583

>>106991556
it's like chroma but better in every way*
*very bad seed variety and no nsfw

Anonymous
10/24/25(Fri)03:01:08 No.106991588

Anonymous 10/24/25(Fri)03:01:08 No.106991588

>>106991579
best i can do is a 1girl

Anonymous
10/24/25(Fri)03:01:19 No.106991591

Anonymous 10/24/25(Fri)03:01:19 No.106991591

>>106991577
broski, your hunyuan 3 80B?

Anonymous
10/24/25(Fri)03:01:37 No.106991594

Anonymous 10/24/25(Fri)03:01:37 No.106991594

File: 1733973356023155.jpg (802 KB, 1336x2008)

802 KB JPG

Anonymous
10/24/25(Fri)03:01:42 No.106991595

Anonymous 10/24/25(Fri)03:01:42 No.106991595

File: 521545215448.jpg (1.11 MB, 873x873)

1.11 MB JPG

>>106991355
>>106991342
Local caught up around Flux. That's when its LoRAs were really up there.

For realism, MJ is currently not that good. Pic rel are four MJ gens made not too long ago. SDXL tier crap (though you could argue it's better than SDXL all you want, it's still not Flux tier).

Anonymous
10/24/25(Fri)03:02:19 No.106991600

Anonymous 10/24/25(Fri)03:02:19 No.106991600

File: file.png (842 KB, 764x823)

842 KB PNG

>>106991264
finally... untooned if it was good

Anonymous
10/24/25(Fri)03:02:42 No.106991601

Anonymous 10/24/25(Fri)03:02:42 No.106991601

>>106991600
do peter griffin

Anonymous
10/24/25(Fri)03:03:42 No.106991607

Anonymous 10/24/25(Fri)03:03:42 No.106991607

>>106991601
>>>/r/

Anonymous
10/24/25(Fri)03:04:38 No.106991615

Anonymous 10/24/25(Fri)03:04:38 No.106991615

>>106991607
it wasn't a request

Anonymous
10/24/25(Fri)03:19:25 No.106991681

Anonymous 10/24/25(Fri)03:19:25 No.106991681

File: 1756524604760972.png (2.69 MB, 3000x1680)

2.69 MB PNG

>>106991224
SDXL be like

Anonymous
10/24/25(Fri)03:21:32 No.106991694

Anonymous 10/24/25(Fri)03:21:32 No.106991694

File: b894d7abbd104ec8a7737836f(...).png (392 KB, 2083x1085)

392 KB PNG

Trying the latent upscale from an anon from before. It won't work for using the same image as last frame, I guess this is because the low noise now has the upscaled resolution, but am I not feeding it the upscaled resolution?

Anonymous
10/24/25(Fri)03:23:11 No.106991704

Anonymous 10/24/25(Fri)03:23:11 No.106991704

File: 1732399560434934.png (3.77 MB, 3727x965)

3.77 MB PNG

https://noamissachar.github.io/DyPE/
slop in 4k let's goo!

Anonymous
10/24/25(Fri)03:24:03 No.106991711

Anonymous 10/24/25(Fri)03:24:03 No.106991711

>>106991704
chromaxysters... we won...

Anonymous
10/24/25(Fri)03:24:07 No.106991712

Anonymous 10/24/25(Fri)03:24:07 No.106991712

>>106991694
Oh, I was talking only about images. No idea for videos.

Anonymous
10/24/25(Fri)03:27:08 No.106991736

Anonymous 10/24/25(Fri)03:27:08 No.106991736

File: 1733362971554566.png (129 KB, 1625x507)

129 KB PNG

https://xcancel.com/bdsqlsz/status/1981610051422040067#m
new cope soon(TM)

Anonymous
10/24/25(Fri)03:27:42 No.106991738

Anonymous 10/24/25(Fri)03:27:42 No.106991738

>Keeps models loaded onto vram even after closing
>Logs prompts and send them for """telemetry"'" purposes
>Will soon be closed source
Tell me again why Comfy is good?

Anonymous
10/24/25(Fri)03:28:12 No.106991740

Anonymous 10/24/25(Fri)03:28:12 No.106991740

>>106991738
>Logs prompts and send them
me when I lie

Anonymous
10/24/25(Fri)03:28:24 No.106991742

Anonymous 10/24/25(Fri)03:28:24 No.106991742

File: 324571b8921125358b66a4094(...).png (403 KB, 1675x934)

403 KB PNG

>>106991712
Shit. Well I got it to not error by doing pic related. But the genned result is just a static image.

Anonymous
10/24/25(Fri)03:28:45 No.106991746

Anonymous 10/24/25(Fri)03:28:45 No.106991746

ComfyUI Hijacks your phone and sends your dick pic to Comfyanon himself

Anonymous
10/24/25(Fri)03:29:10 No.106991749

Anonymous 10/24/25(Fri)03:29:10 No.106991749

>>106991746
but i already do that myself

Anonymous
10/24/25(Fri)03:30:13 No.106991753

Anonymous 10/24/25(Fri)03:30:13 No.106991753

File: 00062-3075492450.png (1.81 MB, 1792x1024)

1.81 MB PNG

Anonymous
10/24/25(Fri)03:43:45 No.106991813

Anonymous 10/24/25(Fri)03:43:45 No.106991813

>>106991753
>>106991566
interested in recipe

Anonymous
10/24/25(Fri)03:47:19 No.106991824

Anonymous 10/24/25(Fri)03:47:19 No.106991824

>>106991738
Why is Comfy such a promptlet that he needs to steal other people's prompts?

Anonymous
10/24/25(Fri)03:47:25 No.106991825

Anonymous 10/24/25(Fri)03:47:25 No.106991825

File: 00071-4070013988.png (2.21 MB, 1792x1024)

2.21 MB PNG

>>106991813
i'll try, but share places are getting retarded...

Anonymous
10/24/25(Fri)03:50:58 No.106991845

Anonymous 10/24/25(Fri)03:50:58 No.106991845

File: 00068-4070013985.png (2.1 MB, 1792x1024)

2.1 MB PNG

>>106991813
gettem while they're hot
https://litter.catbox.moe/9alwt7ad0aziq1r9.png
https://litter.catbox.moe/fhji583fokthr8h8.png

Anonymous
10/24/25(Fri)03:54:30 No.106991863

Anonymous 10/24/25(Fri)03:54:30 No.106991863

>>106991704
Looks insane

Anonymous
10/24/25(Fri)03:55:21 No.106991871

Anonymous 10/24/25(Fri)03:55:21 No.106991871

Wansisters, long vid 2.2 is here

>State-of-the-art text-to-video models excel at generating isolated clips but fall short of creating coherent, multi-shot narratives—the essence of storytelling. We bridge this "narrative gap" with HoloCine, a framework that generates entire scenes holistically to ensure global consistency from the first shot to the last. Our architecture achieves precise directorial control through a Window Cross-Attention mechanism that localizes text prompts to specific shots, while a Sparse Inter-Shot Self-Attention pattern—dense within shots but sparse between them—ensures the efficiency required for minute-scale generation. Beyond setting a new state-of-the-art in narrative coherence, HoloCine develops remarkable emergent abilities: a persistent memory for characters and scenes, and an intuitive grasp of cinematic techniques. Our work marks a pivotal shift from clip synthesis towards automated cinematic storytelling.

https://github.com/yihao-meng/HoloCine
https://huggingface.co/hlwang06/HoloCine/tree/main/HoloCine_dit/full
https://holo-cine.github.io/

Anonymous
10/24/25(Fri)03:56:28 No.106991877

Anonymous 10/24/25(Fri)03:56:28 No.106991877

File: 1746197977693273.jpg (727 KB, 1336x2008)

727 KB JPG

Anonymous
10/24/25(Fri)03:58:13 No.106991883

Anonymous 10/24/25(Fri)03:58:13 No.106991883

Reminder the next release of Wan is already showing to be better than Sora2

Anonymous
10/24/25(Fri)04:02:53 No.106991903

Anonymous 10/24/25(Fri)04:02:53 No.106991903

File: 2025-10-22-14h03m07s_seed(...).jpg (117 KB, 1920x1088)

117 KB JPG

>>106991556
qwen image is boring and has terrible seed rng variety.

Anonymous
10/24/25(Fri)04:04:05 No.106991911

Anonymous 10/24/25(Fri)04:04:05 No.106991911

>order 96gb of ram because it's the only thing in stock and other stores have no date for restock
>meant to be delivered yesterday
>got delayed till monday
>get an email now that it's out of stock

But with me searching again led me to a 192gb pack and it's completely in stock and arrives monday.

What a blessing.

Anonymous
10/24/25(Fri)04:05:51 No.106991921

Anonymous 10/24/25(Fri)04:05:51 No.106991921

What is the Jeets recommendation for a good image model?

Anonymous
10/24/25(Fri)04:07:36 No.106991930

Anonymous 10/24/25(Fri)04:07:36 No.106991930

>>106991871
It needs the new code to work properly, kijai is supposedly working on it at the moment.
Finally 5 sec slop will stop, been waiting for this for like a year.

Anonymous
10/24/25(Fri)04:10:24 No.106991945

Anonymous 10/24/25(Fri)04:10:24 No.106991945

File: Screenshot 2025-10-24 040807.png (61 KB, 1660x394)

61 KB PNG

>>106991871
>no audio+video combined generation.
fuck off with this bullshit.
>57.2gb
dead in the water, even ovi would have better potential community support if properly integrated with comfy, wan2gp and neoforge.

Anonymous
10/24/25(Fri)04:11:19 No.106991950

Anonymous 10/24/25(Fri)04:11:19 No.106991950

>>106991930
>gen 10 minute vidoe
>prompt completely fails 6 minutes in
5 seconds will always be superior

Anonymous
10/24/25(Fri)04:11:40 No.106991953

Anonymous 10/24/25(Fri)04:11:40 No.106991953

File: latentupscale_00008.mp4 (3.68 MB, 720x1200)

3.68 MB MP4

>>106991742
I can't get around this it seems.
I guess using a last frame doesn't work when low noise is starting from step 0 with less than 1 on denoise?

Anonymous
10/24/25(Fri)04:11:49 No.106991954

Anonymous 10/24/25(Fri)04:11:49 No.106991954

>>106991945
Can't tell if trolling or legitimately braindead.

Anonymous
10/24/25(Fri)04:12:11 No.106991957

Anonymous 10/24/25(Fri)04:12:11 No.106991957

File: 1748853342899229.jpg (784 KB, 2000x1336)

784 KB JPG

Anonymous
10/24/25(Fri)04:13:58 No.106991971

Anonymous 10/24/25(Fri)04:13:58 No.106991971

File: 00078-97005244.png (1.59 MB, 1792x1024)

1.59 MB PNG

https://youtu.be/9HwCNiUtYv4
gn

Anonymous
10/24/25(Fri)04:15:16 No.106991976

Anonymous 10/24/25(Fri)04:15:16 No.106991976

File: latentupscale_00004.mp4 (1.99 MB, 720x1200)

1.99 MB MP4

>>106991953
Compared to just using first frame. Stuff actually happens and the latent upscale is working.

Anonymous
10/24/25(Fri)04:19:05 No.106992002

Anonymous 10/24/25(Fri)04:19:05 No.106992002

>>106991845
got, tyvm

Anonymous
10/24/25(Fri)04:20:24 No.106992008

Anonymous 10/24/25(Fri)04:20:24 No.106992008

>>106991976
I went back to the original workflow and hooked up one single thing and it just works..
I shouldn't be doing these things after waking up and desperately needing to take a shit.

Anonymous
10/24/25(Fri)04:24:19 No.106992040

Anonymous 10/24/25(Fri)04:24:19 No.106992040

File: 00023-4065159245.png (1.89 MB, 1248x1824)

1.89 MB PNG

Anonymous
10/24/25(Fri)04:25:05 No.106992043

Anonymous 10/24/25(Fri)04:25:05 No.106992043

>>106991921
Are you asking in order to avoid it?

Anonymous
10/24/25(Fri)04:32:23 No.106992093

Anonymous 10/24/25(Fri)04:32:23 No.106992093

Why is there so many gay loras for Chroma?

Anonymous
10/24/25(Fri)04:34:20 No.106992102

Anonymous 10/24/25(Fri)04:34:20 No.106992102

File: 1755744657747127.jpg (729 KB, 2000x1336)

729 KB JPG

Anonymous
10/24/25(Fri)04:46:17 No.106992166

Anonymous 10/24/25(Fri)04:46:17 No.106992166

>>106992093
because the chroma creator is a gay furry (I'm not joking)

Anonymous
10/24/25(Fri)04:48:00 No.106992175

Anonymous 10/24/25(Fri)04:48:00 No.106992175

File: image_00296_.jpg (353 KB, 1240x1672)

353 KB JPG

Anonymous
10/24/25(Fri)04:50:58 No.106992190

Anonymous 10/24/25(Fri)04:50:58 No.106992190

File: 1744848557854756.png (490 KB, 1080x720)

490 KB PNG

can you do this shit with 16gb vram? i stopped paying attention to new models after flux because i was already pushing the limits of my card

Anonymous
10/24/25(Fri)04:52:54 No.106992209

Anonymous 10/24/25(Fri)04:52:54 No.106992209

File: 00041-1838771304.png (2.43 MB, 1536x1536)

2.43 MB PNG

Anonymous
10/24/25(Fri)04:54:06 No.106992214

Anonymous 10/24/25(Fri)04:54:06 No.106992214

>>106992190
You can do that on 3GB of vram

Anonymous
10/24/25(Fri)04:55:52 No.106992229

Anonymous 10/24/25(Fri)04:55:52 No.106992229

>>106992190
That example looks untrustworthy. Qwen-E is good at preserving text style and combining images but a restoration like that seems out of its reach.

Anonymous
10/24/25(Fri)04:56:39 No.106992235

Anonymous 10/24/25(Fri)04:56:39 No.106992235

>>106992190
>>106992214
Can I use it to make nudes (of adults)?

Anonymous
10/24/25(Fri)05:00:02 No.106992256

Anonymous 10/24/25(Fri)05:00:02 No.106992256

>>106992235
Ask >>106992209

Anonymous
10/24/25(Fri)05:03:33 No.106992271

Anonymous 10/24/25(Fri)05:03:33 No.106992271

File: 00046-563496202.png (2.41 MB, 1824x1248)

2.41 MB PNG

Anonymous
10/24/25(Fri)05:06:36 No.106992294

Anonymous 10/24/25(Fri)05:06:36 No.106992294

>>106991883
Of course it is, they upgraded to SaaS for Wan2.5 which is why they were able to compete

Anonymous
10/24/25(Fri)05:19:42 No.106992381

Anonymous 10/24/25(Fri)05:19:42 No.106992381

>>106992294
retard

Anonymous
10/24/25(Fri)05:20:07 No.106992383

Anonymous 10/24/25(Fri)05:20:07 No.106992383

>>106992209
i love me some plastic

Anonymous
10/24/25(Fri)05:20:44 No.106992386

Anonymous 10/24/25(Fri)05:20:44 No.106992386

wan2.5 will be local just like mogao, only two more weeks of waiting!

Anonymous
10/24/25(Fri)05:22:38 No.106992404

Anonymous 10/24/25(Fri)05:22:38 No.106992404

File: image_00300_.jpg (424 KB, 1240x1672)

424 KB JPG

Anonymous
10/24/25(Fri)05:29:19 No.106992439

Anonymous 10/24/25(Fri)05:29:19 No.106992439

>saastech so powerful it let Wan skip over 2.3 and 2.4
It’s no surprise local is so far behind, SaaS must be literal magic

Anonymous
10/24/25(Fri)05:35:03 No.106992472

Anonymous 10/24/25(Fri)05:35:03 No.106992472

>>106991883
>the next release of Wan is already showing to be better than Sora2
you mean wan 3.0?

Anonymous
10/24/25(Fri)05:37:55 No.106992492

Anonymous 10/24/25(Fri)05:37:55 No.106992492

>>106992271
I tried qwen image edit but the workflow says it needs more than 16gb vram and indeed it did not work

Anonymous
10/24/25(Fri)05:38:10 No.106992494

Anonymous 10/24/25(Fri)05:38:10 No.106992494

>>106991736
let's hope it won't be another slopped shit this time, wake the fuck up chinks and stop training your models with synthetic data

Anonymous
10/24/25(Fri)05:41:03 No.106992519

Anonymous 10/24/25(Fri)05:41:03 No.106992519

File: 00065-1699447175.png (2.51 MB, 1248x1824)

2.51 MB PNG

Anonymous
10/24/25(Fri)05:54:28 No.106992607

Anonymous 10/24/25(Fri)05:54:28 No.106992607

File: file.png (319 KB, 450x532)

319 KB PNG

>>106991601
the homer one isnt ai its just an old thing someone by the name of pixeloo made, they called it untoons

Anonymous
10/24/25(Fri)05:56:07 No.106992615

Anonymous 10/24/25(Fri)05:56:07 No.106992615

>>106992386
ltx 2 seems way better anyway and that's confirmed to be open source in november and running on consumer gpus, alibaba can suck it.

Anonymous
10/24/25(Fri)05:57:29 No.106992625

Anonymous 10/24/25(Fri)05:57:29 No.106992625

>>106992615
the ltx guys always give the distilled shit model though no?

Anonymous
10/24/25(Fri)05:58:28 No.106992633

Anonymous 10/24/25(Fri)05:58:28 No.106992633

>>106992235
Yes, with the clothes remover lora.
Go back a few threads for a link.
Hopefully it still works.

Anonymous
10/24/25(Fri)05:59:28 No.106992638

Anonymous 10/24/25(Fri)05:59:28 No.106992638

>>106992615
It’s also a western model, and western models are better quality than chinese slop. It’s just we rarely get weights without bullshit attached

Anonymous
10/24/25(Fri)05:59:51 No.106992642

Anonymous 10/24/25(Fri)05:59:51 No.106992642

>>106992492
currently running a qwen image edit on my 16gb card, amd at that
so you're on several layers of skill issues here

Anonymous
10/24/25(Fri)06:01:43 No.106992649

Anonymous 10/24/25(Fri)06:01:43 No.106992649

>>106992638
>It’s just we rarely get weights without bullshit attached
when was the last time we got a non distilled western model? lool

Anonymous
10/24/25(Fri)06:02:16 No.106992653

Anonymous 10/24/25(Fri)06:02:16 No.106992653

Why is shitjai still paying attention to that absolute svi dogshit loras?
The new holo finetune seems orders of magnitude better, guess he's too dumb to work with different and complex code when he vibecodes with claude.

Anonymous
10/24/25(Fri)06:02:52 No.106992655

Anonymous 10/24/25(Fri)06:02:52 No.106992655

>>106992649
was sd3 distilled? sd3 also had bullshit attached with the license though. maybe sd cascade or sdxl

Anonymous
10/24/25(Fri)06:03:37 No.106992660

Anonymous 10/24/25(Fri)06:03:37 No.106992660

>>106992653
You should do it since you have everything figured out

Anonymous
10/24/25(Fri)06:04:33 No.106992666

Anonymous 10/24/25(Fri)06:04:33 No.106992666

>>106992660
Don't need to till so much autistic finngolian

Anonymous
10/24/25(Fri)06:07:30 No.106992691

Anonymous 10/24/25(Fri)06:07:30 No.106992691

comfyui and forge should switch names

Anonymous
10/24/25(Fri)06:12:16 No.106992725

Anonymous 10/24/25(Fri)06:12:16 No.106992725

File: QwenEdit_00222_.png (1.24 MB, 912x1144)

1.24 MB PNG

>>106992190
>>106992229
I just ran it through qwen edit no loras because you had me curious.
Prompt:
adjust the color of the image to a realistic photo

Anonymous
10/24/25(Fri)06:13:18 No.106992731

Anonymous 10/24/25(Fri)06:13:18 No.106992731

File: F73bqw3.png (280 KB, 657x829)

280 KB PNG

>>106992725
input

Anonymous
10/24/25(Fri)06:18:26 No.106992753

Anonymous 10/24/25(Fri)06:18:26 No.106992753

>>106991736
he didn't say when it will be released?

Anonymous
10/24/25(Fri)06:22:43 No.106992777

Anonymous 10/24/25(Fri)06:22:43 No.106992777

File: 00074-2234657624.png (2.81 MB, 1248x1824)

2.81 MB PNG

Anonymous
10/24/25(Fri)06:26:10 No.106992803

Anonymous 10/24/25(Fri)06:26:10 No.106992803

man I love the pornmix plastic sloppa

Anonymous
10/24/25(Fri)06:39:17 No.106992855

Anonymous 10/24/25(Fri)06:39:17 No.106992855

File: hindenburg.png (2.2 MB, 2144x1000)

2.2 MB PNG

not too bad honestly
a bit slopped but eh

Anonymous
10/24/25(Fri)06:39:52 No.106992858

Anonymous 10/24/25(Fri)06:39:52 No.106992858

>>106991845
>>106992002
Fuck, I missed it at work
Reup please?

Anonymous
10/24/25(Fri)06:40:11 No.106992861

Anonymous 10/24/25(Fri)06:40:11 No.106992861

when will local reach this level of kino? >>>/wsg/6008898

Anonymous
10/24/25(Fri)06:41:05 No.106992868

Anonymous 10/24/25(Fri)06:41:05 No.106992868

File: 00092-1659676425.png (2.1 MB, 1248x1824)

2.1 MB PNG

Anonymous
10/24/25(Fri)06:41:23 No.106992871

Anonymous 10/24/25(Fri)06:41:23 No.106992871

File: 1740643033570938.mp4 (616 KB, 832x480)

616 KB MP4

>>106991871
>HoloCine
16 seconds is hype. I'm staying optimistic until I run this myself
I'm already bored of video without audio now though

Anonymous
10/24/25(Fri)06:52:56 No.106992924

Anonymous 10/24/25(Fri)06:52:56 No.106992924

File: babby_loicense.png (1.06 MB, 1400x620)

1.06 MB PNG

>>106992855
>>106992725

Anonymous
10/24/25(Fri)06:56:42 No.106992956

Anonymous 10/24/25(Fri)06:56:42 No.106992956

File: 00101-166946628.png (2.98 MB, 1824x1248)

2.98 MB PNG

Anonymous
10/24/25(Fri)06:58:42 No.106992966

Anonymous 10/24/25(Fri)06:58:42 No.106992966

File: old_photo.jpg (34 KB, 436x550)

34 KB JPG

>>106992190

manual edits with 2GB ram

Anonymous
10/24/25(Fri)06:58:57 No.106992968

Anonymous 10/24/25(Fri)06:58:57 No.106992968

File: 00103-3793658223.png (2.86 MB, 1248x1824)

2.86 MB PNG

Anonymous
10/24/25(Fri)07:00:19 No.106992977

Anonymous 10/24/25(Fri)07:00:19 No.106992977

>>106991871
looks kino desu
cumfart when????

Anonymous
10/24/25(Fri)07:03:26 No.106992993

Anonymous 10/24/25(Fri)07:03:26 No.106992993

>>106992977
Kijai seems to be struggling with the implementation at the moment

Anonymous
10/24/25(Fri)07:07:01 No.106993008

Anonymous 10/24/25(Fri)07:07:01 No.106993008

File: 233595738.jpg (600 KB, 1664x2432)

600 KB JPG

Anonymous
10/24/25(Fri)07:07:49 No.106993015

Anonymous 10/24/25(Fri)07:07:49 No.106993015

File: ComfyUI_00094_.png (957 KB, 1168x888)

957 KB PNG

Anonymous
10/24/25(Fri)07:17:50 No.106993062

Anonymous 10/24/25(Fri)07:17:50 No.106993062

>>106991694
dude don't bother. the output is slopped and the background is grainy. anon must've been trolling

Anonymous
10/24/25(Fri)07:18:52 No.106993066

Anonymous 10/24/25(Fri)07:18:52 No.106993066

>>106991738
none of those are true, julien

Anonymous
10/24/25(Fri)07:21:30 No.106993079

Anonymous 10/24/25(Fri)07:21:30 No.106993079

>>106993062
No it's working for me. The quality is equivalent to going 720p on low noise, but the motion is enhanced.

Anonymous
10/24/25(Fri)07:25:59 No.106993104

Anonymous 10/24/25(Fri)07:25:59 No.106993104

File: 00110-2218708856.png (2.62 MB, 1248x1824)

2.62 MB PNG

Anonymous
10/24/25(Fri)07:29:23 No.106993118

Anonymous 10/24/25(Fri)07:29:23 No.106993118

>>106993079
I'm looking at your workflow and both samplers are genning at 704p, no?

Anonymous
10/24/25(Fri)07:31:34 No.106993132

Anonymous 10/24/25(Fri)07:31:34 No.106993132

>>106993118
Oh ignore that one, I wrote in a later post that I went back to the original one and it's working, but it's not quite as good for last frame, weird things happening.

Anonymous
10/24/25(Fri)07:32:34 No.106993141

Anonymous 10/24/25(Fri)07:32:34 No.106993141

File: 00118-2949417064.png (2.38 MB, 1824x1248)

2.38 MB PNG

Anonymous
10/24/25(Fri)07:49:16 No.106993220

Anonymous 10/24/25(Fri)07:49:16 No.106993220

>>106991736
Oh god, if it's good enough to kill chroma I'm all for it.

Anonymous
10/24/25(Fri)07:52:36 No.106993234

Anonymous 10/24/25(Fri)07:52:36 No.106993234

>>106991704
LET'S GET DYPED UP DYPER BROTHERS

Anonymous
10/24/25(Fri)07:56:44 No.106993259

Anonymous 10/24/25(Fri)07:56:44 No.106993259

>>106991736
>trusting this faggot when he said the same about the new wan model
LOL

Anonymous
10/24/25(Fri)07:57:44 No.106993262

Anonymous 10/24/25(Fri)07:57:44 No.106993262

File: file.png (5 KB, 429x50)

5 KB PNG

>>106991704
uhmmm sisters??? this doesnt look right

Anonymous
10/24/25(Fri)08:00:14 No.106993282

Anonymous 10/24/25(Fri)08:00:14 No.106993282

>>106993259
>he got wrong one time out of 1000 therefore we shouldn't trust him anymore
meh, he still has a great ratio though

Anonymous
10/24/25(Fri)08:01:02 No.106993290

Anonymous 10/24/25(Fri)08:01:02 No.106993290

>>106993282
the wan ragpull left a deep scar man

Anonymous
10/24/25(Fri)08:02:18 No.106993299

Anonymous 10/24/25(Fri)08:02:18 No.106993299

>>106993290
When ltx 2 gets out wan is basically dead anyway.

Anonymous
10/24/25(Fri)08:02:51 No.106993301

Anonymous 10/24/25(Fri)08:02:51 No.106993301

So I take it neta lumina isn't good with text

Anonymous
10/24/25(Fri)08:02:53 No.106993302

Anonymous 10/24/25(Fri)08:02:53 No.106993302

>>106993299
meh, I saw some ltx 2 videos, the sound is atrocious

Anonymous
10/24/25(Fri)08:05:50 No.106993325

Anonymous 10/24/25(Fri)08:05:50 No.106993325

>>106993299
doesnt really look better visually, we'll see how it trains and how it hold coherence but wan 2.2 is pretty good for physics actually and for cartoony art styles already

the 4k 50fps long generations is good on paper but means little if the videos genned ultimately look like they are 720p "upscales"

although its obvious ltx was trained very heavily on veo 3, given it copies its voice styles very closely, so at least we will have veo 3 mini at home for ok audio and video gen

and we will also see about speed compared to wan

Anonymous
10/24/25(Fri)08:07:59 No.106993331

Anonymous 10/24/25(Fri)08:07:59 No.106993331

>>106993299
from the few clips I've seen it looks really slopped, maybe for I2V it will be good though, I want my own I2V grok meme generator at home >>>/wsg/6009078

Anonymous
10/24/25(Fri)08:16:27 No.106993371

Anonymous 10/24/25(Fri)08:16:27 No.106993371

The based chinks are waiting for someone to btfo them hard before the finally have to pull out the trump card of just saying fuck kikes and ip "rights" and train on the entirety of youtube they must have been scraping all this time like sora 2 and all movies and cartoons ever made to finally get a huge boost in model quality and knowledge.

They don't want to do it too soon because if they put out a great model that knows all popular media:
1. IP "rights" holder companies will put large pressure on China to shut it down.
2. They will have no more trump cards until they can make their own gpus which wont be for a couple more years and everyone else will be able to train on their models while adding their own advancements, leaving China to follow behind

So by always having this extra aspect of being able to train on copyrighted media, they have a reasonably big leeway to do whatever and always be able to add the extra high quality copyrighted dataset spice to get juust near the top of the list of good gen ai models

Anonymous
10/24/25(Fri)08:17:58 No.106993376

Anonymous 10/24/25(Fri)08:17:58 No.106993376

When will the based chinkoids finally release a vram monster
I'm rooting for the insects

Anonymous
10/24/25(Fri)08:18:45 No.106993379

Anonymous 10/24/25(Fri)08:18:45 No.106993379

>>106993371
>The based chinks
I'll call them based the day they'll really do train their model on the entirety of youtube like OpenAI did

Anonymous
10/24/25(Fri)08:20:40 No.106993389

Anonymous 10/24/25(Fri)08:20:40 No.106993389

>>106993379
What matters is releasing good models, and wan 2.1 was a huge jump they contributed that wont be matched any time soon that they didnt need to release, there was no pressure in that space from anyone else, hunyuan was okish but still very much a toy

Anonymous
10/24/25(Fri)08:22:01 No.106993396

Anonymous 10/24/25(Fri)08:22:01 No.106993396

So how does bucketing and batch size work together?
I am training with a batch size of 2. I have some buckets with odd number of images.
I have 65 images, no repeats and 10 epochs. This should give me 325 steps, based on batch size 2.
Judging by the fact that I have 360 total steps, I am guessing the training script is doing some steps with batch size 1 to compensate for the odd numbered buckets.
The question is, does this have an adverse effect on the training quality? Should I manually resize or use higher bucket steps like 128?

Anonymous
10/24/25(Fri)08:22:18 No.106993397

Anonymous 10/24/25(Fri)08:22:18 No.106993397

>>106993389
>hunyuan was okish but still very much a toy
Tencent has the balls to put nudity on their models, but Wan has more competent engineers, unfortunately :(

Anonymous
10/24/25(Fri)08:24:58 No.106993412

Anonymous 10/24/25(Fri)08:24:58 No.106993412

>>106993397
Wasn't the Wan 2.1 chinese project page memed for having coombait women in their literal cherry picked examples at the beginning?

Also its very good at everything NSFW with any NSFW lora.

Anonymous
10/24/25(Fri)08:25:57 No.106993415

Anonymous 10/24/25(Fri)08:25:57 No.106993415

>>106993412
>Wasn't the Wan 2.1 chinese project page memed for having coombait women in their literal cherry picked examples at the beginning?
I remember that, it was a fake website unfortunately kek

Anonymous
10/24/25(Fri)08:28:06 No.106993422

Anonymous 10/24/25(Fri)08:28:06 No.106993422

>>106993415
I was here the entire time and don't remember it being exposed as a fake website, i dont feel like going through the archives and my second point still stands to disprove the censorship part

Anonymous
10/24/25(Fri)08:30:16 No.106993443

Anonymous 10/24/25(Fri)08:30:16 No.106993443

>>106993422
>my second point still stands to disprove the censorship part
your second point destroys your initial argument though >>106993371

you said "you want NSFW, just do a lora bro" but at the same time you want IP shit on the base model, why can't we respond to that "you want IP characters on the model? just do a lora bro"

Anonymous
10/24/25(Fri)08:34:21 No.106993467

Anonymous 10/24/25(Fri)08:34:21 No.106993467

https://xcancel.com/maxescu/status/1981416100303950309#m
it's so fucking plastic, why do they all train their model on synthetic shit, I'm going craaazzyyy, only OpenAI doesn't do that

Anonymous
10/24/25(Fri)08:35:20 No.106993475

Anonymous 10/24/25(Fri)08:35:20 No.106993475

>>106993371
>based chinks
first, they're not based at all. second, they can't beat sora 2. or maybe in 5 years kek

Anonymous
10/24/25(Fri)08:35:29 No.106993476

Anonymous 10/24/25(Fri)08:35:29 No.106993476

File: 1737452916818240.png (224 KB, 487x467)

224 KB PNG

>>106993467
>https://xcancel.com/maxescu/status/1981416100303950309#m
>flux chin
DOA

Anonymous
10/24/25(Fri)08:37:12 No.106993481

Anonymous 10/24/25(Fri)08:37:12 No.106993481

>>106993475
>they can't beat sora 2. or maybe in 5 years kek
they'll never beat sora 2 if they keep training their model on synthetic data, one day they must learn that they can't cheap out on the dataset, it'll always be the most important thing on deep learning, period

Anonymous
10/24/25(Fri)08:38:49 No.106993498

Anonymous 10/24/25(Fri)08:38:49 No.106993498

>>106993443
When mentioning the censorship i meant its not censored against nsfw, given that it so easily learns any nsfw concept in lora training.

With IP characters, it doesn't learn them as fast and they are not the same type of data to expect the model to be able to generate compared to nsfw because there is a difference for a company to train heavily on youtube and when asked about IP rights say "oh well we trained on everything like sora 2 did" versus them training on a huge dataset of literal porn.

And im not saying a model should be limited in the data its trained on even when it comes to porn given the anatomy benefits, its just that training on porn for a model company is not something that we can almost ever really expect to happen, meaning when it comes to the discussion of censorship, what that really means is that as long as the model is not specifically trained against genning nsfw or it gets lobotomized to the tier of sd3 so it cant even generate women, then that is good enough of a sign that the model's core wasnt "censored"/lobotomized.

Anonymous
10/24/25(Fri)08:40:38 No.106993514

Anonymous 10/24/25(Fri)08:40:38 No.106993514

>>106993498
>there is a difference for a company to train heavily on youtube and when asked about IP rights say "oh well we trained on everything like sora 2 did" versus them training on a huge dataset of literal porn.
it's way more dangerous to train on IP, you can piss off anime artists, celebrities... OpenAI is getting some heat recently because of that, copyright is something serious, really serious

Anonymous
10/24/25(Fri)08:42:07 No.106993523

Anonymous 10/24/25(Fri)08:42:07 No.106993523

File: UHZht5g.png (486 KB, 1024x576)

486 KB PNG

>>106991306
Not bad. Can you do NTSC artifacts like cross-color, dot-crawl, dot-hang?

Anonymous
10/24/25(Fri)08:43:34 No.106993532

Anonymous 10/24/25(Fri)08:43:34 No.106993532

Divine axioms of diffusion:
1: SaaS is years ahead of local
2: China mogs the west
Therefore it’s easy to understand why Wan stopped releasing local models.

Anonymous
10/24/25(Fri)08:45:34 No.106993539

Anonymous 10/24/25(Fri)08:45:34 No.106993539

>>106993396
>So how does bucketing and batch size work together?
Some say it affects, but using gradient checkpointing should negate it which is always on for me so I haven't even thought about the thing. Might be worth testing out.

Anonymous
10/24/25(Fri)08:47:23 No.106993549

Anonymous 10/24/25(Fri)08:47:23 No.106993549

File: why.jpg (247 KB, 2418x513)

247 KB JPG

why is comfy ignoring the openpose controlnet?

Anonymous
10/24/25(Fri)08:51:33 No.106993571

Anonymous 10/24/25(Fri)08:51:33 No.106993571

>>106993529
OpenAI did push the overtron window, but I don't believe this was their intention, they just wanted hype by showing their model could do Will Smith playing ping pong against 2pac, ff7 style and shit, they know what people like, so they bait by letting them do copyright shit for like one week and then switch to stay safe, they did this on 4o and dalle3 as well, I'm NOOOTICING the pattern at this point

but hey, everything that pushes the overton window in the right direction is welcome, even if it's not being done intentionally

Anonymous
10/24/25(Fri)08:57:19 No.106993610

Anonymous 10/24/25(Fri)08:57:19 No.106993610

File: 00144-2446407782.png (2.64 MB, 1248x1824)

2.64 MB PNG

Anonymous
10/24/25(Fri)09:01:34 No.106993648

Anonymous 10/24/25(Fri)09:01:34 No.106993648

>>106993549
>trannymai
good

Anonymous
10/24/25(Fri)09:06:02 No.106993674

Anonymous 10/24/25(Fri)09:06:02 No.106993674

>>106993549
you need to go back >>106970615

Anonymous
10/24/25(Fri)09:07:02 No.106993681

Anonymous 10/24/25(Fri)09:07:02 No.106993681

>>106993529
>Although it's all inevitable, thankfully.
there will be a long fight before it being normalized though, I don't believe copyright companies will give up that easily

Anonymous
10/24/25(Fri)09:15:31 No.106993750

Anonymous 10/24/25(Fri)09:15:31 No.106993750

>>106993412
>Also its very good at everything NSFW with any NSFW lora
lol no. it's ok at best if you stack half a dozen loras and fuck around with strengths

Anonymous
10/24/25(Fri)09:21:41 No.106993796

Anonymous 10/24/25(Fri)09:21:41 No.106993796

>>106993549
cute

Anonymous
10/24/25(Fri)09:21:52 No.106993798

Anonymous 10/24/25(Fri)09:21:52 No.106993798

File: 1730797376385699.png (473 KB, 750x1000)

473 KB PNG

https://github.com/bytedance-fanqie-ai/MoGA
Make OpenSource Great Again!

Anonymous
10/24/25(Fri)09:23:58 No.106993816

Anonymous 10/24/25(Fri)09:23:58 No.106993816

>>106993798
Either jump on the API train or get run over by it, API is the future.

Anonymous
10/24/25(Fri)09:24:26 No.106993825

Anonymous 10/24/25(Fri)09:24:26 No.106993825

Does Chroma and Qwen share workflows? Or would I need to set up different nodes for each one? Do they work similarly to Flux Kontext?

Anonymous
10/24/25(Fri)09:25:35 No.106993831

Anonymous 10/24/25(Fri)09:25:35 No.106993831

>>106993825
just check the default templates, retard.
do you know how to breathe?

Anonymous
10/24/25(Fri)09:25:57 No.106993837

Anonymous 10/24/25(Fri)09:25:57 No.106993837

>>106993825
>Does Chroma and Qwen share workflows
no, i mean you wouldn't use different nodes but you would use different settings
>>106993816
this is the local diffusion general. fuck off

Anonymous
10/24/25(Fri)09:28:01 No.106993853

Anonymous 10/24/25(Fri)09:28:01 No.106993853

>>106993798
>more buttdance scraps
There is not a single thing they released that is actually useful. Bytedance literally only releases garbage

Anonymous
10/24/25(Fri)09:30:12 No.106993869

Anonymous 10/24/25(Fri)09:30:12 No.106993869

>>106993853
>Bytedance literally only releases garbage
to be fair, they seem to only have made failures, Seedream 4.0 is the only succesful model they have lol

Anonymous
10/24/25(Fri)09:40:34 No.106993965

Anonymous 10/24/25(Fri)09:40:34 No.106993965

>>106992993
Where can i read "Mien Comfyui" ?

Anonymous
10/24/25(Fri)09:45:33 No.106994005

Anonymous 10/24/25(Fri)09:45:33 No.106994005

>>106993837
you can load apis on applications like comfy retard.

Anonymous
10/24/25(Fri)09:47:20 No.106994023

Anonymous 10/24/25(Fri)09:47:20 No.106994023

File: 00160-2305585121.png (2.58 MB, 1824x1248)

2.58 MB PNG

Anonymous
10/24/25(Fri)09:49:09 No.106994036

Anonymous 10/24/25(Fri)09:49:09 No.106994036

>>106993798
>Make OpenSource Great Again!
not thanks to free poop models of jewdance

Anonymous
10/24/25(Fri)09:52:11 No.106994054

Anonymous 10/24/25(Fri)09:52:11 No.106994054

>>106994005

I can't generate that video. Try describing another idea. You can also get tips for how to write prompts and review our video policy guidelines.

Anonymous
10/24/25(Fri)09:56:57 No.106994089

Anonymous 10/24/25(Fri)09:56:57 No.106994089

>>106992993
>>106993965
Nevermind I forgot the mongol is only interested in i2v and control shit, he's still wasting time with the useless svi loras.
Probably won't even work on the holocine implementation, which it's insane considering long gens are what everyone has been waiting forever.

Anonymous
10/24/25(Fri)09:56:59 No.106994090

Anonymous 10/24/25(Fri)09:56:59 No.106994090

>>106994023
Man I am not even trying to be a "hater" but can you look at your gens for longer than 2 seconds before posting them here?
She has like 8 fingers in her right hand.

Anonymous
10/24/25(Fri)10:03:39 No.106994131

Anonymous 10/24/25(Fri)10:03:39 No.106994131

>>106993301
Use NetaYume, not the original Neta Lumina, if you aren't already. They can do it decently enough, DPM++ 2S Ancestral Linear Quadratic seems to give the most consistently good results for it. Particularly long text support definitely isn't as strong as in e.g. Flux or Qwen though.

Anonymous
10/24/25(Fri)10:13:24 No.106994198

Anonymous 10/24/25(Fri)10:13:24 No.106994198

>>106994089
i2v does a lot of the heavy lifting for getting a satisfactory gen though, it's understandable though not desirable.

Anonymous
10/24/25(Fri)10:21:43 No.106994253

Anonymous 10/24/25(Fri)10:21:43 No.106994253

File: ComfyUI_00605_.mp4 (1.56 MB, 880x1176)

1.56 MB MP4

>>106994090
>he doesn't have 8 (6) fingers on his right hand.

Anonymous
10/24/25(Fri)10:23:05 No.106994264

Anonymous 10/24/25(Fri)10:23:05 No.106994264

anyone know how i can have an image to 3d set up?

Anonymous
10/24/25(Fri)10:24:41 No.106994275

Anonymous 10/24/25(Fri)10:24:41 No.106994275

The new lightx2 loras from a couple days ago (yesterday?) seem quite good. Just running them at 1 strength. I guess there's still some slowmo.

>>106994264
What do you mean by 3d? Do you want to make a 3d model or do you want to make a 3d video that rotates around the subject?

Anonymous
10/24/25(Fri)10:25:39 No.106994284

Anonymous 10/24/25(Fri)10:25:39 No.106994284

>>106994275
yes i want a 3d model. i use sparc 3d now, but it takes forever to get a turn
https://huggingface.co/spaces/ilcve21/Sparc3D

Anonymous
10/24/25(Fri)10:27:15 No.106994295

Anonymous 10/24/25(Fri)10:27:15 No.106994295

>>106994284
Idk about that model specifically but if you have a decent GPU you can just try cloning their repo and running it locally. Lots of those example apps on huggingface can just be cloned and run locally.

Anonymous
10/24/25(Fri)10:28:59 No.106994311

Anonymous 10/24/25(Fri)10:28:59 No.106994311

File: FLX_0044.png (3.51 MB, 1080x1920)

3.51 MB PNG

Anonymous
10/24/25(Fri)10:30:26 No.106994320

Anonymous 10/24/25(Fri)10:30:26 No.106994320

>>106994295
i have no idea how to set this up. i just set up a text-to-image generator once using A1111

Anonymous
10/24/25(Fri)10:31:53 No.106994339

Anonymous 10/24/25(Fri)10:31:53 No.106994339

File: FLX_0047.png (3.34 MB, 1080x1920)

3.34 MB PNG

>>106994311
Flux sometimes just kills ittttttt

Anonymous
10/24/25(Fri)10:36:38 No.106994375

Anonymous 10/24/25(Fri)10:36:38 No.106994375

any work on low step Lumina models? 30-50 steps is too much

Anonymous
10/24/25(Fri)10:37:23 No.106994378

Anonymous 10/24/25(Fri)10:37:23 No.106994378

>>106994131
Using Yume with comfy's default workflow, consistently fucks up on a short phrase

Anonymous
10/24/25(Fri)10:37:47 No.106994382

Anonymous 10/24/25(Fri)10:37:47 No.106994382

File: 00201-1225490963.png (2.9 MB, 1248x1824)

2.9 MB PNG

>>106994090
on sdxl, hands are very difficult to get right especially when prompting for complex poses with foreshortening and combat involved. adetailer can't fix all the aspects of a bad hands and fingers.

Anonymous
10/24/25(Fri)10:38:47 No.106994390

Anonymous 10/24/25(Fri)10:38:47 No.106994390

File: 1576977637700.jpg (218 KB, 1280x960)

218 KB JPG

i regret testing sora 2. it's hard to go back to mute videos now. and isn't like we have, the best mute video models anyway

Anonymous
10/24/25(Fri)10:39:43 No.106994394

Anonymous 10/24/25(Fri)10:39:43 No.106994394

Any tips for prompt adherence for WAN2.2 not to zoom in randomly? I feel I hit this more than slowmo nowadays

Anonymous
10/24/25(Fri)10:42:28 No.106994410

Anonymous 10/24/25(Fri)10:42:28 No.106994410

>>106994394
prompt in chinese. works 110%

Anonymous
10/24/25(Fri)10:46:12 No.106994429

Anonymous 10/24/25(Fri)10:46:12 No.106994429

File: FLX_0054.png (3.36 MB, 1080x1920)

3.36 MB PNG

>>106994339

Anonymous
10/24/25(Fri)10:47:49 No.106994443

Anonymous 10/24/25(Fri)10:47:49 No.106994443

File: 00210-1157773276.png (2.12 MB, 1248x1824)

2.12 MB PNG

Anonymous
10/24/25(Fri)10:47:57 No.106994445

Anonymous 10/24/25(Fri)10:47:57 No.106994445

>>106994410
Don't know if you're trolling me or not but will try it out lol

Anonymous
10/24/25(Fri)10:48:34 No.106994452

Anonymous 10/24/25(Fri)10:48:34 No.106994452

>>106994445
not even kidding. give it a whirl

Anonymous
10/24/25(Fri)10:50:15 No.106994467

Anonymous 10/24/25(Fri)10:50:15 No.106994467

>>106994443
realism >>>>>>>

Anonymous
10/24/25(Fri)10:55:39 No.106994512

Anonymous 10/24/25(Fri)10:55:39 No.106994512

>>106994229
is this a good plan for an application?

Anonymous
10/24/25(Fri)11:00:15 No.106994552

Anonymous 10/24/25(Fri)11:00:15 No.106994552

>>106994467
thats called uncanny valley desu

Anonymous
10/24/25(Fri)11:02:37 No.106994569

Anonymous 10/24/25(Fri)11:02:37 No.106994569

File: FLX_0064.png (3.47 MB, 1080x1920)

3.47 MB PNG

>>106994552
not saying its bad. jus pref.

Anonymous
10/24/25(Fri)11:02:52 No.106994575

Anonymous 10/24/25(Fri)11:02:52 No.106994575

>>106993798
Hopefully its not another dead project that'll never release their model. Speaking of released models,wonder if Kijai or anyone know that Rolling Forcing is already out https://huggingface.co/TencentARC/RollingForcing/tree/main/checkpoints

Anonymous
10/24/25(Fri)11:08:22 No.106994639

Anonymous 10/24/25(Fri)11:08:22 No.106994639

>>106994452
Did not work, trying another gen without any loras to see if there's any weird interaction fucking up.
Or maybe I suck at prompting

Anonymous
10/24/25(Fri)11:11:30 No.106994655

Anonymous 10/24/25(Fri)11:11:30 No.106994655

File: bgkorit91xwf1.png (13 KB, 846x213)

13 KB PNG

Anonymous
10/24/25(Fri)11:12:26 No.106994667

Anonymous 10/24/25(Fri)11:12:26 No.106994667

File: ComfyUI_00139_.mp4 (479 KB, 640x640)

479 KB MP4

>>106994655

Anonymous
10/24/25(Fri)11:17:06 No.106994696

Anonymous 10/24/25(Fri)11:17:06 No.106994696

>>106994512
better than cumfart at least

Anonymous
10/24/25(Fri)11:21:14 No.106994734

Anonymous 10/24/25(Fri)11:21:14 No.106994734

File: i3281w.jpg (1.44 MB, 1600x1600)

1.44 MB JPG

Anonymous
10/24/25(Fri)11:24:41 No.106994773

Anonymous 10/24/25(Fri)11:24:41 No.106994773

File: file.png (17 KB, 916x152)

17 KB PNG

>>106994655
based ani

Anonymous
10/24/25(Fri)11:29:28 No.106994817

Anonymous 10/24/25(Fri)11:29:28 No.106994817

>>106994773
wtf i love julien now

Anonymous
10/24/25(Fri)11:30:05 No.106994823

Anonymous 10/24/25(Fri)11:30:05 No.106994823

>>106994655
What is this platform? Just a slop character interaction ui? Can the avatars change?

Anonymous
10/24/25(Fri)11:30:40 No.106994826

Anonymous 10/24/25(Fri)11:30:40 No.106994826

>>106994734
>>106985727

Anonymous
10/24/25(Fri)11:31:05 No.106994829

Anonymous 10/24/25(Fri)11:31:05 No.106994829

>>106994826
there it is ;)

Anonymous
10/24/25(Fri)11:32:29 No.106994839

Anonymous 10/24/25(Fri)11:32:29 No.106994839

>>106994829
glad i could help o/

Anonymous
10/24/25(Fri)11:32:51 No.106994842

Anonymous 10/24/25(Fri)11:32:51 No.106994842

>>106994823
civitai pony v7 comment section replies

Anonymous
10/24/25(Fri)11:38:28 No.106994880

Anonymous 10/24/25(Fri)11:38:28 No.106994880

File: AnimateDiff_00001.mp4 (753 KB, 352x512)

753 KB MP4

Trying the 2.2 distilled loras, not too shabby.

Anonymous
10/24/25(Fri)11:47:32 No.106994951

Anonymous 10/24/25(Fri)11:47:32 No.106994951

>>106994253
yes I love 1-2-3girls laughing at me. Wheres the laughing at me gens?????????

Anonymous
10/24/25(Fri)11:59:15 No.106995080

Anonymous 10/24/25(Fri)11:59:15 No.106995080

I prefer "girls zapping me with magic lightning bolts" personally although that's more of a video prompt.

Anonymous
10/24/25(Fri)12:00:18 No.106995092

Anonymous 10/24/25(Fri)12:00:18 No.106995092

>>106995080
thanks for the idea kind anon, ill make some kino zapping 1girls!

Anonymous
10/24/25(Fri)12:03:37 No.106995120

Anonymous 10/24/25(Fri)12:03:37 No.106995120

>>106995092
Looking forward to it

Anonymous
10/24/25(Fri)12:03:48 No.106995123

Anonymous 10/24/25(Fri)12:03:48 No.106995123

why is chroma full of shitty gay loras, it's sad

Anonymous
10/24/25(Fri)12:04:01 No.106995125

Anonymous 10/24/25(Fri)12:04:01 No.106995125

What the fuck, I can't open workflows in comfyui anymore, but opened the last one 5 minutes ago. Did not update, just restarted.
Did this shit updated by itself silently or what?
> [DEPRECATION WARNING] Detected import of deprecated legacy API

Anonymous
10/24/25(Fri)12:05:32 No.106995133

Anonymous 10/24/25(Fri)12:05:32 No.106995133

>>106994378
Try the specific sampler / scheduler combo I mentioned

Anonymous
10/24/25(Fri)12:05:55 No.106995138

Anonymous 10/24/25(Fri)12:05:55 No.106995138

>>106995123
create good straight loras. you can train, right?

Anonymous
10/24/25(Fri)12:06:46 No.106995145

Anonymous 10/24/25(Fri)12:06:46 No.106995145

>>106995133
(samefag) woops I should have mentioned also, around CFG 4.5 to 5.5 is best.

Anonymous
10/24/25(Fri)12:07:53 No.106995157

Anonymous 10/24/25(Fri)12:07:53 No.106995157

File: wrwrwrwrrwr.jpg (106 KB, 1024x1024)

106 KB JPG

Anonymous
10/24/25(Fri)12:08:55 No.106995165

Anonymous 10/24/25(Fri)12:08:55 No.106995165

File: arf2.jpg (105 KB, 1024x1024)

105 KB JPG

Anonymous
10/24/25(Fri)12:09:25 No.106995173

Anonymous 10/24/25(Fri)12:09:25 No.106995173

File: 251010-181444-Wan-I2v-2Xr(...).mp4 (2.34 MB, 1600x1200)

2.34 MB MP4

>>106994951

Anonymous
10/24/25(Fri)12:10:28 No.106995177

Anonymous 10/24/25(Fri)12:10:28 No.106995177

>>106995173
can you make it of anime girls

Anonymous
10/24/25(Fri)12:16:33 No.106995223

Anonymous 10/24/25(Fri)12:16:33 No.106995223

>>106995125
>deprecated legacy API
>comfy deprecating all API nodes
based

Anonymous
10/24/25(Fri)12:17:05 No.106995229

Anonymous 10/24/25(Fri)12:17:05 No.106995229

is 5090 worth it

Anonymous
10/24/25(Fri)12:19:55 No.106995251

Anonymous 10/24/25(Fri)12:19:55 No.106995251

Giving video gen a shot, I downloaded wan2GP, 32gb system ram + 16gb 5070.
Are 5s/step on the 1.3B t2v model expected or am I doing something wrong?

Anonymous
10/24/25(Fri)12:20:46 No.106995261

Anonymous 10/24/25(Fri)12:20:46 No.106995261

>>106995229
if you are thinking of a 5090 in terms of "value" then no. Like all high-end hardware (speakers/cameras/headphones/whatever) it isn't about value, it is about how much you enjoy owning high-end shit and seeing the marginal advantages.

Anonymous
10/24/25(Fri)12:22:06 No.106995276

Anonymous 10/24/25(Fri)12:22:06 No.106995276

Kijai-Sama, please, I can only test so many models

>holocine

https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled/tree/main/T2V/HoloCine

Anonymous
10/24/25(Fri)12:22:22 No.106995279

Anonymous 10/24/25(Fri)12:22:22 No.106995279

>>106995229
not really. the VRAM helps fit models but the speed isn't much better than a 4090

Anonymous
10/24/25(Fri)12:22:49 No.106995280

Anonymous 10/24/25(Fri)12:22:49 No.106995280

>>106995276
>t2v
but I want i2v

Anonymous
10/24/25(Fri)12:23:11 No.106995285

Anonymous 10/24/25(Fri)12:23:11 No.106995285

File: 00009-1000121469.png (2.98 MB, 1152x1440)

2.98 MB PNG

Anonymous
10/24/25(Fri)12:25:55 No.106995312

Anonymous 10/24/25(Fri)12:25:55 No.106995312

>>106995223
it's comfyui manager and rgthree give these warnings

Anonymous
10/24/25(Fri)12:28:28 No.106995348

Anonymous 10/24/25(Fri)12:28:28 No.106995348

what is the prompt to consistently remove all people in the scene while keeping the viewpoint unchanged in wan i2v?

Anonymous
10/24/25(Fri)12:31:20 No.106995386

Anonymous 10/24/25(Fri)12:31:20 No.106995386

Can an anon catbox a decent NetaYume workflow?

Anonymous
10/24/25(Fri)12:33:11 No.106995407

Anonymous 10/24/25(Fri)12:33:11 No.106995407

File: 1744256748672062.png (1.68 MB, 1280x960)

1.68 MB PNG

qwen image, analogcore 2000s lora

Anonymous
10/24/25(Fri)12:33:30 No.106995410

Anonymous 10/24/25(Fri)12:33:30 No.106995410

>>106995386
I could

Anonymous
10/24/25(Fri)12:34:16 No.106995417

Anonymous 10/24/25(Fri)12:34:16 No.106995417

>>106995386
he did in previous

Anonymous
10/24/25(Fri)12:34:28 No.106995420

Anonymous 10/24/25(Fri)12:34:28 No.106995420

>>106995125
Try --disable-api-nodes

Anonymous
10/24/25(Fri)12:34:35 No.106995421

Anonymous 10/24/25(Fri)12:34:35 No.106995421

File: NetaYumev35_20251024_00001_.png (1.82 MB, 1024x1536)

1.82 MB PNG

>>106995386
here, I dedicate to you my first gen of the day

Anonymous
10/24/25(Fri)12:36:48 No.106995438

Anonymous 10/24/25(Fri)12:36:48 No.106995438

>>106995386
There is nothing special with Yume workflows desu.

Anonymous
10/24/25(Fri)12:37:29 No.106995441

Anonymous 10/24/25(Fri)12:37:29 No.106995441

File: QwenImage_Output_6266433.png (2.04 MB, 1584x1056)

2.04 MB PNG

Anonymous
10/24/25(Fri)12:37:46 No.106995444

Anonymous 10/24/25(Fri)12:37:46 No.106995444

>>106995438
there is nothing special with yume

Anonymous
10/24/25(Fri)12:39:09 No.106995456

Anonymous 10/24/25(Fri)12:39:09 No.106995456

>see a cool lora on civitai
>early access and you need to pay for it to download it

Anonymous
10/24/25(Fri)12:41:09 No.106995474

Anonymous 10/24/25(Fri)12:41:09 No.106995474

>>106995407
>long dick general

Anonymous
10/24/25(Fri)12:41:25 No.106995479

Anonymous 10/24/25(Fri)12:41:25 No.106995479

>>106993412
> Also its very good at everything NSFW with any NSFW lora.
as long as nsfw is not genitals or sexual acts

Anonymous
10/24/25(Fri)12:42:09 No.106995484

Anonymous 10/24/25(Fri)12:42:09 No.106995484

>>106995479
?
https://civitai.com/user/LocalOptima/models

Anonymous
10/24/25(Fri)12:43:57 No.106995507

Anonymous 10/24/25(Fri)12:43:57 No.106995507

File: AnimateDiff_00001.mp4 (489 KB, 416x480)

489 KB MP4

>genning funny reaction images
>results are complete shit with cartoon stuff
>find funny baby
>have it act like a footballer witnessing a goal
>turns out great
>continue with other images
>start contemplating in the back of my mind
>realize what people can do with photos of kids
>truly realize
>am aware of the realization

Local needs to be banned.

Anonymous
10/24/25(Fri)12:44:45 No.106995517

Anonymous 10/24/25(Fri)12:44:45 No.106995517

>>106995407
oh yeah broi gimme the grain and analog oh yeah I love shitty photos that remind me of crappy cameras ohb yeah bro i can feel the soul bro ycamcorder bro yeah bro give it to me bro

Anonymous
10/24/25(Fri)12:44:53 No.106995518

Anonymous 10/24/25(Fri)12:44:53 No.106995518

>>106994452
apparently it's either the light or the fusion loras that add the movement, without them camera stays static, but quality becomes ASS

Anonymous
10/24/25(Fri)12:45:21 No.106995525

Anonymous 10/24/25(Fri)12:45:21 No.106995525

>>106993467
The model is probably cucked too, but at least we got a hypothetical Flux video.

Anonymous
10/24/25(Fri)12:45:53 No.106995530

Anonymous 10/24/25(Fri)12:45:53 No.106995530

>>106995517
the problem bro is that you're non-white bro

Anonymous
10/24/25(Fri)12:46:52 No.106995535

Anonymous 10/24/25(Fri)12:46:52 No.106995535

File: NetaYumev35_20251024_00006_.png (1.84 MB, 1024x1536)

1.84 MB PNG

tfw my wife will never launch lighting bolts at me

Anonymous
10/24/25(Fri)12:46:55 No.106995536

Anonymous 10/24/25(Fri)12:46:55 No.106995536

>>106993549
rgb -> bgr

Anonymous
10/24/25(Fri)12:47:04 No.106995537

Anonymous 10/24/25(Fri)12:47:04 No.106995537

>>106995507
And that's why I dont put personal stuff online anymore.
Bringing back printed family albums

Anonymous
10/24/25(Fri)12:48:03 No.106995542

Anonymous 10/24/25(Fri)12:48:03 No.106995542

>>106995530
and that's a good thing, i'd hate to be a minority like a nigger

Anonymous
10/24/25(Fri)12:49:10 No.106995554

Anonymous 10/24/25(Fri)12:49:10 No.106995554

I am not even gonna give a (You) to that fucking redditor

Anonymous
10/24/25(Fri)12:54:41 No.106995603

Anonymous 10/24/25(Fri)12:54:41 No.106995603

>>106993371
They don't have leeway to do anything. US/ClosedAI could do it, but not them, plus copyright holders would attempt to charge them double the tax.

Anonymous
10/24/25(Fri)12:54:53 No.106995607

Anonymous 10/24/25(Fri)12:54:53 No.106995607

File: 00013-795533095.jpg (904 KB, 1536x1920)

904 KB JPG

Anonymous
10/24/25(Fri)12:55:08 No.106995611

Anonymous 10/24/25(Fri)12:55:08 No.106995611

>>106991495
Porn
and easier lora training

Anonymous
10/24/25(Fri)12:59:54 No.106995659

Anonymous 10/24/25(Fri)12:59:54 No.106995659

>>106995611
He can't train unfortunately

Anonymous
10/24/25(Fri)13:00:03 No.106995660

Anonymous 10/24/25(Fri)13:00:03 No.106995660

File: file.png (2.03 MB, 1328x1328)

2.03 MB PNG

>>106995530
whiter than you post hand

Anonymous
10/24/25(Fri)13:01:08 No.106995668

Anonymous 10/24/25(Fri)13:01:08 No.106995668

>>106995660
kek

Anonymous
10/24/25(Fri)13:01:53 No.106995680

Anonymous 10/24/25(Fri)13:01:53 No.106995680

>>106995676
>>106995676
>>106995676
>>106995676

Anonymous
10/24/25(Fri)13:09:36 No.106995726

Anonymous 10/24/25(Fri)13:09:36 No.106995726

>>106993371
Anon, stop being delusional, we don't even have open-source T2I models with dalle3's levels of pop culture knowledge, so videos are a given it's "never ever" in that regard.
Chinks don't care about having a video model with trillions of parameters that "knows everything", they just want a model that performs well enough on benchmarks while being small enough to run on their gpu-embargoed datacenters

Anonymous
10/24/25(Fri)13:49:00 No.106996101

Anonymous 10/24/25(Fri)13:49:00 No.106996101

>>106993523
Unsure if specifics like that were trained into the lora, I did attempt to prompt for extra haloing / rainbowing but it didn't do anything.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.