/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 09/27/25(Sat)22:59:48 No.106722132

File: highlights_g_106719267_17(...).jpg (1.75 MB, 2475x3501)

1.75 MB JPG

/ldg/ - Local Diffusion General Anonymous 09/27/25(Sat)22:59:48 No.106722132 Archived

3 x 80 Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106719267

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/27/25(Sat)23:00:55 No.106722139

Anonymous 09/27/25(Sat)23:00:55 No.106722139

>>106722132
I'd like to speak to the manager.

Anonymous
09/27/25(Sat)23:03:39 No.106722153

Anonymous 09/27/25(Sat)23:03:39 No.106722153

Blessed thread of frenship

Anonymous
09/27/25(Sat)23:04:28 No.106722160

Anonymous 09/27/25(Sat)23:04:28 No.106722160

>>106722132
cringe coomer collage

Anonymous
09/27/25(Sat)23:09:00 No.106722191

Anonymous 09/27/25(Sat)23:09:00 No.106722191

>80
>billion

Anonymous
09/27/25(Sat)23:10:30 No.106722203

Anonymous 09/27/25(Sat)23:10:30 No.106722203

File: 1739047480245991.png (1.43 MB, 1024x1024)

1.43 MB PNG

a massive billboard with this character is visible on a building in Akihabara, Tokyo during the day.

I like the wrapping effect.

Anonymous
09/27/25(Sat)23:10:39 No.106722204

Anonymous 09/27/25(Sat)23:10:39 No.106722204

File: ComfyUI_07130_.png (1.96 MB, 1152x1152)

1.96 MB PNG

Imagine some kind of finetuning breakthrough and then 80B parameter
HunyuanImage Chroma. Haha, that would be insane.

Anonymous
09/27/25(Sat)23:11:32 No.106722211

Anonymous 09/27/25(Sat)23:11:32 No.106722211

>>106722204
Are you running it locally?

Anonymous
09/27/25(Sat)23:12:44 No.106722220

Anonymous 09/27/25(Sat)23:12:44 No.106722220

File: Screenshot 2025-09-28 at (...).png (25 KB, 633x254)

25 KB PNG

What is CoT in the instruct model?

Anonymous
09/27/25(Sat)23:13:09 No.106722226

Anonymous 09/27/25(Sat)23:13:09 No.106722226

>>106722132
My gen is the Chroma one, the one in the middle. I feel honored, thank you for supporting my bitten nipple fetish.

Anonymous
09/27/25(Sat)23:13:37 No.106722229

Anonymous 09/27/25(Sat)23:13:37 No.106722229

File: Wanimate_00042.mp4 (990 KB, 960x544)

990 KB MP4

Anonymous
09/27/25(Sat)23:14:57 No.106722238

Anonymous 09/27/25(Sat)23:14:57 No.106722238

Absolute retards thinking a moe model works like dense.
Active is like 13b or 14b, you'll be able to run that in 24b easily. In lower shitty cards with low quants.
You can easily do a dynamic brain damage quant and fit everything else in ram and it won't make much difference in quality or speed.
lmg CHADS rise up.

Anonymous
09/27/25(Sat)23:15:23 No.106722244

Anonymous 09/27/25(Sat)23:15:23 No.106722244

File: ComfyUI_07145_.png (2.48 MB, 1152x1152)

2.48 MB PNG

>>106722211
>3x80GB locally
I don't think anyone can. China has to hurry up with their GPUs. If they're soon looking to shit on Nvidia, I think that might be the appeal of releasing such a large model.

Anonymous
09/27/25(Sat)23:15:37 No.106722247

Anonymous 09/27/25(Sat)23:15:37 No.106722247

goofs wen

Anonymous
09/27/25(Sat)23:16:51 No.106722255

Anonymous 09/27/25(Sat)23:16:51 No.106722255

File: ComfyUI_07146_.png (2.28 MB, 1152x1152)

2.28 MB PNG

Anonymous
09/27/25(Sat)23:16:57 No.106722256

Anonymous 09/27/25(Sat)23:16:57 No.106722256

>Before release
>Looks shit, I don't even want it
>released
>OMG GOOFS WHERE I NEEEEED IT.

Fuck off.
inb4 goomba.

Anonymous
09/27/25(Sat)23:17:16 No.106722259

Anonymous 09/27/25(Sat)23:17:16 No.106722259

Can someone explain to me why https://github.com/FlyMyAI/flymyai-lora-trainer managed to train just fine within my 24GB of VRAM while https://github.com/ostris/ai-toolkit goes over by ~6GB?
Same lora rank, same quant settings (8bit), unloading the text encoder in AIT, same image size (1024). What the fuck is going on there?
Also would I be losing anything by using the former as opposed to the latter? First time training a lora for any model so not quite sure how to go about it. 3090 if that matters.

Anonymous
09/27/25(Sat)23:18:19 No.106722263

Anonymous 09/27/25(Sat)23:18:19 No.106722263

>>106722244
>If they're soon looking to shit on Nvidia,
lmao dude the stuff they make now is so ass it's not even close.

Anonymous
09/27/25(Sat)23:19:35 No.106722274

Anonymous 09/27/25(Sat)23:19:35 No.106722274

>>106722256
You are referencing two different kinds of posters

Anonymous
09/27/25(Sat)23:19:45 No.106722279

Anonymous 09/27/25(Sat)23:19:45 No.106722279

>>106722259
Could be a few things, could be the rank, batch size, gradient accumulation or the optimizer whatever flymyai is uses. That's something you can tweak in the settings.
That being said, I wouldn't recommend diffusion pipe over anything else unless you have more than one GPU, in which case you should absolutely use it.

Anonymous
09/27/25(Sat)23:19:46 No.106722280

Anonymous 09/27/25(Sat)23:19:46 No.106722280

>>106722256
The only way to test the model is to run it yourself nigga. I don't believe benchmark images one bit

Anonymous
09/27/25(Sat)23:19:51 No.106722281

Anonymous 09/27/25(Sat)23:19:51 No.106722281

File: ComfyUI_07147_.png (2.03 MB, 1152x1152)

2.03 MB PNG

>Absolute retards thinking a moe model works like dense.
It's an moe model?
Either way, from my understanding you still need enough VRAM to open the entire model even if there's only a few active parameters.

Anonymous
09/27/25(Sat)23:20:35 No.106722289

Anonymous 09/27/25(Sat)23:20:35 No.106722289

File: ComfyUI_temp_dknmz_00020_.png (2.19 MB, 1152x1728)

2.19 MB PNG

Anonymous
09/27/25(Sat)23:20:45 No.106722290

Anonymous 09/27/25(Sat)23:20:45 No.106722290

>>106722274
I said inb4 goomba.

Anonymous
09/27/25(Sat)23:21:54 No.106722299

Anonymous 09/27/25(Sat)23:21:54 No.106722299

>>106722281
https://x.com/tencenthunyuan/status/1972130405160833334
READ NIGGER READ

Anonymous
09/27/25(Sat)23:21:56 No.106722301

Anonymous 09/27/25(Sat)23:21:56 No.106722301

File: ComfyUI_07153_.png (1.94 MB, 1152x1152)

1.94 MB PNG

>>106722238
>>106722281
Forgot to quote

Anonymous
09/27/25(Sat)23:22:23 No.106722304

Anonymous 09/27/25(Sat)23:22:23 No.106722304

>>106722281
In its defense, and I am playing devil's advocate here because I really doubt it will be the case. You can usually offload a MoE with less of a speed hit compared to dense models.

Anonymous
09/27/25(Sat)23:24:14 No.106722319

Anonymous 09/27/25(Sat)23:24:14 No.106722319

File: ComfyUI_07159_.png (1.89 MB, 1152x1152)

1.89 MB PNG

>>106722299
Yeah but that doesn't really change what you need to inference it anon

Anonymous
09/27/25(Sat)23:25:05 No.106722325

Anonymous 09/27/25(Sat)23:25:05 No.106722325

>>106722259
AI Toolkit is notoriously bad at optimizing vram use, I don't even know if it supports any offloading at all

Best trainer for speed if you need any offloading (model doesn't fully fit in your vram) is OneTrainer

Anonymous
09/27/25(Sat)23:26:24 No.106722339

Anonymous 09/27/25(Sat)23:26:24 No.106722339

does nag not work with chroma flash or are my settings just wrong? any time i use a flash model, or the delta weights, or a flash lora, the entire image looks like it's been jpegified

Anonymous
09/27/25(Sat)23:29:19 No.106722353

Anonymous 09/27/25(Sat)23:29:19 No.106722353

>>106722319
It does, you seem more retarded each time you open your dirty mouth.
With moe you just need to fit the actual active part in vram, the rest can be offloaded and even quant to a different size.
You don't know shit about anything and talk without even reading the sources, the fuck you even trying to even engage here?

Anonymous
09/27/25(Sat)23:29:49 No.106722354

Anonymous 09/27/25(Sat)23:29:49 No.106722354

File: ComfyUI_07163_.png (2.11 MB, 1152x1152)

2.11 MB PNG

>>106722339
Example catbox? Something might be off in it, or maybe it really doesn't work.

Anonymous
09/27/25(Sat)23:29:50 No.106722355

Anonymous 09/27/25(Sat)23:29:50 No.106722355

tfw found a new artist to train on

Anonymous
09/27/25(Sat)23:30:15 No.106722360

Anonymous 09/27/25(Sat)23:30:15 No.106722360

File: 2_banner_all.jpg (3.92 MB, 3847x5000)

3.92 MB JPG

So this is the power of 80b. A model that looks worse than flux for realism, worse than illustrious for anime and uses gpt generated synthetic data to train its text outputs. Amazing.

Anonymous
09/27/25(Sat)23:30:22 No.106722362

Anonymous 09/27/25(Sat)23:30:22 No.106722362

File: vibevoice.jpg (50 KB, 1099x153)

50 KB JPG

>>106721831
i was able to use 1.5B to gen something, but when i tried the large or 7B i am guessing i am out of memory. but I have 24gb vram. the 1.5B doesnt sound that great, buts its able to preserve accents from the sample which is awesome

Anonymous
09/27/25(Sat)23:30:28 No.106722363

Anonymous 09/27/25(Sat)23:30:28 No.106722363

>>106722191
did they steal sora's content or something? i hope they at least added some nsfw fun

Anonymous
09/27/25(Sat)23:30:55 No.106722369

Anonymous 09/27/25(Sat)23:30:55 No.106722369

>>106722353
nta, but I think you're being overly optimistic how feasible this model will be to run at any speed that won't make you want to kill yourself.

Anonymous
09/27/25(Sat)23:35:24 No.106722394

Anonymous 09/27/25(Sat)23:35:24 No.106722394

>>106722360
>A model that looks worse than flux for realism
No, it clearly doesn't

That said for such a huge model it does look unimpressive

Anonymous
09/27/25(Sat)23:36:47 No.106722398

Anonymous 09/27/25(Sat)23:36:47 No.106722398

File: ComfyUI_07164_.png (1.84 MB, 1152x1152)

1.84 MB PNG

>>106722360
I agree, they would never give us for free what is strictly from a hand curated dataset. Though it does appear to have some slight hints of using real photos as reference, at least for cinematic stuff (so it's comparable to Seedream 4). The one area that is clearly superior is prompt following and text, given it's autoregressive. An image edit model based on this would probably be pretty good, and there's a good chance they'd open source it.

Anonymous
09/27/25(Sat)23:39:28 No.106722414

Anonymous 09/27/25(Sat)23:39:28 No.106722414

File: Wanimate_00043.mp4 (827 KB, 960x544)

827 KB MP4

Anonymous
09/27/25(Sat)23:39:31 No.106722415

Anonymous 09/27/25(Sat)23:39:31 No.106722415

>>106722394
Nah, he's right, it's good for realism, but it's not as good as Krea or Chroma.

Anonymous
09/27/25(Sat)23:39:51 No.106722416

Anonymous 09/27/25(Sat)23:39:51 No.106722416

>>106722290
You didn't say it you typed it

Anonymous
09/27/25(Sat)23:39:52 No.106722417

Anonymous 09/27/25(Sat)23:39:52 No.106722417

>>106722362
7B works fine on 24GB, I use it all the time. Make sure the sample voice file is 3 minutes max, a bit over 3 minutes is when it'll OOM on 24GB.
>https://pastebin.com/raw/f2ibMSGf
Simple single speaker wf.
Use https://github.com/diodiogod/TTS-Audio-Suite, the other extensions for VibeVoice are no good.

Anonymous
09/27/25(Sat)23:39:57 No.106722419

Anonymous 09/27/25(Sat)23:39:57 No.106722419

>>106722414
looks like shit

Anonymous
09/27/25(Sat)23:44:01 No.106722441

Anonymous 09/27/25(Sat)23:44:01 No.106722441

>>106722417
>3 minutes max
Is there a benefit to using samples that long?

Anonymous
09/27/25(Sat)23:44:46 No.106722448

Anonymous 09/27/25(Sat)23:44:46 No.106722448

>>106722419
But enough about you.

Anonymous
09/27/25(Sat)23:46:27 No.106722455

Anonymous 09/27/25(Sat)23:46:27 No.106722455

Comfy give me smea dy ++ plox

Anonymous
09/27/25(Sat)23:46:33 No.106722456

Anonymous 09/27/25(Sat)23:46:33 No.106722456

File: 1748599270941883.jpg (1.44 MB, 2000x2599)

1.44 MB JPG

>>106722360
imagine paying a 10000 dollars gpu to get this shit lmao

Anonymous
09/27/25(Sat)23:46:51 No.106722458

Anonymous 09/27/25(Sat)23:46:51 No.106722458

>>106722441
Yes, the cloned voice will better resemble the original and the cadence of the speech will be more accurate. Make a 30 second sample and a 2 minute sample, then compare the outputs. The 2 minute will usually (if not always) sound better.
It's also better to split up your samples by emotion. Don't mix whispers with neutral speaking, or angry/yelling upbeat/happy. Have characterX_angry.wav and characterX_netural.wav for example, then use them accordingly for best results.

Anonymous
09/27/25(Sat)23:47:43 No.106722463

Anonymous 09/27/25(Sat)23:47:43 No.106722463

uh oh localkeks are seething! meanwhile the rest of us can enjoy hunyuan 3.0 uncensored through API nodes

Anonymous
09/27/25(Sat)23:48:44 No.106722466

Anonymous 09/27/25(Sat)23:48:44 No.106722466

>>106722456
Just take another mortage on your house, totally worth it

Anonymous
09/27/25(Sat)23:49:21 No.106722467

Anonymous 09/27/25(Sat)23:49:21 No.106722467

>>106722463
>uncensored
post oneeshota futa pegging or gtfo

Anonymous
09/27/25(Sat)23:50:22 No.106722473

Anonymous 09/27/25(Sat)23:50:22 No.106722473

>>106722115
They've had well over a year to make it better though

Anonymous
09/27/25(Sat)23:51:27 No.106722480

Anonymous 09/27/25(Sat)23:51:27 No.106722480

>>106722463
>uncensored
API
kek

Anonymous
09/27/25(Sat)23:51:47 No.106722484

Anonymous 09/27/25(Sat)23:51:47 No.106722484

>Silveroxides removed all his speed loras that were better than Flash
Why

Anonymous
09/27/25(Sat)23:52:29 No.106722487

Anonymous 09/27/25(Sat)23:52:29 No.106722487

File: 1748285656514546.png (64 KB, 336x150)

64 KB PNG

>>106722191
80 billions parameters!

Anonymous
09/27/25(Sat)23:52:41 No.106722489

Anonymous 09/27/25(Sat)23:52:41 No.106722489

>>106722484
moved into a new folder

Anonymous
09/27/25(Sat)23:53:35 No.106722494

Anonymous 09/27/25(Sat)23:53:35 No.106722494

File: ComfyUI_temp_osron_00013_.png (1.56 MB, 960x1440)

1.56 MB PNG

Anonymous
09/27/25(Sat)23:54:08 No.106722496

Anonymous 09/27/25(Sat)23:54:08 No.106722496

>>106722489
Where? His Chroma Loras repo only has the flash loras.

Anonymous
09/27/25(Sat)23:55:01 No.106722504

Anonymous 09/27/25(Sat)23:55:01 No.106722504

>>106722496
https://huggingface.co/silveroxides/Chroma-LoRAs/tree/main/flash-heun

Anonymous
09/27/25(Sat)23:55:50 No.106722506

Anonymous 09/27/25(Sat)23:55:50 No.106722506

I think people expecting this model to be trimmed down to a manageable size and still be worth using are huffing obscene amounts of copium.

Anonymous
09/27/25(Sat)23:56:38 No.106722508

Anonymous 09/27/25(Sat)23:56:38 No.106722508

>>106722504
Are you retarded or just illiterate?

Anonymous
09/27/25(Sat)23:56:51 No.106722509

Anonymous 09/27/25(Sat)23:56:51 No.106722509

>>106722487
Not even a gorillion

Anonymous
09/27/25(Sat)23:57:48 No.106722512

Anonymous 09/27/25(Sat)23:57:48 No.106722512

What's the best sampler/scheduler/steps combo for Chroma HD?

Anonymous
09/27/25(Sat)23:58:55 No.106722516

Anonymous 09/27/25(Sat)23:58:55 No.106722516

>>106722512
lcm/karras 75 steps

Anonymous
09/27/25(Sat)23:59:17 No.106722517

Anonymous 09/27/25(Sat)23:59:17 No.106722517

>>106722512
res_multistep, beta, 50

Anonymous
09/27/25(Sat)23:59:58 No.106722524

Anonymous 09/27/25(Sat)23:59:58 No.106722524

File: 1741184122881217.png (97 KB, 1437x778)

97 KB PNG

>>106722191
>240 gb of vram
kek, you literally need 10x3090 cards to run this shit

Anonymous
09/28/25(Sun)00:00:33 No.106722528

Anonymous 09/28/25(Sun)00:00:33 No.106722528

>>106722506
i heard chodestone will de-distill it and retrain it at 256x256

Anonymous
09/28/25(Sun)00:00:48 No.106722529

Anonymous 09/28/25(Sun)00:00:48 No.106722529

>>106722524
That's the 16bit model, right?

Anonymous
09/28/25(Sun)00:01:48 No.106722535

Anonymous 09/28/25(Sun)00:01:48 No.106722535

>>106722528
and his version will take twice the time as the original to gen 1 image

Anonymous
09/28/25(Sun)00:01:54 No.106722536

Anonymous 09/28/25(Sun)00:01:54 No.106722536

>>106722529
yes, bf16, that means you'll need 120gb of vram to run Q8

Anonymous
09/28/25(Sun)00:01:56 No.106722537

Anonymous 09/28/25(Sun)00:01:56 No.106722537

>>106722516
Blurry as fuck
>>106722517
That actually looks great

Anonymous
09/28/25(Sun)00:04:38 No.106722548

Anonymous 09/28/25(Sun)00:04:38 No.106722548

>>106722417
thanks ill try this out, did an install and its saying some things are missing ill figure it out tomorrow

Anonymous
09/28/25(Sun)00:04:58 No.106722550

Anonymous 09/28/25(Sun)00:04:58 No.106722550

https://xcancel.com/TencentHunyuan/status/1972130405160833334#m
>noo you don't get it anon, we need 80b parameters to get decent images
those engineers are living on another planet or something?

Anonymous
09/28/25(Sun)00:06:42 No.106722555

Anonymous 09/28/25(Sun)00:06:42 No.106722555

I'm glad every new model is getting larger and larger while also getting worse and worse

Anonymous
09/28/25(Sun)00:07:18 No.106722557

Anonymous 09/28/25(Sun)00:07:18 No.106722557

localbrowns seething at hunyuan for pushing the tech forward instead of seething at nvidia for keeping hardware behind. there is nothing wrong with 80b, an H200 can run this fine. SOTA models like Seedream and GPT are equally as big. what do you expect, endless 12b flux clones that all look like shit? they tried that with hunyuan 2.1 and nobody cared.

Anonymous
09/28/25(Sun)00:07:28 No.106722560

Anonymous 09/28/25(Sun)00:07:28 No.106722560

File: 1732763641637071.png (48 KB, 622x115)

48 KB PNG

>>106722550
they trained this shit on 5 billions images, damnnnnnn

Anonymous
09/28/25(Sun)00:07:39 No.106722561

Anonymous 09/28/25(Sun)00:07:39 No.106722561

>>106722548
Needs sage installed, since it speeds up gens. You can set attention_mode to auto and it'll still gen, only slower. Only extensions needed are tts suite and rgthree.

Anonymous
09/28/25(Sun)00:08:20 No.106722566

Anonymous 09/28/25(Sun)00:08:20 No.106722566

>>106722524
Can any richfags test the pretrained version just to check if that version is unslopped?

Anonymous
09/28/25(Sun)00:08:51 No.106722568

Anonymous 09/28/25(Sun)00:08:51 No.106722568

>>106722560
>5 billion openai generated synthetic image-text pairs
Yeah, it shows.

Anonymous
09/28/25(Sun)00:08:57 No.106722569

Anonymous 09/28/25(Sun)00:08:57 No.106722569

>>106722550
>those engineers are living on another planet or something?
For the last four years anyone working in AI has basically be able to say "We need X H100s to do Y" and someone will write them a blank cheque to achieve it. So yeah, the do live on another planet. A planet where someone will just provide the compute for them no matter the cost. As long as that is the case, no further optimizations will ever be made to make smaller models more effective, because the people who make the models have no upper limit to their resources.

This is why I hope for an AI crash soon. So engineers are forced to work with limited resources and provide solutions that aren't just "more compute"

Anonymous
09/28/25(Sun)00:08:59 No.106722570

Anonymous 09/28/25(Sun)00:08:59 No.106722570

File: ComfyUI_temp_osron_00024_.png (944 KB, 960x1440)

944 KB PNG

gonna smoke some weed brb

Anonymous
09/28/25(Sun)00:09:22 No.106722572

Anonymous 09/28/25(Sun)00:09:22 No.106722572

File: 00205-4204754562.png (2.73 MB, 1248x1848)

2.73 MB PNG

Anonymous
09/28/25(Sun)00:09:23 No.106722573

Anonymous 09/28/25(Sun)00:09:23 No.106722573

>>106722568
OpenAI got some nice money with those chinese fuck spamming their server lmao

Anonymous
09/28/25(Sun)00:09:59 No.106722575

Anonymous 09/28/25(Sun)00:09:59 No.106722575

cozy bread

Anonymous
09/28/25(Sun)00:10:20 No.106722577

Anonymous 09/28/25(Sun)00:10:20 No.106722577

>>106722573
Yeah, half if not more of their traffic has to be the Chinese slopping out synthsets

Anonymous
09/28/25(Sun)00:11:09 No.106722579

Anonymous 09/28/25(Sun)00:11:09 No.106722579

Is there a way to make comfyui stop capping floats to the first digit?

Anonymous
09/28/25(Sun)00:14:07 No.106722593

Anonymous 09/28/25(Sun)00:14:07 No.106722593

>>106722204
desu I'm sure you can remove some of the fat and end up with a 20b model that is like 95% as good

Anonymous
09/28/25(Sun)00:16:35 No.106722601

Anonymous 09/28/25(Sun)00:16:35 No.106722601

File: RA_NBCM_00016.jpg (879 KB, 2736x1872)

879 KB JPG

Anonymous
09/28/25(Sun)00:16:52 No.106722602

Anonymous 09/28/25(Sun)00:16:52 No.106722602

Where's the tool that lets you convert Flux loras to Chroma loras?

Anonymous
09/28/25(Sun)00:17:47 No.106722604

Anonymous 09/28/25(Sun)00:17:47 No.106722604

>>106722569
yeah, they haven't improved shit on the architecture or on the training process, it's just "stack moar layers bro" and they call it a day, fucking lazy fucks

Anonymous
09/28/25(Sun)00:17:52 No.106722606

Anonymous 09/28/25(Sun)00:17:52 No.106722606

>>106722557
But you don't understand there is a miracle tune method out there that will make model smaller than XL and beat the best of the best!
What "not even trillion dollar companies have figured such a method out yet?" Fuck off with that shit, they just dumb, I obviously know best. #JustOneMoreEpoch!!

Anonymous
09/28/25(Sun)00:18:27 No.106722607

Anonymous 09/28/25(Sun)00:18:27 No.106722607

1000 more years of sdxl btw

Anonymous
09/28/25(Sun)00:18:35 No.106722608

Anonymous 09/28/25(Sun)00:18:35 No.106722608

File: Wanimate_00045.mp4 (1.82 MB, 960x544)

1.82 MB MP4

Anonymous
09/28/25(Sun)00:18:39 No.106722609

Anonymous 09/28/25(Sun)00:18:39 No.106722609

>>106722602
It's called retraining.

Anonymous
09/28/25(Sun)00:18:46 No.106722610

Anonymous 09/28/25(Sun)00:18:46 No.106722610

>>106722607
this

Anonymous
09/28/25(Sun)00:19:44 No.106722614

Anonymous 09/28/25(Sun)00:19:44 No.106722614

>>106722557
>nvidia for keeping hardware behind
this meme must die, why do you insist on Nvdia having to be the only gpu provider for humanity? it's not Nvdia's fault that its rivals suck ass, I'm more angry at AMD not even trying to be competitive

Anonymous
09/28/25(Sun)00:19:46 No.106722615

Anonymous 09/28/25(Sun)00:19:46 No.106722615

>>106722609
No, there was a specific script that was posted here that you could use to convert shit like the hyper-flux loras into Chroma loras.

Anonymous
09/28/25(Sun)00:20:53 No.106722622

Anonymous 09/28/25(Sun)00:20:53 No.106722622

>>106722601
y repoast

Anonymous
09/28/25(Sun)00:21:36 No.106722626

Anonymous 09/28/25(Sun)00:21:36 No.106722626

File: 1729312036509967.png (1.11 MB, 1006x953)

1.11 MB PNG

>>106722524
Tencent be like:

Anonymous
09/28/25(Sun)00:23:00 No.106722630

Anonymous 09/28/25(Sun)00:23:00 No.106722630

>>106722608
Gandalf was truly a master of disguise

Anonymous
09/28/25(Sun)00:23:05 No.106722631

Anonymous 09/28/25(Sun)00:23:05 No.106722631

>>106722557
>what do you expect, endless 12b flux clones that all look like shit? they tried that with hunyuan 2.1 and nobody cared.
nobody cared because they train on synthetic slop, 12b or 80b it doesn't matter, the result will end up slopped as fuck, no one will care until they finally make some effort on having a decent dataset (like they did on Midjourney or Seedream)

Anonymous
09/28/25(Sun)00:24:03 No.106722635

Anonymous 09/28/25(Sun)00:24:03 No.106722635

>>106722608
>tits accurately swinging with the motion of the cart
What a time to be alive

Anonymous
09/28/25(Sun)00:24:56 No.106722639

Anonymous 09/28/25(Sun)00:24:56 No.106722639

>>106722631
>Midjourney or Seedream
Fuck no, those model gens suck ass

Anonymous
09/28/25(Sun)00:25:10 No.106722640

Anonymous 09/28/25(Sun)00:25:10 No.106722640

Downloading the Hunyuan 3 now
I don't have 3x H100, but I have an RTX 6000 and 192GB system memory so hopefully it offloads easily and I can at least do some basic tests to see if it's worth pursuing any further.

Anonymous
09/28/25(Sun)00:25:16 No.106722642

Anonymous 09/28/25(Sun)00:25:16 No.106722642

why couldn't that furry faggot just release a normal goddamn speed lora instead of one that fucks everything up? and why did the other furry faggot remove his collection of speed loras that worked better than the official one? furries are a fucking blight and need to be banned from contributing to ai development

Anonymous
09/28/25(Sun)00:25:37 No.106722643

Anonymous 09/28/25(Sun)00:25:37 No.106722643

>>106722577
Tencent weren't doing that as much back in HunyuanDiT days. Had they continued iterating on that, we might have had a good contender to Flux.

Anonymous
09/28/25(Sun)00:28:15 No.106722647

Anonymous 09/28/25(Sun)00:28:15 No.106722647

File: 1731796850524359.png (311 KB, 1356x940)

311 KB PNG

he wasn't joking when he called it "the super parameter" 2 weeks ago, this baby is huge

Anonymous
09/28/25(Sun)00:28:26 No.106722649

Anonymous 09/28/25(Sun)00:28:26 No.106722649

>>106722642
>furries are a fucking blight and need to be banned from contributing to ai development
They're better than trannies, and also unlike trannies they have money to fund training

Anonymous
09/28/25(Sun)00:28:39 No.106722650

Anonymous 09/28/25(Sun)00:28:39 No.106722650

>>106722607
I'm ok with this. It's tried and true.

Anonymous
09/28/25(Sun)00:29:18 No.106722656

Anonymous 09/28/25(Sun)00:29:18 No.106722656

>>106722649
furries and trannies are often one and the same

Anonymous
09/28/25(Sun)00:29:20 No.106722657

Anonymous 09/28/25(Sun)00:29:20 No.106722657

>>106722649
>They're better than trannies
a lot of furries are trannies though

Anonymous
09/28/25(Sun)00:29:28 No.106722659

Anonymous 09/28/25(Sun)00:29:28 No.106722659

>>106722607
all you need

Anonymous
09/28/25(Sun)00:29:36 No.106722660

Anonymous 09/28/25(Sun)00:29:36 No.106722660

>>106722647
>anon says something *crickets*
>this random fuck on twitter who was wrong about Wan 2.5 says something *50 different screenshots including news we all know by now*

Posting him should be a bannable offense.

Anonymous
09/28/25(Sun)00:30:31 No.106722661

Anonymous 09/28/25(Sun)00:30:31 No.106722661

>>106722660
>wrong about Wan 2.5
>she doesn't know

Anonymous
09/28/25(Sun)00:31:27 No.106722666

Anonymous 09/28/25(Sun)00:31:27 No.106722666

>>106722660
>he was wrong on one prediction out of 10 gozillions, BAN HIM
what kind of mental illness is this?

Anonymous
09/28/25(Sun)00:31:31 No.106722668

Anonymous 09/28/25(Sun)00:31:31 No.106722668

i'm going to make my own ui and it'll just be photoshop plus sdxl. i'm going to leave this general and i'm never going to return. no more noodles, no more fluxchromaqwenhunyuanwan shit. everything after sdxl has been disappointment after disappointment

Anonymous
09/28/25(Sun)00:32:28 No.106722672

Anonymous 09/28/25(Sun)00:32:28 No.106722672

>>106722668
But what about wan support.

Anonymous
09/28/25(Sun)00:32:58 No.106722674

Anonymous 09/28/25(Sun)00:32:58 No.106722674

see you 2morrow

Anonymous
09/28/25(Sun)00:33:06 No.106722677

Anonymous 09/28/25(Sun)00:33:06 No.106722677

>>106722672
okay so maybe wan is fine. but nothing else.

Anonymous
09/28/25(Sun)00:33:22 No.106722681

Anonymous 09/28/25(Sun)00:33:22 No.106722681

>>106722666
Because he's just the marketing arm of Chinese SaaS companies now.

Anonymous
09/28/25(Sun)00:34:13 No.106722685

Anonymous 09/28/25(Sun)00:34:13 No.106722685

File: ComfyUI__00270.png (3.34 MB, 1280x1920)

3.34 MB PNG

yes 1girl is very nice, but have you considered ... 2girl?

Anonymous
09/28/25(Sun)00:35:44 No.106722695

Anonymous 09/28/25(Sun)00:35:44 No.106722695

File: output.webm (3.85 MB, 832x1248)

3.85 MB WEBM

>>106722494

>>106722674
I shoulda gone to bed 2 hours ago.

Anonymous
09/28/25(Sun)00:35:55 No.106722696

Anonymous 09/28/25(Sun)00:35:55 No.106722696

>>106722668
trvke: r/StableDiffusion is a better source of local gen information than this general

Anonymous
09/28/25(Sun)00:36:15 No.106722698

Anonymous 09/28/25(Sun)00:36:15 No.106722698

>>106722668
we already have this technology it's called the krita ai diffusion plugin

Anonymous
09/28/25(Sun)00:36:20 No.106722699

Anonymous 09/28/25(Sun)00:36:20 No.106722699

>>106720291
>Troon posting is no different from shitting up the board with scat. It should be a bannable offense.
>>106722660
>Posting him should be a bannable offense.
this troon thinks he's on reddit or something? lmao

Anonymous
09/28/25(Sun)00:36:50 No.106722700

Anonymous 09/28/25(Sun)00:36:50 No.106722700

>>106722698
>krita
sucks fucking cock
>krita ai diffusion plugin
sucks mega fucking cock

Anonymous
09/28/25(Sun)00:37:20 No.106722706

Anonymous 09/28/25(Sun)00:37:20 No.106722706

I don't want to sound mean (Because I have shit gens myself) but why are all video poster gens always shit or interchangeable?

Anonymous
09/28/25(Sun)00:38:15 No.106722707

Anonymous 09/28/25(Sun)00:38:15 No.106722707

File: cube.mp4 (1 MB, 512x512)

1 MB MP4

>>106722706
I've only ever seen one soulful video gen from this place and it was posted when 2.1 first came out

Anonymous
09/28/25(Sun)00:39:25 No.106722711

Anonymous 09/28/25(Sun)00:39:25 No.106722711

>>106722695
Lmao
this is great

Anonymous
09/28/25(Sun)00:39:34 No.106722712

Anonymous 09/28/25(Sun)00:39:34 No.106722712

>>106722685
>have you considered ... 2girl?
as a Tencent employee, I always prefer my AI images with 80 billions girls
https://youtu.be/VU2d_Pld3w8?t=60

Anonymous
09/28/25(Sun)00:39:58 No.106722715

Anonymous 09/28/25(Sun)00:39:58 No.106722715

>>106722706
Because most video posters are just fucking around and they take a long time to make. So there more of an expression hey look at this rather than hey I made this for you entertainment.

Anonymous
09/28/25(Sun)00:40:32 No.106722716

Anonymous 09/28/25(Sun)00:40:32 No.106722716

>>106722707
Fair. I like some, but we got walkinganon, rocketanon, and one other. I don't have the tech to do something or else I'd do something fun besides Makoto pics.VWRHT

Anonymous
09/28/25(Sun)00:41:17 No.106722719

Anonymous 09/28/25(Sun)00:41:17 No.106722719

>>106722695
on the other hand once you notice how locked in space her hand and phone are it's kind of distracting

Anonymous
09/28/25(Sun)00:41:31 No.106722720

Anonymous 09/28/25(Sun)00:41:31 No.106722720

>>106722700
>sucks mega fucking cock
my cock

Anonymous
09/28/25(Sun)00:41:46 No.106722721

Anonymous 09/28/25(Sun)00:41:46 No.106722721

>>106722716
>I don't have the tech to do something or else
Then maybe you should just shut the fuck up and consume what you are given like a good little vramlet instead of complaining.

Anonymous
09/28/25(Sun)00:41:58 No.106722722

Anonymous 09/28/25(Sun)00:41:58 No.106722722

File: 1751538243755537.jpg (328 KB, 1248x1824)

328 KB JPG

>>106722685
2 girl is illegal is some regions

Anonymous
09/28/25(Sun)00:42:49 No.106722724

Anonymous 09/28/25(Sun)00:42:49 No.106722724

>>106722719
Idiot retard double dipshit. That's how phones in mirrors work. You wouldn't know because you're too ugly to be worth taking photos of yourself.

Anonymous
09/28/25(Sun)00:43:08 No.106722725

Anonymous 09/28/25(Sun)00:43:08 No.106722725

>>106722721
this guy owns a 5090 and slops

Anonymous
09/28/25(Sun)00:43:22 No.106722726

Anonymous 09/28/25(Sun)00:43:22 No.106722726

File: buzz lol.png (4 KB, 151x58)

4 KB PNG

>>106722715
Who don't they use online gens at this point? With CivitAI they could rack up buzz and do video gens.

Anonymous
09/28/25(Sun)00:43:52 No.106722729

Anonymous 09/28/25(Sun)00:43:52 No.106722729

>>106722724
you know im right and you're coping

Anonymous
09/28/25(Sun)00:43:54 No.106722730

Anonymous 09/28/25(Sun)00:43:54 No.106722730

>>106722685
I consider 2girl an unhealthy amount

Anonymous
09/28/25(Sun)00:44:12 No.106722731

Anonymous 09/28/25(Sun)00:44:12 No.106722731

What's stopping me from training my own speed loras

Anonymous
09/28/25(Sun)00:44:23 No.106722732

Anonymous 09/28/25(Sun)00:44:23 No.106722732

>>106722725
I doubt it lol

Anonymous
09/28/25(Sun)00:44:30 No.106722736

Anonymous 09/28/25(Sun)00:44:30 No.106722736

>>106722726
People with disposable income do not care about buzz.

Anonymous
09/28/25(Sun)00:46:20 No.106722746

Anonymous 09/28/25(Sun)00:46:20 No.106722746

>>106722721
Like I said: I don't want to be mean. I enjoy them, but they get old after a while. If I had some god shit, I'd be fucking with everything lol

Anonymous
09/28/25(Sun)00:47:07 No.106722750

Anonymous 09/28/25(Sun)00:47:07 No.106722750

>>106722517
Can be okay for the initial gen, bad for upscaling. It gives the edges of pixels this ragged/aliased look. Okay aesthetic for certain gens, maybe.
res_2s | beta57 | 20-35 steps for initial gen, 15 for upscaling passes @ x2 scale (max 4 tiles) looks best on chroma hd imo, but you'd want a decent GPU given how slow 2s is.

Anonymous
09/28/25(Sun)00:47:37 No.106722752

Anonymous 09/28/25(Sun)00:47:37 No.106722752

>>106722722
>teenage mutant ninja turtle hands
illustrious does this a lot more than i remember

Anonymous
09/28/25(Sun)00:47:50 No.106722755

Anonymous 09/28/25(Sun)00:47:50 No.106722755

File: ComfyUI_temp_osron_00028_.png (2.15 MB, 960x1440)

2.15 MB PNG

>>106722719
kek, people like you always make me laugh, always looking out for those little details in ai gens

Anonymous
09/28/25(Sun)00:47:52 No.106722756

Anonymous 09/28/25(Sun)00:47:52 No.106722756

>>106722724
Bait aside, believe it or not, turn the camera slightly in any direction then it won't be in the same spot anymore in the mirror. On the other hand, if you started with larger frame, stabilization can be applied by cropping stuff.

Anonymous
09/28/25(Sun)00:48:31 No.106722757

Anonymous 09/28/25(Sun)00:48:31 No.106722757

>>106722736
That doesn't answer what I asked. I got all that from liking gens and claiming daily buzz,

Anonymous
09/28/25(Sun)00:50:20 No.106722770

Anonymous 09/28/25(Sun)00:50:20 No.106722770

API NODES SAVE US ONEGAI

Anonymous
09/28/25(Sun)00:51:14 No.106722772

Anonymous 09/28/25(Sun)00:51:14 No.106722772

>>106722755
six fingers

Anonymous
09/28/25(Sun)00:53:02 No.106722782

Anonymous 09/28/25(Sun)00:53:02 No.106722782

>>106722755
details matter

Anonymous
09/28/25(Sun)00:53:20 No.106722784

Anonymous 09/28/25(Sun)00:53:20 No.106722784

>>106722695
thats fucking awesome, did you prompt the devil guy or did you use some reference image?

Anonymous
09/28/25(Sun)00:54:33 No.106722792

Anonymous 09/28/25(Sun)00:54:33 No.106722792

>>106722661
They never fully confirmed the weights for Wan 2.5 would be released. They implied they would "consider" releasing it after the API preview phase, but this is not a confirmation and it sounds like it's not up to the devs but the higher ups.

But I personally believe they will ultimately release it (Kling and even some other API models mogs it)

Anonymous
09/28/25(Sun)00:55:45 No.106722801

Anonymous 09/28/25(Sun)00:55:45 No.106722801

>>106722792
>They implied they would "consider"
Anyone familiar with Chinese or asian culture for that matter knows this means no.

Anonymous
09/28/25(Sun)00:56:56 No.106722805

Anonymous 09/28/25(Sun)00:56:56 No.106722805

>>106722782
only turbo autists get fixated on looking out for small details like that

Anonymous
09/28/25(Sun)00:57:12 No.106722808

Anonymous 09/28/25(Sun)00:57:12 No.106722808

File: 1758906075660251.jpg (359 KB, 2048x2048)

359 KB JPG

>>106722752
It's not every day you'll come across someone as bad at configuring this stuff as me, so there's that too

Anonymous
09/28/25(Sun)00:58:11 No.106722815

Anonymous 09/28/25(Sun)00:58:11 No.106722815

>>106722792
>I personally believe they will ultimately release it
same, because this model isn't close to veo 3, they'll probably reach google's level with wan 3.0 and wan 2.5 will be useless, so might as well share this scap to the localkeks

Anonymous
09/28/25(Sun)00:59:12 No.106722820

Anonymous 09/28/25(Sun)00:59:12 No.106722820

>>106722746
This is the same "If I was a billionaire I'd" but you wouldn't you're just as fallible and predictable as everyone else and would end up producing the same slop.

If you were a billionaire, you would be a tight ass.
If you were living in Nazi Germany, you would have been a hard core Nazu.
If you had a 5090, you'd gen slop.

Anonymous
09/28/25(Sun)00:59:18 No.106722823

Anonymous 09/28/25(Sun)00:59:18 No.106722823

>>106722808
where is the 'make 2girl kiss' guy when you need him?

Anonymous
09/28/25(Sun)01:03:25 No.106722846

Anonymous 09/28/25(Sun)01:03:25 No.106722846

>>106722801
Their model is not even SOTA. I think the higher ups got too confident/cocky they would be on top this time around and decided to become API-only, but from the side by side comparisons I've been seeing on X, if they lock this behind API it will be just another video model not even normies will bother using
I do think they would permanently stop flirting with the idea of releasing weights the second they become at least top2 or when they realize they have something truly useful industry-wise

Anonymous
09/28/25(Sun)01:03:38 No.106722847

Anonymous 09/28/25(Sun)01:03:38 No.106722847

File: Untitled.jpg (3.99 MB, 7034x9142)

3.99 MB JPG

>>106722191
who the fuck is their target audience even that they really thought this collection of sample pics was reasonable justification for the 80B params. Like NOTHING seen there is any different from the cherry picked shit every recently released Chinese model has done, it's not impressive in any way.

Anonymous
09/28/25(Sun)01:04:16 No.106722854

Anonymous 09/28/25(Sun)01:04:16 No.106722854

>>106722847
can you post this image 12 more times please

Anonymous
09/28/25(Sun)01:04:16 No.106722855

Anonymous 09/28/25(Sun)01:04:16 No.106722855

>>106722649
>money to fund training
It would be more effective to pour gasoline on that money and just burn it then the experimental bullshit training the furries do. At least the former would entertaining for a couple of minutes

Anonymous
09/28/25(Sun)01:04:27 No.106722856

Anonymous 09/28/25(Sun)01:04:27 No.106722856

>>106722820
I only have a 4060. If I had some good shit, I'd train LoRAs 'n shit with help from you guys.

Anonymous
09/28/25(Sun)01:04:33 No.106722857

Anonymous 09/28/25(Sun)01:04:33 No.106722857

why has the hunyuan collage been posted three times in this thread

Anonymous
09/28/25(Sun)01:05:29 No.106722862

Anonymous 09/28/25(Sun)01:05:29 No.106722862

>>106722847
isnt it a multimodal llm?

Anonymous
09/28/25(Sun)01:05:44 No.106722866

Anonymous 09/28/25(Sun)01:05:44 No.106722866

localbrowns continue seething at being unable to run the top open weight model in the world. absolute sour grapes

Anonymous
09/28/25(Sun)01:06:55 No.106722871

Anonymous 09/28/25(Sun)01:06:55 No.106722871

>>106722857
The dude has been mindbroken by chinese dick. sad really.

Anonymous
09/28/25(Sun)01:07:12 No.106722874

Anonymous 09/28/25(Sun)01:07:12 No.106722874

>>106722846
>I think the higher ups got too confident/cocky
I think they did too. But it gives you a glimpse into their mindset. They want to start getting their money back for this investment. That kind of action shifts your entire mindset. Releasing 2.5 might detract paying customers from forking out for 3.0 when it drops etc. It's flawed reasoning, but it's hard not to see them tightening the noose.

Anonymous
09/28/25(Sun)01:07:33 No.106722877

Anonymous 09/28/25(Sun)01:07:33 No.106722877

>>106722847
Those latest chinese models (except the Bytedance ones) are proof that these guys, while being smart and technical, have no real sense of aesthetics or taste by realizing their models produce slop.
The fact that a fucking 80b model from them still produces slop is the final proof of that.

Anonymous
09/28/25(Sun)01:09:04 No.106722889

Anonymous 09/28/25(Sun)01:09:04 No.106722889

>>106722856
You can train chroma loras on 8gb

Anonymous
09/28/25(Sun)01:11:06 No.106722899

Anonymous 09/28/25(Sun)01:11:06 No.106722899

>>106722889
>You can train chroma loras on 8gb
This has the same energy as a homeless person telling you to stay out of school.

Anonymous
09/28/25(Sun)01:11:08 No.106722900

Anonymous 09/28/25(Sun)01:11:08 No.106722900

>>106722889
I know, I'm just sayin; I would if I could lol

Anonymous
09/28/25(Sun)01:11:17 No.106722901

Anonymous 09/28/25(Sun)01:11:17 No.106722901

>>106722855
anti-Chroma schizo, get a life

Anonymous
09/28/25(Sun)01:11:53 No.106722906

Anonymous 09/28/25(Sun)01:11:53 No.106722906

>>106722572
Very nice. Catbox?

Anonymous
09/28/25(Sun)01:12:48 No.106722913

Anonymous 09/28/25(Sun)01:12:48 No.106722913

>>106722901
Different guy, newly converted into hating chroma. Hate seeing someone with so much potential waste his effort on worthless bullshit

Anonymous
09/28/25(Sun)01:13:28 No.106722916

Anonymous 09/28/25(Sun)01:13:28 No.106722916

>>106722900
>I would if I could lol
Wait... how much vram do you actually have?

Anonymous
09/28/25(Sun)01:13:43 No.106722918

Anonymous 09/28/25(Sun)01:13:43 No.106722918

>>106722913
lel, sure Jan

Anonymous
09/28/25(Sun)01:13:50 No.106722919

Anonymous 09/28/25(Sun)01:13:50 No.106722919

i don't even hate chroma i just want a better speed lora

Anonymous
09/28/25(Sun)01:14:06 No.106722922

Anonymous 09/28/25(Sun)01:14:06 No.106722922

>>106722916
8

Anonymous
09/28/25(Sun)01:16:05 No.106722929

Anonymous 09/28/25(Sun)01:16:05 No.106722929

File: 16539[1].jpg (21 KB, 640x432)

21 KB JPG

>>106722922
vramlet

Anonymous
09/28/25(Sun)01:17:07 No.106722934

Anonymous 09/28/25(Sun)01:17:07 No.106722934

QWEN was too good to be true..
>changes image composition
>zooms in
>doesn't follow instructions
>doesn't remove objects

Anonymous
09/28/25(Sun)01:18:04 No.106722940

Anonymous 09/28/25(Sun)01:18:04 No.106722940

>>106722934
don't use lightning

Anonymous
09/28/25(Sun)01:19:12 No.106722947

Anonymous 09/28/25(Sun)01:19:12 No.106722947

>>106722484
https://huggingface.co/clover-supply/Chroma-loras/blob/main/chroma-unlocked-v4x-hyper-turbo-flash-r64-fp32.safetensors
https://huggingface.co/clover-supply/Chroma-loras/blob/main/Hyper-Chroma-low-step-LoRA.safetensors

Anonymous
09/28/25(Sun)01:19:32 No.106722950

Anonymous 09/28/25(Sun)01:19:32 No.106722950

>>106722929
You're right lol
I wish I had more. I have 48GB RAM so I think I'm okay.

Anonymous
09/28/25(Sun)01:20:08 No.106722955

Anonymous 09/28/25(Sun)01:20:08 No.106722955

Spending 4k on a new sff pc build with a 5090 dedicated to AI gooning. Is it worth it if i save up over the next year? That 32gb vram take me far for what i need. 100% offline & local.

What would you guys suggest i add to my build. I was thinking
64gb ram
4tb nvme or even 8 if there's any reputable brands out there
9800x3d

Anonymous
09/28/25(Sun)01:20:41 No.106722956

Anonymous 09/28/25(Sun)01:20:41 No.106722956

>>106722955
>64gb ram
128+

Anonymous
09/28/25(Sun)01:21:26 No.106722961

Anonymous 09/28/25(Sun)01:21:26 No.106722961

>32gb vram
vramlet

Anonymous
09/28/25(Sun)01:22:37 No.106722968

Anonymous 09/28/25(Sun)01:22:37 No.106722968

>>106722947
i love you forever

Anonymous
09/28/25(Sun)01:23:15 No.106722969

Anonymous 09/28/25(Sun)01:23:15 No.106722969

File: WVI2V_CC_INT_28-09-25-02-(...).mp4 (3.16 MB, 480x848)

3.16 MB MP4

Anonymous
09/28/25(Sun)01:23:16 No.106722970

Anonymous 09/28/25(Sun)01:23:16 No.106722970

>>106722956
I've seen some people suggest that, any reason why. Vram would be getting hit for the most part. I have 32gb ram right now and there's times where i get OOM errors but 128?
>>106722961
I've toyed with getting a A6000, maybe i will

Anonymous
09/28/25(Sun)01:24:54 No.106722979

Anonymous 09/28/25(Sun)01:24:54 No.106722979

If I want to know more about how these models are trained should I start reading papers and textbooks?

Anonymous
09/28/25(Sun)01:25:54 No.106722987

Anonymous 09/28/25(Sun)01:25:54 No.106722987

>>106722970
>any reason why
Futureproofing, training.

Anonymous
09/28/25(Sun)01:26:28 No.106722991

Anonymous 09/28/25(Sun)01:26:28 No.106722991

File: 12.jpg (3.83 MB, 4614x2000)

3.83 MB JPG

>>106722854
yw

Anonymous
09/28/25(Sun)01:26:53 No.106722994

Anonymous 09/28/25(Sun)01:26:53 No.106722994

File: WanVideo2_2_I2V_00455.webm (340 KB, 1248x704)

340 KB WEBM

>>106722414
So the quality of animate is kind of shit, so I had the schizo idea of feeding the first frame through i2v after the generation was done to see if it could clean up some of the shittyness. idk.

Anonymous
09/28/25(Sun)01:28:06 No.106723000

Anonymous 09/28/25(Sun)01:28:06 No.106723000

>>106722994
>use 180p source video
>it matches that quality when inserting your custom character
>durrrr da quawity of animate is shiiiiiiiiiit

Anonymous
09/28/25(Sun)01:28:35 No.106723004

Anonymous 09/28/25(Sun)01:28:35 No.106723004

>>106723000
kys faggot. I'm talking about the warping and blurring of the objects in the image.

Anonymous
09/28/25(Sun)01:28:45 No.106723005

Anonymous 09/28/25(Sun)01:28:45 No.106723005

>>106722955
You can usually have more than 1 nvme.

Anonymous
09/28/25(Sun)01:29:05 No.106723007

Anonymous 09/28/25(Sun)01:29:05 No.106723007

>>106722847
We're fully in the Deepseek era but for image diffusion/video diffusion. Remember when LLMs were at max 70B open source while propietary was playing with 100-200B and then suddenly, we had 671B parameters with Deepseek. Same thing happened with image diffusion here. We went from 12B parameters and API models are probably in the 20-30B range and now we have 80B open sourced from Tencent.

Anonymous
09/28/25(Sun)01:29:26 No.106723011

Anonymous 09/28/25(Sun)01:29:26 No.106723011

>>106722994
her fingers longer than most anon penis

Anonymous
09/28/25(Sun)01:29:56 No.106723015

Anonymous 09/28/25(Sun)01:29:56 No.106723015

>>106723007
This is their game. They can play the open source on paper card while releasing models nobody could feasibly hope to ever run and thus forking out for their use.

Anonymous
09/28/25(Sun)01:30:10 No.106723017

Anonymous 09/28/25(Sun)01:30:10 No.106723017

>>106723004
Use a higher resolution input video, you fucking retard.

Anonymous
09/28/25(Sun)01:30:52 No.106723019

Anonymous 09/28/25(Sun)01:30:52 No.106723019

Is there some place or maybe a youtube channel where i can go to learn how AI works. I'm barely understanding what 12/20/80B means but i'd like to know some of the nerdier stuff

Anonymous
09/28/25(Sun)01:31:12 No.106723020

Anonymous 09/28/25(Sun)01:31:12 No.106723020

File: 00000-277433315.png (2.71 MB, 1248x1848)

2.71 MB PNG

man I love alcohol and not caring about the quality of my posts in a thread that will disappear. What a nice feeling.

Anonymous
09/28/25(Sun)01:31:20 No.106723021

Anonymous 09/28/25(Sun)01:31:20 No.106723021

>>106722955
get 192 or 256gb of ram. ditch the sff so you can have 4 dimms, even at a slightly lower speed.

system ram can make a huge difference for models with different subcomponents that are sequentially swapped out of vram. there are also a lot of badly programmed python inference scripts that rely on system ram cache that are easier to throw the ram at than to spend time fixing. and of course, system ram helps run moe llms, even if it's slow dual channel.

Anonymous
09/28/25(Sun)01:31:35 No.106723024

Anonymous 09/28/25(Sun)01:31:35 No.106723024

>>106723017
The resolution of the input is the resolution I genned at for wan animate. I can only upscale that image and video and hope it works. If you're talking about the actual source of the video itself, I'm genning it now.

Anonymous
09/28/25(Sun)01:31:41 No.106723025

Anonymous 09/28/25(Sun)01:31:41 No.106723025

>>106722979
>>106723019
Yeah lurk here and read papers

Anonymous
09/28/25(Sun)01:32:32 No.106723027

Anonymous 09/28/25(Sun)01:32:32 No.106723027

>>106723011
>projection

Anonymous
09/28/25(Sun)01:32:33 No.106723028

Anonymous 09/28/25(Sun)01:32:33 No.106723028

>>106723007
deepseek is actually good though, better than the closed competition at the time in many areas.
hunyuan 3 tbd but not looking good based on >>106722991

Anonymous
09/28/25(Sun)01:32:40 No.106723029

Anonymous 09/28/25(Sun)01:32:40 No.106723029

>>106723024
What are you using for interpolation? If you say rife, I'm going to punch you in the face.

Anonymous
09/28/25(Sun)01:32:56 No.106723031

Anonymous 09/28/25(Sun)01:32:56 No.106723031

>>106723011
not here in long dick general

Anonymous
09/28/25(Sun)01:33:56 No.106723036

Anonymous 09/28/25(Sun)01:33:56 No.106723036

>>106722979
What is even the point pursuing that when you don't have any access to the resources required to train them?

I'd bet any midwit here would be able to train top tier stuff if they had tons of free compute thrown at them. People would just replicate architecture and training pipelines from existing papers and use properly curated data and still get great results. Anyone with a good eye determining what is "slop" and what is not would still produce better models than any 130 IQ ML engineer has been putting out.

Anonymous
09/28/25(Sun)01:34:18 No.106723038

Anonymous 09/28/25(Sun)01:34:18 No.106723038

File: 1735755570014342.png (1.42 MB, 1248x832)

1.42 MB PNG

>>106723020
i like them

Anonymous
09/28/25(Sun)01:36:25 No.106723051

Anonymous 09/28/25(Sun)01:36:25 No.106723051

>>106722847
The only thing that actually looks impressive is that it knows how to write and position correctly. The Chinese characters don't look positioned correctly and none of them look like they are impossible characters or hallucinated from my limited Japanese knowledge. I am guessing it will probably be good at maybe making manga or comics oneshot without doing any editing and manual placement. But other than that, I really don't know.

Anonymous
09/28/25(Sun)01:36:25 No.106723052

Anonymous 09/28/25(Sun)01:36:25 No.106723052

>>106723036
I'm not looking to make top-tier SOTA GPT-killer-tier stuff, I just want to know enough to be able to fuck around to the extent that I want.

Anonymous
09/28/25(Sun)01:38:42 No.106723061

Anonymous 09/28/25(Sun)01:38:42 No.106723061

>>106723052
Unless you have some H100s laying around that extent is effectively 0.

Anonymous
09/28/25(Sun)01:41:05 No.106723071

Anonymous 09/28/25(Sun)01:41:05 No.106723071

>>106723061
That sucks. I hate that I just hallucinated this entire hobby and the websites full of models and their associated tooling, which all do not exist because the people involved don't have H100s laying around.

Anonymous
09/28/25(Sun)01:41:07 No.106723072

Anonymous 09/28/25(Sun)01:41:07 No.106723072

>>106723052
I have IRL friends that like training small ml models for "fun and experience" but I never really saw the point of that considering they never had any ambition to work at FAGMAN or join an AI-related startup with VC money, where they would truly put their knowledge to practice.
It's a big waste of time, I wouldn't even spin it as an "intellectual" activity since this doesn't even displays true intelligence

Anonymous
09/28/25(Sun)01:41:09 No.106723073

Anonymous 09/28/25(Sun)01:41:09 No.106723073

>>106723061
Let a nigga be curious ffs

Anonymous
09/28/25(Sun)01:41:32 No.106723074

Anonymous 09/28/25(Sun)01:41:32 No.106723074

>>106723051
Being able to output coherent text isn't impressive at all. It's an autoregressive model, coherent text is a stock standard feature of the architecture (modeling sequential/long range dependencies). Literally every other aspect of it looks like complete ass, doubly so considering the size.

Anonymous
09/28/25(Sun)01:43:07 No.106723086

Anonymous 09/28/25(Sun)01:43:07 No.106723086

File: 00438-2801304206.png (2.54 MB, 1280x1920)

2.54 MB PNG

>>106723038
now that is a really nice image

Anonymous
09/28/25(Sun)01:43:54 No.106723089

Anonymous 09/28/25(Sun)01:43:54 No.106723089

>>106722847
Tbf, in China, where 4o is probably banned, this is probably ground breaking stuff. Which is why they distill in the first place.

Anonymous
09/28/25(Sun)01:45:24 No.106723096

Anonymous 09/28/25(Sun)01:45:24 No.106723096

>>106722695
Can you post this again but without the text.

Anonymous
09/28/25(Sun)01:49:02 No.106723111

Anonymous 09/28/25(Sun)01:49:02 No.106723111

File: 6387927520680860617721570.jpg (211 KB, 451x666)

211 KB JPG

>>106723051
Think you mean they do look positioned correctly.
>>106723074
English text no. Chinese text though, even ChatGPT messes up at times, the 4 and 12 characters are wrong. But I dunno if you need a model possibly 4x what ChatGPT is running to get perfect Chinese text.

Anonymous
09/28/25(Sun)01:50:37 No.106723119

Anonymous 09/28/25(Sun)01:50:37 No.106723119

>>106723111
I mean, you have to figure a Chinese model is going to be better at Chinese than a Westoid model, right? OpenAI's market is mostly English speaking regions, and their product is banned in China.

Anonymous
09/28/25(Sun)01:52:57 No.106723133

Anonymous 09/28/25(Sun)01:52:57 No.106723133

File: Screenshot 2025-09-28 015241.png (3 KB, 444x43)

3 KB PNG

Attempting to generate the Hunyuan 3.0 reference image

Anonymous
09/28/25(Sun)01:53:53 No.106723140

Anonymous 09/28/25(Sun)01:53:53 No.106723140

>>106723133
Oof. What hardware are you using and how much are you offloading?

Anonymous
09/28/25(Sun)01:54:55 No.106723142

Anonymous 09/28/25(Sun)01:54:55 No.106723142

File: 1751845357020963.jpg (692 KB, 2120x1416)

692 KB JPG

>>106723086
second passes look shit but pretty much what the real thing looks like kek https://x.com/MorinagaJunko/media

Anonymous
09/28/25(Sun)01:55:51 No.106723145

Anonymous 09/28/25(Sun)01:55:51 No.106723145

>>106722877
>have no real sense of aesthetics or taste by realizing their models produce slop.

Some of their researchers don't seem to. Same thing happened at ClosedAI. First they had Dalle which shows some competent artistic direction, but then they tossed all of that out the window with 4o, with a much suspected fingerprint/censorship causing the yellow tint. Chinese companies have simply gotten lazy, no more effort into their models, which is why we're now getting low effort models with trashy aesthetics too. Bytedance has offset some of that crap a bit with Seedream, but it's still not quite there yet. As much as I hate it, if I were to choose an API model based on aesthetics alone, MJ is still the winner, closely followed by NAI.

Anonymous
09/28/25(Sun)01:58:51 No.106723157

Anonymous 09/28/25(Sun)01:58:51 No.106723157

>>106723142
damn that shit is raw, nice find

Anonymous
09/28/25(Sun)01:59:49 No.106723165

Anonymous 09/28/25(Sun)01:59:49 No.106723165

File: 4545484421845.png (59 KB, 885x574)

59 KB PNG

>>106723145
>MJ
And that's the sad part of all this. There was a time when Tencent legit cared and thought MJ was aesthetic and SOTA. By testing their model against it, they were doing honest research back then which is how we got HunyuanDiT. It's so sad that we're talking about the past, and what is essentially now just an archived and forgotten about repo.

Anonymous
09/28/25(Sun)02:00:49 No.106723173

Anonymous 09/28/25(Sun)02:00:49 No.106723173

File: RA_NBCM_00025.jpg (1.48 MB, 1872x2736)

1.48 MB JPG

Anonymous
09/28/25(Sun)02:01:08 No.106723175

Anonymous 09/28/25(Sun)02:01:08 No.106723175

>>106723140
RTX 6000 BBW with 192GB of RAM
About half of the system RAM is being used along with all the VRAM

Anonymous
09/28/25(Sun)02:01:27 No.106723178

Anonymous 09/28/25(Sun)02:01:27 No.106723178

>>106722608
https://files.catbox.moe/3h88fr.webm

Here's another attempt and i2ving the animate output to clean it up. Unfortunately her nipple kind of popped out so I gotta catbox it.

Anonymous
09/28/25(Sun)02:01:28 No.106723180

Anonymous 09/28/25(Sun)02:01:28 No.106723180

>>106723145
dall-e 3 was the peak of image models. insane amounts of characters and styles, and did a really nice job at creative gens. too bad it was nerfed to shit and never released. 4o is a complete downgrade

Anonymous
09/28/25(Sun)02:04:31 No.106723193

Anonymous 09/28/25(Sun)02:04:31 No.106723193

sooo hunyan v3 is a moe. an 80b moe with A13B.
is it unironically over for us vramlets?

Anonymous
09/28/25(Sun)02:05:19 No.106723197

Anonymous 09/28/25(Sun)02:05:19 No.106723197

>>106723145
>>106723180

True, dalle3 was magical. To this day it's the only base model that can produce good niche aesthetics like polaroid-like or vhs-like pics, was really good at some painting styles like impressionist art, could make some "trippy" retro anime illustrations that I haven't seen any other model pulling out, and overall produced some really authentic 2D artworks. The "normal" outputs were slop, but with some prompt it produced some very interesting results. Whoever worked on post-training it likely doesn't work at openai anymore, lol, the 4o outputs look pretty generic in comparison

Anonymous
09/28/25(Sun)02:06:09 No.106723203

Anonymous 09/28/25(Sun)02:06:09 No.106723203

>>106723193
No, because SDXL is still the best for anime and I say that without a hint of irony. This model looks slopped.

Anonymous
09/28/25(Sun)02:06:29 No.106723210

Anonymous 09/28/25(Sun)02:06:29 No.106723210

I never thought De3 looked aesthetic desu the humans especially were slopped

Anonymous
09/28/25(Sun)02:06:57 No.106723213

Anonymous 09/28/25(Sun)02:06:57 No.106723213

>>106723193
no it means you have a chance of running it with enough ram... if lmao.cpp ever implements it (never ever)

Anonymous
09/28/25(Sun)02:07:17 No.106723216

Anonymous 09/28/25(Sun)02:07:17 No.106723216

File: image.png (1.53 MB, 1216x832)

1.53 MB PNG

>>106723175
The first output of Hunyuan 3.0 has arrived.
The total generation time was 13:20 (50 steps, all default settings, sample inference code).
The default prompt is
>prompt = "A brown and white dog is running on the grass"
Output resolution is chosen automatically by default.

Anonymous
09/28/25(Sun)02:07:21 No.106723219

Anonymous 09/28/25(Sun)02:07:21 No.106723219

>>106723210
I always thought they looked impressive at first, but once you know and get used to what they look like they become overbearingly slopped.

Anonymous
09/28/25(Sun)02:07:55 No.106723222

Anonymous 09/28/25(Sun)02:07:55 No.106723222

>>106723219
Agreed

Anonymous
09/28/25(Sun)02:08:32 No.106723225

Anonymous 09/28/25(Sun)02:08:32 No.106723225

>>106723180
They're really tight lipped about the architecture too. DALLE 3 was the only model to this day that truly understood my prompts. I think they used a smaller version of GPT4V as the text encoder or something which would explain a lot.

Anonymous
09/28/25(Sun)02:08:37 No.106723226

Anonymous 09/28/25(Sun)02:08:37 No.106723226

>>106723216
Jesus. It looks like liquid shit spat out through a straw.

Anonymous
09/28/25(Sun)02:08:43 No.106723227

Anonymous 09/28/25(Sun)02:08:43 No.106723227

>>106723210
People liked how it defaulted making women look like hot bimbos, which is quite something considering the company

Anonymous
09/28/25(Sun)02:08:49 No.106723228

Anonymous 09/28/25(Sun)02:08:49 No.106723228

>>106723216
I guess it's a nice looking dog. Not the best dog I've ever seen though. Certainly not for 13 minutes.

Anonymous
09/28/25(Sun)02:09:39 No.106723235

Anonymous 09/28/25(Sun)02:09:39 No.106723235

>>106723216
isn't the new hunyuan a multimodal llm? can you talk to it?

Anonymous
09/28/25(Sun)02:09:51 No.106723239

Anonymous 09/28/25(Sun)02:09:51 No.106723239

>>106723216
Reminds me of Chroma during its early epochs, ie the lack of resolution on the grass, the way it frazzles fine details on hair, etc

Anonymous
09/28/25(Sun)02:11:37 No.106723251

Anonymous 09/28/25(Sun)02:11:37 No.106723251

>>106723216
>The total generation time was 13:20
It's like they want people to keep using sdxl

Anonymous
09/28/25(Sun)02:13:19 No.106723260

Anonymous 09/28/25(Sun)02:13:19 No.106723260

>>106723235
Yeah, I think it's supposed to be. The basic inference script simply accepts a text prompt and outputs an image. I'll try to use the chatbot functionality after I generate a couple more test pictures.

Anonymous
09/28/25(Sun)02:13:54 No.106723262

Anonymous 09/28/25(Sun)02:13:54 No.106723262

>>106723197
Its art was very good. Dalle 3 threads from back then were pure sovl. Realism was also very good, just not as good as what we have now (and Chroma has essentially closed the gap in prompt following). Still, Dalle 3 was probably single handedly the most coherent realism even if it had to be jailbroken to unlock its capabilities. I still get iffy multiple subject gens on Chroma (even Chroma HD Flash) that I know Dalle would nail in one shot.

Anonymous
09/28/25(Sun)02:13:56 No.106723264

Anonymous 09/28/25(Sun)02:13:56 No.106723264

>>106723251
>It's like they want people to keep using sdxl
If all you people prompt is "1girl" and criticizes "boomer prompting" (aka, prompting like a sane non-autistic member of society would), why even bothering with other models at all?

Anonymous
09/28/25(Sun)02:14:02 No.106723265

Anonymous 09/28/25(Sun)02:14:02 No.106723265

>>106723145
We get it you like MJ, like holy shit man. I think it looks like ass so hey.
>NAI
Lol their current model looks like absolute ass, V3 was great, I'll give it that.

Anonymous
09/28/25(Sun)02:15:01 No.106723275

Anonymous 09/28/25(Sun)02:15:01 No.106723275

>qwen 3 vl multimodal
"Ooh, I want to try and make it translate and colorize douji-"
>235b

Anonymous
09/28/25(Sun)02:16:49 No.106723281

Anonymous 09/28/25(Sun)02:16:49 No.106723281

>>106723213
>if lmao.cpp ever implements it
im still waiting for them to implement qwen3max, so hopes are low

Anonymous
09/28/25(Sun)02:17:45 No.106723286

Anonymous 09/28/25(Sun)02:17:45 No.106723286

>>106723275
The qwen vl series only accepts image input anyway, so it won't do any coloring for you even with a B200 farm.

Anonymous
09/28/25(Sun)02:18:27 No.106723292

Anonymous 09/28/25(Sun)02:18:27 No.106723292

>>106723264
I don't mind boomer prompting but in 13 minutes I can gen enough with seed gacha to get the same result on other models.

Anonymous
09/28/25(Sun)02:20:41 No.106723300

Anonymous 09/28/25(Sun)02:20:41 No.106723300

>>106723216
Not bad, maybe not 80B good but is something.

Anonymous
09/28/25(Sun)02:21:14 No.106723307

Anonymous 09/28/25(Sun)02:21:14 No.106723307

File: 1748287166931509.mp4 (1.36 MB, 720x720)

1.36 MB MP4

>>106722808
>>106722823

Anonymous
09/28/25(Sun)02:24:13 No.106723325

Anonymous 09/28/25(Sun)02:24:13 No.106723325

File: 1745076622417154.jpg (56 KB, 897x882)

56 KB JPG

>>106723307

Anonymous
09/28/25(Sun)02:27:35 No.106723349

Anonymous 09/28/25(Sun)02:27:35 No.106723349

>>106722353
No, krugertard, with MoE, for every step basically all of the experts are going to actually be used every time, it's just that at any given time only a set amount is active, but you are still reading the entire model every time. So you don't "just place the active parts in vram", since they swap all the time are are all used for each step, which would make you hit into the pcie bottleneck quickly.

The point of MoE is that you get faster speed but need to train a bigger model to get the same IQ of the dense model, which is a fine tradeoff since given the faster speed and RAM cheapness, you can load it into RAM.

There are also some caveats here like some architectures indeed having experts that are more commonly used and thus always locked into vram but thats besides the general way MoE models work right now.

Anonymous
09/28/25(Sun)02:28:36 No.106723354

Anonymous 09/28/25(Sun)02:28:36 No.106723354

>>106723349
>are are
and are

Anonymous
09/28/25(Sun)02:30:47 No.106723370

Anonymous 09/28/25(Sun)02:30:47 No.106723370

>>106723262
Chroma with loras is the closest thing that exists that can replicate the "dalle3 aesthetics" in an open model, but even then it's very disappointing how mangled Chroma is overall with anatomy and simple things like characters holding swords

Anonymous
09/28/25(Sun)02:32:47 No.106723376

Anonymous 09/28/25(Sun)02:32:47 No.106723376

>>106723349
we dont even know if this uses shared experts (it probably does, like all moes), still even with shared on vram and the rest on ram, it would be painfully slow for imagegen.
I'm not even sure if comfy has an implementation that lets you select how many layers you want on cpu/gpu for moes, or select the layers by name (like using -ot with llama) and tensor splitting across multiple gpus I mean

Anonymous
09/28/25(Sun)02:33:09 No.106723378

Anonymous 09/28/25(Sun)02:33:09 No.106723378

File: makotoburrito.jpg (2.93 MB, 2304x2304)

2.93 MB JPG

We're just hangin' out lol

Anonymous
09/28/25(Sun)02:34:35 No.106723388

Anonymous 09/28/25(Sun)02:34:35 No.106723388

Well the noob guys got their GPT equivalent that they wanted, you think they will come and finetune this? kek

Anonymous
09/28/25(Sun)02:35:09 No.106723390

Anonymous 09/28/25(Sun)02:35:09 No.106723390

File: WanVideo2_2_I2V_00458.webm (1.72 MB, 1248x704)

1.72 MB WEBM

Here's another wan animate cleaned up through i2v. This time I used seedvr2 to upscale video before running it.

Anonymous
09/28/25(Sun)02:37:51 No.106723403

Anonymous 09/28/25(Sun)02:37:51 No.106723403

>>106723307
awoo

Anonymous
09/28/25(Sun)02:38:52 No.106723414

Anonymous 09/28/25(Sun)02:38:52 No.106723414

>>106723390
gave him anime eyes

Anonymous
09/28/25(Sun)02:39:35 No.106723417

Anonymous 09/28/25(Sun)02:39:35 No.106723417

>>106723414
You don't understand, Gandalf is a type of gollum you see.

Anonymous
09/28/25(Sun)02:41:25 No.106723426

Anonymous 09/28/25(Sun)02:41:25 No.106723426

File: image2.png (2.05 MB, 1024x1024)

2.05 MB PNG

After 29:32, the second output of Hunyuan 3.0 is ready. It's the mandatory 1girl test. Yes, this 1024x1024 image took half an hour.

>>106723213
You don't need to wait. The stock inference code offloads properly. I would aim for over 220GB total memory though.

Anonymous
09/28/25(Sun)02:42:12 No.106723429

Anonymous 09/28/25(Sun)02:42:12 No.106723429

>>106723426
It's detailed

Anonymous
09/28/25(Sun)02:44:25 No.106723437

Anonymous 09/28/25(Sun)02:44:25 No.106723437

>>106723426
nice earth. looks like dall-e

Anonymous
09/28/25(Sun)02:44:37 No.106723439

Anonymous 09/28/25(Sun)02:44:37 No.106723439

>>106723426
The earth looks pretty good desu.

Anonymous
09/28/25(Sun)02:44:55 No.106723441

Anonymous 09/28/25(Sun)02:44:55 No.106723441

>>106723426
are you running it with FA and flashinfer?

Anonymous
09/28/25(Sun)02:45:12 No.106723445

Anonymous 09/28/25(Sun)02:45:12 No.106723445

>>106723426
interesting part of the globe it chose to display

Anonymous
09/28/25(Sun)02:45:50 No.106723447

Anonymous 09/28/25(Sun)02:45:50 No.106723447

>>106723426
If you could, can you test the model's world knowledge? Things that current local models need a lora for? I tried the official website but there is a rewrite engine behind the scenes, so it keeps modifying my prompts for slopped outputs basically.

Anonymous
09/28/25(Sun)02:46:44 No.106723453

Anonymous 09/28/25(Sun)02:46:44 No.106723453

i kneel

Anonymous
09/28/25(Sun)02:48:44 No.106723458

Anonymous 09/28/25(Sun)02:48:44 No.106723458

>>106723445
it's showing the only part of the world that matters, china and friends

Anonymous
09/28/25(Sun)02:50:23 No.106723467

Anonymous 09/28/25(Sun)02:50:23 No.106723467

>>106723307
very nice

Anonymous
09/28/25(Sun)02:50:39 No.106723469

Anonymous 09/28/25(Sun)02:50:39 No.106723469

>>106723458
brown

Anonymous
09/28/25(Sun)02:51:34 No.106723474

Anonymous 09/28/25(Sun)02:51:34 No.106723474

File: 1752098074223273.png (1.02 MB, 1248x832)

1.02 MB PNG

>>106723426

Anonymous
09/28/25(Sun)02:51:54 No.106723476

Anonymous 09/28/25(Sun)02:51:54 No.106723476

>>106723469
friend*

Anonymous
09/28/25(Sun)02:52:00 No.106723478

Anonymous 09/28/25(Sun)02:52:00 No.106723478

>>106723458
And India. I think we'll see Indian AI being the future of open source in the coming weeks.

Anonymous
09/28/25(Sun)02:52:23 No.106723482

Anonymous 09/28/25(Sun)02:52:23 No.106723482

>>106723441
No, it didn't work despite everything nominally being configured, so I gave up for now to try actually generating some things.

>>106723447
What do you want to test specifically? The model itself is an LLM hybrid so I'm not sure if "creative interpretation" is built in.

Anonymous
09/28/25(Sun)02:53:29 No.106723489

Anonymous 09/28/25(Sun)02:53:29 No.106723489

File: 1753933781942115.jpg (991 KB, 1248x1824)

991 KB JPG

>handcel seeth
wake me up when it can do hands

Anonymous
09/28/25(Sun)02:54:36 No.106723496

Anonymous 09/28/25(Sun)02:54:36 No.106723496

>>106723474
Model?

Anonymous
09/28/25(Sun)02:54:42 No.106723497

Anonymous 09/28/25(Sun)02:54:42 No.106723497

>just realized I can split up elements of a scene each into their own clip encoder to avoid shitting up the prompt
wew

Anonymous
09/28/25(Sun)02:56:48 No.106723508

Anonymous 09/28/25(Sun)02:56:48 No.106723508

>>106723496
>VRAMlet detected
you caught me...
app.reve.com

Anonymous
09/28/25(Sun)02:58:42 No.106723517

Anonymous 09/28/25(Sun)02:58:42 No.106723517

>>106723508
buy an ad

Anonymous
09/28/25(Sun)03:00:07 No.106723525

Anonymous 09/28/25(Sun)03:00:07 No.106723525

>>106723517
>I did it
>I said the line!
buy me a GPU, faggot

Anonymous
09/28/25(Sun)03:01:10 No.106723528

Anonymous 09/28/25(Sun)03:01:10 No.106723528

>>106723525
this is the LOCAL diffusion general, go shit up /sdg/ saastard.

Anonymous
09/28/25(Sun)03:03:34 No.106723540

Anonymous 09/28/25(Sun)03:03:34 No.106723540

>https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main/Wan2.2-T2V-A14B-4steps-lora-250928

Looks like we've got fixed lightning loras for Wan2.2 releasing.

Anonymous
09/28/25(Sun)03:04:56 No.106723548

Anonymous 09/28/25(Sun)03:04:56 No.106723548

>>106723426
half an hour for that is brutal

Anonymous
09/28/25(Sun)03:05:12 No.106723551

Anonymous 09/28/25(Sun)03:05:12 No.106723551

>>106723540
>T2V
wake me up I2V drops.

Anonymous
09/28/25(Sun)03:09:13 No.106723568

Anonymous 09/28/25(Sun)03:09:13 No.106723568

>>106723540
>files are not named properly

These fuckers need to hang.

"Lets release this brand new car! But sir, what do we name it? That's the thing, we don't!"

Anonymous
09/28/25(Sun)03:13:07 No.106723584

Anonymous 09/28/25(Sun)03:13:07 No.106723584

File: 1733539143304046.png (1.04 MB, 832x1248)

1.04 MB PNG

Anonymous
09/28/25(Sun)03:13:43 No.106723587

Anonymous 09/28/25(Sun)03:13:43 No.106723587

>>106723497
How?

Anonymous
09/28/25(Sun)03:14:07 No.106723589

Anonymous 09/28/25(Sun)03:14:07 No.106723589

>>106723133
>seedvr2 to upscale video before running it.
Why? The movie is available in 4k, there is no reason to do an upscale on a clip of said movie, you can downscale from that resolution. And you introduced artifacts in said upscale that caused the end result to be worse.

Anonymous
09/28/25(Sun)03:16:51 No.106723596

Anonymous 09/28/25(Sun)03:16:51 No.106723596

>>106723589
Attention retard. ATTENTION RETARD.
The clip needs to be lowered in resolution in order to run wan animate. I can't throw a 4k clip into wan animate and not oom. Nobody can.

Anonymous
09/28/25(Sun)03:17:55 No.106723602

Anonymous 09/28/25(Sun)03:17:55 No.106723602

>>106723587
you just make a bunch of them and then use conditioning (combine) nodes to chain them together by 2s until you have a final positive conditioning output
I actually don't know if this works, I'm still waiting to find out lol

Anonymous
09/28/25(Sun)03:18:04 No.106723604

Anonymous 09/28/25(Sun)03:18:04 No.106723604

File: 1739542274361803.png (3.56 MB, 1416x2120)

3.56 MB PNG

Anonymous
09/28/25(Sun)03:18:14 No.106723605

Anonymous 09/28/25(Sun)03:18:14 No.106723605

File: makotodrunk4.jpg (2.71 MB, 2304x2304)

2.71 MB JPG

I gotta put her to bed, but I don't think she will.

Anonymous
09/28/25(Sun)03:19:07 No.106723607

Anonymous 09/28/25(Sun)03:19:07 No.106723607

>>106723426
YOU ARE THE RETARD. I explicitly said
>you can downscale from that resolution
Why even have a clip at a lower resolution than wan animate to even need to upscale was my quesiton. If you don't have the actual raw movie, then your fault for needing to upscale from your shit source data.

Anonymous
09/28/25(Sun)03:20:08 No.106723614

Anonymous 09/28/25(Sun)03:20:08 No.106723614

>>106723607
Mean to quote >>106723596

Anonymous
09/28/25(Sun)03:21:07 No.106723621

Anonymous 09/28/25(Sun)03:21:07 No.106723621

>>106723602
ok it doesn't really work great

Anonymous
09/28/25(Sun)03:21:20 No.106723622

Anonymous 09/28/25(Sun)03:21:20 No.106723622

>>106723607
Hey shit dick. Not everyone just has a 4k rip of the lord of the rings on their computer. It's an unreasonable expectation.

Anonymous
09/28/25(Sun)03:21:24 No.106723623

Anonymous 09/28/25(Sun)03:21:24 No.106723623

>>106723604
>an actual sexy gen for once

nice

Anonymous
09/28/25(Sun)03:21:47 No.106723626

Anonymous 09/28/25(Sun)03:21:47 No.106723626

>>106723624
>>106723624
>>106723624
>>106723624
>>106723624

Anonymous
09/28/25(Sun)03:25:50 No.106723650

Anonymous 09/28/25(Sun)03:25:50 No.106723650

>>106723370
Yeah, Chroma messes up if prompt with multiple subjects is too complicated, and I guess Qwen is pretty good at that stuff nowadays too (though it's censored).

>>106723426
This is pretty good. Looks like the model knows some rough manga lines, similar to HunyuanDiT. I wonder if it knows mangaka too. If only it were smaller, local would be saved. God damnit.

Anonymous
09/28/25(Sun)03:27:35 No.106723661

Anonymous 09/28/25(Sun)03:27:35 No.106723661

File: LOTR： The Fellowship of t(...).jpg (678 KB, 3840x2160)

678 KB JPG

>>106723622
>not hoarding media in the year of our lord 2025
Your loss.

Anonymous
09/28/25(Sun)04:15:12 No.106723869

Anonymous 09/28/25(Sun)04:15:12 No.106723869

>>106723621
>>106723602
actually it kinda does, I just need to figure out how to break it up best

Anonymous
09/28/25(Sun)05:12:59 No.106724145

Anonymous 09/28/25(Sun)05:12:59 No.106724145

>>106723216
imagine spending a 10000 dollars gpu, waiting 15 minutes for this lmaooo

Anonymous
09/28/25(Sun)05:31:41 No.106724246

Anonymous 09/28/25(Sun)05:31:41 No.106724246

>>106722602
https://github.com/EnragedAntelope/Flux-ChromaLoraConversion

Anonymous
09/28/25(Sun)07:07:59 No.106724887

Anonymous 09/28/25(Sun)07:07:59 No.106724887

Sage Attention 3 out, thoughts?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.