/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/01/25(Wed)14:50:22 No.106760347

File: highlights_g_106758695_17(...).jpg (695 KB, 2622x1331)

695 KB JPG

/ldg/ - Local Diffusion General Anonymous 10/01/25(Wed)14:50:22 No.106760347 Archived

Specialized Models Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106758695

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
10/01/25(Wed)14:51:30 No.106760357

Anonymous 10/01/25(Wed)14:51:30 No.106760357

localcope hour

Anonymous
10/01/25(Wed)14:51:35 No.106760359

Anonymous 10/01/25(Wed)14:51:35 No.106760359

Cursed thread of Mental Illness

Anonymous
10/01/25(Wed)14:51:47 No.106760361

Anonymous 10/01/25(Wed)14:51:47 No.106760361

Blessed thread of frenship

Anonymous
10/01/25(Wed)14:51:50 No.106760364

Anonymous 10/01/25(Wed)14:51:50 No.106760364

File: rfhnrfdxrfhrf.png (294 KB, 1453x791)

294 KB PNG

trying to figure out why this upscaling method is causing the output to become desaturated. pls help.
https://files.catbox.moe/m0fwcu.json

Anonymous
10/01/25(Wed)14:52:53 No.106760374

Anonymous 10/01/25(Wed)14:52:53 No.106760374

>instant seethe
How does the existence of localchads piss them off so much?

Anonymous
10/01/25(Wed)14:53:48 No.106760383

Anonymous 10/01/25(Wed)14:53:48 No.106760383

>>106760374
>localchads
I wished I was a chad, but compared to API our models are fucking toys

Anonymous
10/01/25(Wed)14:55:12 No.106760397

Anonymous 10/01/25(Wed)14:55:12 No.106760397

>>106760374
because they can only generate sam altman. no other real people.

Anonymous
10/01/25(Wed)14:55:22 No.106760401

Anonymous 10/01/25(Wed)14:55:22 No.106760401

File: 00138-1668900267.png (2.51 MB, 1240x1240)

2.51 MB PNG

>>106760374
Has nothing to do with local he does this whenever a api model gets released. He can't afford a new gpu and is stuck with a 3000 series card which is still solid but can't play with bleeding edge

Anonymous
10/01/25(Wed)14:55:52 No.106760409

Anonymous 10/01/25(Wed)14:55:52 No.106760409

>>106760397
>because they can only generate sam altman. no other real people.
Wan can only do Trump
>but muhh i2v
Sora can do that too

Anonymous
10/01/25(Wed)14:56:06 No.106760410

Anonymous 10/01/25(Wed)14:56:06 No.106760410

File: 00002-2571889917.png (1.54 MB, 1024x1280)

1.54 MB PNG

Anonymous
10/01/25(Wed)14:56:29 No.106760414

Anonymous 10/01/25(Wed)14:56:29 No.106760414

>instant cope
How does the existence of superior saas models trigger the localpoors so easily?

Anonymous
10/01/25(Wed)14:56:37 No.106760417

Anonymous 10/01/25(Wed)14:56:37 No.106760417

>>106760409
>Sora can do that too
prove it with an image of any celeb ill wait :]

Anonymous
10/01/25(Wed)14:57:04 No.106760418

Anonymous 10/01/25(Wed)14:57:04 No.106760418

so much cope already

Anonymous
10/01/25(Wed)14:57:07 No.106760419

Anonymous 10/01/25(Wed)14:57:07 No.106760419

The brain isn’t a monolith model, it’s a federation of specialized modules. Visual cortex, auditory cortex, motor cortex, language centers, etc. all evolved for domain-specific processing. Coordination doesn’t erase specialization; it depends on it. And by the way, the brain has trillions of synapses: orders of magnitude beyond any AI model. If you think that comparison justifies wasting parameters on unfocused multimodal models, you’re proving my point: specialization is what actually makes the system efficient. The irony is our brain is more akin to a MoE model with specialized domains all of which filter and prepare inputs for the "generalist" model. Do you think your brain processes the raw auditory data? Do you think what you see in your brain is what your eyes actually see?

Anonymous
10/01/25(Wed)14:57:46 No.106760425

Anonymous 10/01/25(Wed)14:57:46 No.106760425

>>106760414
>How does the existence of superior saas models trigger the localpoors so easily?
jealousy, that's all, and instead of wanting to reach that level, they prefer to pretend that their localkek models are good enough, those guys have 0 ambition it's sad

Anonymous
10/01/25(Wed)15:00:36 No.106760442

Anonymous 10/01/25(Wed)15:00:36 No.106760442

>>106760414
Sam Killed local
https://files.catbox.moe/q5yjbn.mp4

Anonymous
10/01/25(Wed)15:02:28 No.106760455

Anonymous 10/01/25(Wed)15:02:28 No.106760455

File: 1737378911147787.png (1.27 MB, 992x1048)

1.27 MB PNG

view the scene from the side 90 degrees

neat, works

Anonymous
10/01/25(Wed)15:02:39 No.106760458

Anonymous 10/01/25(Wed)15:02:39 No.106760458

>no u
Woah, is this a rerun?

Anonymous
10/01/25(Wed)15:02:45 No.106760459

Anonymous 10/01/25(Wed)15:02:45 No.106760459

File: FluxKrea_Output_2726927.png (2.45 MB, 1024x1496)

2.45 MB PNG

Anonymous
10/01/25(Wed)15:02:56 No.106760462

Anonymous 10/01/25(Wed)15:02:56 No.106760462

>>106760442
Could you do this, but instead of Sam Altman it is a 1girl with gigantic jiggling boobies?

Anonymous
10/01/25(Wed)15:03:30 No.106760467

Anonymous 10/01/25(Wed)15:03:30 No.106760467

File: 1741458254746239.jpg (204 KB, 1440x1517)

204 KB JPG

>>106760455
original:

Anonymous
10/01/25(Wed)15:03:41 No.106760469

Anonymous 10/01/25(Wed)15:03:41 No.106760469

>>106760462
too unsafe saar please do not

Anonymous
10/01/25(Wed)15:04:21 No.106760483

Anonymous 10/01/25(Wed)15:04:21 No.106760483

>>106760458
So far, only /ldg/ is seething about Sora 2, never seen /DE3/ seething about Wan 2.2 lol

Anonymous
10/01/25(Wed)15:05:05 No.106760492

Anonymous 10/01/25(Wed)15:05:05 No.106760492

>>106760419
Assuming you are responding to me, yes, all that is correct. I am not advocating for the "throw more parameters and compute at the problem" procedure, I am saying that there is no one solution to the problem because the problem is ill-defined and depends on context, thus, you need all sorts of different tools, some general and some specialized.

Anonymous
10/01/25(Wed)15:05:22 No.106760498

Anonymous 10/01/25(Wed)15:05:22 No.106760498

still waiting for that celeb i2v btw

Anonymous
10/01/25(Wed)15:06:57 No.106760513

Anonymous 10/01/25(Wed)15:06:57 No.106760513

File: reeeee.png (59 KB, 259x194)

59 KB PNG

>>106760492
>I am not advocating for the "throw more parameters and compute at the problem" procedure
Tenslop: "HOW DARE YOU"

Anonymous
10/01/25(Wed)15:06:57 No.106760514

Anonymous 10/01/25(Wed)15:06:57 No.106760514

>every few gens all models unload completely

Anyone else having this issue? They take forever to load back up

Anonymous
10/01/25(Wed)15:08:10 No.106760528

Anonymous 10/01/25(Wed)15:08:10 No.106760528

>>106760483
I bet if DE3 got 50 or so links to videos posted in a row they'd be annoyed as well

Anonymous
10/01/25(Wed)15:09:23 No.106760540

Anonymous 10/01/25(Wed)15:09:23 No.106760540

>>106760483
>seething
yes i'm so jealous you are paying hundreds a month to prompt censored slop. money you could use to buy a gpu or cpu or build a computer!

Anonymous
10/01/25(Wed)15:09:30 No.106760541

Anonymous 10/01/25(Wed)15:09:30 No.106760541

>>106760528
/ldg/ started to seethe immediatly after the first video, try not to rewrite history, they are completly allergic to API news, they don't want to know the world is advancing without them, it makes them sad you know

Anonymous
10/01/25(Wed)15:10:03 No.106760548

Anonymous 10/01/25(Wed)15:10:03 No.106760548

File: 00142-3000492623.png (2.62 MB, 1240x1240)

2.62 MB PNG

Anonymous
10/01/25(Wed)15:10:26 No.106760550

Anonymous 10/01/25(Wed)15:10:26 No.106760550

File: 1738420508849752.png (1.28 MB, 992x1048)

1.28 MB PNG

give the woman black lingerie and anime cat ears.

Anonymous
10/01/25(Wed)15:10:48 No.106760557

Anonymous 10/01/25(Wed)15:10:48 No.106760557

what's the obsession with celebrities anyways?

Anonymous
10/01/25(Wed)15:10:59 No.106760560

Anonymous 10/01/25(Wed)15:10:59 No.106760560

File: 1747645830781770.png (217 KB, 1060x805)

217 KB PNG

>>106760540
>hundreds a month
how about 20 a month, and you had to pay thousands to daddy Nvdia to run your localkek slop you know

Anonymous
10/01/25(Wed)15:12:01 No.106760567

Anonymous 10/01/25(Wed)15:12:01 No.106760567

>>106760557
>what's the obsession with celebrities anyways?
Ikr, before Sora 2 appeared, they were pretending they didn't need celebrities and that it was fine that local models can only render Miku and Trump, and now they care, curious

Anonymous
10/01/25(Wed)15:12:14 No.106760568

Anonymous 10/01/25(Wed)15:12:14 No.106760568

>>106760560
>$200 a month
>for slightly higher res and no watermarks
the utter state of SAAS fags, you probably pay $1000 a year for games you dont own too.

Anonymous
10/01/25(Wed)15:12:50 No.106760575

Anonymous 10/01/25(Wed)15:12:50 No.106760575

>>106760557
The part of our brains that used to care about polytheistic deities/fairies etc. got hijacked by the propaganda industry during the 1920s and advent of mass visual media, thus, when some people think "Venus" they instead think "Christina Hendricks" or (Allah forbid) cosplayers.

Anonymous
10/01/25(Wed)15:13:02 No.106760579

Anonymous 10/01/25(Wed)15:13:02 No.106760579

>>106760568
>for slightly higher res
that's why it's a better deal to go for 20 a month, the difference between 720p and 1080p isn't big

Anonymous
10/01/25(Wed)15:13:06 No.106760581

Anonymous 10/01/25(Wed)15:13:06 No.106760581

Come up with something new lilbro. I know you're restricted with your API calls but that doesn't mean you're restricted with your trolling.

Anonymous
10/01/25(Wed)15:13:16 No.106760583

Anonymous 10/01/25(Wed)15:13:16 No.106760583

File: 1746575212901509.png (1.14 MB, 992x1048)

1.14 MB PNG

>>106760550
give the woman a black business suit, with a white bra and cleavage. she is smoking a cigar.

Anonymous
10/01/25(Wed)15:14:49 No.106760593

Anonymous 10/01/25(Wed)15:14:49 No.106760593

have local models learned what blitzball is yet?

Anonymous
10/01/25(Wed)15:15:32 No.106760596

Anonymous 10/01/25(Wed)15:15:32 No.106760596

ask openAI to generate a video of Sam Altman being beheaded for defending his Israel masters.

see what happens. Oh? Censorship? That's sure worth paying for.

Anonymous
10/01/25(Wed)15:16:16 No.106760599

Anonymous 10/01/25(Wed)15:16:16 No.106760599

I don't care what anyone says, this is the most kino AI video I've ever seen in my life.
https://files.catbox.moe/uj9981.mp4

Anonymous
10/01/25(Wed)15:17:00 No.106760607

Anonymous 10/01/25(Wed)15:17:00 No.106760607

>>106760541
Do you think the newfags bought this reply?

Anonymous
10/01/25(Wed)15:17:17 No.106760609

Anonymous 10/01/25(Wed)15:17:17 No.106760609

>>106760599
yeah why look at miku hatsune when you could see sam altman's ugly face in every video.

Anonymous
10/01/25(Wed)15:17:23 No.106760610

Anonymous 10/01/25(Wed)15:17:23 No.106760610

>>106760596
can you do behading on local? they don't know how to do violence, Sora 2 is better at this
https://files.catbox.moe/nr3fk0.mp4

Anonymous
10/01/25(Wed)15:17:40 No.106760615

Anonymous 10/01/25(Wed)15:17:40 No.106760615

>>106760599
yeah low iq gooners just don't understand progress

Anonymous
10/01/25(Wed)15:18:19 No.106760618

Anonymous 10/01/25(Wed)15:18:19 No.106760618

File: 1739640448275619.png (1.3 MB, 992x1048)

1.3 MB PNG

show me a sora2 video with an asian girl with big tits like this. go ahead. surely the paid service has an i2v option. Use this image!

Anonymous
10/01/25(Wed)15:18:28 No.106760620

Anonymous 10/01/25(Wed)15:18:28 No.106760620

>>106760409
on behalf of anon i humbly accept your concession btw

Anonymous
10/01/25(Wed)15:18:34 No.106760621

Anonymous 10/01/25(Wed)15:18:34 No.106760621

File: ComfyUI_temp_yjmco_00028_.jpg (849 KB, 1536x1152)

849 KB JPG

stop replying to it, fucking retards. what happened to "don't feed the trolls"

Anonymous
10/01/25(Wed)15:18:51 No.106760626

Anonymous 10/01/25(Wed)15:18:51 No.106760626

>>106760492
For neural networks trained with gradient descent, there really is only one path to efficiency: clean datasets with minimal noise and specialized modalities. Everything else just increases variance, slows convergence, and wastes compute. Until machine learning models diverge from being specialist autocomplete models, this will not change. Current ML models (transformers, diffusion, etc.) are fundamentally specialist autocomplete machines. They learn distributions and predict the next most likely output.

Anonymous
10/01/25(Wed)15:19:36 No.106760634

Anonymous 10/01/25(Wed)15:19:36 No.106760634

>>106760621
In anons defence the troll will simply reply to himself if he gets no (you)s. That being said you're right.

Anonymous
10/01/25(Wed)15:19:59 No.106760637

Anonymous 10/01/25(Wed)15:19:59 No.106760637

>>106760621
they will never learn
>>106753726

Anonymous
10/01/25(Wed)15:20:37 No.106760641

Anonymous 10/01/25(Wed)15:20:37 No.106760641

File: 1757377837801470.jpg (11 KB, 225x225)

11 KB JPG

qwen edit 2509 is more impressive than generic slop videos with sam altman.

Anonymous
10/01/25(Wed)15:20:41 No.106760644

Anonymous 10/01/25(Wed)15:20:41 No.106760644

>>106760609
>yeah why look at miku hatsune
Sora 2 can do miku though, it can do a shit ton of characters
https://files.catbox.moe/1c3h2s.mp4
https://files.catbox.moe/6tmyun.mp4
https://files.catbox.moe/1seqwp.mp4

Anonymous
10/01/25(Wed)15:21:16 No.106760647

Anonymous 10/01/25(Wed)15:21:16 No.106760647

>>106760621
>don't feed the trolls
a near impossible ask when most of the thread consists of troll posts
itll die down soon thoever

Anonymous
10/01/25(Wed)15:22:08 No.106760657

Anonymous 10/01/25(Wed)15:22:08 No.106760657

>>106760644
okay now do a japanese gravure model with big tits.

Anonymous
10/01/25(Wed)15:22:54 No.106760669

Anonymous 10/01/25(Wed)15:22:54 No.106760669

File: not beating the allegatio(...).png (588 KB, 518x617)

588 KB PNG

>>106760657
>but muuhh coom
that's all /ldg/ cares about huh?

Anonymous
10/01/25(Wed)15:23:24 No.106760677

Anonymous 10/01/25(Wed)15:23:24 No.106760677

File: 1757944334874321.png (1.09 MB, 888x1176)

1.09 MB PNG

stop arguing and play Starfield(tm)

Anonymous
10/01/25(Wed)15:24:58 No.106760689

Anonymous 10/01/25(Wed)15:24:58 No.106760689

>>106760669
Now do anything that is remotely PG-13.

Anonymous
10/01/25(Wed)15:25:25 No.106760695

Anonymous 10/01/25(Wed)15:25:25 No.106760695

File: 00149-240615931.png (2.59 MB, 1240x1240)

2.59 MB PNG

Anonymous
10/01/25(Wed)15:25:40 No.106760698

Anonymous 10/01/25(Wed)15:25:40 No.106760698

File: 1756173605584358.png (1.06 MB, 888x1176)

1.06 MB PNG

>>106760677

Anonymous
10/01/25(Wed)15:25:49 No.106760700

Anonymous 10/01/25(Wed)15:25:49 No.106760700

File: ComfyUI_05856_.png (1.45 MB, 992x1048)

1.45 MB PNG

>>106760583
Give the woman a severe case of leprosy

Anonymous
10/01/25(Wed)15:26:19 No.106760707

Anonymous 10/01/25(Wed)15:26:19 No.106760707

>>106760689
>Now do anything that is remotely PG-13.
an Epstein Joke is good to you?
https://files.catbox.moe/p8zyu7.mp4

Anonymous
10/01/25(Wed)15:27:57 No.106760720

Anonymous 10/01/25(Wed)15:27:57 No.106760720

>>106760621
You vastly underestimate the dedication this guy puts into sliding this long dick general

Anonymous
10/01/25(Wed)15:28:37 No.106760725

Anonymous 10/01/25(Wed)15:28:37 No.106760725

>>106760720
>this guy
we all know it's debo though, everything bad that happens to earth is the fault of debo after all

Anonymous
10/01/25(Wed)15:28:57 No.106760727

Anonymous 10/01/25(Wed)15:28:57 No.106760727

>>106760707
Nothing in that besides the meta knowledge makes it PG-13.

Anonymous
10/01/25(Wed)15:29:34 No.106760731

Anonymous 10/01/25(Wed)15:29:34 No.106760731

File: 1756794538403652.png (992 KB, 888x1176)

992 KB PNG

THE FANS DEMANDED IT

Anonymous
10/01/25(Wed)15:29:36 No.106760732

Anonymous 10/01/25(Wed)15:29:36 No.106760732

>>106760725
i'm debo#3 myself

Anonymous
10/01/25(Wed)15:29:43 No.106760735

Anonymous 10/01/25(Wed)15:29:43 No.106760735

>proving my point for me
kek

Anonymous
10/01/25(Wed)15:30:16 No.106760741

Anonymous 10/01/25(Wed)15:30:16 No.106760741

localkeks are NOT ready for Midjourney V8

Anonymous
10/01/25(Wed)15:30:19 No.106760743

Anonymous 10/01/25(Wed)15:30:19 No.106760743

>>106760727
how about Hitler having his head blown out by a gun? is that also for all ages? >>106760610

Anonymous
10/01/25(Wed)15:31:45 No.106760758

Anonymous 10/01/25(Wed)15:31:45 No.106760758

>>106760743
>brain blown out
>blood splatter
Okay, that might get PG-13. But I hope the cuts aren't a sign of a quality model.

Anonymous
10/01/25(Wed)15:32:27 No.106760765

Anonymous 10/01/25(Wed)15:32:27 No.106760765

>>106760060
People who have never really used 4o or sora for still images wouldn’t know, but from my use (and I am a local enjoyer don’t get me wrong saas won’t make my big titty 1girls taking dicks in their ass), OpenAI’s models seem to have the best prompt recognition and understanding. That’s not to say they have the best output, but I think anons in the thread that I see are conflating output quality with prompt recognition. Which fair if the quality is too poor who cares if it adheres well, but I don’t think it’s that bad.

Anonymous
10/01/25(Wed)15:32:31 No.106760767

Anonymous 10/01/25(Wed)15:32:31 No.106760767

File: 00010-2405399559.png (1.42 MB, 1024x1280)

1.42 MB PNG

Anonymous
10/01/25(Wed)15:33:09 No.106760775

Anonymous 10/01/25(Wed)15:33:09 No.106760775

https://files.catbox.moe/obg1y4.mp4

Anonymous
10/01/25(Wed)15:34:33 No.106760787

Anonymous 10/01/25(Wed)15:34:33 No.106760787

>>106760775
kek, this thread is fun with those meme Sora models, it's a good way to pass the time before talking about some somethingburger happening in the local space

Anonymous
10/01/25(Wed)15:34:44 No.106760790

Anonymous 10/01/25(Wed)15:34:44 No.106760790

>>106760775
Absolutely schizo cineotography and framing.

Anonymous
10/01/25(Wed)15:34:52 No.106760792

Anonymous 10/01/25(Wed)15:34:52 No.106760792

>>106760707
wan2.2 will throw cum-covered ass all over a horsecock. sora2 is cool but different jobs require different tools

Anonymous
10/01/25(Wed)15:36:59 No.106760815

Anonymous 10/01/25(Wed)15:36:59 No.106760815

File: 621037~01.jpg (23 KB, 303x362)

23 KB JPG

What's the default CFG/steps for Wan without the light loras?

Anonymous
10/01/25(Wed)15:37:01 No.106760816

Anonymous 10/01/25(Wed)15:37:01 No.106760816

>>106760790
>Absolutely schizo cineotography and framing.
I agree, their model is impressive but there's just too many cuts

Anonymous
10/01/25(Wed)15:39:16 No.106760845

Anonymous 10/01/25(Wed)15:39:16 No.106760845

File: 00160-2062562495-before-h(...).png (2.38 MB, 1240x1240)

2.38 MB PNG

Anonymous
10/01/25(Wed)15:39:19 No.106760849

Anonymous 10/01/25(Wed)15:39:19 No.106760849

>>106760816
it's not the cuts, the last cut is unusable and a shot no one would ever do, why would you frame his back and the empty back wall?

Anonymous
10/01/25(Wed)15:39:47 No.106760856

Anonymous 10/01/25(Wed)15:39:47 No.106760856

File: 00488-950026451.png (2.66 MB, 1248x1848)

2.66 MB PNG

>goin fishin huh?
>make sure to get some good bait

Anonymous
10/01/25(Wed)15:41:04 No.106760864

Anonymous 10/01/25(Wed)15:41:04 No.106760864

>>106760849
yeah, that too, since there's so many cuts, it has more chances of making mistakes, it's way easier for a model to do just one continuous shot, what they did is ambitious, but it's not accurate enough yet

Anonymous
10/01/25(Wed)15:42:00 No.106760871

Anonymous 10/01/25(Wed)15:42:00 No.106760871

>>106760845
>tylenol
Kekd

Anonymous
10/01/25(Wed)15:42:01 No.106760872

Anonymous 10/01/25(Wed)15:42:01 No.106760872

>>106760864
But maybe the next $1 pull will make a better video :^)

Anonymous
10/01/25(Wed)15:42:44 No.106760881

Anonymous 10/01/25(Wed)15:42:44 No.106760881

File: 00163-2666148618.jpg (1.1 MB, 2480x2480)

1.1 MB JPG

Anonymous
10/01/25(Wed)15:44:43 No.106760897

Anonymous 10/01/25(Wed)15:44:43 No.106760897

File: 00013-3042618728.png (1.39 MB, 1280x1024)

1.39 MB PNG

Anonymous
10/01/25(Wed)15:46:22 No.106760905

Anonymous 10/01/25(Wed)15:46:22 No.106760905

>>106760644
I have a single question, aren't they scared of being copyright raped? Why are they allowing so many characters to be rendered lol

Anonymous
10/01/25(Wed)15:46:31 No.106760907

Anonymous 10/01/25(Wed)15:46:31 No.106760907

why is he so easy to bait? he'll be melting down for hours

Anonymous
10/01/25(Wed)15:46:32 No.106760908

Anonymous 10/01/25(Wed)15:46:32 No.106760908

so forge neo seems pretty good, all you have to do is git clone it then move your forge/reforge stuff to the same folders.

faster for loading/switching models too.

Anonymous
10/01/25(Wed)15:48:09 No.106760925

Anonymous 10/01/25(Wed)15:48:09 No.106760925

>OpenAI's servers right now
https://files.catbox.moe/zu9sl2.mp4

Anonymous
10/01/25(Wed)15:48:21 No.106760927

Anonymous 10/01/25(Wed)15:48:21 No.106760927

>>106760908
>git clone
this alone filters 70% of users

Anonymous
10/01/25(Wed)15:48:59 No.106760930

Anonymous 10/01/25(Wed)15:48:59 No.106760930

>>106760908
are you trolling? it's completely unusable for chroma, qwen and wan.

and who cares about sdxl? if you insist on using sdxl reforge is so far ahead of neo forge it's not even funny

Anonymous
10/01/25(Wed)15:49:43 No.106760940

Anonymous 10/01/25(Wed)15:49:43 No.106760940

File: 00167-3122639363.jpg (585 KB, 2480x2480)

585 KB JPG

>>106760930
I'm using chroma just fine

Anonymous
10/01/25(Wed)15:50:44 No.106760948

Anonymous 10/01/25(Wed)15:50:44 No.106760948

chroma is dogshit trained at 1/4 the resolution of SDXL yet takes 5x as long to generate. any post that seriously suggests it can be discarded as tasteless coomerboomer babble

Anonymous
10/01/25(Wed)15:51:26 No.106760954

Anonymous 10/01/25(Wed)15:51:26 No.106760954

>>106760948
He's out of line but he's right!

Anonymous
10/01/25(Wed)15:51:52 No.106760957

Anonymous 10/01/25(Wed)15:51:52 No.106760957

File: file.png (910 KB, 832x1280)

910 KB PNG

>>106760940
do you have 64gb ram or a 5090?
neo forge insists on reloading chroma every single gen, so you're either lying (for whatever reason) or just have enough ram where the memory management doesn't shit itself like a retard

>>106760948
haha yeah i know right? your gens looks so much better! oh wait.

Anonymous
10/01/25(Wed)15:51:58 No.106760958

Anonymous 10/01/25(Wed)15:51:58 No.106760958

>>106760948
>tasteless coomerboomer babble
So SDXL?

Anonymous
10/01/25(Wed)15:52:59 No.106760966

Anonymous 10/01/25(Wed)15:52:59 No.106760966

>>106760958
>So SDXL?
everything but Seedream and Midjourney

Anonymous
10/01/25(Wed)15:54:09 No.106760984

Anonymous 10/01/25(Wed)15:54:09 No.106760984

File: 00188-1162602840.png (3.34 MB, 3072x768)

3.34 MB PNG

>>106760908
does it gen faster

Anonymous
10/01/25(Wed)15:54:46 No.106760990

Anonymous 10/01/25(Wed)15:54:46 No.106760990

>>106760499
>Nobody giving a fuck about hunyuan image 3.0
>>106760633
>nobody gives a shit about it because its a piece of trash, if it could simulate reality and take 32845698 b200 everyone would still scramble to be the first api piggie to try it
chat is it true?

Anonymous
10/01/25(Wed)15:54:55 No.106760992

Anonymous 10/01/25(Wed)15:54:55 No.106760992

File: 00172-3122639366-before-h(...).png (2.72 MB, 1240x1240)

2.72 MB PNG

>>106760957
I have both and nothing is taxed, I'm using fedora

Anonymous
10/01/25(Wed)15:56:14 No.106761006

Anonymous 10/01/25(Wed)15:56:14 No.106761006

This is how Sam scrapped Studio Ghibli's images btw
https://files.catbox.moe/qb7d5t.mp4

Anonymous
10/01/25(Wed)15:56:21 No.106761008

Anonymous 10/01/25(Wed)15:56:21 No.106761008

>>106760990
He's right that no goofs really put a damper on things. But even the one anon who busted out his 96GB VRAM build thought it was ass.

Anonymous
10/01/25(Wed)15:57:17 No.106761015

Anonymous 10/01/25(Wed)15:57:17 No.106761015

>>106760990
yes obviously. jeetykeks jump through hoops to run wan 2.2 on 8gb because it's good, but nobody cares about hunyuan because it's 1024x shit (still better than chroma btw)

Anonymous
10/01/25(Wed)15:57:55 No.106761022

Anonymous 10/01/25(Wed)15:57:55 No.106761022

>>106760930
I use forge for my illustrious/noobai stuff. adetailer also works with a checkbox and is easy to configure and is not very comfy in comfyui.

I moved stuff over cause reforge isn't getting updated any more.

Anonymous
10/01/25(Wed)15:59:32 No.106761038

Anonymous 10/01/25(Wed)15:59:32 No.106761038

>>106761015
>jeetykeks jump through hoops to run wan 2.2 on 8gb because it's good
this, if HunyuanImage 3.0 was an unslopped model that could do a shit ton of characters and celebrities, you bet I would find a way to stuff this shit on my poor 3090

Anonymous
10/01/25(Wed)15:59:36 No.106761039

Anonymous 10/01/25(Wed)15:59:36 No.106761039

>>106761022
>adetailer also works with a checkbox and is easy to configure and is not very comfy in comfyui.
im UI agnostic but check out https://github.com/chrisgoringe/cg-controller

Anonymous
10/01/25(Wed)16:00:59 No.106761058

Anonymous 10/01/25(Wed)16:00:59 No.106761058

>>106760930
reforge still better overall? okay, i'll have to compare both

Anonymous
10/01/25(Wed)16:04:45 No.106761095

Anonymous 10/01/25(Wed)16:04:45 No.106761095

>>106760990
Yes, people would be renting clusters if Hunyuan was actually the best model ever made but it doesn't even seem even 10% better than QIE.

Anonymous
10/01/25(Wed)16:09:06 No.106761127

Anonymous 10/01/25(Wed)16:09:06 No.106761127

File: 1750781475977569.png (2.46 MB, 1536x1536)

2.46 MB PNG

yeah reforge seems a bit better plus the img2img and inpaint UI is cleaner

Anonymous
10/01/25(Wed)16:09:29 No.106761132

Anonymous 10/01/25(Wed)16:09:29 No.106761132

local image model ranking best to worst (post-sdxl):
qwen
wan single-frame
flux pixelwave
chroma
flux dev
hunyuan 3 80b
hidream
sd3.5m
sana

Anonymous
10/01/25(Wed)16:12:28 No.106761158

Anonymous 10/01/25(Wed)16:12:28 No.106761158

>>106761132
for tranime or realism?

Anonymous
10/01/25(Wed)16:15:41 No.106761181

Anonymous 10/01/25(Wed)16:15:41 No.106761181

https://xcancel.com/Adyseku/status/1973352752714752430#m
wtf lol

Anonymous
10/01/25(Wed)16:15:56 No.106761183

Anonymous 10/01/25(Wed)16:15:56 No.106761183

File: 1752881996931518.png (1.23 MB, 1024x1024)

1.23 MB PNG

>>106761127
made that monitor gen so I can test qwen edit:

remove the text on the screen of the CRT monitor. add White ascii text that says "LDG general" on the monitor.

Anonymous
10/01/25(Wed)16:19:00 No.106761201

Anonymous 10/01/25(Wed)16:19:00 No.106761201

https://files.catbox.moe/xipsho.mp4
Is this the first celebrity Sora can do that isn't Sam?

Anonymous
10/01/25(Wed)16:20:31 No.106761216

Anonymous 10/01/25(Wed)16:20:31 No.106761216

Invalid workflow against zod schema:
Validation error: Required at "definitions.subgraphs[1].nodes[8].inputs[0].type"; Required at "definitions.subgraphs[1].nodes[8].inputs[1].type"; Required at "definitions.subgraphs[1].nodes[8].outputs[0].type"; Required at "definitions.subgraphs[1].nodes[9].inputs[0].type"; Required at "definitions.subgraphs[1].nodes[9].inputs[1].type"; Required at "definitions.subgraphs[1].nodes[9].outputs[0].type"

How do I use this information to debug the workflow? Obviously it has something to do with subgraphs, but what else?

Anonymous
10/01/25(Wed)16:22:15 No.106761236

Anonymous 10/01/25(Wed)16:22:15 No.106761236

>>106761201
it's clear openai models are trained on absolutely everything, the most powerful models in the world. but they get censored and censored to the point where everyone forgets what they're capable of as they're neutered until they produce nothing more than sanitized slop

Anonymous
10/01/25(Wed)16:23:46 No.106761260

Anonymous 10/01/25(Wed)16:23:46 No.106761260

Wow, it truly is over isn't it?
https://files.catbox.moe/nyn13v.mp4

Anonymous
10/01/25(Wed)16:25:56 No.106761275

Anonymous 10/01/25(Wed)16:25:56 No.106761275

>>106761260
I'm not being ironic when I say this is as slopped as SDXL booru gacha. If you stop clapping like a retard at the flashing colors and start actually considering what you're seeing you'd see the details are nonsense. We're still in slop meme gacha territory, just like 4o before it.

Anonymous
10/01/25(Wed)16:26:14 No.106761278

Anonymous 10/01/25(Wed)16:26:14 No.106761278

>>106761260
You can literally use this thing to make HD versions/continue manga adaptations where it left off etc...
Crazy

Anonymous
10/01/25(Wed)16:26:37 No.106761281

Anonymous 10/01/25(Wed)16:26:37 No.106761281

File: remember SD3? lol.png (1.34 MB, 2047x524)

1.34 MB PNG

>>106761260
>APIchads are having their fun playing around with cool anime characters while localkeks can only boast that they managed to put a red sphere on top of a blue square... yay...

Anonymous
10/01/25(Wed)16:27:12 No.106761285

Anonymous 10/01/25(Wed)16:27:12 No.106761285

File: 00060-2659787539.png (1.71 MB, 896x1152)

1.71 MB PNG

Anonymous
10/01/25(Wed)16:27:26 No.106761288

Anonymous 10/01/25(Wed)16:27:26 No.106761288

File: 1728304467251327.png (1.17 MB, 992x1048)

1.17 MB PNG

>>106761281
>censored
>PG at most
yawn.

Anonymous
10/01/25(Wed)16:27:31 No.106761290

Anonymous 10/01/25(Wed)16:27:31 No.106761290

>>106761278
k, good luck with that

Anonymous
10/01/25(Wed)16:27:46 No.106761293

Anonymous 10/01/25(Wed)16:27:46 No.106761293

>>106761275
>We're still in slop meme gacha territory, just like 4o before it.
no one said it's perfect, but it's way better than what local can produce, and yet you're a fan of local right? curiously you're less harsh when it's about to judge the capabilities of Wan, and god knows this model can make some horror shit

Anonymous
10/01/25(Wed)16:28:13 No.106761298

Anonymous 10/01/25(Wed)16:28:13 No.106761298

>>106760700
Unironically hotter

t ghoulish enjoyer

>>106760815
Think its like 5 cfg and 20-25 steps? I forgor, been using light and other slopped models for so long

Anonymous
10/01/25(Wed)16:28:29 No.106761303

Anonymous 10/01/25(Wed)16:28:29 No.106761303

>>106761293
It's not just "not perfect", it's actually useless beyond generating short meme slop.

Anonymous
10/01/25(Wed)16:30:01 No.106761319

Anonymous 10/01/25(Wed)16:30:01 No.106761319

File: 1748694875762694.gif (2.28 MB, 450x360)

2.28 MB GIF

>>106761303
>short meme
a.k.a the sovl of the golden age of the internet

Anonymous
10/01/25(Wed)16:30:48 No.106761330

Anonymous 10/01/25(Wed)16:30:48 No.106761330

>>106761290
If it were local it'd just need controlnet and some other simple scene control. Add to that the possibility of a video edit model... It's 95%+ of the way there. Fan animations would actually be feasible for the first time.

Anonymous
10/01/25(Wed)16:31:27 No.106761337

Anonymous 10/01/25(Wed)16:31:27 No.106761337

>>106761319
Yes, which is the value of $0. I'm not saying Sora isn't interesting, but it's so overhyped and you people don't realize they're going to rugpull you on the memes again just like the did with 4o.

Anonymous
10/01/25(Wed)16:31:49 No.106761341

Anonymous 10/01/25(Wed)16:31:49 No.106761341

>>106761260
that looks like some YTP scenes of animes, that's funni

Anonymous
10/01/25(Wed)16:32:41 No.106761354

Anonymous 10/01/25(Wed)16:32:41 No.106761354

File: 1756037045255772.png (1.23 MB, 1024x1024)

1.23 MB PNG

it doesn't matter if sora could do 4k HD videos, it is censored by default which is bad. and it's not free.

any fun prompt will be rejected by the openAI overlords.

Anonymous
10/01/25(Wed)16:32:51 No.106761356

Anonymous 10/01/25(Wed)16:32:51 No.106761356

>>106761337
>Yes, which is the value of $0.
That's true, because having a good time and laughing out loud at these videos is priceless.

Anonymous
10/01/25(Wed)16:33:54 No.106761363

Anonymous 10/01/25(Wed)16:33:54 No.106761363

https://files.catbox.moe/bvrz47.mp4
wan 2.5...

Anonymous
10/01/25(Wed)16:34:06 No.106761366

Anonymous 10/01/25(Wed)16:34:06 No.106761366

>>106761356
Yeah just like those 4o memes you people share today right?
It's funny because the only real video model that has lasted is Wan because it can do uncensored.

Anonymous
10/01/25(Wed)16:34:45 No.106761372

Anonymous 10/01/25(Wed)16:34:45 No.106761372

>>106761363
already better than sora

Anonymous
10/01/25(Wed)16:35:08 No.106761379

Anonymous 10/01/25(Wed)16:35:08 No.106761379

>>106761363
>furry coom
lodestone is that you?

Anonymous
10/01/25(Wed)16:35:32 No.106761385

Anonymous 10/01/25(Wed)16:35:32 No.106761385

>>106761363
Way too unsafe to be OpenAI for sure.

Anonymous
10/01/25(Wed)16:36:08 No.106761390

Anonymous 10/01/25(Wed)16:36:08 No.106761390

>>106761366
to be fair this is a local thread, from time to time we talk about the news of an API model, but that's pretty much it, it shouldn't overstay its welcome, it should remain a place to talk about local in the majority of the time

Anonymous
10/01/25(Wed)16:36:27 No.106761394

Anonymous 10/01/25(Wed)16:36:27 No.106761394

>>106761363
Is this legit wan 2.5?, cause gross furry shit aside that's legit good.

Anonymous
10/01/25(Wed)16:37:09 No.106761399

Anonymous 10/01/25(Wed)16:37:09 No.106761399

>>106761363
you guys are pathetic, all you care about is coom, that's why everyone makes fun of localkeks btw

Anonymous
10/01/25(Wed)16:38:04 No.106761407

Anonymous 10/01/25(Wed)16:38:04 No.106761407

>>106761390
You don't see Sora fucking anywhere. And 4o only gets posted in the OpenAI general which literally looks like only two people post in.

Anonymous
10/01/25(Wed)16:38:12 No.106761408

Anonymous 10/01/25(Wed)16:38:12 No.106761408

>>106761363
This has no reason to go so hard

Anonymous
10/01/25(Wed)16:38:33 No.106761412

Anonymous 10/01/25(Wed)16:38:33 No.106761412

File: hunyuan 3 examples.png (1.65 MB, 1109x974)

1.65 MB PNG

>>106761281
the problem with local labs is they fail to understand that these party tricks were meant to be a demonstration, not an end-goal. now we have models like hunyuan 3 emadmaxxing textslop while being incapable of producing anything pleasing. instead of going
>being able to produce text on signs indicates the model was trained on a wide range of well-captioned data, including text
they go
>we need to generate longer text than everyone else, quick generate some flux images of a man holding a blank sign and then paste 3 paragraphs of arial font gptslop into it! now automate this for 10000 more samples

Anonymous
10/01/25(Wed)16:38:49 No.106761417

Anonymous 10/01/25(Wed)16:38:49 No.106761417

>>106761366
Yeah this model looks fun at short bursts but the schizo editing will get annoying quick. Already getting to me ngl

Anonymous
10/01/25(Wed)16:39:00 No.106761421

Anonymous 10/01/25(Wed)16:39:00 No.106761421

>>106761363
Damn, that's actually repulsive.

Anonymous
10/01/25(Wed)16:39:14 No.106761424

Anonymous 10/01/25(Wed)16:39:14 No.106761424

>>106761399
Well no shit, you have two options:
- low quality meme slop worth $0
- specialized slop for titilation

Anonymous
10/01/25(Wed)16:39:32 No.106761428

Anonymous 10/01/25(Wed)16:39:32 No.106761428

>>106761407
>You don't see Sora fucking anywhere.
me when I lie, I couldn't escape Sora today and yesterday, whether it's tiktok, youtube, reddit, twitter...

Anonymous
10/01/25(Wed)16:39:53 No.106761434

Anonymous 10/01/25(Wed)16:39:53 No.106761434

>>106761394
https://www.reddit.com/r/aivideo/comments/1nv72c4/
i'm assuming it is but it just says wan in the title

>>106761399
https://www.reddit.com/r/aivideo/comments/1nv8bh1/
i was looking for sora vids, this one has some good blood going at 3:10

Anonymous
10/01/25(Wed)16:40:12 No.106761438

Anonymous 10/01/25(Wed)16:40:12 No.106761438

>wan wait to release 2.5 to make sora 2 look like shit
I kneel China

Anonymous
10/01/25(Wed)16:40:33 No.106761446

Anonymous 10/01/25(Wed)16:40:33 No.106761446

>>106761424
>- specialized slop for titilation
also worth $0

Anonymous
10/01/25(Wed)16:40:47 No.106761452

Anonymous 10/01/25(Wed)16:40:47 No.106761452

>>106761412
What gets to me is the fact that these text showcases always look like absolute shit.
Worse than a 2 minute photoshop job. It literally looks like someone just used MS Paint to plop some text on an image. No blending, no natural strokes. Just machine text on another layer from the background.
It fucking sucks ass.

Anonymous
10/01/25(Wed)16:41:18 No.106761460

Anonymous 10/01/25(Wed)16:41:18 No.106761460

File: 00061-1303723015.png (2.22 MB, 896x1152)

2.22 MB PNG

Anonymous
10/01/25(Wed)16:41:54 No.106761464

Anonymous 10/01/25(Wed)16:41:54 No.106761464

>>106761428
Sora 2 is being spammed just like 4o on release. Sora 1 is posted no where. And when they crack down on the memes just like they did with 4o, it'll die too because the only things you're going to see spammed with Sora 2 is copyrighted characters. But have fun with your FotW fun model, not like we've not done this dance the last four major API video model releases.

Anonymous
10/01/25(Wed)16:42:23 No.106761474

Anonymous 10/01/25(Wed)16:42:23 No.106761474

>>106761363
Is the song AI generated too? That's the most impressive part of the video.

Anonymous
10/01/25(Wed)16:42:40 No.106761478

Anonymous 10/01/25(Wed)16:42:40 No.106761478

>>106761464
>Sora 1 is posted no where.
well duh, it was a bad model, Sora 2 is giving good shit if you test it out by yourself

Anonymous
10/01/25(Wed)16:43:00 No.106761484

Anonymous 10/01/25(Wed)16:43:00 No.106761484

>>106761446
No it's definitely worth more than $0 because people spend money and time making those LoRAs. Anon, it's okay to admit there's no functional value to Sora 2.

Anonymous
10/01/25(Wed)16:43:05 No.106761486

Anonymous 10/01/25(Wed)16:43:05 No.106761486

Needs more disabo

Anonymous
10/01/25(Wed)16:43:40 No.106761493

Anonymous 10/01/25(Wed)16:43:40 No.106761493

File: whol.jpg (2.65 MB, 3456x1440)

2.65 MB JPG

Anonymous
10/01/25(Wed)16:44:04 No.106761497

Anonymous 10/01/25(Wed)16:44:04 No.106761497

>>106761474
>Is the song AI generated too?
not a chance, it sounds better than Udio and Suno, it's probably a real song

Anonymous
10/01/25(Wed)16:44:36 No.106761502

Anonymous 10/01/25(Wed)16:44:36 No.106761502

>>106761452
If I were to guess that's exactly how they assembled their dataset. Take blank_page.jpg and then dynamically generate text on it and then have the prompt "A white paper on a table with text that reads: generated_text".

Anonymous
10/01/25(Wed)16:45:05 No.106761505

Anonymous 10/01/25(Wed)16:45:05 No.106761505

>>106761484
>people spend money and time making those LoRAs
and people spend money and time making anime character loras, your point?
>Anon, it's okay to admit there's no functional value to Sora 2.
memes have value, sorry if you only goal in life is to coom and not to laugh

Anonymous
10/01/25(Wed)16:45:21 No.106761510

Anonymous 10/01/25(Wed)16:45:21 No.106761510

>>106761497
Yes, but there's ElevenLabs Music now which I think could be higher in quality.

Anonymous
10/01/25(Wed)16:45:42 No.106761512

Anonymous 10/01/25(Wed)16:45:42 No.106761512

>>106761505
Post your last best gen.

Anonymous
10/01/25(Wed)16:46:19 No.106761517

Anonymous 10/01/25(Wed)16:46:19 No.106761517

>>106761502
That sounds dumb enough to be entirely true, seeing how they really love their synthetic training data.

Anonymous
10/01/25(Wed)16:46:48 No.106761522

Anonymous 10/01/25(Wed)16:46:48 No.106761522

ComfyUI custom node question:

Does anyone know the correct way to create a node that feeds an image filename into a Load Image node? It doesn't accept a string, even though that's what's being actually put out by the primitive node that feeds into it.

The input info looks like this:

@classmethod
def INPUT_TYPES(s):
. . input_dir = folder_paths.get_input_directory()
. . files = [f for f in os.listdir(input_dir) if os.path.isfile(os.path.join(input_dir, f))]
. . return {"required":
. . . . {
. . . . . . "image": (sorted(files), {"image_upload": True}),
etc.

So it's a list of filenames in the input directory.

Listing the return type as [ ] on a node allows you connect it to this input, so it's doing some kind of check to make sure it's a list, but when you actually try to run the workflow it finds the lists don't match. I'm not sure why that matters because the only value actually being returned by e.g. a primitive node connected to this input is a single string filename. I've tried to have the exact list recreated as a return type (lol) but it still finds they don't match.

deleted and reposted because I fucked up copying the code section lol

Anonymous
10/01/25(Wed)16:47:36 No.106761532

Anonymous 10/01/25(Wed)16:47:36 No.106761532

after 2 days of running comfyui i came to a conclusion that you only need epic realism checkpoint. prove me wrong

Anonymous
10/01/25(Wed)16:48:05 No.106761538

Anonymous 10/01/25(Wed)16:48:05 No.106761538

File: 1744540823349298.png (222 KB, 1407x861)

222 KB PNG

>>106761517
It's sad that the chinks are so obsessed with mememarks, when will they understand that this is not what we care about??? and when we try to make them understand that they cry and piss themselves

Anonymous
10/01/25(Wed)16:48:19 No.106761543

Anonymous 10/01/25(Wed)16:48:19 No.106761543

File: comf0.jpg (1.51 MB, 1536x2560)

1.51 MB JPG

Anonymous
10/01/25(Wed)16:48:22 No.106761544

Anonymous 10/01/25(Wed)16:48:22 No.106761544

>>106761517
It actually would work quite well if you took care to make sure it didn't look computer generated, but since they're Chinese, much like Jeet, they don't go the extra mile.

Synthetic ultimately is the only way to scale to millions of dataset items with 100% accuracy. But realistically you'd do things like procedural generation with a game engine to create different images in different styles.

Anonymous
10/01/25(Wed)16:48:44 No.106761549

Anonymous 10/01/25(Wed)16:48:44 No.106761549

>>106761522
Isn't there already a custom node like that? Also try asking Claude

Anonymous
10/01/25(Wed)16:48:56 No.106761555

Anonymous 10/01/25(Wed)16:48:56 No.106761555

>>106761434
Man I hope those chinks keep to their promise and actually release it.
>sora
Pretty cool motion is pretty good but man those cuts are cancer...

Anonymous
10/01/25(Wed)16:49:10 No.106761559

Anonymous 10/01/25(Wed)16:49:10 No.106761559

>>106761522
ah shit, I'm retarded. The error was because I forgot to sort the list.

Anonymous
10/01/25(Wed)16:50:13 No.106761569

Anonymous 10/01/25(Wed)16:50:13 No.106761569

>custom nodes
meh
>API nodes
now were talking

Anonymous
10/01/25(Wed)16:51:26 No.106761581

Anonymous 10/01/25(Wed)16:51:26 No.106761581

https://files.catbox.moe/hs4dbx.mp4
To give some credits to Chroma, it can do Animal Crossing so...

Anonymous
10/01/25(Wed)16:51:56 No.106761585

Anonymous 10/01/25(Wed)16:51:56 No.106761585

>>106761559
Ok I'm still getting errors, just different errors. Something about how I'm doing it is wrong.

Anonymous
10/01/25(Wed)16:52:59 No.106761596

Anonymous 10/01/25(Wed)16:52:59 No.106761596

File: ChromaRobocop_00010_.jpg (721 KB, 1912x1648)

721 KB JPG

Anonymous
10/01/25(Wed)16:53:29 No.106761605

Anonymous 10/01/25(Wed)16:53:29 No.106761605

>>106761434
>>106761581
everytime it cuts, there's some weird pixel blurring for a few frames, it quickly becomes a blurry mess when the cuts are frequent, like on that boxing fight

Anonymous
10/01/25(Wed)16:54:36 No.106761615

Anonymous 10/01/25(Wed)16:54:36 No.106761615

>>106760364
either your initial output is deepfried as fuck and the upscale is simply fixing that, or youre using a bad vae

Anonymous
10/01/25(Wed)16:54:39 No.106761616

Anonymous 10/01/25(Wed)16:54:39 No.106761616

Soracucks immediately silence when wan 2.5 chads anally destroy them

Anonymous
10/01/25(Wed)16:55:35 No.106761631

Anonymous 10/01/25(Wed)16:55:35 No.106761631

>>106761616
>you defeat them with furry coom
not everyone is a degenrate like you lodestone

Anonymous
10/01/25(Wed)16:56:12 No.106761640

Anonymous 10/01/25(Wed)16:56:12 No.106761640

File: 1746210888878086.png (1.24 MB, 1024x1024)

1.24 MB PNG

>>106761354
remove the crt monitor from the image. the anime girl has both her hands on the desk.

Anonymous
10/01/25(Wed)16:57:57 No.106761661

Anonymous 10/01/25(Wed)16:57:57 No.106761661

>>106761631
can sora have Sam anally raped by a fur god? I didn't think so

Anonymous
10/01/25(Wed)16:58:22 No.106761669

Anonymous 10/01/25(Wed)16:58:22 No.106761669

>>106761522
You probably need to rewrite it to be a string for the image input so that the string primitive can hook to it. The LoadImage node is so shit.

Anonymous
10/01/25(Wed)16:58:30 No.106761672

Anonymous 10/01/25(Wed)16:58:30 No.106761672

https://files.catbox.moe/6p1rbn.mp4
SAAR Altman
>>106761661
you can have Sam being a furry though >>106761181

Anonymous
10/01/25(Wed)16:58:55 No.106761679

Anonymous 10/01/25(Wed)16:58:55 No.106761679

File: 1754190377498679.mp4 (3.17 MB, 1280x720)

3.17 MB MP4

>>106761631
How about a wan meme

Anonymous
10/01/25(Wed)16:59:08 No.106761680

Anonymous 10/01/25(Wed)16:59:08 No.106761680

>>106761399
Outside of generating my own hentai and porno, what is the genuine use case for image and video generation? I mean really? I do manual labour and my hobbies don’t involve media. You look down your nose at it but would anyone even be using this shit nearly as much otherwise?

Anonymous
10/01/25(Wed)16:59:40 No.106761684

Anonymous 10/01/25(Wed)16:59:40 No.106761684

Don't fret anon, open source will always be here for you.

Anonymous
10/01/25(Wed)16:59:54 No.106761688

Anonymous 10/01/25(Wed)16:59:54 No.106761688

>>106761605
It almost feels like their magic sauce is it generates multiple 3 second clips but uses the previous frames as input, similar artifact to Wan with first frame insertion where the first frames of the gen are funky.

Anonymous
10/01/25(Wed)17:00:33 No.106761697

Anonymous 10/01/25(Wed)17:00:33 No.106761697

File: Fun not allowed.png (172 KB, 460x460)

172 KB PNG

>>106761680
>use case for memes?

Anonymous
10/01/25(Wed)17:01:44 No.106761711

Anonymous 10/01/25(Wed)17:01:44 No.106761711

>>106761680
>I
>I
>my
that's your problem, people might have hobbies you don't like, and you don't seem to understand that simple concept, narcissistic behavior

Anonymous
10/01/25(Wed)17:01:45 No.106761712

Anonymous 10/01/25(Wed)17:01:45 No.106761712

checkpoints folder, or diffusion models folder?
call it friendo.

Anonymous
10/01/25(Wed)17:02:49 No.106761723

Anonymous 10/01/25(Wed)17:02:49 No.106761723

>>106761711
Why does other people’s use case matter for mine?
>muh narcissism
Yeah sure but how about an actual answer.

Anonymous
10/01/25(Wed)17:02:56 No.106761724

Anonymous 10/01/25(Wed)17:02:56 No.106761724

File: that's right.png (89 KB, 618x640)

89 KB PNG

>>106761679
kek, that's cool, I love memes, whether it's made by a local model or an API model I don't give a shit, if it makes me laugh I welcome it

Anonymous
10/01/25(Wed)17:03:36 No.106761733

Anonymous 10/01/25(Wed)17:03:36 No.106761733

So what version of chroma should I be using? And anyone has workflow for it?
How does it compare to WAN txt2img workflow?

Anonymous
10/01/25(Wed)17:03:50 No.106761736

Anonymous 10/01/25(Wed)17:03:50 No.106761736

>>106761679
heh, not bad! something with focus lampooning current pol in this quality would get some reposts.

Anonymous
10/01/25(Wed)17:03:59 No.106761738

Anonymous 10/01/25(Wed)17:03:59 No.106761738

>>106761723
And why does your use case matter for mine?

Anonymous
10/01/25(Wed)17:04:03 No.106761739

Anonymous 10/01/25(Wed)17:04:03 No.106761739

>>106761697
I guess I somehow just assumed those were implicit, fair call out.

Anonymous
10/01/25(Wed)17:05:09 No.106761749

Anonymous 10/01/25(Wed)17:05:09 No.106761749

>>106761738
It doesn’t, but I don’t sit here shitting on you for it like you (or whoever the REEEEE COOOMERS guy was). It’s as silly as the dudes who mald about anime on here, an anime website lol

Anonymous
10/01/25(Wed)17:05:15 No.106761752

Anonymous 10/01/25(Wed)17:05:15 No.106761752

>>106761672
no thanks, had to have acts of sexual deviancy

Anonymous
10/01/25(Wed)17:05:22 No.106761755

Anonymous 10/01/25(Wed)17:05:22 No.106761755

Remember hypernetworks?

Anonymous
10/01/25(Wed)17:05:57 No.106761763

Anonymous 10/01/25(Wed)17:05:57 No.106761763

>>106761724
That's the thing with memes they get old fast. Though if you are having fun now, you do you but I can't see this model lasting more than a month before the magic wears off

Anonymous
10/01/25(Wed)17:06:07 No.106761765

Anonymous 10/01/25(Wed)17:06:07 No.106761765

Reminder to flush your vram with an empty prompt to stop buiildup of loose bits or you get ramrot.

Anonymous
10/01/25(Wed)17:06:11 No.106761767

Anonymous 10/01/25(Wed)17:06:11 No.106761767

>>106761733
i recommend chroma 1 base,

the current radiance snapshot is not for everyone (arguably you could try it too because that's the one we could currently in some ways help improve)

Anonymous
10/01/25(Wed)17:06:31 No.106761773

Anonymous 10/01/25(Wed)17:06:31 No.106761773

>>106761555
>Man I hope those chinks keep to their promise and actually release it.
same here, there's a few more wan 2.5 vids i saw on there but they were a lot more slopped than the furry vid

https://www.reddit.com/r/aivideo/comments/1nlucn1/
i usually hate these type of vids but some of the lyrics were p funny, makes me want to try suno

Anonymous
10/01/25(Wed)17:07:01 No.106761780

Anonymous 10/01/25(Wed)17:07:01 No.106761780

>>106761755
remember embeddings? doras?

Anonymous
10/01/25(Wed)17:08:01 No.106761793

Anonymous 10/01/25(Wed)17:08:01 No.106761793

File: 1752412229041615.mp4 (3.66 MB, 736x736)

3.66 MB MP4

>>106761763
>Though if you are having fun now, you do you but I can't see this model lasting more than a month before the magic wears off
lol, I'm still having fun with Wan + I2V, it's a great combo for infinite memes

Anonymous
10/01/25(Wed)17:12:29 No.106761834

Anonymous 10/01/25(Wed)17:12:29 No.106761834

>>106761780
I still use an embedding just to saben me the few seconds of typing out “masterpiece best quality” and bla bla for my booru models.

Anonymous
10/01/25(Wed)17:14:01 No.106761848

Anonymous 10/01/25(Wed)17:14:01 No.106761848

>>106761834
>magic prompt keyword embedding
Yikes

Anonymous
10/01/25(Wed)17:14:24 No.106761856

Anonymous 10/01/25(Wed)17:14:24 No.106761856

quick question, is hunyuan image 3 our monkey paw wish for gpt 4o at home? or are we still not there yet

Anonymous
10/01/25(Wed)17:15:28 No.106761866

Anonymous 10/01/25(Wed)17:15:28 No.106761866

https://files.catbox.moe/pt8abc.mp4
it's impressive how well it's able to reproduce 80's styles videos

Anonymous
10/01/25(Wed)17:16:20 No.106761876

Anonymous 10/01/25(Wed)17:16:20 No.106761876

File: 1735072024342886.png (8 KB, 405x71)

8 KB PNG

I've only been using these 2 upscalers in 2025, is there something newer or better I should look into? I kind of like upscalers that add some grain.

Anonymous
10/01/25(Wed)17:16:20 No.106761877

Anonymous 10/01/25(Wed)17:16:20 No.106761877

>>106761260
>2:08
SOVL

Anonymous
10/01/25(Wed)17:16:29 No.106761881

Anonymous 10/01/25(Wed)17:16:29 No.106761881

>>106761856
>is hunyuan image 3 our monkey paw wish for gpt 4o at home?
https://www.youtube.com/watch?v=H47ow4_Cmk0
https://www.reddit.com/r/StableDiffusion/comments/1nt22sm/hunyuanimage_30_t2i_example

Anonymous
10/01/25(Wed)17:17:00 No.106761886

Anonymous 10/01/25(Wed)17:17:00 No.106761886

>>106761669
I don't understand why it doesn't work for me. I can make a node that the load image node primitive can hook into and the data it receives is just a string. I can make a node that hooks into the load image input without error and gives it a string but now I get
>Prompt outputs failed validation:
>LoadImage:
>- Exception when validating inner node: 'NoneType' object has no attribute 'endswith'

Anonymous
10/01/25(Wed)17:17:07 No.106761887

Anonymous 10/01/25(Wed)17:17:07 No.106761887

>>106761733
workflow wise, you could just load the models (chroma 1 base, flux ae vae, t5xxl) on your preferred sdxl worklflow

i can share you a workflow otherwise but IDK if you want the same custom nodes, they're not "what's needed to get chroma working", i just habitually like some nodes

Anonymous
10/01/25(Wed)17:17:10 No.106761889

Anonymous 10/01/25(Wed)17:17:10 No.106761889

>>106761856
Hunyuan Image 3 is a retarded meme, as if we need 80B parameters. It's like building a 30,000 sq/ft single family home, it's completely missed the plot when they could've done something more practical like having a 24B image model and maybe an external, optional, vlm guidance module.

Anonymous
10/01/25(Wed)17:17:15 No.106761890

Anonymous 10/01/25(Wed)17:17:15 No.106761890

>>106761856
Sora 2 is the new benchmark. Until we get something like this opened locally, we will never be there. BFL is capable, but they bend the knee to safety.

Anonymous
10/01/25(Wed)17:17:35 No.106761893

Anonymous 10/01/25(Wed)17:17:35 No.106761893

>>106761856
its just generic chinese crap, there are no amazing gens locked behind the steep requirements. it's available for free on hunyuan's api and the outputs there are equally garbage. the output quality absolutely does not match the parameter count, and in almost all cases it somehow winds up looking worse than qwen/wan

Anonymous
10/01/25(Wed)17:17:49 No.106761895

Anonymous 10/01/25(Wed)17:17:49 No.106761895

File: 1749580249303490.png (2.5 MB, 1536x1536)

2.5 MB PNG

wai v15 is pretty good, no lora

masterpiece, best quality, amazing quality, hatsune miku, takamaki anne cosplay, school uniform, classroom, smile, persona 5, sitting, desk

the first 2 are default, then you just add prompts. this extension is amazing for booru based models btw:

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

Anonymous
10/01/25(Wed)17:18:00 No.106761897

Anonymous 10/01/25(Wed)17:18:00 No.106761897

>>106761755
kek, yeah those were fun for sd1.5, they were super strong. wonder if there's a way to do chroma out there somewhere

Anonymous
10/01/25(Wed)17:18:37 No.106761901

Anonymous 10/01/25(Wed)17:18:37 No.106761901

its crazy someone could post a bare pussy right now and get nuked out of orbit within a minute but all the spamming wouldn't even be touched

Anonymous
10/01/25(Wed)17:19:15 No.106761903

Anonymous 10/01/25(Wed)17:19:15 No.106761903

>>106761893
well that's kind of a relief to hear then

Anonymous
10/01/25(Wed)17:19:36 No.106761908

Anonymous 10/01/25(Wed)17:19:36 No.106761908

>>106761901
based mods

Anonymous
10/01/25(Wed)17:20:51 No.106761918

Anonymous 10/01/25(Wed)17:20:51 No.106761918

>>106761890
>Sora 2 is the new benchmark.
you can tell OpenAI could destroy the competition on image models as well, I wonder why they prefer to keep the piss filter instead

Anonymous
10/01/25(Wed)17:20:51 No.106761919

Anonymous 10/01/25(Wed)17:20:51 No.106761919

I wish anon trained more LoRAs.

Anonymous
10/01/25(Wed)17:21:54 No.106761932

Anonymous 10/01/25(Wed)17:21:54 No.106761932

PEAK KINO
https://files.catbox.moe/55wi6g.mp4

Anonymous
10/01/25(Wed)17:22:32 No.106761936

Anonymous 10/01/25(Wed)17:22:32 No.106761936

>>106761866
damn that is p good

Anonymous
10/01/25(Wed)17:23:13 No.106761945

Anonymous 10/01/25(Wed)17:23:13 No.106761945

File: Screenshot.jpg (26 KB, 505x234)

26 KB JPG

>>106760815

Anonymous
10/01/25(Wed)17:23:31 No.106761949

Anonymous 10/01/25(Wed)17:23:31 No.106761949

>>106761848
Shitty slop models still say they’ need em so who am I to argue. Whatever it takes for my 1girls.

Anonymous
10/01/25(Wed)17:24:33 No.106761953

Anonymous 10/01/25(Wed)17:24:33 No.106761953

File: 1750413587300693.mp4 (1.48 MB, 640x640)

1.48 MB MP4

>>106761895
the teal hair anime girl Miku Hatsune shakes hands with the blonde girl on the right, in the classroom.

welcome to class anon!

Anonymous
10/01/25(Wed)17:24:47 No.106761957

Anonymous 10/01/25(Wed)17:24:47 No.106761957

File: ChromaRobocop_00017_.jpg (571 KB, 992x1456)

571 KB JPG

>>106761919
Hear hear! This goes back to oven for 4k more steps

Anonymous
10/01/25(Wed)17:25:29 No.106761961

Anonymous 10/01/25(Wed)17:25:29 No.106761961

>>106761895
Have you used v14 to compare? I’d upgrade but these rando mixes on Civitai have shown multiple times that whoever is cooking them is just throwing shit at a wall hoping it sticks and newer versions aren’t always better.

Anonymous
10/01/25(Wed)17:25:38 No.106761966

Anonymous 10/01/25(Wed)17:25:38 No.106761966

>>106761866
I find it hard to believe that this isn't like a giant 200b model, it's capable of grasping all the subtleties of these styles. It's probably the first model that manages to really understand the world around it.

Anonymous
10/01/25(Wed)17:25:56 No.106761968

Anonymous 10/01/25(Wed)17:25:56 No.106761968

>>106761866
They definitely trained on the 80s video marathons they have on Youtube.

Anonymous
10/01/25(Wed)17:26:08 No.106761973

Anonymous 10/01/25(Wed)17:26:08 No.106761973

>>106761890
>Sora 2 is the new benchmark
Aren't they talking about an image model? What does Sora 2 got to do with it. While it's fun as a video I wouldn't call Sora vids image quality high. Can you even gen higher res on it?

Anonymous
10/01/25(Wed)17:26:18 No.106761976

Anonymous 10/01/25(Wed)17:26:18 No.106761976

>>106761961
They're all sameslop anyway just use base.

Anonymous
10/01/25(Wed)17:26:51 No.106761980

Anonymous 10/01/25(Wed)17:26:51 No.106761980

>>106761876
Where do you even find up scaling models? I’m a functional retard and have basically only found everything through civitai and there aren’t many Upscalers on there. I use one called “remacri” but it there’s something noticeably better out there

Anonymous
10/01/25(Wed)17:27:06 No.106761984

Anonymous 10/01/25(Wed)17:27:06 No.106761984

>>106761895
If you want plastic just use qwen.

Anonymous
10/01/25(Wed)17:27:49 No.106761991

Anonymous 10/01/25(Wed)17:27:49 No.106761991

>>106761957
>4k steps
Holy fuck are you for real? How many steps does it usually take with Chroma?

Anonymous
10/01/25(Wed)17:27:51 No.106761992

Anonymous 10/01/25(Wed)17:27:51 No.106761992

File: 1729247725345413.mp4 (1.26 MB, 640x640)

1.26 MB MP4

sora has no upskirts

wan does, without prompts. eat SHIT, saas users.

Anonymous
10/01/25(Wed)17:28:19 No.106761997

Anonymous 10/01/25(Wed)17:28:19 No.106761997

>>106761966
>t. Hunyuan's marketing team
Why do you think you need 15 times the paramters as Wan? It's not even twice as good as Wan and Wan can approximate most styles out of the box. Also 200B is infeasible as a business model, it'd cost $2 in electricity to generate a video.

Anonymous
10/01/25(Wed)17:28:23 No.106761999

Anonymous 10/01/25(Wed)17:28:23 No.106761999

>>106761966
>I find it hard to believe that this isn't like a giant 200b model
don't cope anon, this model is too big for your 3060 anyways lol

Anonymous
10/01/25(Wed)17:28:52 No.106762005

Anonymous 10/01/25(Wed)17:28:52 No.106762005

>>106761961
yeah, it's not radically different but it has more data and characters, the aesthetic isn't too different but it seems good so far to me. cfg 7 20-30 steps works well with euler a.

Anonymous
10/01/25(Wed)17:29:24 No.106762010

Anonymous 10/01/25(Wed)17:29:24 No.106762010

>>106761997
>Also 200B is infeasible as a business model, it'd cost $2 in electricity to generate a video.
look at deepseek, it's cheap as fuck and it's a 672b model, they can make big models as long as it's MoE

Anonymous
10/01/25(Wed)17:29:59 No.106762014

Anonymous 10/01/25(Wed)17:29:59 No.106762014

>>106761991
Not that guy, but I use the default setting on 100 epochs and it ends up being around 1500-2000 steps

Anonymous
10/01/25(Wed)17:30:11 No.106762018

Anonymous 10/01/25(Wed)17:30:11 No.106762018

File: cory isnt reacting well t(...).jpg (171 KB, 900x1200)

171 KB JPG

every. one of my problems with wan 2.2 lately. have been because of bad FP8 quants across the board.
im gonna fucking. not even throw a fit. find my zen and just fap to some massive titties.
so, at this point now i can just tweak the lightning loras a bit to bring back some quality, lowering their strengths should do that right? tits and motion are looking a bit crusty and stiff.

Anonymous
10/01/25(Wed)17:30:29 No.106762022

Anonymous 10/01/25(Wed)17:30:29 No.106762022

>>106761997
>Wan can approximate most styles out of the box
absolute bullshit, you're tripping my nigga

Anonymous
10/01/25(Wed)17:30:55 No.106762026

Anonymous 10/01/25(Wed)17:30:55 No.106762026

Is there already technology that can take a photo of a person as a parameter and create a new image with that person in a different position, expression, clothes, or situation, without that person having been used to train an algorithm?

Example: photo of my neighbor
Prompt: girl dressed as a bunny in a cabaret
Result: the face generated on the "character" is that of my neighbor

Anonymous
10/01/25(Wed)17:31:41 No.106762032

Anonymous 10/01/25(Wed)17:31:41 No.106762032

>>106761991
This is different because I got large dataset so I use gradient accumulation to speedrun epochs

Anonymous
10/01/25(Wed)17:31:41 No.106762034

Anonymous 10/01/25(Wed)17:31:41 No.106762034

>>106762010
Wan is 16B. There is nothing that Sora 2 does that wouldn't even be feasible in 24B or 32B. Deepseek is also MoE, so it's ~30B during generation.

Anonymous
10/01/25(Wed)17:31:43 No.106762035

Anonymous 10/01/25(Wed)17:31:43 No.106762035

>be me
>doing wan2.2 t2v
>they told me to use lightx lora cause it's fun
>getting drifting movements at 1280x720
>just fine at 720x480

what do? What is the recommended resolution?

Anonymous
10/01/25(Wed)17:31:46 No.106762036

Anonymous 10/01/25(Wed)17:31:46 No.106762036

>>106762005
>more data and characters
That’s good a lot of the mixes seem to be stuck in the same dataset the original noobai used from 2024. Well disk space is cheap may as well try it thanks for the heads up

Anonymous
10/01/25(Wed)17:32:00 No.106762041

Anonymous 10/01/25(Wed)17:32:00 No.106762041

>>106762026
>photo of my neighbor
rethink your life choices

Anonymous
10/01/25(Wed)17:32:40 No.106762051

Anonymous 10/01/25(Wed)17:32:40 No.106762051

>>106762032
>large dataset
How large?
>gradient accumulation
I'd ask for a TLDR but I think that's something I can manage to read about on my own desu

Anonymous
10/01/25(Wed)17:32:43 No.106762052

Anonymous 10/01/25(Wed)17:32:43 No.106762052

>>106762034
>Wan is 16B
it's 14b
>There is nothing that Sora 2 does that wouldn't even be feasible in 24B or 32B.
I hope you're right anon, deep down I wish it is true as well

Anonymous
10/01/25(Wed)17:33:05 No.106762056

Anonymous 10/01/25(Wed)17:33:05 No.106762056

File: 1754712177309000.jpg (1.73 MB, 1664x2432)

1.73 MB JPG

>>106761980
This website here. It's great.
https://openmodeldb.info/

Anonymous
10/01/25(Wed)17:33:35 No.106762060

Anonymous 10/01/25(Wed)17:33:35 No.106762060

>>106762026
Obvious Qwen Edit 2509 is obvious

never tried myself though

Anonymous
10/01/25(Wed)17:34:00 No.106762064

Anonymous 10/01/25(Wed)17:34:00 No.106762064

>>106762035
take a peek at the wan guide in the lazy getting started guide in OP

Anonymous
10/01/25(Wed)17:34:07 No.106762065

Anonymous 10/01/25(Wed)17:34:07 No.106762065

>>106762041
I used my neighbor as an example because she didn't have her image trained in an algorithm like celebrities do.
She is 70 years old.

Anonymous
10/01/25(Wed)17:34:16 No.106762067

Anonymous 10/01/25(Wed)17:34:16 No.106762067

>>106762051
385 images

Anonymous
10/01/25(Wed)17:35:37 No.106762080

Anonymous 10/01/25(Wed)17:35:37 No.106762080

File: 1746532314756956.mp4 (1.85 MB, 640x640)

1.85 MB MP4

make Miku bending over with Sora 2. I will wait SAAS anon.

Anonymous
10/01/25(Wed)17:36:15 No.106762086

Anonymous 10/01/25(Wed)17:36:15 No.106762086

>>106762056
she ain't got no nose, jim

Anonymous
10/01/25(Wed)17:36:18 No.106762087

Anonymous 10/01/25(Wed)17:36:18 No.106762087

>>106762067
Do you also run just adam + 2 batch size or do you have some unholy mixture of settings?

Anonymous
10/01/25(Wed)17:37:45 No.106762098

Anonymous 10/01/25(Wed)17:37:45 No.106762098

https://files.catbox.moe/8hsubg.mp4
desu if it kills sloptubers I'm all for it

Anonymous
10/01/25(Wed)17:37:57 No.106762101

Anonymous 10/01/25(Wed)17:37:57 No.106762101

>>106762086
michael got her nose hee hee

Anonymous
10/01/25(Wed)17:38:34 No.106762107

Anonymous 10/01/25(Wed)17:38:34 No.106762107

>>106761895
This one's good enough to go in my slop folder

Anonymous
10/01/25(Wed)17:39:35 No.106762114

Anonymous 10/01/25(Wed)17:39:35 No.106762114

>>106762052
It is true, I don't know why people think parameters are everything and the reality is the only people who want more parameters are the people who want to make sure you don't run anything locally. It's funny because you can literally see the benefits and the diminishing returns of parameters as you compare models.

From HDM 300m to Pixart 600m to Lumina 2.6B to Flux 12B, you can see that performance diminishes per parameter, even pruned Flux (8B) still maintains 99% of base Flux's text legibility and prompt adherence.

So realistically you're adding parameters from Wan's 14B to add knowledge capacity, not necessarily for video realism.

Anonymous
10/01/25(Wed)17:39:42 No.106762115

Anonymous 10/01/25(Wed)17:39:42 No.106762115

>>106762087
adamw8bit 2 batch + 2 gradient + snakeoil settings I'm testing. Still wrangling concept bleed

Anonymous
10/01/25(Wed)17:40:00 No.106762119

Anonymous 10/01/25(Wed)17:40:00 No.106762119

File: 1738360631795970.jpg (570 KB, 832x1216)

570 KB JPG

>>106762086
Yes, I had to do (no nose:1.1) just to make sure of that.

Anonymous
10/01/25(Wed)17:40:50 No.106762132

Anonymous 10/01/25(Wed)17:40:50 No.106762132

File: 1729528762452747.mp4 (1.32 MB, 640x640)

1.32 MB MP4

Miku laughing at Scam Altman:

also, getting better wan 2.2 movement with a combo lora setup another anon suggested:

Wan 2.2 lora setup for ideal movement: 2.2 high 1 str into 2.1 lora 3 strength, 2.2 low lora 1 strength into 2.1 lora .25 strength.

Anonymous
10/01/25(Wed)17:40:54 No.106762133

Anonymous 10/01/25(Wed)17:40:54 No.106762133

>so desprare for attention he will spend most of his time in a thread that doesn't welcome him
>only post in his containment zone as a means to cope with the humiliation from his defeat
>still can't make a good gen after all these years
Grim

Anonymous
10/01/25(Wed)17:41:02 No.106762135

Anonymous 10/01/25(Wed)17:41:02 No.106762135

>>106762114
>I don't know why people think parameters are everything
look at wan 2.2 5b and wan 2.2 14b, one is an useless piece of shit and the other one is kinda cool, there's some threshold you need to surprass if you want to get closer to perfection, of course at some point you have diminishing returns, but I don't think it starts at 14b, it has to be higher

Anonymous
10/01/25(Wed)17:42:21 No.106762145

Anonymous 10/01/25(Wed)17:42:21 No.106762145

>>106762135
20b? who knows.

Anonymous
10/01/25(Wed)17:42:34 No.106762148

Anonymous 10/01/25(Wed)17:42:34 No.106762148

File: 1740589033337976.png (118 KB, 1457x547)

118 KB PNG

>>106762132
*there are two separate lora paths in the workflow, it's like this:

Anonymous
10/01/25(Wed)17:42:39 No.106762151

Anonymous 10/01/25(Wed)17:42:39 No.106762151

>>106762135
Dumb comparison, you don't know how much they even trained the 5B model when it's clear the flagship model was always the 14B.

But way to ignore CogX which is also 5B and the one people were proudly showing Tom and Jerry cartoons from last thread.

Anonymous
10/01/25(Wed)17:43:17 No.106762157

Anonymous 10/01/25(Wed)17:43:17 No.106762157

>>106762132
Miku left you for Sora 2, sorry anon
https://files.catbox.moe/1c3h2s.mp4

Anonymous
10/01/25(Wed)17:43:46 No.106762168

Anonymous 10/01/25(Wed)17:43:46 No.106762168

how detailed are you suppose to describe wan? can i just say "she nervously looks around", or do you have to say "she has an expression of fear and panic on her face, she quickly looks to the left then quickly looks to the right. she bends her knees slightly" etc

Anonymous
10/01/25(Wed)17:44:22 No.106762173

Anonymous 10/01/25(Wed)17:44:22 No.106762173

>>106762151
>But way to ignore CogX which is also 5B
this is even worse than Wan 2.2 5b, what are you smoking anon?

Anonymous
10/01/25(Wed)17:46:14 No.106762190

Anonymous 10/01/25(Wed)17:46:14 No.106762190

>>106762173
But even with Wan you can curve between 1.3B -> 5B and 14B and realize there's clearly a diminishing returns curve. So anyone thinking even going to 24B is going to result in anything fantastic is crazy.

Anonymous
10/01/25(Wed)17:47:33 No.106762200

Anonymous 10/01/25(Wed)17:47:33 No.106762200

>>106762190
Idk, is 14b enough to have Sora 2's knowledge, on the llm space, if it's under 100b it doesn't remember some less mainstream details and trivia

Anonymous
10/01/25(Wed)17:48:38 No.106762213

Anonymous 10/01/25(Wed)17:48:38 No.106762213

>>106762200
I'm suggesting that even going from 14B to say 18B focusing on knowledge capacity would likely be more than enough.

Anonymous
10/01/25(Wed)17:48:39 No.106762214

Anonymous 10/01/25(Wed)17:48:39 No.106762214

Dataset dataset dataset. It's all about the dataset.

Anonymous
10/01/25(Wed)17:49:48 No.106762226

Anonymous 10/01/25(Wed)17:49:48 No.106762226

>>106762214
And yes, to even have Wan do 80s late night talk shows someone has to put that in the dataset and properly caption it.

Anonymous
10/01/25(Wed)17:49:59 No.106762228

Anonymous 10/01/25(Wed)17:49:59 No.106762228

>>106762214
This, OpenAI likely spent millions of dollars and years manually annotating a huge dataset, which is difficult to replicate.

Anonymous
10/01/25(Wed)17:51:10 No.106762234

Anonymous 10/01/25(Wed)17:51:10 No.106762234

File: 1741583119201902.mp4 (1.93 MB, 640x640)

1.93 MB MP4

guys I think the monitor is defective...

Anonymous
10/01/25(Wed)17:51:19 No.106762235

Anonymous 10/01/25(Wed)17:51:19 No.106762235

>>106762228
It's more likely a SOTA caption model. We can only dream of a hand captioned model.

Anonymous
10/01/25(Wed)17:52:05 No.106762241

Anonymous 10/01/25(Wed)17:52:05 No.106762241

There is no way Sora 2 fits within anything under 50b. Especially not with audio. The fact that it can copy a variety of video sources (spongebob, bob ross, english anime dubs, 90s commercials) while maintaining proper voice/audio theme is quite impressive. You might get something from China that can do audio/video to the same level, but it won't have the wide range of knowledge that Sora 2 has just like how Flux had better comprehension than Dall-E 3 but only a fraction of the knowledge.

Anonymous
10/01/25(Wed)17:52:19 No.106762245

Anonymous 10/01/25(Wed)17:52:19 No.106762245

>>106762214
...which is all about the VLM

Anonymous
10/01/25(Wed)17:53:27 No.106762256

Anonymous 10/01/25(Wed)17:53:27 No.106762256

>>106762241
Crazy idea:
- 2B audio generator
- 20B video generator

Anonymous
10/01/25(Wed)17:53:33 No.106762257

Anonymous 10/01/25(Wed)17:53:33 No.106762257

>>106762235
>>106762245
but you need to train a caption model to be able to recognize all those characters and styles too, so... at some point you need manually annotated data to do that

Anonymous
10/01/25(Wed)17:54:34 No.106762262

Anonymous 10/01/25(Wed)17:54:34 No.106762262

>>106762241
>There is no way Sora 2 fits within anything under 50b. Especially not with audio.
I'd say it's a 100b MoE model with 15b active parameters, it's fast enough so it's cheap, but big enough to memorize all the concepts

Anonymous
10/01/25(Wed)17:54:47 No.106762263

Anonymous 10/01/25(Wed)17:54:47 No.106762263

There is an equivalent of image2video, but like image2image, but not the ones that generate an image similar to the input, but one that takes, for example, a picture of my teacher and returns my teacher naked smoking a cigarette, according to the prompt .
Thank you.

Anonymous
10/01/25(Wed)17:54:48 No.106762264

Anonymous 10/01/25(Wed)17:54:48 No.106762264

File: Michelle T 4576.mp4 (3.46 MB, 1056x768)

3.46 MB MP4

Whoever created this masterpiece, is a true god. I encourage you to continue with this most righteous endeavor. :D
God Speed >

Anonymous
10/01/25(Wed)17:55:43 No.106762271

Anonymous 10/01/25(Wed)17:55:43 No.106762271

>>106762264
Obviously posted by an >>>/b/AI+parody chad

Anonymous
10/01/25(Wed)17:56:51 No.106762278

Anonymous 10/01/25(Wed)17:56:51 No.106762278

>>106762264
imagine

Anonymous
10/01/25(Wed)17:57:19 No.106762281

Anonymous 10/01/25(Wed)17:57:19 No.106762281

the tiddimigu scared me

Anonymous
10/01/25(Wed)17:57:37 No.106762284

Anonymous 10/01/25(Wed)17:57:37 No.106762284

>>106762271
No, it was one of your fellow guru's here in g/ldg , I saved it from here about two weeks ago.

Anonymous
10/01/25(Wed)17:57:42 No.106762285

Anonymous 10/01/25(Wed)17:57:42 No.106762285

>check other ai generals from different boards
>almost all of them are eerily civil

goddamn, get it together /ldg/

Anonymous
10/01/25(Wed)17:59:10 No.106762299

Anonymous 10/01/25(Wed)17:59:10 No.106762299

>>106762285
it's civil at the moment, and we're talking about that API model, I like it, it should be like /lmg/, it's all right to speculate about SOTA models and see what make them so special

Anonymous
10/01/25(Wed)18:00:18 No.106762302

Anonymous 10/01/25(Wed)18:00:18 No.106762302

It's annoying that you have scrape danbooru if you want a proper dataset as everything on HF has no tags/metadata.

Anonymous
10/01/25(Wed)18:00:21 No.106762303

Anonymous 10/01/25(Wed)18:00:21 No.106762303

>>106759949
>>106760002
>>106760064
>>106760098
>>106760126
>>106760148
>>106760237
The results are quite amazing, are you using the default comfy workflow?
Also how does it handle LoRA's with concepts it doesn't understand?

Anonymous
10/01/25(Wed)18:00:40 No.106762308

Anonymous 10/01/25(Wed)18:00:40 No.106762308

>>106762299
open source always wins. qwen edit v2 > nano banana. also because no censorship, but it's arguably even more versatile.

Anonymous
10/01/25(Wed)18:01:06 No.106762309

Anonymous 10/01/25(Wed)18:01:06 No.106762309

>>106762308
>qwen edit v2 > nano banana
I want to know what you're smoking m8

Anonymous
10/01/25(Wed)18:01:32 No.106762314

Anonymous 10/01/25(Wed)18:01:32 No.106762314

>>106762309
>she is nude

Anonymous
10/01/25(Wed)18:02:13 No.106762318

Anonymous 10/01/25(Wed)18:02:13 No.106762318

Needless to say but i'l say it anyway since we're all probably on this page by now

parameters are a moot point, its the fucking giganigger fuckhuge captioned dataset that gives Sora its power level.
Question at this point should be, are the chinese bold enough to play dataset chicken with wan 3.0 or other potentially new models?

Anonymous
10/01/25(Wed)18:02:16 No.106762319

Anonymous 10/01/25(Wed)18:02:16 No.106762319

>>106762303
yes, default workflow with Qwen-Image-Lightning-8steps-V2.0.safetensors at 8 steps. it actually works better than qwen edit v1, but you can try both.

it works 100% fine with loras and also with multiple image inputs, just reference them as "image2" or "image3".

ie: the anime girl is wearing the outfit in image2 (image2 being an outfit, on someone or cropped).

Anonymous
10/01/25(Wed)18:03:05 No.106762325

Anonymous 10/01/25(Wed)18:03:05 No.106762325

New
>>106762321
New
>>106762321
New
>>106762321
New
>>106762321
New
>>106762321

Anonymous
10/01/25(Wed)18:03:16 No.106762327

Anonymous 10/01/25(Wed)18:03:16 No.106762327

>>106762318
>Question at this point should be, are the chinese bold enough to play dataset chicken with wan 3.0 or other potentially new models?
it'll be an API model like wan 2.5 so I don't really care about what they will be doing

Anonymous
10/01/25(Wed)18:04:05 No.106762334

Anonymous 10/01/25(Wed)18:04:05 No.106762334

>>106762318
I think the Chinese are incapable of paying that much attention to detail. They're all about be flashy as a culture and putting corncobs in the concrete. I don't think you can expect them to be that serious about the dataset outside of raw volume unless one of their researchers gets personally invested in accuracy.

Anonymous
10/01/25(Wed)18:04:08 No.106762337

Anonymous 10/01/25(Wed)18:04:08 No.106762337

any good noob/Illustrious checkpoint/lora for classic fantasy characters and creatures? I wanted to see if I could do paper minis for dnd but my setup makes the goblins too uguu kawaii.

Anonymous
10/01/25(Wed)18:05:31 No.106762348

Anonymous 10/01/25(Wed)18:05:31 No.106762348

>>106762302
>>106762341

Anonymous
10/01/25(Wed)21:25:39 No.106763850

Anonymous 10/01/25(Wed)21:25:39 No.106763850

>>106760610
>completely unfazed by a member of the normandy appearing via portal
what a chad

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.