/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 09/30/25(Tue)02:13:49 No.106743839

File: highlights_g_106739587_17(...).jpg (2.01 MB, 3571x3133)

2.01 MB JPG

/ldg/ - Local Diffusion General Anonymous 09/30/25(Tue)02:13:49 No.106743839 Archived

Noise Convergence Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106739587

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/30/25(Tue)02:20:33 No.106743860

Anonymous 09/30/25(Tue)02:20:33 No.106743860

I need to take a shit real fucking bad, but I have so much fun genning..

Anonymous
09/30/25(Tue)02:29:18 No.106743909

Anonymous 09/30/25(Tue)02:29:18 No.106743909

File: 00135-1563371775.png (2.79 MB, 1536x1536)

2.79 MB PNG

Anonymous
09/30/25(Tue)02:29:41 No.106743911

Anonymous 09/30/25(Tue)02:29:41 No.106743911

File: file.png (2.03 MB, 1536x864)

2.03 MB PNG

>>106743839
You dropped >>106743830

Anonymous
09/30/25(Tue)02:40:33 No.106743956

Anonymous 09/30/25(Tue)02:40:33 No.106743956

>>106743909
laughable

Anonymous
09/30/25(Tue)02:45:20 No.106743982

Anonymous 09/30/25(Tue)02:45:20 No.106743982

File: 1734638977163041.mp4 (1.61 MB, 640x640)

1.61 MB MP4

hatsune miku is dressed in soldier fatigues running through a battlefield

Anonymous
09/30/25(Tue)02:55:40 No.106744023

Anonymous 09/30/25(Tue)02:55:40 No.106744023

File: 1728451674344437.png (1.27 MB, 768x1280)

1.27 MB PNG

why buy a 6000 when you can buy a 5090 with a workstation mobo and just swapmax?

Anonymous
09/30/25(Tue)03:10:12 No.106744089

Anonymous 09/30/25(Tue)03:10:12 No.106744089

File: 00178-3380301330.png (2.72 MB, 1344x1728)

2.72 MB PNG

Anonymous
09/30/25(Tue)03:15:17 No.106744124

Anonymous 09/30/25(Tue)03:15:17 No.106744124

File: 00193-66941820.png (2.32 MB, 1728x1344)

2.32 MB PNG

Anonymous
09/30/25(Tue)03:21:31 No.106744153

Anonymous 09/30/25(Tue)03:21:31 No.106744153

File: 1747169419905448.png (20 KB, 1205x172)

20 KB PNG

What's a preprocessor? I can't remember. I know I use tile in img2img since some anon told me to do that, and said to use this model. It works but I don't understand how or what it's doing.

Anonymous
09/30/25(Tue)03:28:31 No.106744188

Anonymous 09/30/25(Tue)03:28:31 No.106744188

File: wan22_00669.mp4 (451 KB, 448x576)

451 KB MP4

Anonymous
09/30/25(Tue)03:39:35 No.106744239

Anonymous 09/30/25(Tue)03:39:35 No.106744239

>>106743909
model/catbox?

Anonymous
09/30/25(Tue)03:45:31 No.106744277

Anonymous 09/30/25(Tue)03:45:31 No.106744277

>>106744023
there is no good reason to buy one at that price uynless you plan to do training. Especially since it wont be particularly much faster than a 5090, you'll be compute bound for most video/image tasks anyways. It will be fantastic for LLM's mostly- with system ram he can run glm q4 which is somewhat on par with deepseek using just that card.

Im sure you can do some fun stuff like having llm/tts/image gen all on device. But it wont be good. It will be jank, and too slow, and not real time even on that.

if you plan to keep this for 5 years- 2k/year is do-able. I just know theyre gonna come out with flash attention 4 or some shit and the last few years you will be a cuck like people genning vid on 3090's right now.

Anonymous
09/30/25(Tue)04:10:24 No.106744407

Anonymous 09/30/25(Tue)04:10:24 No.106744407

Anyone ever had dark thoughts about robbing a shipment of Nvidia cards?

Anonymous
09/30/25(Tue)04:12:31 No.106744421

Anonymous 09/30/25(Tue)04:12:31 No.106744421

>>106744239
https://files.catbox.moe/254tgy.png
https://civitai.com/models/784543?modelVersionId=2142667

Anonymous
09/30/25(Tue)04:13:38 No.106744429

Anonymous 09/30/25(Tue)04:13:38 No.106744429

>>106744421
danke

Anonymous
09/30/25(Tue)04:14:45 No.106744438

Anonymous 09/30/25(Tue)04:14:45 No.106744438

>>106744407
Yes, more than once

Anonymous
09/30/25(Tue)04:32:39 No.106744555

Anonymous 09/30/25(Tue)04:32:39 No.106744555

File: WanVideo2_2_I2V_00464.webm (873 KB, 1248x768)

873 KB WEBM

Anonymous
09/30/25(Tue)04:34:48 No.106744568

Anonymous 09/30/25(Tue)04:34:48 No.106744568

Does anyone know if musubi tuner can do regular flux and chroma loras? It does flux kontext of all things, for whatever reason.

>>106744023
>>106744277

Why not? Kinda sounds like poor cope. A faster card and a shit ton of ram sounds good to me.

Anonymous
09/30/25(Tue)04:37:33 No.106744593

Anonymous 09/30/25(Tue)04:37:33 No.106744593

>>106743542
post prompt postcard! :)

Anonymous
09/30/25(Tue)04:46:41 No.106744641

Anonymous 09/30/25(Tue)04:46:41 No.106744641

https://huggingface.co/papers/2507.01051

what

Anonymous
09/30/25(Tue)04:48:53 No.106744648

Anonymous 09/30/25(Tue)04:48:53 No.106744648

File: 00229-1154129823.png (2.69 MB, 1728x1344)

2.69 MB PNG

Anonymous
09/30/25(Tue)04:50:13 No.106744660

Anonymous 09/30/25(Tue)04:50:13 No.106744660

>>106744648
Nice finger count.

Anonymous
09/30/25(Tue)04:52:06 No.106744674

Anonymous 09/30/25(Tue)04:52:06 No.106744674

File: 1746334153397035.png (288 KB, 420x429)

288 KB PNG

>>106742822
boob size fix https://files.catbox.moe/t7nsds.mp4
https://civitai.com/models/1918611?modelVersionId=2204588

Anonymous
09/30/25(Tue)04:53:22 No.106744684

Anonymous 09/30/25(Tue)04:53:22 No.106744684

>>106744568
the 6000 is only like 10-15% faster for several thousand dollars more. It's a nice bonus but the vram, takes center stage, and if youre just gonna be running wan, it's not going to be a huge difference. Now you can run larger vid models and full precision stuff, but those are going to run even slower to the point of suffering. And even on a 5090 I am constantly lowering resolution and sampling just to get it to render faster. Unfortunately, what is required to make 1080p video render quickly is a full b200 gpu stack that costs millions and some video models that have not been open sourced. Basically, you have to be Elon Musk.

Im sure a comfy worklflow could even negate the offload penalty by batch generating low noise gens and then doing the same for high noise so that you could come very close to the 6000 pro's speed bonus from keeping models on gpu. For fucking free.

Anonymous
09/30/25(Tue)04:58:44 No.106744713

Anonymous 09/30/25(Tue)04:58:44 No.106744713

>>106744684
>Basically, you have to be Elon Musk.
Am I the only one who's pissed off that GPU and hardware manufacturers can almost exclusively cater to a handful of billionaires and still turn record profits? Like what's the fucking point if they alone can drive the market.

Anonymous
09/30/25(Tue)04:59:41 No.106744716

Anonymous 09/30/25(Tue)04:59:41 No.106744716

best way to give a model an image and basically tell it to just gen more of something similar? captioning the image and feeding that as a prompt isnt really enough although can be interesting, can ipadapter be used for this or is there something else?

Anonymous
09/30/25(Tue)05:00:26 No.106744719

Anonymous 09/30/25(Tue)05:00:26 No.106744719

>>106744713
line go up

Anonymous
09/30/25(Tue)05:01:31 No.106744727

Anonymous 09/30/25(Tue)05:01:31 No.106744727

>>106744713
the average consumer has stopped mattering to corporations a long time ago. we're lucky they don't decide to just turn everyone into mulch

Anonymous
09/30/25(Tue)05:03:12 No.106744738

Anonymous 09/30/25(Tue)05:03:12 No.106744738

File: 00250-1937915327.png (2.92 MB, 1344x1728)

2.92 MB PNG

Anonymous
09/30/25(Tue)05:06:19 No.106744756

Anonymous 09/30/25(Tue)05:06:19 No.106744756

File: WanVideo2_2_I2V_00467.webm (957 KB, 1152x720)

957 KB WEBM

Anonymous
09/30/25(Tue)05:07:22 No.106744764

Anonymous 09/30/25(Tue)05:07:22 No.106744764

>>106744716
>captioning the image and feeding that as a prompt isnt really enough
It should be if you also do img2img with the original. A high denoise value should get nice variants.

Anonymous
09/30/25(Tue)05:08:12 No.106744771

Anonymous 09/30/25(Tue)05:08:12 No.106744771

>>106744727
don't give them ideas

Anonymous
09/30/25(Tue)05:09:56 No.106744779

Anonymous 09/30/25(Tue)05:09:56 No.106744779

>>106744727
rich people stop being rich when everyone serving them dies. While I love the idea of omnicide, the rich and powerful wanna keep this going.

>>106744713
the rich are bad with money and are mentally children. But unfortunately they have all the money. Gamers who care about money would have never let nvidia get away with this.

Anonymous
09/30/25(Tue)05:12:09 No.106744789

Anonymous 09/30/25(Tue)05:12:09 No.106744789

>>106744779
>Gamers who care about money would have never let nvidia get away with this
idk about that when the average goymers buys the newest 90 card the moment it comes out because bigger number better despite it not being needed for basically any game except the most pozzed AAA propagandaslop thats unoptimized on purpose while looking worse than games from 8 years ago to sell those very cards

Anonymous
09/30/25(Tue)05:14:32 No.106744802

Anonymous 09/30/25(Tue)05:14:32 No.106744802

>>106744738
sexo

Anonymous
09/30/25(Tue)05:16:48 No.106744809

Anonymous 09/30/25(Tue)05:16:48 No.106744809

>https://huggingface.co/Efficient-Large-Model/LongLive-1.3B
>https://nvlabs.github.io/LongLive
>Nvidia LongLive: Real-time Interactive Long Video Generation
Which part of this video generator exactly is real-time? Is "Real-Time" with us in the room right now, Nvidia?
https://youtu.be/Wb5FdiCIXwI
>takes double the time of target video lenght to generate 24FPS video
>iF wE sLoW dOwN pLAyBaCk sPeEd tO 0.5x wE cAn AcKshUalLy ViEw iN rEaLTiME
>lEtS aLsO jUsT cAlL 0.5x = 1.0x aNd 1.0x = 2.0x sPeEd.
>nOw iT's tRuLy rEaL TiMe!!!!
the audacity of these unhinged researchers to call that realtime.. or did the jeet fuck up somewhere?

Anonymous
09/30/25(Tue)05:16:58 No.106744811

Anonymous 09/30/25(Tue)05:16:58 No.106744811

>>106744789
the most popular cards are the XX-50 series bro. Gamers are cheap as fuck.

Anonymous
09/30/25(Tue)05:19:43 No.106744822

Anonymous 09/30/25(Tue)05:19:43 No.106744822

>>106744756
heh

Anonymous
09/30/25(Tue)05:22:22 No.106744835

Anonymous 09/30/25(Tue)05:22:22 No.106744835

File: 00277-3507880779.png (2.88 MB, 1728x1344)

2.88 MB PNG

>>106744802
indeed

Anonymous
09/30/25(Tue)05:22:37 No.106744838

Anonymous 09/30/25(Tue)05:22:37 No.106744838

>>106744641
>they cannot meaningfully consent to the numerous potential outputs their data might enable or the extent to which the output is used or distributed
These people are completely nuts.

Anonymous
09/30/25(Tue)05:22:44 No.106744839

Anonymous 09/30/25(Tue)05:22:44 No.106744839

>>106744779
>>106744789
why are we pretending it's gamers fault when nvidia makes most of it's money from ai

Anonymous
09/30/25(Tue)05:23:31 No.106744843

Anonymous 09/30/25(Tue)05:23:31 No.106744843

>>106744838
women

Anonymous
09/30/25(Tue)05:24:40 No.106744851

Anonymous 09/30/25(Tue)05:24:40 No.106744851

>>106744838
enjoy the consent cult making everything insane

Anonymous
09/30/25(Tue)05:25:20 No.106744852

Anonymous 09/30/25(Tue)05:25:20 No.106744852

can someone look at this please?
>>106744809
I really wanna know if I misunderstood something and it's actually realtime.

Anonymous
09/30/25(Tue)05:25:46 No.106744854

Anonymous 09/30/25(Tue)05:25:46 No.106744854

>>106744835
what is this absolute slop

Anonymous
09/30/25(Tue)05:26:04 No.106744856

Anonymous 09/30/25(Tue)05:26:04 No.106744856

>>106744811
right, what i wanted to say is that they buy 90 if they have the money, which yes, most dont, but a crap ton still do, rewarding ngreedia and ultimately showing that they have no principles and that the only average gamers not rewarding ngreedia are those that just cant drop 2-4k on a gpu

Anonymous
09/30/25(Tue)05:32:51 No.106744878

Anonymous 09/30/25(Tue)05:32:51 No.106744878

>>106744809
Actually quite impressive considering those samples took around 8 minutes to gen for a 30 second vid, this is the direction we needed. Now please, for the love of god, some one slop this technology into wan

Anonymous
09/30/25(Tue)05:33:36 No.106744882

Anonymous 09/30/25(Tue)05:33:36 No.106744882

>>106744641
If this represents the current state of social studies papers, it's kind of pathetic.

Anonymous
09/30/25(Tue)05:38:22 No.106744901

Anonymous 09/30/25(Tue)05:38:22 No.106744901

And now, a short poem about seeing a comfyui update and saying "ah, I am sure it's fine"

>And on the pedestal, these words appear:
>My name is Ozymandias, King of Kings;
Look on my Works, ye Mighty, and despair!"
>Nothing beside remains. Round the decay
>Of that colossal Wreck, boundless and bare
>The lone and level sands stretch far away.

Anonymous
09/30/25(Tue)05:40:25 No.106744909

Anonymous 09/30/25(Tue)05:40:25 No.106744909

>>106744901
did one this morning and for some reason my vram usage went down

Anonymous
09/30/25(Tue)05:43:50 No.106744920

Anonymous 09/30/25(Tue)05:43:50 No.106744920

>>106744878
this is based on WAN afaik

Anonymous
09/30/25(Tue)05:46:48 No.106745026

Anonymous 09/30/25(Tue)05:46:48 No.106745026

File: 1754908411471452.png (226 KB, 1818x812)

226 KB PNG

>>106744641
>foid papers
>nothing but globohomo propagandaslop
So this is the vast contribution of women to world research.

Anonymous
09/30/25(Tue)05:52:14 No.106745054

Anonymous 09/30/25(Tue)05:52:14 No.106745054

File: 00304-651905710.png (2.58 MB, 1344x1728)

2.58 MB PNG

Anonymous
09/30/25(Tue)05:52:25 No.106745055

Anonymous 09/30/25(Tue)05:52:25 No.106745055

File: 1734553835746341.png (151 KB, 465x453)

151 KB PNG

>>106744920
>checks huggingface

les goooooooooo

Anonymous
09/30/25(Tue)05:52:52 No.106745058

Anonymous 09/30/25(Tue)05:52:52 No.106745058

File: file.png (12 KB, 739x124)

12 KB PNG

>the average AI user

Anonymous
09/30/25(Tue)05:56:41 No.106745076

Anonymous 09/30/25(Tue)05:56:41 No.106745076

>>106744641
30+ year old woman going insane with the possibility of ai sex robots

Anonymous
09/30/25(Tue)06:01:20 No.106745103

Anonymous 09/30/25(Tue)06:01:20 No.106745103

File: 1739364583781090.jpg (7 KB, 200x200)

7 KB JPG

>>106745026
>>106744641

Anonymous
09/30/25(Tue)06:04:22 No.106745115

Anonymous 09/30/25(Tue)06:04:22 No.106745115

File: 00321-1150737988.png (2.45 MB, 1152x2016)

2.45 MB PNG

Anonymous
09/30/25(Tue)06:08:30 No.106745132

Anonymous 09/30/25(Tue)06:08:30 No.106745132

>>106744838
I like how the "AI Done Well" example was just a regular ol' EULA with nothing else mentioned previously (temporal consent, etc). Wish I could get paid to write gibberish.

Anonymous
09/30/25(Tue)06:09:41 No.106745138

Anonymous 09/30/25(Tue)06:09:41 No.106745138

>>106744851
>consent cult
It's a weird mix of applying consent to areas where it makes no sense, to mixing it with copyright ideas so far reaching it would make any derivative work essentially illegal.
Or even looking at anything for that matter.

Anonymous
09/30/25(Tue)06:12:28 No.106745147

Anonymous 09/30/25(Tue)06:12:28 No.106745147

if i use the 4 step light loras, is there any point to go higher than 4 steps on each ksampler or is the result going to look the same?

Anonymous
09/30/25(Tue)06:13:14 No.106745151

Anonymous 09/30/25(Tue)06:13:14 No.106745151

My AI consents genning raunchy porn 24/7.

Anonymous
09/30/25(Tue)06:14:56 No.106745164

Anonymous 09/30/25(Tue)06:14:56 No.106745164

If we just make the AI horny, then it'll like generating horny stuff. Elon has the right idea.

Anonymous
09/30/25(Tue)06:15:17 No.106745168

Anonymous 09/30/25(Tue)06:15:17 No.106745168

File: Qwan_00001_.jpg (852 KB, 2976x1984)

852 KB JPG

Think they'll nunchakufy Hunyuan 3.0? I'd take a nunchakufied Hunyuan 3.0.
>>106745103
Pretty cute, actually.

Anonymous
09/30/25(Tue)06:18:27 No.106745180

Anonymous 09/30/25(Tue)06:18:27 No.106745180

Sage attention 3 out yet?

Anonymous
09/30/25(Tue)06:18:52 No.106745181

Anonymous 09/30/25(Tue)06:18:52 No.106745181

>>106745168
>qwen image slopper has no taste
pottery

Anonymous
09/30/25(Tue)06:19:27 No.106745185

Anonymous 09/30/25(Tue)06:19:27 No.106745185

File: 00414-816042324.png (2.76 MB, 1080x1920)

2.76 MB PNG

>>106744901
I have chosen personally to completely cease all wan and flux fun until i pick up an additional 32 gigs of ram to prevent swapping to my nvme who's a good boy who dindunuffin. Comfyui is too niggerlicious for this modest little rig.

https://youtu.be/3COHCKX_xF8?si=U_DcG_MN7Lqi4fa2

>that moment i realized illustrious was perfectly capable of doing what i asked anyway

Anonymous
09/30/25(Tue)06:21:14 No.106745195

Anonymous 09/30/25(Tue)06:21:14 No.106745195

>>106745185
spilling into ssd with model inference shouldnt do anything other than reading off the storage, meaning its free and not stressing it

Anonymous
09/30/25(Tue)06:21:42 No.106745198

Anonymous 09/30/25(Tue)06:21:42 No.106745198

File: Screenshot 2025-09-30 at (...).png (45 KB, 944x731)

45 KB PNG

Anonymous
09/30/25(Tue)06:22:28 No.106745201

Anonymous 09/30/25(Tue)06:22:28 No.106745201

>>106745180
It is, but for only blackwell cards at the moment. I can imagine they'll later support older cards, time will tell.

>inb4 some demoralizing contrarian dipshit

Anonymous
09/30/25(Tue)06:24:24 No.106745205

Anonymous 09/30/25(Tue)06:24:24 No.106745205

>>106745147
you're not limited to 4 steps, more is always better. it can result in smoother motion and more clarity and detail in the output.

Anonymous
09/30/25(Tue)06:33:25 No.106745238

Anonymous 09/30/25(Tue)06:33:25 No.106745238

>>106745201
is it even usable on blackwell cards on comfyui?
they recommend only using it after the first step and excluding the last step, and using sage2++ for the rest

Anonymous
09/30/25(Tue)06:36:06 No.106745252

Anonymous 09/30/25(Tue)06:36:06 No.106745252

File: Chrowan_00001_.jpg (1.06 MB, 3024x2016)

1.06 MB JPG

>>106745181
I slop with whatever, really. Chroma tends to be so noisy, though. Doesn't lend itself for a lot of types of images, in my opinion.
As for the girl, she reminds me of an older Julianna Rose Mauriello a lot. I'll stand by that.

Anonymous
09/30/25(Tue)06:39:34 No.106745265

Anonymous 09/30/25(Tue)06:39:34 No.106745265

>>106745168
it's a man i2i'd using qwen

Anonymous
09/30/25(Tue)06:40:26 No.106745270

Anonymous 09/30/25(Tue)06:40:26 No.106745270

File: 1755777023939209.jpg (1.25 MB, 2016x1152)

1.25 MB JPG

Anonymous
09/30/25(Tue)06:56:13 No.106745338

Anonymous 09/30/25(Tue)06:56:13 No.106745338

>>106745168
>Pretty cute, actually.
Maybe, but she has mental aids.

Anonymous
09/30/25(Tue)07:02:26 No.106745371

Anonymous 09/30/25(Tue)07:02:26 No.106745371

>>106745238
Not a clue, I dont have a blackwell card, kek. sage2.2 on my 4070tis works fine but double speed boost would be great too

Anonymous
09/30/25(Tue)07:08:32 No.106745397

Anonymous 09/30/25(Tue)07:08:32 No.106745397

anyone has a workflow for wan2.2 without the lightx2v loras pretty please?
i tried the native one, with fp8_scaled and euler 10/10 steps, then tried unipc 15/20 steps and the result always looks shitty and blurry

Anonymous
09/30/25(Tue)07:11:59 No.106745410

Anonymous 09/30/25(Tue)07:11:59 No.106745410

File: file.png (66 KB, 1665x709)

66 KB PNG

>>106745371
the issue is that sage attention 3 is a destructive process, way more than the relative free lunch that sage1/2 were

Anonymous
09/30/25(Tue)07:13:01 No.106745415

Anonymous 09/30/25(Tue)07:13:01 No.106745415

>>106744407
Only about ASML.

Anonymous
09/30/25(Tue)07:16:35 No.106745432

Anonymous 09/30/25(Tue)07:16:35 No.106745432

>>106745410
How do you even change the version between steps?

Anonymous
09/30/25(Tue)07:29:40 No.106745492

Anonymous 09/30/25(Tue)07:29:40 No.106745492

File: 00121-175658031.png (3.11 MB, 1248x1824)

3.11 MB PNG

>>106745410
kek this is why i'm waiting for the ((two more weeks))
let the monke beta test until it's good enough to go through the effort of dropping into my workflow
or SKIP altogether.

Anonymous
09/30/25(Tue)07:58:46 No.106745655

Anonymous 09/30/25(Tue)07:58:46 No.106745655

File: 9392025850.png (798 KB, 1024x1024)

798 KB PNG

Anonymous
09/30/25(Tue)08:12:18 No.106745733

Anonymous 09/30/25(Tue)08:12:18 No.106745733

File: Qwan_00010_.jpg (657 KB, 2976x1984)

657 KB JPG

>>106745338
Well, that's women for you, huh? Am I right fellas?

Anonymous
09/30/25(Tue)08:14:15 No.106745742

Anonymous 09/30/25(Tue)08:14:15 No.106745742

why do people still use pony?
it boggles the mind.

Anonymous
09/30/25(Tue)08:18:08 No.106745763

Anonymous 09/30/25(Tue)08:18:08 No.106745763

>>106745742
boggle?

Anonymous
09/30/25(Tue)08:18:38 No.106745768

Anonymous 09/30/25(Tue)08:18:38 No.106745768

>>106744555
Hi /lit/ bro

Anonymous
09/30/25(Tue)08:21:41 No.106745787

Anonymous 09/30/25(Tue)08:21:41 No.106745787

>>106745742
What should people use instead?

Anonymous
09/30/25(Tue)08:23:25 No.106745798

Anonymous 09/30/25(Tue)08:23:25 No.106745798

>>106745787
noob

Anonymous
09/30/25(Tue)08:29:12 No.106745827

Anonymous 09/30/25(Tue)08:29:12 No.106745827

>>106745798
>why do people still use noob?
>it boggles the mind.

Anonymous
09/30/25(Tue)08:47:48 No.106745943

Anonymous 09/30/25(Tue)08:47:48 No.106745943

File: 98849951.mp4 (3.46 MB, 1312x896)

3.46 MB MP4

Anonymous
09/30/25(Tue)08:50:57 No.106745961

Anonymous 09/30/25(Tue)08:50:57 No.106745961

>>106745787
illustrious, noobai, chroma, and qwen.
pony is simply just not good anymore.
illu and nai shitmixes have more concepts and characters built in

>he needs a lora
fuck you, use illust you cunt and stop shitting up civitai with loras that illu/nai can do out of the box RRRREEEEee

Anonymous
09/30/25(Tue)08:51:56 No.106745966

Anonymous 09/30/25(Tue)08:51:56 No.106745966

Anons, I tried to install comfy ui just now, and just for the Python dependencies (I think), it tries to download almost 70GB of packages (several 3.33GB parts). I tried to install it outside of C:, but I got a warning that it may run unstable and that it is not recommended.
Is that really the case? Will comfy_ui need almost 100GB of storage space on C:, even though I haven't installed any large models or anything? Is this just to get started with those Python/PyTorch things?

Anonymous
09/30/25(Tue)08:52:15 No.106745970

Anonymous 09/30/25(Tue)08:52:15 No.106745970

>>106745961
>stop shitting up civitai with loras that illu/nai can do out of the box RRRREEEEee
but we need six gorillion concept loras for things the first ever released illustrious checkpoint can do out of the box!

and don't even get me started on the six gajillion anime girls even pony was able to do out of the box!

Anonymous
09/30/25(Tue)08:53:45 No.106745982

Anonymous 09/30/25(Tue)08:53:45 No.106745982

>>106745966
Idk what are you installing since comfy python is ~10 gigs with 3.6 being torch.

Anonymous
09/30/25(Tue)08:59:39 No.106746034

Anonymous 09/30/25(Tue)08:59:39 No.106746034

File: 88945033.mp4 (3.56 MB, 1312x896)

3.56 MB MP4

Anonymous
09/30/25(Tue)09:01:15 No.106746041

Anonymous 09/30/25(Tue)09:01:15 No.106746041

>>106745966
>tries to download almost 70GB of packages
how do you fuck up a comfy install so badly?
delete everything and use the portable version jesus fucking christ.

anon you might not be cut out for this whole ai stuff

Anonymous
09/30/25(Tue)09:02:29 No.106746050

Anonymous 09/30/25(Tue)09:02:29 No.106746050

with the following core settings and a 640x360 video dataset, it's possible to train Wan 2.2 14b i2v with musubi-trainer and not OOM
    --task i2v-A14B --sdpa --mixed_precision fp16 --fp8_base \
    --optimizer_type adamw8bit --learning_rate 2e-4 --gradient_checkpointing --gradient_accumulation_steps 1  \
    --max_data_loader_n_workers 2 --persistent_data_loader_workers --offload_inactive_dit \
    --network_module networks.lora_wan --network_dim 32 \
    --timestep_sampling shift --timestep_boundary 900 --min_timestep 0 --max_timestep 1000 --discrete_flow_shift 3.0 \
    --max_train_epochs 16 --save_every_n_epochs 1 --seed 23571113 \
    --save_state \
836x480 was close, like it almost worked in 48GB, but there would be occasional peak memory useage moments where it would OOM, and it would happen before it could write a checkpoint.

Anonymous
09/30/25(Tue)09:03:09 No.106746055

Anonymous 09/30/25(Tue)09:03:09 No.106746055

>>106745966
we dont know what you're downloading specifically so you come off as a fucking idiot, you might be downloading a prepackaged setup for multiple things for all we know

Anonymous
09/30/25(Tue)09:03:43 No.106746059

Anonymous 09/30/25(Tue)09:03:43 No.106746059

>>106746050
>it's possible to train Wan 2.2 14b i2v with musubi-trainer and not OOM
on what, on 48gb vram?

Anonymous
09/30/25(Tue)09:09:07 No.106746107

Anonymous 09/30/25(Tue)09:09:07 No.106746107

>>106746059
Yes, I have a 48GB 4090D

Anonymous
09/30/25(Tue)09:10:44 No.106746117

Anonymous 09/30/25(Tue)09:10:44 No.106746117

For flux/chroma, is there a way to use a reference images of real faces to blend them to a new one?
I want to try and create the perfect waifu with celebs.

Anonymous
09/30/25(Tue)09:11:57 No.106746125

Anonymous 09/30/25(Tue)09:11:57 No.106746125

>>106746117
i was going to help you until you mentioned celebs.

Anonymous
09/30/25(Tue)09:12:37 No.106746131

Anonymous 09/30/25(Tue)09:12:37 No.106746131

>>106745966
tray stability matrix. It just installs everything for you. one click, works on windows and linux.

Anonymous
09/30/25(Tue)09:14:24 No.106746144

Anonymous 09/30/25(Tue)09:14:24 No.106746144

>>106746125
Anime all have the same face.

I remember an old phone app that let you do it, it was amazing, /tv/ threads created beauty never seen before.

Anonymous
09/30/25(Tue)09:16:15 No.106746156

Anonymous 09/30/25(Tue)09:16:15 No.106746156

File: 00101-2095852289.png (1.22 MB, 896x1152)

1.22 MB PNG

Anonymous
09/30/25(Tue)09:18:24 No.106746170

Anonymous 09/30/25(Tue)09:18:24 No.106746170

>>106746117
With loras of each celebrity it would be easy, just alter the strength of each lora to your liking while prompting for a single person

There's probably some easy way to do img2img with two celebrity reference images, but I doubt the results will be particularly good

Anonymous
09/30/25(Tue)09:20:12 No.106746185

Anonymous 09/30/25(Tue)09:20:12 No.106746185

>>106746059
I wanted to also mention i2v is probably only worth doing if you want something unique that Wan doesn't already understand, in terms of motion. I went with the action scenes from City the Animation, since they're pretty creative and have a lot of "bullet time" action going on.
Now I understand why most community i2v loras are very specific - impractical on home gear, and expensive in the cloud to go past 100 or so videos, so you have to keep it focused.

Anonymous
09/30/25(Tue)09:21:02 No.106746191

Anonymous 09/30/25(Tue)09:21:02 No.106746191

File: heyohoya.jpg (49 KB, 684x456)

49 KB JPG

Summoning Kijai, QuantStack, lightx2v or phr00t to work their black magic and create a lora for Long Video Generation https://huggingface.co/Efficient-Large-Model/LongLive-1.3B

>>106745961
Use whatever works for you, broham

>>106746107
>4090D

Still OOMs? Was going to buy one of those too, kek

Anonymous
09/30/25(Tue)09:21:33 No.106746196

Anonymous 09/30/25(Tue)09:21:33 No.106746196

>tfw not OOMing from doing an upscale to 5120x4000p

Feels good.

Anonymous
09/30/25(Tue)09:21:57 No.106746199

Anonymous 09/30/25(Tue)09:21:57 No.106746199

>>106746117
>use 2 celeb loras at equal strength
>????
>profit
idiot

Anonymous
09/30/25(Tue)09:22:44 No.106746206

Anonymous 09/30/25(Tue)09:22:44 No.106746206

>>106746170
>>106746199
I don't think it will work the way I want it to, but I'll try.

Anonymous
09/30/25(Tue)09:23:52 No.106746218

Anonymous 09/30/25(Tue)09:23:52 No.106746218

>>106746117
i dont know about flux, but even slopmerges for noob and illustrious manage to keep some XL knowledge of celebrities, you can prompt their name and get a decent likeness that isn't 1:1, ive used this for a specific OC that i wanted to have defining features.

Anonymous
09/30/25(Tue)09:24:08 No.106746220

Anonymous 09/30/25(Tue)09:24:08 No.106746220

>>106745432
Waiting for a node to do that.

Anonymous
09/30/25(Tue)09:27:10 No.106746244

Anonymous 09/30/25(Tue)09:27:10 No.106746244

>>106746218
Hang on, there's no loras of any celebs on civitai?

Anonymous
09/30/25(Tue)09:27:41 No.106746250

Anonymous 09/30/25(Tue)09:27:41 No.106746250

>>106746244
it got shoah'd. never forget the six million celeb loras. most other sites also fell in step for the same reason/paranoia.

Anonymous
09/30/25(Tue)09:28:21 No.106746259

Anonymous 09/30/25(Tue)09:28:21 No.106746259

File: 1734809426946311.mp4 (1.36 MB, 816x560)

1.36 MB MP4

>>106745943

Anonymous
09/30/25(Tue)09:29:17 No.106746268

Anonymous 09/30/25(Tue)09:29:17 No.106746268

>>106746191
> Still OOMs? Was going to buy one of those too, kek
yeah I couldn't make it do 836x480, and this was on a linux box running "headless", meaning all GUI stuff shut down so there's no other program using the GPU memory.
4090D 48GB is still great though. It lets you do wan 2.2 14B af fp16, you can train Qwen-image, etc... I consider it the lowest entry-level GPU for anything more serious than genning 1girls. Anything older with that much memory either costs more, or is a poor value because it's slow and lacks fp8 support.
In all honestly, the "not just fucking around" GPU is the Blackwell 6000 Pro. if you have that, you're golden because you can run anything up to single H100 80GB territory, and that covers just about all the diffusion projects.

I'm just fucking around, so spending $3k on a GPU is about my limit.

Anonymous
09/30/25(Tue)09:30:19 No.106746271

Anonymous 09/30/25(Tue)09:30:19 No.106746271

>>106746250
Well fuck. Guess it's time to start learning on how to train loras.

Anonymous
09/30/25(Tue)09:30:53 No.106746278

Anonymous 09/30/25(Tue)09:30:53 No.106746278

File: 00102-12096845.png (1.38 MB, 896x1152)

1.38 MB PNG

Anonymous
09/30/25(Tue)09:31:05 No.106746281

Anonymous 09/30/25(Tue)09:31:05 No.106746281

>>106746271
It's really not difficult. The difficult territory is natural language tagging, have fun with that one.

Anonymous
09/30/25(Tue)09:31:20 No.106746286

Anonymous 09/30/25(Tue)09:31:20 No.106746286

Another long vid gen attempt, 2minutes?

>Tencent promise a new autoregressive video model ( based on Wan 1.3B, eta mid October) ; Rolling-Forcing real-time generation of multi-minute video ( lot of examples & comparisons on the project page)

https://kunhao-liu.github.io/Rolling_Forcing_Webpage/
https://github.com/TencentARC/RollingForcing

Anonymous
09/30/25(Tue)09:32:20 No.106746294

Anonymous 09/30/25(Tue)09:32:20 No.106746294

In fairness, installing Python packages makes us all idiots.

Anonymous
09/30/25(Tue)09:33:36 No.106746310

Anonymous 09/30/25(Tue)09:33:36 No.106746310

>>106746286
looks interesting, possibly even promising.
alright getting it out of my system; ROOOOOOOLLIING STAAAAAAAAAAAAARRRTT

Anonymous
09/30/25(Tue)09:36:19 No.106746341

Anonymous 09/30/25(Tue)09:36:19 No.106746341

>>106746268
Could try 1:1 ratios? Reading around the leddit training threads, some have success at 1:1 ratios.

Anonymous
09/30/25(Tue)09:38:12 No.106746355

Anonymous 09/30/25(Tue)09:38:12 No.106746355

>>106746271
or, you know, grab them from sea or tensor

Anonymous
09/30/25(Tue)09:41:39 No.106746390

Anonymous 09/30/25(Tue)09:41:39 No.106746390

File: 00106-2222961683.png (2.46 MB, 1088x1920)

2.46 MB PNG

hhrrrmmm scratches big scary chin

wily's noob realism vpred (formerly epsilon's) has some promise. that said, my adetailer pass has been going a little too hard. not sure what i fucked up this time, but it gives every girl varying sizes of cheekbones for some reason.

Anonymous
09/30/25(Tue)09:43:54 No.106746408

Anonymous 09/30/25(Tue)09:43:54 No.106746408

File: 00103-669395706.png (1.65 MB, 896x1152)

1.65 MB PNG

Anonymous
09/30/25(Tue)09:45:48 No.106746424

Anonymous 09/30/25(Tue)09:45:48 No.106746424

>>106746310
Yeah these look dope, miles better than the other attempts. So we have

>Rolling Forcing
>LongLive

These will 100% release (usable in comfyui) before radial attention/wan chaku

Anonymous
09/30/25(Tue)09:46:29 No.106746429

Anonymous 09/30/25(Tue)09:46:29 No.106746429

File: serhbsrbsrfhbrshrshjrs.png (64 KB, 892x781)

64 KB PNG

>>106746390
sharing this incase anyone has better settings you can pass me. denoise always set to 0.3 since 0.4 seems aggressive on every checkpoint. should i be adjusting cfg scale based on the checkpoint?

>>106746424
I like the little skateboarding turtle not gonna lie, it did have the best example out of the bunch.

Anonymous
09/30/25(Tue)09:49:35 No.106746455

Anonymous 09/30/25(Tue)09:49:35 No.106746455

File: Untitled.png (6 KB, 751x280)

6 KB PNG

>>106746041
>>106746055
I downloaded the installer from their official website, for Windows, for NVidia GPUs.
The first time I tried to install it, it had more than 15 of these 3.3GB packages. Now I tried again, and apparently there is only one, I don't know... I'll see what happens.

Anonymous
09/30/25(Tue)09:51:21 No.106746473

Anonymous 09/30/25(Tue)09:51:21 No.106746473

>>106746355
What's sea?

I guess tensor doesn't give a fuck, nice.

Anonymous
09/30/25(Tue)09:51:53 No.106746481

Anonymous 09/30/25(Tue)09:51:53 No.106746481

>>106746455
>I downloaded the installer from their official website, for Windows, for NVidia GPUs.
Rookie mistake, the electron app is shit. get the windows portable release off of github

Anonymous
09/30/25(Tue)09:54:57 No.106746507

Anonymous 09/30/25(Tue)09:54:57 No.106746507

>>106746191
I feel like I can already generate at 320x320 and have rapid iteration on prompting. I have trouble caring about making really long janky low quality turbo-slop vids. I mean, look at all the examples they give lol. None of them are good. They dont even try to like upscale or vid to vid them or anything to try and justify this.

Anonymous
09/30/25(Tue)09:55:38 No.106746511

Anonymous 09/30/25(Tue)09:55:38 No.106746511

>>106746034
just like real porn actresses pretending to play chess

Anonymous
09/30/25(Tue)09:56:24 No.106746522

Anonymous 09/30/25(Tue)09:56:24 No.106746522

File: dumb whore types at the p(...).jpg (79 KB, 400x582)

79 KB JPG

>>106746511
or pretending to do anything for that matter

Anonymous
09/30/25(Tue)10:04:29 No.106746573

Anonymous 09/30/25(Tue)10:04:29 No.106746573

>>106746507
It's the 1.3b model, give it time maybe? You're not going to release a banger on the first swing. We dont have anything in terms of proper long vid generation. Plus the long awaited radial that'll never get released, I'm grateful we're getting something, well two things, see >>106746286

Anonymous
09/30/25(Tue)10:05:25 No.106746582

Anonymous 09/30/25(Tue)10:05:25 No.106746582

>>106746481
they have an electron app? holy mother of bloat, who thought that was a good idea wtf

Anonymous
09/30/25(Tue)10:05:40 No.106746588

Anonymous 09/30/25(Tue)10:05:40 No.106746588

>>106746034
Funny thing is that someone who hasn't played the game should be able to get a little suspicious. Three players moving while a fourth hand's seat remains empty.

Anonymous
09/30/25(Tue)10:10:01 No.106746632

Anonymous 09/30/25(Tue)10:10:01 No.106746632

why are seaart and tensor the absolute worst made sites? navigating them feels like trying to find the right download button on a sketchy site from the early 2000's

Anonymous
09/30/25(Tue)10:15:14 No.106746685

Anonymous 09/30/25(Tue)10:15:14 No.106746685

I remembered that deviantart was a thing and it still is. Seems to be fairly alive, worth posting stuff there?

Anonymous
09/30/25(Tue)10:17:06 No.106746706

Anonymous 09/30/25(Tue)10:17:06 No.106746706

>>106746685
yes, they've stopped pretending to care about "the ethical ramifications of using AI" since they realised they can make money off of it

REAL FUNNY HOW THAT WORKS

Anonymous
09/30/25(Tue)10:18:16 No.106746717

Anonymous 09/30/25(Tue)10:18:16 No.106746717

>>106746685
It's gigaslopified and basically unusable.

Anonymous
09/30/25(Tue)10:19:33 No.106746731

Anonymous 09/30/25(Tue)10:19:33 No.106746731

File: downloads.jpg (22 KB, 689x100)

22 KB JPG

Does this piece of shit Comfy UI have to install everything in the fucking C: documents folder? I installed the program on other hard drive with 4 terabytes of free space, but it insists on downloading everything to C:, where I have almost no free memory.
The installation section said that I could put models and other things on other drives, but the option does not exist anywhere.

Anonymous
09/30/25(Tue)10:20:53 No.106746744

Anonymous 09/30/25(Tue)10:20:53 No.106746744

>>106746731
uninstall the retarded shit you installed, then download the portable version and stop suffering your own idiocy

Anonymous
09/30/25(Tue)10:23:32 No.106746762

Anonymous 09/30/25(Tue)10:23:32 No.106746762

why are the most retarded people the loudest

Anonymous
09/30/25(Tue)10:24:39 No.106746770

Anonymous 09/30/25(Tue)10:24:39 No.106746770

Is noobai still the best sdxl based 1girl model? I’ve tried some of the top mixes on Civitai and holy baked in slop. I did a test with just “1girl” as the prompt and they basically make the same exact girl with the same face basically doing the same sex act over and over. Noob seems like it may be less “polished” for detail but much more variety and “tunable” raw state. If that makes sense.

Anonymous
09/30/25(Tue)10:26:07 No.106746782

Anonymous 09/30/25(Tue)10:26:07 No.106746782

>>106746770
because all slopmixes have the same recipes more or less. just pick the model that you like the look of.
99% of top voted shit on civi is turboslop made by merging the prevoius highest rated model with other random shit.

SUPERGENIUS
09/30/25(Tue)10:28:00 No.106746795

SUPERGENIUS 09/30/25(Tue)10:28:00 No.106746795

I've been no fapping for 10 days. Thought I'd give a progress report.

Anonymous
09/30/25(Tue)10:38:09 No.106746878

Anonymous 09/30/25(Tue)10:38:09 No.106746878

>>106746782
what are some /ldg/ approved illu/noob slopmixes?

Anonymous
09/30/25(Tue)10:42:08 No.106746920

Anonymous 09/30/25(Tue)10:42:08 No.106746920

>>106746878
we dont do that here

Anonymous
09/30/25(Tue)10:43:16 No.106746931

Anonymous 09/30/25(Tue)10:43:16 No.106746931

>>106746920
what do we do here then?

Anonymous
09/30/25(Tue)10:45:27 No.106746946

Anonymous 09/30/25(Tue)10:45:27 No.106746946

>>106746931
chromapilled asian1girlmaxxing

Anonymous
09/30/25(Tue)10:46:11 No.106746957

Anonymous 09/30/25(Tue)10:46:11 No.106746957

>>106746878
I like one called smoothmix noobillustrious, it easily spits out that 3d ish looking style which I enjoy, but it’s only good in small doses cause it heavily samefaces. For straight “flat” anime style I like wainsfw, despite the name it’s great for general purpose 1girl gacha

Anonymous
09/30/25(Tue)10:47:53 No.106746975

Anonymous 09/30/25(Tue)10:47:53 No.106746975

>>106746931
use base noob

Anonymous
09/30/25(Tue)10:49:11 No.106746996

Anonymous 09/30/25(Tue)10:49:11 No.106746996

>>106744809
>They finetuned Wan 1.3B and not 14B
Shit...

Anonymous
09/30/25(Tue)10:56:46 No.106747067

Anonymous 09/30/25(Tue)10:56:46 No.106747067

how do i stop hires fix from adding new knees

Anonymous
09/30/25(Tue)10:57:48 No.106747079

Anonymous 09/30/25(Tue)10:57:48 No.106747079

>>106747067
lower denoise/cfg

Anonymous
09/30/25(Tue)10:59:31 No.106747097

Anonymous 09/30/25(Tue)10:59:31 No.106747097

>>106747067
>he doesnt like extra limbs/nipples/knees
NGMI

Anonymous
09/30/25(Tue)11:00:51 No.106747107

Anonymous 09/30/25(Tue)11:00:51 No.106747107

>still calling it "highres fix"

Anonymous
09/30/25(Tue)11:01:26 No.106747115

Anonymous 09/30/25(Tue)11:01:26 No.106747115

hairesufixu

Anonymous
09/30/25(Tue)11:02:37 No.106747126

Anonymous 09/30/25(Tue)11:02:37 No.106747126

>>106746996
you won't get "real time" from a 14b model kek

Anonymous
09/30/25(Tue)11:03:52 No.106747134

Anonymous 09/30/25(Tue)11:03:52 No.106747134

>>106747067
In auto and its offspring hiresfix sometimes just won’t work (some models worse than others), in those cases you have to img2img it separately and what works best for me is to use a controlnet to ensure it doesn’t warp the image too much. Sometimes even a controlnet isn’t enough but it controls things a lot better. But if it just refuses to do patent upscale, then a script like sd upscale usually can get me there.

Anonymous
09/30/25(Tue)11:09:03 No.106747168

Anonymous 09/30/25(Tue)11:09:03 No.106747168

File: 1746546375476862.png (88 KB, 885x637)

88 KB PNG

neat, qwen edit v2 (2509) understands controlnets natively. so you can use this node, and the output as your image2 source, then whatever you prompt with image1 will use the canny/depth/openpose info for your gens.

before:

Anonymous
09/30/25(Tue)11:10:06 No.106747184

Anonymous 09/30/25(Tue)11:10:06 No.106747184

File: 1752765655071996.png (835 KB, 824x1256)

835 KB PNG

>>106747168
and after. prompt is just "the anime girl is squatting and holding a black rifle."

as you can see, it follows the canny controlnet output. and it works for depth maps, and openpose too. pretty cool.

Anonymous
09/30/25(Tue)11:11:15 No.106747194

Anonymous 09/30/25(Tue)11:11:15 No.106747194

File: 1750681717069795.png (742 KB, 824x1256)

742 KB PNG

>>106747184
"the background is white" for a cleaner bg:

Anonymous
09/30/25(Tue)11:11:16 No.106747195

Anonymous 09/30/25(Tue)11:11:16 No.106747195

File: 1739869578242063.png (985 KB, 1080x636)

985 KB PNG

https://github.com/dc-ai-projects/DC-Gen
https://arxiv.org/pdf/2509.25180
>52x faster
HOLD UP LET THEM COOK

Anonymous
09/30/25(Tue)11:12:17 No.106747199

Anonymous 09/30/25(Tue)11:12:17 No.106747199

Any new API node announcements recently?

Anonymous
09/30/25(Tue)11:14:04 No.106747216

Anonymous 09/30/25(Tue)11:14:04 No.106747216

>>106747195
Wake me up when it has an actual implementation

Anonymous
09/30/25(Tue)11:15:07 No.106747225

Anonymous 09/30/25(Tue)11:15:07 No.106747225

File: 1757888820847257.png (700 KB, 856x1216)

700 KB PNG

>>106747194
this time, openpose of the stance as image2 input, and changed image1 to motoko:

pretty cool. no background stuff since it's just the pose/skeleton.

Anonymous
09/30/25(Tue)11:16:52 No.106747242

Anonymous 09/30/25(Tue)11:16:52 No.106747242

File: 1741434330185693.png (716 KB, 848x1232)

716 KB PNG

2b, pretty good too. if you want the original style rifle just prompt diff or use canny.

Anonymous
09/30/25(Tue)11:24:45 No.106747330

Anonymous 09/30/25(Tue)11:24:45 No.106747330

File: 1740523541954421.png (726 KB, 824x1256)

726 KB PNG

>>106747242
depth map of another nikke bunny character as image2 input, image1 with miku.

isn't technology neat?

basic prompt: the anime girl is bent over and is wearing a teal thong under her skirt. she has long teal twintails and is wearing black boots. the background is white.

Anonymous
09/30/25(Tue)11:29:15 No.106747379

Anonymous 09/30/25(Tue)11:29:15 No.106747379

>>106747195
>4k image generation
Krea can do that?

Anonymous
09/30/25(Tue)11:30:26 No.106747394

Anonymous 09/30/25(Tue)11:30:26 No.106747394

File: 1737051617221494.png (449 KB, 848x1232)

449 KB PNG

>>106747330
yes, I know 2b is in the game but it's just a test. there is no bunny 2b (yet)

Anonymous
09/30/25(Tue)11:34:16 No.106747430

Anonymous 09/30/25(Tue)11:34:16 No.106747430

File: 1740436919573531.png (597 KB, 784x1328)

597 KB PNG

>>106747394
last test, zero suit samus cause why not:

Anonymous
09/30/25(Tue)11:38:56 No.106747472

Anonymous 09/30/25(Tue)11:38:56 No.106747472

>>106747195
let me guess its some batch image generation of fp4 quants running on h200 with a low 1-4 step model

Anonymous
09/30/25(Tue)11:39:24 No.106747475

Anonymous 09/30/25(Tue)11:39:24 No.106747475

>>106747195
sage attention 3?
nah m8
DreamCast Gen?

AW YEAH THIS IS HAPPENIN

Anonymous
09/30/25(Tue)11:43:49 No.106747517

Anonymous 09/30/25(Tue)11:43:49 No.106747517

File: 1732995985022650.jpg (482 KB, 2509x808)

482 KB JPG

>>106747195

Anonymous
09/30/25(Tue)11:44:44 No.106747524

Anonymous 09/30/25(Tue)11:44:44 No.106747524

>>106747126
Probably not "real time" with 14b if it comes out but can only imagine it will be relatively fast.

Anonymous
09/30/25(Tue)11:45:33 No.106747533

Anonymous 09/30/25(Tue)11:45:33 No.106747533

>>106747517
waiter waiter, more plastic!

Anonymous
09/30/25(Tue)11:49:19 No.106747563

Anonymous 09/30/25(Tue)11:49:19 No.106747563

File: 1750753966685899.png (219 KB, 1404x843)

219 KB PNG

https://xcancel.com/T8star_Aix/status/1972934185624215789
>NOOOO HOW DARE YOU NOT GIVE A FUCK ABOUT OUR GOZILLION MODEL, TO PUNISH YOU WE'LL MAKE THE EDIT ONE 20B
looks like we won folks

Anonymous
09/30/25(Tue)11:51:30 No.106747581

Anonymous 09/30/25(Tue)11:51:30 No.106747581

>>106747563
this is in response to comfyui stating they are not going to official implement hunyuan btw (they have api nodes to add instead)

Anonymous
09/30/25(Tue)11:51:34 No.106747583

Anonymous 09/30/25(Tue)11:51:34 No.106747583

File: 1744088896296492.png (1.23 MB, 824x1264)

1.23 MB PNG

you can also use openpose and qwen edit to do other fun things. to be fair, I made the miku with openpose then used edit to remove the guy, then combined the two with PS. AI did the hard work though (the Miku, in the pose.)

Anonymous
09/30/25(Tue)11:52:32 No.106747590

Anonymous 09/30/25(Tue)11:52:32 No.106747590

>>106747581
So it's comfy that saved us? So fucking based, maybe I treated him a bit harshly...

Anonymous
09/30/25(Tue)11:52:40 No.106747591

Anonymous 09/30/25(Tue)11:52:40 No.106747591

File: 1741117307075227.png (77 KB, 1204x670)

77 KB PNG

Wan 3.0 mixture of experts with 671b params quanted dynamically to 130gb for 24vram+128ram systems + ramtorch for high quality real time 1080 video gen when?

Anonymous
09/30/25(Tue)11:53:29 No.106747596

Anonymous 09/30/25(Tue)11:53:29 No.106747596

File: 1744264585098708.png (1.23 MB, 824x1264)

1.23 MB PNG

>>106747583
sorry, wasn't very clean at the car bumper. now it's floyd time.

Anonymous
09/30/25(Tue)11:53:33 No.106747597

Anonymous 09/30/25(Tue)11:53:33 No.106747597

>>106747583
bruh, lmao

Anonymous
09/30/25(Tue)11:56:12 No.106747622

Anonymous 09/30/25(Tue)11:56:12 No.106747622

>>106747596
what's neat is you can take a base image (OG miku) and feed any openpose/canny/depth map from a comfy node into image 2, then get that pose/figure, with the original appearance and manipulate it how you like.

controlnets are great but this adds the option to manipulate stuff that img2img/inpaint + controlnets can't do at the same time. so it's a very cool tool.

Anonymous
09/30/25(Tue)11:56:14 No.106747623

Anonymous 09/30/25(Tue)11:56:14 No.106747623

>>106747563
is he retarded? why does he expect any support at all? no one can run a 80b model so the case is quickly closed

Anonymous
09/30/25(Tue)11:57:19 No.106747634

Anonymous 09/30/25(Tue)11:57:19 No.106747634

>>106747623
dont forget that it's also super slopped, even if you can run it

Anonymous
09/30/25(Tue)11:57:19 No.106747635

Anonymous 09/30/25(Tue)11:57:19 No.106747635

>>106747581
comfy should have worded it less harshly and said that given the model size and quality its not worth comitting a lot of work to get it to run given that most people wont use it but that he's open for PRs but it seems like the harsh words made them aware of the problems of huge models that arent mixture of experts

Anonymous
09/30/25(Tue)11:57:35 No.106747637

Anonymous 09/30/25(Tue)11:57:35 No.106747637

>>106747623
Let retards like this fall off, this is a repeat episode of smug retard trying to juggle profit and free labor to improve his model and forgetting that the community is the main reason why models succeed in this space.

Anonymous
09/30/25(Tue)11:57:57 No.106747642

Anonymous 09/30/25(Tue)11:57:57 No.106747642

File: rare W leddit.png (370 KB, 1813x1234)

370 KB PNG

>>106747563
even leddit isn't buying that bullshit lol

Anonymous
09/30/25(Tue)11:58:30 No.106747652

Anonymous 09/30/25(Tue)11:58:30 No.106747652

>>106747635
>comfy should have worded it less harshly
No, it was perfectly fine. The model is just ass.

Anonymous
09/30/25(Tue)11:58:32 No.106747653

Anonymous 09/30/25(Tue)11:58:32 No.106747653

>>106747563
He should set a GitHub star goal for anon to reach kek I hate chinks

Anonymous
09/30/25(Tue)11:59:13 No.106747661

Anonymous 09/30/25(Tue)11:59:13 No.106747661

>>106747635
>comfy should have worded it less harshly
>it seems like the harsh words made them aware of
so... he was right to be harsh, we immediatly got some nice feedback from them

Anonymous
09/30/25(Tue)11:59:27 No.106747664

Anonymous 09/30/25(Tue)11:59:27 No.106747664

>>106747635
>made them aware of the problems of huge models that arent mixture of experts
although i do want to say that actually the main problem is the model quality in relation to its size, it doesnt look much better than qwen image while its much larger, thats really the problem, since if it was huge but a huge improvement in quality without any doubt, people would be able to prune it somehow themselves or crowd fund something like nunchaku tier quants

Anonymous
09/30/25(Tue)12:00:37 No.106747673

Anonymous 09/30/25(Tue)12:00:37 No.106747673

>>106747664
this, if that 80b model was Seedream tier, there's no doubt the community would've found a way to make it run, but since it looks like your random slopped model, it's completly useless to do any effort on that shit

Anonymous
09/30/25(Tue)12:01:37 No.106747690

Anonymous 09/30/25(Tue)12:01:37 No.106747690

>>106747661
the point is that you dont want to go nuclear too quickly as you can just as easily make the company hate you because of your over the top harsh words and specifically avoid supporting you in the future through PRs or even make them less likely to open source things, especially now when the main way most image/video models are ran locally for anyone who wants to have actual control is through comfyui

Anonymous
09/30/25(Tue)12:01:50 No.106747694

Anonymous 09/30/25(Tue)12:01:50 No.106747694

>>106747664
its complete shit, and it's only 1024 too. qwen is at least 1328x1328. hunyuan 3 is just a disaster of a model, the rendering quality is poor, it's bloated beyond belief, it has the aesthetic of refiner-era SDXL. it's like they trained it on a couple of benchmark party tricks and forgot the rest of the model

Anonymous
09/30/25(Tue)12:02:07 No.106747696

Anonymous 09/30/25(Tue)12:02:07 No.106747696

>>106747690
>over the top
nigga

Anonymous
09/30/25(Tue)12:02:38 No.106747703

Anonymous 09/30/25(Tue)12:02:38 No.106747703

>>106747696
>never had a job in his life

Anonymous
09/30/25(Tue)12:03:17 No.106747716

Anonymous 09/30/25(Tue)12:03:17 No.106747716

>>106747703
>muh peepeepoopoo words
Shut the fuck up nigga. The model is shit.

Anonymous
09/30/25(Tue)12:03:18 No.106747717

Anonymous 09/30/25(Tue)12:03:18 No.106747717

>>106747690
but that's not what happened there, Comfy managaed to make them reconsider things, it's a win for the open source community
>>106747703
Comfy doesn't care about that he's his own boss, and he has the power to make Tencent kneel, he's pretty based when you think about it

Anonymous
09/30/25(Tue)12:04:42 No.106747725

Anonymous 09/30/25(Tue)12:04:42 No.106747725

>>106747703
>>106747690
shut the fuck up retard, Comfy was right to not implement it, and tf you say about "harsh words"? at no point he insulted the model or the team behind it

Anonymous
09/30/25(Tue)12:06:12 No.106747739

Anonymous 09/30/25(Tue)12:06:12 No.106747739

File: jam.png (314 KB, 1195x1060)

314 KB PNG

>>106747581
How many times must it be said: Comfy's business model is all about API nodes. Getting companies to switch to API is in ComfyUI's best interest. He's now making shit up and refusing to implement local models so that they switch to API only, at which point he will get on his knees and suck them off with a whole suite of workflows, tutorials, livestreams, and san francisco meetups just like for Wan2.5 and Seedream 4
The quicker ComfyUI gets removed from the OP the quicker local models begin to heal.

Anonymous
09/30/25(Tue)12:06:58 No.106747749

Anonymous 09/30/25(Tue)12:06:58 No.106747749

File: Comfy right now.png (384 KB, 859x960)

384 KB PNG

>>106747563
>>106747581
>this is in response to comfyui stating they are not going to official implement hunyuan btw
I don't want anyone to talk shit about Comfy for at least 2 months since he made this gigachad move

Anonymous
09/30/25(Tue)12:07:23 No.106747751

Anonymous 09/30/25(Tue)12:07:23 No.106747751

>106747739
get better bait

Anonymous
09/30/25(Tue)12:07:41 No.106747753

Anonymous 09/30/25(Tue)12:07:41 No.106747753

How retards ITT see it
>based comfy, this means we will get 20b models locally soon!!
How chinks see it
>Lets just make our stuff API only, it's not worth dealing with this shit
How local receives the news
>NOOOO why are they making it API?????
How comfy receives the news
>heh, all according to plan

Anonymous
09/30/25(Tue)12:08:15 No.106747758

Anonymous 09/30/25(Tue)12:08:15 No.106747758

You need to be harsh to these faggots or you'll get another SD3 situation. Ironic because Comfy was slurping up the bullshit only to jump ship and cope despite anons knowing what would happen after XL

Anonymous
09/30/25(Tue)12:10:01 No.106747775

Anonymous 09/30/25(Tue)12:10:01 No.106747775

>>106747563
>>106747581
We live in a timeline where Comfy forced a supergiant Chinese company like Tencent to kneet, just let that sink in.

Anonymous
09/30/25(Tue)12:10:41 No.106747783

Anonymous 09/30/25(Tue)12:10:41 No.106747783

File: 1736448083405756.png (915 KB, 824x1256)

915 KB PNG

okay, here's a more fun example. Take a depth map of elegg from nikke and make it your image2 source. image1 source is a full body miku pic.

then with qwen edit: you get thicc Miku.

Anonymous
09/30/25(Tue)12:11:41 No.106747789

Anonymous 09/30/25(Tue)12:11:41 No.106747789

>>106747775
he's false-flagging as a concerned local user in order to push tencent into adopting api nodes

Anonymous
09/30/25(Tue)12:11:54 No.106747793

Anonymous 09/30/25(Tue)12:11:54 No.106747793

Common API node victory, localpajeets stuck with scraps again

Anonymous
09/30/25(Tue)12:12:39 No.106747801

Anonymous 09/30/25(Tue)12:12:39 No.106747801

File: 1730353474243799.png (21 KB, 931x174)

21 KB PNG

>>106747725
>>106747717
again, retards that cant differentiate between regular environments and thinking versus what a suit high up in a company or a woman in marketing will think when they see the response to the implementation of you newest model of the literal owner of the most popular local open source community inferencing frontend clowning on it publicly in picrel

you just need one of those people to see this and give out the memo to no longer cater to the foss community and associate with the currently most popular project for running these models, let alone send their devs to submit PRs to help after this
im not saying this is how it should be, im saying this is how it is

Anonymous
09/30/25(Tue)12:13:05 No.106747805

Anonymous 09/30/25(Tue)12:13:05 No.106747805

>>106747775
Zero adoption of HI3 would've forced them anyway.

Anonymous
09/30/25(Tue)12:14:12 No.106747816

Anonymous 09/30/25(Tue)12:14:12 No.106747816

>>106747801
>what a suit high up in a company or a woman in marketing will think
Maybe they should think why their model turned out a steaming pile of shit.

Anonymous
09/30/25(Tue)12:14:18 No.106747819

Anonymous 09/30/25(Tue)12:14:18 No.106747819

File: 1749869016708585.png (589 KB, 824x1256)

589 KB PNG

>>106747783
would retouch the hand (it's a depth map) but you get the idea.

Anonymous
09/30/25(Tue)12:14:23 No.106747821

Anonymous 09/30/25(Tue)12:14:23 No.106747821

>>106747801
I see all that sponsorship money is being put to good and effective use

Anonymous
09/30/25(Tue)12:15:13 No.106747825

Anonymous 09/30/25(Tue)12:15:13 No.106747825

File: your boos mean nothing, I(...).png (13 KB, 330x138)

13 KB PNG

>>106747801
>they booed him but look at where we are now, Tencent got defeated
based Comfy, he never listened to the retards and that's why he has that power in the first place

Anonymous
09/30/25(Tue)12:15:23 No.106747827

Anonymous 09/30/25(Tue)12:15:23 No.106747827

>>106747801
What do you expect from turdworld posters? They believe that comfy made tencent 'kneel' when all he did was make them abandon local altogether, which is exactly what he wants so he can then crawl back to them and suck them off with an API offer to bait more VC money.

Anonymous
09/30/25(Tue)12:16:07 No.106747831

Anonymous 09/30/25(Tue)12:16:07 No.106747831

>>106747825
he went hard on cosmos, saying it was prime for finetunes and I bet you don't even remember that

Anonymous
09/30/25(Tue)12:16:14 No.106747833

Anonymous 09/30/25(Tue)12:16:14 No.106747833

>>106747805
Tencent knows that if Comfy doesn't want to implement a local model, such model is officially dead, that's how powerful Comfy is

Anonymous
09/30/25(Tue)12:16:16 No.106747836

Anonymous 09/30/25(Tue)12:16:16 No.106747836

The disabled one is here, remember to mention his disability whenever he tries to detail the thread

Anonymous
09/30/25(Tue)12:16:32 No.106747840

Anonymous 09/30/25(Tue)12:16:32 No.106747840

>>106747825
>>106747749
>>106747717
pathetic samefaggotry

Anonymous
09/30/25(Tue)12:16:42 No.106747841

Anonymous 09/30/25(Tue)12:16:42 No.106747841

File: 1492821324809.png (572 KB, 1080x1190)

572 KB PNG

Oh, HE is here again...

Anonymous
09/30/25(Tue)12:17:15 No.106747844

Anonymous 09/30/25(Tue)12:17:15 No.106747844

>>106747827
>They believe that comfy made tencent 'kneel'
he did though
>Comfy says he won't implement it
>2 days later Tencent says they'll prune their model so that more people can run it

Anonymous
09/30/25(Tue)12:17:22 No.106747846

Anonymous 09/30/25(Tue)12:17:22 No.106747846

File: 1739626073818383.png (816 KB, 928x1120)

816 KB PNG

the man is reading a newspaper with the title "BFL bankrupt!". keep his expression the same.

it'd be really funny if the CEO looked like that.

Anonymous
09/30/25(Tue)12:17:46 No.106747848

Anonymous 09/30/25(Tue)12:17:46 No.106747848

Seeing mounting evidence for the removal of ComfyUI from the OP. It's clear that it's just SaaSware now. Perhaps a thread split is needed as well, so many comfyfaggot shills ITT

Anonymous
09/30/25(Tue)12:18:10 No.106747852

Anonymous 09/30/25(Tue)12:18:10 No.106747852

>>106747844
>2 days later Tencent says they'll prune their model so that more people can run it
Ok, but the model is still pure ass and I have zero reasons to use it over Qwen EDit.

Anonymous
09/30/25(Tue)12:18:29 No.106747854

Anonymous 09/30/25(Tue)12:18:29 No.106747854

>>106747836
*derail
He's going to have a major melty today so remind him of his disability. Also beware he will try to mass report you in retaliation.

Anonymous
09/30/25(Tue)12:18:54 No.106747859

Anonymous 09/30/25(Tue)12:18:54 No.106747859

>>106747852
yeah I don't care about their image model, but they'll release an edit model as well, and if that one is 20b instead of 80b, and it's better than QIE, we won

Anonymous
09/30/25(Tue)12:18:55 No.106747860

Anonymous 09/30/25(Tue)12:18:55 No.106747860

>>106747848
This. apiUI is killing local

Anonymous
09/30/25(Tue)12:19:31 No.106747866

Anonymous 09/30/25(Tue)12:19:31 No.106747866

I didn't realize a company like Tencent would care about posts from literal whos on this hell site desu

Anonymous
09/30/25(Tue)12:19:36 No.106747867

Anonymous 09/30/25(Tue)12:19:36 No.106747867

>>106747848
API Nodes are local as long as ComfyUI is in the OP, just so you know

Anonymous
09/30/25(Tue)12:19:54 No.106747873

Anonymous 09/30/25(Tue)12:19:54 No.106747873

>>106747848
>>106747860
Ani samefag

Anonymous
09/30/25(Tue)12:20:37 No.106747881

Anonymous 09/30/25(Tue)12:20:37 No.106747881

>>106747859
Unless the edit model has been trained completely separately from the 80b one then MAYBE it has a chance, but if it is based on the 80b nonsense, then it is DoA

Anonymous
09/30/25(Tue)12:20:45 No.106747882

Anonymous 09/30/25(Tue)12:20:45 No.106747882

>>106747866
they don't, it's comfyshills pretending they have influence. tencent gave a handout to localpoors, localpoors screeched, so now tencent is switching back to closed source. that's all it is

Anonymous
09/30/25(Tue)12:21:24 No.106747888

Anonymous 09/30/25(Tue)12:21:24 No.106747888

>>106747866
The disabled one shifted his focus to this thread you can tell by his dead thread sinking
>>106747873
Wrong one out of the duo, this is his attack dog.

Anonymous
09/30/25(Tue)12:21:34 No.106747891

Anonymous 09/30/25(Tue)12:21:34 No.106747891

how much does ten cent pay (you) to post here, anon?

Anonymous
09/30/25(Tue)12:22:16 No.106747898

Anonymous 09/30/25(Tue)12:22:16 No.106747898

Brainstorming ideas for the new thread title. I’m thinking /odg/ - Open Diffusion General

Anonymous
09/30/25(Tue)12:22:27 No.106747904

Anonymous 09/30/25(Tue)12:22:27 No.106747904

File: Tencent is kneeling in fr(...).png (652 KB, 3155x520)

652 KB PNG

>>106747882
>now tencent is switching back to closed source
why do you have to lie like that, we have eyes we read what they've written >>106747563

Anonymous
09/30/25(Tue)12:23:09 No.106747908

Anonymous 09/30/25(Tue)12:23:09 No.106747908

>>106747898
This is a good idea really, I think the shilling for API Nodes is a bit out of control, it's time to start fresh

Anonymous
09/30/25(Tue)12:23:53 No.106747914

Anonymous 09/30/25(Tue)12:23:53 No.106747914

>trolling anon into doing your dirty work

Anonymous
09/30/25(Tue)12:24:11 No.106747917

Anonymous 09/30/25(Tue)12:24:11 No.106747917

>106747898
Ah the bi quarterly attempt to split the general
Bold move disabled one, surely it will work the 20th time.

Anonymous
09/30/25(Tue)12:24:31 No.106747921

Anonymous 09/30/25(Tue)12:24:31 No.106747921

>>106747904
This is a massive win for local honestly, just like Flux. We can get the 20b hyper-distilled version while the powerful one comes to API. In this situation, everyone wins

Anonymous
09/30/25(Tue)12:24:57 No.106747923

Anonymous 09/30/25(Tue)12:24:57 No.106747923

>>106747908
>I think the shilling for API Nodes
who's shilling for API nodes, all we hear is your complaining about "ghosts who shill for API nodes"

Anonymous
09/30/25(Tue)12:25:19 No.106747927

Anonymous 09/30/25(Tue)12:25:19 No.106747927

>>106747783
very cool, works really well

Anonymous
09/30/25(Tue)12:26:01 No.106747935

Anonymous 09/30/25(Tue)12:26:01 No.106747935

>>106747921
>hyper-distilled
every tencent model is distilled no? even the 80b model is distilled right?

Anonymous
09/30/25(Tue)12:26:11 No.106747938

Anonymous 09/30/25(Tue)12:26:11 No.106747938

>>106747923
The shilling is in the OP and all throughout this thread

Anonymous
09/30/25(Tue)12:26:18 No.106747939

Anonymous 09/30/25(Tue)12:26:18 No.106747939

>>106747923
Debo is upset because his thread gets no traction and has been having a meltdown. He tries this regularly and fails. Just make fun of him for being a low functional autistic neet that can't hold a job.

Anonymous
09/30/25(Tue)12:26:52 No.106747945

Anonymous 09/30/25(Tue)12:26:52 No.106747945

>post lauding API nodes
>wait a few minutes
>complain about API posts
All in a days work

Anonymous
09/30/25(Tue)12:27:02 No.106747947

Anonymous 09/30/25(Tue)12:27:02 No.106747947

>>106747938
>all throughout this thread
where? show some posts shilling API nodes

Anonymous
09/30/25(Tue)12:27:43 No.106747951

Anonymous 09/30/25(Tue)12:27:43 No.106747951

Seedream 4.0 is probably the best image model to date, any place I can run it for cheap??

Anonymous
09/30/25(Tue)12:29:01 No.106747961

Anonymous 09/30/25(Tue)12:29:01 No.106747961

File: 1740349658633482.png (929 KB, 1080x630)

929 KB PNG

>>106747882
>so now tencent is switching back to closed source
not the right time to say random shit like that, they are about to release an autoregressive video model
https://kunhao-liu.github.io/Rolling_Forcing_Webpage/
https://arxiv.org/pdf/2509.25161

Anonymous
09/30/25(Tue)12:29:48 No.106747971

Anonymous 09/30/25(Tue)12:29:48 No.106747971

>>106747841
from helpful calm thread to an explosion of shit
every time

Anonymous
09/30/25(Tue)12:31:31 No.106747988

Anonymous 09/30/25(Tue)12:31:31 No.106747988

>mention his thread is sinking
>then he post
I promise you if anons just started posting handicap sign gens he will fuck off. All of this stops whenever he has to see his social worker

Anonymous
09/30/25(Tue)12:31:39 No.106747990

Anonymous 09/30/25(Tue)12:31:39 No.106747990

File: 1735117414112557.png (13 KB, 291x180)

13 KB PNG

this node is amazing btw, for canny/depth/openpose in qwen edit (or any model really).

Anonymous
09/30/25(Tue)12:32:25 No.106747995

Anonymous 09/30/25(Tue)12:32:25 No.106747995

>>106747990
how does it work?

Anonymous
09/30/25(Tue)12:32:49 No.106748002

Anonymous 09/30/25(Tue)12:32:49 No.106748002

>>106747904
>why do you have to lie like that
>>106747961
>not the right time to say random shit like that

It doesn't matter, he just wants to stir shit. He'll keep repeating it and or come up with some other bullshit regardless.

Anonymous
09/30/25(Tue)12:34:13 No.106748014

Anonymous 09/30/25(Tue)12:34:13 No.106748014

>>106747990
I don't like preprocessors that don't come with values to change

Anonymous
09/30/25(Tue)12:34:31 No.106748017

Anonymous 09/30/25(Tue)12:34:31 No.106748017

[Tutorials]
Here are some tutorials I found recently that really leveled-up my ComfyUI workflows. Hope they help!
https://www.youtube.com/watch?v=FmU-iNXlZ9g
https://www.youtube.com/watch?v=BZzGfUT4YAg
https://www.youtube.com/watch?v=Ht2rafC3FRI

Anonymous
09/30/25(Tue)12:34:55 No.106748023

Anonymous 09/30/25(Tue)12:34:55 No.106748023

>>106747563
I really don't like their behavior, why do they feel entilted to get some dick sucking for every turd they released, it's not because it's "free" that we should eat shit

Anonymous
09/30/25(Tue)12:34:57 No.106748024

Anonymous 09/30/25(Tue)12:34:57 No.106748024

File: 1734951856370812.png (832 KB, 1176x880)

832 KB PNG

>>106747990
also, it makes edits even more precise cause you have controlnet info. with a depth map of this same picture I got a better result.

"replace the girl on the right with Miku Hatsune."

it's easy to swap asuka without the openpose image source but not necessarily in the same pose/style. But you can get exact poses with canny/depth/openpose as a source for image2.

pretty neat, the first version of qwen edit didnt have native controlnet support.

Anonymous
09/30/25(Tue)12:35:00 No.106748025

Anonymous 09/30/25(Tue)12:35:00 No.106748025

>>106748002
They don't understand how he operates but most of the time debo is replying to himself which makes it more pathetic. It's no surprise he has a dead thread where anons make fun of him for being disabled.

Anonymous
09/30/25(Tue)12:35:27 No.106748032

Anonymous 09/30/25(Tue)12:35:27 No.106748032

>>106747961
All these papers are benchmaxed fake shit until we see an actual practical implementation.

Anonymous
09/30/25(Tue)12:35:56 No.106748036

Anonymous 09/30/25(Tue)12:35:56 No.106748036

>>106748017
(You)

Anonymous
09/30/25(Tue)12:36:14 No.106748039

Anonymous 09/30/25(Tue)12:36:14 No.106748039

>>106747995
take an image, pick a controlnet in the dropdown (openpose, canny, depth, whatever), link it to an image output, run: get controlnet output.

stick that in image2 and you can do qwen edit with that as a reference, for exact poses or outputs (canny for 1:1, depth for more flexibility, openpose for the exact pose/skeleton).

Anonymous
09/30/25(Tue)12:36:47 No.106748048

Anonymous 09/30/25(Tue)12:36:47 No.106748048

>>106748036
Glad I could be of help to at least one person. More tutorials coming later, there's just so much to learn!

Anonymous
09/30/25(Tue)12:37:14 No.106748057

Anonymous 09/30/25(Tue)12:37:14 No.106748057

File: C R I N G E.png (1.9 MB, 1661x1704)

1.9 MB PNG

Wtf? This is disturbing as fuck.

Anonymous
09/30/25(Tue)12:37:36 No.106748060

Anonymous 09/30/25(Tue)12:37:36 No.106748060

No one will use tranistudio julien just give up

Anonymous
09/30/25(Tue)12:38:19 No.106748066

Anonymous 09/30/25(Tue)12:38:19 No.106748066

File: 1747924511023871.png (906 KB, 1176x880)

906 KB PNG

kek

replace the girl on the right with the man in image3.

Anonymous
09/30/25(Tue)12:38:22 No.106748067

Anonymous 09/30/25(Tue)12:38:22 No.106748067

>>106748057
The implication here is that guy wasn't already a weirdo before generating the image which we all know is not true kek

Anonymous
09/30/25(Tue)12:38:24 No.106748068

Anonymous 09/30/25(Tue)12:38:24 No.106748068

>>106748039
oh I see, never used these since sd1.5/a1111, do you need to download the canny/openpose etc models?

Anonymous
09/30/25(Tue)12:39:35 No.106748087

Anonymous 09/30/25(Tue)12:39:35 No.106748087

File: ComfyUI_39637_.png (1.66 MB, 1152x896)

1.66 MB PNG

>>106746878
theres no consensus whatsoever aside from base noob being the most flexible one (and requiring more skill)
slopmix wise i regularly use hassaku and anime screenshot merge, as well as cocoillustrious but that one got taken off civitai for some reason

Anonymous
09/30/25(Tue)12:39:38 No.106748089

Anonymous 09/30/25(Tue)12:39:38 No.106748089

File: 1739665706602451.png (783 KB, 1176x880)

783 KB PNG

>>106748068
in comfy it will download it if you dont have it, did that for the zoe depth one I tried, and others.
>>106748066
replace the girl on the right with the man in image3. remove the red hair anime girl.

qwen magic. pretty good desu, only using a depth map for image2.

Anonymous
09/30/25(Tue)12:39:50 No.106748091

Anonymous 09/30/25(Tue)12:39:50 No.106748091

>>106748060
Will even throw his friend under the bus
Just like interior anon, debo lacks the ability to comprehended basic social norms.

Anonymous
09/30/25(Tue)12:39:51 No.106748092

Anonymous 09/30/25(Tue)12:39:51 No.106748092

>>106748057
What a psycho. Even if you do that, don't fucking share it publicly.

>>106748067
He was weird before social media existed, but I didn't have to see it lol.

Anonymous
09/30/25(Tue)12:40:21 No.106748097

Anonymous 09/30/25(Tue)12:40:21 No.106748097

>>106748089
>in comfy it will download it if you dont have it, did that for the zoe depth one I tried, and others.
ok thanks anon, will try

Anonymous
09/30/25(Tue)12:40:24 No.106748098

Anonymous 09/30/25(Tue)12:40:24 No.106748098

>>106747961
LONG VIDEO WAITING ROOM

Anonymous
09/30/25(Tue)12:40:38 No.106748102

Anonymous 09/30/25(Tue)12:40:38 No.106748102

>>106747563
>it is not easy to open source the core model
how? they just released "Slop model #48458784", does he really believe he did something special there? lmao, those guys are living on a bubble there's no way

Anonymous
09/30/25(Tue)12:41:36 No.106748111

Anonymous 09/30/25(Tue)12:41:36 No.106748111

>>106748097
it might seem like it's doing nothing while the node is outlined in green but if you open explorer it will show python is downloading stuff, so it's getting the model then when done should do the output.

Anonymous
09/30/25(Tue)12:42:35 No.106748117

Anonymous 09/30/25(Tue)12:42:35 No.106748117

>>106748111
NTA but can you try with and without the node (same seed)?

Anonymous
09/30/25(Tue)12:44:05 No.106748129

Anonymous 09/30/25(Tue)12:44:05 No.106748129

>>106748117
you dont have to use any controlnet stuff as inputs, but if you want a specific canny/depth/openpose style, just use it as image2 input and prompt, it will affect image1. otherwise just bypass image2/3.

Anonymous
09/30/25(Tue)12:44:48 No.106748135

Anonymous 09/30/25(Tue)12:44:48 No.106748135

>>106748087
>aside from base noob being the most flexible one (and requiring more skill)
based. skill-lets BTFO eternally

Anonymous
09/30/25(Tue)12:46:11 No.106748145

Anonymous 09/30/25(Tue)12:46:11 No.106748145

File: 1742643618815005.png (962 KB, 1176x880)

962 KB PNG

replace the blue hair anime girl on the left with the man in image3.

didn't exactly work in this case (the depth map for rei isn't 100%) but the result is pretty funny

Anonymous
09/30/25(Tue)12:47:08 No.106748153

Anonymous 09/30/25(Tue)12:47:08 No.106748153

>>106748129
Yeah i meant does it make that huge of a difference, I was just curious to see it compared.

Anonymous
09/30/25(Tue)12:47:44 No.106748159

Anonymous 09/30/25(Tue)12:47:44 No.106748159

>>106747563
Their model is quite bad, it becomes clear when looking at their prompting manual. Maybe the 20B will have some value, but it won't be better than the current best SDXL fine-tune.

Anonymous
09/30/25(Tue)12:48:10 No.106748164

Anonymous 09/30/25(Tue)12:48:10 No.106748164

>>106744641
This must be a woman/feminine thing. I couldn't care less if someone trained on my art let alone my face. In fact, I would be flattered!

Anonymous
09/30/25(Tue)12:48:59 No.106748170

Anonymous 09/30/25(Tue)12:48:59 No.106748170

is there any "show image" node that isn't saved in my feed?
I don't want clutter, only to see some intermediate steps when I gen, and it's annoying to see everything saved

Anonymous
09/30/25(Tue)12:49:04 No.106748172

Anonymous 09/30/25(Tue)12:49:04 No.106748172

>>106747563
>Tencent: "Hey guys! Please like and subscribe and if we reach [arbiratry number of github Stars] we'll release that subpar pruned model!
>Alibaba: Here, take this SOTA model, see you next time
during the "Alien during pushups days" I was kinda rooting for Tencent as the underdog, but now I see them as entilted bitches, I really don't like that company, Alibaba is my goat and it'll remain that way

Anonymous
09/30/25(Tue)12:50:12 No.106748177

Anonymous 09/30/25(Tue)12:50:12 No.106748177

File: 1753012460105993.png (898 KB, 1176x880)

898 KB PNG

>>106748153
this is just with 1 image: "replace the red hair anime girl on the right with miku hatsune." it swapped, but didn't do the same pose (didn't specify)

but, notice >>106748024 got the original pose right, that's cause of the depth map info (or openpose, canny, etc).

Anonymous
09/30/25(Tue)12:51:13 No.106748185

Anonymous 09/30/25(Tue)12:51:13 No.106748185

File: 1729718904687047.png (434 KB, 669x502)

434 KB PNG

>>106748177
and here is the original, both swap but you get the same pose if you use the controlnet as image2 input.

Anonymous
09/30/25(Tue)12:51:27 No.106748189

Anonymous 09/30/25(Tue)12:51:27 No.106748189

>>106748170
preview image

Anonymous
09/30/25(Tue)12:52:05 No.106748197

Anonymous 09/30/25(Tue)12:52:05 No.106748197

>>106748185
thank you for posting the original as i have never seen that image in my entire life ever please keep posting it

Anonymous
09/30/25(Tue)12:52:24 No.106748201

Anonymous 09/30/25(Tue)12:52:24 No.106748201

File: 1740884390850406.png (129 KB, 434x1698)

129 KB PNG

>>106748129
yeah it worked, now I just need to learn what all of these do...

Anonymous
09/30/25(Tue)12:53:35 No.106748209

Anonymous 09/30/25(Tue)12:53:35 No.106748209

>>106748201
canny, openpose, and depth (zoe depth is fine) are the main ones for image edits. stuff like tile is for upscaling mainly.

Anonymous
09/30/25(Tue)12:53:49 No.106748212

Anonymous 09/30/25(Tue)12:53:49 No.106748212

>>106748177
OK, makes sense. Thanks for testing, anon.

Anonymous
09/30/25(Tue)12:54:12 No.106748216

Anonymous 09/30/25(Tue)12:54:12 No.106748216

>>106748172
other than hunyuan vid, that only gained traction because if i recall the competition at the time was just anidiff, what have they even done?

Anonymous
09/30/25(Tue)12:54:40 No.106748220

Anonymous 09/30/25(Tue)12:54:40 No.106748220

File: 1739652028203622.png (337 KB, 1365x1024)

337 KB PNG

>>106748209
basically, canny is when you want 1:1 lineart, it's like copying an outline

depth is like this, and has more flexibility when genning

openpose is a skeleton of the character pose, which will help create exactly that.

Anonymous
09/30/25(Tue)12:55:03 No.106748227

Anonymous 09/30/25(Tue)12:55:03 No.106748227

>>106748189
perfect, thanks

Anonymous
09/30/25(Tue)12:55:40 No.106748231

Anonymous 09/30/25(Tue)12:55:40 No.106748231

>>106748172
Alibaba is based. Wan2.5 is insanely powerful, especially through ComfyUI API nodes. Excited to see more coming from them!

Anonymous
09/30/25(Tue)12:56:24 No.106748237

Anonymous 09/30/25(Tue)12:56:24 No.106748237

>>106748209
>>106748220
yeah I'm just wondering which ones would actually useful for qie and which ones it's clever enough to not need

Anonymous
09/30/25(Tue)12:57:03 No.106748243

Anonymous 09/30/25(Tue)12:57:03 No.106748243

>>106748231
>Alibaba is based.
facts

Anonymous
09/30/25(Tue)12:57:42 No.106748250

Anonymous 09/30/25(Tue)12:57:42 No.106748250

>>106748231
SO
MUCH
THISSSSSSS

Anonymous
09/30/25(Tue)12:57:45 No.106748251

Anonymous 09/30/25(Tue)12:57:45 No.106748251

I refuse to root for any Chinese company desu

Anonymous
09/30/25(Tue)12:57:46 No.106748252

Anonymous 09/30/25(Tue)12:57:46 No.106748252

>>106748237
the base model is great at all kinds of stuff, controlnets are for when you want a very specific pose or output to look a certain way (same character outline, same size, same pose). like the example, if you want miku to have the same asuka pose with the feet, you can use controlnet info to do that.

Anonymous
09/30/25(Tue)12:59:14 No.106748270

Anonymous 09/30/25(Tue)12:59:14 No.106748270

When ready

>>106748266
>>106748266
>>106748266
>>106748266

Anonymous
09/30/25(Tue)13:00:00 No.106748275

Anonymous 09/30/25(Tue)13:00:00 No.106748275

File: NOTHING.png (73 KB, 216x234)

73 KB PNG

>>106748216
>other than hunyuan vid, that only gained traction because if i recall the competition at the time was just anidiff, what have they even done?

Anonymous
09/30/25(Tue)13:31:47 No.106748609

Anonymous 09/30/25(Tue)13:31:47 No.106748609

>>106748250
Well I'm convinced

Anonymous
09/30/25(Tue)17:28:02 No.106750977

Anonymous 09/30/25(Tue)17:28:02 No.106750977

>>106748216
they dont clean their datasets as well and left in nsfw stuff because theyre bad at ai. The image 2.1 model they released a few weeks ago is pretty uncensored compared to qwen

their desperation and brute force approach may help us. It's free, fuck it.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.