/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 09/22/25(Mon)12:48:02 No.106666599

File: highlights_g_106661715_17(...).png (2.75 MB, 1329x1309)

2.75 MB PNG

/ldg/ - Local Diffusion General Anonymous 09/22/25(Mon)12:48:02 No.106666599

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106661715

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
09/22/25(Mon)12:50:44 No.106666623

Anonymous 09/22/25(Mon)12:50:44 No.106666623

Cookies & Cream

Anonymous
09/22/25(Mon)12:53:29 No.106666645

Anonymous 09/22/25(Mon)12:53:29 No.106666645

>>106666599
pretty cool collage

Anonymous
09/22/25(Mon)12:56:10 No.106666683

Anonymous 09/22/25(Mon)12:56:10 No.106666683

File: 1751448759653009.png (194 KB, 2610x1210)

194 KB PNG

For those who have missed that out, maybe we'll get something better than Wan
https://byteaigc.github.io/Lynx/

20Loras
09/22/25(Mon)12:57:43 No.106666703

20Loras 09/22/25(Mon)12:57:43 No.106666703

File: WanVideo2_2_I2V_00047.mp4 (543 KB, 480x512)

543 KB MP4

Thanks for the help with the multiple sampler method to split up the strengths of the lora for each step. Sadly it doesn't seem it works.
I'm trying to fix the color shift with the other wrapper wf.

Anonymous
09/22/25(Mon)13:01:52 No.106666755

Anonymous 09/22/25(Mon)13:01:52 No.106666755

>>106666683
I downloaded one video and it's 16fps, it's probably a finetune of Wan 2.2

Anonymous
09/22/25(Mon)13:03:13 No.106666772

Anonymous 09/22/25(Mon)13:03:13 No.106666772

>>106666683
i love how those numbers just do not mean anything at all.

Anonymous
09/22/25(Mon)13:04:43 No.106666795

Anonymous 09/22/25(Mon)13:04:43 No.106666795

>>106666755
> Built on an open-source Diffusion Transformer (DiT) foundation model
it's 100% wan2.2. but the somewhat interesting part is their ID adapter. that aside i see nothing else of value.

Anonymous
09/22/25(Mon)13:05:11 No.106666800

Anonymous 09/22/25(Mon)13:05:11 No.106666800

File: 1745929321836140.mp4 (850 KB, 672x480)

850 KB MP4

Anonymous
09/22/25(Mon)13:11:05 No.106666878

Anonymous 09/22/25(Mon)13:11:05 No.106666878

>>106666800
what's with the celery

Anonymous
09/22/25(Mon)13:12:12 No.106666890

Anonymous 09/22/25(Mon)13:12:12 No.106666890

>>106666878
you don't know Hatsune Miku's leek? how young are you? lol
https://www.youtube.com/watch?v=6ZWwqTnqxdk

Anonymous
09/22/25(Mon)13:13:35 No.106666912

Anonymous 09/22/25(Mon)13:13:35 No.106666912

>>106666890
I don't speak japanese, do you?

Anonymous
09/22/25(Mon)13:14:25 No.106666919

Anonymous 09/22/25(Mon)13:14:25 No.106666919

>>106666912
>do you?
I do know how to search for the translated lyrics on the internet yes?

Anonymous
09/22/25(Mon)13:14:52 No.106666925

Anonymous 09/22/25(Mon)13:14:52 No.106666925

Blessed thread of frenship

Anonymous
09/22/25(Mon)13:20:54 No.106666979

Anonymous 09/22/25(Mon)13:20:54 No.106666979

>>106666890
>Hatsune Miku's leek
Leekspin was orihime from bleach mikunigger

Anonymous
09/22/25(Mon)13:23:28 No.106667003

Anonymous 09/22/25(Mon)13:23:28 No.106667003

File: FluxKrea_Output_36262.jpg (3.57 MB, 1664x2496)

3.57 MB JPG

Anonymous
09/22/25(Mon)13:23:40 No.106667005

Anonymous 09/22/25(Mon)13:23:40 No.106667005

>>106666683
>>106666755

While the demos do look pretty good, seems to still have the same 5 second cap....sigh...

20Loras
09/22/25(Mon)13:24:01 No.106667011

20Loras 09/22/25(Mon)13:24:01 No.106667011

If I got the hardware, are the fp32 options worth it over the fp16 precision in comfyui nodes? Can't tell much difference.

Anonymous
09/22/25(Mon)13:24:04 No.106667012

Anonymous 09/22/25(Mon)13:24:04 No.106667012

>>106666979
https://www.youtube.com/watch?v=ekdKIKfY6Ng
damn I miss that era of youtube, All YouTube recommendations were as kino as this

Anonymous
09/22/25(Mon)13:24:08 No.106667015

Anonymous 09/22/25(Mon)13:24:08 No.106667015

File: 1745321222999795.png (2.08 MB, 1024x1552)

2.08 MB PNG

cute kote

Anonymous
09/22/25(Mon)13:25:30 No.106667024

Anonymous 09/22/25(Mon)13:25:30 No.106667024

>>106667005
likely a finetune of Wan so yeah, with all the drawbacks associated with it

Anonymous
09/22/25(Mon)13:26:00 No.106667031

Anonymous 09/22/25(Mon)13:26:00 No.106667031

>>106667011
isn't bf16 better than fp32?

Anonymous
09/22/25(Mon)13:28:04 No.106667053

Anonymous 09/22/25(Mon)13:28:04 No.106667053

nunchaku qwen image edit plus when???????????

Anonymous
09/22/25(Mon)13:32:39 No.106667084

Anonymous 09/22/25(Mon)13:32:39 No.106667084

>>106667003
corpo slop you'd see them hang in your office to remind you not to think for yourself and waste your life on the company while they rape you.

10/10 would kms again

Anonymous
09/22/25(Mon)13:39:07 No.106667130

Anonymous 09/22/25(Mon)13:39:07 No.106667130

Is the state of unpozzed video gen good enough that there's a thread someplace with horrible degenerate images of heterosexual coupling (and not just "female with liquid overflowing")? Where? Is there a place to try my hand at it for free?

Anonymous
09/22/25(Mon)13:40:45 No.106667152

Anonymous 09/22/25(Mon)13:40:45 No.106667152

File: 1747958829558856.png (534 KB, 1024x1024)

534 KB PNG

make a plastic anime figure of the cyan hair anime girl on a round pedestal. (qwen edit)

Anonymous
09/22/25(Mon)13:40:49 No.106667153

Anonymous 09/22/25(Mon)13:40:49 No.106667153

>>106667130
> free
yes, simply buy a 5090 and you can gen it at home for free.

Anonymous
09/22/25(Mon)13:42:25 No.106667175

Anonymous 09/22/25(Mon)13:42:25 No.106667175

File: 1737544147093371.png (553 KB, 1024x1024)

553 KB PNG

>>106667152
oops, didnt set upscale on the compare image nodes to lanczos. now it's better:

Anonymous
09/22/25(Mon)13:43:36 No.106667189

Anonymous 09/22/25(Mon)13:43:36 No.106667189

>>106667153
dont need a 5090 for wan. 12 or 16gb is enough.

Anonymous
09/22/25(Mon)13:44:17 No.106667196

Anonymous 09/22/25(Mon)13:44:17 No.106667196

File: 1746273254494760.png (2.12 MB, 1500x1500)

2.12 MB PNG

>>106667084
>again
what are you a fucking cat?

Anonymous
09/22/25(Mon)13:44:55 No.106667203

Anonymous 09/22/25(Mon)13:44:55 No.106667203

>>106667189
sure, if you want to wait forever just for 5 seconds.

Anonymous
09/22/25(Mon)13:46:31 No.106667219

Anonymous 09/22/25(Mon)13:46:31 No.106667219

>>106667203
with a 4080 I can get a clip in like 100-120 seconds with lightx2v loras.

Anonymous
09/22/25(Mon)13:50:12 No.106667242

Anonymous 09/22/25(Mon)13:50:12 No.106667242

>>106666591
>having good success with the 2.1 lora at 3 strength for high, and 2.2 lora at 1 for low. only 2.2 high seems to affect motion in a bad way
nigga we been known this

Anonymous
09/22/25(Mon)13:51:08 No.106667250

Anonymous 09/22/25(Mon)13:51:08 No.106667250

File: 1746702751519132.png (978 KB, 1072x968)

978 KB PNG

>>106667175
with both:

Anonymous
09/22/25(Mon)13:52:51 No.106667267

Anonymous 09/22/25(Mon)13:52:51 No.106667267

>>106665018
Catbox?

Anonymous
09/22/25(Mon)13:54:08 No.106667279

Anonymous 09/22/25(Mon)13:54:08 No.106667279

File: 1730104063321332.png (1.01 MB, 1072x968)

1.01 MB PNG

put the image on a coffee cup, that is placed on a table in a coffee shop.

the edit models are so neat. and you can use this stuff with wan or whatever. you can inpaint and do stuff like that but it'd be very hard to do this type of stuff without qwen edit/kontext.

Anonymous
09/22/25(Mon)13:55:21 No.106667288

Anonymous 09/22/25(Mon)13:55:21 No.106667288

>>106667279
Woah that's crazy I had no idea this is entirely new information that no one's mentioned before thank you for sharing anon

Anonymous
09/22/25(Mon)13:59:39 No.106667320

Anonymous 09/22/25(Mon)13:59:39 No.106667320

>>106667288
yes my bad for discussing diffusion models in the diffusion general.

point remains, these are really good tools cause any amount of denoise % or controlnets would not be able to do what these can do.

Anonymous
09/22/25(Mon)13:59:40 No.106667321

Anonymous 09/22/25(Mon)13:59:40 No.106667321

>>106667003
I would use that in a powerpoint.

Anonymous
09/22/25(Mon)14:00:05 No.106667325

Anonymous 09/22/25(Mon)14:00:05 No.106667325

How many GB vram do you need to train an SDXL LoRA?

Anonymous
09/22/25(Mon)14:02:08 No.106667346

Anonymous 09/22/25(Mon)14:02:08 No.106667346

>>106667320
Much like the memes you choose to edit the discussion you bring is old and stale kek

Anonymous
09/22/25(Mon)14:03:40 No.106667361

Anonymous 09/22/25(Mon)14:03:40 No.106667361

>>106667250
Can it make a mamama apimiku style mimukawa miku?

Anonymous
09/22/25(Mon)14:06:37 No.106667394

Anonymous 09/22/25(Mon)14:06:37 No.106667394

File: 1756649195842063.png (1.07 MB, 1136x912)

1.07 MB PNG

Anonymous
09/22/25(Mon)14:12:07 No.106667428

Anonymous 09/22/25(Mon)14:12:07 No.106667428

>>106667394
kek

Anonymous
09/22/25(Mon)14:13:37 No.106667445

Anonymous 09/22/25(Mon)14:13:37 No.106667445

File: 1752173970525730.png (1.58 MB, 1368x760)

1.58 MB PNG

replace the anime girl with Miku Hatsune. Change the text "DOROTHY: SERENDIPITY" to "MIKU: HATSUNE".

nice

Anonymous
09/22/25(Mon)14:15:21 No.106667455

Anonymous 09/22/25(Mon)14:15:21 No.106667455

>>106667445
kys

Anonymous
09/22/25(Mon)14:15:46 No.106667459

Anonymous 09/22/25(Mon)14:15:46 No.106667459

>>106667455
no ty sir

Anonymous
09/22/25(Mon)14:17:45 No.106667481

Anonymous 09/22/25(Mon)14:17:45 No.106667481

>>106667445
pretty good, but hatsune miku isn't the hardest to change, most models know her well

Anonymous
09/22/25(Mon)14:19:14 No.106667497

Anonymous 09/22/25(Mon)14:19:14 No.106667497

>>106667481
youre expecting anon to do something interesting? he would never

Anonymous
09/22/25(Mon)14:19:41 No.106667502

Anonymous 09/22/25(Mon)14:19:41 No.106667502

>>106667481
you could also remove the model entirely and then swap them in with photoshop. what's neat about the edit models is they can edit or remove stuff but respect layers. can't do that with inpainting at high denoise levels.

Anonymous
09/22/25(Mon)14:20:04 No.106667506

Anonymous 09/22/25(Mon)14:20:04 No.106667506

https://huggingface.co/Qwen/Qwen-Image-Edit-2509
it's up

Anonymous
09/22/25(Mon)14:20:36 No.106667511

Anonymous 09/22/25(Mon)14:20:36 No.106667511

Incredible groundbreaking developments from the miku poster

Anonymous
09/22/25(Mon)14:21:54 No.106667521

Anonymous 09/22/25(Mon)14:21:54 No.106667521

>>106667506
Enhanced Single-image Consistency: For single-image inputs, Qwen-Image-Edit-2509 significantly improves editing consistency, specifically in the following areas:
Improved Person Editing Consistency: Better preservation of facial identity, supporting various portrait styles and pose transformations;
Improved Product Editing Consistency: Better preservation of product identity, supporting product poster editing;
Improved Text Editing Consistency: In addition to modifying text content, it also supports editing text fonts, colors, and materials

sounds good

Anonymous
09/22/25(Mon)14:22:38 No.106667524

Anonymous 09/22/25(Mon)14:22:38 No.106667524

>>106667506
>This September, we are pleased to introduce Qwen-Image-Edit-2509
>Multi-image Editing Support: For multi-image inputs, Qwen-Image-Edit-2509 builds upon the Qwen-Image-Edit architecture and is further trained via image concatenation to enable multi-image editing. It supports various combinations such as "person + person," "person + product," and "person + scene." Optimal performance is currently achieved with 1 to 3 input images.
let's fucking go dude, no more image stitching cope anymore

Anonymous
09/22/25(Mon)14:23:41 No.106667530

Anonymous 09/22/25(Mon)14:23:41 No.106667530

File: 1734482800694463.png (424 KB, 405x720)

424 KB PNG

>>106667506
thank you chinks, you really are our saviors

Anonymous
09/22/25(Mon)14:24:12 No.106667537

Anonymous 09/22/25(Mon)14:24:12 No.106667537

>>106667506
any fp8/q8 yet?

Anonymous
09/22/25(Mon)14:24:51 No.106667546

Anonymous 09/22/25(Mon)14:24:51 No.106667546

>>106667506
https://huggingface.co/spaces/Qwen/Qwen-Image-Edit-2509
https://huggingface.co/spaces/akhaliq/Qwen-Image-Edit-2509
there's demos in here

Anonymous
09/22/25(Mon)14:26:09 No.106667557

Anonymous 09/22/25(Mon)14:26:09 No.106667557

>>106667537
it's just been here less than an hour ago, unlikely

Anonymous
09/22/25(Mon)14:27:38 No.106667566

Anonymous 09/22/25(Mon)14:27:38 No.106667566

>>106667506
already?
lmao, qwen actually speedruning to agi in 3 years...

Anonymous
09/22/25(Mon)14:28:43 No.106667574

Anonymous 09/22/25(Mon)14:28:43 No.106667574

>>106667445
>>106667279
>>106667152
Can you try with the updated model >>106667506

Anonymous
09/22/25(Mon)14:28:47 No.106667575

Anonymous 09/22/25(Mon)14:28:47 No.106667575

>>106667506
>and it's not over
we'll get wan 2.5 in a few days, Alibaba is better than Santa Claus lmao

Anonymous
09/22/25(Mon)14:29:53 No.106667587

Anonymous 09/22/25(Mon)14:29:53 No.106667587

File: 1728428382115506.png (2.28 MB, 3072x1638)

2.28 MB PNG

>>106667546
>https://huggingface.co/spaces/akhaliq/Qwen-Image-Edit-2509
wtf is this shit ;_;

Anonymous
09/22/25(Mon)14:30:16 No.106667591

Anonymous 09/22/25(Mon)14:30:16 No.106667591

>>106667574
dont have a quant or fp8 download, I assume it has to be converted from this batch of files.

Anonymous
09/22/25(Mon)14:31:20 No.106667601

Anonymous 09/22/25(Mon)14:31:20 No.106667601

> This September, we are pleased to introduce Qwen-Image-Edit-2509, the monthly iteration of Qwen-Image-Edit.

what? so they are updating it monthly or what?

Anonymous
09/22/25(Mon)14:32:28 No.106667611

Anonymous 09/22/25(Mon)14:32:28 No.106667611

>>106667601
China does what Scam Altman dont

Anonymous
09/22/25(Mon)14:32:33 No.106667612

Anonymous 09/22/25(Mon)14:32:33 No.106667612

>>106667506
sadly it does look just like a incremental improvement

Anonymous
09/22/25(Mon)14:33:15 No.106667618

Anonymous 09/22/25(Mon)14:33:15 No.106667618

>>106667601
>what? so they are updating it monthly or what?
I think they already done that with LLMs

Anonymous
09/22/25(Mon)14:33:20 No.106667619

Anonymous 09/22/25(Mon)14:33:20 No.106667619

at what point will you wake up and stop getting excited for chinese slop

Anonymous
09/22/25(Mon)14:33:42 No.106667622

Anonymous 09/22/25(Mon)14:33:42 No.106667622

>>106667506
NUNCHAKU NEXT TARGET

Anonymous
09/22/25(Mon)14:33:54 No.106667625

Anonymous 09/22/25(Mon)14:33:54 No.106667625

>>106667611
sure but is this just a monthly further finetune or athe actual "edit plus" they are talking about?

Anonymous
09/22/25(Mon)14:35:21 No.106667637

Anonymous 09/22/25(Mon)14:35:21 No.106667637

>>106667622
>NUNCHAKU
give up bro lol

Anonymous
09/22/25(Mon)14:35:34 No.106667640

Anonymous 09/22/25(Mon)14:35:34 No.106667640

>>106667619
slop? wan is better than any western video model and is open source. qwen/qwen edit are free. OpenAI want you to pay $1000 a month for 5 prompts a day.

Anonymous
09/22/25(Mon)14:35:49 No.106667642

Anonymous 09/22/25(Mon)14:35:49 No.106667642

>>106667506
if that one doesn't zooms in randomly, all we have to do is to SPRO this shit and we'll be back

Anonymous
09/22/25(Mon)14:36:27 No.106667645

Anonymous 09/22/25(Mon)14:36:27 No.106667645

that's nice and all but let me know when they are brave enough to include aroused genitals in the dataset

Anonymous
09/22/25(Mon)14:36:38 No.106667647

Anonymous 09/22/25(Mon)14:36:38 No.106667647

File: file.png (15 KB, 450x115)

15 KB PNG

ok yeah sure

Anonymous
09/22/25(Mon)14:38:00 No.106667657

Anonymous 09/22/25(Mon)14:38:00 No.106667657

So are the lighning loras supposed to be 2 high 2 low or 4 high 4 low?

Anonymous
09/22/25(Mon)14:38:28 No.106667660

Anonymous 09/22/25(Mon)14:38:28 No.106667660

>>106667619
never, we can save it!

Anonymous
09/22/25(Mon)14:39:29 No.106667671

Anonymous 09/22/25(Mon)14:39:29 No.106667671

>>106667611
I still would welcome a leak of dalle 3. It has a weird vibe no other model gets right.

Anonymous
09/22/25(Mon)14:40:20 No.106667678

Anonymous 09/22/25(Mon)14:40:20 No.106667678

File: 1744949353100508.png (1.28 MB, 1280x720)

1.28 MB PNG

>>106667506
>The girl from Image 2 is sunbathing on the lounge chair in Image 1
>The girl from Image 2 is drinking coffee on the sofa in Image 1
excellent, that's exactly what I wanted from an edit model

Anonymous
09/22/25(Mon)14:41:51 No.106667692

Anonymous 09/22/25(Mon)14:41:51 No.106667692

>>106667678
scale is way off in the coffee shop image

Anonymous
09/22/25(Mon)14:42:08 No.106667695

Anonymous 09/22/25(Mon)14:42:08 No.106667695

>>106667640
wan is the only non slopped decent model from xi its an outlier need i remind you of the dozen failed image models

Anonymous
09/22/25(Mon)14:43:10 No.106667703

Anonymous 09/22/25(Mon)14:43:10 No.106667703

>>106667692
that was a bad idea to use the ratio of the 2nd image, it should've been the ratio of the 1st image

Anonymous
09/22/25(Mon)14:43:10 No.106667704

Anonymous 09/22/25(Mon)14:43:10 No.106667704

>>106667506
>60GB
do I need 60gb vram to run this?

Anonymous
09/22/25(Mon)14:44:17 No.106667712

Anonymous 09/22/25(Mon)14:44:17 No.106667712

>>106667619
Qwen Image is less slopped than Flux and has the apache 2.0 licence, if only it wasn't so big it would be finetuned by another suspiciously richfag
>>106667704
it's the same size as Qwen Image, so a 24gb vram card will suffice (Q8 QIE + Q8 text encoder)

Anonymous
09/22/25(Mon)14:44:48 No.106667715

Anonymous 09/22/25(Mon)14:44:48 No.106667715

>>106667704
where'd you get that number? I'm seeing 40, and that's fp16 so fp8 will be 20

Anonymous
09/22/25(Mon)14:47:20 No.106667736

Anonymous 09/22/25(Mon)14:47:20 No.106667736

>>106667712
I use q8 image/edit on 16gb (4080) without issue, not even using multigpu node.

Anonymous
09/22/25(Mon)14:48:25 No.106667745

Anonymous 09/22/25(Mon)14:48:25 No.106667745

It's shit. You will cope for a week saying how it's better than nano banana until you too finally admit it's shit.

Anonymous
09/22/25(Mon)14:48:39 No.106667748

Anonymous 09/22/25(Mon)14:48:39 No.106667748

can you actually run the bf16 qie with blockswap on a 24gb card?

Anonymous
09/22/25(Mon)14:49:45 No.106667759

Anonymous 09/22/25(Mon)14:49:45 No.106667759

>>106667745
who legit cares about if it's better or not than nano banan? this shit is free and wildly uncensored compared to any paid non-local model.

you fat cunt.

Anonymous
09/22/25(Mon)14:49:46 No.106667760

Anonymous 09/22/25(Mon)14:49:46 No.106667760

File: retiring.jpg (148 KB, 1376x768)

148 KB JPG

>>106667546

Anonymous
09/22/25(Mon)14:49:49 No.106667761

Anonymous 09/22/25(Mon)14:49:49 No.106667761

>>106667745
>You will cope for a week saying how it's better than nano banana
no one will say that lol, I don't expect QIE to beat nano banana anytime soon

Anonymous
09/22/25(Mon)14:50:41 No.106667767

Anonymous 09/22/25(Mon)14:50:41 No.106667767

Is Wan 4:3 resolutions only or can it do 9:16 (ie iphone), 1:1, etc? I mostly gen between 4:5 and 9:16 so 4:3 doesn't work for me...

Anonymous
09/22/25(Mon)14:50:50 No.106667770

Anonymous 09/22/25(Mon)14:50:50 No.106667770

>>106667760
the eyes are sus for poor Ryan, but I like the skin texture though

Anonymous
09/22/25(Mon)14:51:30 No.106667778

Anonymous 09/22/25(Mon)14:51:30 No.106667778

>>106667767
this is a dumb question.

Anonymous
09/22/25(Mon)14:51:55 No.106667782

Anonymous 09/22/25(Mon)14:51:55 No.106667782

>>106667767
just try it?

Anonymous
09/22/25(Mon)14:52:18 No.106667786

Anonymous 09/22/25(Mon)14:52:18 No.106667786

>>106667767
you can do any size, smaller is good for fast gens (ie: 640x640 vs 832, etc)

Anonymous
09/22/25(Mon)14:53:33 No.106667798

Anonymous 09/22/25(Mon)14:53:33 No.106667798

the model has been out one hour now

where quants

Anonymous
09/22/25(Mon)14:54:53 No.106667805

Anonymous 09/22/25(Mon)14:54:53 No.106667805

why aren't all loras migrated to the new model already?

Anonymous
09/22/25(Mon)14:56:48 No.106667817

Anonymous 09/22/25(Mon)14:56:48 No.106667817

>>106667767
If only there was perhaps a guide written that included this very information.

Anonymous
09/22/25(Mon)14:57:02 No.106667821

Anonymous 09/22/25(Mon)14:57:02 No.106667821

File: 1727804101294440.png (2.95 MB, 3562x1664)

2.95 MB PNG

>>106667506
not bad at all

Anonymous
09/22/25(Mon)14:58:53 No.106667839

Anonymous 09/22/25(Mon)14:58:53 No.106667839

File: 1741028704148360.png (574 KB, 1920x1080)

574 KB PNG

Qwen Image Edit PSA

Always add:
"without changing anything else about the image"
at the end of your prompts if you want to preserve anything at all from the original image

Also here's a great workflow for the old Qwen Image Edit model
https://files.catbox.moe/6wcz4m.png

Anonymous
09/22/25(Mon)15:00:01 No.106667853

Anonymous 09/22/25(Mon)15:00:01 No.106667853

>>106667839
your advice is deprecated anon kek >>106667506

Anonymous
09/22/25(Mon)15:00:17 No.106667856

Anonymous 09/22/25(Mon)15:00:17 No.106667856

>>106667821
Can you catbox those two images so I can try it with the old model? And paste the prompt too if you can

Anonymous
09/22/25(Mon)15:01:32 No.106667865

Anonymous 09/22/25(Mon)15:01:32 No.106667865

>>106667856
I found it on reddit so I can't help you with that

Anonymous
09/22/25(Mon)15:02:16 No.106667872

Anonymous 09/22/25(Mon)15:02:16 No.106667872

>>106667853
No, I'm giving it specifically because I see that the new version also needs that same prompt to be appended for it to preserve things properly and because I see people who are using bad workflows for the old model thinking it's bad but the new one is just an incremental improvement.

The new model still doesn't keep the exact same resolution, and still has the same VAE quality loss obviously as it's still not pixelspace.

Anonymous
09/22/25(Mon)15:04:24 No.106667882

Anonymous 09/22/25(Mon)15:04:24 No.106667882

File: Chroma_Output_24252.png (1.41 MB, 1024x1496)

1.41 MB PNG

Anonymous
09/22/25(Mon)15:04:42 No.106667885

Anonymous 09/22/25(Mon)15:04:42 No.106667885

>>106667872
once lodestones finishes his radiance/pixelspace model we will likely see more models adopt it.
all pissy trolling aside, it really keep images nicer not having to run em through a vae. this will be important for edit models where you simply can't iterate because the quality gets raped by the vae.

Anonymous
09/22/25(Mon)15:04:52 No.106667886

Anonymous 09/22/25(Mon)15:04:52 No.106667886

do people still use guidance that makes generation take 2x as long if you use negatives? haven't done SD in a while

Anonymous
09/22/25(Mon)15:05:37 No.106667891

Anonymous 09/22/25(Mon)15:05:37 No.106667891

File: 1749081402833794.png (727 KB, 1176x880)

727 KB PNG

quants where?

Anonymous
09/22/25(Mon)15:06:38 No.106667899

Anonymous 09/22/25(Mon)15:06:38 No.106667899

File: 1750371447094879.mp4 (3.77 MB, 1920x1080)

3.77 MB MP4

>>106667642
>if that one doesn't zooms in randomly,
it does, look at 33 sec
https://xcancel.com/Ali_TongyiLab/status/1970194603161854214#m

Anonymous
09/22/25(Mon)15:07:40 No.106667906

Anonymous 09/22/25(Mon)15:07:40 No.106667906

>>106667821
looks kinda bad, you can see how it completely rejects the blue jacket's texture when outpainting. looks like a 512x512 crop pasted on top

Anonymous
09/22/25(Mon)15:08:01 No.106667908

Anonymous 09/22/25(Mon)15:08:01 No.106667908

>>106667642
spro is a meme lil bro, let it go

Anonymous
09/22/25(Mon)15:09:04 No.106667914

Anonymous 09/22/25(Mon)15:09:04 No.106667914

>>106667906
oh yeah nice catch

Anonymous
09/22/25(Mon)15:09:10 No.106667916

Anonymous 09/22/25(Mon)15:09:10 No.106667916

not uploaded yet but seems like the first quants are here https://huggingface.co/calcuis/qwen-image-edit-plus-gguf

Anonymous
09/22/25(Mon)15:10:43 No.106667927

Anonymous 09/22/25(Mon)15:10:43 No.106667927

>>106667916
we'll still have to wait for comfy to implement the multi image process too

Anonymous
09/22/25(Mon)15:11:26 No.106667930

Anonymous 09/22/25(Mon)15:11:26 No.106667930

no qwen image edit plus nsfw finetune yet?

Anonymous
09/22/25(Mon)15:11:50 No.106667932

Anonymous 09/22/25(Mon)15:11:50 No.106667932

>>106667885
>once lodestones finishes his radiance/pixelspace model we will likely see more models adopt it.
yep, for edit model it'll be mendatory to go for the pixel space, maybe QIE will be the first to do it who knows

Anonymous
09/22/25(Mon)15:12:56 No.106667938

Anonymous 09/22/25(Mon)15:12:56 No.106667938

>>106667932
This is gonna be brutally heavy on vram and ram tho

Anonymous
09/22/25(Mon)15:14:18 No.106667948

Anonymous 09/22/25(Mon)15:14:18 No.106667948

File: 1749166461153434.png (1.94 MB, 1505x1466)

1.94 MB PNG

>>106667506
now that can be interesting to experiment with

Anonymous
09/22/25(Mon)15:15:22 No.106667958

Anonymous 09/22/25(Mon)15:15:22 No.106667958

>>106667938
I still think 20b is overkill, if they manage to keep the quality with 13-14b + pixel space we could manage to run this shit

Anonymous
09/22/25(Mon)15:15:37 No.106667960

Anonymous 09/22/25(Mon)15:15:37 No.106667960

>>106667886
Raising the CFG will inevitably make things slower.

Anonymous
09/22/25(Mon)15:17:13 No.106667976

Anonymous 09/22/25(Mon)15:17:13 No.106667976

>>106667782
it takes me like 10 minutes every time I "just try" something with video, I'd like to not waste hours discovering the handful of things everyone else already knows. Besides I was hoping someone might know some info about how it was trained and whether it was intended to support such resolutions or not

Anonymous
09/22/25(Mon)15:21:58 No.106668014

Anonymous 09/22/25(Mon)15:21:58 No.106668014

>>106667506
it says it supports ControlNet, holy shit

Anonymous
09/22/25(Mon)15:22:10 No.106668015

Anonymous 09/22/25(Mon)15:22:10 No.106668015

File: 1748998248144044.png (2.86 MB, 1752x1590)

2.86 MB PNG

Here's the new vs old Qwen Image Edit models for comparison with the Will Smith example posted above.

We need SRPO and no quality loss from VAE like Chroma radiance model has with the new pixelspace research, this is just an incremental improvement that isn't much different as the old model is already pretty good depending on the prompt and workflow.

Anonymous
09/22/25(Mon)15:23:33 No.106668025

Anonymous 09/22/25(Mon)15:23:33 No.106668025

>>106668015
I like the improvement, the face is more accurate and the skin texture is not as slopped as before

Anonymous
09/22/25(Mon)15:23:44 No.106668028

Anonymous 09/22/25(Mon)15:23:44 No.106668028

>>106668015
Old version workflow: https://files.catbox.moe/r0kyif.png
Same workflow as posted at >>106667839

>>106667821

Anonymous
09/22/25(Mon)15:23:59 No.106668030

Anonymous 09/22/25(Mon)15:23:59 No.106668030

>>106668015
old version looks so much better ahahah.
can you do some more comparisons? i'm out of gpu time on hf

Anonymous
09/22/25(Mon)15:28:21 No.106668062

Anonymous 09/22/25(Mon)15:28:21 No.106668062

>>106667601
>what? so they are updating it monthly or what?
lmao, are they really gonna upload a new version of QIE each month? sounds crazy, I guess they realized that the training wasn't over and the curve loss wasn't flattened yet

Anonymous
09/22/25(Mon)15:30:12 No.106668080

Anonymous 09/22/25(Mon)15:30:12 No.106668080

File: 1730766689667884.png (1.22 MB, 1280x768)

1.22 MB PNG

>>106668015
>>106668028

Also keep in mind that the deployment parameters of models matter a lot, so we need to wait for the best workflow to be created for a more like to like comparisons.

For example with this comparison and that generated image you see there, on the old model I added
"Don't change anything about their heads at all, keeping their faces and heads exactly as they are."
to the prompt and yet I got the same image as you see there as I get in picrel without adding that sentence to the prompt, meaning the old model for example can't like for like copy the original images like the new one can, despite the images being low quality themselves, it can be a showcaseing of the new model following the prompt better, which is important.

Anonymous
09/22/25(Mon)15:37:35 No.106668134

Anonymous 09/22/25(Mon)15:37:35 No.106668134

https://huggingface.co/calcuis/qwen-image-edit-plus-gguf/blob/main/qwen2.5-vl-7b-test-q4_0.gguf

what's this

Anonymous
09/22/25(Mon)15:37:50 No.106668137

Anonymous 09/22/25(Mon)15:37:50 No.106668137

I just bought a 5060 Ti (16 GB) instead of a 5070 Ti.
Not worth 2x the price; still a massive upgrade from my 3060

Anonymous
09/22/25(Mon)15:38:55 No.106668149

Anonymous 09/22/25(Mon)15:38:55 No.106668149

>>106668134
the text encoder? it's probably the same text encoder as the previous Qwen Image model

Anonymous
09/22/25(Mon)15:39:18 No.106668151

Anonymous 09/22/25(Mon)15:39:18 No.106668151

>>106668137
Waste of money. Should've waited until you had more and bought a 4090 or 5090. I can't imagine doing video gens on 16GB.

Anonymous
09/22/25(Mon)15:39:23 No.106668152

Anonymous 09/22/25(Mon)15:39:23 No.106668152

well I guess q8/other models will be up later today some time.

Anonymous
09/22/25(Mon)15:40:09 No.106668156

Anonymous 09/22/25(Mon)15:40:09 No.106668156

File: elf hugger_00435_.png (3.56 MB, 1080x1920)

3.56 MB PNG

Anonymous
09/22/25(Mon)15:40:24 No.106668161

Anonymous 09/22/25(Mon)15:40:24 No.106668161

>>106668151
wan q8 works absolutely fine on a 4080 (16gb). the only thing you have to consider is not making the dimensions *too* large cause that needs more vram.

Anonymous
09/22/25(Mon)15:40:32 No.106668164

Anonymous 09/22/25(Mon)15:40:32 No.106668164

>>106668134
>>106668149
if that's the same text encoder he's wasting his time, there already have gguf of this

Anonymous
09/22/25(Mon)15:41:10 No.106668168

Anonymous 09/22/25(Mon)15:41:10 No.106668168

>>106668137
nice!

Anonymous
09/22/25(Mon)15:41:14 No.106668169

Anonymous 09/22/25(Mon)15:41:14 No.106668169

>>106667786
imo there's specific resolutions that wan works best with, and i will continue to stick to wan 2.1 resolutions, which is 1280x720 high res, and 480x832 for low res.

Anonymous
09/22/25(Mon)15:42:02 No.106668175

Anonymous 09/22/25(Mon)15:42:02 No.106668175

>>106668137
Should've waited for the super cards. A 3090 is faster than a 5060 ti and you're now stuck with 16gb.

Anonymous
09/22/25(Mon)15:42:37 No.106668181

Anonymous 09/22/25(Mon)15:42:37 No.106668181

>Sarrs... a second model is released this week.

Wtf, I didn't get to fuck around with Wan animate completely yet. We're eating too good.

Anonymous
09/22/25(Mon)15:45:00 No.106668202

Anonymous 09/22/25(Mon)15:45:00 No.106668202

>>106668161
you cant do 720p + future wan models may not do the whole high/low split thing again. if they don't, you'll be forced to use a lower quant like with wan2.1

Anonymous
09/22/25(Mon)15:45:46 No.106668212

Anonymous 09/22/25(Mon)15:45:46 No.106668212

>>106668181
what the hell even is wan animate? is it like vace?

Anonymous
09/22/25(Mon)15:46:11 No.106668216

Anonymous 09/22/25(Mon)15:46:11 No.106668216

>>106668151
waste of money, by getting +70% faster gens? not really. Not very interested in video

>>106668168
yeah! looking forward to it.

>>106668175
I considered waiting for them, but when they're coming is not confirmed - and they're hardly going to be anywhere close to MSRP anyway

Anonymous
09/22/25(Mon)15:47:17 No.106668230

Anonymous 09/22/25(Mon)15:47:17 No.106668230

>>106667506
>Multi-image Editing Support: For multi-image inputs, Qwen-Image-Edit-2509 builds upon the Qwen-Image-Edit architecture and is further trained via image concatenation
what makes it different to our image concatenation cope we used to do on the previous QIE?

Anonymous
09/22/25(Mon)15:48:08 No.106668234

Anonymous 09/22/25(Mon)15:48:08 No.106668234

>>106668216
>waste of money
it's literally not. vram is absolute king.

>Not very interested in video
it's beneficial for training loras too and future proofing for the latest models, but you do you. 100% wasted.

Anonymous
09/22/25(Mon)15:48:22 No.106668236

Anonymous 09/22/25(Mon)15:48:22 No.106668236

File: WanimateCollage_00001 - Copy.mp4 (3.39 MB, 1004x1360)

3.39 MB MP4

>>106668212

https://humanaigc.github.io/wan-animate/

>tl:dr

Character replacer for videogen.

Anonymous
09/22/25(Mon)15:49:24 No.106668243

Anonymous 09/22/25(Mon)15:49:24 No.106668243

>>106668236
whos the girl anyway?

Anonymous
09/22/25(Mon)15:51:31 No.106668258

Anonymous 09/22/25(Mon)15:51:31 No.106668258

>>106668234
>vram is absolute king
would you take a 96GB GTX 680 over a 16GB RTX 4080?

Anonymous
09/22/25(Mon)15:51:31 No.106668259

Anonymous 09/22/25(Mon)15:51:31 No.106668259

>>106668236
Seems VACE is a component within Wan Animate, so essentially it's doing the same thing except better I guess.

Anonymous
09/22/25(Mon)15:51:36 No.106668260

Anonymous 09/22/25(Mon)15:51:36 No.106668260

File: ComfyUI_01086_.jpg (329 KB, 864x1152)

329 KB JPG

Anonymous
09/22/25(Mon)15:54:18 No.106668286

Anonymous 09/22/25(Mon)15:54:18 No.106668286

>>106668258
No because it's GDDR5 vram not GDDR7.

Anonymous
09/22/25(Mon)15:54:57 No.106668292

Anonymous 09/22/25(Mon)15:54:57 No.106668292

>>106668234
I have had no issue training LoRAs with hundreds of images. Not wasted for me.

Anonymous
09/22/25(Mon)15:55:03 No.106668295

Anonymous 09/22/25(Mon)15:55:03 No.106668295

>>106668286
...would you take the GTX if it was GDDR7?

Anonymous
09/22/25(Mon)15:58:20 No.106668323

Anonymous 09/22/25(Mon)15:58:20 No.106668323

>>106668236
that face seems grafted on her, scary

Anonymous
09/22/25(Mon)16:01:00 No.106668339

Anonymous 09/22/25(Mon)16:01:00 No.106668339

>>106668234
>it's literally not. vram is absolute king.
I've seen no difference in my gens speed (well 5%, aka nothing) while sending parts of the model to ram.
Compute >> vram when you have enough vram to do the compute while ram can host the model itself.

Anonymous
09/22/25(Mon)16:02:33 No.106668355

Anonymous 09/22/25(Mon)16:02:33 No.106668355

File: output_image_edit_plus.png (1.36 MB, 896x1152)

1.36 MB PNG

I will run a couple tests of the new QIE with the stock inference code if you post image pairs and a prompt.

I tried it a few times and the result wasn't too good. It retained random bits of the source. It's also unclear how to specifically refer to image 1 and image 2 in the prompt.

Anonymous
09/22/25(Mon)16:04:32 No.106668386

Anonymous 09/22/25(Mon)16:04:32 No.106668386

>>106668243

Meme Dance from some vtuber, real dancer is Yaorenmao

https://www.youtube.com/watch?v=db8FRxYM97Y

gacha character replacement as a test

https://nikke-goddess-of-victory-international.fandom.com/wiki/Liter#Cute_Sunflower

Anonymous
09/22/25(Mon)16:04:50 No.106668390

Anonymous 09/22/25(Mon)16:04:50 No.106668390

File: awsgvawgvawsgvbawgvbw.png (9 KB, 476x131)

9 KB PNG

>>106667506
aint no gaht damn way kek guess i'm not running this shit locally?

Anonymous
09/22/25(Mon)16:05:07 No.106668394

Anonymous 09/22/25(Mon)16:05:07 No.106668394

>>106668286
>>106668295
the silence says everything

Anonymous
09/22/25(Mon)16:07:34 No.106668422

Anonymous 09/22/25(Mon)16:07:34 No.106668422

>>106668355
what were you trying to combine?

Anonymous
09/22/25(Mon)16:07:39 No.106668423

Anonymous 09/22/25(Mon)16:07:39 No.106668423

>>106668386
It looks so much better in the original.

Anonymous
09/22/25(Mon)16:10:08 No.106668452

Anonymous 09/22/25(Mon)16:10:08 No.106668452

>>106666599
>>106663722
Is bottom-left Chroma Radiant?

Anonymous
09/22/25(Mon)16:11:04 No.106668459

Anonymous 09/22/25(Mon)16:11:04 No.106668459

>>106668423

Up to (you)s to mess around and find out what works. Wan animate is only out for couple days. Took a while before Wan2.1/2.2 to get good too.

Anonymous
09/22/25(Mon)16:11:52 No.106668466

Anonymous 09/22/25(Mon)16:11:52 No.106668466

File: ComfyUI_18315.png (3.44 MB, 1728x1152)

3.44 MB PNG

So, Flux SRPO and Chroma share the same grubby output, how do you Chroma guys help mitigate the problem? Upping the resolution certainly helps, but detail is still pretty poor the further into the image you look.

Also, these Flux lines are killing me... I forgot how bad they were when you can't get them to go away.

Anonymous
09/22/25(Mon)16:11:52 No.106668467

Anonymous 09/22/25(Mon)16:11:52 No.106668467

File: input2.png (458 KB, 600x767)

458 KB PNG

>>106668422
A basic miku and this space marine
>"the girl is wearing the space marine's armor. she is not wearing the helmet."

Anonymous
09/22/25(Mon)16:11:59 No.106668468

Anonymous 09/22/25(Mon)16:11:59 No.106668468

>>106668452
it has stealth metadata which says
Model hash: ea349eeae8, Model: noobaiXLNAIXL_vPred10Version, Hashes: {"LORA:noob-ft-1536x-extract.safetensors": "62ab3e5fbe", "LORA:96_chadmix_vpred10_1a.safetensors": "3e4c2efab7", "LORA:96_chiaroscuro_vpred10_1a-000022.safetensors": "29061f3419", "model": "ea349eeae8"}

Anonymous
09/22/25(Mon)16:12:10 No.106668469

Anonymous 09/22/25(Mon)16:12:10 No.106668469

>>106668236
Still looks like shit

Anonymous
09/22/25(Mon)16:13:27 No.106668480

Anonymous 09/22/25(Mon)16:13:27 No.106668480

>>106668468
Nice, thanks

Anonymous
09/22/25(Mon)16:14:30 No.106668493

Anonymous 09/22/25(Mon)16:14:30 No.106668493

>>106668467
go for, "the anime girl from image 1 wears the armor from the image 2"

Anonymous
09/22/25(Mon)16:19:07 No.106668527

Anonymous 09/22/25(Mon)16:19:07 No.106668527

>>106667250
>childlike innocence

Anonymous
09/22/25(Mon)16:20:41 No.106668538

Anonymous 09/22/25(Mon)16:20:41 No.106668538

>>106668466
lots of steps and second pass

Anonymous
09/22/25(Mon)16:24:03 No.106668563

Anonymous 09/22/25(Mon)16:24:03 No.106668563

File: output_image_edit_plus6.png (1.25 MB, 896x1152)

1.25 MB PNG

>>106668493
It's definitely a better result with that prompt, but it's also trying too hard to preserve the original

Anonymous
09/22/25(Mon)16:30:56 No.106668634

Anonymous 09/22/25(Mon)16:30:56 No.106668634

Is there a Comfy native workflow for Wan Animate yet?

Anonymous
09/22/25(Mon)16:32:21 No.106668648

Anonymous 09/22/25(Mon)16:32:21 No.106668648

>>106668634
It's open source so there's a delay, a lot saas stuff to implement first for the comfy man

Anonymous
09/22/25(Mon)16:33:22 No.106668654

Anonymous 09/22/25(Mon)16:33:22 No.106668654

File: 1728412965403171.png (427 KB, 981x1483)

427 KB PNG

https://xcancel.com/LodestoneE621/status/1968687032605065528#m
still no comfyui nodes for this?

Anonymous
09/22/25(Mon)16:35:57 No.106668678

Anonymous 09/22/25(Mon)16:35:57 No.106668678

>>106668654
Is this possible for text models

Anonymous
09/22/25(Mon)16:36:37 No.106668683

Anonymous 09/22/25(Mon)16:36:37 No.106668683

>>106668654
I'll believe it when i see it i tell's ya.

Anonymous
09/22/25(Mon)16:36:50 No.106668685

Anonymous 09/22/25(Mon)16:36:50 No.106668685

>https://github.com/comfyanonymous/ComfyUI/pull/9979

Someone from the LTX team fixed the ComfyUI memory leaks but the devs are probably getting mangled by their VCs and cba to push it to main.

Anonymous
09/22/25(Mon)16:38:18 No.106668706

Anonymous 09/22/25(Mon)16:38:18 No.106668706

File: srghbarhgbareharh.png (106 KB, 521x546)

106 KB PNG

>>106668685
I will spare this fella from my daily jew jokes henceforth, holy fuck cumfy stop fucking around and push this shit. what the fuck are their priorities?

Anonymous
09/22/25(Mon)16:41:03 No.106668737

Anonymous 09/22/25(Mon)16:41:03 No.106668737

>>106668685
based, it's important to have this shit when using wan 2.2 (with all the unloading/reloading shit)

Anonymous
09/22/25(Mon)16:41:38 No.106668742

Anonymous 09/22/25(Mon)16:41:38 No.106668742

>>106668706
That guy is almost cartoon level jew stereotype lol. Based af though if his implementation holds up, I will kneel to him

Anonymous
09/22/25(Mon)16:41:59 No.106668744

Anonymous 09/22/25(Mon)16:41:59 No.106668744

>>106668706
just copy/paste the file yourself

Anonymous
09/22/25(Mon)16:42:24 No.106668749

Anonymous 09/22/25(Mon)16:42:24 No.106668749

File: 00024-801614018.png (3.86 MB, 1536x1536)

3.86 MB PNG

Anonymous
09/22/25(Mon)16:42:36 No.106668753

Anonymous 09/22/25(Mon)16:42:36 No.106668753

>>106668742
>That guy is almost cartoon level jew stereotype lol.
I thought he was arab lol

Anonymous
09/22/25(Mon)16:42:56 No.106668757

Anonymous 09/22/25(Mon)16:42:56 No.106668757

>>106668737
bro i literally cannot fucking run wan 2.2 at all on any quants because of this issue, it'll get through the first pass then rape my computer because it's not cleaning its memory properly. meanwhile i've been steadily perfecting my 2.1 workflow and i can keep running for another 3 or so gens before i get an OOM.

>>106668742
someone in here lurking probably has a better eye for all those desert phenotypes, he's probably not like "pure jew" but still incredibly based. >>106668753 this is what i mean kek

Anonymous
09/22/25(Mon)16:44:14 No.106668773

Anonymous 09/22/25(Mon)16:44:14 No.106668773

https://huggingface.co/calcuis/qwen-image-edit-plus-gguf/blob/main/qwen-image-edit-plus-q2_k_s.gguf

this guy is uploading q2 first...

Anonymous
09/22/25(Mon)16:44:48 No.106668781

Anonymous 09/22/25(Mon)16:44:48 No.106668781

>>106668773
even if we have the weights we can't really test the multiple image shit, comfy has to implement this

Anonymous
09/22/25(Mon)16:45:00 No.106668784

Anonymous 09/22/25(Mon)16:45:00 No.106668784

File: dtgjnstxrhjnsrjnhsrjntjm.png (3 KB, 191x153)

3 KB PNG

>>106668773
haha. aahh. i'm not even gonna try this model.

Anonymous
09/22/25(Mon)16:45:43 No.106668793

Anonymous 09/22/25(Mon)16:45:43 No.106668793

File: 1.jpg (6 KB, 294x18)

6 KB JPG

worth it

Anonymous
09/22/25(Mon)16:50:51 No.106668840

Anonymous 09/22/25(Mon)16:50:51 No.106668840

File: grizzlychips.webm (1.26 MB, 576x720)

1.26 MB WEBM

Anonymous
09/22/25(Mon)16:51:22 No.106668849

Anonymous 09/22/25(Mon)16:51:22 No.106668849

>>106667506
https://imgsli.com/NDE3MjYy
still has the zoom issue, fuck

Anonymous
09/22/25(Mon)16:51:37 No.106668851

Anonymous 09/22/25(Mon)16:51:37 No.106668851

>>106668793
What was it?

Anonymous
09/22/25(Mon)16:52:34 No.106668861

Anonymous 09/22/25(Mon)16:52:34 No.106668861

god I fucking love China FUCK

Anonymous
09/22/25(Mon)16:53:32 No.106668872

Anonymous 09/22/25(Mon)16:53:32 No.106668872

>>106668851
1280x720 80 step porn clip. came out perfect

Anonymous
09/22/25(Mon)16:54:58 No.106668895

Anonymous 09/22/25(Mon)16:54:58 No.106668895

File: nero what.gif (3.71 MB, 500x281)

3.71 MB GIF

>>106668872
>80 step

Anonymous
09/22/25(Mon)17:00:38 No.106668964

Anonymous 09/22/25(Mon)17:00:38 No.106668964

>>106668753
both are semite so yeah

Anonymous
09/22/25(Mon)17:01:36 No.106668973

Anonymous 09/22/25(Mon)17:01:36 No.106668973

are you serious? https://github.com/comfyanonymous/ComfyUI/pull/9979/files
just 3 lines of codes were needed to fix the retarded memory leaks?
fucking useless.

Anonymous
09/22/25(Mon)17:01:59 No.106668975

Anonymous 09/22/25(Mon)17:01:59 No.106668975

>>106668849
OH COME ON

Anonymous
09/22/25(Mon)17:03:10 No.106668988

Anonymous 09/22/25(Mon)17:03:10 No.106668988

>>106668793
>>106668872
SIX HOURS?
And I was already thinking I was crazy back then when my gens were like 2h each before any optimizations.
You've got to share it, catbox!

Anonymous
09/22/25(Mon)17:03:50 No.106668998

Anonymous 09/22/25(Mon)17:03:50 No.106668998

> 6 hours for 5 seconds

the absolute state of video gen'ers

Anonymous
09/22/25(Mon)17:04:15 No.106669006

Anonymous 09/22/25(Mon)17:04:15 No.106669006

>>106668973
would you have found it?

Anonymous
09/22/25(Mon)17:04:45 No.106669012

Anonymous 09/22/25(Mon)17:04:45 No.106669012

https://github.com/comfyanonymous/ComfyUI/pull/9986
damn this motherfucker is fast, now I'll wait for the gguf quants

Anonymous
09/22/25(Mon)17:05:31 No.106669017

Anonymous 09/22/25(Mon)17:05:31 No.106669017

>>106668973
>nocoder

Anonymous
09/22/25(Mon)17:05:32 No.106669018

Anonymous 09/22/25(Mon)17:05:32 No.106669018

File: five nights at niggys (2).gif (2.59 MB, 498x380)

2.59 MB GIF

>>106669012
haha funny how fast they can just implement new SOTA models meanwhile >>106668973

funny that innit bruv

Anonymous
09/22/25(Mon)17:06:49 No.106669023

Anonymous 09/22/25(Mon)17:06:49 No.106669023

>>106669018
dude, sometimes the solution is in the details, there's like millions of lines of code on his project, good luck finding the one that caused the issue

Anonymous
09/22/25(Mon)17:07:34 No.106669030

Anonymous 09/22/25(Mon)17:07:34 No.106669030

File: 1755012436492595.jpg (468 KB, 1184x1046)

468 KB JPG

chat i'm going to manually the mem fixes

Anonymous
09/22/25(Mon)17:08:23 No.106669039

Anonymous 09/22/25(Mon)17:08:23 No.106669039

File: 1644982565793.gif (3.58 MB, 200x183)

3.58 MB GIF

>>106669023
> if loaded_model.model.is_clone(current_loaded_models[i].model):
to_unload = [i] + to_unload
for i in to_unload:
current_loaded_models.pop(i).model.detach(unpatch_all=False)
model_to_unload = current_loaded_models.pop(i)
model_to_unload.model.detach(unpatch_all=False)
model_to_unload.model_finalizer.detach()

total_memory_required = {}
for loaded_model in models_to_load:

you faggots were just crying every thread that there WEREN'T memory leak issues, now it's "w-wwell aschkully muh six million lines of code"

Anonymous
09/22/25(Mon)17:11:02 No.106669052

Anonymous 09/22/25(Mon)17:11:02 No.106669052

never forget the six gorillion lines

Anonymous
09/22/25(Mon)17:11:33 No.106669055

Anonymous 09/22/25(Mon)17:11:33 No.106669055

File: that's right.png (380 KB, 401x612)

380 KB PNG

>>106669039
and yet you weren't the one who proposed a solution to this as a PR, brother

Anonymous
09/22/25(Mon)17:12:28 No.106669065

Anonymous 09/22/25(Mon)17:12:28 No.106669065

>>106669039
comfy fanboys are pretty cancer unfortunately. Why the fuck does a UI even have fanboys lol

Anonymous
09/22/25(Mon)17:13:07 No.106669073

Anonymous 09/22/25(Mon)17:13:07 No.106669073

File: randy savage glasses 1.png (419 KB, 640x480)

419 KB PNG

>>106669055
ooooo i'm taking your hulk hogan slam jam no-u-aroo and no-u-aroo'ing you right back brother, why didn't you come up with the wrestlemania 1989 solution to this PR hogan? yeeaaahhh that's what i thought YEAH.

Anonymous
09/22/25(Mon)17:13:41 No.106669082

Anonymous 09/22/25(Mon)17:13:41 No.106669082

>>106669039
>you faggots were just crying every thread that there WEREN'T memory leak issues
no one said that wtf

Anonymous
09/22/25(Mon)17:15:24 No.106669097

Anonymous 09/22/25(Mon)17:15:24 No.106669097

lodestonesissies, /lmg/ is making fun of our project :(
>>106669029
>>106669087

Anonymous
09/22/25(Mon)17:16:12 No.106669106

Anonymous 09/22/25(Mon)17:16:12 No.106669106

>>106669082
Pretty much look through all the github issue page, the reddit page, and heck even previous threads while back. Anybody who said about the memory leak are met with
>works on my machine lol

Anonymous
09/22/25(Mon)17:17:02 No.106669110

Anonymous 09/22/25(Mon)17:17:02 No.106669110

gib q8 qwen edit update

Anonymous
09/22/25(Mon)17:17:07 No.106669111

Anonymous 09/22/25(Mon)17:17:07 No.106669111

https://www.reddit.com/r/StableDiffusion/comments/1nnyur4/mushroomian_psycho_wan22_animate/
kek, wan animate seems fun to play around with

Anonymous
09/22/25(Mon)17:17:13 No.106669112

Anonymous 09/22/25(Mon)17:17:13 No.106669112

>>106668973
Acktually it was just one "model_to_unload.model_finalizer.detach()"

Anonymous
09/22/25(Mon)17:17:20 No.106669113

Anonymous 09/22/25(Mon)17:17:20 No.106669113

>>106669106
It does work on my machine tho. Idk if you niggas are trying to run it with 16GB ram and cry when it crashes.

Anonymous
09/22/25(Mon)17:18:08 No.106669123

Anonymous 09/22/25(Mon)17:18:08 No.106669123

>>106669113
this

Anonymous
09/22/25(Mon)17:18:29 No.106669125

Anonymous 09/22/25(Mon)17:18:29 No.106669125

What's the alternative to cumfy?
>gradio
>ani

Anonymous
09/22/25(Mon)17:18:47 No.106669130

Anonymous 09/22/25(Mon)17:18:47 No.106669130

>>106669097
> siss
> our
piss off

Anonymous
09/22/25(Mon)17:18:58 No.106669131

Anonymous 09/22/25(Mon)17:18:58 No.106669131

File: oh-yeah randy.gif (359 KB, 220x145)

359 KB GIF

>>106669082
>>106669113
>blames KJ's nodes
>blames wrappers in general
>blames "its your workflow bro"
>blames "its your ram bro lol"

>now there's a pr for it
>well we never said any of those things

it's enough to BRING ME TO THE BOILIN' POINT!

Anonymous
09/22/25(Mon)17:19:53 No.106669139

Anonymous 09/22/25(Mon)17:19:53 No.106669139

>>106669131
We got a new schizo is town, but it's a WWE schizo so I let it pass, that's funni

Anonymous
09/22/25(Mon)17:22:13 No.106669155

Anonymous 09/22/25(Mon)17:22:13 No.106669155

>>106669139
>pointing out observable thread behavior (and even things outside 4chan) is schizo behavior
y'know what buddy, just for that, i'm done with the WWE bit. no fun allowed.

Anonymous
09/22/25(Mon)17:23:50 No.106669165

Anonymous 09/22/25(Mon)17:23:50 No.106669165

RIP WWE Shizo
2025 - 2025

Anonymous
09/22/25(Mon)17:24:10 No.106669170

Anonymous 09/22/25(Mon)17:24:10 No.106669170

https://huggingface.co/calcuis/qwen-image-edit-plus-gguf/blob/main/qwen-image-edit-plus-q3_k_l.gguf

q3! they are slowly getting to q8...

Anonymous
09/22/25(Mon)17:24:44 No.106669173

Anonymous 09/22/25(Mon)17:24:44 No.106669173

>>106669170
why did this retard started with the small quants? reeeeee

Anonymous
09/22/25(Mon)17:25:10 No.106669177

Anonymous 09/22/25(Mon)17:25:10 No.106669177

just ask grok to find the leak it's got all the context window

Anonymous
09/22/25(Mon)17:25:11 No.106669178

Anonymous 09/22/25(Mon)17:25:11 No.106669178

UPDATE

https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF/tree/main

not up yet but I use this repo for ggufs all the time, should be there soon.

Anonymous
09/22/25(Mon)17:25:24 No.106669180

Anonymous 09/22/25(Mon)17:25:24 No.106669180

>>106669113
same. never had any issues with memory leaks using wan2.2. have done 100+ gens in a single session. zero problems.

i do however have obvious memory leak/lag with my SDXL workflow for some odd reason. I have to refresh the page after 30 or so gens because the ui becomes unresponsive. no idea why. i cant figure out why its happening.

Anonymous
09/22/25(Mon)17:27:54 No.106669190

Anonymous 09/22/25(Mon)17:27:54 No.106669190

>>106669173
indian website sir

Anonymous
09/22/25(Mon)17:28:36 No.106669193

Anonymous 09/22/25(Mon)17:28:36 No.106669193

>>106669177
just like that kid with the joker profile pic who asked chatgpt to make triton 4x faster and somehow tricked everyone on reddit into downloading his slop then disappeared after someone called him out

Anonymous
09/22/25(Mon)17:28:42 No.106669194

Anonymous 09/22/25(Mon)17:28:42 No.106669194

>>106669178
Retard here, 2509 is the same as plus, right?

Anonymous
09/22/25(Mon)17:29:38 No.106669201

Anonymous 09/22/25(Mon)17:29:38 No.106669201

>>106669194
pretty sure yes.

Anonymous
09/22/25(Mon)17:30:01 No.106669207

Anonymous 09/22/25(Mon)17:30:01 No.106669207

File: drftsxhbnrsfHBrswzFhbrws.png (111 KB, 1008x938)

111 KB PNG

>>106669170
>>106669178
>>106669194
>>106669201

Anonymous
09/22/25(Mon)17:30:22 No.106669210

Anonymous 09/22/25(Mon)17:30:22 No.106669210

>>106669193
the fact most redditors just upvoted the shit out of it and praised it proves that there are too many retards that are dumb.
this is why grifters thrive. too many people with absolutely zero braincells who throw money at shit.

Anonymous
09/22/25(Mon)17:31:15 No.106669212

Anonymous 09/22/25(Mon)17:31:15 No.106669212

>>106669210
Did he earn any money from that?

Anonymous
09/22/25(Mon)17:31:24 No.106669214

Anonymous 09/22/25(Mon)17:31:24 No.106669214

>>106669210
>too many retards that are dumb
cool tautology, guess i'm one of them.

Anonymous
09/22/25(Mon)17:32:03 No.106669217

Anonymous 09/22/25(Mon)17:32:03 No.106669217

claude is awesome for making general purpose custom nodes though

Anonymous
09/22/25(Mon)17:32:08 No.106669220

Anonymous 09/22/25(Mon)17:32:08 No.106669220

Every time I hear something about memory issues it's some poorfag retard trying to blockswap with kijai

Anonymous
09/22/25(Mon)17:33:28 No.106669232

Anonymous 09/22/25(Mon)17:33:28 No.106669232

>>106669212
no, i don't think there was a patreon linked etc.

Anonymous
09/22/25(Mon)17:34:15 No.106669237

Anonymous 09/22/25(Mon)17:34:15 No.106669237

File: file.png (1.18 MB, 990x901)

1.18 MB PNG

Hello any idea how this was made?

https://www.youtube.com/watch?v=yi8ffoNrj9k&t=6s

specially Maria I think looks great and if this was done without any Loras is a step forward in simplified workflows

Anonymous
09/22/25(Mon)17:34:48 No.106669244

Anonymous 09/22/25(Mon)17:34:48 No.106669244

>>106669210
or the fact almost no one actually tested it themselves to see if it even offered a performance boost.

Anonymous
09/22/25(Mon)17:35:32 No.106669250

Anonymous 09/22/25(Mon)17:35:32 No.106669250

File: what the fuck is this i d(...).png (112 KB, 229x203)

112 KB PNG

>>106669237
lol what

Anonymous
09/22/25(Mon)17:36:15 No.106669253

Anonymous 09/22/25(Mon)17:36:15 No.106669253

>>106669237
>take a sprite
>use Qwen Image Edit -> make the image realistic
>I2V with Wan

Anonymous
09/22/25(Mon)17:36:28 No.106669255

Anonymous 09/22/25(Mon)17:36:28 No.106669255

>>106669237
Feed the 2D into qwen image edit to get the 3d image, then feed the 3d image into a wan image to video pipeline. No character loras needed but maybe use a style lora and good prompting

Anonymous
09/22/25(Mon)17:36:40 No.106669257

Anonymous 09/22/25(Mon)17:36:40 No.106669257

File: 2025-09-22173626_stealthmeta.png (3.11 MB, 1448x1448)

3.11 MB PNG

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.