/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 12/06/25(Sat)20:29:38 No.107463245

File: highlights_g_107460114_17(...).jpg (1.53 MB, 2535x2267)

1.53 MB JPG

/ldg/ - Local Diffusion General Anonymous 12/06/25(Sat)20:29:38 No.107463245 Archived

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107460114

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo

Anonymous
12/06/25(Sat)20:30:25 No.107463254

Anonymous 12/06/25(Sat)20:30:25 No.107463254

File: qwen_00105_.png (1006 KB, 832x1248)

1006 KB PNG

Anonymous
12/06/25(Sat)20:35:37 No.107463279

Anonymous 12/06/25(Sat)20:35:37 No.107463279

File: ZImage_00855_.png (966 KB, 1152x896)

966 KB PNG

Anonymous
12/06/25(Sat)20:37:39 No.107463290

Anonymous 12/06/25(Sat)20:37:39 No.107463290

File: is_it_really_base_2.png (2.61 MB, 1621x1579)

2.61 MB PNG

We might not get the pre-train base model. We might get the SFT one with slop merging.

Anonymous
12/06/25(Sat)20:38:33 No.107463296

Anonymous 12/06/25(Sat)20:38:33 No.107463296

File: zimage df11.jpg (835 KB, 2048x2048)

835 KB JPG

I was bored so I messed with this a bit
https://github.com/mingyi456/ComfyUI-DFloat11-Extended
I have 12gb vram card so I typically run z-image bf16 with circa 2gb offloading for 1024p images.
In this setup df11 actually runs SLOWER than bf16 despite easily fitting into my VRAM. Apparently the decompression has an overhead.
So I tried 1520x1520 image (relying on offloading more) and the gap got smaller. This made me hope that it would run faster on 2048p but when I tried it, I had the same story as 1520x1520. No idea how that works but it is still very slightly slower than bf16.
Maybe this shit is more useful for /lmg/? But I doubt anyone would bother with it over Q6_L or whatever.
TLDR: Not worth it

Anonymous
12/06/25(Sat)20:40:10 No.107463308

Anonymous 12/06/25(Sat)20:40:10 No.107463308

>>107463254
why does ai still struggle with feet?
it seems like there hasn't been much progress on it compared to hands

Anonymous
12/06/25(Sat)20:40:39 No.107463312

Anonymous 12/06/25(Sat)20:40:39 No.107463312

>>107463290
I don't care that much as long as it doesn't hurt NSFW finetuning.

Anonymous
12/06/25(Sat)20:40:41 No.107463313

Anonymous 12/06/25(Sat)20:40:41 No.107463313

File: file.png (2 KB, 180x54)

2 KB PNG

270$ a few months ago btw loooooooooool

Anonymous
12/06/25(Sat)20:40:51 No.107463316

Anonymous 12/06/25(Sat)20:40:51 No.107463316

>>107463257
Yes, it means that our tuning will have to waste time undoing their tuning, and will probably produce worse results due to conflicts. The SFT itself can be thought of as the 'slopification' step.

Anonymous
12/06/25(Sat)20:41:16 No.107463318

Anonymous 12/06/25(Sat)20:41:16 No.107463318

>>107463308
Most photos don't show people's feet but you can see people's hands easily.

Anonymous
12/06/25(Sat)20:42:15 No.107463325

Anonymous 12/06/25(Sat)20:42:15 No.107463325

Is chroma still being actively worked on by lodestone or is the model done?

Anonymous
12/06/25(Sat)20:43:25 No.107463331

Anonymous 12/06/25(Sat)20:43:25 No.107463331

>>107463312
It will hurt all turning

Anonymous
12/06/25(Sat)20:43:25 No.107463332

Anonymous 12/06/25(Sat)20:43:25 No.107463332

>>107463296
If you have Ampere card, bf16 is hardware accelerated and other stuff isn't. This is why bf16 is still faster even when its getting offloaded.

Anonymous
12/06/25(Sat)20:45:30 No.107463344

Anonymous 12/06/25(Sat)20:45:30 No.107463344

File: ComfyUI_00235_.mp4 (1.97 MB, 832x640)

1.97 MB MP4

>>107463308
what's wrong with the cat's feet? i dont know im not a foot fag

Anonymous
12/06/25(Sat)20:46:41 No.107463350

Anonymous 12/06/25(Sat)20:46:41 No.107463350

File: 1736521275153099.jpg (165 KB, 732x1000)

165 KB JPG

>>107463318
theres millions of high quality stock feet pics
no excuse at this point
>>107463344
the cat is missing a toe

Anonymous
12/06/25(Sat)20:46:53 No.107463353

Anonymous 12/06/25(Sat)20:46:53 No.107463353

>>107463316
>will have to waste time undoing their tuning,
Why? Why would we need to undo it? Isn't it just an aesthetic alignment? How do you know the base model's state is more favorable to a NSFW tune than this?
>The SFT itself can be thought of as the 'slopification' step.
Pre-train images don't look any better>>107463290
Especially the fox looks sloppier in pre-training.

Anonymous
12/06/25(Sat)20:48:58 No.107463367

Anonymous 12/06/25(Sat)20:48:58 No.107463367

My preferred cope is what that other anon said in the previous thread, i.e. that they rushed out Turbo purely to shit on BFL and the base model wasn't quite finished yet.

Anonymous
12/06/25(Sat)20:49:13 No.107463368

Anonymous 12/06/25(Sat)20:49:13 No.107463368

>>107463332
I see.
Makes sense I suppose.
Maybe useful to earlier GPUs then.

Anonymous
12/06/25(Sat)20:50:22 No.107463378

Anonymous 12/06/25(Sat)20:50:22 No.107463378

>>107463367
this is basically what that one chinese leaker said, that base still hasnt finished training

Anonymous
12/06/25(Sat)20:51:50 No.107463383

Anonymous 12/06/25(Sat)20:51:50 No.107463383

File: ComfyUI_00237_.mp4 (428 KB, 832x640)

428 KB MP4

Anonymous
12/06/25(Sat)20:52:34 No.107463392

Anonymous 12/06/25(Sat)20:52:34 No.107463392

>>107463325
no its all z now

Anonymous
12/06/25(Sat)20:52:41 No.107463393

Anonymous 12/06/25(Sat)20:52:41 No.107463393

File: b2ee26e9-8009-4e7f-9bec-2(...).png (164 KB, 2590x1242)

164 KB PNG

>comfy cloud
>7400 credits ≈ 5.5 GPU hours per month.
>$35
imagine paying for this shit

Anonymous
12/06/25(Sat)20:55:29 No.107463414

Anonymous 12/06/25(Sat)20:55:29 No.107463414

>>107463393
hosting a gpu renting service is one of the only ways to make money with ai so i cant blame them

Anonymous
12/06/25(Sat)20:56:32 No.107463423

Anonymous 12/06/25(Sat)20:56:32 No.107463423

>>107463325
no idea but for now, there's chroma spark https://huggingface.co/SG161222/SPARK.Chroma_preview or https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main

i havent had the chance to try it out yet

Anonymous
12/06/25(Sat)20:57:11 No.107463427

Anonymous 12/06/25(Sat)20:57:11 No.107463427

As an anon who likes genning, Turbo released a week ago and I still haven't covered 1% of it, got lot more to try.
Why are you so anxious for base?
Did you already explore everything Turbo can do? These models are deep as hell and you at least a year to start to know one properly.
I'll admit with shame that I've been on SDXL this whole time and still don't know it fully yet.
So I ask again, why are you so anxious for base?

Anonymous
12/06/25(Sat)20:57:40 No.107463431

Anonymous 12/06/25(Sat)20:57:40 No.107463431

>>107463325
I think he is still training Radiance?
Last update seems to be 9days ago.
Speaking of which, can anyone tell me how much more VRAM Radiance uses and how slower it is compared to base chroma, just a ballpark figure?
I am curious about that.

Anonymous
12/06/25(Sat)20:57:41 No.107463432

Anonymous 12/06/25(Sat)20:57:41 No.107463432

>>107463393
ComfyUI is SaaS adware and should be removed from the OP.

Anonymous
12/06/25(Sat)20:57:49 No.107463434

Anonymous 12/06/25(Sat)20:57:49 No.107463434

>>107463353
the base model has a very wide range of possible outputs - this includes 'bad' outputs. the examples shown include some of those. sft or similar tuning constricts this range out outputs to a much smaller set that the creators think will be suitable. when we talk about 'slop' we're really referring to a model having strong preferences in its output style (implicitly ones we disagree with) that are difficult to override. to avoid slop, you want a model with the widest range of possible outputs and lowest level of baked-in preference, even though this easily allows for 'bad' output. think sd 1.5

Anonymous
12/06/25(Sat)20:58:32 No.107463438

Anonymous 12/06/25(Sat)20:58:32 No.107463438

File: ZImage_00902_.png (1 MB, 1152x896)

1 MB PNG

>>107463383
cool!

Anonymous
12/06/25(Sat)20:58:39 No.107463440

Anonymous 12/06/25(Sat)20:58:39 No.107463440

>>107463245
Will I be able to generate short video clips on a 4070, or is this card too weak?

Anonymous
12/06/25(Sat)20:58:41 No.107463442

Anonymous 12/06/25(Sat)20:58:41 No.107463442

>>107463325
At this point would pretty stupid to keep working on it, might at as well burn that money, at least would warm your house for a while

Anonymous
12/06/25(Sat)21:00:06 No.107463455

Anonymous 12/06/25(Sat)21:00:06 No.107463455

>>107463367
>>107463427
Even with a much better image quality I rather gen under 30 seconds and upscale later than taking 5 minutes for a 1girl.

Anonymous
12/06/25(Sat)21:00:39 No.107463461

Anonymous 12/06/25(Sat)21:00:39 No.107463461

File: ZImage_00905_.png (802 KB, 1152x896)

802 KB PNG

2am, going to sleep gn

Anonymous
12/06/25(Sat)21:00:56 No.107463466

Anonymous 12/06/25(Sat)21:00:56 No.107463466

>>107463440
yes but it will take 20-30 mins

Anonymous
12/06/25(Sat)21:01:26 No.107463468

Anonymous 12/06/25(Sat)21:01:26 No.107463468

>>107463466
fvark
i'll just learn how to draw

Anonymous
12/06/25(Sat)21:01:37 No.107463470

Anonymous 12/06/25(Sat)21:01:37 No.107463470

>>107463440
people are genning videos on 3060s brother

Anonymous
12/06/25(Sat)21:01:59 No.107463472

Anonymous 12/06/25(Sat)21:01:59 No.107463472

>>107463427
>Why are you so anxious for base?
High quality COOM tunes need base, that's all.
In vacuum I would be grateful to alibaba for just giving the turbo for free but we NEED a sane size and good quality model to espace SDXL hell. (No chroma failbake doesn't count unless someone succeeds at unfucking it)
>>107463423
He stopped training it and seems to moved on to something else.
Maybe the anons accused it of being a grift were right, or maybe it genuinely didn't work out.

Anonymous
12/06/25(Sat)21:02:12 No.107463475

Anonymous 12/06/25(Sat)21:02:12 No.107463475

>THE BASE WILL COME OUT IT JUST HAS TO YOURE JUST IMPATIENT IT WILL COME OUT IT WILL IT WILL IT WILL
it's just sad at this point honestly

Anonymous
12/06/25(Sat)21:02:20 No.107463476

Anonymous 12/06/25(Sat)21:02:20 No.107463476

>>107463368
I had a table of the supported floating point/int formats but can't find it.
But anyways only the very latest cards benefit from the more exotic float/int formats.

Anonymous
12/06/25(Sat)21:02:46 No.107463480

Anonymous 12/06/25(Sat)21:02:46 No.107463480

>>107463470
If it's as bad as this anon >>107463466 said then it's not worth it lol

Anonymous
12/06/25(Sat)21:03:03 No.107463481

Anonymous 12/06/25(Sat)21:03:03 No.107463481

>>107463466
only if you dont add any of the speed boosts. looking at around 6-8 minutes maybe even less depending on the resolution

Anonymous
12/06/25(Sat)21:03:06 No.107463482

Anonymous 12/06/25(Sat)21:03:06 No.107463482

>>107463455
>5minutes
Oh you're a VRAMLET, you're a second class citizen, your opinion is worthless. Delete it so it doesn't clog up the thread.
Thanks.

Anonymous
12/06/25(Sat)21:04:02 No.107463487

Anonymous 12/06/25(Sat)21:04:02 No.107463487

>>107463431
Same speed, but the VRAM reqs go up like crazy since you have no vae compression and are rawdogging it.

Anonymous
12/06/25(Sat)21:04:09 No.107463488

Anonymous 12/06/25(Sat)21:04:09 No.107463488

>>107463480
lol its up to you, dont let 1 random sway your pursuits, research and make up your own mind

Anonymous
12/06/25(Sat)21:04:41 No.107463493

Anonymous 12/06/25(Sat)21:04:41 No.107463493

>>107463434
Makes sense.
But it should be still better than distill for finetuning and hopefully still receptive enough.

Anonymous
12/06/25(Sat)21:05:48 No.107463500

Anonymous 12/06/25(Sat)21:05:48 No.107463500

File: ComfyUI_temp_qnyyp_00058_.png (2.03 MB, 832x1248)

2.03 MB PNG

>>107463185
So instead of a yapping model, now we have one that has frozen faces because it's trained on stills kek.

Anonymous
12/06/25(Sat)21:05:50 No.107463502

Anonymous 12/06/25(Sat)21:05:50 No.107463502

>>107463488
Fair enough, I'll look more into it.

Anonymous
12/06/25(Sat)21:08:36 No.107463525

Anonymous 12/06/25(Sat)21:08:36 No.107463525

>>107463487
>Same speed
Really? Interesting
>but the VRAM reqs go up like crazy since you have no vae compression and are rawdogging it.
I understand that but I was curious what this crazy roughly equals to. Say 16 extra gigs compared to base chroma for same resolution?

Anonymous
12/06/25(Sat)21:09:18 No.107463529

Anonymous 12/06/25(Sat)21:09:18 No.107463529

Fuck chroma is so good at making ugly fucking bastards. Might gen a shit ton and train a lora someday.

Anonymous
12/06/25(Sat)21:10:10 No.107463533

Anonymous 12/06/25(Sat)21:10:10 No.107463533

File: Z-image turbo.png (1.27 MB, 1280x720)

1.27 MB PNG

Anonymous
12/06/25(Sat)21:10:27 No.107463535

Anonymous 12/06/25(Sat)21:10:27 No.107463535

File: 00031-2447428105.png (1.16 MB, 896x1152)

1.16 MB PNG

Anonymous
12/06/25(Sat)21:12:22 No.107463547

Anonymous 12/06/25(Sat)21:12:22 No.107463547

File: ComfyUI_00188_.mp4 (253 KB, 640x640)

253 KB MP4

>>107463254

Anonymous
12/06/25(Sat)21:12:29 No.107463549

Anonymous 12/06/25(Sat)21:12:29 No.107463549

>>107463533
>pornably

Anonymous
12/06/25(Sat)21:12:54 No.107463552

Anonymous 12/06/25(Sat)21:12:54 No.107463552

>>107463529
The one good thing about Chroma being made by a furry is that it produces a decent werewolf

Anonymous
12/06/25(Sat)21:13:22 No.107463556

Anonymous 12/06/25(Sat)21:13:22 No.107463556

File: zturbo_00006_.png (1.55 MB, 1024x1024)

1.55 MB PNG

i got the nag working, i had to use some guy's fork. i have been using a1111 this whole time, switching to comfy for this. will have to install adetailer and figure out inpainting and stuff with this retarded ui

Anonymous
12/06/25(Sat)21:15:09 No.107463570

Anonymous 12/06/25(Sat)21:15:09 No.107463570

Reminder that ComfyUI secretly logs your prompts

Anonymous
12/06/25(Sat)21:15:23 No.107463573

Anonymous 12/06/25(Sat)21:15:23 No.107463573

>>107463556
>i got the nag working, i had to use some guy's fork.
are you using the right parameters?
>cfg 1, nag_scale 3, nag_tau 1, nag_alpha 0.25, nag_sigma_end 0.75

Anonymous
12/06/25(Sat)21:17:03 No.107463586

Anonymous 12/06/25(Sat)21:17:03 No.107463586

File: ComfyUI_temp_cterv_00012_.png (2.57 MB, 768x1344)

2.57 MB PNG

>>107463525
No nothing like that. But on my 12gig on q8 I oom on radiance when genning in 1920*1088 while normal is perfectly fine.

Anonymous
12/06/25(Sat)21:17:30 No.107463591

Anonymous 12/06/25(Sat)21:17:30 No.107463591

File: Zurbo_00015_.jpg (1.3 MB, 3328x1792)

1.3 MB JPG

Vidya

Anonymous
12/06/25(Sat)21:18:27 No.107463600

Anonymous 12/06/25(Sat)21:18:27 No.107463600

>>107463552
Well, I'll admit, some furries are suspiciously talented

Anonymous
12/06/25(Sat)21:19:03 No.107463605

Anonymous 12/06/25(Sat)21:19:03 No.107463605

>>107463570
Oh shit, link the code line from github so I can comment it out. Thanks anon!!

Anonymous
12/06/25(Sat)21:20:49 No.107463624

Anonymous 12/06/25(Sat)21:20:49 No.107463624

>>107463586
Tis is radiance furina?

Anonymous
12/06/25(Sat)21:23:29 No.107463646

Anonymous 12/06/25(Sat)21:23:29 No.107463646

>>107463427
The issue I have is how it wants to stick to a look. Smaller adjustments often don't get recognized at all. Sometimes it feels like it forgets a part of the prompt after a while. Maybe it's an issue with English prompting but it doesn't make it less annoying.

There's also the problem with random seeds doing very little to change the result.

If base is good for loras and finetunes then it should open up a whole new dimension to z-image.

>>107463455
>5 minutes
What the fuck. My ZIT gens take like 10-20 seconds.

>>107463440
You can do it but you'll have to compromise with time spent per gen, resolution and quality. Having done it with a 2070 I don't think it was worth it. Higher res wan genning gives so much better results.

Anonymous
12/06/25(Sat)21:23:50 No.107463648

Anonymous 12/06/25(Sat)21:23:50 No.107463648

>>107463570
my network mode is set to public and security level is weak should i be worried?

Anonymous
12/06/25(Sat)21:24:36 No.107463659

Anonymous 12/06/25(Sat)21:24:36 No.107463659

>>107463586
May I request something?
While moving on to the sampler node Comfy is supposed to dump a line like
loaded partially; 8206.90 MB usable, 7981.13 MB loaded, 3758.42 MB offloaded, 225.00 MB buffer reserved
this to the terminal/cmd
While testing both at the same quant (q8 or whatever) and genning an image at the same resolution, can you show how these two values differ?

Anonymous
12/06/25(Sat)21:26:05 No.107463673

Anonymous 12/06/25(Sat)21:26:05 No.107463673

https://civitai.com/models/2198268/zit-miku?modelVersionId=2475118
Finally, Miku, exactly what Z-image turbo lacked!

Anonymous
12/06/25(Sat)21:26:10 No.107463674

Anonymous 12/06/25(Sat)21:26:10 No.107463674

>>107463659
*these two values differ between normal chroma and radiance

Anonymous
12/06/25(Sat)21:27:55 No.107463690

Anonymous 12/06/25(Sat)21:27:55 No.107463690

File: zturbo_00019_.png (2.82 MB, 1920x1088)

2.82 MB PNG

>>107463573
yeah i copied the settings from the reddit post.

Anonymous
12/06/25(Sat)21:30:15 No.107463713

Anonymous 12/06/25(Sat)21:30:15 No.107463713

>>107463690
pretty cool image anon

Anonymous
12/06/25(Sat)21:32:52 No.107463738

Anonymous 12/06/25(Sat)21:32:52 No.107463738

>>107463570
what does logging here mean? log to where?

Anonymous
12/06/25(Sat)21:33:03 No.107463740

Anonymous 12/06/25(Sat)21:33:03 No.107463740

File: 00046-1033016205.png (1.44 MB, 896x1152)

1.44 MB PNG

aaaah, this brings back memories...

Anonymous
12/06/25(Sat)21:34:41 No.107463753

Anonymous 12/06/25(Sat)21:34:41 No.107463753

>>107463738
your prompts get logged to the comfyui server

Anonymous
12/06/25(Sat)21:34:42 No.107463754

Anonymous 12/06/25(Sat)21:34:42 No.107463754

>>107463738
Andy's logs?

Anonymous
12/06/25(Sat)21:37:28 No.107463779

Anonymous 12/06/25(Sat)21:37:28 No.107463779

>>107463740
When will this stupid bitch apologize for her car?

Anonymous
12/06/25(Sat)21:37:34 No.107463780

Anonymous 12/06/25(Sat)21:37:34 No.107463780

>>107463753
I did a recursive grep and there is nothing.

Anonymous
12/06/25(Sat)21:41:45 No.107463812

Anonymous 12/06/25(Sat)21:41:45 No.107463812

>>107463779
Apologize for her car? wat?

Anonymous
12/06/25(Sat)21:45:03 No.107463831

Anonymous 12/06/25(Sat)21:45:03 No.107463831

File: Styles comparison.jpg (2.78 MB, 4095x1536)

2.78 MB JPG

Anonymous
12/06/25(Sat)21:46:12 No.107463843

Anonymous 12/06/25(Sat)21:46:12 No.107463843

>>107463738
https://github.com/Comfy-Org/ComfyUI-Manager/issues/2193

Anonymous
12/06/25(Sat)21:47:00 No.107463854

Anonymous 12/06/25(Sat)21:47:00 No.107463854

Tomorrow will be released, right?

Anonymous
12/06/25(Sat)21:50:11 No.107463875

Anonymous 12/06/25(Sat)21:50:11 No.107463875

>>107463843
nobody believed the anon warning anons about this for almost a year, nobody bats an eye. some random fucking redditor says it's happening and suddenly it's a surprise.

Anonymous
12/06/25(Sat)21:50:23 No.107463878

Anonymous 12/06/25(Sat)21:50:23 No.107463878

I get Forge users, I get Swarm users, but I can't wrap my head around Comfy users when they're talking about all these technical aspects of Comfy.
Are they using Swarm as a frontend, or they are using undiluted Base Comfy

Anonymous
12/06/25(Sat)21:50:47 No.107463882

Anonymous 12/06/25(Sat)21:50:47 No.107463882

>>107463780
thanks I just needed for somebody who knows whats going on to lay to rest my suspicions. I've been genning ... some fucked up shit lately

Anonymous
12/06/25(Sat)21:51:12 No.107463890

Anonymous 12/06/25(Sat)21:51:12 No.107463890

>>107463843
>>107463875
What are the api calls doing?

Anonymous
12/06/25(Sat)21:51:39 No.107463892

Anonymous 12/06/25(Sat)21:51:39 No.107463892

>>107463843
>>107463875
how do I block this?

Anonymous
12/06/25(Sat)21:51:46 No.107463893

Anonymous 12/06/25(Sat)21:51:46 No.107463893

File: 3555622322.png (1.18 MB, 896x1152)

1.18 MB PNG

Anonymous
12/06/25(Sat)21:52:08 No.107463897

Anonymous 12/06/25(Sat)21:52:08 No.107463897

>>107463892
Using Forge

Anonymous
12/06/25(Sat)21:53:24 No.107463906

Anonymous 12/06/25(Sat)21:53:24 No.107463906

File: 664076285.png (1.14 MB, 896x1152)

1.14 MB PNG

Anonymous
12/06/25(Sat)21:54:40 No.107463917

Anonymous 12/06/25(Sat)21:54:40 No.107463917

>>107463906
>>107463893
>asians
Comfy damage control squad?

Anonymous
12/06/25(Sat)21:55:20 No.107463921

Anonymous 12/06/25(Sat)21:55:20 No.107463921

comfyorg collapse 2026

Anonymous
12/06/25(Sat)21:55:59 No.107463928

Anonymous 12/06/25(Sat)21:55:59 No.107463928

File: qwen edit 2509_00004_.png (753 KB, 1160x896)

753 KB PNG

Anonymous
12/06/25(Sat)21:56:57 No.107463931

Anonymous 12/06/25(Sat)21:56:57 No.107463931

>>107463928
Powerfull

Anonymous
12/06/25(Sat)22:00:42 No.107463948

Anonymous 12/06/25(Sat)22:00:42 No.107463948

Wansisters, is there a way to make native wan generation as fast as kj nodes? When messing around with the SVI workflows I noticed how retard fast it genned after the 1st gen.

Anonymous
12/06/25(Sat)22:03:26 No.107463958

Anonymous 12/06/25(Sat)22:03:26 No.107463958

>>107463948
u mean wankers

Anonymous
12/06/25(Sat)22:04:18 No.107463963

Anonymous 12/06/25(Sat)22:04:18 No.107463963

File: q8.png (22 KB, 1151x170)

22 KB PNG

>>107463659
q8

Anonymous
12/06/25(Sat)22:08:20 No.107463993

Anonymous 12/06/25(Sat)22:08:20 No.107463993

File: 00051-3897548588.png (1.02 MB, 896x1152)

1.02 MB PNG

>>107463897
How do we know Forge isn't logging?

Anonymous
12/06/25(Sat)22:09:25 No.107463998

Anonymous 12/06/25(Sat)22:09:25 No.107463998

>>107463948
Haven't tried the SVI workflows, KJ wasn't really faster for me than native tho (assuming both use ltx2v and sageattn or w/e setup you use)

Anonymous
12/06/25(Sat)22:11:27 No.107464010

Anonymous 12/06/25(Sat)22:11:27 No.107464010

File: radiance.png (9 KB, 1122x98)

9 KB PNG

>>107463963
>>107463659
Radiance q8

Anonymous
12/06/25(Sat)22:15:55 No.107464036

Anonymous 12/06/25(Sat)22:15:55 No.107464036

did any training ui or inference ui add longcat-image yet?

Anonymous
12/06/25(Sat)22:15:55 No.107464037

Anonymous 12/06/25(Sat)22:15:55 No.107464037

>>107463897
>Gradio
>block spying
kek

Anonymous
12/06/25(Sat)22:17:44 No.107464047

Anonymous 12/06/25(Sat)22:17:44 No.107464047

>>107463890
I assume it is just building a local cache of https://api.comfy.org/nodes which has 329 pages of JSON.

Anonymous
12/06/25(Sat)22:18:41 No.107464059

Anonymous 12/06/25(Sat)22:18:41 No.107464059

>>107464047
makes sense

Anonymous
12/06/25(Sat)22:22:18 No.107464084

Anonymous 12/06/25(Sat)22:22:18 No.107464084

>>107463958
Kek you're not wrong

>>107463998
Yeah kj nodes used to be painfully slow months ago for me but for whatever reason its now almost twice as fast as native. Yeah their workflows are lightx2v. Also yes on the sageatten but it's all woct0rdho including radialattn https://github.com/woct0rdho/ComfyUI-RadialAttn

There's SVI 2.0 now but still waiting on the 2.2 workflows to drop https://github.com/vita-epfl/Stable-Video-Infinity/tree/svi_wan22

Anonymous
12/06/25(Sat)22:28:25 No.107464119

Anonymous 12/06/25(Sat)22:28:25 No.107464119

>>107463963
>>107464010
I appreciate it, merci

Anonymous
12/06/25(Sat)22:30:28 No.107464130

Anonymous 12/06/25(Sat)22:30:28 No.107464130

>>107464084
>It's also recommended to install SageAttention, and add --use-sage-attention when starting ComfyUI. When RadialAttention is not applicable, SageAttention will be used.
So does that work with PatchSageAttentionKJ node? I don't like applying sage globally.

Anonymous
12/06/25(Sat)22:31:50 No.107464135

Anonymous 12/06/25(Sat)22:31:50 No.107464135

>>107464084
i don't have much experience with that but i can't really see why it'd be slower if you also use radial attention and maybe also torch compile in native.

hard to diagnose I suppose

Anonymous
12/06/25(Sat)22:35:46 No.107464151

Anonymous 12/06/25(Sat)22:35:46 No.107464151

File: FISH.png (1.3 MB, 896x1152)

1.3 MB PNG

Anonymous
12/06/25(Sat)22:37:33 No.107464160

Anonymous 12/06/25(Sat)22:37:33 No.107464160

im upscaling a video with seedvr2. i was using a workflow that used this to upscale images and its really good. i dont really know what i am doing but i uploaded a video nodes so it upscales each frame then il going to combine it back to a video

Anonymous
12/06/25(Sat)22:37:49 No.107464162

Anonymous 12/06/25(Sat)22:37:49 No.107464162

>>107463245
Coming back after a while, have we seriously not got anything better than SDXL for fully uncensored base model? You can layer LoRAs but not ideal exactly for anything complex. I feel like slop generation hasn't progressed much and kinda saddens me, am I missing something? SDXL feels so dated has worse prompt understanding than a Indian

Anonymous
12/06/25(Sat)22:39:27 No.107464176

Anonymous 12/06/25(Sat)22:39:27 No.107464176

File: 1960220049.png (1.41 MB, 1024x1536)

1.41 MB PNG

Anonymous
12/06/25(Sat)22:40:39 No.107464180

Anonymous 12/06/25(Sat)22:40:39 No.107464180

File: ComfyUI_temp_qhhpa_00005_.png (3.31 MB, 1280x1280)

3.31 MB PNG

oink

Anonymous
12/06/25(Sat)22:42:45 No.107464192

Anonymous 12/06/25(Sat)22:42:45 No.107464192

>>107464162
We are most likely getting a decent Z-Image porn tune in the following months if the Chinese don't fuck us over with the base model release.
We have Chroma which knows insane amount of stuff including NSFW but also too fucking schizo to be used reliably. If you have a powerful card you can make the seed lottery work.
SDXL has a retarded ancient text encoder so yes it is notoriously awful at understanding prompts.

Anonymous
12/06/25(Sat)22:44:10 No.107464201

Anonymous 12/06/25(Sat)22:44:10 No.107464201

>>107464130
I have no idea. I do use "Model Patch Torch Settings" then enable the fp16 bit but I use that for native only.

>>107464135
Wish I knew. Its a shame because its so fast it also kinda ignores prompts even with loras, then again I havnt had the time to properly do an in depth test. Speaking of speed boosts, if this ever releases https://github.com/dvlab-research/Jenga we could be looking at image speeds for video

Anonymous
12/06/25(Sat)22:44:48 No.107464205

Anonymous 12/06/25(Sat)22:44:48 No.107464205

>>107464176
>>107464180
prompt?

Anonymous
12/06/25(Sat)22:45:00 No.107464207

Anonymous 12/06/25(Sat)22:45:00 No.107464207

File: 1675872417.png (1.8 MB, 1024x1536)

1.8 MB PNG

Anonymous
12/06/25(Sat)22:45:10 No.107464208

Anonymous 12/06/25(Sat)22:45:10 No.107464208

>>107464180
*SNNNIIIFFFF, this smells like chroma

Anonymous
12/06/25(Sat)22:48:03 No.107464231

Anonymous 12/06/25(Sat)22:48:03 No.107464231

File: comfyui trash.jpg (18 KB, 426x415)

18 KB JPG

>>107464160
im so sick of this fucking garbage. from now on i am going to have to safe every single intermediatory product from anything that is produced by this trash and have mini work flows set up that i can initiate. what a waste of fucking time.

Anonymous
12/06/25(Sat)22:52:15 No.107464264

Anonymous 12/06/25(Sat)22:52:15 No.107464264

>>107464207
they should make those life sized chink bots with shells like this

bloodyscabSNIFFER
12/06/25(Sat)22:53:58 No.107464273

bloodyscabSNIFFER 12/06/25(Sat)22:53:58 No.107464273

is this even a blue board anymore or what?

Anonymous
12/06/25(Sat)22:55:50 No.107464286

Anonymous 12/06/25(Sat)22:55:50 No.107464286

File: ComfyUI_temp_qkqtm_00001_.jpg (442 KB, 1600x1152)

442 KB JPG

Anonymous
12/06/25(Sat)22:55:59 No.107464287

Anonymous 12/06/25(Sat)22:55:59 No.107464287

>>107464160
>upscaling a video with seedvr2
Is it better than using an "oldschool" upscale model?

Anonymous
12/06/25(Sat)22:57:21 No.107464300

Anonymous 12/06/25(Sat)22:57:21 No.107464300

File: ZIT_00631_.png (2.28 MB, 1152x2048)

2.28 MB PNG

>>107464273
WDYM? This is clearly blue board content.

Anonymous
12/06/25(Sat)22:57:39 No.107464303

Anonymous 12/06/25(Sat)22:57:39 No.107464303

File: 1735002-A close up, ultra(...).png (1.24 MB, 768x1344)

1.24 MB PNG

Finally got z-image to do a shot like this!

Anonymous
12/06/25(Sat)22:59:53 No.107464322

Anonymous 12/06/25(Sat)22:59:53 No.107464322

File: 202134449.png (1.7 MB, 1024x1536)

1.7 MB PNG

>>107464264
They'd sell

Anonymous
12/06/25(Sat)23:00:40 No.107464332

Anonymous 12/06/25(Sat)23:00:40 No.107464332

File: ComfyUI_temp_qhhpa_00014_.png (2.67 MB, 1040x1440)

2.67 MB PNG

Anonymous
12/06/25(Sat)23:01:14 No.107464334

Anonymous 12/06/25(Sat)23:01:14 No.107464334

File: latina girl closeup.jpg (367 KB, 1408x966)

367 KB JPG

>>107464287
ive used upscalers in the past for images and they always fuck things up, but this is really good. anything it messes up its because there isnt enough information in the base image. its not 100% perfect, I would still try to run it through img2img to get more details. but if i had an image, then i upscaled it, then i downscaled it back to the original size then it has more details, it doesnt seem to mess up the lighting or colors very much

Anonymous
12/06/25(Sat)23:04:25 No.107464357

Anonymous 12/06/25(Sat)23:04:25 No.107464357

>>107464334
You're using the ComfyUI-SeedVR2-VideoUpscaler?

Anonymous
12/06/25(Sat)23:04:59 No.107464359

Anonymous 12/06/25(Sat)23:04:59 No.107464359

File: ComfyUI_temp_qhhpa_00017_.png (1.98 MB, 1040x1440)

1.98 MB PNG

bloodyscabSNIFFER
12/06/25(Sat)23:05:18 No.107464362

bloodyscabSNIFFER 12/06/25(Sat)23:05:18 No.107464362

>>107464300
topkek

can't hack me head, I have the app from Ubuntu software store "Extension Manager" and the extension "Grayscale Windows" installed.

>>107464303
prompting strategy for the view?

Anonymous
12/06/25(Sat)23:06:24 No.107464374

Anonymous 12/06/25(Sat)23:06:24 No.107464374

File: 2305001-flash, photograph(...).png (1.18 MB, 1024x1024)

1.18 MB PNG

>>107464362
It didn't seem to really work until I told it WHAT she was doing? I don't fully understand it myself yet.

Anonymous
12/06/25(Sat)23:08:28 No.107464392

Anonymous 12/06/25(Sat)23:08:28 No.107464392

File: 1737899093461916.png (1.71 MB, 1120x1440)

1.71 MB PNG

Anonymous
12/06/25(Sat)23:09:12 No.107464400

Anonymous 12/06/25(Sat)23:09:12 No.107464400

File: 1737036941184105.png (1.26 MB, 1120x1440)

1.26 MB PNG

>>107464392

Anonymous
12/06/25(Sat)23:10:21 No.107464409

Anonymous 12/06/25(Sat)23:10:21 No.107464409

>>107464357
no i used this work flow that had the upscaling in it https://civitai.com/models/1376005/photoflow-z-image-turbo-qwen-chroma-wan-2221-sdxl-t2i-text-to-image-txt2img-workflow. i short circuited the upscaling so i could upload images directly. then i tried to upload a video and feed it through so it would batch upscale each frame then recombine the frames into videos. i wasnt away there was an upscaling node just for videos

Anonymous
12/06/25(Sat)23:10:30 No.107464410

Anonymous 12/06/25(Sat)23:10:30 No.107464410

File: ZIT_00640_.png (2.41 MB, 1536x1536)

2.41 MB PNG

>>107464303
>A close up, ultra wide lens angle, of a
Doesn't seem to do anything for me.

Anonymous
12/06/25(Sat)23:10:33 No.107464412

Anonymous 12/06/25(Sat)23:10:33 No.107464412

File: 1659193385.png (1.71 MB, 1024x1536)

1.71 MB PNG

Anonymous
12/06/25(Sat)23:12:04 No.107464422

Anonymous 12/06/25(Sat)23:12:04 No.107464422

>>107463296
>12gb
why not use Q8?

Anonymous
12/06/25(Sat)23:12:06 No.107464423

Anonymous 12/06/25(Sat)23:12:06 No.107464423

File: ComfyUI_temp_qhhpa_00021_.png (2.56 MB, 1040x1440)

2.56 MB PNG

Anonymous
12/06/25(Sat)23:12:51 No.107464429

Anonymous 12/06/25(Sat)23:12:51 No.107464429

Is there anything like mmaudio yet where I can add sound to existing video for local? Not talking but sounds

Anonymous
12/06/25(Sat)23:15:07 No.107464445

Anonymous 12/06/25(Sat)23:15:07 No.107464445

>>107464429
I thought mmaudio was already local

Anonymous
12/06/25(Sat)23:15:37 No.107464448

Anonymous 12/06/25(Sat)23:15:37 No.107464448

>>107464429
>mmaudio
That is local... I think hunyuan also released one
https://github.com/Tencent-Hunyuan/HunyuanVideo-Foley

Anonymous
12/06/25(Sat)23:16:28 No.107464452

Anonymous 12/06/25(Sat)23:16:28 No.107464452

>>107464412
sexo

Anonymous
12/06/25(Sat)23:17:11 No.107464458

Anonymous 12/06/25(Sat)23:17:11 No.107464458

File: 1651002-An overhead photo(...).png (1.25 MB, 768x1344)

1.25 MB PNG

>>107464410
Like I said, I don't truly understand it yet.

Anonymous
12/06/25(Sat)23:18:24 No.107464464

Anonymous 12/06/25(Sat)23:18:24 No.107464464

Should I buy a 3090 for slop generation?

Anonymous
12/06/25(Sat)23:19:00 No.107464468

Anonymous 12/06/25(Sat)23:19:00 No.107464468

>>107464084
How much speed up do you really get with this over sage?
It seems there are no pre-built wheels available outside of windows. I am curious if it is worth compiling.

Anonymous
12/06/25(Sat)23:19:24 No.107464471

Anonymous 12/06/25(Sat)23:19:24 No.107464471

>>107464458
ZIT doesn't seem to like camera control much. Maybe there's some secret sauce we need to discover.

Anonymous
12/06/25(Sat)23:19:35 No.107464474

Anonymous 12/06/25(Sat)23:19:35 No.107464474

how long until they start banning local?

Anonymous
12/06/25(Sat)23:20:38 No.107464479

Anonymous 12/06/25(Sat)23:20:38 No.107464479

>>107464422
I don't want to degrade quality too much, which is already decreased in the distill.
I was thinking about trying that when (if) base model releases and more steps are needed.
I can bear the current gen speed.

Anonymous
12/06/25(Sat)23:21:18 No.107464481

Anonymous 12/06/25(Sat)23:21:18 No.107464481

File: 2319001-An overhead photo(...).png (1.19 MB, 1024x1024)

1.19 MB PNG

>>107464471
Seems to like 'overhead photo'? Maybe because it was a trendy photo style at one time?

Anonymous
12/06/25(Sat)23:21:46 No.107464485

Anonymous 12/06/25(Sat)23:21:46 No.107464485

>>107464464
buy a 5090 while you still can

Anonymous
12/06/25(Sat)23:21:58 No.107464487

Anonymous 12/06/25(Sat)23:21:58 No.107464487

>>107464445
Heh, I mentioned local because I know some dickweed is going to mention api shit.

>>107464448
Oh it even has comfy integration, thanks!

Anonymous
12/06/25(Sat)23:22:20 No.107464490

Anonymous 12/06/25(Sat)23:22:20 No.107464490

can i img2img z-image?

Anonymous
12/06/25(Sat)23:22:47 No.107464493

Anonymous 12/06/25(Sat)23:22:47 No.107464493

>>107464481
I'll try that once my gpu is done with these vids.

Anonymous
12/06/25(Sat)23:23:19 No.107464495

Anonymous 12/06/25(Sat)23:23:19 No.107464495

>>107464490
Yes, >>107464374 is an image to image using z-image

bloodyscabSNIFFER
12/06/25(Sat)23:24:08 No.107464501

bloodyscabSNIFFER 12/06/25(Sat)23:24:08 No.107464501

>>107464481
like skate park fisheye

Anonymous
12/06/25(Sat)23:25:38 No.107464508

Anonymous 12/06/25(Sat)23:25:38 No.107464508

>>107464464
There are better purchases but it can be worth it.
Definitely go second hand though. Not worth it first hand.

Anonymous
12/06/25(Sat)23:25:59 No.107464515

Anonymous 12/06/25(Sat)23:25:59 No.107464515

File: 2325001-An overhead photo(...).png (1.17 MB, 1024x1024)

1.17 MB PNG

>>107464501
Man, 'fish eye' really does good! Can't believe I didn't think of that!

Anonymous
12/06/25(Sat)23:28:36 No.107464529

Anonymous 12/06/25(Sat)23:28:36 No.107464529

File: 2327003-A super close up,(...).png (1.42 MB, 1024x1024)

1.42 MB PNG

>>107464515
zoo wee mama! It works really well!

Anonymous
12/06/25(Sat)23:31:27 No.107464549

Anonymous 12/06/25(Sat)23:31:27 No.107464549

>>107464409
nice did you upscale a video then? or just the image

Anonymous
12/06/25(Sat)23:32:13 No.107464554

Anonymous 12/06/25(Sat)23:32:13 No.107464554

>>107464529
Prompt for this?

Anonymous
12/06/25(Sat)23:32:23 No.107464556

Anonymous 12/06/25(Sat)23:32:23 No.107464556

why would you release a schnell version of an unfinished base model?
why wouldnt they release the base model if it was finished when they made the schnell version?

Anonymous
12/06/25(Sat)23:33:55 No.107464562

Anonymous 12/06/25(Sat)23:33:55 No.107464562

File: 2333001-A super close up,(...).png (1.28 MB, 1024x1024)

1.28 MB PNG

>>107464554
"A super close up, fish-eye lens photo of a girl sitting on the ground outside of a bar at night. Fresh snow is on the ground around her. She has a cigarette between her lips, smoke coming off it. She's doing a peace sign over her eye."

Man, gotta say, learning about new things is fun!

Anonymous
12/06/25(Sat)23:36:06 No.107464578

Anonymous 12/06/25(Sat)23:36:06 No.107464578

>>107464464
i just got a used one recently

Anonymous
12/06/25(Sat)23:37:29 No.107464586

Anonymous 12/06/25(Sat)23:37:29 No.107464586

>>107464556
>why would you release a schnell version of an unfinished base model?
because its still better than flux2 and nuking that off the face of the planet is a big win
>why wouldnt they release the base model if it was finished when they made the schnell version?
because it wasnt

Anonymous
12/06/25(Sat)23:37:45 No.107464588

Anonymous 12/06/25(Sat)23:37:45 No.107464588

File: upscaled.jpg (2.07 MB, 3046x3046)

2.07 MB JPG

>>107464549
no it failed >>107464231 been having problems with video combine and it sometimes shits the bed.
>>107464529

Anonymous
12/06/25(Sat)23:38:14 No.107464590

Anonymous 12/06/25(Sat)23:38:14 No.107464590

>>107464468
The first generation is 2-3 minutes then hovers around 40 to 50 secs for each batch. The default workflow for SVI is at 8 steps but I set it to 4. Then again, my resolutions are tiny like 640 by 512 kek. Going from 15 minutes to generate 5 seconds to less than 5 minutes to generate 15 secs is pretty wild (obviously not counting loading the models). If you go for radial attn, make sure you use woct0rdhos sage, sparge and triton

>>107464474
After all the bullshit going on, wouldnt be surprised. Make sure you make many backups.

Anonymous
12/06/25(Sat)23:39:48 No.107464600

Anonymous 12/06/25(Sat)23:39:48 No.107464600

File: 1145001-A super grainy, l(...).webm (1.07 MB, 960x960)

1.07 MB WEBM

>>107464588
Think you could upscale this?

bloodyscabSNIFFER
12/06/25(Sat)23:41:03 No.107464605

bloodyscabSNIFFER 12/06/25(Sat)23:41:03 No.107464605

>>107464529
amazing how much better that looks in grayscale lol

Anonymous
12/06/25(Sat)23:43:41 No.107464622

Anonymous 12/06/25(Sat)23:43:41 No.107464622

>>107464515
>AI is already this good
it's over

Anonymous
12/06/25(Sat)23:46:07 No.107464638

Anonymous 12/06/25(Sat)23:46:07 No.107464638

>>107464485
>literally 5x the price of a used 3090
no

>>107464508
>There are better purchases
Like?
Used is the intention

Anonymous
12/06/25(Sat)23:47:07 No.107464643

Anonymous 12/06/25(Sat)23:47:07 No.107464643

File: upscaled2.jpg (1.75 MB, 3046x3046)

1.75 MB JPG

>>107464600
no, this is the first frame. i have to figure out why the video combine fails half the time. i am thinking i can just batch up scale every single frame then put them into a video again

Anonymous
12/06/25(Sat)23:47:08 No.107464645

Anonymous 12/06/25(Sat)23:47:08 No.107464645

File: 2330001-A fish-eye lens o(...).png (1.47 MB, 1024x1024)

1.47 MB PNG

>>107464605
Damn, you're right. I might try some greyscale images here after these videos gen.
>>107464622
I think once video gets on par with image gen, then it's over.

Anonymous
12/06/25(Sat)23:50:46 No.107464673

Anonymous 12/06/25(Sat)23:50:46 No.107464673

>>107463245
I have a RX 6800 currently, my experience with it has been quite dogshit possibly of no fault to AMD as I think it may be slightly defective. It works fine 99% of the time no artifacting even, but I randomly have my screen go black usually followed by a GPU reset, AI stuff is especially bad and can almost instantly trigger a crash unless I run it without a GUI. I'm thinking of getting a new GPU what are you guys experience with newer AMD GPUs e.g. 9000 series? Or should I just go back to NVIDIA? Newer NVIDIA cards seem to have weird issues on Linux, my 1080 was fine though. I assume it's just my retarded card losing silicon lottery hard because it has been good experience if it wasn't for the constant crashing, I think it's probably faulty VRAM modules just a hunch though

Anonymous
12/06/25(Sat)23:50:51 No.107464676

Anonymous 12/06/25(Sat)23:50:51 No.107464676

Wen comfy?

>SGLang Diffusion + Cache-DiT = 20-165% Faster Local Image/Video Generation
>SGLang integrates Cache-DiT, a caching acceleration engine for Diffusion Transformers (DiT), to achieve up to 7.4x inference speedup with minimal quality loss.

https://www.reddit.com/r/LocalLLaMA/comments/1pg8jtk/sglang_diffusion_cachedit_20165_faster_local/

Anonymous
12/06/25(Sat)23:50:56 No.107464678

Anonymous 12/06/25(Sat)23:50:56 No.107464678

File: 1146001-overexposed, low (...).png (1.69 MB, 1024x1024)

1.69 MB PNG

>>107464643
Oh cool! Thanks! I need to figure out upscaling in SwarmUI.

bloodyscabSNIFFER
12/06/25(Sat)23:53:26 No.107464690

bloodyscabSNIFFER 12/06/25(Sat)23:53:26 No.107464690

>>107464645
photography is crazy fun lol

Anonymous
12/06/25(Sat)23:53:50 No.107464693

Anonymous 12/06/25(Sat)23:53:50 No.107464693

>>107464590
>make sure you use woct0rdhos sage, sparge and triton
These just look like prebuilt wheels of respective packages I don't think it specifically depends on these
Eh I might give it a shot.
>>107464638
5090
For used 3090 is the best value though.

Anonymous
12/06/25(Sat)23:56:54 No.107464720

Anonymous 12/06/25(Sat)23:56:54 No.107464720

File: 2023001-An image of a wom(...).png (1.37 MB, 1024x1024)

1.37 MB PNG

>>107464690
That's my primary use for this stuff! I just wish I was better at prompting, but I'm getting better. I'm so used to the old days of using tags. Natural language doesn't feel natural for this stuff lol.

bloodyscabSNIFFER
12/06/25(Sat)23:57:13 No.107464724

bloodyscabSNIFFER 12/06/25(Sat)23:57:13 No.107464724

File: qie_00003_.png (1.31 MB, 1240x840)

1.31 MB PNG

Anonymous
12/07/25(Sun)00:00:46 No.107464756

Anonymous 12/07/25(Sun)00:00:46 No.107464756

>>107464720
zimage is ONLY natural language?

Anonymous
12/07/25(Sun)00:01:59 No.107464767

Anonymous 12/07/25(Sun)00:01:59 No.107464767

File: 0000003-overexposed, low (...).png (1.24 MB, 1024x1024)

1.24 MB PNG

>>107464756
No, but tags don't see to work as well as they used to. At least, from my testing. I could just be full blown retarded desu.

Anonymous
12/07/25(Sun)00:04:13 No.107464773

Anonymous 12/07/25(Sun)00:04:13 No.107464773

>>107464756
>zimage is ONLY natural language?
not if you train your own/use a lora

Anonymous
12/07/25(Sun)00:04:39 No.107464776

Anonymous 12/07/25(Sun)00:04:39 No.107464776

why are there so many nipple loras for zit

Anonymous
12/07/25(Sun)00:04:43 No.107464777

Anonymous 12/07/25(Sun)00:04:43 No.107464777

>>107464693
>I don't think it specifically depends on these

Turns out all 4 were needed, I spent a month trying to figure out why I kept getting errors. I uninstalled regular triton and sage, then installedall woct's stuff and it worked. Thanks to an anon many, many threads ago he mentioned to not mix them.

https://github.com/woct0rdho/triton-windows
https://github.com/woct0rdho/SageAttention
https://github.com/woct0rdho/SpargeAttn
https://github.com/woct0rdho/ComfyUI-RadialAttn

Also for the fp16 bat file --use-sage-attention --fast fp16_accumulation --disable-api-nodes

Anonymous
12/07/25(Sun)00:05:21 No.107464780

Anonymous 12/07/25(Sun)00:05:21 No.107464780

>>107464756
>>107464767
It is not trained on them specifically but "A woman, standing, alone, winter..." style prompting works somewhat ok since TE is smart.
Moderate length natural language paragraphs give best results (Anatomy errors start to appear when you give too verbose prompt)
>>107464776
A lot more easier to train cunts, cocks and sex and also useful for coom

Anonymous
12/07/25(Sun)00:05:44 No.107464783

Anonymous 12/07/25(Sun)00:05:44 No.107464783

>>107464776
because it cant do them and they are the basic thing for everyone genning anything even remotely nsfw

Anonymous
12/07/25(Sun)00:07:48 No.107464795

Anonymous 12/07/25(Sun)00:07:48 No.107464795

>>107464776
>>107464783
funny enough all the nipple loras destroy whole model

Anonymous
12/07/25(Sun)00:08:48 No.107464799

Anonymous 12/07/25(Sun)00:08:48 No.107464799

best zit nipple lora so far?

Anonymous
12/07/25(Sun)00:09:49 No.107464803

Anonymous 12/07/25(Sun)00:09:49 No.107464803

>>107464795
they fuck details but you can bring them back with just using any dual sampler workflow to fix details for the last few steps like one at top of https://civitai.com/models/2093591

Anonymous
12/07/25(Sun)00:10:09 No.107464805

Anonymous 12/07/25(Sun)00:10:09 No.107464805

File: 2115003-A woman with the (...).png (1.24 MB, 1024x1024)

1.24 MB PNG

>>107464780
Where do we get lora now if not civit?

Anonymous
12/07/25(Sun)00:12:43 No.107464821

Anonymous 12/07/25(Sun)00:12:43 No.107464821

is there a nsfw version of this thread? i checked the nsfw boards and its pure sloppa

Anonymous
12/07/25(Sun)00:13:16 No.107464824

Anonymous 12/07/25(Sun)00:13:16 No.107464824

>>107464777
I skimmed through commits and triton seems to be a generic pre-built but he seems to have made important changes to sage and sparge over base repos.
This makes it even more tedious since I don't want to recompile sage.
I am certain I won't bother now, but the knowledge is useful so thanks.
>>107464805
There is not a single good place whatsoever to get loras.
Some people hide based stuff on huggingface, but by design it is ass to find.

Anonymous
12/07/25(Sun)00:15:37 No.107464837

Anonymous 12/07/25(Sun)00:15:37 No.107464837

>>107464676
I forgot to add their blog, it talks more about it https://lmsys.org/blog/2025-11-07-sglang-diffusion/

>We are excited to introduce SGLang Diffusion, which brings SGLang's state-of-the-art performance to accelerate image and video generation for diffusion models. SGLang Diffusion supports major open-source video and image generation models (Wan, Hunyuan, Qwen-Image, Qwen-Image-Edit, Flux) while providing fast inference speeds and ease of use via multiple API entry points (OpenAI-compatible API, CLI, Python interface). SGLang Diffusion delivers 1.2x - 5.9x speedup across diverse workloads. In collaboration with the FastVideo team, we provide a complete ecosystem for diffusion models, from post-training to production serving. The code is available here.

>Optimize Wan, FastWan, Hunyuan, Qwen-Image series, FLUX
>Support LongCat-Video

Possible comfy coming https://github.com/sgl-project/sglang/issues/13024

Anonymous
12/07/25(Sun)00:19:07 No.107464856

Anonymous 12/07/25(Sun)00:19:07 No.107464856

File: 1751828639826060.png (1.43 MB, 1120x1440)

1.43 MB PNG

used chroma for like a year and this is the first furry gen I made.

Anonymous
12/07/25(Sun)00:24:40 No.107464891

Anonymous 12/07/25(Sun)00:24:40 No.107464891

File: 1764171647478574.png (1.47 MB, 1120x1440)

1.47 MB PNG

>>107464856
do furries use chroma? like on /trash/?

Anonymous
12/07/25(Sun)00:28:21 No.107464913

Anonymous 12/07/25(Sun)00:28:21 No.107464913

>>107464891
I tortured myself by looking through the furfag thread and they all seem like genned on illustrious.

Anonymous
12/07/25(Sun)00:29:45 No.107464926

Anonymous 12/07/25(Sun)00:29:45 No.107464926

>>107464821
/gif/ has a video thread
/aco/ threads are kinda shit

Anonymous
12/07/25(Sun)00:30:30 No.107464934

Anonymous 12/07/25(Sun)00:30:30 No.107464934

>>107464821
when anon posts lewdcatbox yes

Anonymous
12/07/25(Sun)00:30:45 No.107464939

Anonymous 12/07/25(Sun)00:30:45 No.107464939

>>107464676
>>107464837
call me when it's available for comfy

Anonymous
12/07/25(Sun)00:30:57 No.107464941

Anonymous 12/07/25(Sun)00:30:57 No.107464941

which gguf for flux is the best in terms of lightweight to quality ratio?

Anonymous
12/07/25(Sun)00:35:06 No.107464967

Anonymous 12/07/25(Sun)00:35:06 No.107464967

>>107464941
Flux 1?
Nunchaku.

Anonymous
12/07/25(Sun)00:37:43 No.107464982

Anonymous 12/07/25(Sun)00:37:43 No.107464982

>>107464967
do you need to install anything extra to use nunchaku or can I just use it?

Anonymous
12/07/25(Sun)00:40:02 No.107464993

Anonymous 12/07/25(Sun)00:40:02 No.107464993

File: 1755732081621441.png (1.04 MB, 832x1248)

1.04 MB PNG

Anonymous
12/07/25(Sun)00:40:59 No.107464998

Anonymous 12/07/25(Sun)00:40:59 No.107464998

>>107464993
furshit belongs in >>>/trash/

Anonymous
12/07/25(Sun)00:43:17 No.107465013

Anonymous 12/07/25(Sun)00:43:17 No.107465013

File: 1764824030033989.png (937 KB, 832x1248)

937 KB PNG

Damn it lost her with the pose

bloodyscabSNIFFER
12/07/25(Sun)00:43:25 No.107465015

bloodyscabSNIFFER 12/07/25(Sun)00:43:25 No.107465015

File: bfsh3_00001_.png (1.44 MB, 1328x904)

1.44 MB PNG

Anonymous
12/07/25(Sun)00:44:10 No.107465023

Anonymous 12/07/25(Sun)00:44:10 No.107465023

>>107464982
You need to install comfyui-nunchaku custom node.

Anonymous
12/07/25(Sun)00:46:05 No.107465038

Anonymous 12/07/25(Sun)00:46:05 No.107465038

>>107465013
The fact that it can't consistently gen one of the most prominent furfag characters that it has almost certainly seen thousands of images of during the training, speaks volumes about Chroma.

Anonymous
12/07/25(Sun)00:46:57 No.107465044

Anonymous 12/07/25(Sun)00:46:57 No.107465044

>>107463245
I'm coming back to the generating gaem and it looks like all my models were left on the dust (SD v1-5 Chad here). What are the kewl cats using now? What's the best base model? What about inpaint models? Are they still a thing?

SANKIUUUU in advance, my fellow prompterers.

Anonymous
12/07/25(Sun)00:47:50 No.107465051

Anonymous 12/07/25(Sun)00:47:50 No.107465051

>>107465023
but I'm on forge

Anonymous
12/07/25(Sun)00:48:29 No.107465052

Anonymous 12/07/25(Sun)00:48:29 No.107465052

File: 1741059702606007.png (1.06 MB, 832x1248)

1.06 MB PNG

>>107465038
These are Z. Maybe with a longer prompt I can get more hits

Anonymous
12/07/25(Sun)00:50:33 No.107465064

Anonymous 12/07/25(Sun)00:50:33 No.107465064

>>107465051
You shouldn't be on forge if you don't like missing out on stuff.
But if you insist on remaining use Q8. Maybe nf4 if you are a turbo vramlet.

Anonymous
12/07/25(Sun)00:51:25 No.107465071

Anonymous 12/07/25(Sun)00:51:25 No.107465071

File: 1742601414619512.png (1.42 MB, 1120x1440)

1.42 MB PNG

>>107464913
bizarre. I do see it in their sticky. ironic the furry made chroma yet his community seems to have ignored it.

Anonymous
12/07/25(Sun)00:52:02 No.107465083

Anonymous 12/07/25(Sun)00:52:02 No.107465083

File: ComfyUI_00602_.png (3.9 MB, 1432x2144)

3.9 MB PNG

New model from the Noob guys: NewBie-image-Exp0.1
>We are thrilled to introduce NewBie-image-Exp0.1, released by NewBieAi-Lab. This model utilizes a brand-new NewBie architecture designed on the foundation of Next-DiT. We have combined Gemma3-4B-it with Jina CLIP v2 to effectively enhance the model's text comprehension capabilities. Additionally, we utilized the FLUX.1-dev 16-channel VAE to provide richer details. The current dataset consists of approximately 12 million images (including the complete Danbooru dataset up to October 2025 and 1/4 of the e621 dataset). Trained on 8x H200 GPUs for 10 epochs (approx. 17,500 H200 hours), it now supports characters and art styles with a mean solo count of 150 on Danbooru. We sincerely thank everyone involved in the testing and training process. Thank you for your support, and we hope the open-source community continues to thrive!
Huggingface model is walled but you can download on Civit: https://civitai.com/models/2197517/newbie-image
Already supports LoRA training too: https://github.com/NewBieAI-Lab/NewbieLoraTrainer
It's only a v0.1 so it's probably not very good in its current state.

Anonymous
12/07/25(Sun)00:52:58 No.107465090

Anonymous 12/07/25(Sun)00:52:58 No.107465090

>>107465052
Well should have clarified that.
Z-Image can do very few people and characters consistently, unsurprising.
Also this image is weirdly hot and no I am not a furry.

Anonymous
12/07/25(Sun)00:53:04 No.107465092

Anonymous 12/07/25(Sun)00:53:04 No.107465092

>noob in december of 2025

Anonymous
12/07/25(Sun)00:53:04 No.107465093

Anonymous 12/07/25(Sun)00:53:04 No.107465093

>>107465083
This isn't the Z-image mode that was promised, but has potential anyways

Anonymous
12/07/25(Sun)00:54:18 No.107465102

Anonymous 12/07/25(Sun)00:54:18 No.107465102

File: 1758558632682889.png (1.22 MB, 832x1248)

1.22 MB PNG

Just needed a longer prompt

bloodyscabSNIFFER
12/07/25(Sun)00:54:52 No.107465107

bloodyscabSNIFFER 12/07/25(Sun)00:54:52 No.107465107

>>107465092
well. it's fast. maybe a wf that gens and then has qie prompted permanently with "fix the hands"?

Anonymous
12/07/25(Sun)00:56:17 No.107465117

Anonymous 12/07/25(Sun)00:56:17 No.107465117

>>107465071
People like ease of use.
SDXL can be run comfortably on a decade old mid range GPU, or hell even on phones with a distill lora.
Ease of use triumphs quality (And chroma struggles a lot to do that consistently despite much greater potential than SDXL)

Anonymous
12/07/25(Sun)00:56:29 No.107465121

Anonymous 12/07/25(Sun)00:56:29 No.107465121

is longcat actually good?

Anonymous
12/07/25(Sun)00:58:40 No.107465132

Anonymous 12/07/25(Sun)00:58:40 No.107465132

>>107465117
>Ease of use triumphs quality
hence why ZIT easily overtakes Chroma

Anonymous
12/07/25(Sun)00:59:17 No.107465137

Anonymous 12/07/25(Sun)00:59:17 No.107465137

>>107464939
Funny enough, I just randomly checked leddit again and apparently, there is a comfyui implementation https://github.com/xlite-dev/comfyui-cache-dit

However, its in chinese (use a tranlator firefox addon) and hasnt been updated in 3 months, heh

Anonymous
12/07/25(Sun)01:00:22 No.107465147

Anonymous 12/07/25(Sun)01:00:22 No.107465147

>>107464678
No problem!

Anonymous
12/07/25(Sun)01:01:20 No.107465152

Anonymous 12/07/25(Sun)01:01:20 No.107465152

>>107465013
this is good
>>107465052
this is furfaggotry
>>107465090
seek help

Anonymous
12/07/25(Sun)01:02:47 No.107465159

Anonymous 12/07/25(Sun)01:02:47 No.107465159

>>107465083
I am seeing lots of gibberish text in their examples despite flux vae and a modern enough text encoder. This thing seems very under-trained. I suppose normal for V0.1.
But maybe it will have potential, we will see.
Lumina is needlessly slow for its size though, so I expect it to get overtaken by Z-Image booru tunes if Alibaba doesn't fuck us over.

Anonymous
12/07/25(Sun)01:03:49 No.107465166

Anonymous 12/07/25(Sun)01:03:49 No.107465166

>smol base model + multiple loras
or
>big base model that understands many concepts
?

Anonymous
12/07/25(Sun)01:04:29 No.107465173

Anonymous 12/07/25(Sun)01:04:29 No.107465173

>>107465121
nope
>>107465152
I am attracted to the very human body in the middle, the funny head just spices it up. I am not into anthropomorphic animals.

Anonymous
12/07/25(Sun)01:04:32 No.107465175

Anonymous 12/07/25(Sun)01:04:32 No.107465175

Don't forget to disable prompt logging in comfyUI. it's on by default

Anonymous
12/07/25(Sun)01:06:02 No.107465188

Anonymous 12/07/25(Sun)01:06:02 No.107465188

>>107465137
>and hasnt been updated in 3 months
either it works so well it doesn't need to be updated or it sucks and nobody bothered to maintain it

Anonymous
12/07/25(Sun)01:06:41 No.107465193

Anonymous 12/07/25(Sun)01:06:41 No.107465193

>>107465156
you know you can do this shit with noob, right?

Anonymous
12/07/25(Sun)01:07:21 No.107465198

Anonymous 12/07/25(Sun)01:07:21 No.107465198

I'd rather have animetroons than fucking furfags

Anonymous
12/07/25(Sun)01:07:51 No.107465203

Anonymous 12/07/25(Sun)01:07:51 No.107465203

>>107465166
100% the latter.
It is very difficult to teach certain concepts as loras, much less combine many of them reliably.
The only argument for the former is that smaller models tend to be easier to inference. But that stops being worth much if you are turning it into slop factory.

Anonymous
12/07/25(Sun)01:08:06 No.107465206

Anonymous 12/07/25(Sun)01:08:06 No.107465206

>>107465175
Where do you disable it?

Anonymous
12/07/25(Sun)01:09:04 No.107465212

Anonymous 12/07/25(Sun)01:09:04 No.107465212

>>107465188
Judging by the fact that we are just now hearing about this massive revolutionary beast that provides 7x speed up, it's not hard to guess the answer.

Anonymous
12/07/25(Sun)01:09:20 No.107465214

Anonymous 12/07/25(Sun)01:09:20 No.107465214

>>107465198
this guy gets it

furfaggots ruin everything

Anonymous
12/07/25(Sun)01:09:22 No.107465215

Anonymous 12/07/25(Sun)01:09:22 No.107465215

>>107465203
but people like zit more than chroma

Anonymous
12/07/25(Sun)01:09:24 No.107465216

Anonymous 12/07/25(Sun)01:09:24 No.107465216

>>107465064
I enjoy forge because it's easy to use. I have comfy too but I just use it for the stuff that forge can't do.
Thanks for the help.

Anonymous
12/07/25(Sun)01:09:51 No.107465220

Anonymous 12/07/25(Sun)01:09:51 No.107465220

>>107465166
100 trillion parameter model trained on all data ever in history distilled into a 4 bit 4 step MoE that fits into 24gb vram and 64gb ram

Anonymous
12/07/25(Sun)01:13:26 No.107465240

Anonymous 12/07/25(Sun)01:13:26 No.107465240

>>107465044
Someone PLOX guide me!

Anonymous
12/07/25(Sun)01:13:31 No.107465241

Anonymous 12/07/25(Sun)01:13:31 No.107465241

You can tell someone is a zoomer when they post Elsa and judy Hopps porn

Anonymous
12/07/25(Sun)01:14:11 No.107465246

Anonymous 12/07/25(Sun)01:14:11 No.107465246

File: eff.png (45 KB, 384x719)

45 KB PNG

How do these two differ?

Anonymous
12/07/25(Sun)01:14:28 No.107465250

Anonymous 12/07/25(Sun)01:14:28 No.107465250

>>107465240
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

Anonymous
12/07/25(Sun)01:14:30 No.107465251

Anonymous 12/07/25(Sun)01:14:30 No.107465251

>>107465215
Chroma knows a lot things but can't actually reliably gen them in an aesthetically pleasing way.
ZiT doesn't know much but it can gen what it knows consistently.

Anonymous
12/07/25(Sun)01:15:30 No.107465255

Anonymous 12/07/25(Sun)01:15:30 No.107465255

>>107465090
>Well should have clarified that.
he doesn't need to clarify anything, you just assumed incorrectly and looked like a retard.

Anonymous
12/07/25(Sun)01:15:49 No.107465257

Anonymous 12/07/25(Sun)01:15:49 No.107465257

>>107465206
you can't anymore. comfyanon removed the option to turn it off

Anonymous
12/07/25(Sun)01:18:39 No.107465275

Anonymous 12/07/25(Sun)01:18:39 No.107465275

>>107465188
Look at the issues, since its in chinese no one has heard of it, some one is complaining about z kek https://github.com/xlite-dev/comfyui-cache-dit/issues

>>107465212
Hopefully they update it and it actually does what it says. I gotta head to bed but testing this bitch out tomorrow. The basic node apparently is 模型加速 which is "Model acceleration" according to google translate.

Anonymous
12/07/25(Sun)01:18:49 No.107465276

Anonymous 12/07/25(Sun)01:18:49 No.107465276

>>107465257
where does it save them to?

Anonymous
12/07/25(Sun)01:18:50 No.107465277

Anonymous 12/07/25(Sun)01:18:50 No.107465277

>>107465193
link me a noob that can do judy so i can goon

Anonymous
12/07/25(Sun)01:20:01 No.107465281

Anonymous 12/07/25(Sun)01:20:01 No.107465281

File: 1741788798362176.png (1.09 MB, 832x1248)

1.09 MB PNG

>>107465241
Any Disney or children characters from that time really

Anonymous
12/07/25(Sun)01:21:33 No.107465289

Anonymous 12/07/25(Sun)01:21:33 No.107465289

how long does wan2.2 usually take to gen videos on a 5080?

Anonymous
12/07/25(Sun)01:22:54 No.107465295

Anonymous 12/07/25(Sun)01:22:54 No.107465295

>>107465083
They made some sexy architecture choices. Didn't expect this community to pick up on Jina.

Anonymous
12/07/25(Sun)01:23:21 No.107465299

Anonymous 12/07/25(Sun)01:23:21 No.107465299

>>107465289
bout 3fiddy

Anonymous
12/07/25(Sun)01:24:14 No.107465307

Anonymous 12/07/25(Sun)01:24:14 No.107465307

>>107465083
>>107465295
uuh when are they making it easy to use? i'm not downloading a whole another comfy for this

Anonymous
12/07/25(Sun)01:25:26 No.107465316

Anonymous 12/07/25(Sun)01:25:26 No.107465316

>>107465083
>We have combined Gemma3-4B-it
would be better to use the derestricted version, no?

Anonymous
12/07/25(Sun)01:25:43 No.107465317

Anonymous 12/07/25(Sun)01:25:43 No.107465317

>>107465083
I don't know about the people who made Noob. Is this really by them? Why isn't it uploaded under the account that uploaded Noob?

Anonymous
12/07/25(Sun)01:26:44 No.107465322

Anonymous 12/07/25(Sun)01:26:44 No.107465322

>>107465316
Probably started training this thing well before that.

Anonymous
12/07/25(Sun)01:28:24 No.107465334

Anonymous 12/07/25(Sun)01:28:24 No.107465334

File: 00989-590185063.png (1.21 MB, 800x1200)

1.21 MB PNG

>>107465281
I use Elsa to drain my balls on the regular

Anonymous
12/07/25(Sun)01:29:09 No.107465341

Anonymous 12/07/25(Sun)01:29:09 No.107465341

looks like there is initial support now
> but the VRAM is high

https://github.com/sooxt98/comfyui_longcat_image
https://github.com/meituan-longcat/LongCat-Image/issues/8

Anonymous
12/07/25(Sun)01:32:12 No.107465356

Anonymous 12/07/25(Sun)01:32:12 No.107465356

>>107465083
why would i use this instead of illustrious/noob? genuinely asking

Anonymous
12/07/25(Sun)01:32:26 No.107465357

Anonymous 12/07/25(Sun)01:32:26 No.107465357

>>107465316
For the last time there is no evidence that the text encoders are censored.
If the model can't draw cunts, it is because UNET has never seen enough cunts during training, not because the text encoder doesn't tell it to draw a cunt.

Anonymous
12/07/25(Sun)01:32:33 No.107465359

Anonymous 12/07/25(Sun)01:32:33 No.107465359

File: 1734727628545752.png (394 KB, 894x665)

394 KB PNG

Anonymous
12/07/25(Sun)01:33:26 No.107465366

Anonymous 12/07/25(Sun)01:33:26 No.107465366

>>107465241
something is wrong when you want to fuck cartoon characters from your childhood

everyone like that is always a cringy weirdo

Anonymous
12/07/25(Sun)01:33:59 No.107465369

Anonymous 12/07/25(Sun)01:33:59 No.107465369

>>107465357
And before someone acts pedantic I meant censoring for drawing images.
Of course gemma is very heavily censored when you try to chat with it.

Anonymous
12/07/25(Sun)01:34:15 No.107465371

Anonymous 12/07/25(Sun)01:34:15 No.107465371

>>107465366
at least elsa is supposed to be a hot woman, not a fucking animal like judy hops

Anonymous
12/07/25(Sun)01:35:08 No.107465374

Anonymous 12/07/25(Sun)01:35:08 No.107465374

File: newbie-0.1-sample-civitai.png (2.71 MB, 1432x2144)

2.71 MB PNG

>>107465356
clearly almost all new models had more resolution flexibility and better prompt comprehension.

maybe you wouldn't yet use it in this state of training tho

Anonymous
12/07/25(Sun)01:35:14 No.107465377

Anonymous 12/07/25(Sun)01:35:14 No.107465377

>>107465275
Let us know how it goes

Anonymous
12/07/25(Sun)01:35:39 No.107465381

Anonymous 12/07/25(Sun)01:35:39 No.107465381

>>107465251
It's becoming a confusing mess of "how did we get caught up in this shit show in the first place?"

Anonymous
12/07/25(Sun)01:36:23 No.107465386

Anonymous 12/07/25(Sun)01:36:23 No.107465386

File: 00985-2270860513.png (1.08 MB, 800x1200)

1.08 MB PNG

>>107465366
the artists knew what they were doing when they designed her

Anonymous
12/07/25(Sun)01:37:04 No.107465391

Anonymous 12/07/25(Sun)01:37:04 No.107465391

>>107465241
when you look at thse fanbases a lot are also "disney adults", retards who are old but just follow the current thing

Anonymous
12/07/25(Sun)01:38:27 No.107465400

Anonymous 12/07/25(Sun)01:38:27 No.107465400

>>107465374
>better prompt comprehension
no one here has ever taken advantage of better prompt adherence or comprehension since you're all 1girl, standing enjoyers (derogatory)

Anonymous
12/07/25(Sun)01:39:06 No.107465404

Anonymous 12/07/25(Sun)01:39:06 No.107465404

>>107465371
I mean yeah but there were plenty of hot women in the cartoons of my childhood and I don't feel a particular urge to gen lewds of them

>>107465386
okay I can somewhat understand elsa but it's still weird when you do it too much

when I was using civitai regularly I'd run into accounts genning lewds of far less sexual cartoon characters and those accounts always looked the same
it got to the point where I'd see one gen and I could guess with a very high accuracy that the account would be full of that shit

Anonymous
12/07/25(Sun)01:39:37 No.107465409

Anonymous 12/07/25(Sun)01:39:37 No.107465409

>>107465400
because it's all noob/illu can do, unfortunately

Anonymous
12/07/25(Sun)01:40:10 No.107465414

Anonymous 12/07/25(Sun)01:40:10 No.107465414

File: 1760334363835403.jpg (47 KB, 720x803)

47 KB JPG

sampling images during any point in training?
training at lower res to test the lora first?
no thanks, i have an instinctive 'feel' the lora will be good and when to stop.

Anonymous
12/07/25(Sun)01:40:18 No.107465415

Anonymous 12/07/25(Sun)01:40:18 No.107465415

>>107465241
Judy Hopps is enjoyed by furry gooners and Elsa is enjoyed by Disney adult manchildren of all ages.
There are better candidates for a zoomer alarm.

Anonymous
12/07/25(Sun)01:41:09 No.107465421

Anonymous 12/07/25(Sun)01:41:09 No.107465421

>>107465404
what are you, gay?

Anonymous
12/07/25(Sun)01:42:18 No.107465425

Anonymous 12/07/25(Sun)01:42:18 No.107465425

>>107465083
Wait a second. Are they saying they trained on top of a model (Next-DiT), or trained from scratch without exposing the model to any real photography and non-booru data?

Anonymous
12/07/25(Sun)01:42:41 No.107465426

Anonymous 12/07/25(Sun)01:42:41 No.107465426

>>107465251 >>107465381
At least there is progress and there are reasonable options for future progress.

radiance IMO is getting better at it... slowly. but IDK if it'll reach the state where you can prompt characters like on noob/illustrious

z-image obviously has lots of potential if the large dataset finetuners get the base model

qwen is extremely capable but the GPU power it'll take to finetune that one is silly

neta-yume lumina also still is working

Anonymous
12/07/25(Sun)01:42:50 No.107465427

Anonymous 12/07/25(Sun)01:42:50 No.107465427

>>107465421
nope I just don't have a weird obsession on cartoon characters

Anonymous
12/07/25(Sun)01:43:13 No.107465430

Anonymous 12/07/25(Sun)01:43:13 No.107465430

>>107465427
anon...

Anonymous
12/07/25(Sun)01:43:58 No.107465434

Anonymous 12/07/25(Sun)01:43:58 No.107465434

File: 1755774449905619.png (308 KB, 551x987)

308 KB PNG

>>107465414
i can already sense how the lora will turn out just by looking at the training data

Anonymous
12/07/25(Sun)01:44:05 No.107465436

Anonymous 12/07/25(Sun)01:44:05 No.107465436

>>107465404
you are way too concerned and bothered with how much other men want to fuck or gen lewds of their waifus. you're probably a serious hypocrite or just a jealous homosexual

1girl

Anonymous
12/07/25(Sun)01:44:11 No.107465437

Anonymous 12/07/25(Sun)01:44:11 No.107465437

>>107465430
I'm sorry but you're weird and what you gen is cringe

Anonymous
12/07/25(Sun)01:44:30 No.107465439

Anonymous 12/07/25(Sun)01:44:30 No.107465439

>>107465425
>Thanks to Neta.art for fine-tuning and open sourcing the Lumina-image-2.0 base model.

Anonymous
12/07/25(Sun)01:45:12 No.107465441

Anonymous 12/07/25(Sun)01:45:12 No.107465441

>>107465400
this is partly because of limits on 2girls or more things or more complex poses or w/e that are also due to prompting power

Anonymous
12/07/25(Sun)01:45:47 No.107465445

Anonymous 12/07/25(Sun)01:45:47 No.107465445

>>107465436
waifuing elsa is like waifuing a whore

she's all used up

Anonymous
12/07/25(Sun)01:45:48 No.107465446

Anonymous 12/07/25(Sun)01:45:48 No.107465446

>>107465437
you're on 4chan

Anonymous
12/07/25(Sun)01:46:54 No.107465454

Anonymous 12/07/25(Sun)01:46:54 No.107465454

>>107465446
yeah I know that's why I'm not surprised to find your kind here

Anonymous
12/07/25(Sun)01:46:55 No.107465455

Anonymous 12/07/25(Sun)01:46:55 No.107465455

>>107465439
Alright.
Was that a good model? I wasn't here for it.

Anonymous
12/07/25(Sun)01:48:23 No.107465461

Anonymous 12/07/25(Sun)01:48:23 No.107465461

>>107465359
It's not a patreon link????

Anonymous
12/07/25(Sun)01:49:08 No.107465466

Anonymous 12/07/25(Sun)01:49:08 No.107465466

File: 1757599086726275.png (2.37 MB, 1752x1168)

2.37 MB PNG

Anonymous
12/07/25(Sun)01:49:50 No.107465474

Anonymous 12/07/25(Sun)01:49:50 No.107465474

File: 1745119594261791.png (522 KB, 853x1000)

522 KB PNG

>>107465434
comfyui?
i just read the Q8 weights once and can run realtime inference in my mind's latent space

Anonymous
12/07/25(Sun)01:50:07 No.107465475

Anonymous 12/07/25(Sun)01:50:07 No.107465475

>>107465466
who's that supposed to be?

Anonymous
12/07/25(Sun)01:50:34 No.107465478

Anonymous 12/07/25(Sun)01:50:34 No.107465478

>>107465359
>69 minutes of video to tell me how to train a thing
Couldn't this be done in a shorter video?

Anonymous
12/07/25(Sun)01:50:34 No.107465479

Anonymous 12/07/25(Sun)01:50:34 No.107465479

>>107465455
Quality is alright but it is slow.

Anonymous
12/07/25(Sun)01:52:04 No.107465485

Anonymous 12/07/25(Sun)01:52:04 No.107465485

>>107465475
No one in particular only the style

Anonymous
12/07/25(Sun)01:52:06 No.107465487

Anonymous 12/07/25(Sun)01:52:06 No.107465487

>>107465475
1girl

Anonymous
12/07/25(Sun)01:54:21 No.107465500

Anonymous 12/07/25(Sun)01:54:21 No.107465500

File: 1762545555635797.png (2.38 MB, 1752x1168)

2.38 MB PNG

Anonymous
12/07/25(Sun)01:58:24 No.107465529

Anonymous 12/07/25(Sun)01:58:24 No.107465529

File: 1763153227261068.png (547 KB, 500x500)

547 KB PNG

>>107465474
weights? i source fly agaric outside my local methyl isocyanate chemical plant, which i then consume before i place myself down in front of a black canvas

Anonymous
12/07/25(Sun)01:59:26 No.107465539

Anonymous 12/07/25(Sun)01:59:26 No.107465539

File: ComfyUI_temp_qbxgp_00003_.png (1.83 MB, 1040x1440)

1.83 MB PNG

Anonymous
12/07/25(Sun)01:59:45 No.107465543

Anonymous 12/07/25(Sun)01:59:45 No.107465543

File: ComfyUI_08961_.png (1.6 MB, 864x1280)

1.6 MB PNG

Anonymous
12/07/25(Sun)02:01:04 No.107465547

Anonymous 12/07/25(Sun)02:01:04 No.107465547

File: file.png (81 KB, 500x460)

81 KB PNG

>>107465454

Anonymous
12/07/25(Sun)02:03:13 No.107465560

Anonymous 12/07/25(Sun)02:03:13 No.107465560

>>107465543
>not N. Higgers
tch

Anonymous
12/07/25(Sun)02:03:23 No.107465563

Anonymous 12/07/25(Sun)02:03:23 No.107465563

File: file.png (206 KB, 781x493)

206 KB PNG

>>107465529
inference? i calculate the entire diffusion process by hand and take decades to make my 1girl standing looking at viewer, saves on electricity

Anonymous
12/07/25(Sun)02:08:29 No.107465586

Anonymous 12/07/25(Sun)02:08:29 No.107465586

>>107465547
it is not full of things I don't like though

Anonymous
12/07/25(Sun)02:08:46 No.107465588

Anonymous 12/07/25(Sun)02:08:46 No.107465588

the quality of gens in these threads is inversely proportional to the quality of the models released

you must strive to be better genners

Anonymous
12/07/25(Sun)02:14:55 No.107465614

Anonymous 12/07/25(Sun)02:14:55 No.107465614

File: ComfyUI_09004_.png (1.52 MB, 864x1280)

1.52 MB PNG

>>107465560
N. Iggers is a different author tho

Anonymous
12/07/25(Sun)02:16:41 No.107465623

Anonymous 12/07/25(Sun)02:16:41 No.107465623

Small custom node that might be of interest for ZIT:
https://github.com/ChangeTheConstants/SeedVarianceEnhancer

It diversifies outputs.

Anonymous
12/07/25(Sun)02:17:27 No.107465630

Anonymous 12/07/25(Sun)02:17:27 No.107465630

File: 1752216279704758.png (2.42 MB, 1752x1168)

2.42 MB PNG

Anonymous
12/07/25(Sun)02:17:34 No.107465632

Anonymous 12/07/25(Sun)02:17:34 No.107465632

File: 00012.png (1.03 MB, 1464x2008)

1.03 MB PNG

>>107463245
Trying to transfer an outfit to a picture of a character. Should I use Nano Banana Pro or Qwen Edit?

Anonymous
12/07/25(Sun)02:19:16 No.107465642

Anonymous 12/07/25(Sun)02:19:16 No.107465642

>>107465632
How do you run nano banana locally? Wrong thread for that. Qwen Edit is pretty good at it tho.

Anonymous
12/07/25(Sun)02:19:39 No.107465644

Anonymous 12/07/25(Sun)02:19:39 No.107465644

>>107465614
kek

Anonymous
12/07/25(Sun)02:20:34 No.107465651

Anonymous 12/07/25(Sun)02:20:34 No.107465651

>>107465646
>>107465646
>>107465646
>>107465646

Anonymous
12/07/25(Sun)02:20:51 No.107465654

Anonymous 12/07/25(Sun)02:20:51 No.107465654

File: Qwen_t2i_bent-b_.png (685 KB, 864x896)

685 KB PNG

>>107465539
drop the prompt

Anonymous
12/07/25(Sun)02:35:40 No.107465737

Anonymous 12/07/25(Sun)02:35:40 No.107465737

>>107465654
what's up with the square pattern

Anonymous
12/07/25(Sun)06:59:06 No.107467501

Anonymous 12/07/25(Sun)06:59:06 No.107467501

>>107464410
she's ultra wide, alright

Anonymous
12/07/25(Sun)07:00:14 No.107467512

Anonymous 12/07/25(Sun)07:00:14 No.107467512

>>107464805
why not civit?

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.