/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 01/20/26(Tue)22:52:27 No.107926791

File: collage.jpg (953 KB, 3264x1702)

953 KB JPG

/ldg/ - Local Diffusion General Anonymous 01/20/26(Tue)22:52:27 No.107926791

Discussion of Free and Open Source Diffusion Models

Prev: >>107925157

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
01/20/26(Tue)22:54:32 No.107926805

Anonymous 01/20/26(Tue)22:54:32 No.107926805

repoast:
>>107926593
>pixel-layer watermarking, and C2PA metadata to ensure content provenance and safety.
Oh right, I forgot about this. Can you spot the watermark by playing with levels in photoshop?

Anonymous
01/20/26(Tue)22:55:06 No.107926808

Anonymous 01/20/26(Tue)22:55:06 No.107926808

>>107926805
I guess so, if you play with the saturation and shit you'll be able to see the dots, like on NBP

Anonymous
01/20/26(Tue)22:56:30 No.107926824

Anonymous 01/20/26(Tue)22:56:30 No.107926824

>>107926808
Trivial to get rid of.

Anonymous
01/20/26(Tue)22:56:55 No.107926827

Anonymous 01/20/26(Tue)22:56:55 No.107926827

File: 1762042773121626.png (887 KB, 1049x1200)

887 KB PNG

>>107926798
https://huggingface.co/black-forest-labs/FLUX.2-klein-base-9B
>DESU I feel like the wall of text is specifically because they failed at making it safe.
maybe they're just pretending so that the European Union won't nuke their ass with more (((regulations)))

Anonymous
01/20/26(Tue)22:58:34 No.107926835

Anonymous 01/20/26(Tue)22:58:34 No.107926835

unborn death maggot

Anonymous
01/20/26(Tue)22:59:27 No.107926841

Anonymous 01/20/26(Tue)22:59:27 No.107926841

>>107926835
triggered?

Anonymous
01/20/26(Tue)23:00:45 No.107926848

Anonymous 01/20/26(Tue)23:00:45 No.107926848

File: latest-4120322386.jpg (43 KB, 533x355)

43 KB JPG

>>107926827
The only thing in that image that actually works IRL is the bottle cap

Anonymous
01/20/26(Tue)23:01:48 No.107926853

Anonymous 01/20/26(Tue)23:01:48 No.107926853

>>107926848
It doesn't tho. It's so poorly designed that everyone just rips it off.

Anonymous
01/20/26(Tue)23:02:12 No.107926858

Anonymous 01/20/26(Tue)23:02:12 No.107926858

>>107926848
I live in europe and I hate this shit, I always have to remove that with a knife or something, it's a fucking paainnnnnn

Anonymous
01/20/26(Tue)23:05:44 No.107926872

Anonymous 01/20/26(Tue)23:05:44 No.107926872

>>107926805
When editing with 9b there'll be zero compression artifacts on the edited portions of the image. Wasn't a visible noise pattern that I could see from my cursory glance though.

Anonymous
01/20/26(Tue)23:06:01 No.107926875

Anonymous 01/20/26(Tue)23:06:01 No.107926875

File: Logos.jpg (3.71 MB, 4096x2048)

3.71 MB JPG

>>107926554
top four are 4B Distilled, bottom four are 8B Distilled, 8 steps, Euler / Flux.2 Scheduler
```flaming retro logo text that reads "COMMIT SUICIDE" against a solid black background```

Anonymous
01/20/26(Tue)23:06:40 No.107926878

Anonymous 01/20/26(Tue)23:06:40 No.107926878

>>107926872
>Wasn't a visible noise pattern that I could see from my cursory glance though.
there's a slight shift in colors, but I think it's just a VAE's problem, not a watermark

Anonymous
01/20/26(Tue)23:06:48 No.107926880

Anonymous 01/20/26(Tue)23:06:48 No.107926880

File: 1768219577088406.png (1.26 MB, 1200x675)

1.26 MB PNG

Anonymous
01/20/26(Tue)23:07:37 No.107926888

Anonymous 01/20/26(Tue)23:07:37 No.107926888

>>107926880
Fucking kek

Anonymous
01/20/26(Tue)23:08:01 No.107926890

Anonymous 01/20/26(Tue)23:08:01 No.107926890

File: that's right.png (431 KB, 800x582)

431 KB PNG

>>107926805
that's the reason I want Z-image edit to win, at least the chinks don't annoy us with safety and watermarks

Anonymous
01/20/26(Tue)23:08:07 No.107926891

Anonymous 01/20/26(Tue)23:08:07 No.107926891

>>107926880
>>>/b/945055399
you'll fit right in

Anonymous
01/20/26(Tue)23:09:59 No.107926905

Anonymous 01/20/26(Tue)23:09:59 No.107926905

>>107926875
yeah works here too, I suspected it was just a matter of reframing the prompt or whatever, try without the caps, its so odd how consistently it writes Suiside with "Commit suicide" on prompt

Anonymous
01/20/26(Tue)23:10:07 No.107926911

Anonymous 01/20/26(Tue)23:10:07 No.107926911

>>107926890
the idea of them censoring text output is laughable, and I just proved easily the other anon simply had giga skill issues

Anonymous
01/20/26(Tue)23:10:50 No.107926918

Anonymous 01/20/26(Tue)23:10:50 No.107926918

>>107926905
a lot of models fuck up the text way more if it's not in caps for some reason

Anonymous
01/20/26(Tue)23:10:58 No.107926920

Anonymous 01/20/26(Tue)23:10:58 No.107926920

Absolute kino
https://files.catbox.moe/4vusph.mp4

Anonymous
01/20/26(Tue)23:16:23 No.107926960

Anonymous 01/20/26(Tue)23:16:23 No.107926960

File: 66.jpg (677 KB, 2048x1024)

677 KB JPG

>>107926918
I'll keep that in mind

Commit suicide ahead! vs COMMIT SUICIDE AHEAD!

Anonymous
01/20/26(Tue)23:19:06 No.107926976

Anonymous 01/20/26(Tue)23:19:06 No.107926976

>>107926918
I think it's because there's less variation in fonts when it comes to caps

Anonymous
01/20/26(Tue)23:20:57 No.107926985

Anonymous 01/20/26(Tue)23:20:57 No.107926985

>>107926920
great. too bad you choose a meme slop

Anonymous
01/20/26(Tue)23:22:02 No.107926989

Anonymous 01/20/26(Tue)23:22:02 No.107926989

>>107926985
he worked very hard on it give him his deserved updoots >:(

Anonymous
01/20/26(Tue)23:23:06 No.107926992

Anonymous 01/20/26(Tue)23:23:06 No.107926992

i don't really know "where" gamergate happened. i was here shitposting the whole time
https://x.com/UnburntWitch/status/916106617493495808?s=20

Anonymous
01/20/26(Tue)23:23:51 No.107926997

Anonymous 01/20/26(Tue)23:23:51 No.107926997

>>107926805
what could the watermark possibly include? that the image was made with klein?

Anonymous
01/20/26(Tue)23:23:53 No.107926998

Anonymous 01/20/26(Tue)23:23:53 No.107926998

>>107926827
so that's why klein's anatomy is so shit

Anonymous
01/20/26(Tue)23:24:04 No.107927000

Anonymous 01/20/26(Tue)23:24:04 No.107927000

>>107926992
wrong thread anon

Anonymous
01/20/26(Tue)23:25:20 No.107927006

Anonymous 01/20/26(Tue)23:25:20 No.107927006

>>107926997
>that the image was made with klein?
I guess, that's it's an AI image made with klein

Anonymous
01/20/26(Tue)23:26:16 No.107927012

Anonymous 01/20/26(Tue)23:26:16 No.107927012

>>107926997
Who knows how much they are willing to go in the name of safety

Anonymous
01/20/26(Tue)23:27:35 No.107927021

Anonymous 01/20/26(Tue)23:27:35 No.107927021

File: FluxKlein9BDistilled_Outp(...).png (3.75 MB, 1536x1536)

3.75 MB PNG

Anonymous
01/20/26(Tue)23:27:57 No.107927022

Anonymous 01/20/26(Tue)23:27:57 No.107927022

>>107927012
to be fair, they seem to have calmed down on "safety", Klein is way less uncensored than Kontext for example, Kontext used to not modify your image at all if it had its censorship layers triggered or some shit, never happened on Klein so far

Anonymous
01/20/26(Tue)23:29:37 No.107927030

Anonymous 01/20/26(Tue)23:29:37 No.107927030

>>107926805
This is from memory but "the sample inference code" applies watermarking. As in, the model weights do not perform any watermarking, the Python code around them performs it. You can identify that code and change it.

Anonymous
01/20/26(Tue)23:29:45 No.107927033

Anonymous 01/20/26(Tue)23:29:45 No.107927033

>>107926998
they undershot the recommended step count to make the distilled models look faster if you ask me. It's way better at 8 steps. Censorship wasn't even the the problem with SD3, SD3 was just technically fucked in multiple ways.

Anonymous
01/20/26(Tue)23:30:28 No.107927039

Anonymous 01/20/26(Tue)23:30:28 No.107927039

>>107927030
>, the Python code around them performs it.
I doubt Comfy has implemented that shit, he's too north american for that

Anonymous
01/20/26(Tue)23:30:30 No.107927040

Anonymous 01/20/26(Tue)23:30:30 No.107927040

>>107926848
>>107926853
>>107926858
The bottle cap has two plastic strands, just break one of them and it gets the cap away from your mouth. And now you don't have to babysit a cap in your hand. Anyone complaining about it is not smarter than a bottle cap.

Anonymous
01/20/26(Tue)23:31:52 No.107927047

Anonymous 01/20/26(Tue)23:31:52 No.107927047

>>107927033
>they undershot the recommended step count to make the distilled models look faster if you ask me. It's way better at 8 steps.
that's my guess too, they went for too low, if they distilled it at 8 steps instead of 4 it would've been closer to Z-image turbo in terms of realism and anatomy, unironically

Anonymous
01/20/26(Tue)23:33:21 No.107927052

Anonymous 01/20/26(Tue)23:33:21 No.107927052

File: fk9b_00063.png (1.94 MB, 960x1440)

1.94 MB PNG

sure anon, you generating anime conversions and getting weird outputs is how you catch BFL cheaping out on training

Anonymous
01/20/26(Tue)23:35:12 No.107927059

Anonymous 01/20/26(Tue)23:35:12 No.107927059

File: 1750443513734476.png (87 KB, 360x360)

87 KB PNG

>>107927052
>fk9b
aktually, it's f2k9b

Anonymous
01/20/26(Tue)23:36:19 No.107927068

Anonymous 01/20/26(Tue)23:36:19 No.107927068

>>107927047
depends what you're doing. some of my gens look cooked at 8 steps.

Anonymous
01/20/26(Tue)23:37:42 No.107927080

Anonymous 01/20/26(Tue)23:37:42 No.107927080

>>107927068
no, I meant that BFL should've distilled the model at 8 steps instead of 4, and yeah, going for 8 inference steps for a 4-steps distilled model is probably not the right idea (I get less slopped shit personally so I'm ok with it)

Anonymous
01/20/26(Tue)23:39:49 No.107927088

Anonymous 01/20/26(Tue)23:39:49 No.107927088

What the fuck does the crying emoji mean on civitai?

Anonymous
01/20/26(Tue)23:40:01 No.107927093

Anonymous 01/20/26(Tue)23:40:01 No.107927093

File: Screenshot 2026-01-20 at (...).png (612 KB, 870x470)

612 KB PNG

how do you get rid of aliasing artifacts with LTX-2? The repeating squares.

Anonymous
01/20/26(Tue)23:41:20 No.107927098

Anonymous 01/20/26(Tue)23:41:20 No.107927098

>>107927093
are you using vae decode (tiled)?

Anonymous
01/20/26(Tue)23:41:56 No.107927103

Anonymous 01/20/26(Tue)23:41:56 No.107927103

>>107927098
yes. guess I'm fucked.

Anonymous
01/20/26(Tue)23:42:37 No.107927108

Anonymous 01/20/26(Tue)23:42:37 No.107927108

>>107927088
It's dislike button #2. Crying laughing emoji is dislike #1

Anonymous
01/20/26(Tue)23:42:47 No.107927110

Anonymous 01/20/26(Tue)23:42:47 No.107927110

File: f2k9b_00002.png (2.15 MB, 960x1440)

2.15 MB PNG

>>107927059
good looking out croski

>>107927088
we truly may never know

Anonymous
01/20/26(Tue)23:43:53 No.107927116

Anonymous 01/20/26(Tue)23:43:53 No.107927116

File: 1752777407964525.png (40 KB, 949x390)

40 KB PNG

>>107927103
nah, it can work, what are your settings? I'm getting ok results with those

Anonymous
01/20/26(Tue)23:44:24 No.107927119

Anonymous 01/20/26(Tue)23:44:24 No.107927119

>>107927116
these*

Anonymous
01/20/26(Tue)23:45:22 No.107927124

Anonymous 01/20/26(Tue)23:45:22 No.107927124

>>107927116
>>107927098
I tried with the normal vae decode and it still gives the same results.

So it's probably not the VAE?

Anonymous
01/20/26(Tue)23:46:02 No.107927128

Anonymous 01/20/26(Tue)23:46:02 No.107927128

>>107927124
show a screen of your workflow, something must be wrong

Anonymous
01/20/26(Tue)23:48:07 No.107927137

Anonymous 01/20/26(Tue)23:48:07 No.107927137

Alright. I installed Comfy UI.
Now what?
Do I need to install Stable Diffusion XL, Z-Image, or Z-Image Turbo?
Is Z-Image even safe? It's from the Chinese.
I have 1050 Ti (4GB VRAM) + 16 GB DDR3 RAM, if that matters.

Anonymous
01/20/26(Tue)23:49:09 No.107927142

Anonymous 01/20/26(Tue)23:49:09 No.107927142

File: 2026-01-20-234853_1224x88(...).png (277 KB, 1224x887)

277 KB PNG

>>107927128
part 1

Anonymous
01/20/26(Tue)23:50:10 No.107927152

Anonymous 01/20/26(Tue)23:50:10 No.107927152

File: 2026-01-20-234934_1636x74(...).png (241 KB, 1636x749)

241 KB PNG

>>107927142
part 2

Anonymous
01/20/26(Tue)23:51:30 No.107927155

Anonymous 01/20/26(Tue)23:51:30 No.107927155

File: Klein 9b distill.png (2.64 MB, 1984x1040)

2.64 MB PNG

https://www.youtube.com/watch?v=KFzhe7PKWdw

Anonymous
01/20/26(Tue)23:52:31 No.107927162

Anonymous 01/20/26(Tue)23:52:31 No.107927162

>>107927152
>euler ancestral
try regular euler

Anonymous
01/20/26(Tue)23:53:48 No.107927172

Anonymous 01/20/26(Tue)23:53:48 No.107927172

>>107927162
the upscaler is bypassed.

Anonymous
01/20/26(Tue)23:55:05 No.107927182

Anonymous 01/20/26(Tue)23:55:05 No.107927182

>>107927142
>gemma 3 fp4
.. it's probably that, text encoders are ultra sensitive to quants

Anonymous
01/20/26(Tue)23:56:35 No.107927188

Anonymous 01/20/26(Tue)23:56:35 No.107927188

File: 1756922818258551.jpg (232 KB, 1280x720)

232 KB JPG

Anonymous
01/20/26(Tue)23:57:07 No.107927195

Anonymous 01/20/26(Tue)23:57:07 No.107927195

>>107927137
download a quant of z-image
comfy has guides if you're not sure how to install things

Anonymous
01/20/26(Tue)23:57:57 No.107927199

Anonymous 01/20/26(Tue)23:57:57 No.107927199

File: 1767016860549490.png (328 KB, 798x644)

328 KB PNG

>>107927137
>I have 1050 Ti (4GB VRAM)
bruh

Anonymous
01/20/26(Tue)23:58:37 No.107927201

Anonymous 01/20/26(Tue)23:58:37 No.107927201

>>107927182
Thanks, will try fp8.

Anonymous
01/20/26(Tue)23:59:04 No.107927204

Anonymous 01/20/26(Tue)23:59:04 No.107927204

>>107927199
kek
he might as well run 1.5

Anonymous
01/20/26(Tue)23:59:40 No.107927206

Anonymous 01/20/26(Tue)23:59:40 No.107927206

>>107927201
go for Q8 instead, it's the same size and the quality is way closer to bf16

Anonymous
01/21/26(Wed)00:02:37 No.107927219

Anonymous 01/21/26(Wed)00:02:37 No.107927219

>>107927206
I'll see if fp8 makes a difference first.

Anonymous
01/21/26(Wed)00:04:48 No.107927235

Anonymous 01/21/26(Wed)00:04:48 No.107927235

>want to try training LTX loras
>realize I don't have enough videos to make anything worthwhile
Am I going to have to scrape leaked OnlyFans archives or something?

Anonymous
01/21/26(Wed)00:04:54 No.107927236

Anonymous 01/21/26(Wed)00:04:54 No.107927236

>>107927219
fp8 is shit, stop using it

Anonymous
01/21/26(Wed)00:06:31 No.107927243

Anonymous 01/21/26(Wed)00:06:31 No.107927243

>>107927235
It can be a bit of a fucking nightmare if your concept is niche enough.

I had to make animations in blender to train the concept I wanted and it barely worked. From there I had to cherry pick the best outputs from the initial scuffed LoRA to get a dataset that was more stable.

Anonymous
01/21/26(Wed)00:07:45 No.107927249

Anonymous 01/21/26(Wed)00:07:45 No.107927249

zit takes as qwen3 4b. Why can't it use the 8b?

Anonymous
01/21/26(Wed)00:12:54 No.107927275

Anonymous 01/21/26(Wed)00:12:54 No.107927275

>>107927249
Because to the model the output looks like gobbledygook if it wasn't trained on that specific text encoder.

Anonymous
01/21/26(Wed)00:14:31 No.107927279

Anonymous 01/21/26(Wed)00:14:31 No.107927279

>>107927249
Not the same model.

Anonymous
01/21/26(Wed)00:14:37 No.107927281

Anonymous 01/21/26(Wed)00:14:37 No.107927281

>>107927249
>>107927275
I think they deemed qwen 3 8b to be too powerful to be Apache 2.0, so they nerfed it

Anonymous
01/21/26(Wed)00:18:49 No.107927293

Anonymous 01/21/26(Wed)00:18:49 No.107927293

File: ComfyUI_temp_dqhuu_00012_.jpg (571 KB, 1950x1510)

571 KB JPG

cfg hacking, this is the same seed.

Anonymous
01/21/26(Wed)00:19:51 No.107927297

Anonymous 01/21/26(Wed)00:19:51 No.107927297

>>107927047
I think it's as realistic or more than Z DESU, with the right prompt. The new VAE is really good.

Anonymous
01/21/26(Wed)00:20:55 No.107927300

Anonymous 01/21/26(Wed)00:20:55 No.107927300

File: Klein 9b distill.png (2 MB, 2720x768)

2 MB PNG

looks like this model is easy to train, loras look good on it
https://civitai.com/models/2188187/old-school-runescape-style-lora-klein-and-zit?modelVersionId=2615834

Anonymous
01/21/26(Wed)00:21:54 No.107927307

Anonymous 01/21/26(Wed)00:21:54 No.107927307

>>107927300
kek

Anonymous
01/21/26(Wed)00:21:56 No.107927308

Anonymous 01/21/26(Wed)00:21:56 No.107927308

>>107927293
this is Klein?
>cfg hacking
how? it looks pretty good anon, you're onto something really interesting

Anonymous
01/21/26(Wed)00:22:38 No.107927311

Anonymous 01/21/26(Wed)00:22:38 No.107927311

>>107926565
>>107927304

Anonymous
01/21/26(Wed)00:23:36 No.107927316

Anonymous 01/21/26(Wed)00:23:36 No.107927316

When training wan video loras, will the style of the dataset matter or can I make it just focus on the motion?

Anonymous
01/21/26(Wed)00:27:09 No.107927334

Anonymous 01/21/26(Wed)00:27:09 No.107927334

>>107927297
>The new VAE is really good.
yeah it's definitely an improvement over flux 1's vae, now the Z-image series look a bit outdated if they keep using the previous version, deep down I hope they used those 2 months to switch VAEs but I'm coping way too hard now lol

Anonymous
01/21/26(Wed)00:31:08 No.107927353

Anonymous 01/21/26(Wed)00:31:08 No.107927353

File: in the morning.png (3.74 MB, 1536x2048)

3.74 MB PNG

Anonymous
01/21/26(Wed)00:31:24 No.107927354

Anonymous 01/21/26(Wed)00:31:24 No.107927354

File: ComfyUI_temp_dqhuu_00017_.jpg (552 KB, 1950x1510)

552 KB JPG

>>107927308
i can't tell if one is better or just different is the issue, basically i'm attempting to offset the cfg to skip the first step

Anonymous
01/21/26(Wed)00:31:43 No.107927356

Anonymous 01/21/26(Wed)00:31:43 No.107927356

>>107927297
>The new VAE is really good.
what new vae? i was just seething about the compression artifacts in my lonesome

Anonymous
01/21/26(Wed)00:32:28 No.107927361

Anonymous 01/21/26(Wed)00:32:28 No.107927361

listen, I'm gonna need something with the prompt adherence and video quality of wan2.2 combined with the audio, video length and generation speed of ltx2 right now

Anonymous
01/21/26(Wed)00:32:44 No.107927365

Anonymous 01/21/26(Wed)00:32:44 No.107927365

File: Klein 9b distill.png (2.81 MB, 2720x768)

2.81 MB PNG

>>107927300
lul
https://civitai.com/models/2280663/basedjak?modelVersionId=2609867

Anonymous
01/21/26(Wed)00:32:46 No.107927366

Anonymous 01/21/26(Wed)00:32:46 No.107927366

>>107927354
take you eyes off the slut and look at the background. one is clearly better

Anonymous
01/21/26(Wed)00:33:45 No.107927371

Anonymous 01/21/26(Wed)00:33:45 No.107927371

>>107927354
I like the one on the right it doesn't have that fucking bokeh

Anonymous
01/21/26(Wed)00:34:50 No.107927376

Anonymous 01/21/26(Wed)00:34:50 No.107927376

>>107927356
>what new vae?
Flux 2 Klein uses Flux 2's vae, it's an improvement over Kontext that was using Flux 1's vae, and Z-image turbo also uses Flux 1's vae

Anonymous
01/21/26(Wed)00:36:19 No.107927387

Anonymous 01/21/26(Wed)00:36:19 No.107927387

File: Flux2-Klein_00135_.png (1.66 MB, 1024x1024)

1.66 MB PNG

Anonymous
01/21/26(Wed)00:40:06 No.107927407

Anonymous 01/21/26(Wed)00:40:06 No.107927407

File: 1757429347546178.png (1021 KB, 1168x880)

1021 KB PNG

>>107927365
kek

Anonymous
01/21/26(Wed)00:40:16 No.107927408

Anonymous 01/21/26(Wed)00:40:16 No.107927408

>>107927361
Video quality I get for ltx is shit but I really don't get the ltx prompt adherence is bad, you can time stamp the prompt and it will follow a good 80-90% of it, you time stamp prompt in wan it will follow whatever the first action and take the entire 5 seconds doing that.

Anonymous
01/21/26(Wed)00:42:58 No.107927426

Anonymous 01/21/26(Wed)00:42:58 No.107927426

File: ComfyUI_temp_dqhuu_00029_.jpg (585 KB, 1950x1510)

585 KB JPG

>>107927371
it's weird that if i don't mention bokeh, it can go either way but i prompt they both respect it.

Anonymous
01/21/26(Wed)00:44:06 No.107927434

Anonymous 01/21/26(Wed)00:44:06 No.107927434

So why did the mentally ill moron spam the last thread?

Anonymous
01/21/26(Wed)00:45:16 No.107927444

Anonymous 01/21/26(Wed)00:45:16 No.107927444

>>107927408
I haven't tried timestamps with ltx but when the character moves around it seems to go to complete warbled shit

Anonymous
01/21/26(Wed)00:45:23 No.107927445

Anonymous 01/21/26(Wed)00:45:23 No.107927445

>>107927426
it definitely looks better on the right, look at the light on her hair it's way more natural, reminds me of Z-image turbo a bit, what's your method anon? you made something really cool

Anonymous
01/21/26(Wed)00:47:52 No.107927456

Anonymous 01/21/26(Wed)00:47:52 No.107927456

>>107927236
ok? u got a link to the q8 for comfy?

Anonymous
01/21/26(Wed)00:49:00 No.107927463

Anonymous 01/21/26(Wed)00:49:00 No.107927463

>>107927456
https://huggingface.co/Qwen/Qwen3-8B-GGUF

Anonymous
01/21/26(Wed)00:51:07 No.107927473

Anonymous 01/21/26(Wed)00:51:07 No.107927473

>>107927463
Maybe you should have paid attention to the whole conversation before showing everyone how schizo you are.

Anonymous
01/21/26(Wed)00:53:45 No.107927491

Anonymous 01/21/26(Wed)00:53:45 No.107927491

>>107927473
What do you mean?

Anonymous
01/21/26(Wed)00:53:58 No.107927493

Anonymous 01/21/26(Wed)00:53:58 No.107927493

What is the current state of voice and soundeffects diffusion?
I tried MMAudio for my wan gens and it was shit.
Are there still no good options for generating sound from a video input? and what about generating dialogue for specific characters?

Anonymous
01/21/26(Wed)00:55:31 No.107927502

Anonymous 01/21/26(Wed)00:55:31 No.107927502

>>107927444
Yeah I think that's more of a fault of how compressed the latents are rather than prompt, it will try to do the prompt but since it's so compressed (I believe it's double what Wan does) the model screws up. Now whether the fix for this more time to bake or maybe just a inherent issue I guess time will tell.

Anonymous
01/21/26(Wed)00:55:31 No.107927503

Anonymous 01/21/26(Wed)00:55:31 No.107927503

>>107927456
https://huggingface.co/unsloth/gemma-3-12b-it-GGUF

Anonymous
01/21/26(Wed)00:56:30 No.107927508

Anonymous 01/21/26(Wed)00:56:30 No.107927508

File: ComfyUI_temp_dqhuu_00040_.jpg (501 KB, 1950x1510)

501 KB JPG

>>107927445
this may just be a way to amplify lora effectiveness. it still has a tendency to add more anatomy issues. i am messing with the "cfg zero/zero init" node. i mentioned this here before klein dropped. but kjnodes has a beta node that works. just using it with the default zero init at zero steps.

Anonymous
01/21/26(Wed)00:59:17 No.107927514

Anonymous 01/21/26(Wed)00:59:17 No.107927514

>>107927182
>>107927206
>>107927219
So took longer because my docker in WSL2 decided to shit itself (probably ran out of disk space).

fp8 gives no improvements.

Anonymous
01/21/26(Wed)00:59:33 No.107927520

Anonymous 01/21/26(Wed)00:59:33 No.107927520

File: ComfyUI_temp_dqhuu_00043_.jpg (828 KB, 1950x1510)

828 KB JPG

>>107927508
tldr; distilled models like to set up the structure very early in steps, if you skip the initial steps it is way more creative, but also fucks up really easy.

Anonymous
01/21/26(Wed)01:02:26 No.107927539

Anonymous 01/21/26(Wed)01:02:26 No.107927539

File: 1751224322363611.png (2.27 MB, 1711x976)

2.27 MB PNG

oof, it compressed the image hard on that one, VAEs on edit models was a mistake
https://www.youtube.com/watch?v=rWyRxQoNHJU

Anonymous
01/21/26(Wed)01:03:30 No.107927544

Anonymous 01/21/26(Wed)01:03:30 No.107927544

File: 1759722030303206.png (4 KB, 63x58)

4 KB PNG

>>107927539
>VAEs on edit models was a mistake
apologize

Anonymous
01/21/26(Wed)01:04:18 No.107927552

Anonymous 01/21/26(Wed)01:04:18 No.107927552

>>107927539
>>107927544
Once someone makes a pixel edit model to prove it, I'll call bullshit

Anonymous
01/21/26(Wed)01:04:28 No.107927554

Anonymous 01/21/26(Wed)01:04:28 No.107927554

File: klein_00288_.png (1.82 MB, 1040x1520)

1.82 MB PNG

Anonymous
01/21/26(Wed)01:05:09 No.107927560

Anonymous 01/21/26(Wed)01:05:09 No.107927560

File: I believe.png (235 KB, 500x489)

235 KB PNG

https://github.com/Tongyi-MAI/Z-Image/issues/126#issuecomment-3769946123
>In reality, the base version has diverged significantly from the initial plan. The original roadmap featured only three variants: base, turbo, and edit. The edit model was developed through additional training and supervised fine-tuning specifically for editing tasks on top of the base version. However, the base version has now evolved into omni base, which inherently incorporates editing capabilities. This signifies that the Edit dataset was incorporated during the initial low-resolution pre-training phase, necessitating extensive retraining. The Chinese community currently anticipates Omni Base's release around the Chinese New Year period.

Anonymous
01/21/26(Wed)01:05:19 No.107927561

Anonymous 01/21/26(Wed)01:05:19 No.107927561

>>107926791
>https://rentry.org/debo
>https://rentry.org/animanon
can anyone please explain why does some troon keep adding this off-topic shit to the op? we have some proper threads and then the schizo reappears and invades the op like a troon in a girls' bathroom. disgusting and annoying

Anonymous
01/21/26(Wed)01:05:35 No.107927563

Anonymous 01/21/26(Wed)01:05:35 No.107927563

File: ComfyUI_00001_.png (368 KB, 512x512)

368 KB PNG

My first render!
What should I try next?

Anonymous
01/21/26(Wed)01:06:04 No.107927566

Anonymous 01/21/26(Wed)01:06:04 No.107927566

>another no u
mark it down

Anonymous
01/21/26(Wed)01:06:22 No.107927567

Anonymous 01/21/26(Wed)01:06:22 No.107927567

>>107927544
I was always in favor of VAEless models, and I hope lodestone will make Klein or Z-image edit VAEless as well

Anonymous
01/21/26(Wed)01:07:22 No.107927576

Anonymous 01/21/26(Wed)01:07:22 No.107927576

>>107927563
1girl, large breasts, masterpiece, style_cluster948332

Anonymous
01/21/26(Wed)01:09:28 No.107927586

Anonymous 01/21/26(Wed)01:09:28 No.107927586

>>107927567
>I was always in favor of VAEless models
okay
>and I hope lodestone will make Klein or Z-image edit VAEless as well

You need to stop relying on this do-nothing furfag with an attention span shorter than the average ipad kid to solve your issues.

Anonymous
01/21/26(Wed)01:10:34 No.107927592

Anonymous 01/21/26(Wed)01:10:34 No.107927592

>>107927563
1girl, fennec fox, standing

Anonymous
01/21/26(Wed)01:11:45 No.107927597

Anonymous 01/21/26(Wed)01:11:45 No.107927597

>>107927586
who should I rely on then? you? come on anon, you can do it

Anonymous
01/21/26(Wed)01:11:57 No.107927598

Anonymous 01/21/26(Wed)01:11:57 No.107927598

File: file.png (563 KB, 1858x3934)

563 KB PNG

this time they'll believe i'm just a random anon..!

Anonymous
01/21/26(Wed)01:12:26 No.107927600

Anonymous 01/21/26(Wed)01:12:26 No.107927600

>>107927356
Flux.2 VAE, Klein uses it also.
https://bfl.ai/research/representation-comparison

Anonymous
01/21/26(Wed)01:14:18 No.107927607

Anonymous 01/21/26(Wed)01:14:18 No.107927607

When will LTX-2 be capable of nsfw audio?

Anonymous
01/21/26(Wed)01:14:38 No.107927608

Anonymous 01/21/26(Wed)01:14:38 No.107927608

File: ComfyUI_temp_dqhuu_00057_.jpg (421 KB, 1566x1222)

421 KB JPG

Anonymous
01/21/26(Wed)01:14:44 No.107927610

Anonymous 01/21/26(Wed)01:14:44 No.107927610

>>107927600
>https://bfl.ai/research/representation-comparison
>Stay tuned for FLUX.3 - coming soon ™.
lul, I'm looking forward to it, they made a good Klein model so they're not completly useless after all

Anonymous
01/21/26(Wed)01:15:04 No.107927612

Anonymous 01/21/26(Wed)01:15:04 No.107927612

>>107927607
check civtai. it can.

Anonymous
01/21/26(Wed)01:15:19 No.107927614

Anonymous 01/21/26(Wed)01:15:19 No.107927614

>>107927598
Didn't you just spend your whole day spamming a thread? Get a life you sad freak.

Anonymous
01/21/26(Wed)01:15:31 No.107927615

Anonymous 01/21/26(Wed)01:15:31 No.107927615

>>107927597
>who should I rely on then?
Nobody. Just stop giving that nobody the deference he doesn't deserve.

Anonymous
01/21/26(Wed)01:17:40 No.107927626

Anonymous 01/21/26(Wed)01:17:40 No.107927626

>>107927615
he was known for fluffyrock before he ever did Chroma though, that thing was in a ton of SD 1.5 merges that didn't even have anything to do with furry stuff

Anonymous
01/21/26(Wed)01:18:14 No.107927631

Anonymous 01/21/26(Wed)01:18:14 No.107927631

>>107927614
who do you think i am lol? you think it's only one person you stole gens from?

Anonymous
01/21/26(Wed)01:18:31 No.107927634

Anonymous 01/21/26(Wed)01:18:31 No.107927634

File: temp1.png (1018 KB, 1023x512)

1018 KB PNG

>>107927576
>>107927592
LOL
First took 37 seconds, second took 27 seconds.
1050 Ti (4 GB VRAM)

Anonymous
01/21/26(Wed)01:18:32 No.107927635

Anonymous 01/21/26(Wed)01:18:32 No.107927635

>>107927614
>>107927627
I bet there's at least ONE newfren believes you. But only one.

Anonymous
01/21/26(Wed)01:18:40 No.107927636

Anonymous 01/21/26(Wed)01:18:40 No.107927636

File: Screenshot 2026-01-21 at (...).png (765 KB, 515x870)

765 KB PNG

Maybe my model is cooked?

Anonymous
01/21/26(Wed)01:19:48 No.107927639

Anonymous 01/21/26(Wed)01:19:48 No.107927639

>>107927634
>First took 37 seconds,
Man. this brings me back to my 1080 days...
Cherish this moment anon.

Anonymous
01/21/26(Wed)01:19:57 No.107927640

Anonymous 01/21/26(Wed)01:19:57 No.107927640

>>107927634
based
>1050 Ti (4 GB VRAM)
ouch :/

Anonymous
01/21/26(Wed)01:20:26 No.107927641

Anonymous 01/21/26(Wed)01:20:26 No.107927641

>>107927612
Are you talking about the furfag lora?

Anonymous
01/21/26(Wed)01:21:30 No.107927644

Anonymous 01/21/26(Wed)01:21:30 No.107927644

>>107927641
nta but it can do people too
https://civitai.com/images/117716335

Anonymous
01/21/26(Wed)01:21:32 No.107927645

Anonymous 01/21/26(Wed)01:21:32 No.107927645

File: 4445445458.jpg (510 KB, 1310x873)

510 KB JPG

This confirms my suspicion, ACEStep 1.5 is already on par if not better than Suno v5 sound quality wise.

Anonymous
01/21/26(Wed)01:22:03 No.107927647

Anonymous 01/21/26(Wed)01:22:03 No.107927647

Kill ani

Anonymous
01/21/26(Wed)01:23:03 No.107927651

Anonymous 01/21/26(Wed)01:23:03 No.107927651

File: ComfyUI_temp_dqhuu_00062_.png (3.89 MB, 1566x1222)

3.89 MB PNG

Anonymous
01/21/26(Wed)01:23:18 No.107927652

Anonymous 01/21/26(Wed)01:23:18 No.107927652

>>107927635
after seeing all that tran has done to ldg, i dont think there's anyone left who believes their lies. her power lies solely in her boyfriend faggot mod who bans everyone who tries to bring the truth

Anonymous
01/21/26(Wed)01:23:26 No.107927653

Anonymous 01/21/26(Wed)01:23:26 No.107927653

>>107927645
only udio is worth comparing to, suno is not the best music model

Anonymous
01/21/26(Wed)01:24:13 No.107927658

Anonymous 01/21/26(Wed)01:24:13 No.107927658

>>107927645
I just dont buy it. Even though it has clearly massively improved recently

Anonymous
01/21/26(Wed)01:24:19 No.107927659

Anonymous 01/21/26(Wed)01:24:19 No.107927659

https://civitai.com/models/2322631/klein-pp-uncut-flaccid-penis

This dick lora is a good example of the Flux.2 VAE being noticeably better I think

Anonymous
01/21/26(Wed)01:25:13 No.107927667

Anonymous 01/21/26(Wed)01:25:13 No.107927667

>>107927653
nta, but I tried Udio yesterday and it was trash compared to suno. Like I was actually shocked.

Anonymous
01/21/26(Wed)01:26:32 No.107927672

Anonymous 01/21/26(Wed)01:26:32 No.107927672

>>107927667
yeah, udio ain't what they used to, but there was a time when udio was actually amazing
https://www.udio.com/songs/wwRF2Bs6fQgbvqchqU6kAe
https://www.udio.com/songs/cnnJ166HGBKhTeHGkxgCtq

Anonymous
01/21/26(Wed)01:26:43 No.107927674

Anonymous 01/21/26(Wed)01:26:43 No.107927674

>>107927652
its sad that he tried so hard to get anon to use his wrapper but it failed because it has no features. and now he just anon posts in the third person. very sad

Anonymous
01/21/26(Wed)01:31:15 No.107927692

Anonymous 01/21/26(Wed)01:31:15 No.107927692

>>107927544
he did a great job with radiance given the means he had, but it's not an edit model?

Anonymous
01/21/26(Wed)01:33:36 No.107927700

Anonymous 01/21/26(Wed)01:33:36 No.107927700

File: ComfyUI_temp_xhamt_00001_.png (2.6 MB, 1280x1024)

2.6 MB PNG

So what would be an "allstar" version of comfy?
>torch 2.8
>monkypatched model support
>gui from half a year ago
Anyone tried to stitch this shit together?

Anonymous
01/21/26(Wed)01:34:48 No.107927707

Anonymous 01/21/26(Wed)01:34:48 No.107927707

>>107927700
>monkypatched model support
what's that?

Anonymous
01/21/26(Wed)01:36:36 No.107927716

Anonymous 01/21/26(Wed)01:36:36 No.107927716

>>107927700
>torch 2.8
Why 2.8 in particular? TensorRT is one thing I can think of that broke down with 2.9.
>monkypatched
Que?

Anonymous
01/21/26(Wed)01:36:39 No.107927717

Anonymous 01/21/26(Wed)01:36:39 No.107927717

>>107927707
You've never run a patch that exists in a single unmerged pr?

Anonymous
01/21/26(Wed)01:50:09 No.107927780

Anonymous 01/21/26(Wed)01:50:09 No.107927780

Can anyone share an LTX-2 workflow that makes use of Kijai's .gguf?
The ComfyUI template workflow uses ckpt_name for it's nodes and it's a pain to adjust.

Anonymous
01/21/26(Wed)01:51:34 No.107927793

Anonymous 01/21/26(Wed)01:51:34 No.107927793

>>107927780
Just install this node and use the loader from it
https://github.com/city96/ComfyUI-GGUF

Anonymous
01/21/26(Wed)01:51:41 No.107927796

Anonymous 01/21/26(Wed)01:51:41 No.107927796

>>107927780
try growing a brain cell?

Anonymous
01/21/26(Wed)01:53:44 No.107927812

Anonymous 01/21/26(Wed)01:53:44 No.107927812

File: Untitled.png (93 KB, 658x723)

93 KB PNG

>>107927780
Just delete the model loader nodes and replace them with these (you will need KJ nodes and city96 gguf nodes)

If that is too difficult for you, I don't know what to tell you.

Anonymous
01/21/26(Wed)01:55:06 No.107927819

Anonymous 01/21/26(Wed)01:55:06 No.107927819

>>107927812
iirc the vae loading has been fixed also in mainline, so no need for kj nodes anymore there

Anonymous
01/21/26(Wed)01:59:15 No.107927843

Anonymous 01/21/26(Wed)01:59:15 No.107927843

>>107927812
what is the lora you are using there? and why is it set to -.20?

Anonymous
01/21/26(Wed)02:00:28 No.107927848

Anonymous 01/21/26(Wed)02:00:28 No.107927848

>>107927843
That's the distill LoRA. I am using it with the distill model but setting it to -20 because I want less distill in my distill model.

It's the same as using the undistilled weights with the distill LoRA set at a strength of .8

Anonymous
01/21/26(Wed)02:01:42 No.107927854

Anonymous 01/21/26(Wed)02:01:42 No.107927854

File: Screenshot 2026-01-21 180011.png (18 KB, 595x398)

18 KB PNG

>>107927812
I already tried all that, and get this error. Using the exact same nodes.

Anonymous
01/21/26(Wed)02:03:21 No.107927864

Anonymous 01/21/26(Wed)02:03:21 No.107927864

>>107927854
Post your nodes so I can see them.

Anonymous
01/21/26(Wed)02:09:50 No.107927916

Anonymous 01/21/26(Wed)02:09:50 No.107927916

File: 74545848874.png (35 KB, 1112x187)

35 KB PNG

>>107927672
Udio at is peak is absolutely great, and what's great about it is its composition ability

https://www.udio.com/songs/nfdtmJRUC7niZfhseaHdNk

https://www.udio.com/songs/7zrLreMnwCYrdBqQkGtEXM

https://www.udio.com/songs/hoCg4BmayTYXcJfjo4jvbT

Specially its insane adherence to lyrics, ACEStep 1.5 still has nothing on it, but there's pic rel that can bridge the gap. He has posted examples on Discord and it will absolutely sound insane for fixing up existing songs. There's also "extend" feature. Sound quality wise, Udio is noticeably worse on many songs, you can clearly hear this issue with a good pair of headphones (I use HD 600/bookshelf speakers so I know what I mean). You can really notice the compression on Udio songs when you turn up the volume. That's either largely because Udio compresses their quality since they don't want plebs like you using the best of the best (voice quality is noticeably superior to everything else though), or perhaps their model just isn't as focused on that as Suno/ACEStep 1.5.

Composition wise, ACEStep 1.5 is almost there. You be the judge, but if on a good seed it's Udio tier, that means v2 is going to surpass Udio.

Regular anime/romantic stuff:
https://files.catbox.moe/2t4h82.mp3

8bit mix:
https://files.catbox.moe/7pqlbx.mp3

Glitched out synth music:
https://files.catbox.moe/klw8a6.mp3

Anyways, just be glad local is finally gonna be eating good.

Anonymous
01/21/26(Wed)02:10:23 No.107927920

Anonymous 01/21/26(Wed)02:10:23 No.107927920

Fuck me. all I had to do was decode the "denoised output."

Anonymous
01/21/26(Wed)02:11:24 No.107927930

Anonymous 01/21/26(Wed)02:11:24 No.107927930

>>107927916
>v2
If I go by the discord and trust what they say (I do, they seem reasonable and are upfront about their plans) version 2 will be open source as well.

They did say the that if they got a model more powerful than suno they would API it though.

Anonymous
01/21/26(Wed)02:13:16 No.107927937

Anonymous 01/21/26(Wed)02:13:16 No.107927937

File: comfyui_fluxklein9b_temp_(...).jpg (210 KB, 1216x832)

210 KB JPG

>>107927561
>>107927614
>>107927652
uh oh, melty!

Anonymous
01/21/26(Wed)02:13:45 No.107927943

Anonymous 01/21/26(Wed)02:13:45 No.107927943

File: Screenshot 2026-01-21 181228.png (20 KB, 662x398)

20 KB PNG

>>107927812
>>107927864
Forget the vae error (found the obvious problem).
Now it's pic related.

Anonymous
01/21/26(Wed)02:15:03 No.107927946

Anonymous 01/21/26(Wed)02:15:03 No.107927946

>>107927937
lmao

Anonymous
01/21/26(Wed)02:15:18 No.107927948

Anonymous 01/21/26(Wed)02:15:18 No.107927948

>>107927943
turn off preview, I think

Anonymous
01/21/26(Wed)02:15:22 No.107927949

Anonymous 01/21/26(Wed)02:15:22 No.107927949

>>107927943
POST A FUCKING SCREENSHOT OF YOUR MODEL LOADER NODES OR I CANNOT FIGURE OUT WHAT YOU'VE DONE WRONG.

Anonymous
01/21/26(Wed)02:16:17 No.107927958

Anonymous 01/21/26(Wed)02:16:17 No.107927958

i have yet to see a single good character lora for f2k
its only good at styles

Anonymous
01/21/26(Wed)02:16:53 No.107927960

Anonymous 01/21/26(Wed)02:16:53 No.107927960

>>107927943
how can we help you if you don't show your workflow anon?

Anonymous
01/21/26(Wed)02:20:17 No.107927974

Anonymous 01/21/26(Wed)02:20:17 No.107927974

File: v6.png (299 KB, 1123x1336)

299 KB PNG

>>107927949
Calm your autism, sperg. I already posted them before deleting after 5 minutes when I noticed the first problem, I just assumed it would've been seen.

>>107927960
https://litter.catbox.moe/6epmp5nvjes21xvf.json

Anonymous
01/21/26(Wed)02:21:19 No.107927977

Anonymous 01/21/26(Wed)02:21:19 No.107927977

>>107927974
sup wanschizo, why have you not been active recently?

Anonymous
01/21/26(Wed)02:21:48 No.107927982

Anonymous 01/21/26(Wed)02:21:48 No.107927982

>>107927946
yeah, haven't seen troonjak melt this hard over her boogeyman scapegoat since yesterday kek

Anonymous
01/21/26(Wed)02:22:02 No.107927983

Anonymous 01/21/26(Wed)02:22:02 No.107927983

>>107927930
The development goal for v2 is to surpass Suno/Udio, but I don't think they're going commercial, at least not fully, it's possible they will just have a proprietary license though. 1.5 will definitely wake up some companies and give them competition (E.G. Alibaba), so them going fully closed is not a real concern.

Anonymous
01/21/26(Wed)02:22:31 No.107927984

Anonymous 01/21/26(Wed)02:22:31 No.107927984

>>107927977
>>107927974
wait, isn't that the dude that was pretending to have errors on his workflow so that he can troll everyone with it?

Anonymous
01/21/26(Wed)02:23:37 No.107927994

Anonymous 01/21/26(Wed)02:23:37 No.107927994

>>107927974
Your clip loader doesn't have the connector, but people seem to think you're a schizo so...

Anonymous
01/21/26(Wed)02:24:04 No.107927997

Anonymous 01/21/26(Wed)02:24:04 No.107927997

can anyone explain this mental illness to me?: >>107927977 >>107927984
I really don't get it.

Anonymous
01/21/26(Wed)02:25:46 No.107928003

Anonymous 01/21/26(Wed)02:25:46 No.107928003

>>107927997
We have a consistent cabal of Comfy shills plaguing this general. They pretend like it's impossible to have any issues with Comfy and that anyone posting about them is a troll.

Anonymous
01/21/26(Wed)02:26:10 No.107928005

Anonymous 01/21/26(Wed)02:26:10 No.107928005

>>107927997
ranfaggot is trying to throw off the scent after showing her hand so that the topic of which schizo is spamming the thread isn't her

Anonymous
01/21/26(Wed)02:26:26 No.107928008

Anonymous 01/21/26(Wed)02:26:26 No.107928008

>>107927997
lol come on dude i'm just asking how you're doing, are you still playing those mind games? suit yourself lol

Anonymous
01/21/26(Wed)02:28:02 No.107928014

Anonymous 01/21/26(Wed)02:28:02 No.107928014

>>107927994
What clip loader? DualClipLoader is connected to the Clip Text Encode just like the previous screencap, and it's the only connector clip node connection in the ltx-2 distill template.

>but people seem to think you're a schizo so...
by all means keep trying to fit in with the mentally ill.

Anonymous
01/21/26(Wed)02:29:25 No.107928017

Anonymous 01/21/26(Wed)02:29:25 No.107928017

File: 882633.png (403 KB, 1476x1094)

403 KB PNG

>>107928014
Just for the record, here is the unedited default template for ltx-2.

Anonymous
01/21/26(Wed)02:29:56 No.107928020

Anonymous 01/21/26(Wed)02:29:56 No.107928020

>>107928003
I had a problem with comfy yesterday, a python thing. Everyone called me a retard. They were right. I learned how to pin python versions and fixed it in 15 minutes and now I'm genning faster than ever

Anonymous
01/21/26(Wed)02:30:08 No.107928023

Anonymous 01/21/26(Wed)02:30:08 No.107928023

File: Untitled.png (36 KB, 962x387)

36 KB PNG

>>107928014
Do you not have something like this in your text encoder folder? As far as I can tell, you just have two gemma models loaded into the dual clip loader.

Anonymous
01/21/26(Wed)02:30:50 No.107928028

Anonymous 01/21/26(Wed)02:30:50 No.107928028

File: 1741489675336664.png (183 KB, 590x449)

183 KB PNG

>>107928020
based self learner, don't depend on others to improve on your stuff

Anonymous
01/21/26(Wed)02:32:29 No.107928041

Anonymous 01/21/26(Wed)02:32:29 No.107928041

>>107928020
this so much. python and comfy are a great thing, we should be thankful for them existing. every problem is a user problem, just apply those 15000 bandaids to make it usable bro

Anonymous
01/21/26(Wed)02:32:40 No.107928044

Anonymous 01/21/26(Wed)02:32:40 No.107928044

>samefag ancient meme pass holder schizo

Anonymous
01/21/26(Wed)02:34:06 No.107928052

Anonymous 01/21/26(Wed)02:34:06 No.107928052

>>107928023
Oh, you're right. I was confused cause this screencap >>107927812 named it "other connector" and the string node in the template said nothing about it.

Anonymous
01/21/26(Wed)02:39:51 No.107928082

Anonymous 01/21/26(Wed)02:39:51 No.107928082

https://github.com/Comfy-Org/ComfyUI/commit/e755268e7b7843695f52b87595afcb09c1e9fd87
>Config for Qwen 3 0.6B model.
what model uses Qwen 3 0.6b as a text encoder?

Anonymous
01/21/26(Wed)02:40:33 No.107928085

Anonymous 01/21/26(Wed)02:40:33 No.107928085

>>107928082
Zit you buffoon

Anonymous
01/21/26(Wed)02:41:42 No.107928090

Anonymous 01/21/26(Wed)02:41:42 No.107928090

>>107928044
How many schizos we got up in this shit?

Anonymous
01/21/26(Wed)02:42:48 No.107928093

Anonymous 01/21/26(Wed)02:42:48 No.107928093

>>107928085
ZiT uses qwen 3 4b, are you retarded or something?

Anonymous
01/21/26(Wed)02:43:58 No.107928096

Anonymous 01/21/26(Wed)02:43:58 No.107928096

>>107928085
>>107928093
Everybody calm the fuck down!!

Anonymous
01/21/26(Wed)02:44:55 No.107928100

Anonymous 01/21/26(Wed)02:44:55 No.107928100

File: Flux2-Klein_00451_.png (674 KB, 1024x1024)

674 KB PNG

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.