/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 01/14/26(Wed)19:50:16 No.107864620

File: highlights_g_107861070_17(...).jpg (693 KB, 2242x1251)

693 KB JPG

/ldg/ - Local Diffusion General Anonymous 01/14/26(Wed)19:50:16 No.107864620

Discussion of Free and Open Source Diffusion Models

Prev:>>107861070

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
01/14/26(Wed)19:51:21 No.107864623

Anonymous 01/14/26(Wed)19:51:21 No.107864623

>>107864569
if only the audio quality was a bit higher this could be really useful, didn't they say they were making a ltx 2.1 with better audio at some point?

Anonymous
01/14/26(Wed)19:52:35 No.107864630

Anonymous 01/14/26(Wed)19:52:35 No.107864630

File: dehr_00004__.png (3.13 MB, 1448x1728)

3.13 MB PNG

mfw

Anonymous
01/14/26(Wed)19:52:48 No.107864631

Anonymous 01/14/26(Wed)19:52:48 No.107864631

File: LTX-2_00007_.webm (3.91 MB, 704x960)

3.91 MB WEBM

>>>/wsg/6072817

Anonymous
01/14/26(Wed)19:53:17 No.107864634

Anonymous 01/14/26(Wed)19:53:17 No.107864634

File: Nekoyomi_110433_0553_.png (1.6 MB, 1071x1428)

1.6 MB PNG

Anonymous
01/14/26(Wed)19:53:44 No.107864638

Anonymous 01/14/26(Wed)19:53:44 No.107864638

>>107864623
it can be very clear but singing is harder to clone than regular audio I guess, still works well in most cases:

https://files.catbox.moe/cozidd.mp4

Anonymous
01/14/26(Wed)19:54:59 No.107864646

Anonymous 01/14/26(Wed)19:54:59 No.107864646

Blessed thread of frenship

Anonymous
01/14/26(Wed)19:56:38 No.107864658

Anonymous 01/14/26(Wed)19:56:38 No.107864658

does ltx2 lose more details overtime than wan in i2v?

Anonymous
01/14/26(Wed)19:56:39 No.107864660

Anonymous 01/14/26(Wed)19:56:39 No.107864660

File: z-image-experimental_00669_.png (2.83 MB, 1264x2048)

2.83 MB PNG

Anonymous
01/14/26(Wed)19:58:21 No.107864671

Anonymous 01/14/26(Wed)19:58:21 No.107864671

>>107864660
reminds me of this

Anonymous
01/14/26(Wed)20:00:36 No.107864688

Anonymous 01/14/26(Wed)20:00:36 No.107864688

>>107864658
if you have enough vram you can make a 30 seconds i2v video without loss

Anonymous
01/14/26(Wed)20:02:56 No.107864701

Anonymous 01/14/26(Wed)20:02:56 No.107864701

File: z-image-experimental_00419_.png (2.35 MB, 2048x1024)

2.35 MB PNG

Anonymous
01/14/26(Wed)20:08:39 No.107864739

Anonymous 01/14/26(Wed)20:08:39 No.107864739

File: z-image-experimental_00671_.png (2.96 MB, 2048x1264)

2.96 MB PNG

Anonymous
01/14/26(Wed)20:09:30 No.107864746

Anonymous 01/14/26(Wed)20:09:30 No.107864746

File: NetaYumeV40_Output_12515.png (2.59 MB, 1536x1280)

2.59 MB PNG

>>107864184

Anonymous
01/14/26(Wed)20:11:05 No.107864761

Anonymous 01/14/26(Wed)20:11:05 No.107864761

Z Image Illustrious when

Anonymous
01/14/26(Wed)20:11:21 No.107864763

Anonymous 01/14/26(Wed)20:11:21 No.107864763

File: 1743229977650182.mp4 (3.57 MB, 1440x1080)

3.57 MB MP4

looks like you get less slopped results if you go for normal ltx2 + distill lora at strength < 1

Anonymous
01/14/26(Wed)20:13:09 No.107864775

Anonymous 01/14/26(Wed)20:13:09 No.107864775

File: _zimg_00003.png (1.03 MB, 864x1152)

1.03 MB PNG

i am actively prompting qwen to hallucinate prompts and automatically gen them, what are the odds that i get v& by something it comes up with?

Anonymous
01/14/26(Wed)20:13:33 No.107864782

Anonymous 01/14/26(Wed)20:13:33 No.107864782

>>107864688
you mean up the resolution?

Anonymous
01/14/26(Wed)20:15:24 No.107864796

Anonymous 01/14/26(Wed)20:15:24 No.107864796

>>107864763
If only I could get my hands in a workflow that works and is not a complete mess...
Is there a workflow available for this video?

Anonymous
01/14/26(Wed)20:15:57 No.107864797

Anonymous 01/14/26(Wed)20:15:57 No.107864797

It's up

https://civitai.com/models/933294

Anonymous
01/14/26(Wed)20:17:44 No.107864808

Anonymous 01/14/26(Wed)20:17:44 No.107864808

https://github.com/Comfy-Org/ComfyUI/pull/11845#issuecomment-3752055641
>For gguf we will try not to break it but we are focusing on improving our own native quant system to make it better/faster than gguf.
delusional lol

Anonymous
01/14/26(Wed)20:18:55 No.107864811

Anonymous 01/14/26(Wed)20:18:55 No.107864811

>>107864797
2 years late unc

Anonymous
01/14/26(Wed)20:19:04 No.107864814

Anonymous 01/14/26(Wed)20:19:04 No.107864814

>>107864623
low res with have shit audio. You have to generate at around 1080 for good audio. The image and audio are connected.

Anonymous
01/14/26(Wed)20:20:34 No.107864823

Anonymous 01/14/26(Wed)20:20:34 No.107864823

>>107864814
>The image and audio are connected.
really? that's dumb, the audio's quality should be its own and has nothing to do with the video

Anonymous
01/14/26(Wed)20:21:13 No.107864828

Anonymous 01/14/26(Wed)20:21:13 No.107864828

>>107864823
Problem? Make your own model.

Anonymous
01/14/26(Wed)20:23:06 No.107864840

Anonymous 01/14/26(Wed)20:23:06 No.107864840

File: 1763149559244816.png (187 KB, 274x356)

187 KB PNG

>>107864828
>Problem? Make your own model.

Anonymous
01/14/26(Wed)20:23:09 No.107864842

Anonymous 01/14/26(Wed)20:23:09 No.107864842

>>107864808
Well one more reason to stop updating until absolutely necessary

Anonymous
01/14/26(Wed)20:24:58 No.107864849

Anonymous 01/14/26(Wed)20:24:58 No.107864849

>>107864823
You can do upscale pass, it helps the audio too not as good as natively but something

Anonymous
01/14/26(Wed)20:26:32 No.107864858

Anonymous 01/14/26(Wed)20:26:32 No.107864858

>>107864814
so what you're saying is i can hear in 4k

Anonymous
01/14/26(Wed)20:27:56 No.107864863

Anonymous 01/14/26(Wed)20:27:56 No.107864863

File: silent.mp4 (3.83 MB, 1500x2048)

3.83 MB MP4

Tried to make her open the bottle with teeth :(

Anonymous
01/14/26(Wed)20:28:55 No.107864872

Anonymous 01/14/26(Wed)20:28:55 No.107864872

>just upscale the audio

Anonymous
01/14/26(Wed)20:29:41 No.107864878

Anonymous 01/14/26(Wed)20:29:41 No.107864878

File: 1761450192918469.png (31 KB, 1031x169)

31 KB PNG

https://github.com/Comfy-Org/ComfyUI/pull/11837

>2 days ago
>nobody told me
baka baka

Anonymous
01/14/26(Wed)20:30:19 No.107864881

Anonymous 01/14/26(Wed)20:30:19 No.107864881

>>107864878
what models are nv anyways, only one i saw is gemma

Anonymous
01/14/26(Wed)20:30:49 No.107864886

Anonymous 01/14/26(Wed)20:30:49 No.107864886

>>107864872
With the ltx latent upscaler lol, it does both

Anonymous
01/14/26(Wed)20:30:58 No.107864889

Anonymous 01/14/26(Wed)20:30:58 No.107864889

>>107864881
wan 2.2, zit, all the good shit. just search huggingface

Anonymous
01/14/26(Wed)20:31:42 No.107864897

Anonymous 01/14/26(Wed)20:31:42 No.107864897

>>107864889
thanks

Anonymous
01/14/26(Wed)20:31:57 No.107864898

Anonymous 01/14/26(Wed)20:31:57 No.107864898

>>107864823
It kind of does make sense. Video quality and audio quality are tied together in videos. When you see some 240p video from 2006 you're not expecting to hear crisp FLAC audio.

Anonymous
01/14/26(Wed)20:34:54 No.107864915

Anonymous 01/14/26(Wed)20:34:54 No.107864915

File: 1763290501869535.png (34 KB, 200x252)

34 KB PNG

>>107864898
you know what, that's a fair point

Anonymous
01/14/26(Wed)20:35:12 No.107864918

Anonymous 01/14/26(Wed)20:35:12 No.107864918

>>107864889
>zit nvfp4 is 1/4 the size of bf16
big if quality is similar

Anonymous
01/14/26(Wed)20:36:56 No.107864929

Anonymous 01/14/26(Wed)20:36:56 No.107864929

>>107864918
>quality is similar
not even close lol

Anonymous
01/14/26(Wed)20:37:35 No.107864934

Anonymous 01/14/26(Wed)20:37:35 No.107864934

>>107864929
then how is this better than quants

Anonymous
01/14/26(Wed)20:38:23 No.107864938

Anonymous 01/14/26(Wed)20:38:23 No.107864938

File: LTX_2.0_i2v__718583547936(...).webm (1.01 MB, 832x832)

1.01 MB WEBM

Anonymous
01/14/26(Wed)20:38:55 No.107864942

Anonymous 01/14/26(Wed)20:38:55 No.107864942

File: _zimg_00209.png (996 KB, 768x1024)

996 KB PNG

Anonymous
01/14/26(Wed)20:40:02 No.107864952

Anonymous 01/14/26(Wed)20:40:02 No.107864952

Any link to an uncensored gemma3-12b fp8?
The default one is making my lewd gens into abominations.

Anonymous
01/14/26(Wed)20:40:30 No.107864956

Anonymous 01/14/26(Wed)20:40:30 No.107864956

>>107864952
yeah

Anonymous
01/14/26(Wed)20:47:56 No.107864999

Anonymous 01/14/26(Wed)20:47:56 No.107864999

>>107864878
oh shit thanks for the heads up
wan2.2 seems to about as fast with fp8 vs nvfp4 but at a glance nvfp4 quality is better but slightly slower.

Anonymous
01/14/26(Wed)20:48:43 No.107865002

Anonymous 01/14/26(Wed)20:48:43 No.107865002

>>107864999
>quality is better
no it isnt

Anonymous
01/14/26(Wed)20:49:05 No.107865006

Anonymous 01/14/26(Wed)20:49:05 No.107865006

>>107864985
what did he mean by this

Anonymous
01/14/26(Wed)20:49:16 No.107865007

Anonymous 01/14/26(Wed)20:49:16 No.107865007

>>107864999
>at a glance nvfp4 quality is better
better quality than fp8? I really doubt that, hope I'm wrong though

Anonymous
01/14/26(Wed)20:50:37 No.107865016

Anonymous 01/14/26(Wed)20:50:37 No.107865016

>>107864999
from here? https://huggingface.co/GitMylo/Wan_2.2_nvfp4/tree/main

Anonymous
01/14/26(Wed)20:52:11 No.107865023

Anonymous 01/14/26(Wed)20:52:11 No.107865023

File: 1759347593798252.png (1.51 MB, 1024x1024)

1.51 MB PNG

will lodestone use the same training data he used for chroma? i hope not. you can tell it was filled shit slop like pic related

Anonymous
01/14/26(Wed)20:52:19 No.107865024

Anonymous 01/14/26(Wed)20:52:19 No.107865024

>>107865016
ye

Anonymous
01/14/26(Wed)20:53:34 No.107865030

Anonymous 01/14/26(Wed)20:53:34 No.107865030

i guess we going back to pony realism

Anonymous
01/14/26(Wed)20:57:19 No.107865050

Anonymous 01/14/26(Wed)20:57:19 No.107865050

>>107864952
>le uncensored text encoder meme
When will this nonsense die? An "uncensored" text encoder doesn't help the diffusion model make lewd outputs. All it does it make it so the LLM doesn't refuse requests. I.e. if you actually GENERATE text autoregressively using the LLM it will do what you ask instead of saying "Sorry I can't help with that." The actual text embeddings of the words in your prompt, which is the thing the diffusion model conditions on, barely changes with an uncensored text encoder. It's literal fucking snake oil that does nothing. If anything the slight shift in text embeddings would reduce quality.

Anonymous
01/14/26(Wed)20:59:22 No.107865063

Anonymous 01/14/26(Wed)20:59:22 No.107865063

So I take it GLM-Image was a flop?

Anonymous
01/14/26(Wed)21:02:56 No.107865084

Anonymous 01/14/26(Wed)21:02:56 No.107865084

>>107865023
I don't know, but what is the problem if it's tagged correctly? are you having trouble prompting for realism?

Anonymous
01/14/26(Wed)21:12:32 No.107865139

Anonymous 01/14/26(Wed)21:12:32 No.107865139

https://github.com/Rolandjg/LTX-2-video-extend-ComfyUI

this is amazing. extend any clip, clone voices + video. see example.

https://files.catbox.moe/cale33.mp4

Anonymous
01/14/26(Wed)21:13:16 No.107865142

Anonymous 01/14/26(Wed)21:13:16 No.107865142

>>107865139
Buy an ad

Anonymous
01/14/26(Wed)21:15:15 No.107865155

Anonymous 01/14/26(Wed)21:15:15 No.107865155

>>107865142
I didnt make it, i'm linking so anons can enjoy it cause it's fun.

the thread is a resource for qwen edit/wan/ltx/etc, so why not. would the dev be making fun of troons? they would lose their github.

https://files.catbox.moe/1bsuwa.mp4

Anonymous
01/14/26(Wed)21:16:05 No.107865158

Anonymous 01/14/26(Wed)21:16:05 No.107865158

>>107864934
If you want a serious response it's hardware accelerated so it should run a lot faster than q4 (which needs to be dequantized before being run), provided you are on 5000 series.
Nvfp4 is also not the same as standard fp4. Groups of 16 4-bit float values are scaled by an fp8 key and the tensor is globally scaled with an fp32 key, to lower the deviation from the baseline. It's still 4 fucking bits though, so don't expect magic in terms of quality.

Anonymous
01/14/26(Wed)21:17:11 No.107865165

Anonymous 01/14/26(Wed)21:17:11 No.107865165

>>107865063
Yes.
I am still on copium that one day we will get a kino local AR model.

Anonymous
01/14/26(Wed)21:17:25 No.107865168

Anonymous 01/14/26(Wed)21:17:25 No.107865168

>>107865158
Thanks for explanation, is there a nvfp8?

Anonymous
01/14/26(Wed)21:18:10 No.107865173

Anonymous 01/14/26(Wed)21:18:10 No.107865173

>>107865165
>AR model
qrd

Anonymous
01/14/26(Wed)21:22:25 No.107865200

Anonymous 01/14/26(Wed)21:22:25 No.107865200

>>107865155
another gen with the detailer lora enabled:

https://files.catbox.moe/yjxn76.mp4

Anonymous
01/14/26(Wed)21:24:43 No.107865209

Anonymous 01/14/26(Wed)21:24:43 No.107865209

>>107865168
Nope.

Anonymous
01/14/26(Wed)21:37:37 No.107865269

Anonymous 01/14/26(Wed)21:37:37 No.107865269

File: 0.png (1.23 MB, 1408x640)

1.23 MB PNG

Anonymous
01/14/26(Wed)21:37:55 No.107865271

Anonymous 01/14/26(Wed)21:37:55 No.107865271

File: ComfyUI_00546_.png (1.36 MB, 832x1216)

1.36 MB PNG

>>107865173
Qrd: The model iterates on your prompt before diffusing.

Anonymous
01/14/26(Wed)21:41:31 No.107865287

Anonymous 01/14/26(Wed)21:41:31 No.107865287

also, try enabling the detailer lora in the extend workflow:

https://files.catbox.moe/ipa17z.mp4

Anonymous
01/14/26(Wed)21:51:56 No.107865353

Anonymous 01/14/26(Wed)21:51:56 No.107865353

>>107865209
When are you planning on making it

Anonymous
01/14/26(Wed)21:56:19 No.107865373

Anonymous 01/14/26(Wed)21:56:19 No.107865373

what's the difference between wan 2.1 and 2.2?
I had a good 2.1 setup and i'm trying to get it to run 2.2 but i'm encountering errors... looking for a good workflow

Anonymous
01/14/26(Wed)21:57:01 No.107865378

Anonymous 01/14/26(Wed)21:57:01 No.107865378

>>107865373
jerk off and go to bed bro

Anonymous
01/14/26(Wed)22:03:21 No.107865412

Anonymous 01/14/26(Wed)22:03:21 No.107865412

>>107865353
Right after Z-Image Base drops.
Should take around two more weeks.

Anonymous
01/14/26(Wed)22:03:28 No.107865414

Anonymous 01/14/26(Wed)22:03:28 No.107865414

>>107865378
I just want to compare i2v with 2.1 and 2.2

Anonymous
01/14/26(Wed)22:04:33 No.107865419

Anonymous 01/14/26(Wed)22:04:33 No.107865419

File: anon_waiting_for_base_00684_.png (3.5 MB, 1264x2048)

3.5 MB PNG

Anonymous
01/14/26(Wed)22:06:07 No.107865424

Anonymous 01/14/26(Wed)22:06:07 No.107865424

kek

https://files.catbox.moe/1h3ja0.mp4

Anonymous
01/14/26(Wed)22:06:15 No.107865425

Anonymous 01/14/26(Wed)22:06:15 No.107865425

>>107864886
It also upscales the sound with the spatial upscaler??

Anonymous
01/14/26(Wed)22:06:16 No.107865427

Anonymous 01/14/26(Wed)22:06:16 No.107865427

>>107865373
Wan 2.2 has higher quality.
Wan 2.2 is a moe that uses separate models for high timestep and low timestep denoising. It eats more system resources, particularly ram as a result.
>but i'm encountering errors
Can't help without seeing the WF or errors
>looking for a good workflow
Have you tried the default Cumfart template?

Anonymous
01/14/26(Wed)22:08:17 No.107865437

Anonymous 01/14/26(Wed)22:08:17 No.107865437

any of yall know a method to turn anime images into realistic / psuedo-realistic ones? like a qwen lora or something

Anonymous
01/14/26(Wed)22:09:18 No.107865441

Anonymous 01/14/26(Wed)22:09:18 No.107865441

>>107865425
Technically speaking no, but you are rerunning both latents through the sampler at a bigger res so it kinda ends up having the same effect.

Anonymous
01/14/26(Wed)22:09:59 No.107865443

Anonymous 01/14/26(Wed)22:09:59 No.107865443

>>107865441
OK thanks I will try it then;

Anonymous
01/14/26(Wed)22:10:18 No.107865446

Anonymous 01/14/26(Wed)22:10:18 No.107865446

in theory, you can make an entire anime episode with linked + extended LTX2 gens.

https://files.catbox.moe/brgchb.mp4

Anonymous
01/14/26(Wed)22:10:45 No.107865448

Anonymous 01/14/26(Wed)22:10:45 No.107865448

File: file.png (26 KB, 336x421)

26 KB PNG

>>107865427
>Have you tried the default Cumfart template?
I tried the default from one of the headers with the anime girl picking up a gun.

Getting this:
KSamplerAdvanced
Given groups=1, weight of size [5120, 36, 1, 2, 2], expected input[1, 32, 21, 96, 96] to have 36 channels, but got 32 channels instead

see picrel.

All i did was load the json and realign the diffusion models and vae + Clip.

Anonymous
01/14/26(Wed)22:12:52 No.107865457

Anonymous 01/14/26(Wed)22:12:52 No.107865457

File: 1930.jpg (40 KB, 640x576)

40 KB JPG

gemma 3 is truly stupid. many failed gens, and with periods that weren't even requested... even hunyuan wasn't this stupid

Anonymous
01/14/26(Wed)22:15:29 No.107865474

Anonymous 01/14/26(Wed)22:15:29 No.107865474

>>107865448
Are you using the correct vae? 14B 2.2 needs 2.1 vae. Only 5B 2.2 uses 2.2 (You shouldn't use the 5B one anyway)

Anonymous
01/14/26(Wed)22:17:05 No.107865479

Anonymous 01/14/26(Wed)22:17:05 No.107865479

>>107865457
What were your prompts?

Anonymous
01/14/26(Wed)22:17:12 No.107865481

Anonymous 01/14/26(Wed)22:17:12 No.107865481

>>107864952
no it's not, that's not how any of this shit works at all

Anonymous
01/14/26(Wed)22:18:50 No.107865491

Anonymous 01/14/26(Wed)22:18:50 No.107865491

>>107865063
it's not really a 1girl T2I sort of model

Anonymous
01/14/26(Wed)22:19:41 No.107865495

Anonymous 01/14/26(Wed)22:19:41 No.107865495

>>107865491
What kind of model is it?

Anonymous
01/14/26(Wed)22:22:38 No.107865513

Anonymous 01/14/26(Wed)22:22:38 No.107865513

>>107865479
something like
>A medieval knight in full plate armor stands in a castle entrance. The armor is intricately crafted with detailed engravings and a polished silver finish. He holds a longsword in one hand and a kite shield in the other.

Anonymous
01/14/26(Wed)22:22:51 No.107865515

Anonymous 01/14/26(Wed)22:22:51 No.107865515

>>107865474
ugh, gimme a min, apparently I haven't upgraded comfyui in a while and now everything is broken... gotta learn to disable that upgrade shit...

Anonymous
01/14/26(Wed)22:25:37 No.107865535

Anonymous 01/14/26(Wed)22:25:37 No.107865535

I fucking kneel LTX, extended a frieren clip

subbed is best, but this is still better than funimation:

https://files.catbox.moe/4u9bah.mp4

Anonymous
01/14/26(Wed)22:25:55 No.107865537

Anonymous 01/14/26(Wed)22:25:55 No.107865537

>he pulled

Anonymous
01/14/26(Wed)22:26:22 No.107865539

Anonymous 01/14/26(Wed)22:26:22 No.107865539

the fennec tranny strikes again

Anonymous
01/14/26(Wed)22:27:17 No.107865542

Anonymous 01/14/26(Wed)22:27:17 No.107865542

>>107865513
i sure hope you are using at least Q6 of 27b gemma

Anonymous
01/14/26(Wed)22:27:23 No.107865543

Anonymous 01/14/26(Wed)22:27:23 No.107865543

>>107865535
>invents new eyebrows
>face becomes fucked up
this model had like no cartoons in the dataset lol

Anonymous
01/14/26(Wed)22:27:51 No.107865550

Anonymous 01/14/26(Wed)22:27:51 No.107865550

File: file.png (1.15 MB, 2000x1000)

1.15 MB PNG

>>107865543
>his model had like no cartoons in the dataset lol

Anonymous
01/14/26(Wed)22:28:17 No.107865554

Anonymous 01/14/26(Wed)22:28:17 No.107865554

>>107865474
nah i used the right ones.
Can you point me towards a basic workflow for wan 2.2? I'll get the resources/nodes as needed. I see tons of t2v but almost no i2v :(
you mentioned cumfart, where is that?

Anonymous
01/14/26(Wed)22:29:50 No.107865565

Anonymous 01/14/26(Wed)22:29:50 No.107865565

>>107865554
benchod

Anonymous
01/14/26(Wed)22:30:38 No.107865573

Anonymous 01/14/26(Wed)22:30:38 No.107865573

>>107865550
fill me in, xir

Anonymous
01/14/26(Wed)22:30:39 No.107865575

Anonymous 01/14/26(Wed)22:30:39 No.107865575

>>107865543
worked fine on this one

https://files.catbox.moe/d3gnpk.mp4

Anonymous
01/14/26(Wed)22:31:14 No.107865578

Anonymous 01/14/26(Wed)22:31:14 No.107865578

>>107865573
https://old.reddit.com/r/StableDiffusion/comments/1q9ao8t/ltx2_weird_result/

Anonymous
01/14/26(Wed)22:31:23 No.107865579

Anonymous 01/14/26(Wed)22:31:23 No.107865579

File: z-image-experimental_00692_.png (3.7 MB, 2048x1264)

3.7 MB PNG

Anonymous
01/14/26(Wed)22:32:03 No.107865585

Anonymous 01/14/26(Wed)22:32:03 No.107865585

>>107865542
try it and show me the result

Anonymous
01/14/26(Wed)22:32:25 No.107865586

Anonymous 01/14/26(Wed)22:32:25 No.107865586

File: z-image-experimental_00693_.png (3.82 MB, 2048x1264)

3.82 MB PNG

Anonymous
01/14/26(Wed)22:33:08 No.107865593

Anonymous 01/14/26(Wed)22:33:08 No.107865593

>>107865579
sl

>>107865586
op

Anonymous
01/14/26(Wed)22:33:32 No.107865597

Anonymous 01/14/26(Wed)22:33:32 No.107865597

>>107865578
Dataset full of garbage and bad captions, evidently

Anonymous
01/14/26(Wed)22:34:18 No.107865601

Anonymous 01/14/26(Wed)22:34:18 No.107865601

frieren but with guns:

https://files.catbox.moe/x0l3eu.mp4

Anonymous
01/14/26(Wed)22:35:02 No.107865608

Anonymous 01/14/26(Wed)22:35:02 No.107865608

>>107865601
Ok this one was funny lol

Anonymous
01/14/26(Wed)22:35:39 No.107865612

Anonymous 01/14/26(Wed)22:35:39 No.107865612

>>107865579
what lora?

Anonymous
01/14/26(Wed)22:37:33 No.107865623

Anonymous 01/14/26(Wed)22:37:33 No.107865623

Any examples of the upscaler workflow for ltxv2?
I'm using the i2v workflow from kijai and it doesn't seem to use any upscaler so I have to gen with the full hd pics if I want hd stuff so I can't do more than 121 frames with a 5090 because the vae just shits itself and never goes to the video compilation.

Anonymous
01/14/26(Wed)22:40:01 No.107865644

Anonymous 01/14/26(Wed)22:40:01 No.107865644

>>107865623
Use the comfy workflow for inspiration, once I've put everything out from the subgraph it made sense after checking.

Anonymous
01/14/26(Wed)22:40:16 No.107865646

Anonymous 01/14/26(Wed)22:40:16 No.107865646

yakuza, ltx extend version

https://files.catbox.moe/6nw4t4.mp4

Anonymous
01/14/26(Wed)22:46:21 No.107865689

Anonymous 01/14/26(Wed)22:46:21 No.107865689

File: gun no jutsu.gif (3.84 MB, 600x338)

3.84 MB GIF

>>107865601

Anonymous
01/14/26(Wed)22:49:55 No.107865709

Anonymous 01/14/26(Wed)22:49:55 No.107865709

>getting memory spikes from just image batching to convert to video
>fuck it just use ffmpeg
>works like a charm and barely even touches my RAM
Reminder to stop relying on ComfyUI for everything. It's a bloated mess that does things sloppier than what more efficient programs can already do.

Anonymous
01/14/26(Wed)22:50:14 No.107865712

Anonymous 01/14/26(Wed)22:50:14 No.107865712

>>107865689
>zoomers in action

Anonymous
01/14/26(Wed)22:51:11 No.107865714

Anonymous 01/14/26(Wed)22:51:11 No.107865714

holy shit, AI can hallucinate some interesting stuff with a basic prompt.

the laser beams cause the red tentacles to explode into fire and black smoke.

allah fern-bar!

https://files.catbox.moe/esajck.mp4

Anonymous
01/14/26(Wed)22:51:50 No.107865717

Anonymous 01/14/26(Wed)22:51:50 No.107865717

File: file.png (1.61 MB, 1296x800)

1.61 MB PNG

I made this btw

Anonymous
01/14/26(Wed)22:52:50 No.107865724

Anonymous 01/14/26(Wed)22:52:50 No.107865724

File: z-image-experimental_00698_.png (2.31 MB, 1440x1440)

2.31 MB PNG

>>107865612
Egon Schiele
ill release it soon

Anonymous
01/14/26(Wed)22:54:47 No.107865732

Anonymous 01/14/26(Wed)22:54:47 No.107865732

>>107865724
like ugly on shitele

Anonymous
01/14/26(Wed)22:56:48 No.107865741

Anonymous 01/14/26(Wed)22:56:48 No.107865741

>>107865724
nice gen

Anonymous
01/14/26(Wed)23:04:24 No.107865764

Anonymous 01/14/26(Wed)23:04:24 No.107865764

File: z-image-experimental_00702_.png (3.46 MB, 1264x2048)

3.46 MB PNG

>>107865741
thanks

Anonymous
01/14/26(Wed)23:06:38 No.107865772

Anonymous 01/14/26(Wed)23:06:38 No.107865772

AI Toolkit now supports LTX2 training on 512 res, 5 secs and audio with 24GB cards now

Anonymous
01/14/26(Wed)23:07:15 No.107865778

Anonymous 01/14/26(Wed)23:07:15 No.107865778

Is there a vid2vid workflow for ltx2? Would it work to just use a typical i2v workflow but use an input video and specify the number of outputs frames to be the same as the number of frames in the input video?

Anonymous
01/14/26(Wed)23:08:04 No.107865780

Anonymous 01/14/26(Wed)23:08:04 No.107865780

frieren s2 early leak:

https://files.catbox.moe/hzng9z.mp4

Anonymous
01/14/26(Wed)23:09:13 No.107865788

Anonymous 01/14/26(Wed)23:09:13 No.107865788

>>107865778
use this one, ive been making the extend edits with it, if you want a full duration edit just lower the frame load cap on the video to 9 or whatever.

https://github.com/Rolandjg/LTX-2-video-extend-ComfyUI

Anonymous
01/14/26(Wed)23:09:37 No.107865791

Anonymous 01/14/26(Wed)23:09:37 No.107865791

>>107865724
you got links to other loras?

Anonymous
01/14/26(Wed)23:11:00 No.107865797

Anonymous 01/14/26(Wed)23:11:00 No.107865797

>>107865780
bro where do you get your short anime videos?

Anonymous
01/14/26(Wed)23:12:22 No.107865801

Anonymous 01/14/26(Wed)23:12:22 No.107865801

File: file.png (230 KB, 951x1143)

230 KB PNG

Anonymous
01/14/26(Wed)23:13:00 No.107865803

Anonymous 01/14/26(Wed)23:13:00 No.107865803

File: z-image-experimental_00705_.png (3.26 MB, 1264x2048)

3.26 MB PNG

>>107865791
i only got illustrious loras on display atm
im not ganna shill my profile, you're just going to have to find me in the wild

Anonymous
01/14/26(Wed)23:13:39 No.107865807

Anonymous 01/14/26(Wed)23:13:39 No.107865807

File: 00266-1397496870.png (698 KB, 768x1024)

698 KB PNG

just installed forge neo so I can finally try ZIT

I thought 16gb vram was enough for q8 but it will only run with neveroom extension active

Anonymous
01/14/26(Wed)23:15:14 No.107865817

Anonymous 01/14/26(Wed)23:15:14 No.107865817

>>107865780
also, in a very interesting output, I got Japanese without asking for Japanese.

https://files.catbox.moe/xeq8iu.mp4

Anonymous
01/14/26(Wed)23:15:23 No.107865819

Anonymous 01/14/26(Wed)23:15:23 No.107865819

File: 1_.webm (1.07 MB, 640x960)

1.07 MB WEBM

>>107865271

Anonymous
01/14/26(Wed)23:16:27 No.107865824

Anonymous 01/14/26(Wed)23:16:27 No.107865824

>>107865797
I just recorded a clip with nvidia shadowplay (alt z) from my frieren s1 folder.

Anonymous
01/14/26(Wed)23:17:00 No.107865826

Anonymous 01/14/26(Wed)23:17:00 No.107865826

>>107865817
It's gibberish, sounds like how a japanese va would though

Anonymous
01/14/26(Wed)23:17:17 No.107865829

Anonymous 01/14/26(Wed)23:17:17 No.107865829

>>107865824
>recorded a clip with nvidia shadowplay
now this is some lobotomite tech lmao

Anonymous
01/14/26(Wed)23:18:28 No.107865835

Anonymous 01/14/26(Wed)23:18:28 No.107865835

>>107865829
I just wanted a fast clip, otherwise i'd use adobe premiere (torrented) and cut out a clip.

Anonymous
01/14/26(Wed)23:19:24 No.107865842

Anonymous 01/14/26(Wed)23:19:24 No.107865842

>>107865829
I just use snip....

Anonymous
01/14/26(Wed)23:20:25 No.107865848

Anonymous 01/14/26(Wed)23:20:25 No.107865848

>>107865842
ima snip your balls

Anonymous
01/14/26(Wed)23:21:05 No.107865849

Anonymous 01/14/26(Wed)23:21:05 No.107865849

>>107865824
nigga please https://github.com/mifi/lossless-cut

Anonymous
01/14/26(Wed)23:21:12 No.107865850

Anonymous 01/14/26(Wed)23:21:12 No.107865850

has anyone remade one punch man s3 yet

Anonymous
01/14/26(Wed)23:21:20 No.107865851

Anonymous 01/14/26(Wed)23:21:20 No.107865851

holy kino

the purple hair anime girl takes out a black pistol and points it at the camera and says "demon faggot you're going to die.". she fires the gun several times.

https://files.catbox.moe/0uekvw.mp4

Anonymous
01/14/26(Wed)23:21:35 No.107865853

Anonymous 01/14/26(Wed)23:21:35 No.107865853

File: 9_.webm (1.07 MB, 512x704)

1.07 MB WEBM

>>107865807

Anonymous
01/14/26(Wed)23:22:05 No.107865856

Anonymous 01/14/26(Wed)23:22:05 No.107865856

>>107865850
unironic improvement

Anonymous
01/14/26(Wed)23:22:06 No.107865857

Anonymous 01/14/26(Wed)23:22:06 No.107865857

>>107865849
it takes like 2 seconds to export a clip from a movie btw. it doesnt re-encode like normal bloatware usually would

Anonymous
01/14/26(Wed)23:22:38 No.107865861

Anonymous 01/14/26(Wed)23:22:38 No.107865861

>>107865842
oh right snipping tool can do video too, thx anon

loading premiere for a quick meme isn't optimal desu

Anonymous
01/14/26(Wed)23:22:43 No.107865862

Anonymous 01/14/26(Wed)23:22:43 No.107865862

>only reason we don't have actual porn video models is because of muttism poisoning the world
grim

Anonymous
01/14/26(Wed)23:23:47 No.107865868

Anonymous 01/14/26(Wed)23:23:47 No.107865868

>>107865803
Can you share training settings?

Anonymous
01/14/26(Wed)23:23:51 No.107865869

Anonymous 01/14/26(Wed)23:23:51 No.107865869

>>107865862
sora 2 can do porn

Anonymous
01/14/26(Wed)23:24:28 No.107865874

Anonymous 01/14/26(Wed)23:24:28 No.107865874

>>107865850
imagine being able to train a lora on season 1 and fixing the rest...

Anonymous
01/14/26(Wed)23:24:38 No.107865875

Anonymous 01/14/26(Wed)23:24:38 No.107865875

>>107865869
local model anon, come on

Anonymous
01/14/26(Wed)23:25:17 No.107865877

Anonymous 01/14/26(Wed)23:25:17 No.107865877

>>107865851
two guns:

https://files.catbox.moe/frd2i1.mp4

this is fun, also with enough frames you can clone voices. it's pretty cool. need to try alex jones next.

Anonymous
01/14/26(Wed)23:25:31 No.107865880

Anonymous 01/14/26(Wed)23:25:31 No.107865880

>>107865875
all you need is wan 2.2 and a lora. not enough for you?

Anonymous
01/14/26(Wed)23:25:36 No.107865881

Anonymous 01/14/26(Wed)23:25:36 No.107865881

>>107865554
Click the templates button. Pick wan 2.2 on models list. Then click on the i2v workflow.
And cumfart is comfyui, saar.

Anonymous
01/14/26(Wed)23:26:37 No.107865885

Anonymous 01/14/26(Wed)23:26:37 No.107865885

>>107865869
I know grok can/could but I doubt this. Any examples?

Anonymous
01/14/26(Wed)23:27:18 No.107865889

Anonymous 01/14/26(Wed)23:27:18 No.107865889

>>107865880
you know that's not the same

Anonymous
01/14/26(Wed)23:29:28 No.107865893

Anonymous 01/14/26(Wed)23:29:28 No.107865893

>>107865885
>Any examples?
https://files.catbox.moe/3jdl4m.mp4

Anonymous
01/14/26(Wed)23:30:23 No.107865896

Anonymous 01/14/26(Wed)23:30:23 No.107865896

sometimes leddit brings the bants

https://www.reddit.com/r/StableDiffusion/comments/1qchwcg/ltx2_easy_all_in_one_workflow/

Anonymous
01/14/26(Wed)23:30:28 No.107865897

Anonymous 01/14/26(Wed)23:30:28 No.107865897

>>107865893
lmaoooooooooooooooo

Anonymous
01/14/26(Wed)23:30:38 No.107865898

Anonymous 01/14/26(Wed)23:30:38 No.107865898

>>107865893
This is impossible to make with any video model btw

Anonymous
01/14/26(Wed)23:30:53 No.107865902

Anonymous 01/14/26(Wed)23:30:53 No.107865902

>>107865896
Stupid tourist, that's literally one of us

Anonymous
01/14/26(Wed)23:31:36 No.107865905

Anonymous 01/14/26(Wed)23:31:36 No.107865905

>>107864863
wan is really bad at doing things you want it to do.. if it isn't the most generic thing imaginable, it just shits the bed

Anonymous
01/14/26(Wed)23:32:28 No.107865908

Anonymous 01/14/26(Wed)23:32:28 No.107865908

>>107865905
Wan 2.6 handles this perfectly though?

Anonymous
01/14/26(Wed)23:32:57 No.107865911

Anonymous 01/14/26(Wed)23:32:57 No.107865911

>>107865908
don't recall asking

Anonymous
01/14/26(Wed)23:33:31 No.107865912

Anonymous 01/14/26(Wed)23:33:31 No.107865912

>take 1-2 seconds of video clip
>use it to clone voice and character
>make 10s+ gen
>clip the original 1-2 seconds off
can make literally anything with audio cloning this way btw.

Anonymous
01/14/26(Wed)23:35:42 No.107865920

Anonymous 01/14/26(Wed)23:35:42 No.107865920

>>107865912
proof? and no the slop you posted so far isn't it

Anonymous
01/14/26(Wed)23:36:06 No.107865922

Anonymous 01/14/26(Wed)23:36:06 No.107865922

>>107865912
It's kinda funny the best voice cloner model is ltx. Man we need better tts models

Anonymous
01/14/26(Wed)23:36:46 No.107865925

Anonymous 01/14/26(Wed)23:36:46 No.107865925

>>107865911
didn't*

Anonymous
01/14/26(Wed)23:37:07 No.107865927

Anonymous 01/14/26(Wed)23:37:07 No.107865927

what is alibaba studio waiting for to release something? we already have ltx 2 and z loras

Anonymous
01/14/26(Wed)23:37:08 No.107865929

Anonymous 01/14/26(Wed)23:37:08 No.107865929

>>107865920
retard im not trying to clone the frieren voices, go try it on any news broadcast and you'll see

Anonymous
01/14/26(Wed)23:37:15 No.107865930

Anonymous 01/14/26(Wed)23:37:15 No.107865930

>>107865922
VibeVoice could've spawned some incredible finetunes had microsoft not pussied out and rugged the training code

Anonymous
01/14/26(Wed)23:38:25 No.107865933

Anonymous 01/14/26(Wed)23:38:25 No.107865933

>>107865924
you deleted before it even loaded, fuck you

Anonymous
01/14/26(Wed)23:39:04 No.107865934

Anonymous 01/14/26(Wed)23:39:04 No.107865934

>>107865925
esl

Anonymous
01/14/26(Wed)23:39:41 No.107865938

Anonymous 01/14/26(Wed)23:39:41 No.107865938

File: x_3cqnyf.png (1.62 MB, 1536x1024)

1.62 MB PNG

Anonymous
01/14/26(Wed)23:40:05 No.107865939

Anonymous 01/14/26(Wed)23:40:05 No.107865939

>>107865938
edible?

Anonymous
01/14/26(Wed)23:40:54 No.107865942

Anonymous 01/14/26(Wed)23:40:54 No.107865942

>>107865920
nta here is a good example
https://github.com/Rolandjg/LTX-2-video-extend-ComfyUI

Anonymous
01/14/26(Wed)23:42:20 No.107865948

Anonymous 01/14/26(Wed)23:42:20 No.107865948

>>107865942
yta though

Anonymous
01/14/26(Wed)23:43:15 No.107865951

Anonymous 01/14/26(Wed)23:43:15 No.107865951

>>107865920
here, kneel to LTX, BBC news report. All I have to do to make it seamless is boost the low audio on their shitty youtube video. or do it in post. accent and everything is the same.

https://files.catbox.moe/fwu81t.mp4

Anonymous
01/14/26(Wed)23:43:27 No.107865952

Anonymous 01/14/26(Wed)23:43:27 No.107865952

>>107865924
>but for z-image i resize it to by a factor of 3/4.
So you are training at 768p.
I assume you tried and failed with 1024p before? I didn't have the best time with it.
>>107865933
NTA but https://desu-usergeneratedcontent.xyz/g/image/1768/45/1768451789988.png

Anonymous
01/14/26(Wed)23:43:28 No.107865953

Anonymous 01/14/26(Wed)23:43:28 No.107865953

>>107865948
Shucks, thanks love you too

Anonymous
01/14/26(Wed)23:44:15 No.107865960

Anonymous 01/14/26(Wed)23:44:15 No.107865960

>LTX is ba-ACK
https://i.4cdn.org/wsg/1768452199582496.webm

Anonymous
01/14/26(Wed)23:46:50 No.107865973

Anonymous 01/14/26(Wed)23:46:50 No.107865973

>>107865951
part 2, hahaha

I cant trust anything I see now, if local is this good. to fix the transition I just have to adjust the frame number (so it is right after a word)

https://files.catbox.moe/kfixll.mp4

Anonymous
01/14/26(Wed)23:47:20 No.107865977

Anonymous 01/14/26(Wed)23:47:20 No.107865977

>>107865898
Nah, it's Sora. You've seen the YTPs right?

Anonymous
01/14/26(Wed)23:47:52 No.107865981

Anonymous 01/14/26(Wed)23:47:52 No.107865981

File: miku3.jpg (838 KB, 1751x1151)

838 KB JPG

using that lipsync workflow from reddit
>>>/wsg/6072959

Anonymous
01/14/26(Wed)23:48:52 No.107865990

Anonymous 01/14/26(Wed)23:48:52 No.107865990

>>107865977
Nope

Anonymous
01/14/26(Wed)23:49:51 No.107865992

Anonymous 01/14/26(Wed)23:49:51 No.107865992

Spooknik is MIA for more than a month now.
I think Chromachaku might be dead:(

Anonymous
01/14/26(Wed)23:50:00 No.107865993

Anonymous 01/14/26(Wed)23:50:00 No.107865993

>>107865981
link pretty please?

Anonymous
01/14/26(Wed)23:52:03 No.107866002

Anonymous 01/14/26(Wed)23:52:03 No.107866002

>>107865993
https://old.reddit.com/r/StableDiffusion/comments/1qcc81m/ltx2_audio_synced_to_added_mp3_i2v_6_examples_3/

Anonymous
01/14/26(Wed)23:52:11 No.107866003

Anonymous 01/14/26(Wed)23:52:11 No.107866003

what can 96gb vram do but not 24gb vram?

Anonymous
01/14/26(Wed)23:52:35 No.107866005

Anonymous 01/14/26(Wed)23:52:35 No.107866005

>>107865992
Oh fuck didn't see this recent discussion:
https://huggingface.co/spooknik/Chroma-HD-SVDQ/discussions/6
Yeah, it's over.

Anonymous
01/14/26(Wed)23:52:38 No.107866006

Anonymous 01/14/26(Wed)23:52:38 No.107866006

>>107866002
thanks

Anonymous
01/14/26(Wed)23:52:54 No.107866009

Anonymous 01/14/26(Wed)23:52:54 No.107866009

imagine how many people you can trick with these workflows, whether it's t2v, i2v, or v2v extension. also, the creative applications of it.

seamless.

https://files.catbox.moe/23tjt6.mp4

Anonymous
01/14/26(Wed)23:55:06 No.107866021

Anonymous 01/14/26(Wed)23:55:06 No.107866021

>>107866003
Hunyuan 80B
Flux 2 (faster and less quantized)
Full finetune of larger models
FP32 of mid sized model
Keep a lot of crap in the VRAM instead of unloading and reloading. (Wan 2.2 for example)
Low denoising upscales of very large images
Not saying all of these are worth it.

Anonymous
01/14/26(Wed)23:56:13 No.107866026

Anonymous 01/14/26(Wed)23:56:13 No.107866026

>>107866021
What can 32gb vram do that 24gb vram can't?

Anonymous
01/14/26(Wed)23:56:17 No.107866028

Anonymous 01/14/26(Wed)23:56:17 No.107866028

File: z-image-experimental_00712_.png (3.74 MB, 1264x2048)

3.74 MB PNG

>>107865952
no i just don't know why z-image prefers smaller images, considering it can go up to 2048

Anonymous
01/14/26(Wed)23:57:09 No.107866034

Anonymous 01/14/26(Wed)23:57:09 No.107866034

>>107866009
the great thing about the model is it can clone voices too, so you dont even need an external app to do it, and it works with the video.

https://files.catbox.moe/nf8msj.mp4

Anonymous
01/14/26(Wed)23:57:13 No.107866035

Anonymous 01/14/26(Wed)23:57:13 No.107866035

>>107865981
did you gen that miku? if so which model?

Anonymous
01/14/26(Wed)23:57:52 No.107866039

Anonymous 01/14/26(Wed)23:57:52 No.107866039

I can't be the only one noticing what is going on.

Anonymous
01/14/26(Wed)23:58:39 No.107866043

Anonymous 01/14/26(Wed)23:58:39 No.107866043

has anyone checked the fights or something for ltx?

Anonymous
01/14/26(Wed)23:59:26 No.107866047

Anonymous 01/14/26(Wed)23:59:26 No.107866047

>>107866039
Tell us

Anonymous
01/15/26(Thu)00:00:15 No.107866048

Anonymous 01/15/26(Thu)00:00:15 No.107866048

>>107866035
zit

Anonymous
01/15/26(Thu)00:00:20 No.107866050

Anonymous 01/15/26(Thu)00:00:20 No.107866050

>>107866028
I think you are supposed to min-max timestep distribution to be able to train at higher resolutions but:
a) The precise knowledge seems to be gatekept at a few discord channels now
b) I don't care enough to run multiple tests to figure out myself in the current distilled version. If it's still an issue with the base, I will take a look again.

Anonymous
01/15/26(Thu)00:00:21 No.107866051

Anonymous 01/15/26(Thu)00:00:21 No.107866051

>>107866021
Almost none of those are worth it lol, maybe faster wan load but ehh
>>107866043
>fights
as in can it do fight scenes? It's meh at it unless you run it on 50fps

Anonymous
01/15/26(Thu)00:02:06 No.107866059

Anonymous 01/15/26(Thu)00:02:06 No.107866059

last one. only 33 input frames from the video:

https://files.catbox.moe/88crej.mp4

Anonymous
01/15/26(Thu)00:02:14 No.107866060

Anonymous 01/15/26(Thu)00:02:14 No.107866060

>>107865992
>>107866005
chroma or wan chaku is never meant to be, ive accepted this...

Anonymous
01/15/26(Thu)00:09:57 No.107866089

Anonymous 01/15/26(Thu)00:09:57 No.107866089

finally, via AI we can make Jensen honest:

https://files.catbox.moe/hml4zw.mp4

Anonymous
01/15/26(Thu)00:12:58 No.107866101

Anonymous 01/15/26(Thu)00:12:58 No.107866101

>>107866039
shilling?

Anonymous
01/15/26(Thu)00:13:24 No.107866103

Anonymous 01/15/26(Thu)00:13:24 No.107866103

File: its completely over.png (916 KB, 1024x1024)

916 KB PNG

>>107866060
I can live without chroma but no wanchaku hurts.

Anonymous
01/15/26(Thu)00:17:51 No.107866125

Anonymous 01/15/26(Thu)00:17:51 No.107866125

>>107866101
im not shilling, im having fun cause those wan cocksuckers made 2.5 API only and now I have a free model with sound more capable than their model.

hope they choke to death on their shekels. enjoy failing like stability AI, niggers.

Anonymous
01/15/26(Thu)00:19:07 No.107866129

Anonymous 01/15/26(Thu)00:19:07 No.107866129

CES 2026 continued:

https://files.catbox.moe/hae03w.mp4

Anonymous
01/15/26(Thu)00:20:43 No.107866137

Anonymous 01/15/26(Thu)00:20:43 No.107866137

>>107865050
>>107865481
You know that you can just test it with the same seed and same prompt?
Z-image normal and abliterated qwen:
>Hatsune Miku reclining naked on a beach lounge.
https://files.catbox.moe/fcfoqa.jpg
Obviously, it doesn't improve generation of nipples or vagene because the model had seen no such images in training, but abliterated TE makes the model follow nsfw prompts more easily. It won't add skin-colored clothes when asked to do nudity.

Anonymous
01/15/26(Thu)00:22:31 No.107866146

Anonymous 01/15/26(Thu)00:22:31 No.107866146

>>107866125
idk but he sounded ominous

Anonymous
01/15/26(Thu)00:28:58 No.107866170

Anonymous 01/15/26(Thu)00:28:58 No.107866170

>>107866137
>michael jordan feet
anyone got clorox for my eyes?

Anonymous
01/15/26(Thu)00:30:19 No.107866178

Anonymous 01/15/26(Thu)00:30:19 No.107866178

File: 00034-4167724773-94156a45(...).png (2.7 MB, 1344x1728)

2.7 MB PNG

Anonymous
01/15/26(Thu)00:51:37 No.107866232

Anonymous 01/15/26(Thu)00:51:37 No.107866232

File: 1758610020136112.png (3.47 MB, 1880x1248)

3.47 MB PNG

Anonymous
01/15/26(Thu)00:53:47 No.107866236

Anonymous 01/15/26(Thu)00:53:47 No.107866236

>>107866232
Needs to be more grungy and blurry, too many pixels

Anonymous
01/15/26(Thu)00:59:18 No.107866264

Anonymous 01/15/26(Thu)00:59:18 No.107866264

>ltx2
>wan + freelong
which one for 20 sec video?

Anonymous
01/15/26(Thu)01:00:17 No.107866269

Anonymous 01/15/26(Thu)01:00:17 No.107866269

>>107866264
test this out for me pls https://www.reddit.com/r/comfyui/comments/1q61gfd/update_wan_svi_infinite_legth_video_now_with/

Anonymous
01/15/26(Thu)01:09:19 No.107866299

Anonymous 01/15/26(Thu)01:09:19 No.107866299

>>107866269
does it support keyframes? if not then freelong is better

Anonymous
01/15/26(Thu)01:10:36 No.107866307

Anonymous 01/15/26(Thu)01:10:36 No.107866307

File: 2026-01-15-01h08m37s_seed(...).png (2.11 MB, 1584x1056)

2.11 MB PNG

Anonymous
01/15/26(Thu)01:14:07 No.107866316

Anonymous 01/15/26(Thu)01:14:07 No.107866316

is there any way that ltx can do women in panties walking around without creating body horror skin mutations?

Anonymous
01/15/26(Thu)01:15:42 No.107866321

Anonymous 01/15/26(Thu)01:15:42 No.107866321

File: 2026-01-15-01h15m00s_seed(...).png (1.96 MB, 1056x1584)

1.96 MB PNG

Anonymous
01/15/26(Thu)01:18:20 No.107866336

Anonymous 01/15/26(Thu)01:18:20 No.107866336

>been watching a dude on youtube making workflows with his autism
>he now uploads blurred porn and spouts rumors to get views

Sad.

Anonymous
01/15/26(Thu)01:19:41 No.107866342

Anonymous 01/15/26(Thu)01:19:41 No.107866342

>>107866336
>>been watching a dude on youtube making workflows
There's your problem.

Anonymous
01/15/26(Thu)01:22:00 No.107866352

Anonymous 01/15/26(Thu)01:22:00 No.107866352

>>107866336
>>107866342
>watching ... youtube
No that's the problem.

Anonymous
01/15/26(Thu)01:45:17 No.107866472

Anonymous 01/15/26(Thu)01:45:17 No.107866472

tranny ass topaz software queue disappears just like in comfyui after a crash and its unstable on its own already

Anonymous
01/15/26(Thu)01:52:46 No.107866502

Anonymous 01/15/26(Thu)01:52:46 No.107866502

File: 00136-1479410384-6c5dbce7(...).png (2.91 MB, 1344x1728)

2.91 MB PNG

Anonymous
01/15/26(Thu)02:08:44 No.107866556

Anonymous 01/15/26(Thu)02:08:44 No.107866556

>>>/wsg/6072537

I can't be the only who thinks it's fucking insane that you can now 1 shot a 40 second video on a single 3090 around 5 minutes and there's barely any fuckery.

Anonymous
01/15/26(Thu)02:13:10 No.107866568

Anonymous 01/15/26(Thu)02:13:10 No.107866568

>>107866556
you're not alone, ltx has brought a lot of good shit to the table, that and the fact you can extend a video with great accuracy is a huge deal too >>>/wsg/6072806

Anonymous
01/15/26(Thu)02:14:34 No.107866577

Anonymous 01/15/26(Thu)02:14:34 No.107866577

https://www.scmp.com/tech/tech-war/article/3339869/zhipu-ai-breaks-us-chip-reliance-first-major-model-trained-huawei-stack
>omg guyz we made GLM-image without having to use Nvdia cards!!!
who cares? that model sucks anyway lool

Anonymous
01/15/26(Thu)02:15:26 No.107866581

Anonymous 01/15/26(Thu)02:15:26 No.107866581

>>107866556
>say you purposefully brought a lewd movie for the family to watch instead of saying it was an accident
chinese culture is so interesting

Anonymous
01/15/26(Thu)02:15:55 No.107866583

Anonymous 01/15/26(Thu)02:15:55 No.107866583

File: 00172-1872519621-1d507a2c(...).png (2.41 MB, 1344x1728)

2.41 MB PNG

Anonymous
01/15/26(Thu)02:18:23 No.107866593

Anonymous 01/15/26(Thu)02:18:23 No.107866593

>>107866577
>we made GLM-image without having to use Nvdia cards
This isn't the win they think it is with those results. I'll happily buy a chinese card once they're proven to be good.

Anonymous
01/15/26(Thu)02:19:49 No.107866602

Anonymous 01/15/26(Thu)02:19:49 No.107866602

It's been like 4 days of civitai doubleposting uploads. Their jeetcoding is breaking apart and they can't fix it, lol.

Anonymous
01/15/26(Thu)02:22:48 No.107866617

Anonymous 01/15/26(Thu)02:22:48 No.107866617

My life became better when I stopped giving a fuck about jeetit and whatever BS is going on there.

Anonymous
01/15/26(Thu)02:24:02 No.107866623

Anonymous 01/15/26(Thu)02:24:02 No.107866623

File: 2026-01-15-02h18m09s_seed(...).png (2.33 MB, 1056x1584)

2.33 MB PNG

facial physiognomy diversity of qwen image 2512 is a great improvement of the previous model and way better than ZIT.

Anonymous
01/15/26(Thu)02:25:36 No.107866632

Anonymous 01/15/26(Thu)02:25:36 No.107866632

File: yrwj.gif (63 KB, 595x696)

63 KB GIF

This is an extreme and urgent request. I am in desperate need of an extremely meticulous and accurate AI image editor model which won't moralfag me. No this is not for NSFW purposes. will be using it to edit a document text. All the best ones are not allowing me to edit it. It can be local as well. Please help me

Anonymous
01/15/26(Thu)02:26:01 No.107866633

Anonymous 01/15/26(Thu)02:26:01 No.107866633

File: 1750834009107281.png (179 KB, 517x266)

179 KB PNG

>>107866623

Anonymous
01/15/26(Thu)02:27:03 No.107866637

Anonymous 01/15/26(Thu)02:27:03 No.107866637

>>107866632
wrong thread senpai

Anonymous
01/15/26(Thu)02:29:30 No.107866649

Anonymous 01/15/26(Thu)02:29:30 No.107866649

>>107866556
for me the most insane thing is that we finally have a local video model that doesn't pretend it's in space (looking at you Wan 2.2)

Anonymous
01/15/26(Thu)02:30:24 No.107866654

Anonymous 01/15/26(Thu)02:30:24 No.107866654

>>107866637
Actually it's the right thread, but random people who show up desperately looking for ways to edit like this give me the ick.

Anonymous
01/15/26(Thu)02:30:25 No.107866655

Anonymous 01/15/26(Thu)02:30:25 No.107866655

>>107866637
an anon on lmg redirected me to here. No idea to where else to go. Reddit is a no go for obv reasons. Would still like any recommendations though

Anonymous
01/15/26(Thu)02:33:13 No.107866673

Anonymous 01/15/26(Thu)02:33:13 No.107866673

>>107866632
They can moralfag less on the API sometimes.
If you want to do local not too many options besides
Flux Kontext
Qwen Image Edit

Anonymous
01/15/26(Thu)02:34:17 No.107866676

Anonymous 01/15/26(Thu)02:34:17 No.107866676

>>107866632
>moralfag
>edit a document text
it's not just about morality, it's illegal lol

Anonymous
01/15/26(Thu)02:37:19 No.107866694

Anonymous 01/15/26(Thu)02:37:19 No.107866694

>>107866632
>he actually asked ChatGPT or something to edit documents
>it's now recorded on their servers if he gets caught
Good luck newfag

Anonymous
01/15/26(Thu)02:39:25 No.107866705

Anonymous 01/15/26(Thu)02:39:25 No.107866705

File: ComfyUI_00019_.png (1.76 MB, 1400x800)

1.76 MB PNG

Anonymous
01/15/26(Thu)02:40:29 No.107866712

Anonymous 01/15/26(Thu)02:40:29 No.107866712

File: 1766481611633501.png (2.44 MB, 1216x1248)

2.44 MB PNG

Anonymous
01/15/26(Thu)02:41:02 No.107866714

Anonymous 01/15/26(Thu)02:41:02 No.107866714

File: 1743945021768171.png (1.91 MB, 1248x1216)

1.91 MB PNG

Anonymous
01/15/26(Thu)02:41:55 No.107866717

Anonymous 01/15/26(Thu)02:41:55 No.107866717

File: zitlora.jpg (690 KB, 1344x1728)

690 KB JPG

first ever zit image on new ai pc, lets go

Gonna train zit sometime but my brain is fried just setting up

Anonymous
01/15/26(Thu)02:42:02 No.107866718

Anonymous 01/15/26(Thu)02:42:02 No.107866718

>>107866705
Clannad was such a depressing anime, don't watch that when you're in a sad mood or you'll ACK- yourself

Anonymous
01/15/26(Thu)02:44:03 No.107866730

Anonymous 01/15/26(Thu)02:44:03 No.107866730

how do you make a single .safetensors for a text encoder? I wanted to use some ablit/heretic/mpoa encoders to test stuff out and wanted too cook my own.

Anonymous
01/15/26(Thu)02:44:22 No.107866733

Anonymous 01/15/26(Thu)02:44:22 No.107866733

>>107866717
>Gonna train zit sometime
don't, I spent a lot of time downloading loras for ZiT and they all suck, this model is just too distilled to be trained with

Anonymous
01/15/26(Thu)02:48:20 No.107866751

Anonymous 01/15/26(Thu)02:48:20 No.107866751

https://huggingface.co/lodestones/Zeta-Chroma/blob/main/zeta-chroma-x0-pixel-proto.safetensors
Has anyone tried that one?

Anonymous
01/15/26(Thu)02:48:29 No.107866754

Anonymous 01/15/26(Thu)02:48:29 No.107866754

>>107866733
Couldn't hurt desu, gonna do anything anyway, at worst I just made a dataset anyway

Anonymous
01/15/26(Thu)02:50:41 No.107866767

Anonymous 01/15/26(Thu)02:50:41 No.107866767

>>107866730
I dunno but you can try ggufs. Search gguf quants of the abliterated TEs. Major quantizers like bartowski and Unsloth also publish bf16 ggufs.
Note, since this is an unusual use case and gguf implementation for diffusion is overall not in a good state, there might be some bugs or regressions with this.

Anonymous
01/15/26(Thu)02:52:57 No.107866779

Anonymous 01/15/26(Thu)02:52:57 No.107866779

>>107866751
It has been training for only two weeks.
Don't see the point in bothering with it.
If you must go to lodestone's shitcord and ask for inference code, it's not supported by anything yet.

Anonymous
01/15/26(Thu)02:55:59 No.107866791

Anonymous 01/15/26(Thu)02:55:59 No.107866791

>>107866717
>>107866754
Train at 768 or 512. 1024 is more difficult for reasons.
Even the best ZiT loras still break the anatomy and text a bit.

Anonymous
01/15/26(Thu)02:56:10 No.107866792

Anonymous 01/15/26(Thu)02:56:10 No.107866792

>>107866717
>>107866754
Loras work great with Z. Just use the distilled as base when you train.

Anonymous
01/15/26(Thu)02:57:40 No.107866804

Anonymous 01/15/26(Thu)02:57:40 No.107866804

File: ComfyUI_00020_.png (1.94 MB, 1400x800)

1.94 MB PNG

>>107866718
Just heal your soul with Tomoyo After's rimjob scenes. https://arch.b4k.dev/vg/thread/545288714/#545353981

Anonymous
01/15/26(Thu)03:01:00 No.107866825

Anonymous 01/15/26(Thu)03:01:00 No.107866825

>>107866791
>>107866792
thanks thanks thanks

Anonymous
01/15/26(Thu)03:02:02 No.107866831

Anonymous 01/15/26(Thu)03:02:02 No.107866831

>>107866804
wtf?

Anonymous
01/15/26(Thu)03:08:50 No.107866855

Anonymous 01/15/26(Thu)03:08:50 No.107866855

>prompt character on chroma
>despite using a strong lora, the model completely shifts to the ugliest cartoon artstyle known to man
is there a way to mitigate this shit

Anonymous
01/15/26(Thu)03:10:26 No.107866860

Anonymous 01/15/26(Thu)03:10:26 No.107866860

>>107866855
By not using Chroma. Anything remotely usable you see from Chroma is a 1 in 100 cherry picked literally unicorn image that in no way represents the absolute shit that model usually spits out.

You are better off just using SDXL.

Anonymous
01/15/26(Thu)03:13:49 No.107866874

Anonymous 01/15/26(Thu)03:13:49 No.107866874

>>107866860
truth nuke

Anonymous
01/15/26(Thu)03:15:10 No.107866882

Anonymous 01/15/26(Thu)03:15:10 No.107866882

>>107866767
I know, but I wanted to use comfy's native model loading instead of the gguf custom nodes.

Anonymous
01/15/26(Thu)03:18:45 No.107866904

Anonymous 01/15/26(Thu)03:18:45 No.107866904

>>107866855
On top of many faults of Chroma commonly being discussed here lack of style control is another one. I have seen detailed photographic prompts randomly switching to illustration styles across seeds.
This anon's right >>107866860
Z-Base will save us soon.

Anonymous
01/15/26(Thu)03:22:22 No.107866925

Anonymous 01/15/26(Thu)03:22:22 No.107866925

>>107866904
>Z-Base will save us soon.
Someone's been skipping their Chinese culture lessons.

Anonymous
01/15/26(Thu)03:23:28 No.107866929

Anonymous 01/15/26(Thu)03:23:28 No.107866929

>>107866882
Try your luck with this random crap I found on github:
https://github.com/soursilver/safetensors-merger
Also you may want to convert it to fp16/bf16 later on, the models are likely in fp32.

Anonymous
01/15/26(Thu)03:25:29 No.107866937

Anonymous 01/15/26(Thu)03:25:29 No.107866937

File: 1747874949990820.png (140 KB, 498x281)

140 KB PNG

>>107866904
>Z-Base will save us soon.
lol

Anonymous
01/15/26(Thu)03:26:50 No.107866945

Anonymous 01/15/26(Thu)03:26:50 No.107866945

We're getting wan 2.5, I haven't seen anything from it at all. It was dead as fuck wasn't it?

Anonymous
01/15/26(Thu)03:32:21 No.107866976

Anonymous 01/15/26(Thu)03:32:21 No.107866976

>>107866945
They said it was too big.

Anonymous
01/15/26(Thu)03:33:06 No.107866979

Anonymous 01/15/26(Thu)03:33:06 No.107866979

>>107866945
anime chinese man said it was too big to be run, it's probably a 40b model no one will ever run (like step video which was a 30+b model)

Anonymous
01/15/26(Thu)03:33:36 No.107866982

Anonymous 01/15/26(Thu)03:33:36 No.107866982

>>107866945
>We're getting wan 2.5
Sorry. You didn't say please enough.

Anonymous
01/15/26(Thu)03:34:22 No.107866986

Anonymous 01/15/26(Thu)03:34:22 No.107866986

>>107866929
I was thinking of something like thisa:

import torch
from transformers import AutoModelForCausalLM
from safetensors.torch import save_file

MODEL_ID = "YanLabs/gemma-3-4b-it-abliterated-normpreserve"
OUT_FILE = "gemma-3-4b-it-abliterated-text-encoder.safetensors"

# Load model
model = AutoModelForCausalLM.from_pretrained(
    MODEL_ID,
    torch_dtype=torch.float16,
    device_map="cpu",   # safest option
)

state_dict = model.state_dict()

# Save as single safetensors file
save_file(state_dict, OUT_FILE)

print(f"Saved {OUT_FILE}")

actually ill just try this

Anonymous
01/15/26(Thu)03:37:33 No.107867003

Anonymous 01/15/26(Thu)03:37:33 No.107867003

What's that one chinese twitter account people swear by for leaks in here?

Anonymous
01/15/26(Thu)03:39:50 No.107867016

Anonymous 01/15/26(Thu)03:39:50 No.107867016

>>107867003
bdsqlsz

Anonymous
01/15/26(Thu)03:43:02 No.107867025

Anonymous 01/15/26(Thu)03:43:02 No.107867025

File: ng2.png (1.64 MB, 856x1216)

1.64 MB PNG

>>LTX is ba-ACK

Anonymous
01/15/26(Thu)03:48:27 No.107867042

Anonymous 01/15/26(Thu)03:48:27 No.107867042

>>107867003
this dude
https://xcancel.com/bdsqlsz/status/2009520301156258171#m
he's even allowed to Alibaba's conference and shit

Anonymous
01/15/26(Thu)03:50:27 No.107867052

Anonymous 01/15/26(Thu)03:50:27 No.107867052

>>107866976
>>107866979
How much is 40b? Like 80-90gb? Offloading is still possible.

Anonymous
01/15/26(Thu)03:51:44 No.107867056

Anonymous 01/15/26(Thu)03:51:44 No.107867056

>>107867052
>Offloading is still possible.
The bargaining phase is over, anon. It's not coming.

Anonymous
01/15/26(Thu)03:52:39 No.107867062

Anonymous 01/15/26(Thu)03:52:39 No.107867062

>>107867042
he's the omar of diffusion
fuck omar

Anonymous
01/15/26(Thu)03:53:27 No.107867065

Anonymous 01/15/26(Thu)03:53:27 No.107867065

>>107867052
basically on fp8, the number of parameters and the size are the same, so 40b = 40gb on fp8

Anonymous
01/15/26(Thu)03:54:03 No.107867071

Anonymous 01/15/26(Thu)03:54:03 No.107867071

>>107866937
>does
>doe-Z
Z-base confirmed

Anonymous
01/15/26(Thu)04:11:30 No.107867184

Anonymous 01/15/26(Thu)04:11:30 No.107867184

why are people still using chroma?
use case?

Anonymous
01/15/26(Thu)04:11:49 No.107867188

Anonymous 01/15/26(Thu)04:11:49 No.107867188

File: 1737350488259046.png (1.58 MB, 1024x1472)

1.58 MB PNG

Anonymous
01/15/26(Thu)04:14:47 No.107867203

Anonymous 01/15/26(Thu)04:14:47 No.107867203

>>107867184
>use case?
Satisfying emotional debt caused by months of sunk cost in believing the next epoch would finally fix its deep and fundamental flaws that make the model literally unusable.

Anonymous
01/15/26(Thu)04:16:05 No.107867209

Anonymous 01/15/26(Thu)04:16:05 No.107867209

>>107867184
>why are people still using chroma?
it's the only model that can make realistic image and NSFW at the same time, and people are willing to get through 99 straight images filled with anatomy attrocities if they can get 1 good goon image out of it, many such cases

Anonymous
01/15/26(Thu)04:18:57 No.107867228

Anonymous 01/15/26(Thu)04:18:57 No.107867228

>>107867203
man it was 1 cope after the other
>actually chroma was very soul at Revision #
>no wait #48 is the actual soulful one
>#50 (HD) is bad but wait!
>there's also the flash heun model its good (its not)
>and there's the HD flash merge too!!!! lol!! I swear this time it converges good!!!!
>but you know whats really bad? its not the unfinished training... its just the.. UGH VAE!!!
>yeah lets train a new vaeless chroma LMAO, RADIANCE!
>*retard spams the general for weeks with his absolutely melty/cooked gens*
>uhmmm no radiance is good !!!!
>but WAIT, ackshually radiance can be fixed with this x0 version
>ehh but you know what? we're moving onto z-image... what? waiting for base? lmao!!!! we're training on a distill just like we did for normal chroma!!!
what a shitshow

Anonymous
01/15/26(Thu)04:20:07 No.107867235

Anonymous 01/15/26(Thu)04:20:07 No.107867235

>>107867228
>>but you know whats really bad? its not the unfinished training... its just the.. UGH VAE!!!
I have to admit that's me :( I really thought Flux's VAE wasn't that good, and then Z-image turbo showed me that it's actually an incredible VAE

Anonymous
01/15/26(Thu)04:20:34 No.107867238

Anonymous 01/15/26(Thu)04:20:34 No.107867238

>>107867228
Well at least it proved de-distillation is a waste of time, so something at least, costly lesson though...

Anonymous
01/15/26(Thu)04:20:44 No.107867239

Anonymous 01/15/26(Thu)04:20:44 No.107867239

File: Screenshot 2026-01-15 040707.png (137 KB, 1394x784)

137 KB PNG

i not sure what going on here, i haven't used pinokio in 6 months because i had issues installing wan2gp over there so i did a normal stand alone install. i recently opened up pinokio to use Joy_Caption_Alpha-Two_GUI app and it had some install requirements which i accepted the installation requirements, finished captioning some images and then closed the application. I'm trying to use wan2gp and now I'm running into some road block again. how do i resolve this?

Anonymous
01/15/26(Thu)04:20:47 No.107867240

Anonymous 01/15/26(Thu)04:20:47 No.107867240

>>107867209
>realistic image
but it's not

Anonymous
01/15/26(Thu)04:21:10 No.107867247

Anonymous 01/15/26(Thu)04:21:10 No.107867247

File: 1743998107645331.png (157 KB, 498x430)

157 KB PNG

>>107867228
you have to admit it was pure entertainment though

Anonymous
01/15/26(Thu)04:22:31 No.107867255

Anonymous 01/15/26(Thu)04:22:31 No.107867255

>>107867238
That furry fag didn't learn that lesson since he wants to save Z-image turbo and that model is way more distilled than Flux Schnell

Anonymous
01/15/26(Thu)04:23:14 No.107867261

Anonymous 01/15/26(Thu)04:23:14 No.107867261

File: mlady.jpg (323 KB, 948x1264)

323 KB JPG

>>107867184
>use case?
Fun to use, gives me the results I want

Anonymous
01/15/26(Thu)04:23:54 No.107867264

Anonymous 01/15/26(Thu)04:23:54 No.107867264

>>107867228
>waiting for base? lmao!!!!
to be fair, we've been waiting for base for too long I can understand he wants to move on

Anonymous
01/15/26(Thu)04:25:11 No.107867269

Anonymous 01/15/26(Thu)04:25:11 No.107867269

>>107867184
qwen image already mogged chroma so idk

Anonymous
01/15/26(Thu)04:26:17 No.107867275

Anonymous 01/15/26(Thu)04:26:17 No.107867275

>>107867264
It's only been like a month and a half lol. Not enough to open the wallet on a stupid mission imo

Anonymous
01/15/26(Thu)04:26:41 No.107867278

Anonymous 01/15/26(Thu)04:26:41 No.107867278

>>107867269
good joke, qwen image is still plastic and can't do NSFW out of the box

Anonymous
01/15/26(Thu)04:27:44 No.107867282

Anonymous 01/15/26(Thu)04:27:44 No.107867282

File: aaaaaaaaaa.png (85 KB, 225x225)

85 KB PNG

>>107867275
>It's only been like a month and a half lol.
THEY PROMISED TO RELEASE IT "NEXT WEEK", NOT NEXT DECADE

Anonymous
01/15/26(Thu)04:28:49 No.107867290

Anonymous 01/15/26(Thu)04:28:49 No.107867290

File: end my suffering.png (171 KB, 736x736)

171 KB PNG

>>107867275
>It's only been like a month and a half
and counting...

Anonymous
01/15/26(Thu)04:30:50 No.107867299

Anonymous 01/15/26(Thu)04:30:50 No.107867299

>>107867275
People think I'm joking when I talk about Chinese culture. If you even tried to read between the lines, you'd understand like I do that we're not getting it.

Anonymous
01/15/26(Thu)04:33:00 No.107867306

Anonymous 01/15/26(Thu)04:33:00 No.107867306

File: 1755329637196283.png (67 KB, 1613x259)

67 KB PNG

>>107867299
B-but (Corporate Hegemony™) said "Patience will be rewarded"!!1!1!!1

Anonymous
01/15/26(Thu)04:34:17 No.107867314

Anonymous 01/15/26(Thu)04:34:17 No.107867314

>>107867306
>Patience will be rewarded
They never said with what.

Anonymous
01/15/26(Thu)04:35:31 No.107867325

Anonymous 01/15/26(Thu)04:35:31 No.107867325

>>107867184
I use chroma for base image, and then zit for detailer

Anonymous
01/15/26(Thu)04:37:23 No.107867330

Anonymous 01/15/26(Thu)04:37:23 No.107867330

>>107867239
can someone help me please?

Anonymous
01/15/26(Thu)04:37:23 No.107867331

Anonymous 01/15/26(Thu)04:37:23 No.107867331

>>107867306
Patience, Colorado.

Anonymous
01/15/26(Thu)04:39:34 No.107867335

Anonymous 01/15/26(Thu)04:39:34 No.107867335

New thread

>>107867304
>>107867304
>>107867304

Anonymous
01/15/26(Thu)04:40:39 No.107867340

Anonymous 01/15/26(Thu)04:40:39 No.107867340

>>107867325
Interesting. You got any examples?

Anonymous
01/15/26(Thu)04:41:18 No.107867343

Anonymous 01/15/26(Thu)04:41:18 No.107867343

File: iStock-896115300-2000x112(...).jpg (162 KB, 2000x1125)

162 KB JPG

>>107867235
>>107867228
I don't get this retard, why can't he just take the dedistilled ZiT and train it? What's his problem?

Anonymous
01/15/26(Thu)04:41:18 No.107867344

Anonymous 01/15/26(Thu)04:41:18 No.107867344

>>107867340
not sharing, rajesh

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.