/wsg/ - /ldg/ - Local Diffusion General - Worksafe GIF


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 01/10/26(Sat)10:03:30 No.6069549

File: grid.mp4 (5.61 MB, 1930x2048)

/ldg/ - Local Diffusion General Anonymous 01/10/26(Sat)10:03:30 No.6069549

Grid edition

Discussion of Free and Open Source Diffusion Models

Prev: >>6067150

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality

Anonymous
01/10/26(Sat)10:06:04 No.6069551

Anonymous 01/10/26(Sat)10:06:04 No.6069551

File: LTX_2.0_i2v_00193_.mp4 (1.01 MB, 640x1024)

1.01 MB MP4

Anonymous
01/10/26(Sat)10:19:27 No.6069556

Anonymous 01/10/26(Sat)10:19:27 No.6069556

Are jump cuts just a prompt issue? Sometimes ltx just randomly ignores imput image altogether

Anonymous
01/10/26(Sat)10:21:37 No.6069557

Anonymous 01/10/26(Sat)10:21:37 No.6069557

File: LTX_2.0_i2v_00198_.mp4 (1.02 MB, 1024x640)

1.02 MB MP4

>>6069556
I don't think its that simple, but prompt does matter, I noticed that if I dumb down my prompt sometimes I can make slideshows work properly

Anonymous
01/10/26(Sat)10:30:44 No.6069563

Anonymous 01/10/26(Sat)10:30:44 No.6069563

File: LTX_2.0_i2v_00201_.mp4 (1.32 MB, 704x512)

1.32 MB MP4

Anonymous
01/10/26(Sat)10:31:14 No.6069565

Anonymous 01/10/26(Sat)10:31:14 No.6069565

File: 1738875720792770.mp4 (1.45 MB, 1280x704)

1.45 MB MP4

with reference audio

Anonymous
01/10/26(Sat)10:31:54 No.6069567

Anonymous 01/10/26(Sat)10:31:54 No.6069567

>>6069565
real?

Anonymous
01/10/26(Sat)10:36:25 No.6069568

Anonymous 01/10/26(Sat)10:36:25 No.6069568

File: LTX_2.0_i2v_00196_.mp4 (1.08 MB, 704x896)

1.08 MB MP4

what the fuck

Anonymous
01/10/26(Sat)10:58:58 No.6069579

Anonymous 01/10/26(Sat)10:58:58 No.6069579

File: 1737720237271889.mp4 (2.15 MB, 704x704)

2.15 MB MP4

Anonymous
01/10/26(Sat)11:06:25 No.6069585

Anonymous 01/10/26(Sat)11:06:25 No.6069585

File: LTX-2_00057_.mp4 (1.98 MB, 832x448)

1.98 MB MP4

Anonymous
01/10/26(Sat)13:26:28 No.6069639

Anonymous 01/10/26(Sat)13:26:28 No.6069639

This also fixes powerpoint
https://www.reddit.com/r/StableDiffusion/comments/1q94nlk/wow_i_accidentally_discovered_that_the_native/

Anonymous
01/10/26(Sat)14:02:56 No.6069653

Anonymous 01/10/26(Sat)14:02:56 No.6069653

>>6069639
I ran into the slideshow issue and honestly just write more in the prompt. I described specific movements and other animated details and it got fixed

Anonymous
01/10/26(Sat)16:32:28 No.6069729

Anonymous 01/10/26(Sat)16:32:28 No.6069729

>>6069549
>dat sound
kek
im sure you could feed the bakery userscript into your favorite LOCOAL llm and have it enable audio. so that it looks nicer

Anonymous
01/10/26(Sat)17:43:26 No.6069769

Anonymous 01/10/26(Sat)17:43:26 No.6069769

>>6069639
This makes wan still useful as a starting concept and motion tool. And it works very well since ltx can detail what you feed in so you only need some low Q4 or maybe even lower wan 2.2 models to quickly take a zit image.

Gen subjects onto white backgrounds, then feed into wan SVI section of workflow and use a cut scene prompt to place it over the background you require. after using resize and pad with white comfyui native node or other node that will let you place all the subjects into the correct video resolution.

"The scene immediately cuts to the ufo hovering above a pine forest in twilight. no text appears in the video."

It will fill the white space for you in around 30 - 40 frames.
then second set of wan 2.2 svi samplers.

"A ufo hovers above a pine forest in twilight, a single dim red light flashes once every 1 seconds, an extremely narrow ray of light or energy beam fires instantly down forming a narrow ray beam for 3 seconds as the ufo begins to move away from the camera very slowly forwards creating a slight disturbance and bending of the light around it. mysterious, spooky, ufo sighting, video from mobile phone. no text appears in the video."

Then use that second wan gen as the basis for Ltx2 to guide its initial motion and concept.

Anonymous
01/10/26(Sat)17:50:18 No.6069775

Anonymous 01/10/26(Sat)17:50:18 No.6069775

basically we have everything now to create a full studio for film creation.

Anonymous
01/10/26(Sat)17:53:07 No.6069777

Anonymous 01/10/26(Sat)17:53:07 No.6069777

File: start_00001.mp4 (582 KB, 720x720)

582 KB MP4

>>6069769
Transition from z image to video

Anonymous
01/10/26(Sat)17:54:09 No.6069779

Anonymous 01/10/26(Sat)17:54:09 No.6069779

File: ufo_00001.mp4 (862 KB, 720x720)

862 KB MP4

>>6069777
continue to create the start video for ltx

Anonymous
01/10/26(Sat)18:45:05 No.6069802

Anonymous 01/10/26(Sat)18:45:05 No.6069802

File: ufo_00002-audio.mp4 (1.45 MB, 736x736)

1.45 MB MP4

>>6069779
Audio is bad i know why, its only 720x720 it was don't at 30 steps though, only 1 stage sampler.

I need to rent a server.

Anonymous
01/10/26(Sat)18:59:54 No.6069812

Anonymous 01/10/26(Sat)18:59:54 No.6069812

I'm happy with this new topic on this board but I hope it doesn't flood the catalog seeing how fast the previous thread reached bump limit.

Anonymous
01/10/26(Sat)19:01:38 No.6069813

Anonymous 01/10/26(Sat)19:01:38 No.6069813

>>6069812
your faith in ltx is awe inspiring

Anonymous
01/10/26(Sat)19:08:20 No.6069821

Anonymous 01/10/26(Sat)19:08:20 No.6069821

I've been trying to make it gen audio for NSFW Wan vids but results have been lackluster so far. I like how it can properly sync up a sound with each penis thrust, however.

Anonymous
01/10/26(Sat)20:05:55 No.6069860

Anonymous 01/10/26(Sat)20:05:55 No.6069860

>>6069821
you could try using a real video with real audio and append your silent wan video to the end so it uses the real video as reference.

Anonymous
01/10/26(Sat)20:07:41 No.6069862

Anonymous 01/10/26(Sat)20:07:41 No.6069862

File: 2026-01-11-01h06m28s_seed(...).webm (1.91 MB, 832x1088)

1.91 MB WEBM

not lewd

Anonymous
01/10/26(Sat)20:10:43 No.6069864

Anonymous 01/10/26(Sat)20:10:43 No.6069864

>>6069860
Not a bad idea. Will test it out later. This model is becoming more useful than I thought it would be

Anonymous
01/10/26(Sat)21:10:29 No.6069893

Anonymous 01/10/26(Sat)21:10:29 No.6069893

File: LTX_2.0_i2v_00248_.mp4 (4.41 MB, 1280x704)

4.41 MB MP4

Anonymous
01/10/26(Sat)22:54:38 No.6069966

Anonymous 01/10/26(Sat)22:54:38 No.6069966

>>6069821
skill issue coomer brain. think outside the box. it is not trained on porn so think about what sounds similar. slapping sounds, sucking sounds and what creates them.

it can do it but not perfectly.

Anonymous
01/10/26(Sat)22:56:46 No.6069968

Anonymous 01/10/26(Sat)22:56:46 No.6069968

>>6069821
Unironically should be something that's easy to train into the model.

Anonymous
01/10/26(Sat)23:43:45 No.6069987

Anonymous 01/10/26(Sat)23:43:45 No.6069987

File: 2026-01-11-02h32m07s_seed(...).webm (870 KB, 1088x832)

870 KB WEBM

thanks for blur, it lets me know that negative prompts don't do shit

Anonymous
01/10/26(Sat)23:48:10 No.6069991

Anonymous 01/10/26(Sat)23:48:10 No.6069991

File: LTX-2_00040-audio.mp4 (2.54 MB, 864x1152)

2.54 MB MP4

>>6069893
I guess it's orc night

Anonymous
01/11/26(Sun)00:10:44 No.6070010

Anonymous 01/11/26(Sun)00:10:44 No.6070010

Anyone ever get black outputs on longer videos?

Anonymous
01/11/26(Sun)00:25:50 No.6070016

Anonymous 01/11/26(Sun)00:25:50 No.6070016

>>6069987
Are you using distilled model or distilled lora? They're used at cfg 1 which ignores negative prompt altogether

Anonymous
01/11/26(Sun)00:25:55 No.6070017

Anonymous 01/11/26(Sun)00:25:55 No.6070017

>>6070010
nvm figured it out. It was sage attention

Anonymous
01/11/26(Sun)00:37:36 No.6070026

Anonymous 01/11/26(Sun)00:37:36 No.6070026

File: LTX_2.0_i2v_00285_.mp4 (3.67 MB, 576x384)

3.67 MB MP4

Anonymous
01/11/26(Sun)00:56:53 No.6070032

Anonymous 01/11/26(Sun)00:56:53 No.6070032

>>6070026
Absolute slopKINO

Anonymous
01/11/26(Sun)03:46:06 No.6070113

Anonymous 01/11/26(Sun)03:46:06 No.6070113

File: file.mp4 (4.29 MB, 480x832)

4.29 MB MP4

Anonymous
01/11/26(Sun)04:50:22 No.6070128

Anonymous 01/11/26(Sun)04:50:22 No.6070128

File: 1750103387482981.webm (3.87 MB, 960x1280)

3.87 MB WEBM

Anonymous
01/11/26(Sun)05:44:54 No.6070150

Anonymous 01/11/26(Sun)05:44:54 No.6070150

how can i increase the audio resolution?

Anonymous
01/11/26(Sun)05:51:23 No.6070152

Anonymous 01/11/26(Sun)05:51:23 No.6070152

File: LTX_2.0_i2v_00197_.mp4 (761 KB, 448x448)

761 KB MP4

Anonymous
01/11/26(Sun)06:38:24 No.6070173

Anonymous 01/11/26(Sun)06:38:24 No.6070173

>>6070150
I noticed both the audio and video quality increases when you double the framerate to 48 fps

Anonymous
01/11/26(Sun)07:53:28 No.6070196

Anonymous 01/11/26(Sun)07:53:28 No.6070196

File: LTX-2_00005-audio.mp4 (1.64 MB, 800x416)

1.64 MB MP4

Pythagoras

Anonymous
01/11/26(Sun)08:20:04 No.6070210

Anonymous 01/11/26(Sun)08:20:04 No.6070210

Anyone has a workflow to add audio to a video?

Anonymous
01/11/26(Sun)08:24:09 No.6070211

Anonymous 01/11/26(Sun)08:24:09 No.6070211

>>6070210
https://files.catbox.moe/f9fvjr.json

Anonymous
01/11/26(Sun)08:24:26 No.6070212

Anonymous 01/11/26(Sun)08:24:26 No.6070212

>>6070211
thanks king

Anonymous
01/11/26(Sun)08:26:12 No.6070213

Anonymous 01/11/26(Sun)08:26:12 No.6070213

>>6069551
eek!

Anonymous
01/11/26(Sun)08:27:02 No.6070215

Anonymous 01/11/26(Sun)08:27:02 No.6070215

>>6070212
np. you will need kijai's ComfyUI-MelBandRoFormer node, but i'm not sure how much it's actually needed to isolate the vocals
for example the pythagoras + chop suey video above i didn't use it because when you isolate the vocals it generated much less movement since the heavy metal backdrop was gone. but if you don't isolate you get slightly worse lip sync

Anonymous
01/11/26(Sun)08:27:04 No.6070216

Anonymous 01/11/26(Sun)08:27:04 No.6070216

>>6069549
please give the webm of that elf

Anonymous
01/11/26(Sun)08:28:41 No.6070217

Anonymous 01/11/26(Sun)08:28:41 No.6070217

>>6070212
ah i think i misread you, the workflow i posted is to add video to audio. to add audio to video it's this one but i haven't tried it:
https://pastebin.com/4w4g3fQE

Anonymous
01/11/26(Sun)08:33:28 No.6070219

Anonymous 01/11/26(Sun)08:33:28 No.6070219

File: LTX-2_00010-audio.mp4 (1.54 MB, 928x512)

1.54 MB MP4

Anonymous
01/11/26(Sun)08:46:12 No.6070223

Anonymous 01/11/26(Sun)08:46:12 No.6070223

>>6070219
this real?

Anonymous
01/11/26(Sun)08:47:01 No.6070224

Anonymous 01/11/26(Sun)08:47:01 No.6070224

>always the same images for i2v
even my memes folder has more images than here...

Anonymous
01/11/26(Sun)08:54:11 No.6070230

Anonymous 01/11/26(Sun)08:54:11 No.6070230

>>6070224
gib image

Anonymous
01/11/26(Sun)08:55:17 No.6070231

Anonymous 01/11/26(Sun)08:55:17 No.6070231

File: 0853_LTX2_00001-audio.mp4 (932 KB, 480x480)

932 KB MP4

>>6070217

Anonymous
01/11/26(Sun)09:02:16 No.6070237

Anonymous 01/11/26(Sun)09:02:16 No.6070237

File: LTX_2.0_i2v_00208_.mp4 (1.2 MB, 704x960)

1.2 MB MP4

used the wrong prompt :(

Anonymous
01/11/26(Sun)09:03:42 No.6070239

Anonymous 01/11/26(Sun)09:03:42 No.6070239

File: LTX-2_00023-audio.mp4 (3.75 MB, 928x512)

3.75 MB MP4

Anonymous
01/11/26(Sun)09:09:19 No.6070244

Anonymous 01/11/26(Sun)09:09:19 No.6070244

File: LTX-2_00122_.mp4 (2.52 MB, 1216x768)

2.52 MB MP4

Anonymous
01/11/26(Sun)09:25:55 No.6070250

Anonymous 01/11/26(Sun)09:25:55 No.6070250

Does anyone have a video-to-video workflow that extends the video and maintains the same quality as input video? Every setting I tried just results in the original video looking degraded in quality compared to the text-to-video results that actually look nice. The Spatial upscaler seems to do a lot of the work making text-to-video look nice.

Anonymous
01/11/26(Sun)09:30:15 No.6070253

Anonymous 01/11/26(Sun)09:30:15 No.6070253

File: audio.mp4 (1.56 MB, 1848x360)

1.56 MB MP4

Anonymous
01/11/26(Sun)09:37:55 No.6070255

Anonymous 01/11/26(Sun)09:37:55 No.6070255

>>6070250
LTX is probably too new for such a thing but for wan 2.2 there's Stable Video Infinity.
https://github.com/vita-epfl/Stable-Video-Infinity/tree/svi_wan22

Anonymous
01/11/26(Sun)09:41:33 No.6070262

Anonymous 01/11/26(Sun)09:41:33 No.6070262

File: audio.mp4 (1.94 MB, 1848x360)

1.94 MB MP4

Anonymous
01/11/26(Sun)09:52:58 No.6070271

Anonymous 01/11/26(Sun)09:52:58 No.6070271

File: LTX_2.0_i2v_00210_.mp4 (1.7 MB, 640x896)

1.7 MB MP4

Anonymous
01/11/26(Sun)09:56:32 No.6070274

Anonymous 01/11/26(Sun)09:56:32 No.6070274

File: audio.mp4 (2.55 MB, 1536x360)

2.55 MB MP4

Anonymous
01/11/26(Sun)10:07:19 No.6070280

Anonymous 01/11/26(Sun)10:07:19 No.6070280

File: LTX_2.0_i2v_00211_.mp4 (1.52 MB, 832x768)

1.52 MB MP4

Anonymous
01/11/26(Sun)10:13:37 No.6070285

Anonymous 01/11/26(Sun)10:13:37 No.6070285

File: LTX_2.0_i2v_00214_.mp4 (1.56 MB, 960x512)

1.56 MB MP4

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. Supported file types are: GIF, WEBM, MP4