[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/wsg/ - Worksafe GIF

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • Supported file types are: GIF, WEBM, MP4

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: grid.mp4 (5.61 MB, 1930x2048)
5.61 MB
5.61 MB MP4
Grid edition

Discussion of Free and Open Source Diffusion Models

Prev: >>6067150

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
>>
File: LTX_2.0_i2v_00193_.mp4 (1.01 MB, 640x1024)
1.01 MB
1.01 MB MP4
>>
Are jump cuts just a prompt issue? Sometimes ltx just randomly ignores imput image altogether
>>
File: LTX_2.0_i2v_00198_.mp4 (1.02 MB, 1024x640)
1.02 MB
1.02 MB MP4
>>6069556
I don't think its that simple, but prompt does matter, I noticed that if I dumb down my prompt sometimes I can make slideshows work properly
>>
File: LTX_2.0_i2v_00201_.mp4 (1.32 MB, 704x512)
1.32 MB
1.32 MB MP4
>>
File: 1738875720792770.mp4 (1.45 MB, 1280x704)
1.45 MB
1.45 MB MP4
with reference audio
>>
>>6069565
real?
>>
File: LTX_2.0_i2v_00196_.mp4 (1.08 MB, 704x896)
1.08 MB
1.08 MB MP4
what the fuck
>>
File: 1737720237271889.mp4 (2.15 MB, 704x704)
2.15 MB
2.15 MB MP4
>>
File: LTX-2_00057_.mp4 (1.98 MB, 832x448)
1.98 MB
1.98 MB MP4
>>
This also fixes powerpoint
https://www.reddit.com/r/StableDiffusion/comments/1q94nlk/wow_i_accidentally_discovered_that_the_native/
>>
>>6069639
I ran into the slideshow issue and honestly just write more in the prompt. I described specific movements and other animated details and it got fixed
>>
>>6069549
>dat sound
kek
im sure you could feed the bakery userscript into your favorite LOCOAL llm and have it enable audio. so that it looks nicer
>>
>>6069639
This makes wan still useful as a starting concept and motion tool. And it works very well since ltx can detail what you feed in so you only need some low Q4 or maybe even lower wan 2.2 models to quickly take a zit image.

Gen subjects onto white backgrounds, then feed into wan SVI section of workflow and use a cut scene prompt to place it over the background you require. after using resize and pad with white comfyui native node or other node that will let you place all the subjects into the correct video resolution.

"The scene immediately cuts to the ufo hovering above a pine forest in twilight. no text appears in the video."

It will fill the white space for you in around 30 - 40 frames.
then second set of wan 2.2 svi samplers.

"A ufo hovers above a pine forest in twilight, a single dim red light flashes once every 1 seconds, an extremely narrow ray of light or energy beam fires instantly down forming a narrow ray beam for 3 seconds as the ufo begins to move away from the camera very slowly forwards creating a slight disturbance and bending of the light around it. mysterious, spooky, ufo sighting, video from mobile phone. no text appears in the video."

Then use that second wan gen as the basis for Ltx2 to guide its initial motion and concept.
>>
basically we have everything now to create a full studio for film creation.
>>
File: start_00001.mp4 (582 KB, 720x720)
582 KB
582 KB MP4
>>6069769
Transition from z image to video
>>
File: ufo_00001.mp4 (862 KB, 720x720)
862 KB
862 KB MP4
>>6069777
continue to create the start video for ltx
>>
File: ufo_00002-audio.mp4 (1.45 MB, 736x736)
1.45 MB
1.45 MB MP4
>>6069779
Audio is bad i know why, its only 720x720 it was don't at 30 steps though, only 1 stage sampler.

I need to rent a server.
>>
I'm happy with this new topic on this board but I hope it doesn't flood the catalog seeing how fast the previous thread reached bump limit.
>>
>>6069812
your faith in ltx is awe inspiring
>>
I've been trying to make it gen audio for NSFW Wan vids but results have been lackluster so far. I like how it can properly sync up a sound with each penis thrust, however.
>>
>>6069821
you could try using a real video with real audio and append your silent wan video to the end so it uses the real video as reference.
>>
not lewd
>>
>>6069860
Not a bad idea. Will test it out later. This model is becoming more useful than I thought it would be
>>
File: LTX_2.0_i2v_00248_.mp4 (4.41 MB, 1280x704)
4.41 MB
4.41 MB MP4
>>
>>6069821
skill issue coomer brain. think outside the box. it is not trained on porn so think about what sounds similar. slapping sounds, sucking sounds and what creates them.

it can do it but not perfectly.
>>
>>6069821
Unironically should be something that's easy to train into the model.
>>
thanks for blur, it lets me know that negative prompts don't do shit
>>
File: LTX-2_00040-audio.mp4 (2.54 MB, 864x1152)
2.54 MB
2.54 MB MP4
>>6069893
I guess it's orc night
>>
Anyone ever get black outputs on longer videos?
>>
>>6069987
Are you using distilled model or distilled lora? They're used at cfg 1 which ignores negative prompt altogether
>>
>>6070010
nvm figured it out. It was sage attention
>>
File: LTX_2.0_i2v_00285_.mp4 (3.67 MB, 576x384)
3.67 MB
3.67 MB MP4
>>
>>6070026
Absolute slopKINO
>>
File: file.mp4 (4.29 MB, 480x832)
4.29 MB
4.29 MB MP4
>>
File: 1750103387482981.webm (3.87 MB, 960x1280)
3.87 MB
3.87 MB WEBM
>>
how can i increase the audio resolution?
>>
File: LTX_2.0_i2v_00197_.mp4 (761 KB, 448x448)
761 KB
761 KB MP4
>>
>>6070150
I noticed both the audio and video quality increases when you double the framerate to 48 fps
>>
File: LTX-2_00005-audio.mp4 (1.64 MB, 800x416)
1.64 MB
1.64 MB MP4
Pythagoras
>>
Anyone has a workflow to add audio to a video?
>>
>>6070210
https://files.catbox.moe/f9fvjr.json
>>
>>6070211
thanks king
>>
>>6069551
eek!
>>
>>6070212
np. you will need kijai's ComfyUI-MelBandRoFormer node, but i'm not sure how much it's actually needed to isolate the vocals
for example the pythagoras + chop suey video above i didn't use it because when you isolate the vocals it generated much less movement since the heavy metal backdrop was gone. but if you don't isolate you get slightly worse lip sync
>>
>>6069549
please give the webm of that elf
>>
>>6070212
ah i think i misread you, the workflow i posted is to add video to audio. to add audio to video it's this one but i haven't tried it:
https://pastebin.com/4w4g3fQE
>>
File: LTX-2_00010-audio.mp4 (1.54 MB, 928x512)
1.54 MB
1.54 MB MP4
>>
>>6070219
this real?
>>
>always the same images for i2v
even my memes folder has more images than here...
>>
>>6070224
gib image
>>
File: 0853_LTX2_00001-audio.mp4 (932 KB, 480x480)
932 KB
932 KB MP4
>>6070217
>>
File: LTX_2.0_i2v_00208_.mp4 (1.2 MB, 704x960)
1.2 MB
1.2 MB MP4
used the wrong prompt :(
>>
File: LTX-2_00023-audio.mp4 (3.75 MB, 928x512)
3.75 MB
3.75 MB MP4
>>
File: LTX-2_00122_.mp4 (2.52 MB, 1216x768)
2.52 MB
2.52 MB MP4
>>
Does anyone have a video-to-video workflow that extends the video and maintains the same quality as input video? Every setting I tried just results in the original video looking degraded in quality compared to the text-to-video results that actually look nice. The Spatial upscaler seems to do a lot of the work making text-to-video look nice.
>>
File: audio.mp4 (1.56 MB, 1848x360)
1.56 MB
1.56 MB MP4
>>
>>6070250
LTX is probably too new for such a thing but for wan 2.2 there's Stable Video Infinity.
https://github.com/vita-epfl/Stable-Video-Infinity/tree/svi_wan22
>>
File: audio.mp4 (1.94 MB, 1848x360)
1.94 MB
1.94 MB MP4
>>
File: LTX_2.0_i2v_00210_.mp4 (1.7 MB, 640x896)
1.7 MB
1.7 MB MP4
>>
File: audio.mp4 (2.55 MB, 1536x360)
2.55 MB
2.55 MB MP4
>>
File: LTX_2.0_i2v_00211_.mp4 (1.52 MB, 832x768)
1.52 MB
1.52 MB MP4
>>
File: LTX_2.0_i2v_00214_.mp4 (1.56 MB, 960x512)
1.56 MB
1.56 MB MP4



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.