[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/wsg/ - Worksafe GIF

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • Supported file types are: GIF, WEBM, MP4

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1739548272015919.mp4 (150 KB, 720x720)
150 KB
150 KB MP4
Miku Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>>/g/107791088

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
>>
wrong board
>>
BASED
>>
File: 1742250089513206.webm (2.02 MB, 1080x1920)
2.02 MB
2.02 MB WEBM
>>
File: 1758852517981157.mp4 (280 KB, 576x1024)
280 KB
280 KB MP4
>>
>>6067153
this is nicer and nobody will ask me to install gentoo
>>
File: LTX-2_00001-audio.mp4 (1.08 MB, 576x832)
1.08 MB
1.08 MB MP4
>>
File: LTX_2.0_i2v_00020_.mp4 (850 KB, 448x832)
850 KB
850 KB MP4
>>
File: LTX_2.0_i2v_00021_.mp4 (832 KB, 448x832)
832 KB
832 KB MP4
>>
>>6067724
lul
>>
File: 1760276029449595.mp4 (1.73 MB, 1184x800)
1.73 MB
1.73 MB MP4
I still don't get why /g/ doesn't let you upload video with sound, it's retarded
>>
File: LTX_2.0_i2v_00073_.mp4 (2.15 MB, 1280x736)
2.15 MB
2.15 MB MP4
>>
File: LTX_2.0_i2v_00027_.mp4 (1.84 MB, 448x832)
1.84 MB
1.84 MB MP4
>>6067725
>>
>>6067729
can you try and see if ltx 2 can do some ytp kino like sora 2
>>
>>6067731
>ytp kino
qrd
>>
>>6067732
>>6067731
I mean in the context of sora
>>
File: LTX_2.0_i2v_00032_.mp4 (1.42 MB, 448x832)
1.42 MB
1.42 MB MP4
>>
>>6067734
this model sure loves to do some powerpoint shit, I wonder if going for an abliterated version of gemma 3 could fix it
>>
File: LTX_2.0_i2v_00038_.mp4 (654 KB, 448x832)
654 KB
654 KB MP4
>>6067740
Oh for sure, I'm getting a fuck ton of powerpoints and posting the least bad ones
>>
File: LTX_2.0_i2v_00043_.mp4 (1.06 MB, 448x832)
1.06 MB
1.06 MB MP4
>>
File: 1763374024836227.mp4 (3.08 MB, 960x960)
3.08 MB
3.08 MB MP4
When base
>>
>>6067746
they're saying boo-urns
>>
File: sad.mp4 (1.22 MB, 960x960)
1.22 MB
1.22 MB MP4
>>6067746
>When base
if they don't release it before Chinese's new year (Feb 17, 2026) it's definitely over
>>
File: LTX_2.0_i2v_00049_.mp4 (813 KB, 448x832)
813 KB
813 KB MP4
fucking powerpoints
>>
File: LTX_2.0_i2v_00050_.mp4 (1.5 MB, 960x960)
1.5 MB
1.5 MB MP4
>>6067746
>>
>>6067750
I give you the original image input so that you can get a better result kek (we can't upload images on this place? this sucks wtf)
https://files.catbox.moe/1jwczb.jpg
>>
File: LTX_2.0_i2v_00053_.mp4 (2.41 MB, 960x960)
2.41 MB
2.41 MB MP4
>>6067756
thanks, already had this one
>>
>>6067750
>>6067757
absolute kino, love those ending transitions
>>
File: 1741358928983775.mp4 (4.1 MB, 1920x1080)
4.1 MB
4.1 MB MP4
https://www.reddit.com/r/StableDiffusion/comments/1q6zb57/comment/nycrhpl/
seems like it's working better on Wan2GP
>>
File: output.webm (3.88 MB, 960x960)
3.88 MB
3.88 MB WEBM
migu left :(
>>
>>6067773
lmaooo, I guess you tried to stitch the videos together by going for the last frame but it's getting more and more horrific for each iteration kek
>>
>>6067774
got so bad miku left the video and made me end it, svi when
>>
>>6067773
>migu left
catch her back! without the sacrifice we won't get Z-image turbo!
>>
>>6067768
i'm having a tinker with it, I'm upping the res and frames each time but the resource usage never moves, could it be infinite?
>>
https://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668
Oh, looks like we'll finally get it!
https://files.catbox.moe/lney3m.JPG
>>
File: WERE BACK.gif (2.73 MB, 498x498)
2.73 MB
2.73 MB GIF
>>6067795
>https://github.com/modelscope/DiffSynth-Studio/commit/0efab85674f2a65a8064acfb7a4b7950503a5668
oh shit it's from Modelscope, finally something is happening
>>
File: LTX_2.0_i2v_00060_.mp4 (720 KB, 512x640)
720 KB
720 KB MP4
waow
>>
>>6067795
I thought they would've released it right before Chinese's new year, but if it's sooner than that I'll definitely take it, gimme gimme gimme
>>
File: LTX_2.0_i2v_00076_.mp4 (3.21 MB, 1280x704)
3.21 MB
3.21 MB MP4
>>
did I do something wrong?
>>
>>6067807
sounds correct to me
>>
File: kek.mp4 (2.17 MB, 1312x704)
2.17 MB
2.17 MB MP4
>>6067795
>>
>>6067803
>>6067811
those powerpoints zoom/de-zoom is killing this model, without that it would be way more fun to play with
>>
File: 1753819783513759.mp4 (1.46 MB, 960x1280)
1.46 MB
1.46 MB MP4
>>6067803
>>
File: LTX_2.0_i2v_00086_.mp4 (3.86 MB, 1280x736)
3.86 MB
3.86 MB MP4
>>
>>6067789
>could it be infinite?
there has to be a resource usage increase, but maybe they found some tricks to make it minimal, this is a huge deal desu
>>
File: LTX_2.0_i2v_00084_.mp4 (3.28 MB, 1280x736)
3.28 MB
3.28 MB MP4
>>
>>6067823
seed lotto or is this a good prompt
>>
>>6067845
starting to notice this music in a lot of videos
>>
File: LTX_2.0_i2v_00089_.mp4 (4.17 MB, 1280x736)
4.17 MB
4.17 MB MP4
>>6067863
Probably the most generic suspense sounds all mashed into one homogeneous suspense slop.
>>
>>6067859
My first try. I am using qwen 8b to enhance the prompt other than that it's the standard comfy flow for the distill model.
>>
File: LTX_2.0_i2v_00090_.mp4 (3.43 MB, 1280x736)
3.43 MB
3.43 MB MP4
>>
>>6067789
it happened
looks like the limit is 960x960_240, or more frames for fewer pixels and vice versa
pretty good, especially considering comfy won't even try at 832x480_121
>>
>ltxv2
>input picture of woman
>prompt her to say something and do a simple action
>every single gen it hangs on the static input image for several seconds while audio plays then the last second it cuts to show an unrelated woman doing the action I prompted (while also being garbled slop)
what the fuck gives?
>>
is chroma better than lumina?
>>
>>6067970
Side grade
>>
>>6067970
lateral step
>>
File: LTX_2.0_i2v_00061_.mp4 (1.7 MB, 704x384)
1.7 MB
1.7 MB MP4
>>
File: LTX_2.0_i2v_00062_.mp4 (1.51 MB, 1408x768)
1.51 MB
1.51 MB MP4
>>
File: LTX_2.0_i2v_00063_.mp4 (2.54 MB, 1408x768)
2.54 MB
2.54 MB MP4
>>
File: LTX_2.0_i2v_00064_.mp4 (2.58 MB, 1408x768)
2.58 MB
2.58 MB MP4
>>
>>6067964
>what the fuck gives?
they censored the model, so we're getting the API cuck treatement, but in local!
>>
File: LTX_2.0_i2v_00065_.mp4 (3.01 MB, 768x1216)
3.01 MB
3.01 MB MP4
>>
File: LTX_2.0_i2v_00066_.mp4 (2.45 MB, 768x1216)
2.45 MB
2.45 MB MP4
bruh
>>
File: LTX_2.0_i2v_00067_.mp4 (2.58 MB, 768x1216)
2.58 MB
2.58 MB MP4
good enough I guess
>>
>>6068165
>>6068167
it's terrible when the movement is fast, not a big fan of the blurry shit lol
>>
File: 1756895147045515.mp4 (1.59 MB, 832x1088)
1.59 MB
1.59 MB MP4
gens really shouldn't look this fake in year of our Lord 2026. Even on good rolls everything always goes a bit blurry. Colours change. Weird motions.
Is it comfy's fault?
>>
>>6068170
its ai slop but the sound makes it funny
>>
>>6068171
>gens really shouldn't look this fake in year of our Lord 2026.
I agree, Z-image turbo showed that you can make good and small models, the others need to learn a thing or two from Tongyi
>>
>>6068171
>dried cum moving when she moves tummy
>>
File: LTX_2.0_i2v_00068_.mp4 (2.56 MB, 1088x832)
2.56 MB
2.56 MB MP4
>>6068171
I mixed height with width, grim
>>
File: LTX_2.0_i2v_00069_.mp4 (2.56 MB, 832x1088)
2.56 MB
2.56 MB MP4
>>6068179
eh
>>
>>6068171
>>6068181
are you using the upscaler? if yes, remove that shit and go for a vanilla render with more pixels (like 0.9 megapixels)
>>
>>6067150
d*bo status?
>>
>>6068189
you can't post images there so you can't be an avatarfag, we're safe from those fuckers lol
>>
>>
File: LTX-2_00011_.mp4 (328 KB, 832x448)
328 KB
328 KB MP4
>>
I want to try this out even if I'm a 16+32 ramlet

What's the best UI to pick up?
>>
>use LTX-2 to create the audio
>then use Wan 2.2 S2V with the audio for better video quality
I'm too lazy to set it up but someone should try this.
>>
>>6068462
wangp
>>
>>6068462
Pinokio + wan2gp if you are lazy and/or have no idea what you are doing. ComfyUI for more speed, but you need to learn a few things first.
>>
File: LTX_2.0_i2v_00070_.mp4 (1.13 MB, 768x1344)
1.13 MB
1.13 MB MP4
>>
>>6068165
lmao
>>
File: LTX-2_00002-audio.mp4 (1.2 MB, 512x736)
1.2 MB
1.2 MB MP4
breh why
>>
File: LTX-2_00004-audio.mp4 (1.8 MB, 512x736)
1.8 MB
1.8 MB MP4
i give up
>>
File: LTX-2_00005-audio.mp4 (958 KB, 704x704)
958 KB
958 KB MP4
ltx hates migu
>>
>>6068647
>>6068641
>>6068639
>>6068625
wtf 4chan supports audio now?
>>
>>6068650
Only chad boards like /wsg/ do
>>
>>6068651
i thought i was on /g/ lol
>>
File: LTX-2_00007-audio.mp4 (1.52 MB, 704x704)
1.52 MB
1.52 MB MP4
Not what I asked, but kinda cute ngl
>>
File: LTX_2.0_i2v_00134_.mp4 (1.64 MB, 1664x960)
1.64 MB
1.64 MB MP4
>>
File: 4.mp4 (2.55 MB, 768x1344)
2.55 MB
2.55 MB MP4
cozy bread
>>
>>6068656
is this real?
>>
>>6068639
SONGIK
>>
>>6068655
moar
>>
>>6068655
Would watch.

Hiroshima is a greedy gook.
>>
File: LTX-2_00008-audio.mp4 (2.93 MB, 1056x608)
2.93 MB
2.93 MB MP4
Attempt 1
>>
File: LTX-2_00009-audio.mp4 (2.27 MB, 1056x608)
2.27 MB
2.27 MB MP4
>>6068655
Attempt 2
>>
File: LTX-2_00010-audio.mp4 (2.32 MB, 1056x608)
2.32 MB
2.32 MB MP4
>>6068655
>>
File: ComfyUI_00005-audio.mp4 (1.98 MB, 640x1024)
1.98 MB
1.98 MB MP4
>>
>>6068655
Ok this is awesome.

>Glad you could bake it, Uther.
>>
File: LTX_2.0_i2v_00071_.mp4 (1.11 MB, 1152x768)
1.11 MB
1.11 MB MP4
>>
File: LTX_2.0_i2v_00006_.mp4 (1.29 MB, 704x384)
1.29 MB
1.29 MB MP4
>>
File: LTX_2.0_i2v_00072_.mp4 (1.6 MB, 640x1024)
1.6 MB
1.6 MB MP4
>>6068675
>>
File: LTX_2.0_i2v_00073_.mp4 (4.02 MB, 768x1344)
4.02 MB
4.02 MB MP4
>>
File: LTX_2.0_i2v_00074_.mp4 (1.03 MB, 832x1216)
1.03 MB
1.03 MB MP4
thanks for the powerpoint
>>
Just heads up that if you aren't using the Q8 ggufs yet, you might want to consider it.
>>
>>6068696
link?
>>
File: LTX_2.0_i2v_00147_.mp4 (1.54 MB, 960x512)
1.54 MB
1.54 MB MP4
>>
>>6068698
https://huggingface.co/Kijai/LTXV2_comfy/tree/main/diffusion_models
>>
>>6068698
https://huggingface.co/Kijai/LTXV2_comfy/tree/main/diffusion_models
>>
>>6068702
>>6068700
thank
>>
>3 difsferent thread s
bruh
>>
>>6068704
this is the shelter from schizos plus we got audio
>>
File: LTX_2.0_i2v_00075_.mp4 (983 KB, 832x1152)
983 KB
983 KB MP4
>>
desu, I'm finding having the audio ready and genning the i2v over it gives some pretty awesome results.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.