[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Ltx 2 Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>107780632

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: hayao3-250953126.jpg (65 KB, 672x672)
65 KB
65 KB JPG
>censored to ass LTX2
>still no ZIT base
>>
>>107782231
>censored to ass LTX2
This is literally good, if local models can do what cloud models can't it would be terrible
>>
File: 1748109329550175.png (45 KB, 708x343)
45 KB
45 KB PNG
Give it to me diagonal, chat. Am I on the latest version? I went to the jeethub and 0.7.0 was the latest.
>>
>chat
>>
File: file.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
Whjo let the fucking jeets bake a model
>>
>>107782219
>webm collage
epicly hollywoodly
>>
>>107782281
>This is literally good, if local models can do what cloud models can't it would be terrible
>>
>>107782239
>They literally said on discord that this is explicitly what they were doing and that's why the model was delayed to 2026
lmao, DOA
>>
>devs:hey we are spending a month censoring the model
>anons: hmm I wonder if they censored it
>>
All I get from LTX 2 are segmentation faults and it kills Comfy. 32 Ram/16 VRAM. Is there any way I can make it work?
>>
>>107782355
yeah
>>
File: 1747969353158520.png (2 KB, 204x63)
2 KB
2 KB PNG
>>107782355
>32ram
anon.. I...
>>
>>107782370
vram?
>>
So t2v is working on a 3090 with the patch, but if I try i2v with the patch it just crashes without error, anyone else getting this?
>>
File: LTX_2.0_i2v_00012_.mp4 (2.62 MB, 1280x704)
2.62 MB
2.62 MB MP4
Fast paced dynamic footage from Mario 3D video game. Mario throws a pizza at the player. The player chases mario and Mario jumps into the lava. The player jumps into the lava and is underwater chasing Mario.
>>
>>107782399
I'm impressed it followed your retard prompt.
>>
>>107782399
this is not safe
>>
>>107782232
>>
>>107782399
why is it so melty?
>>
>The only way to keep up to date is to know exactly what super secret discord kijai is currently posting on.

I hate this.
>>
File: 2026-01-06_08-59.png (15 KB, 453x220)
15 KB
15 KB PNG
>>107782394
how did you get passed gemma being a bitch?
>>
>>107782399
now ask peach so we can have a powerpoint zoom
>>
File: 00199-3936312558.jpg (247 KB, 1824x1248)
247 KB
247 KB JPG
>>107782337
i think its the t2v that's more censored than i2v.
prompt: high quality 4k dark photorealistic Hollywood movie scene of a girl with large blue eyes, fair skin, and brown pigtails tied with green hairbands. She is wearing a partially open, orange floral kimono that reveals her large, round breasts with prominent cleavage. She holds a glass of milk in her left hand. The background is a modern kitchen with dark cabinets, a white tiled backsplash, a stainless steel sink, and a black countertop with a kettle and toaster. The lighting is warm, casting a soft glow on her smooth skin and the kitchen surfaces. The woman is cheerful, innocent and playful. The camera remains in the same focused position on the subject throughout the entire scene. she is looking directly at the viewer. she drinks a sip of milk, then she says "ahhh" in a relaxing and satisfying tone, then wipes her mouth with her hand. she talks to the viewer in a playful, innocent and lovely voice and tone. she says to the viewer "prompt master needs his future A.I waifu to have big titties, so i need to drink up more milk.".
t2v: https://files.catbox.moe/9y9160.mp4
i2v: https://files.catbox.moe/gxrdo0.mp4
here is seedance1.5 quality i2v: https://files.catbox.moe/adkp98.mp4
>>
>>107782435
>>107781457
>>107781812
>>
>just hack your comfy
>>
>>107782399
the good news is that the motion is well done. I think it will be possible to dismantle the censorship of ltx, as was done with wan
>>
File: 2026-01-06_09-03.png (15 KB, 894x71)
15 KB
15 KB PNG
>>107782446
yes anon i did that but its still OOM for some reason
>>
>>107782462
increase the value of reserve vram?
>>
>>107782444
I cringed irl
>>
>>107782465
yeah i tried reserve vram 1-4 even increased to 10 and nothing. Is that Gemma text encoder really needed cant i replace it with something else?
>>
>>107782491
ofc you can, you can even not use any encoder
>>
>>107782491
you might be handicapped
>>
File: ovrt.mp4 (744 KB, 640x752)
744 KB
744 KB MP4
>mfw cucked model
>>
File: 1751137920343968.mp4 (517 KB, 960x512)
517 KB
517 KB MP4
>RTX3090
>cfg 1
>960x512
>121 frames
>8/8 [00:47<00:00, 5.89s/it]
>"The man throws the plush out the window, and everything explodes."
that's pretty fast but I'm afraid the distilled version has worse prompt adherence
>>
>AI needs to be safe, saar
>>
>>107782516
no it doesnt
>>
>>107782517
Why do you need to gen porn?
>>
>>107782537
what else would I gen?
>>
>>107782503
lets be real we all are but nobody is more handicapped than me
>>
Definitely something weird going on with i2v for me. It skips the ram loading to ram process entirely, loads up 3gb on to my vram then just crashes without error.

Works fine for t2v otherwise.

Only flag I'm using is reserve vram 4. These are with the comfy native workflows, both the ltx provided workflows get a dimension size error (I've already turned off previews) so I don't think it's that. Ideas?
>>
>>107782491
There is a fp8 version, I haven't tried it yet.
https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main
>>
File: 00065-3389537484.jpg (350 KB, 1344x1728)
350 KB
350 KB JPG
>>
>>107782516
kek
>>
https://www.reddit.com/r/StableDiffusion/comments/1q5jgnl/ltx2_runs_on_a_16gb_gpu/
>>
https://files.catbox.moe/dlmpiw.mp4
>>
ok i'm bored of ltx2 already. when is the next model coming out?
>>
>>107782561
>Only flag I'm using is reserve vram 4
you're definitely trolling are don't know wtf you are doing... your other processes typically will only need 1 GB and its wrote as a float 1.0
a setting of 4 is far from optimal.
>>
File: zimg_00043.png (1.57 MB, 864x1280)
1.57 MB
1.57 MB PNG
>>
>>107782605
Fuck off dude. I just want to know why my i2v workflow is immediately crashing. Me not using the meta that was decided literally an hour ago is not "trolling".
>>
File: 123.mp4 (262 KB, 640x752)
262 KB
262 KB MP4
>>107782598
why did this make laugh
>>
got complete shit results with ltx
like wan 5b level of shit
>>
>>107782634
proof?
>>
>>107782537
EVERYONE wants to gen porn, i dont care about genning animals with hats or glasses or whatever is the retardfaggot meta
>>
File: 1742501574934749.jpg (453 KB, 2048x1902)
453 KB
453 KB JPG
>show z-image turbo a dataset of highly detailed, well lit photographs of nipples
>z-image interprets it as a blurry reddish brown blob without any structure or real shape
>>
>>107782639
if that were the case it wouldn't be censored and there wouldn't be so many people using it despite it being censored, you coombrains lost
>>
>>107782648
>try to train trained model
>it fails
no way
>>
>>107782655
masaka sonna...
>>
File: LTX_2.0_i2v_00014_.mp4 (3.57 MB, 1280x704)
3.57 MB
3.57 MB MP4
A car crashes into Tony's Oranges stand, sending him flying across the street, causing a massive car crash.
>>
>>107782674
>oranges
>selling bananas
>>
File: z-image_00026_.png (1.74 MB, 864x1280)
1.74 MB
1.74 MB PNG
>>107782639
pornography addiction is bad for you
>>
>>107782674
you should rewrite the prompts tbqh
>>
https://files.catbox.moe/dwoywj.mp4
lmao this is pretty fun with sound
>>
>>107782680
tony's not the smartest guy

>>107782685
yes probably, but i can't be bothered running an LLM as well alongside all this heavy vram eating shit
>>
>>107782651
lmao everyone will just go back to wan when this dogshit is no longer new and shiny
>>
>>107782708
you wish, ai being used to porn is a very tiny market
>>
im going to wait for optimizations cause im not downloading 100 gigs of shit if it needs 128GB ram and a 5090.
>>
>>107782616
And I'm telling you that you are doing this wrong, higher value does not equal better. You should also use --lowvram so that offloading occurs. reserve just reserves vram for your browser and everything else so that opening a youtube video during inference does not cause an OOM

but go ahead and waste your fucking time. also what error are you even getting?
>>
>>107782714
read the thread retard.
>>
>>107782719
You're scolding me for something completely unrelated and if you even read my post before being an ass about it, you'd see I said there was no error.
>>
>>107782714
6000* unless you break your comfy
>>
https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main
>>
>>107782760
anon this yells at me about missing tokenizer
>>
>>107782712
delusional, all i need is 1 look at most downloaded content on civitai
>>
>>107782712
>ai being used to porn is a very tiny market
there's a reason civitai decided to cuck on everything except porn, without porn loras their site dies the next day
>>
i fucking hate subgraphs
>hey you know what would be cooler than 2d spaghetti? a 3d spaghetti maze!
>>
>>107782786
I like the idea of subgraphs. I do not like being exposed to someone else's subgraphs.
>>
https://files.catbox.moe/7k0c53.mp4
>Hatsune Miku appears on the scene and begins to dance, jazz music plays in the background, and she sings in Japanese.
D O A
>>
>>107782786
>i fucking hate subgraphs
same, this shit needs to die
>>
>>107782796
It's actually nuts, this is the first model I think I've ever seen that doesn't know miku.
>>
>>107782773
>delusional, all i need is 1 look at most downloaded content on pornhub
yeah bro lmao
>>
>>107782810
nice cope, all it does is prove my point
>>
>>107782807
>this is the first model I think I've ever seen that doesn't know miku.
sorry saar but the model had to memorize all the bollywood actors first, praise the izzat
>>
>>107782816
no it doesn't, compare the most popular ph video views with the most popular yt video views, most people don't care about porn
>>
>>107782826
>most people don't care about porn
tumblr died instantly when they removed porn in 2018
>>
>>107782826
civitai is a general model sharing site moron, just because porn is THAT popular it got so dominant doesnt make it pornhub
>>
>>107782835
because it was only used by coombrains, everyone else had already left, im sure pornhub would die if it removed porn
>>
>>107782516
>we have sora 2 at home
>>
>>107782843
Nta but if porn is so big why are the smartest models like zit and flux shit at making it compared to pony and illustrious?
>>
>>107782859
porn is illegal in china
>>
>>107782859
because everyone is scared of training on it because of regulations, lawsuits from retards, bad press and visa/mastercard raping you
>>
>>107782861
And which country is leading on local AI? Yep.
>>
>>107782796
https://files.catbox.moe/r5l0nj.mp4
>The man brandishes his sword, he is angry, he looks at the camera and says: “WHAT DO YOU MEAN, LTX 2 DOESN'T KNOW MIKU? THIS IS OUTRAGEOUS!”
wtf?
>>
>>107782786
Subgraphs are cool for hiding stuff that you never, ever change. The rest is much better off left as is
>>
>>107782880
>censored and indianed
>>
>>107782870
ironically the least censored base local model is chinese, and it's HunyuanVideo, this shit can do pussies and dicks perfectly
>>
https://files.catbox.moe/lhy6jj.mp4
>>
>>107782890
Are you using it? Thought so.
>>
>starts genning normal 1girl pictures in aesthetic settings
>diverges into porn a few gens into the day.
every single time
>>
>>107782901
>Are you using it?
I was using it until a better model (Wan) got released
>>
>>107782880
lol doa
>>
>>107782880
seriously though, why is it so biased towards indians??
>>
>>107782796
i'm officially branding LTX as DEI Woke
>>
>>107782920
>trained to never ever make a video with good looking women
>biased towards indians
indian men = opposite of beautiful women confirmed?
>>
File: LTX_2.0_i2v_00018_.mp4 (3 MB, 1152x896)
3 MB
3 MB MP4
it's so melty, people's faces and hands and anything that moves just melt into amorphous blobs
i don't like this model at all personally
maybe good for low effort meme gens
>>
>>107782796
first time in my life I see a model unable to render Miku, really (((impressive)))
>>
This model is so dogshit.
There's only two things going for it, audio and speed, everything else is complete shit and worse than Wan by a lot.
Also absolute zero nsfw capabilities, I wonder if people will care after one week or if it will actually stick around.
>>
>>107782938
it gens the original at 1/4 of the size you ask for, so for 1920 you need to ask for 7680 or it will look like yours
>>
File: plebindian.jpg (96 KB, 998x1189)
96 KB
96 KB JPG
>>107782920
Because Indian and Chink are the new meta.
>>
>>107782938
>>107782958
this, remove the retarded upscaler and ask the model to render high res natively
>>
>>107782953
wan only works because of gorillions of loras
>>
File: 1750561973645927.jpg (1.08 MB, 1248x1824)
1.08 MB
1.08 MB JPG
>>
>>107782968
>6 toes
>>
>>107782965
I don't know what you mean, wan only need the lightx loras and completely destroys ltx2 in any capacity, not to mention the actual full model.
Also can do nsfw out of the box other than pen and vagoo.
>>
>>107782972
show me >>107782399 with wan, pro tip you can't
>>
ltx2 is full of pajeet it was not worth patching it to run on a 3090ti stick to WAN image to video
>>
>>107782965
no one will bother to make loras of a giant 19b model if they know they have a chance to end up with a powerpoint lol >>107782880
>>
File: 1754137868235941.jpg (29 KB, 355x400)
29 KB
29 KB JPG
>>107782796
>no Miku
into the trash it goes.
>>
>>107782977
Anon that looks like absolute dogshit trash, don't know what are you smoking.
>>
>>107782998
then it should be easy for wan to outdo it
>>
Sure if a lack of objectivity in this thread.
>>
File: LTX_2.0_i2v_00020_.mp4 (2.79 MB, 1152x896)
2.79 MB
2.79 MB MP4
>>107782958
>>107782963
ok i removed the upscaler and genned at original resolution. this is supposed to be better how?
>>
>>107782796
https://files.catbox.moe/m85nhz.mp4
>>
>>107783022
It really is Will Smith spaghetti level of quality.
>>
>>107783022
it is though
>>
>>107783022
told you, wan 5b
>>
>>107783022
doa
>>
>>107783022
looks worse than wan sampler previews kek
>>
And people were hoping this model would force wan 2.5 to be open sourced
>>
>>107783022
if you're using the base model don't, the distilled version is much better
>>
>>107783055
that is the distilled one
>>
Damn I was hoping this model was somehow good, what a massive disappointment.
>>
ltx looks like a better voice model than a video one
>>
>>107783059
no it isn't
>>
5090 users, don't update your drivers to the latest. It fucked up ltx2 for me permastuck on 29%, with my previous driver I breezed through gens.
>>
File: 1755662491975012.png (156 KB, 480x315)
156 KB
156 KB PNG
>>107783059
oof, it's over
>>
File: file.png (62 KB, 927x440)
62 KB
62 KB PNG
>>107783080
yes it is. i am the anon who posted the gen.
>>
File: 1739305736747123.png (1.42 MB, 1120x1496)
1.42 MB
1.42 MB PNG
the anime girl is holding another white can of energy drink with her left hand.

neat, from 1 to 2. for some reason the model with the baked 4 step lightning lora works better for me than q8 + separate lora. idk why.

https://huggingface.co/lightx2v/Qwen-Image-Edit-2511-Lightning/blob/main/qwen_image_edit_2511_fp8_e4m3fn_scaled_lightning_comfyui_4steps_v1.0.safetensors
>>
File: 1760452848956326.png (46 KB, 604x379)
46 KB
46 KB PNG
wut
>>
>>107783059
>But doctor, that IS the distilled one.
>>
>>107782880
https://files.catbox.moe/mgkbzy.mp4
the fuck is wrong with that model, I can't stop having those powerpoint zoom shit
>>
>>107783055
>>107783059
oh no no no no
>>
File: 1751395067756230.png (1.33 MB, 1120x1496)
1.33 MB
1.33 MB PNG
>>107783088
shoe on head:

the anime girl is sitting at a computer, with a black shoe on her head. the background is white.
>>
>>107783101
try not using good looking women
>>
>>107783101
they spent months training it that woman = powerpoint slide
it's a form of built in censorship, that's why DOAanon was right
>>
>>107783107
this is torture
>>
>>107783101
Sheboon voice is like a cherry on top lmao
>>
I have not had an issue with power points yet, but some of my gens that ask the camera to zoom out and show the full body of the subject result in random Indian people squatting in the corner of the screen looking up at the subject. It's really creepy.
>>
File: thehorror.jpg (10 KB, 480x360)
10 KB
10 KB JPG
Is it normal that using ZiT with loras results in worse image quality? It's still lightyears ahead of XL even with loras but there seems to be noticeably more defects and anatomy errors when loras are in use, or am I just seeing things
>>
The reason LTX-2 is so fast is the same reason it makes everything look like melty blobs: the VAE is highly compressed.

Wan uses 16x16x4 compression going into the model (8x8x4 in the VAE, with 2x2 spatial patch size at model input).
LTX-2 uses 32x32x8, all in the VAE with patch size 1 at the model.

Every single LTX model is like this. It will always look melted. For some reason the devs find the speed increase preferable over looking good.
>>
>>107783142
why cant xl be smart
>>
>>107783145
Is not only the vae, unfortunately the model itself is really shitty, feels like a 1.3b model.
>>
>>107783142
Distilled model, loras destroy it unless you run them very low like 0.5 or less weight.
>>
File: 1757423560215292.png (1.4 MB, 1120x1496)
1.4 MB
1.4 MB PNG
>>107783107
pizza slice in left hand

I like how the style/shading stays consistent even without prompting it. also copying fonts works well too.
>>
>>107782880
THIS is what you faggots were hyping up? lmao
>>
>>107783101
https://files.catbox.moe/7r115i.mp4
>>
We'll never get a truly uncensored model are we
>>
>>107783158
>feels like a 1.3b model.
You're being melodramatic. It's not that bad.
>>
>>107783166
make your own
>>
>from slowmo 20 second videos to hallucinations of indians
ai bubble gonna burst any moment now
>>
>it's been hours since the release
>not a single video of a character saying nigger
Don't tell me ltx can't even do that
>>
>>107783160
why do you spam these threads with your useless experiments?
>wow llmao qwen edit caN DO shoe on head XDDD
at least better than your garbage low effort floyd/pepe/miku edits
>>
File: IMG_20260106_111827.jpg (236 KB, 720x977)
236 KB
236 KB JPG
Hello Goy are you enjoying your censored Saarmaxed model?
>>
>>107783022
What nodes did you bypass to remove the upscaling?
>>
>>107783191
None of this matter, don't matter how much you seethe at joos.
The model will fall by it's own weight because is just actually shit.
>>
>>107783223
i didn't actually manage to
>>
>>107783116
>they spent months training it that woman = powerpoint slide
which usually breaks a model due to false positives, these cunts never do they. DOA.
>>
ltx2 workflows? please spoonfeed me.
>>
>>107783223
not sure why the other anon is pretending to be me. i just deleted the upscaling part and raised the resolution on the first part (there was a 0.5 downscale)
>>
you need to use the prompt rewriter bros
>>
>>107783224
>No matter how much your seethe at kikes
Anon the kike in the pic releated is the CEO of LTS. This model is shit, and nobody can save the local IS SO OVER .
>>
>>107783311
do i also need to put "melting faces, powerpoint slide, indian person" in the negatives?
>>
File: z-image_01462_.png (2.94 MB, 1440x1440)
2.94 MB
2.94 MB PNG
>>
>>107783305
Have you not seen the quality of the videos posted itt? Wouldn't waste your time.
>>
>>107783305
>Open comfy
>go to templates
>type "light"

you now have all the templates everyone here is using.

>>107783323
couldn't hurt
>>
>>107783185
only actual useless thing is a post, criticizing other images, without making something yourself.
>>
>cannot make lolis
>cannot make mikus
>Jeeted to the moon
DOA model
>>
>>107783320
That's exactly what I was saying you idiot.
That the fuck that is mossad backed (unironically also backed by jewvidea) won't save the model, because is fundamentally trash.
>>
>He hasn't found the secret izzat input node to improve gens.
>>
how can it not know miku
>>
>>107783360
tell the fucking nips to make a model then
>>
>>107783364
fact*
>>
anyone using ltx2 with multigpu nodes? I get "mat1 and mat2 shapes" bullshit. I think the distorch2 clip loader doesn't support the text encoder properly
>>
File: 1692368089.png (2.57 MB, 1248x1824)
2.57 MB
2.57 MB PNG
>>
Wan 2.2 is going to be the "two more years of SDXL" of video gen
>>
Two more weeks before z base anyway
>>
File: 693847353.png (1.21 MB, 1152x896)
1.21 MB
1.21 MB PNG
>>
post some discord screenshots i love behind the scenes secret club stuff
>>
>everyone throwing hands with cumfart code and retarded UI/UX to use a model that sucks ass
why does Nvidia work with cumfart again?
>>
I don't think the model is as fucked as you're saying it is.
>>
>z-image base
>we will release it soon
>really, very soon
>soon to be released
>surprised awaiting for you soon
What the fuck are they waiting for anyways?
>>
>>107783454
dude the damn thing cucks on prompt for hot woman, its trash no one will give a fuck about it in 4 days time.
No boobs
no ass
= into the trash it goes
>>
>>107783488
That's not the issue, if the model was actually good, people will do anything to make it less cucked, but the model itself is really bad.
As the other anon said, we are stuck with wan 2.2 for a long time it seems.
>>
>>107783481
soon
>>
>>107783454
>an artistic render of a classical painting?
>yeah the face should be pixar animated slop
I don't think you think at all
>>
>>107783488
>>107783509
melodrama here. You barely even touched the model.
>>
>>107783454
>>107783534
try to get a gen with a woman spreading her legs
ill wait
>>
File: 1323850336.png (912 KB, 832x1216)
912 KB
912 KB PNG
>>
>>107783534
>>107783509
unlike wan they actually hard trained it in, so good luck uncucking that you will need a lot of money and compute. its flux all over again, the deliberately broke.
>>
>shit model
>shit license
>shit censorship
>shit shill employees in the thread
yeeeeeeeeeeep sounds like a western model release
>>
>>107783307
I get this if I bypass the entire upscale section.
>>
>>107783552
>>107783544
abloobloobloo
>>
>>107783552
>its flux all over again
Nigga, is not flux at all, flux was the best that there was back then, ltx2 is trash, no one will bother using this shit after a week when the "let's see what it can do" and hype phase is gone.
>>
>>107783544
>spreading her legs
Just a traditionally attractive woman standing would be enough for me
Haven't seem any yet
>>
>>107783552
>>107783574
And if you actually know how to read I said IF THE MODEL WAS ACTUALLY GOOD.
>>
>>107783565
move that last node out of the bypass
>>
chinese ops itt
>>
File: LTX_2.0_i2v_00032_-1.mp4 (3.8 MB, 704x1088)
3.8 MB
3.8 MB MP4
By removing the upscale part, resetting the downscale and img_compression back to 1, I'm getting different results. Will explore further.

>>107783592
I figured that out just as I posted..
>>
stop using unsupported resolutions, that seems to be the source of people's issues, must be vramlets trying to make it work using low resolutions it was never trained at
>>
You will not make me updoot. You will not.
>>
File: 1441266190.png (2.35 MB, 1248x1824)
2.35 MB
2.35 MB PNG
>>
>>107783663
alright then rtx pro 6000 haver why dont you show us how its done
>>
why was anon excited about LTX again? we knew it would be shit
>>
>>107783716
because i had hope :(
>>
Someone make a quickstart guide for ltxv in comfyui, what models to download and where. Seething vramlets don't (You) me
>>
>>107783360
>cannot make lolis
Wow its shit
>>
>>107783723
click the comfyui logo, go to Browse Templates, click on the template
>>
What's this offloading tech nvidia talked about?
>>
>>107783684
Just use a higher res
https://files.catbox.moe/gf613x.mp4
https://files.catbox.moe/um302k.mp4
https://files.catbox.moe/iugwvw.mp4
>>
>>107783563
you forgot shit ui
>>
>>107783749
offload all compute to the cloud you will own nothing
>>
File: 1765142632630860.png (1.59 MB, 1216x1216)
1.59 MB
1.59 MB PNG
>>107783716
>tfw downloaded all the loras
>fp8 distilled and dev
>>
>>107783755
these all look like shit even with a high res
>>
>this thread coping with shit gpus
https://files.catbox.moe/7zs4se.mp4
>>
>>107783775
He also got all those from the banojeetdoco discord server.
>>
So if I have a 50XX it'll run super fast did I read that correctly?
>>
im just gonna wait for memory optimizations, but from what I saw, its not looking good broskis. sadge.
>>
>>107783813
install the new drivers goy
>>
File: asdasdas.png (213 KB, 1755x617)
213 KB
213 KB PNG
Don't use downscale / upscale. That fixes the glitching. Bypass the two purple nodes here
>>
File: 1757087968784307.png (2.53 MB, 960x1600)
2.53 MB
2.53 MB PNG
>>
also increase steps to 30 maybe, 20 is too few imo. Still working on playing with setting like shift and sampler.
>>
File: 1755514577630596.png (2.76 MB, 960x1600)
2.76 MB
2.76 MB PNG
did you watch my show today, bros?
>>
wtf why isn't the gradio port 7860 working?
>>
>>107783191
>jewish CEO
>model likes to gen indians
>india also hates china
noooticing
>>
>a robot mech says "This shit absolutely sucks" then explodes
https://files.catbox.moe/r9hg9r.mp4
>>
>>107783876
holy retard
0.0.0.0 is a global binding, meaning it's binding on all interfaces, but it's not a real address.
do localhost or 127.0.0.1
>>
>>107783876
Try http://localhost:7860/
>>
>>107783876
sarrr
>>
kek
>>
>>107783876
someone didnt pay attention in their networking class tsk tsk rakesh
>>
File: 1754683012317641.png (2.23 MB, 1632x896)
2.23 MB
2.23 MB PNG
>>
Gonna sleep and hope ltx 2 is cleaned up tomorrow. Using it now just eats away my pagefile and even when it gens, after a few gens it just bricks my machine.
>>
>>107783755
compared to wan it's quite shit.
>>
>chinese trying to smear competitor
I'm nooticing...
https://files.catbox.moe/xsys6k.mp4
>>
>>107783896
thanks it worked :) guess its time download the models for a 5090+64gbram setup.
>>
File: 1745198651141168.png (2.14 MB, 1216x1248)
2.14 MB
2.14 MB PNG
>>
>>107783977
What are you using to make these?
>>
>>107783993
sdxl
>>
>>107783965
weird looking car
>>
Seriously? New video model and not a single anime gen to test if its 3d slop or sakuga kino?
>>
File: 1748750039102244.png (917 KB, 1859x1103)
917 KB
917 KB PNG
>>107783993
>>
>model literally can't make anything good
>actual proof ITT
>not even the "good" examples look good
>"s-stop this s-s-smear campaign..."
It's shit, Rajesh.
>>
File: file.png (68 KB, 270x577)
68 KB
68 KB PNG
>>107784018
what do these do
>>
>>107784049
its made quite a few good videos. And remember, its doing this in under 10 seconds
>>
>>107784049
>rajesh
wrong, it should be 'it's shit mister silverstein'
>>107784052
that just automatically let's say gets the AR from the input image then snaps it to 1.5MP density on steps of 32
>>
you wait 100s to get a slideshow
fuck this model
>>
>>
>>107784086
promptlet skill issue
>>
>israel makes a model
>india gets the blame
fuck you benchod
>>
>>107784086
ive been using this for hours and still have not seen this
>>
>>107783965
is the car using a mosquito-drive?
>>
>>107784018
what package is the prompt rewriter node from
>>
>>107784096
indians say they're israel greatest ally tho, sanjeev?
>>
>>107784123
that's america
>>
reduce max shift to 1.2, seems much better
>>
For ltx2, what's an alternative Save Video node that lets me do more settings?
>>
https://files.catbox.moe/8ko3r1.mp4
>>
>>107784018
how long is the llm text generation thing on fresh runs?
>>
File: 1736794590109145.png (166 KB, 1497x643)
166 KB
166 KB PNG
>>107784117
it's my own autistically linked shit
I can suggest a couple:
https://github.com/sebagallo/comfyui-sg-llama-cpp/
https://github.com/BigStationW/ComfyUI-Prompt-Rewriter

for my own stuff I'm using llama-server.exe in router mode, with nodes that call my server to manually unload the model when necessary
>>
>>107783876
average shitter giving his opinion online btw
>>
>>107784106
i2v?

>>107784095
Enlighten me then, faggot
>>
>>107782569
I guess I'll try and make GGUFs of the Gemma IT version. Hopefully it doesn't need any architectural modifications to llama.cpp or ComfyUI-GGUF

Can't imagine city isn't working on it as we speak though but I have nothing else to do at work, and honestly figuring out how to quantise the text encoder sounds like more fun than actually using this model based on the outputs I'm seeing
>>
>>107784185
i wish people weren't so quick to help him
we're part of the problem
>>
>>107784189
there's already plenty goofs of gemma it, the city gguf loader needs to implement the arch (along with mmproj loading) since he kinda hardcoded stuff for qwenvl.. it's going to require a little bit more effort
>>
Is it still not yet possible for local gen to do image2image, but changing more than just artstyle and resolution? Like can it change camera position, character position, character actions?
>>
>>107784200
/g/ is sadly infested with low iq retards who know shit about anything related to computers, I'm not sure if having a dedicated board (like /prog/ with captchas tailored to programming/pc knowledge) would help
>>
just woke up and called it >>107778617
>>
>>107784226
if only the software was an exe and automatically set things up properly instead of webdevs running everything
>>
>>107784224
Qwen Image Edit, but it's also slopped and not as good as Nano Banana
>>
>>107783053
>people were hoping this model would force wan 2.5 to be open sourced
Here's some copium: we're at the equivalent stage of when LumaLabs first announced their model 3 months after Sora was shown off

Sora2 is about 4 months old. HunyuanVideo took 9 months to come out from the announcement of Sora. So we should only be dooming if we don't get the Sora 2 killer by June


LTX has always had an incredibly optimised architecture but this release is the canary that real-time video generation is going to be the norm very soon
>>
>>107784241
>muh exe
you're part of the problem
>>
>>107784242
Can I run it on my RTX 5070 Ti?
Can it do NSFW?
Googleshit isn't an option because it's censored.
>>
>>107784263
maybe you don't have the programming expertise to make one :^) you would be filtered by your own suggestion
>>
File: 1754799038552485.png (2.09 MB, 1632x928)
2.09 MB
2.09 MB PNG
>>
>>107783167
>feels like a 1.3b model.
>You're being melodramatic. It's not that bad.
It's uncanny how similar it feels to wan 5b kek
>>
>>107784273
why dont you make one then genius
>>
geeeg i cant tell what's worser: the comfyui software or this new ltx meme
>>
File: z-image-fp_00059_.png (2.92 MB, 1024x2048)
2.92 MB
2.92 MB PNG
>>
>>107784297
i did
>>
https://blog.comfy.org/p/official-amd-rocm-support-arrives
AMD people may finally be able to switch from SD.Next
>>
>>107783716
>why was anon excited about LTX again?
I was excited because I didn't know its audio was garbage. Luckily, the LTX team also know it's audio is garbage and said they already have a fix just couldn't ship it in time

I am really shocked at how garbage their text to video is. I don't even know how you train a text to video worse than the original HunyuanVideo from over a year ago
Oh anons, we have one final hope for video: The guys who make GLM are also the CogVideo guys so hopefully we get CogVideo3 but they probably won't stop focusing on LLMs since they're mogging right now
>>
>>107784363
AMD people exist?
>>
>>107784410
yes remember raja?
>>
>>107783876
>These are the anons telling you to use fp8 scaled
>>
downloaded the models and now i'm getting this error ModuleNotFoundError: No module named 'triton'
>>
>>107784438
this has to be bait i know you have critical thinking skills rakesh use them
>>
File: zimg_00168.png (1.65 MB, 864x1280)
1.65 MB
1.65 MB PNG
>>107784332
>>
>>107783966
>trying to "flex" his shit rig after being clowned on for being tech illiterate on the technology board
lmao, rakesh's 12vhpwr is going to start a house fire by the end of this month for sure, if he even has a 5090
>>
>>107784363
it's the electron garbage which is useless
>>
>>107784211
>there's already plenty goofs of gemma it
Share them because I can't find them on HF for some reason.

Comfyui-GGUF is all hardcoded so that doesn't surprise me.


Is there any way to get the tokenizer config etc without having to login to HF? Why is there no mirror of this model man...
>>
people are finding that ltxv2 like most models these days are highly dependent on long LLM style prompts. Quality goes up with prompt length / detail.
>>
>>107784457
man we need someone to make a workflow for zimage that shittifies the images with filmgrain because its the only thing missing

it'd have to be done in 3 passes using a depth extractor - basically the further away from the camera the more grainier it is, and up close its minimalistic and the grain is less subtle on light pixels
>>
more subtle*
>>
>>107784478
proof?
>>
>>107784521
using it? common sense? the fact all these fucking companies keep using automated LLM captioning?
>>
Oh unsloth has them nvm. Damn guess I have nothing to contribute until I get home

I was thinking of practicing prompts on the HF space that ltx is running but I'm really not motivated anymore

>>107784363
I would test this right now but the 7900XTX sitting right next to me has the wrong bios flashed so I can't install the driver
>>
I hope someone makes an nvfp8 of wan 2.2 model.
>>
local remains a joke
>>
>>107784474
>>107784474
>>107784474
move when ready
>>
>>107784454
so do i have to install it? there is no mention of installing it on the main github page.
>>
yet another cozy bread :]
>>
>>107784612
>waits 10 min to post the link to new
gay but ty for bread
>>
>>107784612
i am not ready
>>
File: 00160-2731996871.png (3.62 MB, 1920x1080)
3.62 MB
3.62 MB PNG
what's the best model for abstract, wallpaper-type gens? i don't mind if it's older

z image turbo is really good but it seems to excel at photorealism. picrel is the type of stuff I like to gen
>>
>>107784718
probably best to just go on civitai.com and search the images for those kind of gens and what checkpoints they used.
>>
File: img_00453_.jpg (293 KB, 1216x1376)
293 KB
293 KB JPG
>>
>>107785252
VLCkeks on suicide watch
>>
>>107785252
ackchually i use mpv
>>
>>107785284
qrd
>>
>>107785556
LTX-2
>>
>>107785595
vlc?
>>
>>107785607
mpc



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.