[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1759355754737965.jpg (1.03 MB, 2740x1248)
1.03 MB
1.03 MB JPG
Discussion of Free and Open Source Diffusion Models

Prev: >107792305

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107794550
is saw somewhere that the issue is im using main instead of the v4 branch, is that it?
>>
>>107794563
yeah v4 branch doesnt have the memory freeing buttons sadly
>>
File: 1764107928052329.png (1.49 MB, 1376x1088)
1.49 MB
1.49 MB PNG
>>
>>>107794552
>>Prev: >107792305
>Prev: >>107792305
>>
>>107794583
fuck
>>
>>107794552
>107792305
DING DING DING
RETARD ALERT
ALL HANDS MAN YOUR BATTLESTATIONS
WE'VE GOT A RETARDED FAGGOT HERE
>>
Man, did Ran really snuck in the troll links again? It's so fucking tiresome
>>
>>107794552
>Maintain Thread Quality
This shit needs to stop. Take your fucking off topic schizo drama to discord where it belongs
>>
>>107794599
im sorry bro no one will use your UI
>>
>>107794607
on topic and related to thread health. it has been there for far more threads than it was without. if you are not satisfied with the thread culture, migrate somewhere else, there are plenty of AI threads on this board
>>
File: ComfyUI_temp_cnzpx_00037_.png (1.93 MB, 1120x1400)
1.93 MB
1.93 MB PNG
>>107794552
Total 1girl supremacy
>>
>>107794609
Take your meds, faggot. We just had one of the best threads ldg ever had and now you bring this garbage here again. Do you really hate this general so much?
>>
>>107794629
>it has been there for far more threads than it was without
Maybe this logic works in your shithole of a country, but appeal to tradition is one of the worst arguments you could possibly have, ever. It's not "thread culture", this is your personal vendetta against a dev.
>>
>>107794629
>give attention to attention whores who get off on said attention, especially when it's negative
>continue giving that attention despite the fact the attention whores in question are still here
winning move right there, good job anon
>>
>>107794632
last thread it was basically ltx shitty videos... barely any image. wish we had more 1 girls posted or images in general
>>
>>107794639
I don't browse this general often and thanks to this discussion I saw those links and read them and will now spread the word
>>
>>107794652
dude get the memo: he's replying to himself
>>
so tired of ran's antics
>>
ltxv is amazing
https://files.catbox.moe/ydn07o.mp4
https://files.catbox.moe/g9dzvj.mp4
https://files.catbox.moe/upb2vk.mp4
>>
how long does it take half-decent loras to be made for new models, i was using hunyuan1.5 before and there weren't many but ltx2 is a gamechanger
>>
File: AnimateDiff_00693.png (1.09 MB, 1248x720)
1.09 MB
1.09 MB PNG
>>
>>107794718
we just need porn loras, people get to work
>>
>>107794762
https://files.catbox.moe/uetngb.mp4

Forgot the video. Fuck this catbox shit, why can't we get sound on here or an AI board.
>>
>>107794677
soon bro i downloaded some creepshot loras yesterday
>>
Blessed thread of truth friendship and justice for all
>>
>>107794768
dont downscale / use the upscaler, they made it ugly, at least atm, it seems broken
>>
>>107794599
>>107794607
>>107794639
>barely 5 hours of sleep
Lol
Your dogshit frontend is kilking you
>>
File: 1751328321406740.png (2.25 MB, 1344x1120)
2.25 MB
2.25 MB PNG
>>
>>107794802
But anon... that WAS with no upscaler.
>>
>>107794834
then why did yours look so much worse
>>107794718
>>
>>107794851
I don't know. But I assure you. No upscaling or downscaling.
>>
>>107794818
He doesn't even work on it lol
>>
>>107794607
It's actually very important to keep the attention whores and avatartroons in check
Also "anistudio" is malware and newfrens need to be warned
>>
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon

yep. this is the real thread. carry on.
>>
>>107794983
Why did you repeat my post?
>>
>schizoid trying to cover for his own schizo mistake
lol.
>>
File: 1758435341417149.png (2.17 MB, 1056x1440)
2.17 MB
2.17 MB PNG
>>
Good Model for pic nswf edit? Flux kontext dev is censored
>>
>>107795052
wondering this too 3bh
qwen edit worked for clothes but there's nothing as simple as "give her big bouncy breasts"
>>
even a jewish team can't make an optimized ltx2 day one. religions kek
>>
>be trani
>piss against the wind
>"stop pissing on me ranfaggot!"
lolcow
>>
>prompt, input image and audio file combo keep giving me a powerpoint presentation zoom in with no motion
>nothing spicy about it at all
MOTHER FUCKER
>>
>>107795203
sadge
>>
how difficult is wan or other video generations on vram? whats the minimum acceptable without genning it for an hour?
>>
>>
>>107795320
bro its been 3 days, change subjects
>>
>>
>>107795298
bout three fiddy
>>
>>107795298
16 but you need lighting loras or ggufs
>>
>>107795343
i should change subjects
>>
LTX-2 gguf when?
>>
File: 1748957828319670.webm (3.9 MB, 1376x2048)
3.9 MB
3.9 MB WEBM
>>
>>107795418
is this ai?
>>
>>107795345
>>>/wsg/6067316
so much quality
>>
>>107795421
Yes, but plottwist: grok was trained on real pictures of Elon in a swastika bikini.
>>
File: zimg_00007.png (3.3 MB, 2016x1408)
3.3 MB
3.3 MB PNG
>>107795379
please broh
>>
>>107795320
>>>/wsg/6067319
>>
>>107795467
>>107795567
:|
>>
>>107795581
>:)
>>
File: LTX-2_00002.mp4 (1.42 MB, 512x1024)
1.42 MB
1.42 MB MP4
>>107795345
>>
someone tried gemma 3 abliterated instead of the original one to test if it works or changes anything?
>>
>>107795655
yeah
>>
File: LTX-2_00003.mp4 (1.67 MB, 512x1024)
1.67 MB
1.67 MB MP4
>>107795641
>>
File: zit_00014_.png (1.18 MB, 1152x864)
1.18 MB
1.18 MB PNG
>>
>>
File: LTX-2_00005.mp4 (1.94 MB, 512x1024)
1.94 MB
1.94 MB MP4
>>107795715
>>
>>107795764
Consider me spooked anon
>>
File: 1756522535864917.jpg (397 KB, 1328x1328)
397 KB
397 KB JPG
>>
thanks for beta testing guys
>>
>>107795796
my dick thanks you
>>
>>107795800
proof?
>>
ltx is too powerful
>>
whats comfy-kitchen?
>>
so far only things ltx2 is able to do are memes about floyd, hitler, trump, from what i see anything sexy is out of the question
>>
>>107795866
Butr anon tohse are sexy
>>
File: zimg_00279.png (1.65 MB, 864x1280)
1.65 MB
1.65 MB PNG
>>107795830
they made a home for all the 1girls
>>
File: z-image-fp_00079_.png (2.99 MB, 2048x1024)
2.99 MB
2.99 MB PNG
>>
another day, another lora
>>
File: zit_00025_.png (1.18 MB, 1152x864)
1.18 MB
1.18 MB PNG
>>107795918
Looking good. Every day is failbake day for me.
>>
>>107795932
whatcha making
>>
>>107795918
what loss values do you typically start and end with? do you just look at the samples to decide whether the checkpoint is good or too far gone?
>>
To make a style lora, can I just grab random varied screenshots from a tv show etc and it'll work for any sort of character/environment gen, as long as I tag them properly?
>>
File: 1740865966401884.png (2.66 MB, 1056x1408)
2.66 MB
2.66 MB PNG
>>
>>>/wsg/6067150
>>>/wsg/6067150
>>>/wsg/6067150
Migrate.
>>
>>107795948
Should work, yeah.
>>
File: z-image-fp_00082_.png (2.9 MB, 2048x1024)
2.9 MB
2.9 MB PNG
>>
https://files.catbox.moe/djvo4z.mp4
https://files.catbox.moe/l67j93.mp4
https://files.catbox.moe/0wvsmj.mp4
https://files.catbox.moe/03noxe.mp4
>>
File: zit_00026_.png (1.43 MB, 832x1248)
1.43 MB
1.43 MB PNG
>>107795933
Trying to port some old IL loras based on some strong styles but characters end up looking fucked up. I'm starting to really appreciate how easy it is to train loras for IL.
>>
>>107795938
in general with ZiT i see a very fast drop to around .04 but I can't really seem to get it to converge completely. so i train to about 1500 steps and see if any samples are acceptable then gen about 50 images. if it's overcooked i'll try earlier checkpoints, if it's undercooked i'll run another 1k steps then rinse and repeat. i think until we get base this is definitely more of an art than a science.
>>
>>107795988
damn, ltx got real boring real fast
>>
>>107796015
nigger ive done nothing else since it came out, and this is before loras. Once loras come out it will be crazy. Uncensored veo 3 at home is crazy
>>
So actually looking at some of the examples some that look decent, people already training loras and it (apparently) takes around 10 - 30 secs to generate a video, wondering if its worth getting in to ltx2

https://www.reddit.com/r/StableDiffusion/comments/1q6j2v7/trained_my_first_ltx2_lora_for_clair_obscur/
>>
File: z-image-fp_00083_.png (3.12 MB, 2048x1024)
3.12 MB
3.12 MB PNG
>>107796007
the fact that this isnt base is the main reason why im put off baking new loras
who knows how well the lora baking process will roll over from turbo to base
>>
>>107796029
already? that is awesome
>>
File: 1739897611321617.png (44 KB, 933x303)
44 KB
44 KB PNG
>>107796007
.04 or 0.4?
all of my onetrainer trainings look more or less like this, no matter how many images i have, caption or no caption, rank 16 or 32, lr 1e-4 or 3e-4, etc etc
validation will go up at some point, which is where i usually stop training.
but i honestly dont understand why it all looks the same
>>
Can you block swap ltx2? Is so, how?
>>
>>107796044
two more weeks
>>
File: z-image_00059_.png (1.22 MB, 864x1280)
1.22 MB
1.22 MB PNG
>>107795990
i had to start from scratch when i switched over from flux. i just use the z defaults and my old datasets with tags. i had all kinds of problems trying to use settings from flux training.

>>107796044
fair enough. it's definitely sinking time into a dead end.

>>107796055
yeah down to .4. i wouldn't focus so much on the numbers if you're happy with the outputs, especially with z-image training since it's all a hack anyway
>>
>>107796057
its automatic. Just use --reserve-vram 2 or so so it knows how much vram to keep for windows
>>
>>107796023
yeah uncensored veo 3 at home would be crazy
ltx looks god fucking awful though
>>
>>107796076
can you stop with this cope? it runs on 12GB cards now, just disable the downscaling and upscaling
>>
File: file.png (87 KB, 2248x374)
87 KB
87 KB PNG
>>107796074
I'd rather do it myself.
>>
File: z-image-fp_00084_.png (3.17 MB, 1024x2048)
3.17 MB
3.17 MB PNG
>>107796071
soon
>>
ltx2 better than wan 2.2?

I'm literally returning to the thread after being busy with other things for almost the entire year. Last time I left Hunyuan was still top dog in video generation.
>>
>>107796090
i can run wan just fine, nobody's coping
you're just blind on hype and failing to see how fucking awful this looks. you'll be ashamed of shit you posted in a few days.
>>
how to avoid static jpg output with ltx2?
>>
File: 1746235807427816.png (381 KB, 1189x977)
381 KB
381 KB PNG
has anyone else noticed that people have generally started to accept that the release of the z-image base model was cancelled?
>>
>>107796108
nigger post me a wan gen that looks anywhere near as good as >>107794718
>>
>>107796029
training this model looks like its gonna be crazy easy then, awesome. Lets see if first nsfw lora drops today or tomorrow
>>
File: LTX-2_00003.png (491 KB, 640x640)
491 KB
491 KB PNG
>>107795777
So this is the power of ltx, such quality I kneel
https://files.catbox.moe/8ikajr.mp4
>>
>Frame count must be divisible by 8 + 1.
LTX2 frame rate seems to be 25 in the comfy workflow, so why is that 25fps instead of 24 or 16fps if we want divisible by 8?

So to get 5 seconds of video, we'd need 121 frames, not 126.
>>
ltxv 2k res looks GOOD
https://files.catbox.moe/uju19s.mp4
>>
>>107795988
i don't remember wan2.2 being this mushy
>>
>>107796147
lmao did you chain 10s gens? poor miku looking plasticer by the second
>>
>>107796105
Maybe in terms of speed and producing sound. Suppose it comes down to what you want to generate.

>>107796114
Good, the 'where base' spam was obnoxious.
>>
>>107796164
yes it decided to make the purple girl exist out of thin air and steal migu's spot
>>
reminder you can only get a good LTX outcome if you rent it out and generate on a b100
>>
>>107796159
24GB vramlets need not apply, this is a 5090 chad only model
>>
>>107796186
based fuck vramlets I hate them
>>
>>107796180
this took 3 mins on 5090, just stop being poor
>>107796159
>>
Proompt me this
1girl, standing, very beautiful
>>
File: file.png (201 KB, 480x281)
201 KB
201 KB PNG
>>107796164
>>
>>107796158
the requirement of 8+1 is correct so it should be 24fps, I don't know why the workflow goes with 25
>>
>>107796201
post workflow or you are lying
>>
>>107796212
grim
>>
File: 1746664318488354.png (2.45 MB, 1024x1472)
2.45 MB
2.45 MB PNG
>>
>>107796201
i dont see how 5090 is a flex. you can rent that for cents on runpod lol
>>
File: 1739357649185899.png (2.17 MB, 1024x1472)
2.17 MB
2.17 MB PNG
>>107796209
here saar
>>
>>107796147
>>107796164

Ah, the issue that plagues every long chained video generation, the sudden jank. Wonder if they'll make an SVI versi...

https://github.com/vita-epfl/Stable-Video-Infinity/issues/67
>>
>>107796226
he supported nvidia and you didn't. they deserve our money for providing this incredible tech
>>
>>107794552
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
why is this still in the OP? tran is supposed to be in a witness protection program and move to africa
>>
>guys ltx quality is amazing look at this barely moving scene with no organics
>oh i also wiped out metadata so you can't even verify if i lied about timings
lmao
>>
>>107796232
thank you saar very beautiful saar
darker skin would look beautifer
>>
>>107796218
its official inference code, not comfy
>>
File: .png (6 KB, 441x117)
6 KB
6 KB PNG
>>107796201
>cloud gen
lmaoing my ass off
>>
File: ZiMG_00073_.png (1.5 MB, 1440x1280)
1.5 MB
1.5 MB PNG
>>107796114
i never expected it to be released in the first place. i understand Chinese Culture
>>
>>107796266
>cheating
>>
File: 1740852708310885.jpg (25 KB, 555x246)
25 KB
25 KB JPG
What is it for?
>>
>>107796268
we just making shit up now? nothing in that points to anything being cloud based? and are you so poor that someone owning a 5090 is that hard to believe?
>>
>>107796248
kek. ngl tho genning on runpod from my mac while cozied up in a blanket is comfy af
>>
>>107796280
if it was local there'd be a comment
>>
>>107796280
Stop lying bastard fuck you
>>
>>107796291
i'll probably gonna be confined to my laptop for a few months soon. why runpod instead of something like paperspace?
>>
File: 1745533334139119.png (2.02 MB, 1024x1472)
2.02 MB
2.02 MB PNG
>>107796262
>darker skin would look beautifer
here u go sar please be of upvote for good looks bless ganesha for fortune
>>
>>107796297
what would it say? Im on arch linuix with strengthened kernel so im not on some shitty windows spyware if that is something windows does
>>
all I fucking get are jpg with sounds
>>
>Im on arch linuix with strengthened kernel so im not on some shitty windows spyware if that is something windows does
severe mental illness
>>
>>107796308
try it yourself retard
https://github.com/Lightricks/LTX-2/blob/main/packages/ltx-pipelines/src/ltx_pipelines/ti2vid_two_stages.py
>>
>>107796212
kek
>>
>>107796317
comfy adds spyware to everything
>>
>>107796280
>>107796291
>1 minute apart
see nigga i knew you were full of shit. it's way too easy to tell a cloud gen from a local ltx gen
>>
>>107796335
I stopped getting complete still when I reduced the strength on the LTXVImgToVideoInPlace node. Still not getting a lot of movement though which is irritating.
>>
>>107796396
thanks I'll try that, for now half of what I get is OOM, the other is these jpg stills
>>
are people retards, ltxv looks great if you gen at high res without the down scale bs. What is this strange coping about it looking better than wan? Are people being paid by alibaba to shit on anything else?
>>
oh and actually use this https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/blob/main/ltx-2-19b-ic-lora-detailer.safetensors
>>
>>107796463
post comparison
>>
the lack of 1girls from ltx gens got me worried
>>
What is the best way to enhance breasts and genitals on anime characters in local gen? I know for video gen you use loras; are there similar loras in image gen specific for translating simplified boobs to more appealing boobs? Something that works like the automatic detailer for hands and faces from the rentry guide would be perfect.
>>
lol now I'm getting oom no matter the number of frames
>>
>>
>>107796502
here is ltxv, now you do a wan one
https://files.catbox.moe/3udh6d.mp4
>>
So how is LTX-2 for t2i?
>>
>>107796571
stop reposting crap from the bandoco discord, tranny
>>
Why does the guide in the OP not include LTX-2?
>>
>>107796605
I accept your concession
>>
>>107796613
its doa
>>
>>107796613
Too close to the character limit.
And no we have to mention the meme anime model that 3 autists use and those discord drama rentries MUST remain there, please understand.
>>
>>107796571
wheres the workflow?
>>
>>107796637
>Too close to the character limit.
remove the off-topic shit from the OP then maybe?
>>
>>107796105
Wan 2.2 is still better overall, the video quality is better. However ltx2 had some clear advantages, lip syncing is pretty good, and it's 25fps natively, and it's faster. You could conceivably use each in different situations. Wan 2.2 for scenes with higher motion and ltx 2 for more static scenes with dialog.
>>
>>107796648
ltxv one, remove the downscale and upscale parts, use higher res with full non distill model at 30+ steps and 4 cfg
>>
testicles
>>
>>107796750
benchod
>>
you can do proper quality ltxv video with less vram, you just need enough ram https://files.catbox.moe/secyev.mp4
>>
>>107796663
>i won't share
fuck off then
>>
>>107796637
Stuff never usually gets added that quickly DESU. I guess ZImage did but it's an exception. Neta earned its spot over time too lol, IDK why you think it's the problem
>>
File: ComfyUI_06074.png (2.62 MB, 1280x2048)
2.62 MB
2.62 MB PNG
>>107796637
...Just rip the band-aid off, they won't miss it!
>>
>>107796764
share your wf anon
>>
>>107796797
everything is here, it was done on a 3060 12GB
https://www.reddit.com/r/StableDiffusion/comments/1q6k2a3/definition_of_insanity_ltx_20_experience/
>>
File: file.png (3.64 MB, 1536x1536)
3.64 MB
3.64 MB PNG
>>107796562
>>
>>107796787
why are you obssessed with this weird faced woman
>>
>>107796825
I don't think he included it, the video file doesn't have it
>>
>>107796860
>The workflow is the I2V Comfyui template one, including the models, the only change is VAE decode is LTXV Spatio Temporal Tiled Vae Decode and Sage Attention node.

even has the full prompt
>>
>>107796825
>The video took 16:21 minutes on a RTX 3060 12GB
>16:21 minutes

Thats a deal breaker sadly. Despite my 4070tis, I'll just wait for the speed boosts
>>
>>107796868
yeah I was looking into someone making their own without the 2 sampler stages overcomplicating things
>>
>>107796396
Oh, i'll also be trying that.
>>107796417
nta but try
python3.12 main.py --disable-pinned-memory --disable-smart-memory --lowvram --reserve-vram 1.0
that is what got it working on my piece of junk 3060 12GB, my issue was not OOM it was the way comfyui is so aggressive with memory swapping causing my system to just freeze up in the vae decode stage or some other stage.

its worth doing this also if you use linux and are hitting a lot into swap file/partition
sudo sysctl vm.swappiness=10

add it to /etc/sysctl.d/99-swappiness.conf so its changed at boot
vm.swappiness = 10
>>
>>107796905
Does comfy contact the mothership?
>>
File: agmvlc.jpg (99 KB, 500x654)
99 KB
99 KB JPG
are loras trained from dedistilled ZIT better? There's a noticeable increase of defects when using loras trained on the normal bf16
>>
Is there something better than Euler + Linear Quadratic 16+ steps for ZIT?
>>
File: 471507904.png (1.2 MB, 1344x768)
1.2 MB
1.2 MB PNG
>>
File: 2.webm (1.26 MB, 768x768)
1.26 MB
1.26 MB WEBM
The anime woman eats soup
>>
File: 1080110282.png (1.7 MB, 896x1152)
1.7 MB
1.7 MB PNG
>>107796936
Not sure. I trained a few on the adapter before ostris made the dedistill, since then I've only trained on the dedistill and they work fine, but not sure if they're better.
>>
>>107796931
after many anons who bothered to check, yes it does and multiple popular custom nodes do as well
>>
Where the FUCK is the Will Smith spaghetti video
>>
>>107797009
>multiple popular custom nodes do as well
fake
>>
>>107796936
Some say the dedistill is better, some say the adapter is better. See which one works better for you.
Dedistill worked better for me but both were shit ultimately and it wasn't a direct comparison (lots of other variables changed) and I am waiting for Base.
>>
>>107796870
4070 ti would prob be at least twice as fast if not three times
>>
File: amazing.webm (1.28 MB, 704x384)
1.28 MB
1.28 MB WEBM
>>107797020
man this is so good, i love ltx
>>
>>107797034
>Some say the dedistill is better
some are retarded then. Obviously full model is way better just a lot slower
>>
Any news on lodestone's Chroma Z suicide mission?
>>
>>107796931
yeah some custom nodes send telemetry

/home/anon/.config/Ultralytics/
settings.json

change
"sync": true,

to
"sync": false,

fucking cheek bastards it should be fucking opt-in
>>
File: ComfyUI_00099_.png (1024 KB, 768x1216)
1024 KB
1024 KB PNG
>>107796787
>>
>>107797057
sync does not send any data other than your version
>>
>>107797021
did you set the flag in ultralytics so it doesn't phone home yet?
>>
>>107797044
what the fuck are you talking about retard that looks like absolute shit
>>
>>107797064
says who?
>>
File: lighttricks.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
You didn't download the newest Mossad spyware, right anon?
>>
>>107797077
the fucking code
>>
don't worry goyim they're just sending the version nothing else
>>
File: 99.webm (1.26 MB, 704x384)
1.26 MB
1.26 MB WEBM
>>107797076
>>
>>107797056
It started training a few days ago and in the unlikely event that it doesn't turn out to be dogshit, it won't be in a usable state until a few weeks at minimum.
>>
>>107797109
qrd on chroma? why bother with it
>>
>>107797056
>hey Lodestone you should wait for Z Base and just finetune it with the Chroma dataset
>"nope I need to train now"
>okay well then, you should merge the dedistillation adapter into the model and then finetune it normally
>"absolutely not, I'm a genius and I can invent better ways to train it"
>alright, what's the plan then?
>"I'm gonna take the Turbo model, swap out the VAE, force the model to learn a whole new latent space, and make other architecture modifications, this way it will be even better"
>...
>"that'll be another $150k pls donate to my ko-fi"
it's gonna fucking suck
>>
>>107797127
you voted for this
>>
it wroks
https://files.catbox.moe/hrb25s.mp4
https://files.catbox.moe/p9zyxu.mp4
>>
>The female teacher dances and hums a happy song.
>30 steps
>euler simple
This works so well indeed.

https://files.catbox.moe/6nclzf.mp4
>>
File: 1744643411312574.png (2 KB, 153x26)
2 KB
2 KB PNG
>telemetry
very cringe. blocked python from comming with the net a long time ago
>>
File: LTX-2_00012_-1.mp4 (3.53 MB, 704x1024)
3.53 MB
3.53 MB MP4
LTX2 is dogshit.

"a beer can is thrown in from out of the frame and the monkey catches the beer can with his hand and then opens the beer can open with his other hand and then begins to drink from the beer can and then lowers his hand holding the beer can down and cheers to the camera."

wan 2.2 can do this amazingly.
>>
>>107797152
prove it?
>>
>>107797117
>qrd on chroma
Flux but dedistilled, can do higher CFG and negative prompts, knows NSFW, furfag shit and a lot of other stuff.
But unfortunately trained by an ADHD furfag autist so it's completely schizo, unstable and unreliable. (The cope is that it's just a base model and it will be amazing once someone finetunes it. Coincidentally there is also another cope that it needs massive amount of data to make a proper finetune, how strange...)
It's also slow as shit.
Shilled sporadically by its cult on plebbit and there is one lesser schizo who occasionally goes crazy about it here.
>why bother with it
If you have an OP GPU like 5090 and don't mind playing a lot of seed lottery, looking at deformed slop, you can get good images from it, eventually.
High potential, low average returns.
>>
File: LTX-2_00012_-12.webm (1.27 MB, 640x384)
1.27 MB
1.27 MB WEBM
I'll be honest I think ltx is doa
>>
>local ltx is unusable
>cloud ltx is a 4/10
>grok is free and does everything ltx does lightyears ahead
>>
File: 1745533334139159.webm (512 KB, 1280x1408)
512 KB
512 KB WEBM
>>107797176
and wan can do this
>>
File: LTX-2_00014_-1.mp4 (3.77 MB, 1216x704)
3.77 MB
3.77 MB MP4
"the camera pans out of the military humvee car window and reveals the humvee car driving in a city very fast to the right side as the camera follows the car and it drives through a crowd of rabbi jews and their bodies bounce off the car as the car keeps driving through the large crowd of rabbi jews with extreme violence and force."

Dogshit.
>>
>>107797194
no it isnt
>>
isn't it cool how we got this model which can do over 10s of video while wan 2.5 is API only?

also, 24fps base too, no more slow mo.

https://files.catbox.moe/rz0nck.mp4
>>
>>107797155
I never got chroma to work properly. No mater what I tried the output was dogshit with artifacts, glitches and chromatic abberation and whatnot, I eventually gave up on it
>>
>>107797176
grok cannot do porn
>>
>>107797194
>humvee
lol
>>
File: 1747996101550913.png (23 KB, 1047x453)
23 KB
23 KB PNG
for the anon using RES4LYF, how did you do it, I keep getting this error with ltx2
>>
File: file.png (690 KB, 1060x390)
690 KB
690 KB PNG
>>107797218
who did it better?
>>
bros we need to pretend ltx2 is good so alibaba is pressured to openweights wan 2.6
>>
>>107797224
neither can ltx retard
>>
File: fuckingtextencoder.png (221 KB, 2082x1147)
221 KB
221 KB PNG
>>107792671
Not that anon, but I did this and still getting the "no CLIP/text encoder weights in checkpoint, the text encoder model will not be loaded." error
>>
They done LTX 2 on 8GB vram
https://www.reddit.com/r/comfyui/comments/1q5vxky/ltx2_on_rtx_3070_mobile_8gb_vram_amazing/

Guessing it was after all just a skill issue.
>>
>>107797219
Congrats, you did the correct thing anon.
You haven't wasted a lot of time trying to tard wrangle it or try comically overcomplicated furfag discord workflows like me, only to arrive at the same conclusion eventually.
I decided that I will need overwhelming evidence that it isn't shit before bothering with any lodestone model in the future.
>>
File: r.jpg (80 KB, 1488x832)
80 KB
80 KB JPG
>>107797117
A Flux schnell [and for radiance a pixnerd pixel space model] variant NSFW finetune done by a single user.

Can do quite many nsfw/anime/1girl/clothes and furry things other models can't but it is also a fairly messy model, so you could have endless debates if it's good enough for some use this or that person had in mind.
>>
File: 1711876781486377.jpg (186 KB, 1080x811)
186 KB
186 KB JPG
>start onetrainer and continue last training from backup like usual
>it's redownloading ZiT from scratch
I do not like this and wonder why it's doing this
>>
>>107797262
there is nothing it can do that illustrious doesn't do better and faster
>>
File: LTX-2_00018_-1.mp4 (3.72 MB, 720x720)
3.72 MB
3.72 MB MP4
"the tiny frogman is leaping around in the puddle repeatedly multiple times during the rain very happily as the water splashes with each time he lands in the puddle and then a hunter in a similar style to the frog peaks out in the trees in the background with a rifle and aims at the tiny frogman still leaping around in the water and follows the frogmans movements with the rifle and then the hunter shoots the frogman as the frogman slumps over like a ragdoll in the water face down and the water around the frogman turns red gradually. "
>>
File: ComfyUI_temp_cnzpx_00041_.png (1.94 MB, 1120x1400)
1.94 MB
1.94 MB PNG
>>107797219
>>107797257
sorry you're missing on the fun
>>
>>107797275
why not use the transformer override? just tell it your local file path
>>
>>107797275
On Win11 and edited enviromental variables? It sometimes shits itself
>>
>>107797278
>Slightly better text (if you are lucky with seed)
>Complex multisubject prompts that are impossible to do without regional conditioning and controlnets in illustrious (Need even more luck with seed for that.)
But yes I agree that it isn't worth it.
>>107797300
Z does that image a few times faster and would nail it first try.
>>
wan2gp chads and niggas, we eating good today :)
>>
>>107797246
get those files and the same gemma_3 file

otherwise update comfy in the folder and GUI if not updated
>>
>>107797308
I'm a retard and never got this to work, I just have to drop the bf16 and text encoder into the same folder and point it to it right? For some reason it kept refusing to load.
>>
File: ComfyUI_temp_cnzpx_00113_.png (1.58 MB, 1040x1480)
1.58 MB
1.58 MB PNG
>>107797322
I love z-image too but it can't, also chroma (for now) has the same the tools that flux has (redux, controlnet, unet modifiers, etc), chroma has a more lewd/nsfw dataset so you can generate anything you like really
>>
>>107797322
>>Complex multisubject prompts that are impossible to do without regional conditioning and controlnets in illustrious (Need even more luck with seed for that.)
nta but proof?
>>
>>107797329
I did. Cloned the entire https://huggingface.co/google/gemma-3-12b-it-qat-q4_0-unquantized/tree/main and also tried the 23GB gemma_3_12B_it.safetensors file along with the other small files.

Downloaded the latest v0.8.0 but nothing is fixing this damned error.
>>
>>107797326
qrd
>>
>>107797300
I just got dogshit output with astronomical gen times and foudn no point in bothering, the output wasn't good enough to justify the wating time
>>
>>107797347
oh wait a second, you're saying it completely redownloads the "Tongyi-MAI/Z-Image-Turbo" folder? the override is specifically only for the transformer as far as i know, so it still looks for TE etc in the Tongyi-MAI/Z-Image-Turbo even if you use a safetensor from somewhere else
>>
File: ComfyUI_temp_cnzpx_00117_.png (1.56 MB, 1040x1480)
1.56 MB
1.56 MB PNG
>>
>>107797246
i get that error also, hmm... Right i know that you can't clone them you have to download them individually. put the safetensor file into its own sub-directory/folder inside of the text encoder directory
ComfyUI/models/text_encoders/gemma
put it all in there and restart comfyui and then load the clip from that in the clip loader.
>>
File: file.png (15 KB, 647x337)
15 KB
15 KB PNG
fuck
>>
Finally a good collage, thank you anon >>107792352
>>
File: ComfyUI_temp_cnzpx_00118_.png (1.34 MB, 1040x1480)
1.34 MB
1.34 MB PNG
>>107797382
I feel your pain anon, I was there too, chroma is a fucking schizo model and it doesnt help that its discord community shares crazy workflows too, there are too many chroma versions out there, but I have been using it since the weekly releases and I can tell you that the fp8 is good combined with the latest comfy, is just as fast as flux.1 now
>>
File: 1745531110605342.png (82 KB, 543x834)
82 KB
82 KB PNG
>>107797370
I get messages like that when it loads stuff too, i'm also using the fp8 distil model, try that instead of full if you are using it maybe

this is my setup:
>>
>>107797395
>put the safetensor file into its own sub-directory/folder inside of the text encoder directory
>ComfyUI/models/text_encoders/gemma
>put it all in there and restart comfyui and then load the clip from that in the clip loader.

Yeah I downloaded each one. Also have it in a subfolder. Really frustrating that this isn't working correctly. I still get an output but I have to assume it isn't using the prompt.
>>
>>107797392
yep downloading everything from scratch for some reason
>>
i cut myself everytime one of my images isnt on the collage
>>
>>107797428
forgot workflow:

https://files.catbox.moe/7gvw08.json
>>
>>107797369
The proofs I have are fetishes I won't admit. (Not pedo stuff, for the glowies among us)
As much as I can say: Imagine two completely unrelated fetishes that never get paired on the booru. One takes place on the left side and the other on the right.
I have never seen an SDXL booru tune pull it off without slopping both together, even with regional conditioning.
Chroma nails it, with varying degrees of deformed anatomy, but they are kept separate and not slopped together.
>>
File: LTX-2_00024_-1.mp4 (3.65 MB, 704x960)
3.65 MB
3.65 MB MP4
"the woman is squatting down as a man leaps in very fast from the side and dropkicks the woman in her back as she is violently shoved out of frame as the man leans down to the camera and shouts "no gooks allowed!!!""
>>
File: ComfyUI_temp_cnzpx_00119_.png (1.54 MB, 1040x1480)
1.54 MB
1.54 MB PNG
>>107797456
DOA
>>
File: img_00299_.jpg (412 KB, 1368x1784)
412 KB
412 KB JPG
>>107797437
>i cut myself
*another slice of cheesecake
>>
>>107797426
What changed? I am not seeing any commit regarding chroma inference speed in comfy last few weeks.
>>
>>107797326
Buy an ad.
>>
>>107797428
>fp8 distil model, try that instead of full if you are using it maybe
Yeah, I'm using that but still getting the "no CLIP/text encoder weights in checkpoint"
>>
>>107797430
oh man i think its because we've both downloaded some other safetensor file from another location, i have to go track that down.
>>
>>107796029
>civitai still did not create a section for ltx2
>>
>>107797437
I just let out a little autistic "nnnnNNNNNg" and then exhale sharply like I'm an angry bull. Then I pick up my foam mousepad and whip my desk with it, it makes a wicked loud noise but doesn't damage anything. Then I use Claude to gen a Haitian voodoo curse against OP and I read it aloud.
>>
>>107797430
>I still get an output but I have to assume it isn't using the prompt.
yeah its not because none of the videos i genned have motion, then i saw you post, then i looked back and carefully looked at my comfyui output in the terminal and its not even loading the clip model.
>>
>>107797485
another user had a similar issue:

>If I remember correctly, this message:

No CLIP/text encoder weights in checkpoint; the text encoder model will not be loaded.

only occurs when the model is loaded from the node, but you are supposed to load the CLIP model with a separate node. This message is normal.

try a diff node or workflow for the encoder
>>
>>107797498
>i have to go track that down.
I would appreciate it. I also tried gemma_3_12B_it_fp8_e4m3fn.safetensors but that didn't work either.
>>
File: zimg_00065.png (1.59 MB, 960x1280)
1.59 MB
1.59 MB PNG
>>107797357
>>
>>107797326
Does wangp work on amd+linux? I need to gen some goon clips
>>
>>107797518
>you are supposed to load the CLIP model with a separate node
What other node should be used beyond " Gemma 3 Model Loader"?
>>
File: ComfyUI_temp_cnzpx_00121_.png (1.94 MB, 1600x1000)
1.94 MB
1.94 MB PNG
>>107797476
dunno, but I updated comfy a few days ago and its handling fp8 speeds really better now, I even droppped all the ggufs models I was using, I think it was because LTX-2 was close to being released and comfy was working closely with nvidia to get better speeds with fp8-fp4 models, I havn't even updated to comfy-kitchen yet

also https://huggingface.co/silveroxides/Chroma1-HD-fp8-scaled/tree/main got released a few days ago too, its only 9.1 GB, so its not a heavy model anymore
>>
>>107797430
wait are you trying to use the fp8 version that was posted on reddit? i think that might be the issue because that is one i've been trying to use just now and its giving me those errors.
>>
File: r.jpg (136 KB, 848x1488)
136 KB
136 KB JPG
>>107797278
chroma has often more details related to clothes, jewelry, backgrounds and stuff, but with the tradeoffs (stuff it can't do) and the speed difference - yea, illustrious/noob are overall better for most people.
>>
File: r.jpg (154 KB, 848x1488)
154 KB
154 KB JPG
>>
File: 1762966624275412.png (79 KB, 543x774)
79 KB
79 KB PNG
>>107797537
all my workflow has at the left is these nodes, try a diff workflow cause that node is probably causing an issue
>>
>>107797520
bingo, i've found our problem. i'll try and see if i can find the hugging page where i got that from last night.
>>
File: ComfyUI_temp_cnzpx_00122_.png (2.15 MB, 1600x1000)
2.15 MB
2.15 MB PNG
>>
>>107797537
>What other node should be used beyond " Gemma 3 Model Loader"?
And I should note that I tried "LTXV Audio Text Encoder Loader" but that gave an error and I'm not doing anything with audio besides passing it through to the output video node.

>>107797549
>wait are you trying to use the fp8 version that was posted on reddit?
No. Tried all that were on the official LTX2 repo; ltx-2-19b-dev, ltx-2-19b-dev-fp8, ltx-2-19b-distilled, and ltx-2-19b-distilled-fp8.
>>
>>107797587
now do 2 girls, yeah you cant ltx can
>>
File: ComfyUI_temp_cnzpx_00123_.png (2.2 MB, 1600x1000)
2.2 MB
2.2 MB PNG
>>
>>107797538
>I currently recommend using "small_rev3".
>small_rev3 inside do_not_use folder
It feels like a humiliation ritual to go through this autistic furtroon shit.
I am curious about what this fp8 speed up is about, it seems architecturally interesting even if the quality is still bad (and I like won't benefit from it on my RTX 3000).
Do you know which paper this experiment stemmed from?
>>
>>107797373
its a gradio webui for multiple ai video models and large image model released after sdxl. Got 51 seconds on my rtx5090+64gb
>>
>>107797622
but comfyui
>>
>>107797527
it should work, check the discord. the dev is active at the moment and the community support is very friendly.
>>
>>107797278
Try two custom subjects with consistent details in illustrious, i fucking dare you.
>>
>>107797640
Thank you
Let the gooning begin
>>
>>107797573
>try a diff workflow cause that node is probably causing an issue
What other node can load the Gemma 3? I'm using the LTX2 V2V_Detailer workflow. https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/LTX-2_V2V_Detailer.json
>>
>>107797598
well i got the same problem, i'm trying to figure it out once I fix it I'll let you know what I did.
>>
File: 1740894668548836.jpg (1.2 MB, 1248x1824)
1.2 MB
1.2 MB JPG
>>107797369
I dare you to make two OCs in illustrious interacting without comfy couple. You will get insane detail bleed, or the subjects won't be recognizable whatsoever.
Although i think neta yume is even better if you want anime AND several OCs.
>>
>>107797633
soulless corpos now. just going to die like invoke
>>
File: ComfyUI_temp_cnzpx_00126_.png (2.49 MB, 1600x1000)
2.49 MB
2.49 MB PNG
>>107797621
yeah I know, just use fp8-mixedfinal, about the fp8 speed up my experience has been anecdotal so far, but reading about comfy-kitchen seems they did something with nvidia
https://github.com/Comfy-Org/comfy-kitchen

I havnt migrated yet
>>
File: 1745210707526794.jpg (887 KB, 1248x1824)
887 KB
887 KB JPG
>>107797667
...and here's neta yume
>>
>>107797674
Just as a reference what GPU are you on anon?
I am not exactly sold, but I will give it a try, just to see if there is any speed difference if not anything else.
Probably Saturday or Sunday, too busy right now to try it properly.
>>
File: zimg_00079.png (1.41 MB, 1280x960)
1.41 MB
1.41 MB PNG
>>
They've added separate models as noted here
https://huggingface.co/Lightricks/LTX-2/discussions/11
that should help.
>>
>>107797642
show it in chroma
>>
>>107797653
use the workflow I linked and try i2v.
>>
File: zimg_00081.png (1.19 MB, 1280x960)
1.19 MB
1.19 MB PNG
>>
>>107797701
>>107797667
can either do goon?
>>
File: illust.png (621 KB, 700x900)
621 KB
621 KB PNG
>>107797701
>>107797667
*yawn*
>>
File: 00072-1282581050.jpg (240 KB, 1344x1728)
240 KB
240 KB JPG
>>
>>107797764
theres a literal watermark bro, lmao
>>
>>107797518
yes that is what google gemini told me, now I'm trying to find what node can load gemma...
>>
>>107797829
>making shit up
i accept your concession
>>
File: 1748163114020594.png (31 KB, 331x73)
31 KB
31 KB PNG
>>107797846
not even him but its right there
>>
>>107797864
>ai cant reproduce watermarks because i said so
lolmao
>>
>>107797864
>>107797829
you lost
>>
>>107797872
https://www.dreamstime.com/romantic-connection-anime-young-couple-head-held-gently-sweet-romance-romantic-connection-anime-young-couple-image411446948
>>
>>107797888
illustrious can goon you can't with your chroma and the other one
>>
>>107797840
I linked a workflow, save it and load it in comfy.
>>
File: 1646529615024.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>107797674
Hmm well with v0.8.0 on a 5060ti I got 1.9it/s with svdq-fp4_r128-z-image-turbo vs. 1.5it/s with z_image_turbo_bf16 before. And its size is 4GB vs original 12GB, so.
>>
File: 1757767064009251.jpg (522 KB, 1248x1824)
522 KB
522 KB JPG
>>107797749
Yes, both of them can do NSFW. Chroma is far better but has issues with anatomy, Neta tries but has very limited knowledge of poses.
>>107797764
This is not impressive at all, and i bet the dude's painted nails are leaking from the girl's prompt.
>>
>>107797922
>Neta tries but has very limited knowledge
cant lora fix that?

>has issues with anatomy
can't lora fix that?
>>
>>107797905
is this real?
>>
>>107797922
>dude's painted nails are leaking from the girl's prompt
bigot
>>
File: 1747342277083633.webm (3.94 MB, 464x688)
3.94 MB
3.94 MB WEBM
>>107797934
They can but barely anyone makes neta loras. Certainly easier than coping with noob/illust.
>>
>>107797901
I got it I think, just going to see if using the basic comfy checkpoint loader would actually work after dragging the file into the checkpoints folder.
>>
>>107797922
You can spin the gen gacha so quickly compared to using either of those models that it doesn't matter, you can just gen enough to get the right gen
>>
install this i guess https://github.com/Lightricks/ComfyUI-LTXVideo

but i thought it was comfy core thing now ugh ffs, what a fucking mess.
>>
File: 262982259.png (3.36 MB, 1664x1338)
3.36 MB
3.36 MB PNG
>>107797219
>>107797257
>>107797322
I actually managed to get good results from Chroma, even trained a few loras and it captures the style better than ZIT, but it's so much slower that it's hard to justify using it.
>>
File: 1740761083453844.jpg (1.51 MB, 2048x2048)
1.51 MB
1.51 MB JPG
>>107797972
It abso fucking lutely matters, you can roll illustrious gacha 100 times, and it will be worse than a single chroma prompt when it comes to character consistency with two custom subjects. This is an area where SDXL shows its age the most.
>>
>>107797009
So comfy is not really "local"?
>>
>>107798117
is any webslop that uses npm and pip really local?
>>
File: 985973512.png (1.16 MB, 1216x832)
1.16 MB
1.16 MB PNG
>>
>>107798058
I don't mind the slow speed, but the stripes that just sometimes appear for no god damn reason annoy me the most
>>
>>107798140
Holly sexo which model!
>>
>>107798134
>is any webslop that uses npm and pip really local?
Which governments are monitoring you?
>>
File: 00097-1886089240.jpg (528 KB, 1664x2432)
528 KB
528 KB JPG
Daily reminder that you don't need more than 1girl.
>>
Any updates to the no clip/text encoder shit?
>>
File: 4209558790.png (1.34 MB, 1152x896)
1.34 MB
1.34 MB PNG
>>107798168
Chroma1-HD
>>
8k is really not that much if you think about it, RTX 6000 pro might actually be worth it.
>>
Bros... I think LTX2 might be bad...
>>
>>107798274
>24fps
>10s+ gens
>emotive
>can use audio + image or i2v
it's great.
>>
>>107798274
Only problem with it is how heavily censored it is, but other than that it's better than WAN 2.2 by every metric.
>>
>>107798307
it's censored, right?
>>
New thread:
>>107798332
New thread:
>>107798332
New thread:
>>107798332
>>
File: 1015842457.png (1.17 MB, 1152x896)
1.17 MB
1.17 MB PNG
>>
>>107798335
>but other than that it's better than WAN 2.2 by every metric
yeah I'm not seeing that with the comfy workflow. can't get the workflow from ltx to run at all
>>
>>107798347
Nice
>>
>>107798243
Not yet, its still giving me problems despite using that other anons workflow method of loading.
>>
File: star wars chroma.jpg (110 KB, 800x782)
110 KB
110 KB JPG
>>107797382
from time to time it hits well but for most part what you say is true.
speed is major issue.
has face gen issues as well,
but it has good prompt understanding.

overall nothing comes close to z-image right now.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.