Discussion of Free and Open Source Diffusion ModelsPrev: >>107791088https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2https://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2485296https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg
Now that we know for sure ltx is shit how about get back toHunyuanVideo-1.5
fish'n'chips
>>107792316kek
Blessed thread of frenship
>>107792316so you were just trolling then, got it
is the Kandinsky pron a joke? or is it with LoRA, asking for a fren
Should I use Stability Matrix??
>>107792347>>107792087
>>107792347that's real. when a slavic nigger says it's uncensored, it actually is
>>107792351nobody asked schizo
>>107792359>you can make 10 secs of porn if you let your 6000 bake for an hourmight as well hire a hooker, cheaper too
>>107792352if you think you are dumb and need handholding at the cost of disk space then sure
>ask anon for workflow>he actually gives me>get home and pop it on comfy>its the default template and obviously not what he used
This was genned in comfyui!
>>107792395>posts real photo>says it is aiboring
>>107792384ranfaggot doesn't code and he didn't make anistudio
Kijai fp8 Kandinsky pro works perfect with comfyui wf and CFG1/8steps... anon need to readhttps://huggingface.co/Kijai/Kandinsky5_comfy/tree/main/fp8_scaled/Pro/T2V
>>107792393And the person who defends its creator 24/7 (totally not its creator!) spams this thread and routinely gets his posts nuked, often 100 posts in individual threads.
>>107792305can someone give the full resolution pic?
>>107792384>>107792393>software made by a doxer>ANISTUDIO is literal malwareProofs?
>>107792399its clearly the default zit girl thougheverbeitdoe
>>107792406oh? they released distilled versions? Maybe speed wont be nearly as bad then. How are they?
>>107792416proof is the author is a retarded frenchoid sneething for years about comfy and derailing threads to shit on comfy/prop up his ui
>>107792367>>107792235
>>107792423do you have the death threat post lmao, that was fucking GUCCI, wwhat a loser
>>107792423doxxing implies he did drop info. this doesn't prove anything. do you have proof of anything you are claiming?
anon can post more NSFW from Kandinsky pro with Kijai FP8?
oh dear. ranschizo should probably leave forever for her safety. she will be missed
>>107792467>You don't need prooflmfao. way to come out as a clown
>>107792467seems to me there's some sort of agenda. are you perhaps falseflagging trying to make ani seem unhinged, ran?
>>107792470nta but even I hope you get hit by a bus. You are insufferable
>>107792482model anon?
Getting this when I try the ltx template, what do I do?
>>107792492Your mental gymnastics is malware for the brain
How do you turn off Comfy offloading to memory? It's leaving 7GB of my VRAM unused for some reason and completely tanking the performance.
>>107792497check if the height is connected? reconnect the height input?
>>107792495chroma dc 2k t2 sl4
works rather nice with qwen edit 2511 gens too:41 seconds for this gen, try this workflow:https://github.com/Lightricks/ComfyUI-LTXVideo/blob/master/example_workflows/LTX-2_I2V_Distilled_wLora.jsonvid: https://files.catbox.moe/kqtusp.mp4
>>107792532what the actual fuck is wrong with you?
people just let this pussy drama faggot put whatever he wants in the op. disappointed in you anons
>>107792543Mods only care if you upload or even talk about pedo stuff. They don’t care about threats of violence because it doesn’t hurt their advertiser dollars.
the black man says "man, I cant wait to do some fent and eat some mcdonalds." After he finishes speaking, he holds up a bag of white powder and says "shiiiiiiiiet!"kek, 58 seconds for 9 seconds with audiohttps://files.catbox.moe/e0thma.mp4
>>107792513I did and that fixed it, but now I see this one here, why is my shit all broken, this weird node is passing null values
It is of vital importance to the local diffusion community that newcomers know to stay safe from this security threat. When someone spends their entire time defending a no-name dev and also makes dox threats against other posters, you must NOT run their code on your computer.
awesome news for a ltxv lora traininghttps://www.reddit.com/r/StableDiffusion/comments/1q6asqd/ltx2_lora_training/apparently works on 5090, its fast and audio training works as well for voices
>>107792570try this workflow, it's fast and it fixes the stupid enhancer nodes that were fucking uphttps://files.catbox.moe/y6y37b.json
>kandinsky 20B took 10 mins for this https://files.catbox.moe/uljslp.mp4oof
>>107792580thanks king
lmao only thing I need is a better prompt than canned laughter: using the workflow from >>107792580 btw, 60 seconds, 9s/240 frames.https://files.catbox.moe/5r47u2.mp4
Something is fucked up with ltx2 for me, I get one gen off, then anything after that is taking forever to generate.
>>107792604not enough ram I guess. Use vram debug node to offload before whereever it hangs up for you. Though then you gota reload models which take forever 64GB is barely enough with all fp8
>>107792604try the workflow above, I have 16GB and 64GB ram and it works fine, no OOMalso try --reserve-vram 4 in startup settings/flags
>>107792612*this is with fp8 distilled btw, which works fine
>>107792611>>107792612I have a 5090 and 192gb of ram. There has to be a bug.
yea kandinsky 20B sucks, its prompt adherence is horrible and its ungodly slow
im impressed how emotive the gens are desu:https://files.catbox.moe/yomurf.mp4
>>107792622try --reserve-vram 4 to see if comfy is not trying to steal the last of your vram or something while something else is using it
>>107792622try updating comfy in the update folder and in the manager, has to be a bug of some type
>>107792512--highvram or --gpu-only startup arguments, but it'll use your gpu as the sole allocating device. You MAY get away with manually setting up offloading with distorch multigpu nodes but I'm not sure
>>107792636this is only advisable on rtx 6000 or H100
>>107792612>>107792617nta but i tried this and it doesnt work, get unet unexpected errors and "no CLIP/text encoder weights in checkpoint, the text encoder model will not be loaded", crashing comfy instantly, 50gigs in the trash lmao
which anime was this?
yea I see why people didn't both with kandinskyhttps://files.catbox.moe/cd0lbx.mp4
>>107792649get the gemma 3 it safetensors filethen, go here and get all the small files NOT the safetensors and put them in text encoder folder:https://huggingface.co/google/gemma-3-12b-it-qat-q4_0-unquantized/tree/mainI had an error till I did that, another anon suggested it, these are my files:
>>107792649your comfy is not up to date, git pull, activate the venv and pip install -r requirements in the comfy folder and then in the ltxv custom node folder
>>107792512jenny my beloved...
it's so fast once the kinks are worked out, 55 seconds, even on a 4080:https://files.catbox.moe/1joxow.mp4
seems like you can trick gemma to do sort of nsfw. We will need to make abiliterated gemma to work with it it seems.https://files.catbox.moe/1sql2m.mp4https://files.catbox.moe/8ydf0z.mp4https://files.catbox.moe/r6kz3e.mp4https://files.catbox.moe/lju7xd.mp4https://files.catbox.moe/jd1biy.mp4
>>107792675im on latest master and all requirements satisfied
>hitler speech audio with trumpSuch a cucked model, holy shit..>>107792633Seems that was it, I only updated through the folder earlier.
holy shit it actually worked. this model is crazy fast and actually works. wan 2.5 has a lot of work to do.https://files.catbox.moe/imb2yw.mp4
>>107792695and you have the correct text encoder, the full gemma folder of files with the safetensors inside of it? If so then something must be fucked. Remove all your other custom nodes, maybe one is conflicting. That has happened before
>>107792556>people just let this pussy drama faggot put whatever he wants in the opYou wanna know what I think? I don't give a fuck about the OP so long as it has the right name in the name field.
>>107792706sometimes it does a panning effect, I got that with the floyd stuff too now and then. it works.try prompting "the man is singing" for the audio.
>>107792717>he fell for it
>>107792712i didnt get the gemma jsons yet since i dont have a huggingface account at this moment
>>107792728ah, yea you need all that, then put the safetensors in it.
Don't get what I'm doing wrong, no matter what I do she doesn't want to lipsynchttps://files.catbox.moe/6gdphw.mp4
>>107792583It's t2v or I2V?
>>107792708anon help me pretty pleasehttps://files.catbox.moe/lvjye3.mp4
>>107792744try prompting "the girl is singing with passion"
>>107792766I2V
>>107792774for the audio input workflow, what worked for my floyd gens was "the man is singing (with passion/loudly/etc)
>>107792779I tried, but I think my toy is broken fuck https://files.catbox.moe/ohjpw5.mp4
>>107792792for audio workflow use the kijai one he posted:https://files.catbox.moe/f9fvjr.json
https://files.catbox.moe/7zs4se.mp4
>>107792809its funny that wan 2.5 was beaten to the punch by a faster model, I love wan 2.2 but sound is the missing element.
kekhttps://files.catbox.moe/7ifa8v.mp4
>>107792831is this real
These custom nodes are a bitch and comfyui manager won’t install them no matter what i do. Is there a place I can get them so I can manually install them.
>>107792831that is actually great lol
>>107792849use stability matrix
>>107792636>but it'll use your gpu as the sole allocating deviceSo you can't turn it off... goddammit! It's doing this for the VAE because it's roughly 400MB, but it's wasting so much fucking time setting that up when it could just dump that in VRAM (again, there's 7GB free!) and leave it there, you know, like it used to.Guess I'll just have to live with the extra 15-20s gen times because I don't think it's something he'll fix. FUCK!
>>107792855That is what I’m using.
>>107792831Look how dangerous this shit is, your average facebook boomer would eat it without thinking twice, that's why we can not allow something like this to work for porn
>>107792859comfyui manager should tell you the packages github if you click install missing nodes
>>107792862AI porn could make onlyfans obsolete, it's a net good for society.
>>107792884what do you think happens if you make ethos lose their jobs?
>>107792714>You wanna know what I think?no. anyone complacent with trannies shitting on their lap is not worth listening to
>>107792891sitting*
>>107792888they are forced to get jobs that aren't degrading for income.
>>107792891>>107792894*sharting
>>107792903yeah like there are plenty of these that pay as much
>>107792636>>107792856Highvram still loads into RAM, just using different logic. Give it a try.
lmaoa group of jewish rabbis wearing a yarmulke walk in from the right and grab the man in the middle wearing glasses. The rabbis say "THIS IS A SHOAH!" before dragging him off camera to the right.https://files.catbox.moe/gjbkzh.mp4
training a lora on a video with audio legit voice clones them, this is crazy.
>>107792931like better than eleven labs level I mean. the ltxv trainer uses deepspeed so it can work with low vram as well
>>107792927did you add the music?
>>107792574>29gb vram with int8 quantobro was training in italian
>>107792954no lol thats what made it even more funny
>>107792934deepfakes and scams are gonna fucking take on a whole new dimension. This will make new laws for sure
>troonlien threadshan't be postingshan't be using trannystudio
>>107792517How many Chroma versions are there? Which ones are actually good?
We have come so far.https://files.catbox.moe/rk3ul0.mp4 (loud)https://files.catbox.moe/svuige.webm (loud)Now we just need SVI tech to work with ltx2.
>>107792962you could easily get people fired with stuff like this as most normies dont know shit about AI.https://files.catbox.moe/g8hn6n.mp4
>>107792979all, new ones are just constantly being put out as its trained
>>107792974There's nothing to post anyway, it's just mindless ltx spam. I wonder if people who praise it are just some sort of shills or are just as blind and retarded as the migufloyd spammer.
>>107792996>fast>video and image>can do audio + image to video, or i2vbest model since qwen edit, of course it will be used/discussed.
>>107793001*correction, qwen edit, and zimage.
>>107792988dog?
>>107792979>Which ones are actually good?I use the ones I've trained loras for.
>>107793001Drop the sales pitch. Guess it's shills.
Saars, please stop dumping everything to catbox, use /wsg/ or /gif/ threads for dumping and post links
can ltx do this?
>>107793017no shilling, wan 2.2 is good but those kikes are holding wan 2.5 hostage for API only, and this is open source.
>>107793026>he thinks israelis won't sell out
>>107793026We have WAN 2.2.
Is it true that the developer of AniStudio runs a tranny grooming discord focused on children?
>>107793035proof?
apparently the cause of the smudginess when there are fast movements is cause the temporal upscaler is not being used but comfy does not have support for it yet
holy shit, it does a perfect trump with no lora.https://files.catbox.moe/q7bd6t.mp4
>>107793060that's literally perfect wow. ltx is just really good
>>107792921Nope. It OOM'd without doing anything on first launch and didn't get past the ksampler when I close the error window and hit run again. It did use all my VRAM that time though!
with wan 2.2 we needed 6-8 steps instead of 4 what's about ltx? 8 isn't enough that's where your shitty results come from
git pull comfy then set it up like this, temporal upscaler is needed along with the spatial upscaler, latents were not lining up correctly, that is causing the issues
can you tell if this is AI?https://files.catbox.moe/uxdz1z.mp4
temporal upscaler is herehttps://huggingface.co/Lightricks/LTX-2/blob/main/ltx-2-temporal-upscaler-x2-1.0.safetensors
>>107793133nah no fucking way. it's so real
>>107793022cricketsltxfags in shambles
>>107793124where node from
>>107793133this one has even better expressions:https://files.catbox.moe/sb1aqd.mp4
>>107793150Wowww okay I think we're onto something guys... this is revolutionary
>>107793150too much quality I kneel
>>107793133How is the audio so bad? I used some random website a year ago to generate AI audio clips with just 30 second voice samples and they sounded extremely realistic.
>>107793172turns out trying to make a single model do multiple things isn't smart
wise words, Frieren...https://files.catbox.moe/z3vech.mp4
>>107793180deep voice did the trick, imoeven though subs > dubshttps://files.catbox.moe/iqmy7j.mp4
>>107793180this is south park levels of animation
>>107793189something a bit more wholesome:https://files.catbox.moe/ahud98.mp4
you might have to manually pull ltxv nodes then reinstall requirements.txt for it for the spatial and temporal upscalershttps://github.com/Lightricks/ComfyUI-LTXVideo/commit/63c8a9285c5c17bcd19c7088da8a6597719d336e
Has anyone been able to do regular 2D cartoon animation with LTX-2 yet? I haven't seen any examples and I don't have time to test it myself right now.
>>107793227hold on things were broke >>107793225
>Vram shit the bed and have to send card in for RMAReeeeeeeeee
>>107792856Post that Jenny lora. We're family here.
well, now you can shit on other fanbases with i2v and audio.https://files.catbox.moe/93fci7.mp4
>>107793251so close, just need a good toss now.https://files.catbox.moe/inhzb2.mp4
>>>/wsg/6067201Finally got this shit working
>>107793185How do i make i2v with this
>>107793268why does it make it so cringe
>>107793270shutup benchod i deleted it
this shit is so fucking bad
fuck, didn't change samplerhttps://files.catbox.moe/2hohuv.json
>>107793292mf make it i2v
>>107793276anon you can literally do anything. like so:https://files.catbox.moe/dlcydq.mp4
>>107793292that should look MUCH better now. No more smudging from fast movement. ComfyuiWF did not use the temporal upscaler which has to be used alongside the spatial upscaler otherwise the latents dont line up
>>107793288say that again >>>/wsg/6067209
>>107793314>mfw ltx
>>107793323i fucking kneel, this is a masterpiece
>>107793331did you know...https://files.catbox.moe/8oojbb.mp4
>>107793342who these 13%
>>107793311you just plug in a image instead of a empty image
>>107793355benchod
kneel to uncensored AI, sora 2 can't do this:https://files.catbox.moe/g7xf14.mp4
>>107793372sora 2 can say it as long as you don't type it
Sorry to be that guy, but are there any examples of LTX2 image to video/audio or text to video/audio NSFW (realistic, not cartoon bullshit). I don't really use this website much. Can't be bothered setting up comfy etc but keen to see what the latest open source looks like these days. Any other boards good for this specifically?Kind regards
>>107793292why this shit not connected to anything
>>107793380you have to trick sora into doing edgy stuff, this is up to you (plus loras)also, same prompt but diff image: neat sky imohttps://files.catbox.moe/5zgepu.mp4
here fixed I2V https://files.catbox.moe/jwmw2j.json
>>107793403who are you
>>107793398god damn it, I should stop rushinghttps://files.catbox.moe/20yi3n.json
>>107793394It just came out, dumbo. It's a censored, baseline model. You need to wait for LoRA's to be trained.
lmao I forgot to adjust the image setting so it cropped jotaros head off and didnt prompt anything about jotaro.https://files.catbox.moe/45vulb.mp4
brehs this is it, we are inside the singularity now>>>/wsg/6067213
>>107793429benchod moment
son of a bitch the I2V still had a disconnected node https://files.catbox.moe/b3mwgz.json
>>107793452Stop being retarded.FUCKING STOP IT.
cozy bread
>>107793467
>>107793407nta but at a quick glance this is a wf with the temporal upscaler included too.
https://files.catbox.moe/l3wumz.mp4
>>107790806blessed thighs
Do you think Julien treats the proxy abuse as a business expense?
The jews... won?
i h8 this inconsistency so much, just randomized seeds everything else all teh same, even down to resolution and aspect ratio on the pics
Am I just wasting my time trying to get Wan 2.2 to do the Nazi salute? Is it the guardrails?
>>107793452kinda far from fastest model now
>>107793524Ask yourself how many videos of nazi salutes are out there. Then ask yourself how many do you think made it into Wan's dataset.This is something you need to specifically train into it via a LoRA.
>>107793452>Allocation on device This error means you ran out of memory on your GPU.do i need to pass some args to comfyui? sorry just woke up so i'm out of the loop. i have a 5090
>>107793526the latent was half as long as it was supposed to be, still very fast
it works very well, look at the mouth/lips/teeth:https://files.catbox.moe/8g1vgb.mp4
>>107793524"reaches arm out in front of him with fingers straight forward and palm down"
>>107793551--reserve-vram 4
>>107793551ah, the I2V WF might be too high res / length for your gpu, I had it really high
>>107793520Watch RAM and SSD usage when genning, I bet sometimes it correctly offloads everything, sometimes it spills to a pagefile
>>107793556that should do it will offloading as long as you have enough ram though yea, windows does not manage memory well
>>107793452why am i getting 16 s video with length 200
>>107793556i can try this, ty>>107793560yeah it was set to full HD. does LTX2 not support offloading like Wan? or will it offload when i pass that reserve vram 4
>>1077935566 or 8 is better
>>107793574Depends on the res though, right? 4 is enough for 1 megapixel up to like 250 frames/10 seconds @ 24gb vram
finally. need an applause lora though.https://files.catbox.moe/552o0c.mp4
I swear this is the final fix for the I2V one, I accidently only set one sampler to res_2 https://files.catbox.moe/h6v3f9.json
>>107793566200/12=16
he did it, he said the thing.https://files.catbox.moe/6qrc0x.mp4
>>107793584what's going on here mate? i think you still fucked up
>>107793584play with image strength though, it might need to be a little higher to stick to the given image but that also might hurt motion a bit
>>107793607>>107793584benchod fix your script or are you trolling?
>>107793607? do you not have the checkpoints in the checkpoint folder?
>>107793619>>107793622nvm i refreshed comfyui and it's fine now. ignore me
and roll credits:the blonde man driving the car says "I am LITERALLY me.". he then exits the car to the right and closes the door. 1980s synth pop music plays in the background.https://files.catbox.moe/38mcc8.mp4
>>107793624benchod fucker
>>107793624you could also just get rid of the combo and put it directly in the loader, I forgot I dont need that anymore
question for the elders, can ltx quality be fixed to be at least wan 2.2 level?
did comfy break auto increment? it's set to increment before generation but it just doesn't.
kneel to Todd:https://files.catbox.moe/y297tw.mp4
so what are the advantages of ltx2 over wan2.2 beside the sound part?
>>107793670fidelity, knowledge, speed, and audio is huge And training it will be MUCH easier since only 1 model
>>107793670FasterI2V and T2V in the same modelNo high noise/low noise split
>>107793670It can gen beyond 5 seconds out of the box, without SVI? Everything else looks worse, especially the video quality.
>>107793683>>107793681>fasterIt's not if you want same quality as wan
>schizobake
>>107793686oh that too, it does best at 15 secs but can do like 50 ive seen without losing too much coherence
>>107793627I don't care about audio, just wish we had this degree of control with wan 2.2
>>107793704this as well, its prompt following is crazy goodhttps://ltx.io/model/model-blog/prompting-guide-for-ltx-2
>>107793342can it do jp audio too or only english/chinese?
>>10779369450 steps with this is still faster than 50 with wan and looks better + audio
>>107793725ive seen people do Italian, Spanish, Russian, Japanese and Chinese
>>107793721havent tried yet
>>107793712>prompt following is crazy goodBtw, for best prompt following set CFG for FIRST STAGE to 4 ish instead of 1, 2nd stage is fine at 1
>>107793725but 50 steps with this is like 15 steps with wan
>>107793746speed? yes. quality, your lying
anyways goodbye
>>107793584OOM's on a 24GB, while anon's WF from one of the last threads can do the same rest, but twice as many frames on my setup and still not OOM.
>>107793686>Everything else looks worse, especially the video qualityThe quality is so much worse that it makes it not worth it. Are people just blind to how bad it is?The sound is awful as well, it's unnerving. Is prompt-adherence really worth it when shit looks fucking atrocious?
>>107793452Pro tip for this one, if the upscale is ooming you, switch off the meme res2 sampler. (you can just do more steps if you absolutely must)
you're a big guyhttps://files.catbox.moe/is9px7.mp4
>>107793786The sound, you can get around by genning the voice with a dedicated model like VibeVoice and using that as the reference, but the output quality... yeah, it's pretty fucking bad.
>>107793802>>107793786I don't think the quality is nearly as bad as what you are saying. A lot of portrait stuff looks downright real.
>>107793802you can get perfect cloned voices if you use the workflow with the audio input + image input.not related, this is just i2vhttps://files.catbox.moe/hxq5mv.mp4
>>107793584oom's, fuck nigga
>>107793826>perfect cloned voicesSo you're actually trolling, right?
Show me something cool
Are we pretending wan 2.2's slow motion 16fps videos look good now?
>>107793840You didn't see the Jensen Huang one?
>>107793840another anon did it too, or you can just clone a voice fast and use that as the audio input and the video will match the audio.
>>107793847No, but this one sucks all of the detail out of the input image. Skin goes ultra smooth, for example. It looks like liquid shit.
>>107793854heres a sample just with swapped audio:https://files.catbox.moe/1a4t0s.mp4
>>107793855sour grapes
>>107792305Why is AniStudio not in OP? It'sa local UI and should be in OP
Don't bite, anons. You're better than that.
>>107793876thissss, ltx is so good, vramlets are fucking seething. keep using your paycuck saarsshit loool
>>107793884it isnt even that demanding. I am using fp8 distilled with 16gb (4080) and 64 physical ram. faster than wan, also.
Questionable qualitywith static camera lorahttps://litter.catbox.moe/0b8nl1lzt51asy01.mp4
cia guy does persona:https://files.catbox.moe/872zwe.mp4
>>107793904is this ai?
>>107793925no its real
>>107793904and one more with miku: didn't prompt a female singer but there you go.https://files.catbox.moe/z7dj2x.mp4
>>107793931you saying it recognised the sex of the singer, determined that the image contained no subjects of the appropriate sex and so conjured one to fill the role?
>>107793584There's definitely some fuckery going on with that temporal upscaler. Obviously it's way more demanding on vram. Not a big deal. But no matter what length you put in it stretches the video to 10 seconds. A lot of weirdness going on there.Are they even supposed to be used at the exact same time together? You can't run one then the next?
>>107793939it works for anyone I just used a basic prompt so it probably inferred female singer.also, ltx2 knows Trump natively:https://files.catbox.moe/3j4wda.mp4
>>107793946use spatial, I havent had issues with vids of various frame lengths
>>>/wsg/6067234Is there no way to hook the upscalers to the KJ's audio reference workflow?
>>107793949this would trick 99% of boomers btw:https://files.catbox.moe/rz0nck.mp4
Yep he's trolling.
if I'm being honest. The best results come from just not fucking with the upscalers and genning natively. The time rounds out the same due to model loading and offloading anyway.
>>107793991true but that is a 4x is processing time, there may be a middle ground
>>107793946sorry, you need to increase the fps from 24 to 48, I forgot to change that in the WF
>>107794018Yeah, go fuck yourself.
>>107793980thank u mr president
how come sometimes it ooms, and sometimes it shuts down completely with no word
we need to save the price of memory guys:https://files.catbox.moe/r0drv0.mp4
ltx is fucking weird.
>>107794018Okay so it's basically interpolating the frames to avoid smearing? Does it really need to run on the same pass as the normal upscale then?
>>107794028that is legit the issue there
>nesting subgraphs inside other subgraphsKill yourself if you do this
>>107794018are you a troll or do you just not test your own workflow before posting 7 different variations of it on here?
yeah back to the 1girl plastic factory for me
this model is incredible.the ugandan man says "why are you no generatin one girls? are you gay?". he has a heavy ugandan accent.https://files.catbox.moe/4y5xau.mp4
>>107794057subgraphs are completely retarded>hurr duur let's just hide all the settings you want to change and constantly tinker with in a random maze of arbitrarily nested bolognesenice humiliation ritual
What's the best workflow around? I've tried two from here and they were both shit.
>>107794075I've noticed people using subgraphs as an excuse not to clean their shit up.
>>107794074Kekd. Low res image makes the shit audio quality not seem so out of place
American Jammer:https://files.catbox.moe/5ltpq5.mp4
Meh, audio on an easier to setup model is nothing burger, this is the same level of hype for SD 1.4 or something, and I'm bout 156,022 gens in, wake me up when it can do 1080p technicolor with sound without face glitches
>>107794112not even sora 2 can do that so I guess wait 100 years for nvidia to give us 1TB GPUs
chinkoid vram monster when
we are getting there, most of us really should be checking our time and use it wiser, none of this is considered fomo, I predicted the ram and gpu hysteria 18 months before it hit and now have 2 4090s and 384 gb ram, and im also just chilling waiting for something good, some of these are funny but I still won't whip out my setup to run them, though Ill hit up ZIT in a month or so, seems good, TLDR dont waste time, nothing here is fomo, unless you anticipate loras and models being deleted, then you are trapped forever
we will soon have the best of both worlds for some people, kijai is working on a video to audio WF so people could feed it wan2.2 videos
>>107794146>nothing here is fomoI think you're using that term wrong. Maybe you mean "nothing here is worth fomoing over."?
Chat if I don't generate an epic meme right the heck now I fear the humanity might go to waste
>>107794160true but I don't think those are at all corrolary
My main issue with LTX's lipsyncing is they really exaggerate the mouth and jaw like it's a cartoon, so it looks a bit uncanny for real life photos. Every time something is spoken it's done with the full extent of the mouth, no subtle movements. Some times it looks like the mouth+jaw is just too large to be real.
so can ltx do goon stuff yet?
>>107794216for some reason its far better when given audio to work with like https://www.reddit.com/r/StableDiffusion/comments/1q627xi/kijai_made_a_ltxv2_audio_image_to_video_workflow/
>>107794224Depends entirely on what you goon to. But for what I assume the majority goons to? No.
>>107794224Yes and no. Depends on what you're into. It'll run with whatever image you feed it, but it knows nothing about sexual actions yet
>>107794237k guess i'll wait for a few days until everything optimized
>>107794249You're looking at weeks to months until there's decent LoRA's.
>>107794254nah, theres a official trainer and people are already training AND its a 1 model setup so its 10x easier than wan2.2 to train https://www.reddit.com/r/StableDiffusion/comments/1q6asqd/ltx2_lora_training/
>>107793513What confuses me whoever is doing this does it for so long that he crashes for a full day only to start up again. I think this is a combo of both pastebin schizoa
>>107794314no shit?
last 1 girl mayhaps?
>>107794333chroma + lora?
>>107794327Many anons forget, they think it's only the dev. The other one is even more schizo and is happy to see the second one become like him. You know who used to be sort of normal and he's becoming a clone of the first one.
So if the provided training code has deepsneed It can use multi gpu out of the box to train right?
>no bakeAI is kill
>>107794333im sorry I said LAST 1 girl
kinda underwhelmed by ltx2 for now, but imma wait a few weeks and reserve my judgement until people figure out workflows and train some loras, etc.i remember early Wan, people were giving up, before we had the speedup loras, cause you had to wait 40 minutes to generate a 5 second video
>>107794362we're still page4
>>107794333Yes, chroma, ever since comfy optimized the fp8 performance of comfy + chroma fp8, it has been running really good for me.I don't know why people don't like it, its basically flux nsfw uncensored, wish the furry author kept training it or someone made a proper finetune for it
no cap this is too good, ltx is amazing
>>107794362mfw we find out that it was just the two schizos baking in tandem all along and no one else has the willpower
Anyone else never even read the OP and just get annoyed when two schizos get into arguments over some formatting or some shit.
>>107793563afterburner graphs showed ram & vram being 99% maxed at one point so it definitely must have spilled into the pagefile. weird but it just went away after using the reboot function from within comfyui. same launch parameters and everything and its back to normal speeds
>>107794395yep, i've never clicked a single link from the OP, the only valuable thing is the collage image
my wife
>>107794379I have>Chroma1-HD-fp8mixed-finaldownloaded, but I forgot to try it. Have to give it a go later
>>107794411mutated feet yummy
>>107794379he moved on to z image it seems
>>107794420mutated feet? here you go!
>>107794432
With temporal upscalehttps://files.catbox.moe/4xmhy8.mp4Spatial upscale onlyhttps://files.catbox.moe/sapzgh.mp4No upscale at all. https://files.catbox.moe/40s59z.mp4Who wins?
>>107794444no upscale of course is gonna be better but its also 4x as expensive. The point of it is to make it faster
more poses for this thot
>>107794453I don't think the time savings are there. There were all pretty comparable. But the temporal upscale resulting in fucky movement, huge uptick in vram use and weird pacing that wasn't fixable even by doubling the framerate.
>>107794476hmm, maybe its still broken in how comfy is handling it then cause that is the entire point of downscaling 0.5 and using the upscaler for the 2nd stage
>>107794495Probably. I don't think there is anything wrong with the upscaler itself. But I don't think it's being used right.
Had to reinstall comfy and now I don't have the free vram button anymore, help pls, top is how it used to be, bottom is now
https://files.catbox.moe/5v9yju.mp4
>>107793249I don't know... would you keep it classy?
>>107794516Reinstall comfyui manager
when ready migrate>>107794552>>107794552>>107794552
>>107792395what class is she
>>107796010healer
>>107796026how does the healing procedure look?
>>107796036she points her pointy staff at the recipient, chants a spell and light/energy begins to flow
>>107796053thank (you) and (her)