Discussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106884374https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://huggingface.co/neta-art/Neta-Luminahttps://civitai.com/models/1790792?modelVersionId=2203741https://neta-lumina-style.tz03.xyz/>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
When is ostrix gonna add ramtorch to other models?
>>106891139what happened to the 16 channel vae for sdxl?
>>106891143best gen in literal months
>>106891150he has the lodestone syndrome, he can't stay on a single project for too long, once a new shinny thing appears, he jumps from the previous boat to this one
New UI is going to be sick.ComfyUI truly is the best. Nothing else comes close. Such talented individuals involved. I kneel.
>>106891176is there anything ComfyUI can't do!?
>>106891182be a good ui is one
>>106891176lipstick on a pig vibes
tinker tranis out in full force I see
>>106891176>hides the CPU and memory usagehmmmmm
>>106891176I think we should start posting more Comfy news every thread from now on. Everyone uses it, so it's important information pertaining to local diffusion.
>>106891223is it some sort of humiliation ritual or something? I'd rather not see the impending enshitification fall straight from the asshole
>>106891219That is not part of core.>>106891241Don't reply it to it. Anyone using the term "local diffusoin" is a troll
~desu THIS
This will blow your mind... to smithereens.
>Anyone using the term "local diffusoin" is a troll>anon misspelled it for some reason>/ldg/ - Local Diffusion General
nobody calls genning diffusion, but nice try
>>106891277chat what did they mean by thisare they implying our entire general is a troll
>>1068912764got my image
>>106891277
so I set the 2.2 lora to 3 strength and the 2.1 lora to 3 strength and got chaos.
>>106891337damn, poor George got turned into a pile of ashes kek
>>106884426https://files.catbox.moe/dkr9yn.mp4
>>1068913376 strength for both!okay, too much.
if all my .safetensors files are saved on an NTFS drive, can I access them via Linux and still gen at the same speed? Or will it be shot and slow as all hell
>>1068913731 strength high (2.2), 1 low:the anime girl picks up the black man on the floor and throws him into the sky.
>>106891366He was begging for it with that gen lel
>>106891448very aesthetic
>>106891447What is it? Doesn't load for me.
>>106891493Catbox is down out of nowhere, will probably be back soon. But I bet you can imagine what it is
update comfyui to insta cancel video gens
>>106891554we can thank that anon that shared the insta cancel custom node, it forced comfy add it officially as well (even though it's a basic feature that should've been added years ago but heh... that's another story)
>>106891575Does that fox asshole lurk here still? T-thanks, I guess.
>>106891453missing a finger.
>>106891606retard. fuck off
>>106891606wat?
repost.AAAAHAHAHAHAHAHAHAHAHhttps://github.com/leejet/stable-diffusion.cpp/issues/396>closed>not fixedstable-diffusion.cpp does NOT use clip_l.>>106891601lmao great acting.
>>106891631Yeah, there it is again, the fingers, man. Freaky.
This means I can't use stable-diffusion.cpp. I rely on being able to prooompt clip_l
the anime girl fires her pistol, causing the man on the right to fall down. she waves hello.warning shot! get out sam.
AHHHHHH I HATE THE ANTICHRIST (comfyui)
>>106891670did you see his interview with Tucker? the dude looks emaciated, sleep-deprived and seriously disturbed. I dont touch chatgpt
What is the proper way of doing video continuation without being an independently glued i2v mess with no proper continuity?My mind is ripe with degenerate goon ideas for genning multi-staged stuff where each section of the video has its own prompt and lora. It has potential to make me abandon real porn forever but its sucking ass the way I'm doing on wan2gp
>>106891670
>>106891730it's obvious the guy is a sociopath, but I think they all big CEOs are, you have to sell your soul to get to that position, and some are more than ok to do that, since they didn't have much soul to begin with
miku kick!
>>106891782That's that Mike guy with the blue hair.
[high and squeaky voice]>"I'm here to kick ass and fuck bitches!"
>>106891176I don't understand? I don't see anything that's actually different in terms of ease of use.Will the inpaint workflow feature will have canvas functionality with layers, bounding boxes like Krita or InvokeAI? Or can I use multiple adetailers more intuitively, or multiple ControlNets without bloating with nodes and wires?Or is this only good news for people who spam MikuTests, Radiance shills, or those who just gen "1girl, crunching, pointing at viewer" Netaslop? So they can have a prettier, more attractive UI to press the generate button and spam more?
AHHHHH FORGIVE ME FATHER FOR I HAVE INSTALLED COMFYUI
the man holds up a game box saying "SKYRIM 2" with a knight in armor holding a sword on the cover, and smiles.
>>106891176>log outHuh?
>>106891840
>>106891554and various enhancements to amd gpus
Wansisters, do we finally have long video? Safetensors loras seem to be already available. I cant try them now, gotta go workhttps://github.com/vita-epfl/Stable-Video-Infinityhttps://huggingface.co/vita-video-gen/svi-model/tree/main/version-1.0
>>106891647simpsons never had 5 fingers
>>106889520The only one who needs to be in charge of every model ever created is Lodestone. Literally just takes one guy with limited compute to destroy every image model out there, and he did that on a dated Flux architecture with censorship on top of that. Imagine if you gave him unlimited compute like some of these huge corps have, we'd have uncensored video models that would make us laugh at Sora.
>>106891915>gotta go workfuck off sphere earth shill
>>106891919Sounds demonic.>>106891923He's about to fall lol
>>106891636>>closedyou can see the merge before the close
>>106891923lol is this a bait or something? there's no way someone can glaze lodememe that much right?
>>106891935it was created by a freemason
>>106891820none of that.
>>106891939psst(it's him)
>>106891923Now that the dust has settled, why did Chroma get so much hate? There is literally no better photoreal model for the details. It's like every hating Plebbitor was acting like those SDXL photoreal models were SOTA or could follow prompts or something.
>>106891926Another attention trip seeker to block, thanks
The retards at comfyui made an installer that doesn't actually work with amd.How are they so retarded? They have amd in the installer, but it doesn't work.
>>106892025this. it doesn't work. They are retards.pip install comfy-clicomfy install
>>106892025>>106892033lmfao
>>106891923>>106891990>why did Chroma get so much hatePeople who need/want it to fail are upsetti spaghetti
the anime girl is sitting at a computer desk with a white CRT monitor.qwen edit + osaka
>>106892053Ever try it?
>>106892085the anime girl is sitting at a computer desk with a white CRT monitor. keep her expression the same.
I would like a straight diffuser. I'm not a homosexual.
>>106891990I don't hate it and think it's the best NSFW realism model, but it's not without it's flaws and they might've been avoidable if the furry had listened to what some of the more knowledgeable guys were saying.
Reinstalling torch. This will work >:|>>106892123Chroma is easily the best painterly paintings generator yet.
>>106891915>unique prompt for multi minutes videoAm I reading that right, it's one prompt for like 8min Tom & Jerry cartoon?>wan i2vGood, tired of everything made for t2v only.>wan 2.1Hope it works with 2.2.
change the text in the white box to "what the fuck happened to Pokemon, man?"
>>106892164>what the fuck happened to Pokemon, man?they make turds and have their games sold really well, you have to blame the consoomers, they're the ones tolerating this, they vote with their wallet and by buying those games on masse they send the message that it's all right
>>106892164change the text in the white box to "We make any shit and it sells, that's why the games are bad.". change the location to a grass field.kek
>>106892164Why is she dressed like a hooker?
>>106892182How's life in Afghanistan?
change the text in the white box to "The devs are really bad, they never even try to improve.". change the location to an office in japan.holy shit they should just use AI to make their backgrounds.
pip is stupid. It's redownloading the same file.
> mfw he's been doing this for a month straight> not even creative prompts> literally the same slop every thread> "guys look what qwen can do"> yeah we know, you've posted it 500 times> realize he's just shilling qwen edit>probably works for alibabaThanks /ldg/ cointainment thread for corpo ads.
>>106892187Fertile. How's your "life", in terms of tfr?
>>106892226use uv
>>106892234I don't know what tfr is, but I'm pretty happy.
>>106892227god forbids people experimenting and having fun with local models in a local model general
>>106892227Missing the other two musketeers, the Radiance and Neta shill and we have the bingo!
>>106892241I'll ask your grandchildren.
>>106892227qwen image edit, huh? That sounds like hot shit.
>>106892247there's experimenting and there's spamming 20 times slight variations of the original image, unfortunately for turboautists like you, you can't see the difference, and I can't blame you, you are born with that brain you can't unwire that shit
I prefer someone shilling a local model they like with gens to complaining about local models. Thread space is unlimited. You have a scroll wheel. Use it. Comment on gens you do like.
>>106892257Sure.
>>106892272>I don't care about the quality of the poststhis mindset is what made /sdg/ what is it today btw
>>106892269it's better than the nth drama about meta stuff, like we are doing right now, or the same schizo rant about comfyui eating babies and torturing kittens
>>106892283Or, I have a different metric of "quality" than you do. As in, my metric of quality is not "things I personally like" and is instead "furthers a discussion on local models in general." You are not the quality police. You are not a mod. You are an anon, the same as everyone else, and I bet you there are anons who think your gens are also not High Quality. Learn to play nice.
my doro is augmented.
>>106892299>furthers a discussion on local models in general.oh yeah, spamming the same image 20 times sure furthers a discussion on local models in generalTake your (You) kind sir, you deserved it
I am... disappointed so far. The older lightx2v seems to be better. Maybe need to balance strengths and find a sweet spot. Needs more testing.
>>106892307>yeah, spamming the same image 20 times sure furthers a discussion on local models in generalSignificantly more than any of your posts in this conversation, yes.
>>106892311hard to say how 0 can beat 0 but go on king
>>106892310You're correct in terms of which animation is actually better, but that lower punch is genuinely hilarious.
>>106892310yeah, it's definitely a bust, I'm really dissapointed
>>106892319he only used 0.0001% of his power on it, it was enough.
>>106892326that's one punch man, if he went for more power her head would've left her body
>>106891907:O
>>106892301
>>106892123Don't get me wrong, it's not perfect, but neither is any other model we have. This is perfectly fine for cooming even with its imperfections. Well, I guess a lot of people can't really infer it at decent speeds to get different seeds which is why they complain about issues.
>>106892273They said they don't exist lmao>lifehue hue hue
>>106892310what strengths on the high and low for the older lora?
>comfui is 2x as fast as stable-diffusion.cppuh
Anyone with a 5090 can try 4/4 steps wan in 720p/129 frames and time it?I can gen in 11-12min using res2m.
>>106892339I think what probably set people off is the percieved lost potential. In a way, it's kind of like the reaction to GPT-5, where the hype of what it could be made the reveal of what it actually was that much more disappointing. Obviously OpenAI continued to shoot themselves in the foot after that, but if they hadn't promised so much ("AGI is Here!") then there would be less backlash. Likewise, as the previous anon said, there were multiple moments where choice A vs B made Chroma less than what it could be. Personally, I loved the aesthetics, even if as a ramlet it took me ages to make an image. But if that image is completely schizo, I can't really do much but play seed gacha or feed it to IPAdapter/i2i.
>>106892333
>>106892365High 3Low 1
what causes the first frame in wan to be super blurry (then the rest is fine) in i2v?
>>106892372>I think they finally moved on from transformersif google has found another better architecture, we're soo back, Sora 2 at home soon babyyyyyyyy
>>106892375I thought vulkan would be worse. that's good news
>>106892375false hope. bad wf.
>>106892332>>106891907>update for amd enhancement>s/it increasesnow this is very cool
>>106892372can i finally make my indie game now as a no-coder?
>>106892428rip
the man is wearing the blue hatpretty cool
>>106892421Or... :Oit really is 2x as fast???11 seconds / it with res multistep 1024x1024, cfg >1Normally this would be 20 seconds, I think...welp
>>106892458is there an offical comfyui wf for qwen image edit?I AM NOT GAY
>>106892479yes there is even a 2509 specific one just search qwen in templates
>>106892070I guess some people just can't have fun.>>106892384Makes sense as well. However, the model continues to exceed my expectations with what it can do. Though someone with different needs might see it differently. Hopefully the bigASP dev could further refine the model.
>>106891990It really is capable of producing top tier results, but you have really investigate it and learn its quirks, otherwise it happily spits out slop, not to mention the best version (2k) was dumped on Civitai with no elaboration. Its a PR problem
>>106892508If only we could prompt things like deformed hands like that.
>>106892508>Hopefully the bigASP dev could further refine the model.Every model is kinda bad when it first comes out, right? I'm hoping this is a more "SDXL" situation than an "SD2/3" situation. You can clearly see tons of potential when it works right.
>>1068925102k???also
>>106892508>pircelI guess some people just can't have standards.
>>106892528lmao this wf
>>106892539I'm naming this style "slopcore"
>>106892310you are using the older lora for both low and high?
>>106892553Yes.
>>106892539Euler being nice and normal.
fwiw, I don't think stable-diffusion.cpp needs to support clip_l, since Chroma doesn't use it. But it should change its name to Chroma.cpp
>>106892572https://github.com/leejet/stable-diffusion.cpp/pull/397fixed it for u
>https://zhuang2002.github.io/FlashVSR/Non snake oil super resolution is here.
>>106892567So Chemo never learned how to do middle fingers?
what scheduler should i be using with chroma flash? currently using simple
>>106892580kek
>>106892595Now you just have to fix Chemo to support clip_l again.
>>106892620I took chemo for u but now i dying :(
>>106892609I'm using beta...>>106892608he passes, tbqfwy
>>106892625Can I have your computer when you die?
>>1068926021girls rise again
the character in image1 is wearing the outfit of the anime girl in image2. keep their expression the same.neat
>>106892508
>>106892510>not to mention the best version (2k) was dumped on Civitai with no elaboration.Eh? HD Flash is still best for me. But I guess there is a slight learning curve.
>>106892664put on some socks
>>106892650
chroma is the best realism model?
>>106892687chroma pre v30 is the best realism model
>>106892679
>>106892667
>>106892692I already told you anon, this is a skill issue. Chroma HD is the best Chroma model, and HD Flash best refinement of that (albeit losing some style variety, but it might be my shit prompting).
>>106892692Are there non-quant ggufs of those?
>>106892704thanks
>>106892146Fuck knows, says 2.2 is todo. Looks like 4 loras? I cant try it out until later, some brave anon will have to test them, surprised no one else is talking about it>>106892580lmao, gold
>>106892710and I already told you anon, your brain is a skill issue, you can't admit that lodestone has slopped the model when he decided to make it work for fewer steps starting for v30, but one day you'll get out of this cult, I believe it, I don't think you're that dumb
>>106892690doing this in 3d turned out pretty nice
Yeah, I don't think we can expect any more big speedups for my 6950xt. I'm at max watt utilization
>>106892710i think i agree with you anon. nothing better than chroma HD
sanna, no!
>>106892710>Chroma HDusingChroma1-HD-dc-fl-BF16.ggufworks with cfg=1
>>106892776
Kek
>>106892801air niggas
i think i also agree, that chroma hd is just clearly the only good local model for realism
>>106892822lol
>>106892301nice augmented doro>>106892601samples look pretty good, yes. did you use it on your own videos yet?
we talking about realism? oh chroma HD is the only one worth even trying
>>106892710>Chroma HD is the best Chroma mode>>106892783>nothing better than chroma HD>>106892860>chroma HD is the only one worth even trying
yeah i wouldn't even dream about using anything other than Chroma HD (tm) for generating realism with local diffusion
>>106892664Not bad, Qwen Edit is getting close.>>106892783Wan gets coherence better, but y'know anon, Chroma can do some cool stuff.
IdleAttack 1Attack 2RunGuardEvadeTaking DamageAt Low HPIncapacitated>TriumphFlourishThe new LoRa seems to does occlusion better and has higher object consistency at the expense of fast movements. Increasing strength beyond 1 on high noise introduce weird fogging, so sadly balancing high/low isn't an option.
>>106892882what about adding one first step without the light lora on high?
>>106892882I tried the 2.2 kijai lora on high and low, 1 str for both:the man holding the bag jumps out the window of the office.
>>106892878my point (if you can't tell from the sarcasm) is that it very clearly is not the best model. >>106892872this is the only actual chroma gen i'm posting and it only really works as security cam footage
>>106892899next ill try with rCM low. pretty sure 2.2 high isnt meant to be used with low, but it somewhat works.
would a 3070 8gb be trash for this stuff?I just remembered I have one laying around.
>>106892907you can use sdxl, not much else
>>106891493>>106891447>>106884426>>106891366 up again
>>106892906and with rCM lowyep, this is a winning combo.
>>106892907it's not ideal but it'll work for most stuffvery slow for most videogen but you can even do that (idk, run some while you do other stuff)
new wan lightx2v high lora weight 1rcm wan low lora weight ???
>>106892915you can use almost all models if you can trade system RAMit'll raise your imagegen times to other people's times to do 5s WAN videos or w/e but it can work pretty well
>>106892919>ACK
>>106892940beautiful dancer
>>1068929241 and 1 seems fine
>>106892922>>106892940could I run qwen edit with it? I'm curious how it would compare to my 16gb amd card
>>106892946a nuclear explosion hits the buildings in the background outside the office. the characters fall on the ground.cinematic.
i still get slow motion with the new high lora, like at the end of this vidor maybe it's not slow and it's just me?
>>106892954thanks anon
>>106892940working a few days until you can buy a decent card would be a much better use of time
the man in the blue shirt walks to the computer on the left, sits in the chair, and starts typing.neat, it worked.setup: 2.2 kijai lora (from today), rCM lora from kijai (same huggingface), 1 str for both, 6 steps (3/3).
>>106892905We can both agree that Chroma is not perfect, but none of the models you posted are uncensored, nor is their photorealism anywhere near Chroma.
>>106892999use image2 node with a meme image and say "change the faces in image1 to the appearance of image2"
>>106892951yea. chroma radiance absolutely has good 1girl aesthetics - though of course it still has flaws it shows why everyone should train the boorus.>>106892960i haven't practically tried it with your hardware, but i believe so.e.g. use the gguf q8 quant with the comfyui-multigpu *gguf*distorch2* model loader where you can tell the node to offload 13GB or whatever is exactly needed to system RAM.or even just use a smaller quant.
Undressing lora for wan:https://civitai.com/models/2044832/wan2-undressing?modelVersionId=2314347Get it before it gets banned.
>>106893013I think it just struggles with all the characters and mental illness involved. Picrel original if you want to have a shot at it.
>>106892823we. wuz
>>106892831saved lol
My setup was working fine. Then I installed ComfyUI_Comfyroll_CustomNodes. Now startup doesnt finish. Comfy opens a blank screen, and the loading circle rotates endlessly.The log doesnt say anything out of the ordinary, and does not give an import failed error.As soon as I delete the ComfyUI_Comfyroll_CustomNodes folder, my setup goes back to working.Any ideas?
oh shit.
>>106892918Thanks anon. Looks nice.
>>106893009you got me there. no way to generate nsfw content with any of these models.https://files.catbox.moe/27k6tn.pnghttps://files.catbox.moe/2ce3mq.pnghttps://files.catbox.moe/7ovjsk.png
>>106893067Ensure ComfyRoll does not have additional install requirements on its github page. If it does, follow the additional instructions. If it doesn't, try switching versions.
>>106893106>ComfyRollwhat is it?
>>106893131nigger get some friends
>>106892983just live twice as good as the average one wage / one housewife household in the 1900s and take the other 8x+ per capita productivity gains for gpu, it's that easy
>>106893135I have made a lot of progress in this area. I'm trying to choose a quant.
>>106893131bloat nodes
>>106892801
the white car drifts around a corner in Tokyo, creating lots of smoke on the tires.
>>106893157doesn't drift hard, but doros hard
The new LoRas are cucked with less gore. I think I'll conclude my personal testing. It's not worth it.
>>106891606kek
I'll fix the hands in post
me when the project page for new thing doesn't have a comfyui workflow json on it
>>106893281>comfyuistay asleep
>>106893009i will concede that if you fuck around with chroma for long enough (and add loras) it can be really creative
>>106892990This is gonna be the Simpsons in 30 years.>>106893211Have you tried adjusting the strength to match the old one? A little strength seems to go a long way with the new one, it might actually add that blood splash back in.
qwen image edit...
I'm trying to gen on Fedora KDE with my 10 VRAM, Qwen is crashing on VAEdecode every time. It was working fine on Win10. What do
>>106893507>Qwen is crashing on VAEdecode every time.go for vae decode tilted
>>106893517does that take a lot longer? im thinking it might be wayland causing GPU conflicts. Grok suggests I should use an older nvidia driver for the 30-series. Sigh
>>106893527>does that take a lot longer?nah it's not that much longer
>>106893507>i'm trying anything on Fedorafound your issue
>>106893556wtf am i supposed to do then? I am not making a windows account
>>106893563Why would you do that?
>>106893586Now put a silly hat on her
>>106891820>"1girl, crunching, pointing at viewer" Netaslopthats my jam
Apparently Grok Imagine videogen can be jailbroken, as I have seen people making undress videos of celebs through I2V by only prompting for pasties or censor barsNot gonna lie, the motion and consistency from that shit is better than what we currently got with Wan. Imagine the possibilities if it was open...I wonder if Wan 2.5 (which is uncertain if it will be open-sourced) is comparable to it
>>106893563>I am not making a windows accountyou dont have to, you dumb fucking monkey.
>>106893721you will eventually
>>106893740>you will eventuallyi assume so as well, but you dont have to right now. just keep a partition with windows so you can gen if you cant figure it out on linux.
Can I take a group of nodes and cosmetically turn them into a single node to clean up the workflow?
>>106893863Ctrl+left click on the nodes you want to merge, right-click on one of them and select [Convert to Group Node].
>>106893740you will with that attitude
>>106893884Oh wow, it retains all the parameters, very useful. Is it possible to just make it cosmetic to save space/make it less chaotic for a new user?
Well I got Qwen to work on Linux but only after changing desktop settings to 1080p/60hz. Sad
Been using sd.webui for a bit now and it does all that I need.I'm kinda stuck on what to do when it freezes. It'll occasionally do that and I can't seem to fix it besides closing out the CMD window and hitting run.bat again.Refreshing just puts any new gens "In Queue" but it'll never finish.Hitting "Restart WebUI" turns off the WebUI but it doesn't restart...The main issue is I access WebUI remotely so when it freezes I have to screenshare to my PC to do all this
>>106891601Pool's Closed
babe open wide its time for your slop
Welp I found a permanent fix: add --cpu-vae so the CPU takes care of business instead. I won't have to go back to Windows after all. Such relief
>>106893920Click on the grey dot/circle next to the node title, it will minimize the node and make it as small as possible.
>>106891915Needs kjboss integration into comfy, looks promising though
>>106894023These seem to already be safetensors, they would need further converting? https://huggingface.co/vita-video-gen/svi-model/tree/main/version-1.0
>>106894215There's inference code on the github, its not just a plug n play lora. It'll need a custom node.
>>106894228No indication of vram requirements and how it scales with length either. Does it work in batches, keeping a number of previous frames memory for context so there's no abrupt motion shifts? Fuck knows
>>106894228Hmm, suppose its just a case of if kj or anyone else will. There's not much buzz about it, so cant see it happening anytime soon. Similar to (I think) nvidia released a long video model or something a few weeks ago?
>>106894263>Does it work in batches, keeping a number of previous frames memory for context so there's no abrupt motion shifts?Seems to be precisely what it does with this:>--num_motion_framesYou can set how many previous to use as context for the next clip it generates after 81 frames. I guess there'd be a sweet spot somewhere that keeps the motion coherent between segments without bloating VRAM too high, I guess it snips off the same amount of frames at the end. Set it to use 16 frames for context, the video it generates is 65 frames, and it keeps chaining them together. It looks like you can set individual prompts per segment too.
>>106894478>Chunk 1:>Input: A single static image.>Generation: pipe() runs and generates a full 81 frames.>Stitching: The code executes video_list += video[:-16]. This takes the first 65 frames (Frame 1 to 65) and adds them to our final video.>Next Context: The code executes rand_ref_frame_final = video[-16:]. The last 16 frames (Frame 66 to 81) are saved to be used as the input for the next chunk.>Chunk 2:>Input: The 16-frame clip from the end of Chunk 1 (Frames 66-81).>Generation: pipe() takes this motion context and generates a new, full-length clip of 81 frames that seamlessly continues the action.>Stitching: The code again executes video_list += video[:-16]. It appends the first 65 frames of this new clip to our final video.>Next Context: The last 16 frames of the new clip are saved for Chunk 3.The VRAM requirements should be the same as regular WAN, since it's always generating 81 frames. It should be compatible with LoRA's too. Seems almost too good to be true
>genning porn while animating 3d porn for a livingWhat a time to be alive. Only so much gooning I can do in between breaks.
B-bros, w-where you at?
>>106894945im here but im busy watching the latest animeslop during paid work hours (wfh chad).have this korbo
i cant believe the first gen in comfyui still doesnt allocate vram properly and takes x3 times to finish because of it before continuing normally from then on
>>106895004The reason the first gen takes longer is because it takes time to load the model. Then when the model is loaded all subsequent images are fast.
>>106895004There's also remnants left in VRAM even if you try to unload. You need to restart Comfy to fully purge it. Really noticable when switching between two high VRAM requirement workflows, you can get OOM's while restarting between each makes them load and run perfectly.
>>106895040no nigger i specifically said allocation, it overallocates like double the memory over 24gb and spills into 16gb extra ram before on the next gen it takes 18gb to gen the entire next video
>tfw going from 720p to 800p fixed the broken faces completely>oom at 860p
>>106895064>24gb(vram)i have 24+128gb and dynamic pagefile that comfy uses to allocate additional 100+gb to btw when genning videos, i have to set blockswap to 31 for it to not oom during the first q8 wan 2.2 gen anyway and then later change it after the first gen finishespicrel is what happens and was talked about before here, first gen overfils vram and spill into ram despite having enough space to fit, it takes take to finish, and then the next gen allocates the proper amount of vram and works from then on
>>106895108offload some of the model to the ram dudehttps://github.com/pollockjj/ComfyUI-MultiGPU
>>106895045yeah, the unload all models node fixes some of those but there is still a problem
>>106892682tummy
>>106895109why would it need the pagefile with 128GB of RAM available? Wan isn't that big.
>>106895179because comfy memory allocation is a meme
>>1068951093090?
>>106895197yes
>>106895139Already am, my dude. I am stocked up on loras.
>>106894952Korbo?Isn’t it Holo?
>>106893454fuck now i have to go watch the annotated series again
>>106895164an ugly, child onei prefer adult somewhat muscular with lineae semilunaris visible
>>106895216>he doesnt know
>>106895208If you are using --fast flag, try --fast fp16_accumulation fp8_matrix_mult cublas_ops instead. There was a commit a month ago that added convolution autotuning but it fucks up on our 3090s holding VRAM during the first run causing either a hang or extreme slowdown on WanImageToVideo node or VAE Decode.
>>106895231kys faggot
any way to fix the face in wanimate? looks slopped, not really like the reference image face
>>106895257yeah i know about that too, i got fucked by that update which made this problem x10 worse, but with now with just fp16 accumulation or without --fast at all im still getting this problem which is older that that addition to --fast
>>106895322bro just buy a blackwell gpu, you arent poor, right?
>>106895333which also wont be used to its fullest since comfyui will overallocate no matter what
>>106895339i mean yeah my rtx 6000 pro has plenty of vram free, so it doesnt matter if it overallocates. stop being poor
>>106895245This steered me to a tumbler PowerPointFking hell It’s Horo btw
>>106895355it's holo the wise bitch wolf, korbo is the meme name since she has garbage calligraphy
>>106895355>inteGIRL(X) Doubt
>>106895348>buys a component to be cucked out of using it fullygood goy
>>106895361True, but it’s still HoroL = RForever +1 no takebacksies no reset even when you go home
Horo
>>106895887>>106895887>>106895887
>>106895355who writes "r" as just a vertical line? It clearly says Holo.
Is there any way to get a NovelAI-like system of giving individual characters a tag, as well as the scene with it's own tags? I love the system NovelAI has but I'm a broke-ass bum and can only use local models like Stable Diffusion.