Previous thread >>5557207Dedicated Suno/Udio thread >>5590329Post AI generated stuff. Song covers, animations, etc.OC encouraged, but not required.This thread focuses on audio and video with an audio component.Let me know if you have more links to add. This thread is a work in progress.> Voice-to-Voicehttps://github.com/Mangio621/Mangio-RVC-Forkhttps://github.com/Vali-98/XTTS-RVC-UIhttps://github.com/voicepaw/so-vits-svc-fork> Text-To-Speechhttps://github.com/collabora/WhisperSpeechhttps://github.com/myshell-ai/OpenVoicehttps://github.com/yl4579/StyleTTS2https://github.com/BoltzmannEntropy/xtts2-uihttps://github.com/daswer123/xtts-webui (Warning: Windows version uses prebuilt binaries that anons haven't verified. Use at your own discretion)> Musichttps://github.com/facebookresearch/audiocrafthttps://rentry.org/AudioCraftRemix> Animation and Videohttps://haiper.aihttps://lumalabs.ai/dream-machinehttps://github.com/ToonCrafter/ToonCrafter> Audio CleanupUVR Walkthrough: https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit#heading=h.n8ac32fhltgghttps://github.com/Anjok07/ultimatevocalremoverguihttps://github.com/resemble-ai/resemble-enhancehttps://github.com/yinruiqing/pyannote-whisper> Related boards>>>/aco/asdg>>>/aco/csdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/g/sdg>>>/g/lmg>>>/g/aicg>>>/h/hdg>>>/trash/sdg>>>/u/sdg>>>/vg/aids>>>/vt/vtai
Additional reminder that there is a dedicated suno/udio thread due to high interest relative to other posts >>5590329Also, luma/haiper stuff is fine, but it's more interesting do something with it like this instead of posting the default 5 second clip with no sound.
>>5603777Alright the 1950s stuff was never that good and it's getting pretty fucking old now
>>5604100normies peak AI content. they get hard over the worst low effort shit
>>5604100Alright, alright. They are a dime a dozen on youtube. Was trying to find an interesting thumbnail for the OP to switch it up but you can't win 'em all I guess.
>>5603782Fucking kek!
>>5603777>trips of thread bakingDoes anyone have any spectacular results and/or prompts that are still (kinda) SFW? I've been trying to get a shot of a young adult redhead woman walking away from a burning/exploding building but so far all I got was trash.>>5604350this one is amazing
>>5604623>so far all I got was trashUsing what tool, specifically? Your advice will vary depending on what you're using.
>>5604626Oh, you are right! I just tried out Luma Dream Machine
>>5604628Somebody else will probably have better advice, but I think most people use a starter image, so generate what you want using Dall-E, Stable Diffusion, etc. and extend that static image using dream machine. If you're already doing that, don't know what to tell you. Seems like people are generally getting disappointing results lately because their servers are overloaded.
>>5603783watching this during a snowstorm sounds peak comfy
>>5603777Chrome Lords (1988) when?
runway gen 3 came out, but it's only text to video, and you need to pay $15 to gain access to it (and they don't explain how many seconds you get with the tokens, but with gen 2 you get 125 seconds, hopefully it's the same)I was hoping it was in the free tier but it's not, but I was not really interested in text to video anyways.
>>5605446my bad it's 60 seconds for gen 3.
>>5605446I wonder if anyone's made something fun with gen 3 yet. Just want to see how good it is. Their own promo demos don't count.
>>5603782Literally just pissed myself laughing at this. Like real piss.
Shogun, but its about Genghis Khan.
Is there any AI tool that can colorize a greyscale photo or painting? I know about Palette.fm, but I am looking for alternatives.
>>5606604i've used https://github.com/jantic/DeOldify and it works well
>>5606608This looks great, I will give it a go. Thank you!
>>5603782My sides lmao
Luma's got some room for improvement alright...Although I can't deny, I enjoy these creations.
>>5607387kino
>>5609221So this is the power of runway. Neat.
>>5607387>the double headless back spring, followed by the limb merge dismount, and she's vanished. Flawless performance
>>5607387>>5608075>>5610067I'd be interested in how this compares to shit people on drugs "see".Maybe "hallucination" is a much more fitting term than initially expected.
>>5610351It's really nothing like this>source: weed, edible weed, LSD, ecstacy, shrooms, ayahuascaBut it's an interesting idea that "the AI is hallucinating"
>>5610432Thanks for clearing that one up.4chan really is kind of a "mixture of experts".>But it's an interesting idea that "the AI is hallucinating"That idea has been floating around in its current interpretation since about late 2022 (GPT-3.5).https://en.wikipedia.org/wiki/Hallucination_(artificial_intelligence)Maybe there wasn't enough gymnastics in the training data for luma, but I don't know: there could also be other factors at play.
>>5610440I'm no AI engineer, but from what I can tell, the big challenge for video is temporal consistency and there are various approaches attempting to solve this. One approach is to basically smooth out between keyframes. Well, those keyframes may not be very consistent from one to the next, which becomes evident in a high motion situation like gymnastics.Take a look at the clips here >>5609221 and notice that even though there is a lot of temporal consistency, the details that do change appear to be blending like there are a few keyframes with in-betweening.I also have a suspicion that the short clip time isn't just because of the compute power, it's also due to its inability to keep it temporally consistent longer than that.
AI seems to want these guys to be sword fighting.
Has there really been no advancement in the speech-to-speech/voice cloning field?
>>5610067
>>5610440a lot of the time it really resembles dreaming, which is a sort of hallucination
>>5612349>>5610067lmao very nice
>>5609221mezmerizing
>>5612868Thinking about it, I clearly remember having had disappearance-after-occlusion events in dreams, which is something, luma produces at times as well.You realize, there is something missing/off, that something isn't working like it should have, but you simply don't "care" in a dream.Is current-day GAI maybe less like a conscious human brain and more like the brain's dream-production machinery?Which may, in the end, be more alike than one would initially think.
>>5604100commit supuku you dumb commie fuck
>>5606241>doesnt turn into a russianbull shit
>>5612990It's often said that dreams are a kind of synthesis of your experiences that your brain does while you're asleep. That's not far off from an AI recalling what it has been trained on.That said, neural networks don't "think" in the sense that we do. We use terms like neurons, hallucination, and dreaming because they're good shorthand, not because they really have potential to be human-like.
>>5603781what kind of workflow is involved in making something as long as that?
>>5614350>We use terms like neurons, hallucination, and dreaming because they're good shorthandSure, but...>not because they really have potential to be human-likewhat makes you think, humans are more powerful than a Turing Machine?
>>5614551The current tech that we call generative AI doesn't "think" nor will it be capable of developing consciousness. It's a bunch of matrix math. Maybe some system in the future can become sentient or intelligent, but despite sensational claims that we're approaching the singularity, we're not even close. We just have a bunch of math utilities that output an approximation of what we want to see.
>>5615327>>5615351I'm fuken' dyin'. Kek.
>>5615283>It's a bunch of matrix math.Yes, but what makes you think, the human is more powerful than "a bunch of matrix math"?I'd be extremely surprised, if there was anything more powerful than precisely "Turing completeness" in this universe, because there is exactly zero indication of that in anything mankind has been documenting and thinking.The "singularity"-talk is obviously nonsense - and reminiscent of the bullshit Marx and Co. were thinking during the industrialization -, but I don't think there is anything stopping a computing device from becoming effectively indistinguishable from a human.
>>5615624I'm not saying that Turing machines in and of themselves don't have potential. I'm just saying that the current building blocks of this wave of AI stuff is a far cry from actual intelligence in any real sense. It's all based on a common architecture that doesn't lend itself well to becoming general intelligence. It's not the only way to implement an AI, it's been done differently in the past and it'll be done differently in the future.>>5614415I didn't make it but there are watermarks all over the thing that give some hints. Haiper clips are all strung together in a video editor, Pika is used to do lip sync, with audio probably from Elevenlabs, and a Suno/Udio song is playing in the background. The Vinny text effect is just some standard video editing. You can do the lip sync stuff with open source tools like these:https://github.com/ajay-sainy/Wav2Lip-GFPGANhttps://github.com/Mozer/wav2lip
>>5617055>Ilsidorkek
>>5617079lol omg I didn't catch that. fixed.
>>5617109Hell it would probably be funnier if he just randomly fucked up names and events like he was having a senile moment.
>>5615351LOL, Way to close to reality. Tubba Tay Tay chocking on a Big Mac.
>>5617123lol that's pretty good. Elrond yapping like Biden about shit that happened thousands of years ago.
>>5615351This needs to be extended to 30 seconds for the full hamburger song.
>>5617140We've got Ilsidor, Godnor, and Argadorn! He was there!
>>5615351I hate it but I can't look away.
>>5616110>I'm not saying that Turing machines in and of themselves don't have potential.I'm saying, the universe and any subset of it, including any human, is probably effectively equivalent to a Turing machine in prowess (given unlimited time and storage).There is nothing a human can do, that in theory would be impossible to a computing device and vice-versa.Practically it's currently obviously a different story on planet Earth.
>>5604350>>5609045
>>5617966i fuckin love dubbing over these old amvs with the dbz characters they're about lol
>>5617966