[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/wsg/ - Worksafe GIF

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • Supported file types are: GIF, WEBM

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Previous thread >>5557207
Dedicated Suno/Udio thread >>5590329
Post AI generated stuff. Song covers, animations, etc.
OC encouraged, but not required.
This thread focuses on audio and video with an audio component.
Let me know if you have more links to add. This thread is a work in progress.

> Voice-to-Voice
https://github.com/Mangio621/Mangio-RVC-Fork
https://github.com/Vali-98/XTTS-RVC-UI
https://github.com/voicepaw/so-vits-svc-fork

> Text-To-Speech
https://github.com/collabora/WhisperSpeech
https://github.com/myshell-ai/OpenVoice
https://github.com/yl4579/StyleTTS2
https://github.com/BoltzmannEntropy/xtts2-ui
https://github.com/daswer123/xtts-webui (Warning: Windows version uses prebuilt binaries that anons haven't verified. Use at your own discretion)

> Music
https://github.com/facebookresearch/audiocraft
https://rentry.org/AudioCraftRemix

> Animation and Video
https://haiper.ai
https://lumalabs.ai/dream-machine
https://github.com/ToonCrafter/ToonCrafter

> Audio Cleanup
UVR Walkthrough: https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit#heading=h.n8ac32fhltgg
https://github.com/Anjok07/ultimatevocalremovergui
https://github.com/resemble-ai/resemble-enhance
https://github.com/yinruiqing/pyannote-whisper

> Related boards
>>>/aco/asdg
>>>/aco/csdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/g/sdg
>>>/g/lmg
>>>/g/aicg
>>>/h/hdg
>>>/trash/sdg
>>>/u/sdg
>>>/vg/aids
>>>/vt/vtai
>>
File: vinny.webm (4.85 MB, 960x540)
4.85 MB
4.85 MB WEBM
Additional reminder that there is a dedicated suno/udio thread due to high interest relative to other posts >>5590329
Also, luma/haiper stuff is fine, but it's more interesting do something with it like this instead of posting the default 5 second clip with no sound.
>>
>>
>>
>>5603777
Alright the 1950s stuff was never that good and it's getting pretty fucking old now
>>
>>5604100
normies peak AI content. they get hard over the worst low effort shit
>>
>>5604100
Alright, alright. They are a dime a dozen on youtube. Was trying to find an interesting thumbnail for the OP to switch it up but you can't win 'em all I guess.
>>
>>5603782
Fucking kek!
>>
>>5603777
>trips of thread baking

Does anyone have any spectacular results and/or prompts that are still (kinda) SFW? I've been trying to get a shot of a young adult redhead woman walking away from a burning/exploding building but so far all I got was trash.

>>5604350
this one is amazing
>>
>>5604623
>so far all I got was trash
Using what tool, specifically? Your advice will vary depending on what you're using.
>>
>>5604626
Oh, you are right! I just tried out Luma Dream Machine
>>
>>5604628
Somebody else will probably have better advice, but I think most people use a starter image, so generate what you want using Dall-E, Stable Diffusion, etc. and extend that static image using dream machine. If you're already doing that, don't know what to tell you. Seems like people are generally getting disappointing results lately because their servers are overloaded.
>>
>>5603783
watching this during a snowstorm sounds peak comfy
>>
File: 1697407588568.webm (1.24 MB, 470x646)
1.24 MB
1.24 MB WEBM
>>
>>5603777
Chrome Lords (1988) when?
>>
runway gen 3 came out, but it's only text to video, and you need to pay $15 to gain access to it (and they don't explain how many seconds you get with the tokens, but with gen 2 you get 125 seconds, hopefully it's the same)
I was hoping it was in the free tier but it's not, but I was not really interested in text to video anyways.
>>
>>5605446
my bad it's 60 seconds for gen 3.
>>
File: GlimpseOfHell.webm (3.75 MB, 1080x1080)
3.75 MB
3.75 MB WEBM
>>
>>5605446
I wonder if anyone's made something fun with gen 3 yet. Just want to see how good it is. Their own promo demos don't count.
>>
File: 1713572628930782.webm (2.35 MB, 678x688)
2.35 MB
2.35 MB WEBM
>>
>>5603782
Literally just pissed myself laughing at this. Like real piss.
>>
>>
File: MongolianFilm.webm (2.62 MB, 1024x576)
2.62 MB
2.62 MB WEBM
Shogun, but its about Genghis Khan.
>>
Is there any AI tool that can colorize a greyscale photo or painting? I know about Palette.fm, but I am looking for alternatives.
>>
>>5606604
i've used https://github.com/jantic/DeOldify and it works well
>>
>>5606608
This looks great, I will give it a go. Thank you!
>>
>>5603782
My sides lmao
>>
File: luma_gymnastics.webm (4.06 MB, 640x364)
4.06 MB
4.06 MB WEBM
Luma's got some room for improvement alright...
Although I can't deny, I enjoy these creations.
>>
>>5607387
kino
>>
File: mint-fantome-fuck-yeah.webm (4.25 MB, 1066x720)
4.25 MB
4.25 MB WEBM
>>
>>
File: Gen-3 Hardcore AI.webm (3.84 MB, 800x450)
3.84 MB
3.84 MB WEBM
>>
File: I have 458 ducks.webm (1.3 MB, 1024x1024)
1.3 MB
1.3 MB WEBM
>>
>>5609221
So this is the power of runway. Neat.
>>
>>5607387
>the double headless back spring, followed by the limb merge dismount, and she's vanished. Flawless performance
>>
>>5607387
>>5608075
>>5610067
I'd be interested in how this compares to shit people on drugs "see".
Maybe "hallucination" is a much more fitting term than initially expected.
>>
>>5610351
It's really nothing like this
>source: weed, edible weed, LSD, ecstacy, shrooms, ayahuasca

But it's an interesting idea that "the AI is hallucinating"
>>
>>5610432
Thanks for clearing that one up.
4chan really is kind of a "mixture of experts".

>But it's an interesting idea that "the AI is hallucinating"
That idea has been floating around in its current interpretation since about late 2022 (GPT-3.5).
https://en.wikipedia.org/wiki/Hallucination_(artificial_intelligence)
Maybe there wasn't enough gymnastics in the training data for luma, but I don't know: there could also be other factors at play.
>>
>>5610440
I'm no AI engineer, but from what I can tell, the big challenge for video is temporal consistency and there are various approaches attempting to solve this. One approach is to basically smooth out between keyframes. Well, those keyframes may not be very consistent from one to the next, which becomes evident in a high motion situation like gymnastics.
Take a look at the clips here >>5609221 and notice that even though there is a lot of temporal consistency, the details that do change appear to be blending like there are a few keyframes with in-betweening.
I also have a suspicion that the short clip time isn't just because of the compute power, it's also due to its inability to keep it temporally consistent longer than that.
>>
>>
File: 456456456433.webm (1.89 MB, 680x376)
1.89 MB
1.89 MB WEBM
AI seems to want these guys to be sword fighting.
>>
Has there really been no advancement in the speech-to-speech/voice cloning field?
>>
File: AI angel.webm (987 KB, 1024x1024)
987 KB
987 KB WEBM
>>
File: AI gfs at the beach.webm (694 KB, 1024x1024)
694 KB
694 KB WEBM
>>
File: close call.webm (146 KB, 852x480)
146 KB
146 KB WEBM
>>
File: marichka.webm (463 KB, 864x1168)
463 KB
463 KB WEBM
>>
File: 24703.webm (221 KB, 538x809)
221 KB
221 KB WEBM
>>
File: 1720027226842319-.webm (450 KB, 640x364)
450 KB
450 KB WEBM
>>5610067
>>
File: 1720443583068405.webm (1.2 MB, 1500x1000)
1.2 MB
1.2 MB WEBM
>>
>>5610440
a lot of the time it really resembles dreaming, which is a sort of hallucination
>>
>>5612349
>>5610067
lmao very nice
>>
>>5609221
mezmerizing
>>
>>5612868
Thinking about it, I clearly remember having had disappearance-after-occlusion events in dreams, which is something, luma produces at times as well.
You realize, there is something missing/off, that something isn't working like it should have, but you simply don't "care" in a dream.
Is current-day GAI maybe less like a conscious human brain and more like the brain's dream-production machinery?
Which may, in the end, be more alike than one would initially think.
>>
>>5604100
commit supuku you dumb commie fuck
>>
>>5606241
>doesnt turn into a russian
bull shit
>>
>>5612990
It's often said that dreams are a kind of synthesis of your experiences that your brain does while you're asleep. That's not far off from an AI recalling what it has been trained on.
That said, neural networks don't "think" in the sense that we do. We use terms like neurons, hallucination, and dreaming because they're good shorthand, not because they really have potential to be human-like.
>>
>>5603781
what kind of workflow is involved in making something as long as that?
>>
>>5614350
>We use terms like neurons, hallucination, and dreaming because they're good shorthand
Sure, but...
>not because they really have potential to be human-like
what makes you think, humans are more powerful than a Turing Machine?
>>
File: 45654645687587.webm (1.74 MB, 432x584)
1.74 MB
1.74 MB WEBM
>>
>>5614551
The current tech that we call generative AI doesn't "think" nor will it be capable of developing consciousness. It's a bunch of matrix math. Maybe some system in the future can become sentient or intelligent, but despite sensational claims that we're approaching the singularity, we're not even close. We just have a bunch of math utilities that output an approximation of what we want to see.
>>
File: 64564562.webm (5.87 MB, 726x402)
5.87 MB
5.87 MB WEBM
>>
File: ErasWithExtraCheese.webm (1.11 MB, 512x512)
1.11 MB
1.11 MB WEBM
>>
>>5615327
>>5615351
I'm fuken' dyin'. Kek.
>>
>>5615283
>It's a bunch of matrix math.
Yes, but what makes you think, the human is more powerful than "a bunch of matrix math"?
I'd be extremely surprised, if there was anything more powerful than precisely "Turing completeness" in this universe, because there is exactly zero indication of that in anything mankind has been documenting and thinking.
The "singularity"-talk is obviously nonsense - and reminiscent of the bullshit Marx and Co. were thinking during the industrialization -, but I don't think there is anything stopping a computing device from becoming effectively indistinguishable from a human.
>>
>>5615624
I'm not saying that Turing machines in and of themselves don't have potential. I'm just saying that the current building blocks of this wave of AI stuff is a far cry from actual intelligence in any real sense. It's all based on a common architecture that doesn't lend itself well to becoming general intelligence. It's not the only way to implement an AI, it's been done differently in the past and it'll be done differently in the future.
>>5614415
I didn't make it but there are watermarks all over the thing that give some hints. Haiper clips are all strung together in a video editor, Pika is used to do lip sync, with audio probably from Elevenlabs, and a Suno/Udio song is playing in the background. The Vinny text effect is just some standard video editing. You can do the lip sync stuff with open source tools like these:
https://github.com/ajay-sainy/Wav2Lip-GFPGAN
https://github.com/Mozer/wav2lip
>>
File: GuyFindsOut.webm (4.31 MB, 680x376)
4.31 MB
4.31 MB WEBM
>>
>>
>>5617055
>Ilsidor
kek
>>
>>5617079
lol omg I didn't catch that. fixed.
>>
>>5617109
Hell it would probably be funnier if he just randomly fucked up names and events like he was having a senile moment.
>>
>>5615351
LOL, Way to close to reality. Tubba Tay Tay chocking on a Big Mac.
>>
>>5617123
lol that's pretty good. Elrond yapping like Biden about shit that happened thousands of years ago.
>>
File: hcbm.webm (1.35 MB, 512x512)
1.35 MB
1.35 MB WEBM
>>5615351
This needs to be extended to 30 seconds for the full hamburger song.
>>
>>5617140
We've got Ilsidor, Godnor, and Argadorn! He was there!
>>
>>5615351
I hate it but I can't look away.
>>
>>5616110
>I'm not saying that Turing machines in and of themselves don't have potential.
I'm saying, the universe and any subset of it, including any human, is probably effectively equivalent to a Turing machine in prowess (given unlimited time and storage).
There is nothing a human can do, that in theory would be impossible to a computing device and vice-versa.
Practically it's currently obviously a different story on planet Earth.
>>
File: broccoli.webm (4.19 MB, 320x180)
4.19 MB
4.19 MB WEBM
>>5604350
>>5609045
>>
>>
>>
>>
>>
>>5617966
i fuckin love dubbing over these old amvs with the dbz characters they're about lol
>>
File: Broly - 10's.webm (5.38 MB, 630x360)
5.38 MB
5.38 MB WEBM
>>5617966
>>



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.