/wsg/ - /aicg/ AI Content General - Worksafe GIF

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/wsg/ - Worksafe GIF

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. Supported file types are: GIF, WEBM


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/aicg/ AI Content General 05/21/24(Tue)20:11:14 No.5557207

File: Vegeta - ＂What Is Love＂ ((...).webm (5.28 MB, 628x360)

5.28 MB WEBM

/aicg/ AI Content General Anonymous 05/21/24(Tue)20:11:14 No.5557207

Previous thread >>5515011
Dedicated Suno/Udio thread >>5553687
Post AI generated stuff. Song covers, animations, etc.
OC encouraged, but not required.
This thread focuses on audio and video with an audio component.
Let me know if you have more links to add. This thread is a work in progress.

> Voice-to-Voice
RVC walkthrough (somewhat outdated, collab is dead): https://docs.google.com/document/d/13_l1bd1Osgz7qlAZn-zhklCbHpVRk6bYOuAuB78qmsE/edit
Models, mega links, and mirrors: https://docs.google.com/spreadsheets/d/1tAUaQrEHYgRsm1Lvrnj14HFHDwJWl0Bd9x0QePewNco/edit#gid=0
https://github.com/Mangio621/Mangio-RVC-Fork
https://github.com/Vali-98/XTTS-RVC-UI
https://github.com/voicepaw/so-vits-svc-fork

> Text-To-Speech
https://github.com/collabora/WhisperSpeech
https://github.com/myshell-ai/OpenVoice
https://github.com/yl4579/StyleTTS2
https://github.com/BoltzmannEntropy/xtts2-ui
https://github.com/daswer123/xtts-webui (Warning: Windows version uses prebuilt binaries that anons haven't verified. Use at your own discretion)

> Music
https://github.com/facebookresearch/audiocraft
https://rentry.org/AudioCraftRemix

> Audio Cleanup
https://github.com/Anjok07/ultimatevocalremovergui
https://github.com/resemble-ai/resemble-enhance

> Related boards
>>>/aco/asdg
>>>/aco/csdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/g/sdg
>>>/g/lmg
>>>/g/aicg
>>>/h/hdg
>>>/trash/sdg
>>>/u/sdg
>>>/vg/aids
>>>/vt/vtai

Anonymous
05/21/24(Tue)20:13:17 No.5557210

Anonymous 05/21/24(Tue)20:13:17 No.5557210

File: Sailor Moon - 1950's Supe(...).webm (5.91 MB, 1280x720)

5.91 MB WEBM

ATTENTION
If you're looking for text to song, there is a separate suno\udio thread due to high interest relative to other posts
>>5553687

Anonymous
05/21/24(Tue)20:13:54 No.5557211

Anonymous 05/21/24(Tue)20:13:54 No.5557211

File: 1716263603179845.webm (5.97 MB, 720x576)

5.97 MB WEBM

taking webms from dud thread

Anonymous
05/21/24(Tue)20:15:00 No.5557215

Anonymous 05/21/24(Tue)20:15:00 No.5557215

File: 1716263663146564.webm (5.82 MB, 1280x720)

5.82 MB WEBM

Anonymous
05/21/24(Tue)20:16:03 No.5557219

Anonymous 05/21/24(Tue)20:16:03 No.5557219

File: 1716263728699116.webm (2.71 MB, 320x240)

2.71 MB WEBM

Anonymous
05/21/24(Tue)22:39:20 No.5557450

Anonymous 05/21/24(Tue)22:39:20 No.5557450

post the one where commander shepard calls jacob a subhuman

Anonymous
05/21/24(Tue)22:42:20 No.5557455

Anonymous 05/21/24(Tue)22:42:20 No.5557455

>>5557210
Nice

Anonymous
05/21/24(Tue)22:43:36 No.5557457

Anonymous 05/21/24(Tue)22:43:36 No.5557457

File: Squidwock.webm (5.98 MB, 960x540)

5.98 MB WEBM

Anonymous
05/21/24(Tue)22:45:23 No.5557460

Anonymous 05/21/24(Tue)22:45:23 No.5557460

>>5557210
The good old times, when the US wasn't filled with nigs and mongrels.
Cursed be the synagogue of satan.

Anonymous
05/22/24(Wed)07:39:04 No.5557785

Anonymous 05/22/24(Wed)07:39:04 No.5557785

File: ronnie-mcnutt-dance_1.webm (3.34 MB, 1280x720)

3.34 MB WEBM

Been having some fun with Viggle AI, there doesn't seem to be filtering on who you can upload. I assume you can't upload nudity ofc, but I uploaded a shirtless guy and the Abu Ghraib prisoner without any trouble.

Anonymous
05/22/24(Wed)07:41:41 No.5557791

Anonymous 05/22/24(Wed)07:41:41 No.5557791

File: lucas-bedroom-dance_2.webm (4.15 MB, 1280x720)

4.15 MB WEBM

>>5557785

Anonymous
05/22/24(Wed)07:43:21 No.5557794

Anonymous 05/22/24(Wed)07:43:21 No.5557794

File: abu-ghriddy_3.webm (4.06 MB, 1280x720)

4.06 MB WEBM

>>5557791

Anonymous
05/22/24(Wed)08:15:17 No.5557819

Anonymous 05/22/24(Wed)08:15:17 No.5557819

>>5557785
jesus christ that's cursed

Anonymous
05/22/24(Wed)08:53:18 No.5557856

Anonymous 05/22/24(Wed)08:53:18 No.5557856

>>5557785
if i could make a request for the same dance but with
Budd Dwyer
Ricardo López
Shuaib Aslam
& Gleb Korablev?

Anonymous
05/22/24(Wed)11:10:50 No.5557969

Anonymous 05/22/24(Wed)11:10:50 No.5557969

File: george-floyd-dance_4.webm (5.09 MB, 1280x720)

5.09 MB WEBM

I tried to upload George Floyd but it got blocked, said some shit about community guidelines. Super easy to circumvent.

Anonymous
05/22/24(Wed)19:41:09 No.5558415

Anonymous 05/22/24(Wed)19:41:09 No.5558415

>>5557785
jesus.. funniest thing ive seen

Anonymous
05/23/24(Thu)11:11:14 No.5559024

Anonymous 05/23/24(Thu)11:11:14 No.5559024

>>5557210
"heroines", not heroes. so tired of people using the wrong world all the time. still saved though.

man, makes me wonder where the fixed version of little mermaid live action is?

Anonymous
05/23/24(Thu)13:28:07 No.5559115

Anonymous 05/23/24(Thu)13:28:07 No.5559115

https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit#heading=h.n8ac32fhltgg

wanted to drop this guide that I found for using UVR, just in case any anon might find it helpful.

Anonymous
05/23/24(Thu)15:02:15 No.5559226

Anonymous 05/23/24(Thu)15:02:15 No.5559226

>>5557785
fucking LMAOOOOOOOOOOOOOOO holy shit you got me good

Anonymous
05/23/24(Thu)15:03:27 No.5559228

Anonymous 05/23/24(Thu)15:03:27 No.5559228

>>5557969
Does anybody outside of us memers even give a shit about this guy anymore? He seems memoryholed already

Anonymous
05/23/24(Thu)15:04:32 No.5559229

Anonymous 05/23/24(Thu)15:04:32 No.5559229

Anybody got those of the zoomer concert entrance?

Anonymous
05/23/24(Thu)18:08:36 No.5559419

Anonymous 05/23/24(Thu)18:08:36 No.5559419

Anyone have this?
https://desuarchive.org/wsg/thread/5518247/#5538629

Anonymous
05/23/24(Thu)21:38:24 No.5559626

Anonymous 05/23/24(Thu)21:38:24 No.5559626

>>5559115
thanks for sharing anon, seems really comprehensive
https://voca.ro/11yLUnVg1lU8

Anonymous
05/23/24(Thu)21:58:43 No.5559646

Anonymous 05/23/24(Thu)21:58:43 No.5559646

>>5557785
wtf it was a prank all along?

Anonymous
05/23/24(Thu)22:35:14 No.5559693

Anonymous 05/23/24(Thu)22:35:14 No.5559693

File: Decending to the Heart of(...).webm (5.38 MB, 1024x1024)

5.38 MB WEBM

Praise be the Omnissiah for Mechanicus II: Necron Boogaloo has been announced.

Anonymous
05/23/24(Thu)22:46:41 No.5559704

Anonymous 05/23/24(Thu)22:46:41 No.5559704

>>5557785
i.... can't look away

Anonymous
05/24/24(Fri)03:26:15 No.5560022

Anonymous 05/24/24(Fri)03:26:15 No.5560022

>>5559626
How does it change key from the original like that? Isn't it taking the notes straight from the vocal track?
I've definitely heard some that do not do this that cause the ai vocal to be outside the character's normal range, but hits the notes normally.

Anonymous
05/24/24(Fri)04:19:47 No.5560068

Anonymous 05/24/24(Fri)04:19:47 No.5560068

File: ai - jej tlc jack of blad(...).webm (1.24 MB, 640x360)

1.24 MB WEBM

Anonymous
05/24/24(Fri)11:01:31 No.5560367

Anonymous 05/24/24(Fri)11:01:31 No.5560367

>>5560022
I might have left a filter on the pitch control that I shouldn't have. That might be why it sounds flatter than the original. Unless that's not your question, then if it's about the key/purchases the vocals are on, I changed it manually in RVC because without adjustment it was out of marcelines usual singing vocal range.

Anonymous
05/24/24(Fri)14:44:16 No.5560651

Anonymous 05/24/24(Fri)14:44:16 No.5560651

>>5560367
Key/pitch. Damn phoneposting got me good there

Anonymous
05/24/24(Fri)15:26:00 No.5560706

Anonymous 05/24/24(Fri)15:26:00 No.5560706

>>5560367
I am going to be honest, I think you should also match the backing with a straight pitch up to the same key as the vocals. It clashes pretty hard here without that.
It is interesting to know that you can adjust the vocal pitch though.

Anonymous
05/24/24(Fri)18:53:24 No.5560932

Anonymous 05/24/24(Fri)18:53:24 No.5560932

>>5560706
I'll try it and see if it works out.

Anonymous
05/24/24(Fri)19:34:48 No.5560981

Anonymous 05/24/24(Fri)19:34:48 No.5560981

>>5560706
so idk why i even pitch shifted the vocals to begin with. this sounds fine without it IMO. I also completely removed the pitch filter and it sounds way better.
https://vocaroo.com/16Rm1sCOcqaW (embed) (embed)

Here's what it sounds like with the instrumental pitch shifted the same as the vocal track (+1/12 octaves or +16%). Pitch shifting to the same pitch f/g 1 didn't work either and sorta made it sound like an attempt at a metal remix of the notes. but this is the funnier result because the instrumental sounds like something outa budokai 1.
https://vocaroo.com/18fxle7SzRmU (embed) (embed)

Anonymous
05/24/24(Fri)23:58:20 No.5561150

Anonymous 05/24/24(Fri)23:58:20 No.5561150

File: sides crippled.gif (117 KB, 189x292)

117 KB GIF

>>5557785

Anonymous
05/25/24(Sat)02:19:53 No.5561261

Anonymous 05/25/24(Sat)02:19:53 No.5561261

File: pitch.webm (4.17 MB, 437x389)

4.17 MB WEBM

>>5560981
This is very confusing to hear.
The vocal is closer to a half-step up from the original; the 2nd link with the shifted instrumentation overshot it. By the way, according to audacity, a half-step is approximately a 5.95% change, pic related was what I applied to match the vocal.
As for the filter, I will be honest I can't significantly distinguish the vocals apart between the three in the first minute or so, though I did not listen to the entirety of each.

Anonymous
05/25/24(Sat)03:05:30 No.5561300

Anonymous 05/25/24(Sat)03:05:30 No.5561300

File: pitch2.webm (2.23 MB, 300x300)

2.23 MB WEBM

>>5560981
Just for fun, tried shifting your sample down as an example. It is really clear that the vocal track didn't match the backing track at 0:35 when my noob attempt at vocal isolation let the guitar through.
I recommend doing something similar with your isolated vocal track instead.

Anonymous
05/25/24(Sat)10:40:06 No.5561578

Anonymous 05/25/24(Sat)10:40:06 No.5561578

>>5561300
Wait so was the issue the difference in key between the main vocals and the backing vocals or the main vocals and the instrumental?

Anonymous
05/25/24(Sat)11:38:08 No.5561608

Anonymous 05/25/24(Sat)11:38:08 No.5561608

>>5561578
Paramore track: unchanged pitch, vocal removed
AI track: pitched half-step down, vocal isolated
The clash is because of the pitched down AI track having a crude isolation that lets some of the AI instrumental backing through, which is off-key now because the AI vocal was off-key and correcting that puts the instrumentation off-key instead.

Anonymous
05/25/24(Sat)13:52:39 No.5561730

Anonymous 05/25/24(Sat)13:52:39 No.5561730

>>5561608
i can't make any real adjustments right now, but I think what you're getting at is that I need to re-isolate the vocals to ensure all of the instrumentals are pulled from them.

I've been trying changing the pitches of both the instrumental and the isolated vocals in Audacity separately (to test the effect of the pitch shift) by max a half step to .5 or even .25 half steps and every time the pitch adjustment just makes the tracks sound waaay off. I'll try restarting with the original track later and see if i can't get a better result after consulting the guide above.

Thanks for the tips though anon, I'm a giga novice when it comes to sound mixing so really i don't quite understand completely but this has been interesting to learn about. I just sorta always wrote off any mixing issues or mistakes as "imagine you're in karaoke or something and it just wont be perfect lol"

Anonymous
05/25/24(Sat)14:27:29 No.5561765

Anonymous 05/25/24(Sat)14:27:29 No.5561765

>>5561730
Try listening to the original track alongside the AI one to compare the two. It is really obvious then. In audacity, you can import the two and play them alongside easily.
I assumed you were getting an isolated vocal output already; weird and annoying that it would pre-mix it for you like that when it's only generating a vocal.
Good luck separating the two, I am clueless there as well.

Anonymous
05/25/24(Sat)18:30:35 No.5562041

Anonymous 05/25/24(Sat)18:30:35 No.5562041

>>5561765
i re-isolated the instrumentals from the original song and overlayed the vocals from my old one. I noticed the old one started a few beats earlier than the original song for some reason so I lined them up and also noticed the key was off a bit on the old one like you were saying. I think I'll double-check like this for my future covers. i had no idea something like this would even happen.

This one is a new one's instrumental sounds exactly the same as the original when I line them up. Is it?
https://voca.ro/1lSvduVJ4K4V

Anonymous
05/25/24(Sat)18:46:06 No.5562056

Anonymous 05/25/24(Sat)18:46:06 No.5562056

The joke is on me. How many times will I listen to this? Four apparently.
You are baiting or tone-deaf, and I do not care which. Fuck this.

Anonymous
05/25/24(Sat)18:51:38 No.5562066

Anonymous 05/25/24(Sat)18:51:38 No.5562066

>>5562056
man i genuinely dont hear what youre hearing then lol, my bad ¯\_(ツ)_/¯

Anonymous
05/25/24(Sat)19:02:34 No.5562088

Anonymous 05/25/24(Sat)19:02:34 No.5562088

File: crushcrushcrushcrushcrushcrush.webm (2.93 MB, 1280x646)

2.93 MB WEBM

>>5562066
>>5562056
the only thing different is the vocals and, of course they are.

Anonymous
05/25/24(Sat)19:53:36 No.5562142

Anonymous 05/25/24(Sat)19:53:36 No.5562142

>>5557785
kek
but definitely would be way funnier with some samsung tunes

Anonymous
05/26/24(Sun)00:16:14 No.5562439

Anonymous 05/26/24(Sun)00:16:14 No.5562439

>>5559228
reddit and twitter will bring him up whenever they want to kill cops

Anonymous
05/26/24(Sun)03:31:52 No.5562614

Anonymous 05/26/24(Sun)03:31:52 No.5562614

File: txf-pepe.webm (4.81 MB, 720x480)

4.81 MB WEBM

>>5557207

Anonymous
05/26/24(Sun)06:16:58 No.5562714

Anonymous 05/26/24(Sun)06:16:58 No.5562714

Does anyone have the Frank horrigan one where he sings human? The one on YouTube was deleted...

Anonymous
05/26/24(Sun)11:38:26 No.5563016

Anonymous 05/26/24(Sun)11:38:26 No.5563016

File: 1712843082351190.webm (3.71 MB, 852x720)

3.71 MB WEBM

Anonymous
05/26/24(Sun)19:41:37 No.5563527

Anonymous 05/26/24(Sun)19:41:37 No.5563527

>>5563016
Did you made this with Suno AI? What's the title?

Anonymous
05/27/24(Mon)18:32:47 No.5564740

Anonymous 05/27/24(Mon)18:32:47 No.5564740

File: Human - Frank Horrigan AI(...).webm (5.43 MB, 1280x640)

5.43 MB WEBM

>>5562714

i didnt have it, so i made one. needed to compress the shit outa it to post a webm though lol
https://voca.ro/1lVzDcXAbGfH
https://files.catbox.moe/zf8bxw.mp4

Anonymous
05/27/24(Mon)20:46:13 No.5564878

Anonymous 05/27/24(Mon)20:46:13 No.5564878

>>5564740
Thanks bro!

Anonymous
05/28/24(Tue)05:01:49 No.5565250

Anonymous 05/28/24(Tue)05:01:49 No.5565250

>>5563527
i reposted this so i could listen to it on my phone through clover, i actually have no idea

Anonymous
05/28/24(Tue)13:56:19 No.5565663

Anonymous 05/28/24(Tue)13:56:19 No.5565663

File: Credit to ADCon-wsg-.webm (5.83 MB, 1024x576)

5.83 MB WEBM

Anonymous
05/28/24(Tue)20:58:56 No.5566130

Anonymous 05/28/24(Tue)20:58:56 No.5566130

File: frankrap.webm (2.13 MB, 1280x108)

2.13 MB WEBM

>>5564740
i don't think it sounds entirely correct, maybe it's impossible to make him sing correctly.
this one is more talking than singing.

Anonymous
05/28/24(Tue)21:02:40 No.5566135

Anonymous 05/28/24(Tue)21:02:40 No.5566135

>>5566130
frank's got such a deep and crusty voice, there's no way it'll sound perfect when the AI's doing it. Best I could do is get close.

That being said, that's a banger

Anonymous
05/29/24(Wed)00:32:59 No.5566353

Anonymous 05/29/24(Wed)00:32:59 No.5566353

tried several collabs to train a voice model but they disconnect randomly for maximun capacity
any less known collab or local training?

Anonymous
05/29/24(Wed)00:54:27 No.5566366

Anonymous 05/29/24(Wed)00:54:27 No.5566366

>>5566353
RVC has a built-in trainer. you just gotta do something else for a few hours

Anonymous
05/29/24(Wed)20:50:37 No.5567515

Anonymous 05/29/24(Wed)20:50:37 No.5567515

whats the preferred text to speech in the OP? is one better than the other or are they all pretty similar in quality/ease of use?

Anonymous
05/30/24(Thu)01:46:14 No.5567812

Anonymous 05/30/24(Thu)01:46:14 No.5567812

File: 1702981627554548.webm (4.89 MB, 720x720)

4.89 MB WEBM

Anonymous
05/30/24(Thu)01:47:21 No.5567815

Anonymous 05/30/24(Thu)01:47:21 No.5567815

File: 1704837844313055.webm (5.75 MB, 720x480)

5.75 MB WEBM

>>5567812

Anonymous
05/30/24(Thu)01:48:25 No.5567818

Anonymous 05/30/24(Thu)01:48:25 No.5567818

File: 1711942758384392.webm (1.13 MB, 480x852)

1.13 MB WEBM

>>5567815

Anonymous
05/30/24(Thu)01:49:31 No.5567820

Anonymous 05/30/24(Thu)01:49:31 No.5567820

File: 1714144986247885.webm (3.53 MB, 720x720)

3.53 MB WEBM

>>5567818

Anonymous
05/30/24(Thu)11:13:06 No.5568212

Anonymous 05/30/24(Thu)11:13:06 No.5568212

File: VinDrinksDiesel.webm (4.54 MB, 960x540)

4.54 MB WEBM

Why do they call him Vin?

Anonymous
05/30/24(Thu)11:21:56 No.5568225

Anonymous 05/30/24(Thu)11:21:56 No.5568225

File: GF AC.webm (1022 KB, 1280x626)

1022 KB WEBM

>>5559228
wdym? They just made a whole video game for him.

Anonymous
05/30/24(Thu)11:27:07 No.5568234

Anonymous 05/30/24(Thu)11:27:07 No.5568234

>>5568225
the fuck is this shit

Anonymous
05/30/24(Thu)11:37:12 No.5568244

Anonymous 05/30/24(Thu)11:37:12 No.5568244

>>5568234
It's the latest woke Assassin's Creed game.

Ubisoft makes a game about ancient Japan and zoom in on the one guy in the whole country who was black. History shows that he did exist but was actually just a servant that they dressed up like a samurai for fun, but of course the devs pretend like that wasn't the case. There's an edit war on Wikipedia to "correct" the record and remove old information.
In addition to lying about history (and the fact that it would be very retarded for the one guy that sticks out the most to be an assassin) you can also play as a stronk womyn because women are totally badass and can do everything a man can do. She's at least Japanese.
Needless to say Japanese gamers are upset with the game and the left don't know how much they can bite back since they aren't as used to shit on asian people as they are shitting on white people.

Anonymous
05/30/24(Thu)11:38:59 No.5568248

Anonymous 05/30/24(Thu)11:38:59 No.5568248

>>5568244
>the left don't know how much they can bite back since they aren't as used to shit on asian people
guess we'll have to wait for the kikes to come up with the latest instructions for the golems

Anonymous
05/30/24(Thu)12:49:45 No.5568326

Anonymous 05/30/24(Thu)12:49:45 No.5568326

>>5568234
>>5568244
Don't forget, he is not only black, but also gay.

Anonymous
05/30/24(Thu)13:55:15 No.5568389

Anonymous 05/30/24(Thu)13:55:15 No.5568389

>>5568326
lulz was that actually confirmed? i think that is just a meme

Anonymous
05/30/24(Thu)17:24:18 No.5568612

Anonymous 05/30/24(Thu)17:24:18 No.5568612

File: It Is The 41St MillenniumC..webm (5.49 MB, 220x220)

5.49 MB WEBM

Trump just got convicted on all counts.

Welcome to the greatest timeline.

Anonymous
05/30/24(Thu)22:46:43 No.5568931

Anonymous 05/30/24(Thu)22:46:43 No.5568931

>>5560068
It's insane how an AI made this scene 10x more kino!

Anonymous
05/31/24(Fri)00:51:31 No.5569042

Anonymous 05/31/24(Fri)00:51:31 No.5569042

is it better with the backing vocals
https://vocaroo.com/14LObMgeLBfp

or without?
https://vocaroo.com/1281P8KEL8Qh

Anonymous
05/31/24(Fri)00:55:25 No.5569046

Anonymous 05/31/24(Fri)00:55:25 No.5569046

>I forgot this thread existed.
Damn, System Shock is a helluva drug.

Anonymous
05/31/24(Fri)01:07:58 No.5569060

Anonymous 05/31/24(Fri)01:07:58 No.5569060

>>5569042
you should use the Halsey cover of that song instead, it'll sound more natural.

Anonymous
05/31/24(Fri)01:39:01 No.5569093

Anonymous 05/31/24(Fri)01:39:01 No.5569093

>>5569060
Oh dang yeah thisll be way better. Thanks mate

Anonymous
05/31/24(Fri)09:24:02 No.5569450

Anonymous 05/31/24(Fri)09:24:02 No.5569450

>>5568612
i heard, America is dead

Anonymous
05/31/24(Fri)09:40:23 No.5569468

Anonymous 05/31/24(Fri)09:40:23 No.5569468

>>5569042
>>5569060
once again, thanks anon, the whole ensemble fits better for it too. Dont know why I didn't search for a cover last night when I usually do. musta been suffering from poo brain
https://vocaroo.com/1nSvXGUBMdr3

Anonymous
05/31/24(Fri)12:20:13 No.5569570

Anonymous 05/31/24(Fri)12:20:13 No.5569570

>>5569468
no problem, glad it worked out.
I like matching voices to songs, it's fun.

Anonymous
05/31/24(Fri)12:25:58 No.5569573

Anonymous 05/31/24(Fri)12:25:58 No.5569573

>>5569468
did you adjust the pitch when you ran it through RVC? it sounds a little off to me. if so, I'd advise against doing that unless it's by +12 or -12, or a rap song.

Anonymous
05/31/24(Fri)14:23:15 No.5569668

Anonymous 05/31/24(Fri)14:23:15 No.5569668

>>5569450
Why would America be dead just because a slave of the jews got declared guilty?

Anonymous
05/31/24(Fri)14:38:45 No.5569690

Anonymous 05/31/24(Fri)14:38:45 No.5569690

>>5569668
Busted election system, busted legal system, busted media, people not believing in free speech (swallowing concepts like hate speech, disinformation, misinformation), values not passed on to the youth. Then there's people such as yourself who simply don't get it on their own and will probably not have the mental capacity to understand what I'm saying even when I point it out in plain text (not saying it just because I'm trying to pick a fight, it's simply true). There's still a thing called "America" or "USA" but it's in name only.

Anonymous
05/31/24(Fri)15:03:47 No.5569727

Anonymous 05/31/24(Fri)15:03:47 No.5569727

>>5569690
everything is fake and gay. always has been.

Anonymous
05/31/24(Fri)20:22:55 No.5570096

Anonymous 05/31/24(Fri)20:22:55 No.5570096

>>5569573
Yeah I tried it at +0 and it didnt sound right and theres no way +12 wouldnt sound like a squeeker in a COD lobby. So i adjusted it to +2 for most of the song and +3 for the chorus because I didn't feel like the +2 sounded like she was "trying enough" if you get what I mean.

Anonymous
05/31/24(Fri)20:57:03 No.5570129

Anonymous 05/31/24(Fri)20:57:03 No.5570129

>>5570096
it's... off key, though. if it doesn't sound right at +12 or -12, or in rare cases -24 or +24, then the song isn't a good candidate for the voice you're using.

Anonymous
05/31/24(Fri)21:09:28 No.5570146

Anonymous 05/31/24(Fri)21:09:28 No.5570146

>>5570096
You're not dealing with just random pitch shifting, the 12 tone scale is music theory. The pitch might allow the voice to sound more like the character, but shifting it causes it to be off key. So it'll be discordant and sound like shit when sung against the instrumental. Basically you have to shift it in intervals of 12, but you might have a slight chance of it not sounding like absolute ass at +7 which is a perfect fifth.
https://en.wikipedia.org/wiki/Interval_(music)

Anonymous
05/31/24(Fri)21:28:58 No.5570162

Anonymous 05/31/24(Fri)21:28:58 No.5570162

>>5570146
thank you for explaining that. I knew that it didn't work, but not why. I'll have to try +7 sometime.

Anonymous
05/31/24(Fri)22:12:40 No.5570198

Anonymous 05/31/24(Fri)22:12:40 No.5570198

>>5570146
I see, guess I'll try +7 later because 0 was bad and 12 will be even worse

Anonymous
06/01/24(Sat)13:03:32 No.5570799

Anonymous 06/01/24(Sat)13:03:32 No.5570799

>>5570146
Ah fuck I had a brain blast and I finally understand what you're saying about key and frequency after someone else explained it by comparing the keys with their hz values. Damn lol

Anonymous
06/02/24(Sun)10:34:21 No.5572024

Anonymous 06/02/24(Sun)10:34:21 No.5572024

File: Carl Wheezer - My Heart W(...).webm (5.85 MB, 640x360)

5.85 MB WEBM

Anonymous
06/02/24(Sun)13:09:08 No.5572122

Anonymous 06/02/24(Sun)13:09:08 No.5572122

>>5557210
this restores my faith in ai

Anonymous
06/02/24(Sun)20:28:35 No.5572599

Anonymous 06/02/24(Sun)20:28:35 No.5572599

>>5572024
a masterpiece

Anonymous
06/03/24(Mon)07:51:30 No.5573105

Anonymous 06/03/24(Mon)07:51:30 No.5573105

File: [AI] Ella Baila Sola x Vi(...).webm (3.31 MB, 1230x682)

3.31 MB WEBM

Anonymous
06/03/24(Mon)12:58:45 No.5573422

Anonymous 06/03/24(Mon)12:58:45 No.5573422

>>5570146
idk what i was smoking lol. +0 is good. i guess I was so focused on making it sound exactly like the character I didn't really think of the big picture when it all came together. I'll keep it in mind from now on, thanks anon

Anonymous
06/03/24(Mon)22:10:35 No.5573990

Anonymous 06/03/24(Mon)22:10:35 No.5573990

>>5567515
TTS doesn't seem to be used a lot by anons here, none of it is on the same level as elevenlabs, but if I recall, a few threads back anons liked xtts due to ease of use (there are 2 front-ends linked in the OP, I don't know which is better)
I don't think there have been a lot of ground-breaking developments in self-hosted TTS lately, so that's probably why there's not a lot of buzz around it.

Anonymous
06/04/24(Tue)04:50:47 No.5574303

Anonymous 06/04/24(Tue)04:50:47 No.5574303

>>5573990
Same goes with any form of OSS audio lately. Stable Audio 1.0 got leaked but it's dogshit, and that's probably the best we'll get for a while. The struggle is real

Anonymous
06/04/24(Tue)10:35:01 No.5574583

Anonymous 06/04/24(Tue)10:35:01 No.5574583

>>5573990
>>5574303
tragic, thanks for the info tho

Anonymous
06/04/24(Tue)10:38:06 No.5574589

Anonymous 06/04/24(Tue)10:38:06 No.5574589

>>5574583
The best fully open source option for TTS at the moment seems to be XTTSv2/StyleTTS2 + RVC, but if there isn't already an RVC model of the voice you're trying to replicate you have to train it yourself.

Anonymous
06/04/24(Tue)12:31:46 No.5574727

Anonymous 06/04/24(Tue)12:31:46 No.5574727

>>5574589
i'll check it out, thanks anon. i was planning on using it mostly to check the quality of RVC voices I train anyway

Anonymous
06/04/24(Tue)13:52:18 No.5574818

Anonymous 06/04/24(Tue)13:52:18 No.5574818

File: tinytim-hl2zombie.webm (1.8 MB, 288x360)

1.8 MB WEBM

Anonymous
06/04/24(Tue)15:31:57 No.5574973

Anonymous 06/04/24(Tue)15:31:57 No.5574973

>>5574589
welp, looks like ive temporarily bricked my local RVC by fucking with the text to speech programs in the OP.

Anonymous
06/04/24(Tue)17:55:08 No.5575180

Anonymous 06/04/24(Tue)17:55:08 No.5575180

>>5574973
Did you reuse the same python virtual environment or something? That's not a good idea, venvs are there specifically to avoid dependency conflicts.

Anonymous
06/04/24(Tue)17:59:31 No.5575184

Anonymous 06/04/24(Tue)17:59:31 No.5575184

>>5567515
>>5573990
Don't forget that you can apply your own voice models to elevenlabs audio. Just a matter of finding a model on that site that has a similar enough accent.

Anonymous
06/04/24(Tue)18:18:26 No.5575201

Anonymous 06/04/24(Tue)18:18:26 No.5575201

>>5575180
i mean, i've learned that now lul

>>5575184
probably the better idea but I was curious

Anonymous
06/04/24(Tue)19:12:10 No.5575231

Anonymous 06/04/24(Tue)19:12:10 No.5575231

File: course-of-empire-destruct(...).webm (3.25 MB, 1200x690)

3.25 MB WEBM

>tfw people keep memeing about America while forgetting about the true imperial collapse

Anonymous
06/04/24(Tue)20:00:08 No.5575249

Anonymous 06/04/24(Tue)20:00:08 No.5575249

>>5570146
The guy's clearly a troll. Wouldn't bother.

Anonymous
06/04/24(Tue)20:53:17 No.5575297

Anonymous 06/04/24(Tue)20:53:17 No.5575297

>>5575184
sure wish i did this first lol, woulda saved the headache.
https://voca.ro/1e3MnmkdXHsp

Anonymous
06/05/24(Wed)02:20:45 No.5575587

Anonymous 06/05/24(Wed)02:20:45 No.5575587

File: ai - jej vocals shards of(...).webm (5.5 MB, 640x360)

5.5 MB WEBM

out of all the messing around i've done with this I think this one turned out the best so far.

Anonymous
06/05/24(Wed)02:25:03 No.5575590

Anonymous 06/05/24(Wed)02:25:03 No.5575590

>>5575587
link for the vader model if anyone else wants to fuck around with it. have fun niggers.
https://huggingface.co/OwlCity/OwlCityRVC/resolve/main/Darth%20Vader%20Ultimate.zip?download=true

Anonymous
06/05/24(Wed)12:58:21 No.5576033

Anonymous 06/05/24(Wed)12:58:21 No.5576033

I wonder if Marceline-Mate is still here.

Anonymous
06/05/24(Wed)13:40:41 No.5576072

Anonymous 06/05/24(Wed)13:40:41 No.5576072

>>5576033
well, one of us is here anyway. i haven't had any inspiration for songs that sound good with her voice

have any suggestions?

Anonymous
06/05/24(Wed)14:19:26 No.5576106

Anonymous 06/05/24(Wed)14:19:26 No.5576106

>>5576033
I'm here. I. Was thee one getting scolded/taught about using off key octaves to get the right voice. I've mostly just been way more busy with other things to spam the thread like before, but my mixtape grows (slowly)

Anonymous
06/05/24(Wed)15:23:31 No.5576163

Anonymous 06/05/24(Wed)15:23:31 No.5576163

File: marcy_friendly_teeth.webm (3.09 MB, 500x709)

3.09 MB WEBM

>>5576072
>>5576033
here's a quick little something
https://voca.ro/12XuNzb5XTBM

Anonymous
06/05/24(Wed)15:57:14 No.5576197

Anonymous 06/05/24(Wed)15:57:14 No.5576197

File: marcy_friendly_teeth_fix.webm (3.09 MB, 500x710)

3.09 MB WEBM

>>5576163
i was getting decoding errors with this webm on firefox so i re-encoded

Anonymous
06/05/24(Wed)20:09:53 No.5576445

Anonymous 06/05/24(Wed)20:09:53 No.5576445

>>5563016
damn this is good.
i legit thought it was some AI voice cover of an existing song and been googling the lyrics like a retard.
can't wait for the voices to become lifelike so i can finally create all the random songs and melodies that've been rolling around in my head throughout the years.
future's bright

Anonymous
06/05/24(Wed)23:20:42 No.5576597

Anonymous 06/05/24(Wed)23:20:42 No.5576597

File: Necromancin Dancin - Marc(...).webm (2.92 MB, 1280x720)

2.92 MB WEBM

>>5576163
nice one anon, glad you're still around

my RVC is being fucky right now and probably looping or frozen on something so I cant make anything new. I found this song a while ago and it always made me think of early Marceline.
https://voca.ro/18fgeMfqyt33

Also, I don't know how to upload my trained models into the google sheet so the Vader anon inspired me to share this way. all three models are 300e and pretty good imo
Eclipsa: https://mega.nz/file/qY4XUDjB#SAbKNf1GhyHGL2Nw4b1V5TCetjed70kYf9gpucpAeVs
Huntress Wizard: https://mega.nz/file/KQRDlTLB#WHF_SiEeNX6oor86gQBsMBRlMEPSQxM7Fv7sEq8MDW4
Nicole Watterson: https://mega.nz/file/PRRSnC7R#VjcxY8eykAXZYQt_xy7XdqsH5ZgMMOfeYfHMNPW8vxg

Anonymous
06/06/24(Thu)12:14:10 No.5577075

Anonymous 06/06/24(Thu)12:14:10 No.5577075

File: marcys_starter_guitar.webm (2.95 MB, 750x750)

2.95 MB WEBM

>>5576197
https://voca.ro/1kxL7lqAkBjj

Anonymous
06/06/24(Thu)20:30:32 No.5577668

Anonymous 06/06/24(Thu)20:30:32 No.5577668

>>5576597
i think RVC isnt seeing my GPU anymore for some reason. Any ideas on how to fix that?

>>5577075
rockin anon, good vibes on that one

Anonymous
06/06/24(Thu)22:16:11 No.5577760

Anonymous 06/06/24(Thu)22:16:11 No.5577760

>>5577668
okay so that wasnt the issue, stuff may be much more fucked than anticipated

Anonymous
06/07/24(Fri)01:44:24 No.5577894

Anonymous 06/07/24(Fri)01:44:24 No.5577894

>>5577075
Would you mind doing one for https://www.youtube.com/watch?v=I-ed7GhM3F0 ?

Anonymous
06/07/24(Fri)06:27:11 No.5578095

Anonymous 06/07/24(Fri)06:27:11 No.5578095

File: marcy_sk8er_boi.webm (2.21 MB, 500x500)

2.21 MB WEBM

>>5577894
https://voca.ro/1dijnFjloZuY

Anonymous
06/07/24(Fri)08:03:28 No.5578168

Anonymous 06/07/24(Fri)08:03:28 No.5578168

>>5578095
King shit

Anonymous
06/07/24(Fri)19:04:31 No.5578823

Anonymous 06/07/24(Fri)19:04:31 No.5578823

File: Wake Me Up When September(...).webm (3.99 MB, 1024x1024)

3.99 MB WEBM

https://vocaroo.com/18a5WYI0wBUN

I need to make more high-energy song covers. maybe I'll do a bunch of Avril or Pink for my next lot

Anonymous
06/07/24(Fri)19:24:19 No.5578840

Anonymous 06/07/24(Fri)19:24:19 No.5578840

>>5568225
A Nigger's Creed

Anonymous
06/08/24(Sat)20:57:27 No.5579960

Anonymous 06/08/24(Sat)20:57:27 No.5579960

>>5578840
Is the game gonna be that bad?

Anonymous
06/09/24(Sun)06:18:34 No.5580294

Anonymous 06/09/24(Sun)06:18:34 No.5580294

>>5579960
It's a modern Ubisoft game. Yes, it's going to be that bad.

Anonymous
06/09/24(Sun)07:51:41 No.5580380

Anonymous 06/09/24(Sun)07:51:41 No.5580380

What's the best TTS tool that does voice cloning rn? I just wanna have my waifu read me audiobooks.

Anonymous
06/09/24(Sun)14:10:27 No.5580713

Anonymous 06/09/24(Sun)14:10:27 No.5580713

>>5580380
find a free model on Elevenlabs that sounds close and use RVC to change the voice. works great for me

Anonymous
06/09/24(Sun)14:30:12 No.5580729

Anonymous 06/09/24(Sun)14:30:12 No.5580729

>>5580713
I used to use elevenlabs and it was so convenient cause I am too much of a brainlet to mess with models and all the stuff on the OP but it no longer does cloning for free.

Anonymous
06/09/24(Sun)15:16:39 No.5580755

Anonymous 06/09/24(Sun)15:16:39 No.5580755

>>5580729
yeah same, but they have some free models you can find in the voicelab>Add generated or cloned voice>voice library that you can use. After you get a good-sounding reading, you can use RVC to change the voice from the elven labs voice to whoever you want

Anonymous
06/09/24(Sun)15:20:21 No.5580756

Anonymous 06/09/24(Sun)15:20:21 No.5580756

>>5580729
As long as you're not training your own models its fairly easy to use RVC. Problem is that someone has to have cloned your waifu already

Anonymous
06/09/24(Sun)16:57:59 No.5580845

Anonymous 06/09/24(Sun)16:57:59 No.5580845

File: techpriherm.webm (2.61 MB, 512x512)

2.61 MB WEBM

>>5580756
>>5580380
Also if you already have the audiobook you don't even need to do any TTS. Male-to-female voice alteration is fairly trivial.

Anonymous
06/09/24(Sun)20:00:53 No.5581011

Anonymous 06/09/24(Sun)20:00:53 No.5581011

>>5580845
Oh true. Since it's an audio book, half of it is done already. Unless they have multiple readers or they try to make different voices for characters. I've had a few do that and idk if it's sound good if a female voice is dubbed over a male reader making a female pitch voice lol.

Anonymous
06/09/24(Sun)20:09:30 No.5581018

Anonymous 06/09/24(Sun)20:09:30 No.5581018

>>5581011
>they try to make different voices for characters.
That still works fine a lot of the time unless the original reader has larger vocal range and the model simply spazzes out when they go up a bunch of octaves and suddenly Emma Watson is making dog whistling noises.

Anonymous
06/09/24(Sun)20:43:46 No.5581041

Anonymous 06/09/24(Sun)20:43:46 No.5581041

>>5574818
I hate this a lot but also love it

Anonymous
06/10/24(Mon)01:23:46 No.5581293

Anonymous 06/10/24(Mon)01:23:46 No.5581293

>>5568389
They confirmed that the game's protagonists are lgbtqiaetcetc
didn't specify gay, but highly likely

Anonymous
06/10/24(Mon)03:03:39 No.5581344

Anonymous 06/10/24(Mon)03:03:39 No.5581344

File: AverageBlackPreacher.gif (2.78 MB, 498x270)

2.78 MB GIF

Frens... The power of Christ compelled Suno to make a certified banger. And I can't grooving to it.
https://suno.com/song/3e68a014-37b7-4c85-80e4-dceaa9075dca

Anonymous
06/10/24(Mon)03:04:42 No.5581346

Anonymous 06/10/24(Mon)03:04:42 No.5581346

>>5581344
*can't stop grooving

Anonymous
06/10/24(Mon)04:40:20 No.5581386

Anonymous 06/10/24(Mon)04:40:20 No.5581386

>>5581346
*can't stop grooming

Anonymous
06/10/24(Mon)06:04:00 No.5581455

Anonymous 06/10/24(Mon)06:04:00 No.5581455

>>5568931
I wanna redo the whole fight with the jack of blades vocals, it's honestly pretty fucking cool I just gotta not be a lazy ass and put it together.

Anonymous
06/10/24(Mon)11:59:31 No.5581793

Anonymous 06/10/24(Mon)11:59:31 No.5581793

>>5559228
there are city statues of him now

Anonymous
06/10/24(Mon)12:06:25 No.5581802

Anonymous 06/10/24(Mon)12:06:25 No.5581802

>>5559228
of course. he gets mentioned almost every time time american internal politics get brought up.
like seriously it's almost like godwin's or mutt's law.
>As a discussion concerning the state of American police or civil rights or any remotely political topic for that matter grows longer, the probability of George Floyd being brought up approaches 1

Anonymous
06/10/24(Mon)13:06:01 No.5581872

Anonymous 06/10/24(Mon)13:06:01 No.5581872

>>5581793
>>5581802
that's mad
cant wait for this era to end

Anonymous
06/10/24(Mon)16:01:31 No.5582040

Anonymous 06/10/24(Mon)16:01:31 No.5582040

>>5580729
How's the project going anon?

Anonymous
06/10/24(Mon)17:19:04 No.5582132

Anonymous 06/10/24(Mon)17:19:04 No.5582132

yeah to anyone wondering rvc is really easy to use, my pc is too shit to run it locally but i use this https://colab.research.google.com/github/hinabl/AICoverGen-Colab/blob/main/Hina_Mod_AICoverGen_colab.ipynb

it's really easy, just look at it and fuck around if you're not a total nigger brain you'll figure it out pretty quick.

Anonymous
06/10/24(Mon)21:18:18 No.5582414

Anonymous 06/10/24(Mon)21:18:18 No.5582414

>>5582132
if you are running it locally though, just be sure you have the C++ from the windows dev tools and generic python libraries installed. i got a headache trying to install it on a laptop I've got before I figured that out

Anonymous
06/11/24(Tue)05:53:58 No.5582757

Anonymous 06/11/24(Tue)05:53:58 No.5582757

>>5582132
>you're not a total nigger brain you'll figure it out pretty quick.
I must be a niggerbrained cause I don't know how to get models to work. The list doesn't show up and I don't know where to look for them.

Anonymous
06/11/24(Tue)09:42:56 No.5582896

Anonymous 06/11/24(Tue)09:42:56 No.5582896

>>5582757
to get a voice in RVC, you want to put the [name].pth in the RVC>assets>weights folder and the folder with the same [name] that has the .index file in the RVC>logs folder.

If you don't have the index file in the logs folder, you can also put the direct path to it in RVC, but there's no point to doing that.

Then just click the inferencing voice drop-down and select the one you want. RVC will select the index file if the folder is named the exact same. slide the re-sampler to 48000 because why not and mess with stuff until it sounds right.

Anonymous
06/11/24(Tue)10:56:06 No.5582945

Anonymous 06/11/24(Tue)10:56:06 No.5582945

>>5557215
Is this sora? Where can I download?

Anonymous
06/11/24(Tue)11:56:07 No.5582995

Anonymous 06/11/24(Tue)11:56:07 No.5582995

>>5582896
I don't know what literally any of this means. Assume I just have a whole load of sample audio. How do I make a pth file. How do I make an index file.

Anonymous
06/11/24(Tue)12:25:08 No.5583021

Anonymous 06/11/24(Tue)12:25:08 No.5583021

>>5582995
ooh, you're trying to train the voice. unlucky that you need to. i assume it's not a character in the google sheet above.

Go to "Train" in RVC

On the first row, name your model, then select 48k, true, and version 2.

On the second row, enter the path for the folder, leave speaker ID as 0, and click process data.

Third row, leave as is, but select rmvpe_gpu is available. click feature extraction.

fourth row, save every x (I usually do 20 just to be safe) epochs, set the training to around 200 epochs, leave batch size, save only the latest .ckpt, don't cache the training sets, and select no for the saving each save points.

Finally, for model G path, put assets/pretrained_v2/f0G40k.pth and for the model D path, put pretrained assets/pretrained_v2/f0D40k.pth. GPU indexes are 0, and click one-click training

After a while, RVC will save the voice and you can select it from the dropdown

Anonymous
06/11/24(Tue)13:13:56 No.5583062

Anonymous 06/11/24(Tue)13:13:56 No.5583062

>>5557210
B-B-BUT AI WILL MEAN THE CONTENT WILL BE LOW QUALITY
NONONONO CHUDS YOU CANT HAVE FUN PLEASE STOP NOOOOOOOOOOOOO

Anonymous
06/11/24(Tue)13:14:26 No.5583063

Anonymous 06/11/24(Tue)13:14:26 No.5583063

>>5557215
>>5557207
more songs pls

Anonymous
06/11/24(Tue)14:45:28 No.5583149

Anonymous 06/11/24(Tue)14:45:28 No.5583149

>>5583021
But there's no train function in
>>5582132

Anonymous
06/11/24(Tue)14:55:20 No.5583158

Anonymous 06/11/24(Tue)14:55:20 No.5583158

>>5583149
ooh, i have no idea how that collab works. i was giving instructions for local/virtual machine RVC

Anonymous
06/11/24(Tue)15:08:04 No.5583168

Anonymous 06/11/24(Tue)15:08:04 No.5583168

>>5583149
>>5583158
okay, i opened the webui for the colab the other anon posted. There's no training so id recommend downloading RVC yourself if you cant find the voice in the OP.

if you cant run RVC (idk how taxing it is on GPUs) send a mega or huggingface of the choice clips and I can make it for you overnight

Anonymous
06/11/24(Tue)15:30:36 No.5583197

Anonymous 06/11/24(Tue)15:30:36 No.5583197

>>5583168
I'd so appreciate it if you did, anon. Here's the clips I've got
https://files.catbox.moe/asjdgu.zip
Ripped from this vid
https://youtu.be/zwkWWXVt-vY
I'm sorry if I may've come off as rude earlier. It's just all so confusing cause there's a million tutorials all using different methods and stuff and they all assume you know the fuck you're doing and I very much don't.

Anonymous
06/11/24(Tue)15:50:47 No.5583217

Anonymous 06/11/24(Tue)15:50:47 No.5583217

>>5583197
oh hell yeah, curie, good choice. it's wild no one made a model for her. also it's all good. kinda hard to talk and troubleshoot when we cant see the same thing lol. that, and there are a shit ton of outdated tutorials.

I'll get started after work

Anonymous
06/11/24(Tue)16:04:49 No.5583234

Anonymous 06/11/24(Tue)16:04:49 No.5583234

>>5583217
Tysm anon!

Anonymous
06/11/24(Tue)16:45:17 No.5583286

Anonymous 06/11/24(Tue)16:45:17 No.5583286

>>>5582132
I've been using this all day and now it suddenly hits me with
>Found no NVIDIA driver on your system.

Anonymous
06/11/24(Tue)19:13:33 No.5583441

Anonymous 06/11/24(Tue)19:13:33 No.5583441

>>5583234
aight, i got a bunch of clips from curie dialogue so its ready to train, but it's gonna take like 3 hours so I'm gonna let it run overnight. I'll post the results tomorrow evening or in the morning if I have the time/energy to do it before work lol

Anonymous
06/11/24(Tue)21:28:46 No.5583565

Anonymous 06/11/24(Tue)21:28:46 No.5583565

got inspired by the audiobook anon, but I think my "I want my waifu to direct guided meditation" has an actual air of sad around it lol.
https://voca.ro/1fSWPihDAzUd
https://voca.ro/19L7l16IynQ0
https://voca.ro/14dWHta5REFU

Anonymous
06/12/24(Wed)00:23:28 No.5583714

Anonymous 06/12/24(Wed)00:23:28 No.5583714

>>5583565
Which character's that?

Anonymous
06/12/24(Wed)03:36:08 No.5583850

Anonymous 06/12/24(Wed)03:36:08 No.5583850

File: 1718015480718870.webm (4.77 MB, 1280x720)

4.77 MB WEBM

Anonymous
06/12/24(Wed)08:47:26 No.5584057

Anonymous 06/12/24(Wed)08:47:26 No.5584057

>>5583234
So it's done. I think it's okay, but using it for English loses out on her inflections / a lot of her accent or throaty sounds. (unless I did it wrong, but idk) It's not like elevenlabs where it made the voice from the sound files, but it sounds pretty good when I used a french accent in elevenlabs to cover. I wonder if I try again but with maybe double the audio files its do it, but it may be more than the program can handle.

American Accent Elevenlabs Vocals dubbed over: https://voca.ro/12XThtMgaJKC

French Accent Elevenlabs Vocals dubbed over: https://voca.ro/1oboDLM2JOXs

Voice files: https://mega.nz/file/vc4CRKaT#CrNLPnZshH5ekgmbWlARL7vxS2lIEEAZtHREN-iJLyU

Anonymous
06/12/24(Wed)10:51:45 No.5584138

Anonymous 06/12/24(Wed)10:51:45 No.5584138

>>5584057
Oh my god, anon, thank you so much! I see what you mean, it's not as good as elevenlabs when I tried it out, but it's still pretty fantastic all things considered. Still, thank you, anon. I was pretty bummed out I couldn't do it myself, but you made my day.

Anonymous
06/12/24(Wed)14:43:41 No.5584330

Anonymous 06/12/24(Wed)14:43:41 No.5584330

>>5584138
>>5584057
i tried cloning her with xtts2 and it came out... ok. this is a sample with no postprocessing and almost no input processing. just posting to have some other examples of other processes
https://voca.ro/1kkgCxtrAPqR

Anonymous
06/12/24(Wed)14:53:36 No.5584339

Anonymous 06/12/24(Wed)14:53:36 No.5584339

>>5584138
No prob anon, hope it works out for you

>>5584330
This sounds pretty good imo. When I tried xtts2 I wasn't very impressed but this is good. I may have been moreso mad that it ruined my rvc python setup but thats 100% my fault.

Anonymous
06/12/24(Wed)15:05:52 No.5584353

Anonymous 06/12/24(Wed)15:05:52 No.5584353

>>5584339
are you using separate python versions between RVC and xtts2? how did you install each, or what was making them overlap? i cant remember if xtts2 needed a different version of python than RVC but with venv you shouldnt have issues with them overlapping/conflicting (i dont, anyway)

Anonymous
06/12/24(Wed)16:20:44 No.5584417

Anonymous 06/12/24(Wed)16:20:44 No.5584417

>>5584353
Yeah I'm not using a venv so when I tried to use both it imploded. I'm kinda a brainlette and was happy enough using just rvc so I never saw the point.

Anonymous
06/12/24(Wed)18:46:25 No.5584554

Anonymous 06/12/24(Wed)18:46:25 No.5584554

>>5584330
That sounds pretty good too, though the reverb-ish sound ai voices have is more pronounced here. Really cute choice of text, too.

Anonymous
06/12/24(Wed)18:59:08 No.5584562

Anonymous 06/12/24(Wed)18:59:08 No.5584562

>>5584330
as a second test i trained an rvc model from ~2hrs of her human dialog and applied that over the xtts2 results and that definitely helps with the raspy/reverby ai xtts2 render. you could totally do a whole audio book this way but for something production quality you'd have to do multiple gens and lots of hand picking from best results

https://voca.ro/12UJWWvc6Dhb

Anonymous
06/12/24(Wed)19:17:19 No.5584572

Anonymous 06/12/24(Wed)19:17:19 No.5584572

>>5584562
When you do rvc models, do you split it into a bunch of small clips or do you leave it as a long mp3? Because mine was about 20 min worth of 80 small clips

Anonymous
06/12/24(Wed)19:23:52 No.5584575

Anonymous 06/12/24(Wed)19:23:52 No.5584575

>>5584572
i yt-dlp'd the dialog video posted earlier, cut out just the human dialog in ~5 minute long sections, and used the directory with those clips as the training path. there were 22 five minute long clips and 1 two and a half minute long clip. if i can find a good host for it i'll post my training data and checkpoints for e150, e200, and e250 if you want it. it's ~5 gigs tho

Anonymous
06/12/24(Wed)19:28:51 No.5584578

Anonymous 06/12/24(Wed)19:28:51 No.5584578

>>5584575
>>5584572
but ultimately because of the rvc step 2a (the first "process data" button), it shouldnt matter how the original data is formatted. it should be just as good to leave them separated or concatenated. originally i tried to feed it the entire 2hr long dialog as one audio file and it really really didnt like that so i split it up

Anonymous
06/12/24(Wed)19:47:33 No.5584598

Anonymous 06/12/24(Wed)19:47:33 No.5584598

>>5584575
>>5584578
that would have been way easier, In the only tutorial video I've watched the guy said to use ~10-second clips so I've been doing that for all of my models lol. been a real time sink

Now I know though, and thank god, that'll be way easier than making 80-100 tiny clips.

Anonymous
06/12/24(Wed)21:35:43 No.5584672

Anonymous 06/12/24(Wed)21:35:43 No.5584672

>>5584598
The 10ish seconds is because processing takes more vram the longer the clip, so it's generally advisable to keep it under 30 seconds or less depending on your GPU.
I made myself a script that calls ffmpeg and auto splits all audio files in a directory into 30 second segments. Even though preprocessing should split it, it's not perfect.

Anonymous
06/12/24(Wed)21:53:36 No.5584695

Anonymous 06/12/24(Wed)21:53:36 No.5584695

Dream Machine just dropped
https://lumalabs.ai/dream-machine

how long will it stay up

Anonymous
06/12/24(Wed)22:02:58 No.5584708

Anonymous 06/12/24(Wed)22:02:58 No.5584708

>>5584672
does it matter if it splits in the middle of a word or sentence? because I've been manually clipping between words when possible and if it doesn't then that would be nice

Anonymous
06/12/24(Wed)22:14:11 No.5584718

Anonymous 06/12/24(Wed)22:14:11 No.5584718

>>5584708
In the middle of a sentence? No. It might have some detrimental effect if it cuts it off in the middle of a word, but with like an hour of stuff I can't imagine it would be a huge deal.
When processing, it identifies phonemes (every sound that you can make to form a word) so it's already splitting things into fragments of words in a way. Worst case it would identify the hacked up audio as a different phoneme, but there should be plenty other examples to grab if there's enough audio. I'm sure not gonna spend time manually splitting 2 hours of audio when I can do it in a script.

Anonymous
06/13/24(Thu)01:08:54 No.5584834

Anonymous 06/13/24(Thu)01:08:54 No.5584834

File: oragnge3d_1.webm (2.1 MB, 1024x1024)

2.1 MB WEBM

>>5584695
Luma image input

Anonymous
06/13/24(Thu)01:41:34 No.5584855

Anonymous 06/13/24(Thu)01:41:34 No.5584855

>>5568212
insane. hardcore. beautiful.

Anonymous
06/13/24(Thu)04:34:58 No.5584932

Anonymous 06/13/24(Thu)04:34:58 No.5584932

File: ai - jack of blades as in(...).webm (2.62 MB, 640x360)

2.62 MB WEBM

quick test of an injured hayden christensen model speaking tweaked jack of blades lines. excuse the second obi wan, I forgot to drag it out.
>>5583149
If you look around on huggingface there are a lot like that one with the training function intact. If use find one on colab that lets you train, don't do it on an account you care about, google is very butthurt over competing ai shit.

Anonymous
06/13/24(Thu)12:31:50 No.5585292

Anonymous 06/13/24(Thu)12:31:50 No.5585292

>>5584057
>>5584138
I'm gonna try and make a new model with way more data than this one to see if it can get her accent down. I don't know why it wouldn't if rvc can make a good neco arc

Anonymous
06/13/24(Thu)13:00:55 No.5585323

Anonymous 06/13/24(Thu)13:00:55 No.5585323

>>5585292
your rvc model isn't the problem. your input needs a french accent for it to come through in the conversion. i'll just give you my model since i've already trained more than enough for whatever you need.

>example spoken in french then converted
https://voca.ro/1jPnq7WuQJim

>weights+index (e945, 48k)
https://files.catbox.moe/ssa773.zip

Anonymous
06/13/24(Thu)13:15:50 No.5585337

Anonymous 06/13/24(Thu)13:15:50 No.5585337

>>5585323
converting speaker doing no french accent, then a french accent highlights the difference. this was converted all at once with that rvc model
https://voca.ro/1kd0QQPS3fPi

Anonymous
06/13/24(Thu)13:44:05 No.5585369

Anonymous 06/13/24(Thu)13:44:05 No.5585369

>>5572024
This isn't AI.

Anonymous
06/13/24(Thu)14:25:19 No.5585429

Anonymous 06/13/24(Thu)14:25:19 No.5585429

>>5585337
If you say so. I do t understand why the French accent isn't pulled out like the sounds neco arc makes. Though I guess in curies case it's moreso between words and letters that have the most impact on the accent rather than a nya sound to some words

Anonymous
06/13/24(Thu)18:24:06 No.5585702

Anonymous 06/13/24(Thu)18:24:06 No.5585702

>>5576197
>i was getting decoding errors with this webm on firefox so i re-encoded
So this shit is why I can't open videos sometimes?

Anonymous
06/13/24(Thu)19:04:28 No.5585734

Anonymous 06/13/24(Thu)19:04:28 No.5585734

>>5585702
the technical details were using a jpeg without specifying pix_fmt in ffmpeg. to re-encode i used pix_fmt yuv420p explicitly. it might have had to do with the height not being divisible by 2, but that's usually only a libx264 requirement

Anonymous
06/13/24(Thu)19:25:16 No.5585749

Anonymous 06/13/24(Thu)19:25:16 No.5585749

>>5585429
if accent A uses phoneme X in a word while accent B replaces for phoneme Y, no matter how much you train on accent A any input with accent B will use the model speaker's phoneme Y. afaik there's nothing in rvc to translate phonemes between accents

Anonymous
06/13/24(Thu)19:53:00 No.5585771

Anonymous 06/13/24(Thu)19:53:00 No.5585771

>>5585749
Interesting. Thanks for spelling it out for me and saving my graphics card another 3-4 hours of suffering lol

Anonymous
06/13/24(Thu)21:57:29 No.5585861

Anonymous 06/13/24(Thu)21:57:29 No.5585861

>>5585749
To add to this, different languages have different phonemes. Like Japanese doesn't have "th" for instance. So like if you train on a voice that is only speaking Japanese, it will be lacking certain phonemes used in English. In this sense, sometimes an accent can bleed through, but even then it's pretty slight.
I feel like current voice-to-voice tech is very misunderstood. Basically nobody knows that it's phoneme based, and probably not a lot of people even know what a phoneme is.

Anonymous
06/14/24(Fri)01:06:09 No.5585993

Anonymous 06/14/24(Fri)01:06:09 No.5585993

>>5576197
There was a song by Leonard Cohen called 'The captain", which was probably the second coolest song he did. If Hallelujah didn't take off the way it did, the captain would have probably turned out to be his sleeper hit.
Tom Waits did a slow burner in his younger years before his larynx turned into tire rubber called 'Martha' in the Closing Time album. As time has gone by, I appreciate it more than Swordfish Trombone, but anyway - food for thought.

Anonymous
06/14/24(Fri)01:07:06 No.5585994

Anonymous 06/14/24(Fri)01:07:06 No.5585994

File: 1718251636008379.webm (383 KB, 1024x1024)

383 KB WEBM

Anonymous
06/14/24(Fri)01:14:31 No.5586000

Anonymous 06/14/24(Fri)01:14:31 No.5586000

>>5581344
Holy shit based and blessed

Anonymous
06/14/24(Fri)02:44:59 No.5586084

Anonymous 06/14/24(Fri)02:44:59 No.5586084

File: Postal Dude- How Bad Can (...).webm (1.63 MB, 532x300)

1.63 MB WEBM

Anonymous
06/14/24(Fri)02:53:32 No.5586089

Anonymous 06/14/24(Fri)02:53:32 No.5586089

File: AI - It wasn't me, it was(...).webm (2.95 MB, 512x768)

2.95 MB WEBM

>>5581872
The statues are made to demoralise you, it's to make you feel powerless against them. Which itself is an admission that you aren't as otherwise they wouldn't need to instill that belief.

Anonymous
06/14/24(Fri)08:31:37 No.5586292

Anonymous 06/14/24(Fri)08:31:37 No.5586292

>>5586089
>all these months and suno is still metallic as fuck
shite ai

Anonymous
06/14/24(Fri)08:33:21 No.5586294

Anonymous 06/14/24(Fri)08:33:21 No.5586294

>>5586084
fucking gold

Anonymous
06/14/24(Fri)10:18:54 No.5586363

Anonymous 06/14/24(Fri)10:18:54 No.5586363

File: improved AI.webm (5.79 MB, 428x480)

5.79 MB WEBM

Anonymous
06/14/24(Fri)10:24:58 No.5586370

Anonymous 06/14/24(Fri)10:24:58 No.5586370

>>5585861
That's fair to say. I never really looked into how it wors, so RVC is essentially a scroll that I cast to get my waifu to sing or read. I imagine that's how it is for most people.

Anonymous
06/14/24(Fri)14:13:16 No.5586540

Anonymous 06/14/24(Fri)14:13:16 No.5586540

File: never-forgetti.webm (5.35 MB, 1280x720)

5.35 MB WEBM

>>5586363
Soul vs soulless
>>5586370
It's cool that it's easy enough for anyone to use. It just helps to know more about it so that you know what to expect and have the knowledge to train better models.

Anonymous
06/15/24(Sat)10:00:16 No.5587593

Anonymous 06/15/24(Sat)10:00:16 No.5587593

File: vegeta-youre-the-best.webm (5.93 MB, 1280x720)

5.93 MB WEBM

Anonymous
06/15/24(Sat)14:40:19 No.5587855

Anonymous 06/15/24(Sat)14:40:19 No.5587855

File: It's so over.webm (4.47 MB, 720x720)

4.47 MB WEBM

I'm not ready bros...

Anonymous
06/15/24(Sat)14:46:03 No.5587865

Anonymous 06/15/24(Sat)14:46:03 No.5587865

File: 1718388831148.webm (4.4 MB, 1280x720)

4.4 MB WEBM

>>5584695

Anonymous
06/15/24(Sat)15:32:35 No.5587955

Anonymous 06/15/24(Sat)15:32:35 No.5587955

>>5587865
I was not ready for the AT-AT turning into a fucking mech, kek.

Anonymous
06/15/24(Sat)16:39:07 No.5588036

Anonymous 06/15/24(Sat)16:39:07 No.5588036

>>5587593
amazing. Fore sure his theme song lol. what Vegeta model did you use?

Anonymous
06/15/24(Sat)17:47:12 No.5588086

Anonymous 06/15/24(Sat)17:47:12 No.5588086

>>5588036
> what Vegeta model did you use?
Made it myself just yesterday. It's so-vits-svc, can't speak for compatibility with rvc. I don't mind sharing it if you want but very few people are interested in sovits models.

Anonymous
06/15/24(Sat)18:01:08 No.5588098

Anonymous 06/15/24(Sat)18:01:08 No.5588098

File: sarumanallthesmallthings.webm (3.62 MB, 602x258)

3.62 MB WEBM

Anonymous
06/15/24(Sat)18:38:58 No.5588138

Anonymous 06/15/24(Sat)18:38:58 No.5588138

>>5588098
anon, it's beautiful. I'm in tears.

Anonymous
06/15/24(Sat)23:04:50 No.5588355

Anonymous 06/15/24(Sat)23:04:50 No.5588355

>>5588086
might as well, you never know.
also do you have an mp3 of the song? i can pull it from the webm but it doesn't hurt to ask

Anonymous
06/15/24(Sat)23:20:46 No.5588373

Anonymous 06/15/24(Sat)23:20:46 No.5588373

>>5588355
Webm's audio is 192kbps opus, which should be good enough, but here:
https://vocaroo.com/1b4vzGmxpp0s
https://huggingface.co/chameleon-ai/so-vits-svc-models/tree/main/vegeta

Anonymous
06/16/24(Sun)13:55:46 No.5589068

Anonymous 06/16/24(Sun)13:55:46 No.5589068

File: Vegeta ft Nappa - Break Stuff.webm (4.89 MB, 960x540)

4.89 MB WEBM

Anonymous
06/16/24(Sun)15:52:22 No.5589227

Anonymous 06/16/24(Sun)15:52:22 No.5589227

File: Watermarked Video0da4611e(...).webm (1.38 MB, 752x1080)

1.38 MB WEBM

Anonymous
06/16/24(Sun)16:10:15 No.5589252

Anonymous 06/16/24(Sun)16:10:15 No.5589252

File: sonyanthem_1.webm (5.47 MB, 854x480)

5.47 MB WEBM

console war inspired oc FUCK XBOX

Anonymous
06/16/24(Sun)17:17:47 No.5589326

Anonymous 06/16/24(Sun)17:17:47 No.5589326

>>5589252
>nigger has to get the final word
>it's something gay, retarded, and lame
no surprise there

Anonymous
06/16/24(Sun)19:34:20 No.5589487

Anonymous 06/16/24(Sun)19:34:20 No.5589487

>>5588373
nice, thanks
>>5589068
again, another banger lol

Anonymous
06/16/24(Sun)20:40:45 No.5589551

Anonymous 06/16/24(Sun)20:40:45 No.5589551

>>5589487
lol, different anon, but yeah Vegeta anon does make some great covers.

Anonymous
06/16/24(Sun)22:06:19 No.5589621

Anonymous 06/16/24(Sun)22:06:19 No.5589621

File: Animal I Have Become - Ve(...).webm (3.53 MB, 1024x576)

3.53 MB WEBM

>>5589551
ooh well, doesnt change what I said.
these Vegeta covers made me want to remake this absolute classic lol.

https://files.catbox.moe/hjxhga.webm
https://www.youtube.com/watch?v=qlsLmo08E-4

Anonymous
06/16/24(Sun)22:14:31 No.5589630

Anonymous 06/16/24(Sun)22:14:31 No.5589630

>>5589621
ah damn, i can make this better. i needed to copy the second animal over to the first so it doesn't warble.

Anonymous
06/16/24(Sun)23:07:35 No.5589670

Anonymous 06/16/24(Sun)23:07:35 No.5589670

>>5589630
https://voca.ro/1aGf5rrQPCOq
aight, thats better

Anonymous
06/17/24(Mon)12:40:22 No.5590265

Anonymous 06/17/24(Mon)12:40:22 No.5590265

>>5557207
how can I replicate Patrick Batemans voice?
is there a resource anywhere for just voice models, only seeing artists in this

Anonymous
06/17/24(Mon)13:21:48 No.5590296

Anonymous 06/17/24(Mon)13:21:48 No.5590296

>>5590265
RVC is voice models, people just use them for songs for meme reasons.

Anonymous
06/17/24(Mon)16:59:35 No.5590495

Anonymous 06/17/24(Mon)16:59:35 No.5590495

File: Deutschland by swirlinggp(...).webm (2.8 MB, 1280x720)

2.8 MB WEBM

the real german hymn

Anonymous
06/18/24(Tue)02:55:57 No.5591059

Anonymous 06/18/24(Tue)02:55:57 No.5591059

>>5587865
That is fucking crazy

Anonymous
06/18/24(Tue)07:10:43 No.5591193

Anonymous 06/18/24(Tue)07:10:43 No.5591193

>>5589227
>shooting strings

Anonymous
06/18/24(Tue)23:22:08 No.5592015

Anonymous 06/18/24(Tue)23:22:08 No.5592015

>>5557207
Could Vegeta ever sing "You're out of touch"?

Anonymous
06/19/24(Wed)00:08:42 No.5592052

Anonymous 06/19/24(Wed)00:08:42 No.5592052

>>5592015
https://www.youtube.com/watch?v=xGY9Q86d1gE

Anonymous
06/19/24(Wed)04:52:11 No.5592225

Anonymous 06/19/24(Wed)04:52:11 No.5592225

File: ronnie-mcnutt-time-traveler_6.webm (3.29 MB, 1280x720)

3.29 MB WEBM

>>5557785
The thrilling sequel:

Anonymous
06/19/24(Wed)10:25:48 No.5592438

Anonymous 06/19/24(Wed)10:25:48 No.5592438

>>5587855
wait what is AI here? The song, the video or has AI been used some other way

Anonymous
06/19/24(Wed)11:26:01 No.5592468

Anonymous 06/19/24(Wed)11:26:01 No.5592468

File: collapsev2.webm (2.12 MB, 512x512)

2.12 MB WEBM

Anonymous
06/19/24(Wed)17:04:19 No.5592747

Anonymous 06/19/24(Wed)17:04:19 No.5592747

File: 1692449946245694.webm (5.57 MB, 240x240)

5.57 MB WEBM

>>5585429
NTA, but Japanese has fewer sounds than English, so they have a distinct IPA phonetic sound when you train them with RVC when they are trying to make sounds they aren't trained on. So they'll sound normal when speaking Japanese, but they'll sound like they have an accent when speaking English. Since English and French are almost identical in IPA phonetics, you don't hear a difference.

Anonymous
06/19/24(Wed)20:19:13 No.5592906

Anonymous 06/19/24(Wed)20:19:13 No.5592906

>>5563016
Pretty please if anyone knows what prompts this was made with, please post

Voice sounds like something I'd hear in the 90s when streets were safe to come home from a concert at 4 am

Anonymous
06/19/24(Wed)20:54:25 No.5592933

Anonymous 06/19/24(Wed)20:54:25 No.5592933

File: poo poo in the loo.webm (5.47 MB, 512x768)

5.47 MB WEBM

>>5592906
since the anon who posted it doesn't have the title, i'm afraid we have to get lucky for the creator to show up sometime.
pretty sure it's inspired by your typical 80s european synth-pop band like depeche mode, new order, camouflage, ultravox, duran duran and several dozens others nowadays only heard on nostalgia baiting shows and movies. this sound was insanely popular during the 80s and 90s
my first thought hearing the text and melody was "this gotta be camouflage" but when i started shazaming it and googling the lyrics i realized it was something completely new.
here's the two most wlel known camouflage titles that feel somewhat similar to the AI upload:
https://www.youtube.com/watch?v=k6s1-caKRtQ
https://www.youtube.com/watch?v=9DL_Pxgqmno

pretty sure some day there'll be capable algorithms that are better at finding the source of even the most obscure songs or remixes. or you could find similar songs to the one you already enjoy. while the royal discipline will be creating your own favourite music either from thought or the machine feeding you stuff it knows you'd like.

video unrelated

Anonymous
06/19/24(Wed)21:30:35 No.5592975

Anonymous 06/19/24(Wed)21:30:35 No.5592975

File: bill_loveless.webm (503 KB, 864x1168)

503 KB WEBM

Anonymous
06/19/24(Wed)21:46:46 No.5592991

Anonymous 06/19/24(Wed)21:46:46 No.5592991

>>5557207
do any of the text to speech have a fucking GUI?

Anonymous
06/19/24(Wed)21:49:26 No.5592995

Anonymous 06/19/24(Wed)21:49:26 No.5592995

>>5592991
As the names imply, xtts2-ui and xtts-webui

Anonymous
06/19/24(Wed)21:53:23 No.5592999

Anonymous 06/19/24(Wed)21:53:23 No.5592999

>>5592995
god i wish python wasnt so fucked on windows.

Anonymous
06/19/24(Wed)21:59:02 No.5593006

Anonymous 06/19/24(Wed)21:59:02 No.5593006

>>5592999
Yeah. Unfortunately, most FOSS AI stuff is based on pytorch since data scientists don't know how to program in low level languages.

Anonymous
06/19/24(Wed)22:11:13 No.5593024

Anonymous 06/19/24(Wed)22:11:13 No.5593024

>>5592999
why are you running windows
>m-m-m-muh gaymes
I just streamed bg3 on my shit-ass xubuntu laptop from my stupid windows partition downstairs, I didn't even know it was a thing, flawless
>I don't have two computers
still, dual-boot at the least, also proton and also not being a manchild.

Anonymous
06/19/24(Wed)22:12:24 No.5593028

Anonymous 06/19/24(Wed)22:12:24 No.5593028

>>5593006
like the fuck is this shit? i have to know basic python just to run this shit? christ on a fucking crutch.

shit fucking sucks. figuring out why python doesnt want to do, what its supposed to do, fucking sucks. goddamnit fuck, like what the fuck.

i cant get python to fetch pip, or requests. like i cant get it to do shit.

Anonymous
06/19/24(Wed)22:13:39 No.5593030

Anonymous 06/19/24(Wed)22:13:39 No.5593030

>>5593024
nigger my ubuntu server is sitting right next to me. using it to host shit, kill yourself and install gentoo right now you stupid fucking faggot.

Anonymous
06/19/24(Wed)22:17:09 No.5593034

Anonymous 06/19/24(Wed)22:17:09 No.5593034

>>5593030
>gentoo
nice meme, but I think I'll pass on this one babe ;)
while we're being angry at each other you rotten goblin, I hope things out a little worse than the best case scenario for you.

Anonymous
06/19/24(Wed)22:19:13 No.5593036

Anonymous 06/19/24(Wed)22:19:13 No.5593036

when I said "I hope things out a little worse than the best case scenario for you"
what I meant was "I hope things work out a little worse than the best case scenario for you."

But now that I think about it, I don't hope that for you, it was my mistake.
Now you might be wondering: "Did he hope for the second-worse outcome, or the best outcome?"
Well, I will leave that for your imagination.

Anonymous
06/19/24(Wed)23:02:13 No.5593067

Anonymous 06/19/24(Wed)23:02:13 No.5593067

>>5593028
For the most part just follow the setup instructions on the github page.
At worst you have to know how to set up a virtual environment and install requirements.txt but that's it unless it's some really work-in-progress project that doesn't have a gui and has poor dependency management.
Like basically as long as you have a working python 3.11 install you're good. It doesn't even have to be the same as your default system python (as long as you're pointing to the correct thing in your virtual environment) and you can use conda if you want.

Anonymous
06/19/24(Wed)23:24:19 No.5593087

Anonymous 06/19/24(Wed)23:24:19 No.5593087

god I can't imagine trying to navigate the hurdles of Windows, why even? What is the point, like is it Stockholm Syndrome to get ads in your OS and have to pay for various functionality?

And it's not even like modern linux is matrix waves of nonsense, just fucking put xubuntu or lubuntu on a thumbdrive and live an easier life

Anonymous
06/19/24(Wed)23:40:58 No.5593103

Anonymous 06/19/24(Wed)23:40:58 No.5593103

>>5593087
Are you still on about this? Did Bill Gates piss in your cereal this morning or something? Also before you start more shit I'm not the anon you think, I'm on Linux, and only use Windows at work.

Anonymous
06/20/24(Thu)09:01:15 No.5593435

Anonymous 06/20/24(Thu)09:01:15 No.5593435

>>5557785
Can someone finish this video with him blowing his head off out of the blue instead of blowing a kiss?

Anonymous
06/20/24(Thu)10:13:16 No.5593479

Anonymous 06/20/24(Thu)10:13:16 No.5593479

>>5593435
Pretty sure that video ends with us blowing our heads off. I hope that singer gets throat cancer.

Anonymous
06/20/24(Thu)11:01:38 No.5593525

Anonymous 06/20/24(Thu)11:01:38 No.5593525

>>5578095
kino

Anonymous
06/20/24(Thu)11:09:57 No.5593530

Anonymous 06/20/24(Thu)11:09:57 No.5593530

>>5568225
Wasn't he in Japan for less than 1.5 years?
And you expect me to believe he learned how to read Japanese in that time?

Anonymous
06/20/24(Thu)11:18:36 No.5593536

Anonymous 06/20/24(Thu)11:18:36 No.5593536

>>5593479
I just feel a loud bang would fit in really nicely there. Idk maybe I'll do it myself.

Anonymous
06/20/24(Thu)16:37:11 No.5593780

Anonymous 06/20/24(Thu)16:37:11 No.5593780

File: lache moi.webm (445 KB, 568x726)

445 KB WEBM

Anonymous
06/20/24(Thu)16:43:10 No.5593783

Anonymous 06/20/24(Thu)16:43:10 No.5593783

>>5593780
Context?

Anonymous
06/20/24(Thu)18:59:49 No.5593891

Anonymous 06/20/24(Thu)18:59:49 No.5593891

>>5589068
I need this with the TFS versions of them.

Anonymous
06/21/24(Fri)07:40:16 No.5594346

Anonymous 06/21/24(Fri)07:40:16 No.5594346

>>5592933
Hey. Hey, you. Thank you for your diligence

Anonymous
06/21/24(Fri)08:18:24 No.5594367

Anonymous 06/21/24(Fri)08:18:24 No.5594367

>>5592747
I see. That makes perfect sense actually.

Anonymous
06/21/24(Fri)19:35:12 No.5594886

Anonymous 06/21/24(Fri)19:35:12 No.5594886

File: b9768266-2bbd-4d19-9ef8-0(...).webm (1.24 MB, 512x512)

1.24 MB WEBM

Anonymous
06/21/24(Fri)22:16:18 No.5594988

Anonymous 06/21/24(Fri)22:16:18 No.5594988

>>5572024
That's clearly not Ai

Anonymous
06/22/24(Sat)11:32:43 No.5595508

Anonymous 06/22/24(Sat)11:32:43 No.5595508

File: Beautiful Things - Marcel(...).webm (2.96 MB, 576x1024)

2.96 MB WEBM

Anonymous
06/22/24(Sat)12:19:05 No.5595549

Anonymous 06/22/24(Sat)12:19:05 No.5595549

File: wayne_june_dune_00.webm (592 KB, 512x512)

592 KB WEBM

audiobook test. Dune read by Wayne June

Anonymous
06/22/24(Sat)13:46:21 No.5595647

Anonymous 06/22/24(Sat)13:46:21 No.5595647

>>5595549
nice, did you use voice cloning over an existing audio book reading or did you put the words into elevenlabs for the base?

speaking of audiobooks though, hows the project going anon? >>5584138

Anonymous
06/22/24(Sat)13:53:05 No.5595658

Anonymous 06/22/24(Sat)13:53:05 No.5595658

>>5595647
xtts2 to generate a base voice and rvc to squeeze some extra quality out of it same as i did here >>5584562 so it's 100% ai

Anonymous
06/22/24(Sat)14:12:37 No.5595687

Anonymous 06/22/24(Sat)14:12:37 No.5595687

>>5595658
awesome, using both is a great idea

Anonymous
06/22/24(Sat)18:44:22 No.5595959

Anonymous 06/22/24(Sat)18:44:22 No.5595959

>>5595658
Nice. It sounds listenable enough. It's no professional human read but neither is elevenlabs, and you were able to do it with all FOSS to boot.

Anonymous
06/22/24(Sat)21:46:17 No.5596103

Anonymous 06/22/24(Sat)21:46:17 No.5596103

File: 72047_966_4.webm (24 KB, 512x320)

24 KB WEBM

Chameleon anon, I just watched your video on open source animation and saw you were still using AnimateDiff.
I believe most people now are using a more modern and powerful model called ToonCrafter. Rather than generating video directly, you input key frames (usually with in painting or manual cleanup) and it interpolates the animation. It's Apachev2 licenced. Animation attached is only given the start and end frame.
https://github.com/ToonCrafter/ToonCrafter

Anonymous
06/22/24(Sat)22:11:48 No.5596114

Anonymous 06/22/24(Sat)22:11:48 No.5596114

>>5596103
Thanks my dude. I'll definitely check it out. Seems like it's more in the vein of Ebsynth rather than AnimateDiff. Not a replacement but it looks dope.

Anonymous
06/22/24(Sat)23:19:16 No.5596171

Anonymous 06/22/24(Sat)23:19:16 No.5596171

>>5596114
Damn, I don't seem to have enough VRAM for this. It started inferring but ran out after a couple steps. Guess it's something to keep on the wishlist for when either there's ability to offload to CPU or I upgrade my GPU. I have 16GB, my card's not exactly a slouch, but it's AMD so not as memory efficient as Nvidia with xformers.

Anonymous
06/23/24(Sun)09:22:32 No.5596609

Anonymous 06/23/24(Sun)09:22:32 No.5596609

>>5596171
It's unfortunate AMD cards don't have xformers, since it needs 28 GB of vram without it. Xformers is the only reason it fits on my 3090.

Anonymous
06/23/24(Sun)09:48:03 No.5596627

Anonymous 06/23/24(Sun)09:48:03 No.5596627

>>5596609
>28GB
Damn son. I really hope there is some development to offload some of that because that's just not accessible to most people. One of these days I'll probably upgrade to a multi-GPU rig but not any time soon.

Anonymous
06/23/24(Sun)11:03:16 No.5596730

Anonymous 06/23/24(Sun)11:03:16 No.5596730

>>5593780
this nigga sounds like a drill lmao

Anonymous
06/24/24(Mon)03:08:51 No.5597706

Anonymous 06/24/24(Mon)03:08:51 No.5597706

>>5592975
bill's such a patrician

Anonymous
06/24/24(Mon)05:23:03 No.5597797

Anonymous 06/24/24(Mon)05:23:03 No.5597797

>>5592975
vid should loop with a MBV track playing tbqh

Anonymous
06/24/24(Mon)11:59:19 No.5598060

Anonymous 06/24/24(Mon)11:59:19 No.5598060

>>5596609
>>5596627
does xformers knocks it down to 14gb? if so, i'll try running it on my 4060ti

Anonymous
06/24/24(Mon)18:02:37 No.5598427

Anonymous 06/24/24(Mon)18:02:37 No.5598427

>>5598060
That would be a 50% reduction. I doubt it, but the worst that can happen is that it doesn't work, so just try it.

Anonymous
06/24/24(Mon)18:33:51 No.5598433

Anonymous 06/24/24(Mon)18:33:51 No.5598433

Is there some tool that can identify voices and separate audio clips? I'm trying to gather audio from a tv show and just don't want to sit and manually edit out every single line the characters say. Sometimes there's like four characters in a room talking and they only say like one sentence at a time.

Anonymous
06/24/24(Mon)18:41:04 No.5598437

Anonymous 06/24/24(Mon)18:41:04 No.5598437

>>5598433
The technical term for this is "speaker diarization". I don't know the best solution but I know whisper is a thing.
https://github.com/yinruiqing/pyannote-whisper

Anonymous
06/24/24(Mon)18:46:14 No.5598443

Anonymous 06/24/24(Mon)18:46:14 No.5598443

>>5598437
thanks I'll try and see if it works for my stuff

Anonymous
06/24/24(Mon)19:04:27 No.5598454

Anonymous 06/24/24(Mon)19:04:27 No.5598454

>>5592225
This music is creepy as fuck

Anonymous
06/24/24(Mon)19:48:22 No.5598492

Anonymous 06/24/24(Mon)19:48:22 No.5598492

>>5574818
KEK

Anonymous
06/24/24(Mon)19:54:35 No.5598498

Anonymous 06/24/24(Mon)19:54:35 No.5598498

>>5598454
yeah, it's sort of like some vidya/movie music that plays during thrilling sequences but it has this atonal sound to it tht makes it slightly unnerving

Anonymous
06/24/24(Mon)19:56:24 No.5598501

Anonymous 06/24/24(Mon)19:56:24 No.5598501

>>5598427
OoM unfortunately. Maybe there's a config I can tweek.

Anonymous
06/24/24(Mon)20:12:30 No.5598515

Anonymous 06/24/24(Mon)20:12:30 No.5598515

File: adrian.webm (2.7 MB, 864x1080)

2.7 MB WEBM

Anonymous
06/25/24(Tue)02:03:33 No.5598777

Anonymous 06/25/24(Tue)02:03:33 No.5598777

>>5557210
If this was real it would make a billion.

Anonymous
06/25/24(Tue)02:46:44 No.5598787

Anonymous 06/25/24(Tue)02:46:44 No.5598787

File: out.webm (2.95 MB, 400x400)

2.95 MB WEBM

>>5557207

Anonymous
06/25/24(Tue)17:07:23 No.5599314

Anonymous 06/25/24(Tue)17:07:23 No.5599314

File: Kid holding fried chicken(...).webm (730 KB, 864x1168)

730 KB WEBM

Anonymous
06/25/24(Tue)19:34:45 No.5599388

Anonymous 06/25/24(Tue)19:34:45 No.5599388

>>5557207
>Models, mega links, and mirrors: https://docs.google.com/spreadsheets/d/1tAUaQrEHYgRsm1Lvrnj14HFHDwJWl0Bd9x0QePewNco/edit#gid=0
Is this thing dead?

Anonymous
06/25/24(Tue)21:53:07 No.5599467

Anonymous 06/25/24(Tue)21:53:07 No.5599467

File: 1709306924916197.webm (1.98 MB, 864x1168)

1.98 MB WEBM

Anonymous
06/25/24(Tue)22:01:48 No.5599474

Anonymous 06/25/24(Tue)22:01:48 No.5599474

>>5599388
I suppose it's time to update the links. I don't check them that often so thanks for letting me know if something's whack.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.