/g/ - Is there FOSS software that uses an AI model that - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
10/13/25(Mon)06:09:40 No.106872917

File: speaking.jpg (97 KB, 1920x1080)

Anonymous 10/13/25(Mon)06:09:40 No.106872917

Is there FOSS software that uses an AI model that can take text, voice, and tone (specific, not just style tags) as 3 separate inputs to generate speech? Or even just text and tone and then I can try to apply a voice onto that as another step.

Anonymous
10/13/25(Mon)06:11:40 No.106872929

Anonymous 10/13/25(Mon)06:11:40 No.106872929

Yes

Anonymous
10/13/25(Mon)06:12:34 No.106872935

Anonymous 10/13/25(Mon)06:12:34 No.106872935

Yes

Anonymous
10/13/25(Mon)06:17:33 No.106872965

Anonymous 10/13/25(Mon)06:17:33 No.106872965

>>106872917
Elevenlabs. Ignore the tards

Anonymous
10/13/25(Mon)06:19:20 No.106872976

Anonymous 10/13/25(Mon)06:19:20 No.106872976

File: 1757329026610341.png (223 KB, 512x512)

223 KB PNG

>>106872965
>Ignore the tards
Usecase for ignoring the tards?

Anonymous
10/13/25(Mon)06:33:22 No.106873066

Anonymous 10/13/25(Mon)06:33:22 No.106873066

>>106872917
There are models that do tts from text prompt where you can set up tone like gpt sovits tts and then you do voice clonning with stuff like rvc. I think this will be your pipeline.
Im too lazy to google it rn

Anonymous
10/13/25(Mon)06:35:11 No.106873087

Anonymous 10/13/25(Mon)06:35:11 No.106873087

>>106872965
>>106873066
Thank you Anons

Anonymous
10/13/25(Mon)06:38:29 No.106873114

Anonymous 10/13/25(Mon)06:38:29 No.106873114

>>106872976
Usecase for ebussy post?

Anonymous
10/13/25(Mon)06:39:54 No.106873125

Anonymous 10/13/25(Mon)06:39:54 No.106873125

>>106873114
It's the new "4you".

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.