TheITN3RD
Contributor
Hi all,
as a hobby project i have started making audio books , my Youtube channel is Audio Books Galore.I am trying in many verticals like Mental Health, Bitcoin and Blockchain etc. I have mastered generating text for audiobook, generate audio ,Youtube thumbnails and generate subtitles. i use capecut/Filmora to club and generate my content. My primary problem is nural / natural sounding audio with voice modulation. i have tried dozons of Ai websites but all have limitations , my books range from 5 minutes to 1hr+ and yet to come up with a solution for that. I have used offline tools like Balabolka , 2nd Speech Center clubbed it togather with Google TTS, Microsoft TTS. Have tried multiple voice and now I am using Microsoft George as my default voice . but i am not satisfied with the quality, its too robotic. i found a good website again its not free. i am looking for neural voices like this . i have tried cloning it online but again its not free.... Is there a way i can clone the voice from any audio file offline or clone any online voice or from WIndows App like PlayHT or Minimax. and use them offline tools like Balabolka , 2nd Speech Center or with any web interface offlne?
Alternately can i get such high quality free to use voices. i have siffed through so many projects on Git hub but all are written in python, even after installing Python 3.x or Miniconda i cant seem to make them install/ work as i have absolutely 0 knowlege about programming or how to install stuff using python.
This is a side hobby project so dont want to spend as these voices each can cost up to 3₤ /$ each, and i need various modulations like commercial/Norration/ampathy/ Voice modulation for different type of content.
any help is much appricated.
as a hobby project i have started making audio books , my Youtube channel is Audio Books Galore.I am trying in many verticals like Mental Health, Bitcoin and Blockchain etc. I have mastered generating text for audiobook, generate audio ,Youtube thumbnails and generate subtitles. i use capecut/Filmora to club and generate my content. My primary problem is nural / natural sounding audio with voice modulation. i have tried dozons of Ai websites but all have limitations , my books range from 5 minutes to 1hr+ and yet to come up with a solution for that. I have used offline tools like Balabolka , 2nd Speech Center clubbed it togather with Google TTS, Microsoft TTS. Have tried multiple voice and now I am using Microsoft George as my default voice . but i am not satisfied with the quality, its too robotic. i found a good website again its not free. i am looking for neural voices like this . i have tried cloning it online but again its not free.... Is there a way i can clone the voice from any audio file offline or clone any online voice or from WIndows App like PlayHT or Minimax. and use them offline tools like Balabolka , 2nd Speech Center or with any web interface offlne?
Alternately can i get such high quality free to use voices. i have siffed through so many projects on Git hub but all are written in python, even after installing Python 3.x or Miniconda i cant seem to make them install/ work as i have absolutely 0 knowlege about programming or how to install stuff using python.
This is a side hobby project so dont want to spend as these voices each can cost up to 3₤ /$ each, and i need various modulations like commercial/Norration/ampathy/ Voice modulation for different type of content.
any help is much appricated.