My TOP 3 Tips for Training Better AI Voices - RVC Voice Cloning

My TOP 3 Tips for Training Better AI Voices - RVC Voice Cloning

Jarods Journey

10 месяцев назад

12,196 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@denblindedjaligator5300
@denblindedjaligator5300 - 13.01.2024 17:17

Hello Jarod's Journey. I would like to know if you would like to train a module for me, where I have set it to false `You can get up to a higher batch size I can only get up to 26 It sounds like there is an autotuner on, when I have trained over 200 epoches. but it could well be, if you train with 35 batches, that it became more precise. How can I send you my dataset set the pitch to false thanks.

Ответить
@reedmoon3630
@reedmoon3630 - 13.01.2024 06:58

Thanks for the tips. I'm swapping singer voices. I have good data of about 20 minutes. 200 epocs. I used Harvest and RMVP_gpu for both training and processing. The results are ok but I still hear too much of the original singer's voice. What can I adjust to make the cloned voice totally replace the original voice?

Ответить
@miinyoo
@miinyoo - 06.12.2023 06:36

I've banged my head against this for two solid days.
I think the noticeable AI sound is a combination of things. #1 on the list is compressed source audio. #2 is leaving silence pre-processed bits in the dataset. #3 is not enough variety in a dataset. #4 Parameters and turning knobs etc.
I have found making convincing RVC is really really fucking hard. You can do it with other noise in the background and no one notices, but once it's "alone in the room" it always seems to fall on its own face.

Ответить
@victorhugodasilva7285
@victorhugodasilva7285 - 11.11.2023 20:38

Great tutorial! Also, will you marry me? 🥺

Ответить
@rushiahravat701
@rushiahravat701 - 05.11.2023 10:39

hello i am trying to train my voice i have a 2.03 minutes but i am getting this error


torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 4.00 GiB total capacity; 3.41 GiB already allocated; 0 bytes free; 3.44 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

i have a 3050 laptop gpu and amd 5600
i saw in task manager that my gpu is not utilazing
what should i do?

Ответить
@MaisnerProductions
@MaisnerProductions - 07.09.2023 15:48

great tips

Ответить
@motokorcle
@motokorcle - 05.09.2023 08:26

can I use this software on fortnite like live?

Ответить
@synthmaster4959
@synthmaster4959 - 29.08.2023 15:37

Hey man is there a rvc ai download with working tensorboard?

Assuming its a clean windows install

Ответить
@hdhdhvdjgdjjdbjdb5541
@hdhdhvdjgdjjdbjdb5541 - 29.08.2023 13:40

How to minimize the delay when streaming? Get better vga? Is 4060 laptop has better delay than 3060 12gb?

Ответить
@denblindedjaligator5300
@denblindedjaligator5300 - 28.08.2023 15:25

where can i find the guitar model? How can i get the mpeg working on the mac side? i can not train my voices ore make my Model Inference. Should me and my frend use Xformers ore not?

Ответить
@klaurcschwackerberg1880
@klaurcschwackerberg1880 - 28.08.2023 14:39

Would you know if it is already possible to make a training which allows me text to audio from acapella's , but I want to avoid the nightmare training from Tacotron 2 , and use n RVC v2 kind of training nice and easy. So I mean I want to train a model by adding acaopella's to the model, in an easy way like you can do in RVC v2 , without having to transcript every sentence as that is needed for tacotron2 training, , and then when inferencing the model , use the type text to audio ! is that not possible yet ? Wouldn't that be great ? Or did I miss something ?

Ответить
@klaurcschwackerberg1880
@klaurcschwackerberg1880 - 28.08.2023 14:24

Does anyone know a good model for UVR5 that can extract acapella's from music but now without the backing vocals ? I Know X-minus can do this but I want to use UVR5. I just don't know what model I need to choose, thanks

Ответить
@heyheybackup
@heyheybackup - 28.08.2023 02:11

could you do a tutorial connecting this to OBS?

Ответить
@LindaSummer27
@LindaSummer27 - 27.08.2023 23:45

How to download RVC?

Ответить
@AdvancedGamingYT
@AdvancedGamingYT - 27.08.2023 23:24

Any tips for the real time voice changer? I can't get it to sound right :/

Ответить
@Avax84
@Avax84 - 27.08.2023 22:26

Does it help if you clean up your device? I’m having the voice changer with cpu (and AMD Radeon graphics card) and what ever I do, on discord it’s extremely slow. Also i can’t use CUDA because when I check if it works in the console it keeps saying “false”

Ответить
@76es84
@76es84 - 27.08.2023 21:09

Bro you looks gay

Ответить
@moriakiinamine1372
@moriakiinamine1372 - 27.08.2023 17:03

Hello! The new RVC update makes training with CUDA faster. With my RTX4070ti it takes 30 seconds per epoch

Ответить
@Antonsetiady
@Antonsetiady - 27.08.2023 16:43

Thanks sir

Ответить
@macdoctorsg
@macdoctorsg - 27.08.2023 15:53

great tutorial mate! I realized a lot of your videos have your voice (audio) outta sync with your visual, i.e. seems like your video couldn't catch-up with your voice.

Ответить
@greenockscatman
@greenockscatman - 27.08.2023 15:17

Solid tips all around! You're right to put the dataset first because "garbage in, garbage out" is probably the first thing you're going to learn through trial and error. Appreciate this vid is mostly geared towards AI voice changing, but if you're doing any AI music where you want to change the vocals, my tip is to not go overboard with UVR in trying to "clean up" the target voice (singer in the song you're wanting to replace the vocals for). Lots of times just a single pass through of Kim Vocal 1 sounds miles better than doing that + de-echo, dereverb etc. It's easy to end up losing some of the little qualities of the song that make it sound good if you clean it up too much.

Ответить
@jlobstertv
@jlobstertv - 27.08.2023 14:26

The UVR tool is effective at separating music vocals from instrumentals; however, in certain instances, there may be some static noise present in the background of the UVR Vocal output. Therefore, it is not guaranteed to work flawlessly for removing background noise in general audio recordings. To ensure clean recordings, it's advisable to use a microphone with noise cancellation capabilities in conjunction with Krisp, a noise-canceling AI app, during the recording process. Additionally, I wish I had known to "start with small datasets" earlier, as I've already set 1000 total epochs for my voice model and it is still training as of now🤣. 15 more hours is my estimated time of completion, I just hope it will turn out well🙏

Ответить
@Joe-hp6jz
@Joe-hp6jz - 27.08.2023 13:46

Is it recommended to train voice samples (talking) and singing voice samples together, or would that compromise the overall quality? Would it be better to train only singing voice samples to make an AI song cover?

Ответить
@jaimeleau
@jaimeleau - 27.08.2023 12:45

Thanks man 💪

Ответить
@SparkysTechCorner
@SparkysTechCorner - 27.08.2023 11:00

Good video, good info stuff Iv came across in my own trial and error. Keep up the good work man

Ответить
@luzmartineza
@luzmartineza - 27.08.2023 10:34

Nice set up. Better lightning and new background. A lot of improvements there.

Ответить
@nickysingha39
@nickysingha39 - 27.08.2023 10:27

Any voice changer for mobile phone

Ответить