Audio Data Processing in Python

Audio Data Processing in Python

Rob Mulla

2 года назад

146,239 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@omart9411
@omart9411 - 05.12.2023 14:19

it's a shame that this is such a low level tutoial but you assume that I'm already familiar with the meaning of the terminology.

Ответить
@amuigayle2231
@amuigayle2231 - 06.11.2023 23:49

Ill probably never get a reply to this but is it that its either or with the STFT and the Mel spectrogram? Why did u not create the Mel spec from the transformed data?

Ответить
@shaimaalbalushi1739
@shaimaalbalushi1739 - 21.10.2023 16:33

what are you using as notepad to write the codes

Ответить
@user-xm5wm4zf2r
@user-xm5wm4zf2r - 16.10.2023 05:20

you're not working in jupyter?

Ответить
@user-kb6yo4gq2o
@user-kb6yo4gq2o - 10.10.2023 02:00

What are the y values that you first extract?

Ответить
@googul2041
@googul2041 - 09.10.2023 15:57

Glob, librosa, wavered, ivi kakundaa hasalu dsp audio yenduku work avataledu adigo librosa

Ответить
@mudasserahmad6076
@mudasserahmad6076 - 24.09.2023 13:36

Hi Rob interesting video. My task is to create mel spectrograms with windows length 93ms 46ms and 23 ms .And then combine them i need one i am confused with this like (128,216,3) what does 3 shows here. 128 in nmels 128 and 216 nu ber of frames.

Ответить
@Zizos
@Zizos - 20.09.2023 04:22

As a "I understand what's going on but not a coder" I understand that it would take me months if not years to create what I want.
How hard would it be to create a audio visualizer plugin? Like make a plugin for a video editor that takes a audio track, analyzes frequencies with custom ranges and drives parameters based on loudness of the frequency ranges you've set up?
I'd have to learn how to manage data, memory, incorporate into video editor, libraries, compiling and who knows what else... ah yes, more than basic coding.

Ответить
@danieldanielineto7228
@danieldanielineto7228 - 01.09.2023 19:40

Great Job, obrigado.

Ответить
@AnimeSyncInfinite
@AnimeSyncInfinite - 17.08.2023 17:18

I want to mimick others voice with my voice. In short i will give a small audio sample as a input (for example my voice) and the code will get the various charateristics of my voice so that i can manipulate it with audio of some other person's voice. Is it possible to do it in python?

Ответить
@dannybee9068
@dannybee9068 - 07.07.2023 13:23

Hello! Thank you for the excellent video! I have a question though: What is the difference in use cases between STFT and Melspectrogram? Both methods appear to extract features for the model, but in distinct ways. I am interested in understanding when one is more advantageous than the other. For example for sentiment analysis, I think melspec seems more appropriate but it's nothing more than a guess with a bit of intuion, and feels like if we feel with a speech its better to use melspec and any other sound stft

Ответить
@nixboaski
@nixboaski - 07.07.2023 01:20

This is so interesting.

A few days ago I wanted to produce a digital reproduction of a particular musical note, using the note as the basis and its harmonics (I was analysing A=440Hz, but I wrote the script in such a way I could alter that). So I had basically two aspects to take into account: the frequencies and its amplitudes.

I recorded a note from the piano, cleaned it of noise as much as I could and extracted the amplitudes from it for each frequency that forms an A note. It was terrible! The final result sounded ghastly.

Your video will help me understand how I must proceed to make a digital sound that makes more sense. I totally would like to learn how to use machine learning on audio processing too.

Ответить
@durgaganesh423
@durgaganesh423 - 19.06.2023 20:22

Hi
Can you help for finding glitches or audio obnormalites from wav file

Ответить
@theTenorDrummer
@theTenorDrummer - 01.06.2023 21:09

Hey, I'm learning Python and want to eventually be able to analyze a drummer's rhythmic timing vs. a "perfect" performance. Definitely stealing a few nuggets from this. Thanks! Anyone out there want to help me out???

Ответить
@footkol
@footkol - 25.05.2023 22:21

Thank you for informative video. May I ask what software are you using in it? Is it JupyterLab?

Ответить
@user-xg1hf2ek2v
@user-xg1hf2ek2v - 08.05.2023 18:02

ana mafdtinich

Ответить
@TheFullofrage
@TheFullofrage - 03.05.2023 20:04

I have a question about sample rate. Is sample rate (that integer sr) defined by that method librosa.load() or by some other way? Btw amazing video! Than you so much!

Ответить
@T0berius
@T0berius - 01.04.2023 18:22

Sir, you are the one.
I'm a hobbist and it's video was useful to me.
Thanks for share you time and expertise with us.

Ответить
@fudgenuggets405
@fudgenuggets405 - 11.03.2023 18:38

This guy's videos are so awesome. Big fan.

Ответить
@fahnub
@fahnub - 08.03.2023 19:00

great vid!

Ответить
@cerita-ceritanabi3343
@cerita-ceritanabi3343 - 07.03.2023 05:23

Hi, I want to analyze an SNR using a CSV file. I want no the signal from 0-200 Hz. In CSV file only has the data on G. What should I do for SNR analysis?

Ответить
@lesptitsoiseaux
@lesptitsoiseaux - 12.02.2023 08:31

Hi Rob, could you give me some pointers please? I suffer from severe sleep apnea which is getting worse and I'm hoping to build a raspberry pi microphone that buzz me gently when it doesn't hear the regular cpap machine whirring up and down of my respiration for more than twenty seconds. How do I write a bot to listen in ? My prototype is just to listen for a certain decibel and constancy of noise, but I'm hoping I can write something I can share for folks to train their own specific sample to detect. Thank you from Vancouver!

Ответить
@youngzproduction7498
@youngzproduction7498 - 06.02.2023 11:23

Nice. It’s what I need.

Ответить
@ErickCalderin
@ErickCalderin - 03.02.2023 05:24

Awsome, thank you. I built some months ago a music genre classifier using spectograms and a Convolutional Neural Network. It was the best thing ever since I got a high accuracy in the first attempt.

Ответить
@bradydyson65
@bradydyson65 - 03.02.2023 03:21

Why does it seem like the frequency values in the spectrograms are much higher than they should be? I tried to use the same method with a piano sample of an e minor scale and found that the primary frequencies ranged from 1000 Hz to 3000 Hz. Then I noticed that a lot of the frequency content from the speech examples also seems high. Am I doing something wrong?

Ответить
@anirbanc88
@anirbanc88 - 01.02.2023 13:30

Its so cool!

Ответить
@pythonmini7054
@pythonmini7054 - 22.01.2023 22:49

what book can you recommend to learn everything that is needed for machine learning?

Ответить
@joppofiss4634
@joppofiss4634 - 20.01.2023 17:37

This intrigued me as a data scientist who works with EEG data (brain signals). Signal is signal in the end :)

Ответить
@conquestech
@conquestech - 19.01.2023 12:08

Awesome!
I wish for a tutorial on TTS & STT technology, audio dataset, in python, to create a model for my indigenous language using IPA phonemes. Thanks

Ответить
@meditation-tunes
@meditation-tunes - 31.12.2022 20:06

Thank Rob, do you know how to compare two audio files and show the matching rate for example. Can Python help? For example I want the user to record a sentence and then we compare it with our library and check if it is matching with for example with a sad tune.

Ответить
@shubhamkapoor5152
@shubhamkapoor5152 - 29.12.2022 00:05

Hi i have an audio dataset with gz extension.i dont know how to load it in python and do preprocessing and extract mfcc from it.Can you give me a brief idea on what to do.i am very lost about this

Ответить
@zachfarrell7162
@zachfarrell7162 - 27.12.2022 23:14

using python 3.11, and having an awful time with librosa module. so many dependencies, and it says that librosa is not compatible with 3.11 so 3.7 is required. anyone know of a way around this?

Ответить
@mir_intizam
@mir_intizam - 24.11.2022 19:41

hello everyone, I have audio data, I want to train this audio data and use it offline. Is there anyone to help me?

Ответить
@larcomj
@larcomj - 22.11.2022 06:38

great video. kinda odd to see "sr" for sampling frequency(Fs in the dsp world) but thats me being particular.... im trying to make the jump from matlab to python ughhh.....

Ответить
@sandhiyaprabhakannuraj1079
@sandhiyaprabhakannuraj1079 - 19.11.2022 11:59

Given a flute music file, how can we convert the music to notes and decompress the file back to audio blocks using literally any method( trained spectograms, any ML algorithm..)

Ответить
@UnleashedDaemon
@UnleashedDaemon - 18.11.2022 16:30

thats nice - but how do I get multiple of them it into an .arff or .csv fromat to analyze it with eg. with weka3?

Ответить
@KJ7JHN
@KJ7JHN - 17.11.2022 08:40

Hey Rob, could you please create a video that takes the trimmed audio data, data set, and apply this to the actual mp3 file? This should result in a trimmed MP3 file. Saving need not be necessary. I'd like to open a file, and play specific time stamps within the file. Thanks.

Ответить
@sandhiyaprabhakannuraj1079
@sandhiyaprabhakannuraj1079 - 14.11.2022 11:29

How can we split an audio file into several equal sized chunks ( with padding)? ( I'm dividing the audio into chunks to apply DCT on every chunk)

Ответить
@trevorwelsh
@trevorwelsh - 11.11.2022 07:51

Im an iOS engineer currently, and am looking to get into audio processing/ML - would you recommend python over c++? Most recommend python for beginners and ML in general, but I have also mostly seen c++ recommended for audio work.

Ответить
@kavinyudhitia
@kavinyudhitia - 11.11.2022 05:49

nice thanks!!

Ответить
@rs9130
@rs9130 - 03.11.2022 09:09

thanks for the tutorial.
how can i convert to time series data wrt to frames
ex: (time_step, feature_dim)

Ответить
@ademhilmibozkurt7085
@ademhilmibozkurt7085 - 02.11.2022 21:44

I love content thank you. please make more :)

Ответить
@irgiahmadmaulana5915
@irgiahmadmaulana5915 - 31.10.2022 09:51

Dose anyone know how to make statistic descriptive based on the audio sound like this? I mean what should we count or you can send a reference instead, thank you

Ответить
@Pxmuchim
@Pxmuchim - 24.10.2022 10:44

Hi! Would you be able to create a tutorial on how we can use the processed audio data (such as the one in this video) to train a machine-learning model? Thanks for the great video!

Ответить
@danielolmos5484
@danielolmos5484 - 23.10.2022 05:53

Thanks for this! Very nice for beginners in this area

Ответить
@DialectDialogue
@DialectDialogue - 12.10.2022 08:59

This is really awesome! Whats your setup to get tto this? I'm on a Mac and so far Librosa has not been successful ...any tips? SQL background & new to Python 😬

Ответить
@liamgough479
@liamgough479 - 05.10.2022 22:41

Hi as a new comer to Python - I want to get to a stage that I can librosa to play around with audio in raw form - as appossed to using Audacity etc. Specifically interested in band filtering. If I were to follow your excellent videos where will I find the basics to start with please and which Python app are you using or recommend for Win 10 please? I am pretty good with Excel and VBA and have a reasonable understanding of audio so hoping that I can get a good start to learning Python.

Ответить
@shubhampadekar2590
@shubhampadekar2590 - 28.09.2022 16:30

Hi,
I hope you are doing well. Excellent tutorial

May I know, how should i approach to a problem of to detect (just want to detect the presence) of background noise in an audio file

Which python audio libraries can be useful?

Ответить