DSP Background - Deep Learning for Audio Classification p.1

DSP Background - Deep Learning for Audio Classification p.1

Seth Adams

5 лет назад

158,241 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@mannibimmel09
@mannibimmel09 - 27.10.2023 00:17

thank you)

Ответить
@mudasserahmad6076
@mudasserahmad6076 - 27.09.2023 20:38

Hey what does it mean dealing with mel spectrograms (128,216,3). By using 3 windows length 93ms,46ms and 23ms and in the end they have write 128,216,3 what does 3 shows here??

Ответить
@TheGroundskeeper
@TheGroundskeeper - 27.05.2023 19:56

Halfway into this video and already love your no BS approach. You're my new "that guy". Solid

Ответить
@user-cn5kp8yf3f
@user-cn5kp8yf3f - 01.05.2023 00:20

explained really clearly!!! thanks a lot.

Ответить
@hlosche
@hlosche - 02.04.2021 22:22

what an angel. What a great playlist!! Thank u

Ответить
@harshmirdhwal
@harshmirdhwal - 13.03.2021 19:24

Hey bro you look like somewhat neville longbottom

Ответить
@marioandresheviacavieres1923
@marioandresheviacavieres1923 - 01.01.2021 01:31

Dude awesome!

Ответить
@MrGailomatico
@MrGailomatico - 11.12.2020 12:22

This was amazing. Thank you!

Ответить
@jcims
@jcims - 16.11.2020 17:38

Still in the video but this is one of the best explanations of STFT i've seen (I'm coming from amateur radio background and have been using these for years without full understanding how these contribute to the spectrogram views we see)

Ответить
@FarhanArRafi
@FarhanArRafi - 03.09.2020 08:44

What's that thumping noise in the video?

Ответить
@merebhayl5826
@merebhayl5826 - 07.08.2020 12:38

You need a highpass filter for your mic though, the banging sub noise is all over.

Ответить
@ibrahimabarry8839
@ibrahimabarry8839 - 13.07.2020 15:57

Hello I hope you are well, I followed your videos on deep learning for audio classification and it was very interesting thanks for everything.
but please ask her something:
if i want to create a machine learning model for transcribing (audio -> written) in a brand new language, like an african language for example, how should i proceed.
thank you

Ответить
@BruinChang
@BruinChang - 01.07.2020 17:45

very instructive talks.

Ответить
@alirachidi2440
@alirachidi2440 - 13.06.2020 01:16

Top video. Thank you!

Ответить
@raulsena3917
@raulsena3917 - 11.06.2020 15:35

Great videos Seth. Thanks a lot!

Ответить
@jayshaligram4474
@jayshaligram4474 - 10.06.2020 08:48

How to deal with and overcome overlapping sounds?

Ответить
@reenathomas4458
@reenathomas4458 - 06.06.2020 13:13

what is your tensorflow version

Ответить
@vladymyrmelnyk2755
@vladymyrmelnyk2755 - 02.06.2020 18:54

ehh.... you should remove your camera video it is distracting . It fine at the beginig but throwing around the screen it really terrible idea.

Ответить
@PauraviWagh
@PauraviWagh - 31.05.2020 06:22

Hi, This is a cool video and has really helped me understand ML in audio, but could you recommend a good source for audio source separation or other audio ML collabs?
Thanks!

Ответить
@TheManInCommand
@TheManInCommand - 19.05.2020 12:35

I’m an Audio Engineer and this is a VERY COOL Channel! Keep creating content! 🤓

Ответить
@powerhub6447
@powerhub6447 - 11.05.2020 12:12

😍😍

Ответить
@sunystudent3513
@sunystudent3513 - 30.04.2020 09:10

That deep noise just threw me off. Maybe I'm the only one but it was very distracting.

Ответить
@alikharabsha5045
@alikharabsha5045 - 14.03.2020 20:33

hello , I have problem in below line :
rand_index=np.random.randint(0,wav.shape[0]-config.step)
type of error :
ValueError: Range cannot be empty (low >= high) unless no samples are taken

can you help me?

Ответить
@notalkguitarampplug-insrev784
@notalkguitarampplug-insrev784 - 13.02.2020 12:39

Hi! I as a fan of guitar amp profiling do you think it could be a game changer for next Kemper or guitar amp plugins in real time?
Thanks :)

Ответить
@harshavardhandonga1137
@harshavardhandonga1137 - 12.02.2020 03:39

Can you suggest some good course to learning deep learning using tensorflow for beginners?

Ответить
@Finite8614
@Finite8614 - 31.12.2019 00:36

This is fantastic! Thanks for sharing

Ответить
@ProudProductions27
@ProudProductions27 - 01.12.2019 09:20

Would I need a different example from this in order to detect a distinct human sound such as a sneeze or cough?

Ответить
@kopalsoni4780
@kopalsoni4780 - 30.11.2019 03:28

Can we use this example for human speech recognition?

Ответить
@muhamad6336
@muhamad6336 - 29.11.2019 17:58

amazing work..keep it up...

Ответить
@fremsoft
@fremsoft - 29.10.2019 16:37

Thank you, this is a very good job! 👨🏻‍🏫

Ответить
@vardantheunderdog
@vardantheunderdog - 08.09.2019 22:52

Nice. Any other resources you could suggest for audio signal processing and deep learning?

Ответить
@mubintirsaiwala9275
@mubintirsaiwala9275 - 24.08.2019 12:19

Hey, Can you please direct me to the haythemfayek's blog because the link that you have provided seems to have expired. Thanks in advance.

Ответить
@saiteja1997
@saiteja1997 - 22.08.2019 20:21

In video you mentioned nyquist frequency = highest frequency...A simple google search gave me : nyquist frequency = 2*(highest frequency)...still very good video.

Ответить
@Batbi1eg
@Batbi1eg - 13.08.2019 21:17

Very helpful, it is like ELI5

Ответить
@Thehoneydroppers
@Thehoneydroppers - 04.08.2019 15:45

Not to split hairs, but when you are talking about the FFT you are showing the equation for the continuous-time Fourier transform.

Ответить
@alexmuhr311
@alexmuhr311 - 27.07.2019 00:01

Excellent video! Great explanations of a very challenging topic.

Ответить
@eduugr
@eduugr - 05.07.2019 22:21

Very useful video, thanks!

Ответить
@maybelle3652
@maybelle3652 - 11.06.2019 17:21

Was verrrrrrrrrrry helpful. You did a good job in explaining. Thank you very much

Ответить
@WizardForPresident
@WizardForPresident - 25.05.2019 19:34

Thanks for the lecture! Just a quick question, can it be used to classification on human voices?

Ответить
@deepanshusingh2527
@deepanshusingh2527 - 25.05.2019 19:17

Thanks, Man. I'm working on Audio Denoising using Deep Learning can you make a video or something relevant to this ??

Ответить
@daniel341996
@daniel341996 - 08.05.2019 19:48

Note that the FFT window size DOES NOT allways have to be power of 2, e.x numpy does not use radix-2 method, so it might be useful if you want accurate analysis on specific frequencies

Ответить
@buzzylogic
@buzzylogic - 23.04.2019 16:29

I have an application that I need to monitor a (fairly noisy) machine for proper operation, it changes sound when it fails, Would Deep Learning for audio would be a good fit?

Ответить
@praburocking2777
@praburocking2777 - 07.04.2019 19:09

hi bro, it seems ur voice is not clear.... i cant get most of the important things

Ответить
@prashanthkolaneru3178
@prashanthkolaneru3178 - 07.04.2019 16:04

Hello, please explain me the final step how 13 coefficients are selected out of 26 coefficients,all 26 filter bank energies are prominent features.

Ответить