Best Voice Transcription AI is now the FASTEST - WHISPER JAX!

1littlecoder

1 год назад

29,581 Просмотров

Скачать видео

Комментарии:

Golden City - 02.10.2023 02:57

Any updates to this?

Ответить

marvin jordan Montalban - 18.09.2023 18:25

is it possibe to have label like speaker a or speaker b.. to label who said this or that

Ответить

Devesh Prabhu Vlogs - 04.09.2023 04:44

Is there any windows app based on this for offline transcription using audio (mp3, wav) files as inputs?

Ответить

Sowjanya Kanchi - 02.08.2023 11:37

can you please share the python code link

Ответить

Carlos Giraldo - 01.08.2023 18:12

Thank you so much!

Ответить

ROMMIX - 25.07.2023 13:05

Late comment, but Whisper even surpasses VOSK api (which was the best offline ASR engine prior to Whisper) which I used to use in some Python scripts for transcriptions and forced alignment.

Ответить

lobsang tenzin - 24.07.2023 22:26

But i wonder y would one need in Text when its easier to listen audio.. :)

Ответить

vivian yglesias - 21.07.2023 10:17

I've used this for Spanish language videos and on some of the transcript it repeats words even though the next dialogue is completely different.

Ответить

RadKrish - 10.07.2023 18:03

How do is convert these transcription to srt file anyone help please

Ответить

Patryk Wojtkowski - 05.07.2023 16:58

Is there any way to use this Whisper-jax in python. I am trying to have a program that transcripts audio files for me and would like to use this to do so. Is it possible to do so.

Ответить

Codie Petersen - 01.07.2023 02:56

Hey man, just subscribed. Normally I see a bunch of hype or garbage ass videos, but yours just gives a nice quick overview, no bs on the latest AI stuff. So I subscribed. Thanks.

Ответить

Salil Jakhadie - LLMs & Generative AI - 30.06.2023 07:16

This is brilliant 🤩 I am curious if we wanted to isolate and identify each individual speaker in an audio, how we go about doing that? What kind of libraries would we use?
For example: If the audio has 3 individuals, I want to identify and label each individual as Person 1-3.
Ideally, if the person introduces themselves or is introduced in the audio label the part of the audio to that person. Adobe Audition, Premiere do something similar.
Can we perhaps us Meta's Segment Anything or is that only for images?
Would love to hear your thoughts

Ответить

Getz Solution - 22.06.2023 21:38

HUGGING FACE IS NOLONGER WORKING ITS SAYING TIMEOUT ERROR 502

Ответить

Rida Choubai - 07.05.2023 19:36

Hi sir, thanks a lot for this tutorial, can you show me how to input my own audio file?

Ответить

KindMulberry - 30.04.2023 15:30

Does this work with any mp3 file and does the program save the files onto a seperate server or are these files deleted after a few hours?

Ответить

Jawad Mansoor - 30.04.2023 09:40

are there any gains of using jax on cpu compared to original model whisper on cpu?

Ответить

Saif AbuGoura - 28.04.2023 15:57

Can anyone help me setting this up? And I’ll pay them of course

Ответить

Mannuel - 26.04.2023 07:11

How can I modify the code to upload my own .mp3 file? It appears pretty straightforward according to the instructions of the Kaggle notebook: "Note that you can also pass any .mp3, .wav, or .flac audio file directly to the Whisper JAX pipeline, and it will automatically load the audio file for you." However, I don't know how to code. Please help.

Ответить