Best Voice Transcription AI is now the FASTEST - WHISPER JAX!

Best Voice Transcription AI is now the FASTEST - WHISPER JAX!

1littlecoder

1 год назад

29,581 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

Golden City
Golden City - 02.10.2023 02:57

Any updates to this?

Ответить
marvin jordan Montalban
marvin jordan Montalban - 18.09.2023 18:25

is it possibe to have label like speaker a or speaker b.. to label who said this or that

Ответить
Devesh Prabhu Vlogs
Devesh Prabhu Vlogs - 04.09.2023 04:44

Is there any windows app based on this for offline transcription using audio (mp3, wav) files as inputs?

Ответить
Sowjanya Kanchi
Sowjanya Kanchi - 02.08.2023 11:37

can you please share the python code link

Ответить
Carlos Giraldo
Carlos Giraldo - 01.08.2023 18:12

Thank you so much!

Ответить
ROMMIX
ROMMIX - 25.07.2023 13:05

Late comment, but Whisper even surpasses VOSK api (which was the best offline ASR engine prior to Whisper) which I used to use in some Python scripts for transcriptions and forced alignment.

Ответить
lobsang tenzin
lobsang tenzin - 24.07.2023 22:26

But i wonder y would one need in Text when its easier to listen audio.. :)

Ответить
vivian yglesias
vivian yglesias - 21.07.2023 10:17

I've used this for Spanish language videos and on some of the transcript it repeats words even though the next dialogue is completely different.

Ответить
RadKrish
RadKrish - 10.07.2023 18:03

How do is convert these transcription to srt file anyone help please

Ответить
Patryk Wojtkowski
Patryk Wojtkowski - 05.07.2023 16:58

Is there any way to use this Whisper-jax in python. I am trying to have a program that transcripts audio files for me and would like to use this to do so. Is it possible to do so.

Ответить
Codie Petersen
Codie Petersen - 01.07.2023 02:56

Hey man, just subscribed. Normally I see a bunch of hype or garbage ass videos, but yours just gives a nice quick overview, no bs on the latest AI stuff. So I subscribed. Thanks.

Ответить
Salil Jakhadie - LLMs & Generative AI
Salil Jakhadie - LLMs & Generative AI - 30.06.2023 07:16

This is brilliant 🤩 I am curious if we wanted to isolate and identify each individual speaker in an audio, how we go about doing that? What kind of libraries would we use?
For example: If the audio has 3 individuals, I want to identify and label each individual as Person 1-3.
Ideally, if the person introduces themselves or is introduced in the audio label the part of the audio to that person. Adobe Audition, Premiere do something similar.
Can we perhaps us Meta's Segment Anything or is that only for images?
Would love to hear your thoughts

Ответить
Getz Solution
Getz Solution - 22.06.2023 21:38

HUGGING FACE IS NOLONGER WORKING ITS SAYING TIMEOUT ERROR 502

Ответить
Rida Choubai
Rida Choubai - 07.05.2023 19:36

Hi sir, thanks a lot for this tutorial, can you show me how to input my own audio file?

Ответить
KindMulberry
KindMulberry - 30.04.2023 15:30

Does this work with any mp3 file and does the program save the files onto a seperate server or are these files deleted after a few hours?

Ответить
Jawad Mansoor
Jawad Mansoor - 30.04.2023 09:40

are there any gains of using jax on cpu compared to original model whisper on cpu?

Ответить
Saif AbuGoura
Saif AbuGoura - 28.04.2023 15:57

Can anyone help me setting this up? And I’ll pay them of course

Ответить
Mannuel
Mannuel - 26.04.2023 07:11

How can I modify the code to upload my own .mp3 file? It appears pretty straightforward according to the instructions of the Kaggle notebook: "Note that you can also pass any .mp3, .wav, or .flac audio file directly to the Whisper JAX pipeline, and it will automatically load the audio file for you." However, I don't know how to code. Please help.

Ответить
Pankaj Pandit
Pankaj Pandit - 26.04.2023 04:44

This is awesome. Any guidance on transcribing FB live videos? Do i need to download them first and then have whisper transcribe them?

Ответить
NicodemPL
NicodemPL - 25.04.2023 12:49

Can you run it on Pixel with tensor TPU?

Ответить
nox player74
nox player74 - 24.04.2023 19:09

Is it possible to run (with GUI) on your own computer like GITHUB Const-me
Whisper ?

Ответить
Milind Doshi
Milind Doshi - 24.04.2023 15:09

your awesome

Ответить
Rafael P.
Rafael P. - 24.04.2023 14:16

When i use whisper Jax on Huggingface what model i am using? i normally like to use the large one

Ответить
Erik Schiegg
Erik Schiegg - 24.04.2023 12:28

huggingface link: 502 Bad Gateway

Ответить
Utkarsh
Utkarsh - 24.04.2023 11:01

hugging face link not working 502 bad gateway

Ответить
Ajit Kumar
Ajit Kumar - 24.04.2023 08:58

Any GPU version available which can transcribe faster than openai versiob

Ответить
Rasel Hossain
Rasel Hossain - 24.04.2023 08:57

is there any model available who can transcript Bangla Language?

Ответить
Michael Zumpano
Michael Zumpano - 24.04.2023 07:30

This is spectacular! Thanks for a great video. I was looking for something like this.

Ответить
The AI Lifestyle
The AI Lifestyle - 24.04.2023 07:25

This is going to be revolutionary in medical transcription.

Ответить
Trillion Knowledge
Trillion Knowledge - 24.04.2023 04:56

Is it possible to use this for live transcription?.

Ответить
dyjamerson freire
dyjamerson freire - 24.04.2023 01:34

is there a cost to use this on my own application?

Ответить
hitlab
hitlab - 24.04.2023 01:27

Do you know if this is runnable locally?

Ответить
crazyfaint
crazyfaint - 24.04.2023 01:21

What TPU version does It support?

Ответить
Victor Multanen
Victor Multanen - 23.04.2023 23:55

Will it transcript voice to text on multiple languages?

Ответить
Devasheesh Mishra
Devasheesh Mishra - 23.04.2023 23:48

amazing

Ответить
NicOri
NicOri - 23.04.2023 23:25

Thanks.

Ответить