Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
707
3
43
Sanchit Gandhi
sanchit-gandhi
Follow
cruelpleasure's profile picture
noxinc's profile picture
manvinder01's profile picture
509 followers
·
13 following
sanchitgandhi99
sanchit-gandhi
AI & ML interests
Open-Source Speech
Articles
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
Feb 27
•
31
Speculative Decoding for 2x Faster Whisper Inference
Dec 20, 2023
•
13
AudioLDM 2, but faster ⚡️
Aug 30, 2023
•
4
A Complete Guide to Audio Datasets
Dec 15, 2022
•
17
Fine-Tune Whisper with 🤗 Transformers
Nov 3, 2022
•
92
Organizations
sanchit-gandhi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
hf-audio/open_asr_leaderboard
about 2 months ago
Drop common voice and update rtfx
#17 opened about 2 months ago by
sanchit-gandhi
New activity in
openai/whisper-large-v3
2 months ago
Update README.md
#144 opened 2 months ago by
sanchit-gandhi
New activity in
facebook/multilingual_librispeech
2 months ago
Missing audio files.
3
#12 opened 5 months ago by
grzegorz700
New activity in
sanchit-gandhi/whisper-jax-spaces
2 months ago
Update app.py
2
#1 opened 2 months ago by
Satyam-Singh
New activity in
google/gemma-2-2b
2 months ago
Update README.md
#10 opened 2 months ago by
sanchit-gandhi
New activity in
google/gemma-2-2b-it
2 months ago
Update README.md
#10 opened 2 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
3 months ago
GGML format please!
2
#8 opened 3 months ago by
dnhkng
New activity in
speechcolab/gigaspeech2
3 months ago
[Help Wanted] Support for GigaSpeech 2 Splits
14
#4 opened 4 months ago by
ruby11dog
New activity in
distil-whisper/distil-medium.en
4 months ago
Just can't run!
15
#14 opened 7 months ago by
awesomeandy
New activity in
openai/whisper-large-v3
4 months ago
list index out of range with word level timestamps
3
#131 opened 4 months ago by
mkvbn
New activity in
eustlb/distil-large-v3-fr
4 months ago
Update README.md
#1 opened 4 months ago by
sanchit-gandhi
New activity in
openai/whisper-large-v3
4 months ago
errors when running whisper locally
2
#130 opened 4 months ago by
luigimontaleone
New activity in
facebook/mms-1b-all
4 months ago
Translation - Will this model works with translation as well, i.e. if we have audio in Spanish, and set the model to give transcription in English?
1
#19 opened 4 months ago by
rizwanishaq
New activity in
openslr/librispeech_asr
4 months ago
Enable Dataset Viewer
3
#6 opened 4 months ago by
sanchit-gandhi
New activity in
openai/whisper-large-v3
4 months ago
Set temperature and prompt possible?
1
#128 opened 4 months ago by
jeffuli755
New activity in
utter-project/mHuBERT-147
4 months ago
How did you convert your HuBERTs to .pt formats?
8
#1 opened 4 months ago by
NeuroDonu
New activity in
distil-whisper/distil-large-v3
4 months ago
Set temperature and prompt possible?
2
#6 opened 4 months ago by
jeffuli755
New activity in
facebook/multilingual_librispeech
4 months ago
Fix streaming mode
#13 opened 4 months ago by
sanchit-gandhi
Corrupted texts in French train set
1
#11 opened 6 months ago by
lukespeech
New activity in
openai/whisper-large-v3
4 months ago
Update README.md
3
#126 opened 4 months ago by
reach-vb
New activity in
distil-whisper/distil-large-v3
4 months ago
Update README.md
1
#5 opened 4 months ago by
reach-vb
New activity in
facebook/mms-tts-eng
4 months ago
Create preprocessor_config.json
2
#12 opened 4 months ago by
adityaedy01
New activity in
facebook/mms-tts-som
4 months ago
where is the preprocessor_config.json for this model?
1
#1 opened 4 months ago by
adityaedy01
New activity in
facebook/wav2vec2-xls-r-1b-21-to-en
4 months ago
Incorrect config file
4
#5 opened 7 months ago by
shrey-jasuja
Update Example Code Snippets
#6 opened 4 months ago by
sanchit-gandhi
New activity in
Aspik101/distil-whisper-large-v3-pl
4 months ago
Model Discussion
7
#2 opened 9 months ago by
sanchit-gandhi
New activity in
facebook/wav2vec2-large-960h-lv60-self
5 months ago
facing issues while using access token of the following model facebook/wav2vec2-large-960h-lv60-self
1
#8 opened 5 months ago by
Webster9
New activity in
openai/whisper-large-v3
5 months ago
KeyError: 'whisper'
1
#116 opened 5 months ago by
aiyaqingzheng
New activity in
parler-tts/parler-tts-mini-expresso
5 months ago
What to use for [train] ? pip install -e .[train]
2
#2 opened 5 months ago by
Kimsui
New activity in
openai/whisper-large-v3
5 months ago
how to transcribe hundreds of local audio files once?
1
#114 opened 5 months ago by
myspace-ai
New activity in
sweet-dreambooths/musicgen-songstarter-v0.2-hf
5 months ago
Upload processor
#2 opened 5 months ago by
sanchit-gandhi
Upload MusicgenMelodyForConditionalGeneration
#1 opened 5 months ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
5 months ago
Error loading dataset
2
#9 opened 6 months ago by
FlarkAI
New activity in
LIUM/tedlium
5 months ago
FileNotFoundError when loading the LIUM/tedlium data on Windows
4
#4 opened 7 months ago by
wondav
New activity in
sanchit-gandhi/musicgen-streaming
6 months ago
Song doesn't appear to play (regardless of any browser)
3
#5 opened 6 months ago by
Nothsa
New activity in
openai/whisper-large-v3
6 months ago
How to get accuracy of transcription from the model?
5
#98 opened 6 months ago by
Atulad
How we can use this model to achieve a real-time trans?
4
#99 opened 6 months ago by
Von-violet
New activity in
parler-tts/parler_tts
6 months ago
Fixed . on a different line.
1
#2 opened 6 months ago by
blaise-tk
minor ui fix
1
#4 opened 6 months ago by
mrfakename
New activity in
parler-tts/parler_tts_mini_v0.1
6 months ago
Inference speed
6
#2 opened 6 months ago by
andreasrath
Link model to the training datasets in metadata
1
#3 opened 6 months ago by
julien-c
Add training datasets to metadata
1
#5 opened 6 months ago by
sanchit-gandhi
Update README.md
#4 opened 6 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
6 months ago
Update alignment heads in gen config
#3 opened 6 months ago by
sanchit-gandhi
New activity in
facebook/voxpopuli
6 months ago
LICENSE question
2
#8 opened 7 months ago by
phoneme
New activity in
sanchit-gandhi/musicgen-streaming
6 months ago
Streaming doesn't work yet with gradio 4.0
#4 opened 6 months ago by
ylacombe
New activity in
distil-whisper/distil-large-v3
6 months ago
about multiple languages?
2
#2 opened 7 months ago by
obtion
New activity in
sanchit-gandhi/whisper-small-hi
6 months ago
Adding `safetensors` variant of this model
#17 opened 11 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-lv-60-espeak-cv-ft
6 months ago
Adding `safetensors` variant of this model
1
#4 opened 11 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-large-xlsr-53
6 months ago
Adding `safetensors` variant of this model
1
#3 opened 7 months ago by
SFconvertbot
New activity in
facebook/wav2vec2-base
6 months ago
Adding `safetensors` variant of this model
1
#2 opened 9 months ago by
SFconvertbot
New activity in
distil-whisper/distil-large-v3-ct2
6 months ago
Update README.md
3
#2 opened 7 months ago by
muhtasham
New activity in
distil-whisper/distil-large-v3-ggml
6 months ago
is it fp16?
3
#1 opened 6 months ago by
supercharge19
New activity in
distil-whisper/distil-large-v3-ct2
7 months ago
Update alignment heads
#1 opened 7 months ago by
sanchit-gandhi
New activity in
distil-whisper/distil-large-v3
7 months ago
How to do multilingual transcription?
3
#1 opened 7 months ago by
emraza110
New activity in
facebook/mms-tts-tao
7 months ago
Reference of the Dataset
1
#1 opened 7 months ago by
ChiaLingWeng
New activity in
openai/whisper-large-v3
7 months ago
How to save the loss value for each step during the training process?
2
#91 opened 7 months ago by
zhouwen999
New activity in
hf-audio/open_asr_leaderboard
7 months ago
[Average WER Calculation] Drop Common Voice WER.
4
#14 opened 7 months ago by
reach-vb
New activity in
openai/whisper-large-v3
7 months ago
Transcript an Spanish audio
4
#86 opened 7 months ago by
Andrews99
New activity in
sanchit-gandhi/whisper-medium-fleurs-lang-id
7 months ago
How do you fine tune Whisper for classification task rather than transcription?
6
#1 opened over 1 year ago by
nkburns
Load more