Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
diwank
's Collections
K
S1.1
Sam
Audio
thought
Audio
updated
2 days ago
Upvote
-
espnet/yodas2
Updated
Jun 10
•
56.9k
•
23
Flux9665/BibleMMS
Viewer
•
Updated
Jun 16
•
736k
•
18
•
61
google/MusicCaps
Viewer
•
Updated
Mar 8, 2023
•
5.52k
•
329
•
124
ShoukanLabs/AniSpeech
Viewer
•
Updated
Jan 29
•
23.7k
•
47
•
34
muzaik/captioned-audio-1k
Viewer
•
Updated
May 28
•
1.05k
•
2
•
4
aoxo/text2asmr-uncensored
Preview
•
Updated
Feb 19
•
2
•
1
google/fleurs
Updated
Aug 25
•
31.4k
•
242
phongdtd/youtube_casual_audio
Updated
26 days ago
•
2
•
4
ProgramComputer/voxceleb
Updated
Jul 27
•
10
•
49
jhu-clsp/seamless-align
Preview
•
Updated
Jun 2
•
6
•
9
IVLLab/MultiDialog
Updated
Aug 29
•
28
•
10
PetraAI/PetraAI
Updated
Sep 14, 2023
•
5
•
20
ReDUB/SoundHarvest
Viewer
•
Updated
Dec 14, 2023
•
2
•
4
•
2
jhu-clsp/seamless-align-expressive
Updated
Feb 22
•
4
•
3
jg583/NSynth
Updated
Apr 26
•
18
•
17
voice-is-cool/voxtube
Viewer
•
Updated
Feb 13
•
4.46M
•
9
•
9
google/speech_commands
Updated
Jan 18
•
888
•
29
Fhrozen/FSD50k
Preview
•
Updated
May 27, 2022
•
50
•
4
nvidia/parakeet-tdt-1.1b
Automatic Speech Recognition
•
Updated
Apr 30
•
352k
•
76
yl4579/StyleTTS2-LibriTTS
Updated
Nov 21, 2023
•
39
coqui/XTTS-v2
Text-to-Speech
•
Updated
Dec 11, 2023
•
794k
•
1.77k
facebook/wav2vec2-large-robust
Updated
Nov 5, 2021
•
17.2k
•
30
laion/links_to_pocasts_lecture_and_shows_for_tts
Viewer
•
Updated
May 29
•
331k
•
2
•
7
laion/youtube-urls-for-emotional-tts
Viewer
•
Updated
May 21
•
78.3k
•
2
•
3
laion/chirp-v2-dataset
Viewer
•
Updated
Mar 25
•
64
•
665
•
5
speechcolab/gigaspeech
Viewer
•
Updated
Nov 23, 2023
•
364k
•
2.53k
•
87
fixie-ai/boolq-audio
Viewer
•
Updated
Jun 12
•
12.7k
•
184
•
7
fixie-ai/soda-audio
Viewer
•
Updated
Jul 24
•
102k
•
299
•
3
amphion/Emilia
Preview
•
Updated
30 days ago
•
1
•
75
google/cvss
Updated
Feb 10
•
139
•
12
PolyAI/minds14
Updated
26 days ago
•
6.54k
•
73
Qwen/Qwen2-Audio-7B-Instruct
Text2Text Generation
•
Updated
Aug 9
•
74.9k
•
209
infgrad/dialogue_rewrite_llm
Viewer
•
Updated
Feb 17
•
1.64M
•
2
•
11
FBK-MT/Speech-MASSIVE
Viewer
•
Updated
Aug 8
•
97.6k
•
179
•
22
Qwen/Qwen2-Audio-7B
Text2Text Generation
•
Updated
Aug 9
•
27.8k
•
56
Mozilla/whisperfile
Updated
4 days ago
•
1.62k
•
231
vucinatim/spectrogram-captions
Viewer
•
Updated
Jan 3, 2023
•
1k
•
2
•
2
rachit8562/mel_spectogram_bird_audio
Viewer
•
Updated
Jan 7, 2023
•
72.2k
•
2
•
2
novateur/WavTokenizer
Text-to-Speech
•
Updated
9 days ago
•
38
gpt-omni/mini-omni
Text-to-Speech
•
Updated
Sep 4
•
4
•
372
amphion/Emilia-Dataset
Viewer
•
Updated
30 days ago
•
52.9M
•
7.34k
•
70
FLUX that Plays Music
Paper
•
2409.00587
•
Published
Sep 1
•
31
feizhengcong/FluxMusic
Updated
Aug 31
•
57
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
12 days ago
•
6.6k
•
369
ICTNLP/Llama-3.1-8B-Omni
Updated
22 days ago
•
2.7k
•
340
HuggingFaceFV/finevideo
Viewer
•
Updated
13 days ago
•
43.8k
•
337
•
243
kyutai/moshiko-pytorch-bf16
Updated
18 days ago
•
9.56k
•
136
kyutai/moshika-pytorch-bf16
Updated
18 days ago
•
1.47k
•
43
Revai/reverb-asr
Automatic Speech Recognition
•
Updated
1 day ago
•
11
•
39
Upvote
-
Share collection
View history
Collection guide
Browse collections