Audio - a diwank Collection

Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

diwank 's Collections

K

S1.1

Sam

Audio

thought

Audio

updated 2 days ago

espnet/yodas2

Updated Jun 10 • 56.9k • 23
Flux9665/BibleMMS

Viewer • Updated Jun 16 • 736k • 18 • 61
google/MusicCaps

Viewer • Updated Mar 8, 2023 • 5.52k • 329 • 124
ShoukanLabs/AniSpeech

Viewer • Updated Jan 29 • 23.7k • 47 • 34
muzaik/captioned-audio-1k

Viewer • Updated May 28 • 1.05k • 2 • 4
aoxo/text2asmr-uncensored

Preview • Updated Feb 19 • 2 • 1
google/fleurs

Updated Aug 25 • 31.4k • 242
phongdtd/youtube_casual_audio

Updated 26 days ago • 2 • 4
ProgramComputer/voxceleb

Updated Jul 27 • 10 • 49
jhu-clsp/seamless-align

Preview • Updated Jun 2 • 6 • 9
IVLLab/MultiDialog

Updated Aug 29 • 28 • 10
PetraAI/PetraAI

Updated Sep 14, 2023 • 5 • 20
ReDUB/SoundHarvest

Viewer • Updated Dec 14, 2023 • 2 • 4 • 2
jhu-clsp/seamless-align-expressive

Updated Feb 22 • 4 • 3
jg583/NSynth

Updated Apr 26 • 18 • 17
voice-is-cool/voxtube

Viewer • Updated Feb 13 • 4.46M • 9 • 9
google/speech_commands

Updated Jan 18 • 888 • 29
Fhrozen/FSD50k

Preview • Updated May 27, 2022 • 50 • 4
nvidia/parakeet-tdt-1.1b

Automatic Speech Recognition • Updated Apr 30 • 352k • 76
yl4579/StyleTTS2-LibriTTS

Updated Nov 21, 2023 • 39
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 794k • 1.77k
facebook/wav2vec2-large-robust

Updated Nov 5, 2021 • 17.2k • 30
laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29 • 331k • 2 • 7
laion/youtube-urls-for-emotional-tts

Viewer • Updated May 21 • 78.3k • 2 • 3
laion/chirp-v2-dataset

Viewer • Updated Mar 25 • 64 • 665 • 5
speechcolab/gigaspeech

Viewer • Updated Nov 23, 2023 • 364k • 2.53k • 87
fixie-ai/boolq-audio

Viewer • Updated Jun 12 • 12.7k • 184 • 7
fixie-ai/soda-audio

Viewer • Updated Jul 24 • 102k • 299 • 3
amphion/Emilia

Preview • Updated 30 days ago • 1 • 75
google/cvss

Updated Feb 10 • 139 • 12
PolyAI/minds14

Updated 26 days ago • 6.54k • 73
Qwen/Qwen2-Audio-7B-Instruct

Text2Text Generation • Updated Aug 9 • 74.9k • 209
infgrad/dialogue_rewrite_llm

Viewer • Updated Feb 17 • 1.64M • 2 • 11
FBK-MT/Speech-MASSIVE

Viewer • Updated Aug 8 • 97.6k • 179 • 22
Qwen/Qwen2-Audio-7B

Text2Text Generation • Updated Aug 9 • 27.8k • 56
Mozilla/whisperfile

Updated 4 days ago • 1.62k • 231
vucinatim/spectrogram-captions

Viewer • Updated Jan 3, 2023 • 1k • 2 • 2
rachit8562/mel_spectogram_bird_audio

Viewer • Updated Jan 7, 2023 • 72.2k • 2 • 2
novateur/WavTokenizer

Text-to-Speech • Updated 9 days ago • 38
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4 • 4 • 372
amphion/Emilia-Dataset

Viewer • Updated 30 days ago • 52.9M • 7.34k • 70
FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1 • 31
feizhengcong/FluxMusic

Updated Aug 31 • 57
fishaudio/fish-speech-1.4

Text-to-Speech • Updated 12 days ago • 6.6k • 369
ICTNLP/Llama-3.1-8B-Omni

Updated 22 days ago • 2.7k • 340
HuggingFaceFV/finevideo

Viewer • Updated 13 days ago • 43.8k • 337 • 243
kyutai/moshiko-pytorch-bf16

Updated 18 days ago • 9.56k • 136
kyutai/moshika-pytorch-bf16

Updated 18 days ago • 1.47k • 43
Revai/reverb-asr

Automatic Speech Recognition • Updated 1 day ago • 11 • 39

Collection guide
Browse collections

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs