Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sugatoray
's Collections
LLMs
LLM Tools
AV LLMs
LLM Training Datasets
Papers
Leaderboards 🔥
Papers-MoE
Papers-LLMEval
LLM LLAMA3
Papers-Fundamentals
TFM: TimeSeries Foundation Models
Papers-Benchmarks
LLMs-EmbeddingModels
LLMs + Mamba
LLM + Datasets : Finance
AV LLMs
updated
3 days ago
A collection of Audio, Video and Visual LLMs.
Upvote
2
myshell-ai/OpenVoice
Text-to-Speech
•
Updated
Apr 24
•
388
Running
954
🤗
OpenVoice
dataautogpt3/ProteusV0.3
Text-to-Image
•
Updated
Feb 12
•
41.3k
•
91
ByteDance/SDXL-Lightning
Text-to-Image
•
Updated
Apr 3
•
75.9k
•
1.89k
openai/whisper-large-v3
Automatic Speech Recognition
•
Updated
Aug 12
•
4.11M
•
•
3.52k
stabilityai/TripoSR
Image-to-3D
•
Updated
Aug 9
•
25.4k
•
447
Efficient-Large-Model/VILA-7b
Text Generation
•
Updated
Mar 4
•
481
•
25
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
116k
•
107
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20
•
102k
•
895
stabilityai/stable-audio-open-1.0
Text-to-Audio
•
Updated
Jul 31
•
18.1k
•
880
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
Jun 13
•
36
aiola/whisper-medusa-v1
Updated
Aug 3
•
191
•
174
merve/idefics3llama-vqav2
Updated
25 days ago
•
8
black-forest-labs/FLUX.1-schnell
Text-to-Image
•
Updated
Aug 16
•
965k
•
•
2.47k
Running
on
Zero
99
😻
Llama3.1 S V0.2 Checkpoint 2024 08 20
gpt-omni/mini-omni
Text-to-Speech
•
Updated
Sep 4
•
4
•
373
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
12 days ago
•
6.6k
•
369
Running
on
Zero
142
📲🫴🏻👁
Tonic's GOT OCR
GOT - OCR (from : UCAS, Beijing)
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
18 days ago
•
174k
•
917
apple/coreml-sam2-large
Mask Generation
•
Updated
23 days ago
•
179
•
18
coreml-projects/sam-2-studio
Updated
5 days ago
•
14
mistralai/Pixtral-12B-2409
Updated
5 days ago
•
9
•
357
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
1 day ago
•
2.29k
•
209
openai/whisper-large-v3-turbo
Automatic Speech Recognition
•
Updated
1 day ago
•
30.5k
•
•
557
Revai/reverb-asr
Automatic Speech Recognition
•
Updated
1 day ago
•
11
•
40
Upvote
2
Share collection
View history
Collection guide
Browse collections