Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama 17 days ago 354 meta-llama/Llama-3.2-1B Text Generation • Updated 12 days ago • 232k • 447 meta-llama/Llama-3.2-3B Text Generation • Updated 15 days ago • 77k • • 180 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated 17 days ago • 305k • • 330 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated 17 days ago • 339k • • 314
Molmo Artifacts for open multimodal language models. Collection by allenai 16 days ago 243 allenai/Molmo-72B-0924 Image-Text-to-Text • Updated 1 day ago • 3.92k • 226 allenai/Molmo-7B-D-0924 Image-Text-to-Text • Updated 1 day ago • 31.5k • 350 allenai/Molmo-7B-O-0924 Image-Text-to-Text • Updated 1 day ago • 4.81k • 127 allenai/MolmoE-1B-0924 Image-Text-to-Text • Updated 1 day ago • 11k • 101
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen 24 days ago 242 Running 351 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated 17 days ago • 91k • 66 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated 17 days ago • 115k • 62 Qwen/Qwen2.5-1.5B Text Generation • Updated 4 days ago • 25k • 29
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. Collection by nvidia 11 days ago 38 nvidia/NVLM-D-72B Image-Text-to-Text • Updated 4 days ago • 21.3k • 603
Salamandra 🦎 Collection by BSC-LT 11 days ago 28 BSC-LT/salamandra-2b Text Generation • Updated 2 days ago • 432 • 16 BSC-LT/salamandra-7b-instruct Text Generation • Updated 2 days ago • 1.59k • 19 BSC-LT/salamandra-7b Text Generation • Updated 2 days ago • 936 • 9 BSC-LT/salamandra-2b-instruct Text Generation • Updated 2 days ago • 479 • 10
Sapiens Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens Collection by facebook 24 days ago 38 Sapiens: Foundation for Human Vision Models Paper • 2408.12569 • Published Aug 22 • 86 facebook/sapiens Updated 22 days ago • 72 • 210 Running on Zero 19 📊 Sapiens Pose Running on Zero 94 🌍 Sapiens Segmentation
Gemma 2 Release Collection by google Sep 9 181 google/gemma-2-2b Text Generation • Updated Aug 7 • 7.47M • 364 google/gemma-2-2b-it Text Generation • Updated Aug 27 • 339k • 603 google/gemma-2-9b Text Generation • Updated Aug 7 • 145k • 564 google/gemma-2-9b-it Text Generation • Updated Aug 27 • 313k • 495
LLaVA-Video Models focus on video understanding (previously known as LLaVA-NeXT-Video). Collection by lmms-lab 7 days ago 46 Video Instruction Tuning With Synthetic Data Paper • 2410.02713 • Published 9 days ago • 33 lmms-lab/LLaVA-Video-178K Viewer • Updated 1 day ago • 1.63M • 517 • 45 lmms-lab/LLaVA-Video-7B-Qwen2 Video-Text-to-Text • Updated 2 days ago • 56.4k • 17 lmms-lab/LLaVA-Video-72B-Qwen2 Text Generation • Updated 5 days ago • 1.03k • 8
Moshi v0.1 Release MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi Collection by kyutai 24 days ago 206 kyutai/moshiko-pytorch-bf16 Updated 24 days ago • 15.9k • 138 kyutai/moshika-pytorch-bf16 Updated 24 days ago • 10.7k • 42 kyutai/moshiko-mlx-q4 Updated 24 days ago • 2.69k • 22 kyutai/moshika-mlx-q4 Updated 24 days ago • 533 • 9
Phi-3 Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. Collection by microsoft 24 days ago 474 microsoft/Phi-3.5-mini-instruct Text Generation • Updated 24 days ago • 395k • • 548 microsoft/Phi-3.5-MoE-instruct Text Generation • Updated 15 days ago • 37.8k • 497 microsoft/Phi-3.5-vision-instruct Image-Text-to-Text • Updated 15 days ago • 334k • 517 microsoft/Phi-3.5-mini-instruct-onnx Text Generation • Updated about 1 month ago • 567 • 8