Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama 19 days ago 362 meta-llama/Llama-3.2-1B Text Generation • Updated 14 days ago • 270k • 495 meta-llama/Llama-3.2-3B Text Generation • Updated 18 days ago • 88.8k • • 198 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated 19 days ago • 342k • • 347 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated 19 days ago • 437k • • 332
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen 26 days ago 249 Running 361 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated 19 days ago • 97.3k • 72 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated 19 days ago • 147k • 70 Qwen/Qwen2.5-1.5B Text Generation • Updated 7 days ago • 29k • 30
Molmo Artifacts for open multimodal language models. Collection by allenai 19 days ago 247 allenai/Molmo-72B-0924 Image-Text-to-Text • Updated 4 days ago • 4.35k • 232 allenai/Molmo-7B-D-0924 Image-Text-to-Text • Updated 4 days ago • 34.9k • 363 allenai/Molmo-7B-O-0924 Image-Text-to-Text • Updated 4 days ago • 5.13k • 130 allenai/MolmoE-1B-0924 Image-Text-to-Text • Updated 4 days ago • 11.6k • 106
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. Collection by nvidia 14 days ago 40 nvidia/NVLM-D-72B Image-Text-to-Text • Updated 6 days ago • 24.2k • 627
Sapiens Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens Collection by facebook 26 days ago 39 Sapiens: Foundation for Human Vision Models Paper • 2408.12569 • Published Aug 22 • 87 facebook/sapiens Updated 25 days ago • 41 • 210 Running on Zero 19 📊 Sapiens Pose Running on Zero 95 🌍 Sapiens Segmentation
Ichigo The experimental family designed to train LLMs to understand sound natively. Collection by homebrewltd about 9 hours ago 7 homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-3 Updated about 9 hours ago • 74 • 9 homebrewltd/mini-Ichigo-llama3.2-3B-s-instruct Updated about 8 hours ago • 22 • 8 homebrewltd/Ichigo-llama3.1-s-base-v0.3 Updated about 9 hours ago • 15 • 1 homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-2 Updated about 9 hours ago • 340 • 4
INTELLECT-1 Dataset INTELLECT-1 Training dataset Collection by PrimeIntellect 7 days ago 6 PrimeIntellect/fineweb-edu Viewer • Updated 5 days ago • 1.2B • 3 PrimeIntellect/fineweb Preview • Updated 4 days ago • 4 PrimeIntellect/StackV1-popular Viewer • Updated 7 days ago • 93M • 2 • 1 open-web-math/open-web-math Viewer • Updated Oct 17, 2023 • 6.32M • 4.76k • 270
Qwen2-VL Vision-language model series based on Qwen2 Collection by Qwen 26 days ago 135 Running 463 🌖 Qwen2-VL-72B Qwen/Qwen2-VL-2B-Instruct Image-Text-to-Text • Updated 24 days ago • 231k • 214 Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated 24 days ago • 899k • 714 Qwen/Qwen2-VL-72B-Instruct Image-Text-to-Text • Updated 24 days ago • 25.8k • 130
Qwen2.5-Coder Code-specific model series based on Qwen2.5 Collection by Qwen 19 days ago 74 Running 94 🥸 Qwen2.5-Coder-7B-Instruct Qwen2.5-Coder Technical Report Paper • 2409.12186 • Published 26 days ago • 123 Qwen/Qwen2.5-Coder-1.5B Text Generation • Updated 20 days ago • 5.04k • 21 Qwen/Qwen2.5-Coder-1.5B-Instruct Text Generation • Updated 20 days ago • 12.7k • 25
EU20-Benchmarks Evaluation Benchmarks for 20 European languages. Collection by openGPT-X 3 days ago 4 openGPT-X/mmlux Updated about 15 hours ago • 246k openGPT-X/arcx Updated about 15 hours ago • 5.02k openGPT-X/hellaswagx Updated about 15 hours ago • 2.72k openGPT-X/gsm8kx Updated about 15 hours ago • 3.06k