Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama 20 days ago 363 meta-llama/Llama-3.2-1B Text Generation • Updated 15 days ago • 270k • • 496 meta-llama/Llama-3.2-3B Text Generation • Updated 18 days ago • 88.8k • • 199 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated 20 days ago • 342k • • 350 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated 20 days ago • 437k • • 336
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen 27 days ago 249 Running 361 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated 20 days ago • 97.3k • 73 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated 20 days ago • 147k • 70 Qwen/Qwen2.5-1.5B Text Generation • Updated 7 days ago • 29k • 30
Molmo Artifacts for open multimodal language models. Collection by allenai 19 days ago 247 allenai/Molmo-72B-0924 Image-Text-to-Text • Updated 4 days ago • 4.35k • 233 allenai/Molmo-7B-D-0924 Image-Text-to-Text • Updated 4 days ago • 34.9k • 365 allenai/Molmo-7B-O-0924 Image-Text-to-Text • Updated 4 days ago • 5.13k • 131 allenai/MolmoE-1B-0924 Image-Text-to-Text • Updated 4 days ago • 11.6k • 106
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. Collection by nvidia 14 days ago 40 nvidia/NVLM-D-72B Image-Text-to-Text • Updated about 2 hours ago • 24.2k • 632
Sapiens Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens Collection by facebook 27 days ago 40 Sapiens: Foundation for Human Vision Models Paper • 2408.12569 • Published Aug 22 • 87 facebook/sapiens Updated 25 days ago • 41 • 210 Running on Zero 20 📊 Sapiens Pose Running on Zero 95 🌍 Sapiens Segmentation
🍓 Ichigo The experimental family designed to train LLMs to understand sound natively. Collection by homebrewltd about 2 hours ago 7 Running on Zero 40 🏢 Ichigo Llama3.1 S Instruct homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-3 Updated about 17 hours ago • 74 • 12 homebrewltd/mini-Ichigo-llama3.2-3B-s-instruct Updated about 16 hours ago • 22 • 11 homebrewltd/Ichigo-llama3.1-s-base-v0.3 Updated about 17 hours ago • 15 • 1
INTELLECT-1 Dataset INTELLECT-1 Training dataset Collection by PrimeIntellect 7 days ago 6 PrimeIntellect/fineweb-edu Viewer • Updated 5 days ago • 1.2B • 3 PrimeIntellect/fineweb Preview • Updated 4 days ago • 4 PrimeIntellect/StackV1-popular Viewer • Updated 7 days ago • 93M • 2 • 1 open-web-math/open-web-math Viewer • Updated Oct 17, 2023 • 6.32M • 4.76k • 270
My most recent datasets Collection by rombodawg 6 days ago 5 rombodawg/Everything_Instruct Viewer • Updated 6 days ago • 4.05M • 216 • 22 rombodawg/Everything_Instruct_Multilingual Viewer • Updated 6 days ago • 5.81M • 19 • 6 rombodawg/code_bagel Viewer • Updated 6 days ago • 2.22M • 15 • 3 rombodawg/code_bagel_hermes-2.5 Viewer • Updated 6 days ago • 2.8M
Qwen2.5-Coder Code-specific model series based on Qwen2.5 Collection by Qwen 20 days ago 74 Running 96 🥸 Qwen2.5-Coder-7B-Instruct Qwen2.5-Coder Technical Report Paper • 2409.12186 • Published 27 days ago • 123 Qwen/Qwen2.5-Coder-1.5B Text Generation • Updated 20 days ago • 5.04k • 21 Qwen/Qwen2.5-Coder-1.5B-Instruct Text Generation • Updated 20 days ago • 12.7k • 25
Qwen2-VL Vision-language model series based on Qwen2 Collection by Qwen 27 days ago 135 Running 463 🌖 Qwen2-VL-72B Qwen/Qwen2-VL-2B-Instruct Image-Text-to-Text • Updated 24 days ago • 231k • 214 Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated 24 days ago • 899k • 715 Qwen/Qwen2-VL-72B-Instruct Image-Text-to-Text • Updated 24 days ago • 25.8k • 130