Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama 20 days ago 364 meta-llama/Llama-3.2-1B Text Generation • Updated 15 days ago • 295k • • 498 meta-llama/Llama-3.2-3B Text Generation • Updated 18 days ago • 96k • 200 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated 20 days ago • 406k • • 350 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated 20 days ago • 488k • • 337
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen 27 days ago 249 Running 362 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated 20 days ago • 99.7k • 73 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated 20 days ago • 155k • 70 Qwen/Qwen2.5-1.5B Text Generation • Updated 7 days ago • 30.2k • 30
Molmo Artifacts for open multimodal language models. Collection by allenai 19 days ago 247 allenai/Molmo-72B-0924 Image-Text-to-Text • Updated 4 days ago • 4.53k • 233 allenai/Molmo-7B-D-0924 Image-Text-to-Text • Updated 4 days ago • 37.2k • 366 allenai/Molmo-7B-O-0924 Image-Text-to-Text • Updated 4 days ago • 5.28k • 131 allenai/MolmoE-1B-0924 Image-Text-to-Text • Updated 4 days ago • 12.1k • 106
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. Collection by nvidia 14 days ago 41 nvidia/NVLM-D-72B Image-Text-to-Text • Updated about 4 hours ago • 25.3k • 632
Sapiens Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens Collection by facebook 27 days ago 40 Sapiens: Foundation for Human Vision Models Paper • 2408.12569 • Published Aug 22 • 87 facebook/sapiens Updated 25 days ago • 40 • 210 Running on Zero 20 📊 Sapiens Pose Running on Zero 95 🌍 Sapiens Segmentation
🍓 Ichigo The experimental family designed to train LLMs to understand sound natively. Collection by homebrewltd about 4 hours ago 7 Running on Zero 42 🏢 Ichigo Llama3.1 S Instruct homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-3 Updated about 19 hours ago • 91 • 13 homebrewltd/mini-Ichigo-llama3.2-3B-s-instruct Updated about 18 hours ago • 38 • 11 homebrewltd/Ichigo-llama3.1-s-base-v0.3 Updated about 19 hours ago • 18 • 1
INTELLECT-1 Dataset INTELLECT-1 Training dataset Collection by PrimeIntellect 7 days ago 6 PrimeIntellect/fineweb-edu Viewer • Updated 5 days ago • 1.2B • 15 PrimeIntellect/fineweb Preview • Updated 4 days ago • 10 PrimeIntellect/StackV1-popular Viewer • Updated 7 days ago • 93M • 8 • 1 open-web-math/open-web-math Viewer • Updated Oct 17, 2023 • 6.32M • 5.4k • 270
My most recent datasets Collection by rombodawg 6 days ago 5 rombodawg/Everything_Instruct Viewer • Updated 7 days ago • 4.05M • 285 • 22 rombodawg/Everything_Instruct_Multilingual Viewer • Updated 6 days ago • 5.81M • 38 • 6 rombodawg/code_bagel Viewer • Updated 7 days ago • 2.22M • 15 • 3 rombodawg/code_bagel_hermes-2.5 Viewer • Updated 7 days ago • 2.8M
Qwen2.5-Coder Code-specific model series based on Qwen2.5 Collection by Qwen 20 days ago 74 Running 97 🥸 Qwen2.5-Coder-7B-Instruct Qwen2.5-Coder Technical Report Paper • 2409.12186 • Published 27 days ago • 123 Qwen/Qwen2.5-Coder-1.5B Text Generation • Updated 20 days ago • 5.21k • 21 Qwen/Qwen2.5-Coder-1.5B-Instruct Text Generation • Updated 20 days ago • 13.3k • 25
Qwen2-VL Vision-language model series based on Qwen2 Collection by Qwen 27 days ago 135 Running 463 🌖 Qwen2-VL-72B Qwen/Qwen2-VL-2B-Instruct Image-Text-to-Text • Updated 24 days ago • 238k • 215 Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated 24 days ago • 908k • 715 Qwen/Qwen2-VL-72B-Instruct Image-Text-to-Text • Updated 24 days ago • 27.5k • 130