Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama 21 days ago 367 meta-llama/Llama-3.2-1B Text Generation • Updated 16 days ago • 322k • 517 meta-llama/Llama-3.2-3B Text Generation • Updated 20 days ago • 105k • • 206 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated 21 days ago • 463k • • 360 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated 21 days ago • 554k • • 347
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen 28 days ago 254 Running 370 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated 21 days ago • 105k • 73 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated 21 days ago • 163k • 72 Qwen/Qwen2.5-1.5B Text Generation • Updated 8 days ago • 32.1k • 30
Llama-3.1-Nemotron-70B SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. Collection by nvidia about 20 hours ago 14 nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • Updated 1 day ago • 132 • 50 nvidia/Llama-3.1-Nemotron-70B-Reward-HF Updated about 20 hours ago • 1.41k • 13 nvidia/HelpSteer2 Viewer • Updated about 20 hours ago • 21.4k • 44.3k • 248 HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published 14 days ago • 10
HelpSteer2-Preference: Complementing Ratings with Preferences Paper • 2410.01257 • Published 14 days ago • 10
Molmo Artifacts for open multimodal language models. Collection by allenai 20 days ago 249 allenai/Molmo-72B-0924 Image-Text-to-Text • Updated 6 days ago • 4.78k • 233 allenai/Molmo-7B-D-0924 Image-Text-to-Text • Updated 6 days ago • 42.7k • 370 allenai/Molmo-7B-O-0924 Image-Text-to-Text • Updated 6 days ago • 5.49k • 131 allenai/MolmoE-1B-0924 Image-Text-to-Text • Updated 6 days ago • 12.6k • 109
Gemma-APS Release Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. Collection by google about 21 hours ago 11 Scalable and Domain-General Abstractive Proposition Segmentation Paper • 2406.19803 • Published Jun 28 • 2 google/gemma-2b-aps-it Text Generation • Updated 19 days ago • 24 • 8 google/gemma-7b-aps-it Text Generation • Updated 19 days ago • 28 • 8
Scalable and Domain-General Abstractive Proposition Segmentation Paper • 2406.19803 • Published Jun 28 • 2
LoLCATS Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! Collection by hazyresearch 2 days ago 9 hazyresearch/lolcats-llama-3.1-8b-distill Updated 2 days ago • 9 hazyresearch/lolcats-llama-3.1-8b-ft-lora Updated 2 days ago • 2 hazyresearch/lolcats-llama-3.1-70b Updated 2 days ago • 3 hazyresearch/lolcats-llama-3.1-405b Updated 2 days ago • 7
🍓 Ichigo The experimental family designed to train LLMs to understand sound natively. Collection by homebrewltd 1 day ago 8 Running on Zero 47 🏢 Ichigo Llama3.1 S Instruct homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-3 Updated 2 days ago • 116 • 16 homebrewltd/mini-Ichigo-llama3.2-3B-s-instruct Updated 2 days ago • 60 • 13 homebrewltd/Ichigo-llama3.1-s-base-v0.3 Updated 2 days ago • 27 • 2
INTELLECT-1 Dataset INTELLECT-1 Training dataset Collection by PrimeIntellect 8 days ago 7 PrimeIntellect/fineweb-edu Viewer • Updated 6 days ago • 1.2B • 15 PrimeIntellect/fineweb Preview • Updated 5 days ago • 10 PrimeIntellect/StackV1-popular Viewer • Updated 8 days ago • 93M • 8 • 1 open-web-math/open-web-math Viewer • Updated Oct 17, 2023 • 6.32M • 5.38k • 270
NVLM 1.0 A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. Collection by nvidia 15 days ago 42 nvidia/NVLM-D-72B Image-Text-to-Text • Updated about 5 hours ago • 26.1k • 646
Qwen2-VL Vision-language model series based on Qwen2 Collection by Qwen 28 days ago 138 Running 467 🌖 Qwen2-VL-72B Qwen/Qwen2-VL-2B-Instruct Image-Text-to-Text • Updated 25 days ago • 241k • 218 Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • Updated 25 days ago • 917k • 718 Qwen/Qwen2-VL-72B-Instruct Image-Text-to-Text • Updated 25 days ago • 29.3k • 134