Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3

meta-llama/Llama-3.2-1B

Text Generation • Updated 16 days ago • 322k • 517
meta-llama/Llama-3.2-3B

Text Generation • Updated 20 days ago • 105k • • 206
meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated 21 days ago • 463k • • 360
meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated 21 days ago • 554k • • 347

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.

Running

370

🚀

Qwen2.5
Qwen/Qwen2.5-0.5B

Text Generation • Updated 21 days ago • 105k • 73
Qwen/Qwen2.5-0.5B-Instruct

Text Generation • Updated 21 days ago • 163k • 72
Qwen/Qwen2.5-1.5B

Text Generation • Updated 8 days ago • 32.1k • 30

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024.

about 20 hours ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation • Updated 1 day ago • 132 • 50
nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated about 20 hours ago • 1.41k • 13
nvidia/HelpSteer2

Viewer • Updated about 20 hours ago • 21.4k • 44.3k • 248
HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published 14 days ago • 10

Artifacts for open multimodal language models.

allenai/Molmo-72B-0924

Image-Text-to-Text • Updated 6 days ago • 4.78k • 233
allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated 6 days ago • 42.7k • 370
allenai/Molmo-7B-O-0924

Image-Text-to-Text • Updated 6 days ago • 5.49k • 131
allenai/MolmoE-1B-0924

Image-Text-to-Text • Updated 6 days ago • 12.6k • 109

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data.

about 21 hours ago

Scalable and Domain-General Abstractive Proposition Segmentation

Paper • 2406.19803 • Published Jun 28 • 2
google/gemma-2b-aps-it

Text Generation • Updated 19 days ago • 24 • 8
google/gemma-7b-aps-it

Text Generation • Updated 19 days ago • 28 • 8

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time!

hazyresearch/lolcats-llama-3.1-8b-distill

Updated 2 days ago • 9
hazyresearch/lolcats-llama-3.1-8b-ft-lora

Updated 2 days ago • 2
hazyresearch/lolcats-llama-3.1-70b

Updated 2 days ago • 3
hazyresearch/lolcats-llama-3.1-405b

Updated 2 days ago • 7

The experimental family designed to train LLMs to understand sound natively.

Running on Zero

47

🏢

Ichigo Llama3.1 S Instruct
homebrewltd/Ichigo-llama3.1-s-instruct-v0.3-phase-3

Updated 2 days ago • 116 • 16
homebrewltd/mini-Ichigo-llama3.2-3B-s-instruct

Updated 2 days ago • 60 • 13
homebrewltd/Ichigo-llama3.1-s-base-v0.3

Updated 2 days ago • 27 • 2

INTELLECT-1 Dataset

INTELLECT-1 Training dataset

PrimeIntellect/fineweb-edu

Viewer • Updated 6 days ago • 1.2B • 15
PrimeIntellect/fineweb

Preview • Updated 5 days ago • 10
PrimeIntellect/StackV1-popular

Viewer • Updated 8 days ago • 93M • 8 • 1
open-web-math/open-web-math

Viewer • Updated Oct 17, 2023 • 6.32M • 5.38k • 270

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks.

nvidia/NVLM-D-72B

Image-Text-to-Text • Updated about 5 hours ago • 26.1k • 646

Vision-language model series based on Qwen2

Running

467

🌖

Qwen2-VL-72B
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • Updated 25 days ago • 241k • 218
Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated 25 days ago • 917k • 718
Qwen/Qwen2-VL-72B-Instruct

Image-Text-to-Text • Updated 25 days ago • 29.3k • 134

Previous
1
2
3
...
6,683
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs