clem (Clem 🤗)

upvoted a paper 1 day ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published 2 days ago • 46

upvoted a collection 4 days ago

Llama 3.2 3B & 1B GGUF Quants

Collection

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated 10 days ago • 40

upvoted a collection 10 days ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 10 days ago • 327

upvoted a paper 15 days ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 17 days ago • 121

upvoted a collection 17 days ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 17 days ago • 201

upvoted an article 22 days ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

upvoted an article 23 days ago

Article

Training Flux Locally on Mac

By

•

24 days ago

• 11

upvoted a collection 23 days ago

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 23 days ago • 76

upvoted 2 papers 26 days ago

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4 • 27

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published about 1 month ago • 30

upvoted an article 29 days ago

Article

LLM Inference at scale with TGI

By

•

29 days ago

• 7

upvoted an article about 1 month ago

Article

Announcing New Dataset Search Features

Jul 8

• 22

upvoted 6 collections about 1 month ago

upvoted 2 articles about 1 month ago

Article

DEMO: French Spoken Language Understanding with the new speech resources from NAVER LABS Europe

By

•

Aug 28

• 9

Article

Scaling robotics datasets with video encoding

Aug 27

• 33

upvoted a paper about 1 month ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27 • 138

upvoted 2 articles about 1 month ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 162

Article

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19

• 11

upvoted 8 papers about 1 month ago

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22 • 33

DreamCinema: Cinematic Transfer with Free Camera and 3D Character

Paper • 2408.12601 • Published Aug 22 • 28

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22 • 61

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22 • 86

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Paper • 2408.13252 • Published Aug 23 • 23

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 111

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28

upvoted 2 articles about 1 month ago

Article

2024 Security Feature Highlights

Aug 6

• 14

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 81

upvoted a collection about 1 month ago

Jamba-1.5

Collection

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 75

upvoted 7 collections about 2 months ago

Gradio Spaces for Background Removal

Collection

Enhance your images by removing the background. Will ensure these Spaces are up and maintained for the community. • 5 items • Updated Aug 20 • 23

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 10 days ago • 586

XGen-MM-1 models and datasets

Collection

A collection of all XGen-MM (Foundation LMM) models! • 14 items • Updated 8 days ago • 34

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 3 days ago • 54

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 17 days ago • 473

💻 Local SmolLMs

Collection

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Aug 20 • 41

Hermes 3

Collection

The Hermes 3 Series of Models • 8 items • Updated Aug 23 • 83

upvoted 2 papers about 2 months ago

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Paper • 2408.08189 • Published Aug 15 • 14

MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model

Paper • 2408.10198 • Published Aug 19 • 32

upvoted an article about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 78

upvoted 6 papers 2 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 103

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

Paper • 2408.00103 • Published Jul 31 • 16

POA: Pre-training Once for Models of All Sizes

Paper • 2408.01031 • Published Aug 2 • 26

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Paper • 2408.00874 • Published Aug 1 • 41

A Large Encoder-Decoder Family of Foundation Models For Chemical Language

Paper • 2407.20267 • Published Jul 24 • 31

upvoted 2 collections 2 months ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 50

upvoted 2 articles 2 months ago

Article

Energy Star Ratings for AI Models

By

•

May 9

• 25

Article

Clarity AI Upscaler Reproduction

By

•

Jul 30

• 18

upvoted 2 articles 3 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 65

Article

Total noob’s intro to Hugging Face Transformers

Mar 22

• 41

upvoted 2 papers 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 154

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 48

upvoted an article 3 months ago

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11

• 42

upvoted a collection 3 months ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 9 items • Updated 12 days ago • 40

Clem 🤗 PRO

AI & ML interests

Organizations

clem's activity

Serverless Inference with Hugging Face and NVIDIA NIMs

Training Flux Locally on Mac

LLM Inference at scale with TGI

Announcing New Dataset Search Features

DEMO: French Spoken Language Understanding with the new speech resources from NAVER LABS Europe

Scaling robotics datasets with video encoding

Mixture of Experts Explained

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

2024 Security Feature Highlights

The 5 Most Under-Rated Tools on Hugging Face

XetHub is joining Hugging Face!

Energy Star Ratings for AI Models

Clarity AI Upscaler Reproduction

Docmatix - a huge dataset for Document Visual Question Answering

Total noob’s intro to Hugging Face Transformers

How to run Gemini Nano locally in your browser