Philipp Schmid

Welcome Gemma - Google's new open LLM

Feb 21

• 16

From OpenAI to Open LLMs with Messages API

Feb 8

• 11

Hugging Face Text Generation Inference available for AWS Inferentia2

Feb 1

Hugging Face and Google partner for open AI collaboration

Jan 25

Mixture of Experts Explained

Dec 11, 2023

• 162

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Dec 11, 2023

• 9

Deploy Embedding Models with Hugging Face Inference Endpoints

Oct 24, 2023

Llama 2 on Amazon SageMaker a Benchmark

Sep 26, 2023

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

• 13

Spread Your Wings: Falcon 180B is here

Sep 6, 2023

Code Llama: Llama 2 learns to code

Aug 25, 2023

• 5

Introducing SafeCoder

Aug 22, 2023

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Aug 10, 2023

Llama 2 is here - get it on Hugging Face

Jul 18, 2023

• 20

Deploy LLMs with Hugging Face Inference Endpoints

Jul 4, 2023

• 11

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

• 9

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

May 31, 2023

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

May 24, 2023

Creating a Coding Assistant with StarCoder

May 9, 2023

Accelerating Hugging Face Transformers with AWS Inferentia2

Apr 17, 2023

Hugging Face and AWS partner to make AI more accessible

Feb 21, 2023

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

Aug 22, 2022

Convert Transformers to ONNX with Hugging Face Optimum

Jun 22, 2022

• 3

Accelerated Inference with Optimum and Transformers Pipelines

May 10, 2022

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

Mar 16, 2022

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Jan 13, 2022

Paper • 2409.17146 • Published 10 days ago • 92

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

Jan 11, 2022

Few-shot learning in practice: GPT-NEO and the 🤗 Accelerated Inference API

Jun 3, 2021

• 3

Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker

Apr 8, 2021

Organizations

philschmid's activity

upvoted a paper 10 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

upvoted an article 10 days ago

Article

Llama can now see and run on your device - welcome Llama 3.2

11 days ago

• 137

upvoted a collection 10 days ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 10 days ago • 327

upvoted a paper 11 days ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published 11 days ago • 18

upvoted a paper 15 days ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published 16 days ago • 128

upvoted a collection 17 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 17 days ago • 224

upvoted a collection 21 days ago

LLM-Reasoning

18 items • Updated Jul 1 • 2

upvoted a collection 27 days ago

🤖 Agents

16 items • Updated 29 days ago • 33

upvoted an article about 1 month ago

Article

Meet Yi-Coder: A Small but Mighty LLM for Code

•

Sep 4

• 11

upvoted 2 papers about 1 month ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5 • 18

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 12

upvoted a collection 2 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35

upvoted an article 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

upvoted a collection 2 months ago

Llama 3.1 GPTQ, AWQ, and BNB Quants

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated 10 days ago • 51

upvoted a paper 2 months ago

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19 • 17

upvoted a collection 3 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 57

upvoted 2 papers 3 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 154

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31

upvoted an article 3 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 93

upvoted a paper 3 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 43

upvoted an article 3 months ago

Article

Google Cloud TPUs made available to Hugging Face users

Jul 9

• 19

upvoted 5 papers 3 months ago

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Paper • 2407.02552 • Published Jul 2 • 4

Understanding the performance gap between online and offline alignment algorithms

Paper • 2405.08448 • Published May 14 • 14

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

Paper • 2407.01906 • Published Jul 2 • 34

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 85

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Paper • 2406.19314 • Published Jun 27 • 18

upvoted an article 3 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 118

upvoted 2 papers 3 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 85

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19 • 16

upvoted a paper 4 months ago

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17 • 29

upvoted a collection 4 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 5 days ago • 156

upvoted 3 papers 4 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 62

CRAG -- Comprehensive RAG Benchmark

Paper • 2406.04744 • Published Jun 7 • 40

MAmmoTH2: Scaling Instructions from the Web

Paper • 2405.03548 • Published May 6 • 6

upvoted an article 4 months ago

Article

Introducing the Hugging Face Embedding Container for Amazon SageMaker

Jun 7

• 13

upvoted a collection 4 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 18 days ago • 340

upvoted a paper 4 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27

upvoted an article 4 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 148

upvoted a paper 4 months ago

SimPO: Simple Preference Optimization with a Reference-Free Reward

Paper • 2405.14734 • Published May 23 • 9

upvoted 2 articles 5 months ago

Article

Build AI on premise with Dell Enterprise Hub

May 21

• 17

Article

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

May 21

• 8

upvoted a collection 5 months ago

Yi-1.5 (2024/05)

10 items • Updated May 20 • 88

upvoted 4 papers 5 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 46

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 60

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Paper • 2404.14723 • Published Apr 23 • 10

upvoted an article 6 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 273

upvoted a paper 6 months ago

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Paper • 2404.07413 • Published Apr 11 • 36

upvoted 2 articles 6 months ago

Article

Welcome Gemma - Google's new open LLM

Feb 21

• 16

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

upvoted a paper 6 months ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 56

upvoted a collection 6 months ago

HF-curated models available on Workers AI

A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2 • 50

upvoted a paper 6 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 103

upvoted a collection 6 months ago

MoEs papers reading list

58 items • Updated 1 day ago • 133

upvoted 3 papers 7 months ago

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Paper • 2402.11411 • Published Feb 18 • 1

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 48

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 60

upvoted a collection 10 months ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 113

upvoted 2 collections 11 months ago

Distil-Whisper Models

The first version of the Distil-Whisper models released with the Distil-Whisper paper. • 4 items • Updated Mar 21 • 35

Zephyr 7B