1104 95 283

Pedro Cuenca

pcuenq

AI & ML interests

None yet

Articles

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 63

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 273

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

Welcome Gemma - Google's new open LLM

Feb 21

• 16

Mixture of Experts Explained

Dec 11, 2023

• 162

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Dec 11, 2023

• 9

SDXL in 4 steps with Latent Consistency LoRAs

Nov 9, 2023

• 10

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Oct 3, 2023

• 3

Inference for PROs

Sep 22, 2023

• 41

Introducing Würstchen: Fast Diffusion for Image Generation

Sep 13, 2023

• 10

Spread Your Wings: Falcon 180B is here

Sep 6, 2023

• 4

Code Llama: Llama 2 learns to code

Aug 25, 2023

• 5

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Aug 8, 2023

• 19

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Jul 27, 2023

• 4

Happy 1st anniversary 🤗 Diffusers!

Jul 20, 2023

• 1

Llama 2 is here - get it on Hugging Face

Jul 18, 2023

• 20

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

Jun 15, 2023

• 4

The Falcon has landed in the Hugging Face ecosystem

Jun 5, 2023

• 9

Train your ControlNet with diffusers

Mar 24, 2023

• 16

Swift Diffusers: Fast Stable Diffusion for Mac

Feb 24, 2023

• 4

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

• 37

Using Stable Diffusion with Core ML on Apple Silicon

Dec 1, 2022

• 4

Hugging Face Machine Learning Demos on arXiv

Nov 17, 2022

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Nov 7, 2022

• 14

Stable Diffusion in JAX/Flax 🚀

Oct 13, 2022

• 1

Stable Diffusion with 🧨 Diffusers

Aug 22, 2022

• 25

Organizations

pcuenq's activity

upvoted a paper 1 day ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published 3 days ago • 21

upvoted a collection 1 day ago

DepthPro Models

Collection

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second • 2 items • Updated about 6 hours ago • 1

upvoted a collection 4 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 10 days ago • 218

upvoted a paper 4 days ago

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published 9 days ago • 32

upvoted an article 10 days ago

Article

Llama can now see and run on your device - welcome Llama 3.2

11 days ago

• 137

upvoted 2 articles 11 days ago

Article

Tool Use, Unified

Aug 12

• 54

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 26

upvoted an article 13 days ago

Article

Exploring the Daily Papers Page on Hugging Face

13 days ago

• 25

upvoted 2 articles 18 days ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

18 days ago

• 144

Article

Introducing the SQL Console on Datasets

19 days ago

• 17

upvoted a collection 21 days ago

Core ML Segment Anything 2

Collection

8 items • Updated 1 day ago • 22

upvoted a collection 27 days ago

Depth-Anything-V2

Collection

6 items • Updated Jun 14 • 26

upvoted an article about 1 month ago

Article

Hugging Face partners with TruffleHog to Scan for Secrets

Sep 4

• 9

upvoted a collection about 1 month ago

LM (MLX)

Collection

State-Space-Model powered Language Models for Apple Silicon • 12 items • Updated Aug 27 • 4

upvoted an article about 1 month ago

Article

Scaling robotics datasets with video encoding

Aug 27

• 33

upvoted 2 collections about 1 month ago

DiffusionKit

Collection

Models, datasets and evaluations results for DiffusionKit: https://github.com/argmaxinc/DiffusionKit • 6 items • Updated 26 days ago • 3

WhisperKit

Collection

Models, datasets and evaluation results for WhisperKit: https://github.com/argmaxinc/WhisperKit • 4 items • Updated about 1 month ago • 6

upvoted a paper about 2 months ago

Enhancing Training Efficiency Using Packing with Flash Attention

Paper • 2407.09105 • Published Jul 12 • 12

upvoted 2 articles about 2 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 44

Article

Introduction to ggml

Aug 13

• 100

upvoted a collection about 2 months ago

Shared params

Collection

27 items • Updated 26 days ago • 3

upvoted an article about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 78

upvoted a paper 2 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

upvoted an article 2 months ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

upvoted a paper 2 months ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8 • 7

upvoted 2 articles 2 months ago

Article

How to run Gemini Nano locally in your browser

•

Jul 11

• 42

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 198

upvoted an article 3 months ago

Article

WWDC 24: Running Mistral 7B with Core ML

Jul 22

• 54

upvoted a paper 3 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10 • 65

upvoted 2 articles 3 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 118

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18

• 35

upvoted a paper 4 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 79

upvoted an article 4 months ago

Article

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Jul 27, 2023

• 4

upvoted 4 collections 4 months ago

MobileCLIP Models + DataCompDR Data

Collection

MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated 1 day ago • 23

Core ML Gallery Models

Collection

7 items • Updated 1 day ago • 30

Core ML FastViT

Collection

2 items • Updated 1 day ago • 6

Core ML Stable Diffusion

Collection

16 items • Updated 1 day ago • 14

upvoted 2 papers 4 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 64

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Paper • 2406.03184 • Published Jun 5 • 18

upvoted a collection 4 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 10 days ago • 676

upvoted 2 articles 4 months ago

Article

Space secrets security update

May 31

• 50

Article

AI has a problem with objectifying women

•

May 24

• 54

upvoted a collection 5 months ago

SD 2.x, Zero-terminal SNR

Collection

SD 2.x models with zero terminal SNR noise schedule. • 3 items • Updated Nov 3, 2023 • 3

upvoted 2 articles 5 months ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

• 27

Article

Enjoy the Power of Phi-3 with ONNX Runtime on your device

•

May 22

• 25

upvoted a paper 5 months ago

INDUS: Effective and Efficient Language Models for Scientific Applications

Paper • 2405.10725 • Published May 17 • 32

upvoted 2 collections 5 months ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 136

PaliGemma FT Models

Collection

108 items • Updated Jul 31 • 27

upvoted an article 5 months ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23

• 33

upvoted a paper 5 months ago

End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 4

upvoted a collection 5 months ago

Depth Anything Release

Collection

Depth Anything models, foundation models for monocular depth estimation, trained on 1.5 million labeled images and 62 million unlabeled images • 8 items • Updated Jan 26 • 9

upvoted an article 5 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

Jun 4

• 69

upvoted 2 collections 5 months ago

OpenELM Instruct Models

Collection

4 items • Updated 1 day ago • 113

OpenELM Pretrained Models

Collection

4 items • Updated 1 day ago • 46

upvoted 3 articles 6 months ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22

• 221

Article

Design choices for Vision Language Models in 2024

•

Apr 16

• 24

Article

Custom architectures with HuggingFace 🤗

•

Apr 22

• 21

upvoted 2 collections 6 months ago

fuck quadratic attention

Collection

11 items • Updated Apr 24 • 20

CodeGemma Release

Collection

18 items • Updated Aug 2 • 77

upvoted a collection 7 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325

Pedro Cuenca

AI & ML interests

Articles

Llama can now see and run on your device - welcome Llama 3.2

FineVideo: behind the scenes

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

PaliGemma – Google's Cutting-Edge Open Vision Language Model

License to Call: Introducing Transformers Agents 2.0

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

CodeGemma - an official Google release for code LLMs

Welcome Gemma - Google's new open LLM

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

SDXL in 4 steps with Latent Consistency LoRAs

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Inference for PROs

Introducing Würstchen: Fast Diffusion for Image Generation

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Happy 1st anniversary 🤗 Diffusers!

Llama 2 is here - get it on Hugging Face

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

The Falcon has landed in the Hugging Face ecosystem

Train your ControlNet with diffusers

Swift Diffusers: Fast Stable Diffusion for Mac

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Using Stable Diffusion with Core ML on Apple Silicon

Hugging Face Machine Learning Demos on arXiv

Training Stable Diffusion with Dreambooth using 🧨 Diffusers

Stable Diffusion in JAX/Flax 🚀

Stable Diffusion with 🧨 Diffusers

Organizations

pcuenq's activity

Llama can now see and run on your device - welcome Llama 3.2

Tool Use, Unified

Assisted Generation: a new direction toward low-latency text generation

Exploring the Daily Papers Page on Hugging Face

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Introducing the SQL Console on Datasets

Hugging Face partners with TruffleHog to Scan for Secrets

Scaling robotics datasets with video encoding

A failed experiment: Infini-Attention, and why we should keep trying?

Introduction to ggml

XetHub is joining Hugging Face!

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

How to run Gemini Nano locally in your browser

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Stable Diffusion XL on Mac with Advanced Core ML Quantization

Space secrets security update

AI has a problem with objectifying women

Introduction to 3D Gaussian Splatting

Enjoy the Power of Phi-3 with ONNX Runtime on your device

SeeMoE: Implementing a MoE Vision Language Model from Scratch

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Fine-tune Llama 3 with ORPO

Design choices for Vision Language Models in 2024

Custom architectures with HuggingFace 🤗