Tonic (Joseph Pollack)

upvoted an article 1 day ago

Article

Introducing the Open FinLLM Leaderboard

2 days ago

• 12

upvoted a paper 1 day ago

ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

Paper • 2408.00103 • Published Jul 31 • 16

upvoted a collection 2 days ago

Salamandra 🦎

Collection

4 items • Updated 4 days ago • 22

upvoted a paper 3 days ago

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers

Paper • 2401.06461 • Published Jan 12 • 1

upvoted an article 3 days ago

Article

Tiny Test Models

By

•

3 days ago

• 4

upvoted a collection 5 days ago

Emu3

Collection

3 items • Updated 9 days ago • 47

upvoted 2 collections 7 days ago

Chinchunmei on WASSA2024 Shared-Task 1

Collection

This is the model cards collection for Chinchunmei team in the WASSA2024 Shared-Task 1: Empathy Detection and Emotion Classification. • 5 items • Updated Jul 3 • 2

emotions-extraction

Collection

This collection lists the personal studies related to extraction of emotion / emotion-cases from texts • 4 items • Updated Jul 23 • 2

upvoted a paper 9 days ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26 • 40

upvoted a collection 17 days ago

Extreme Quantization

Collection

1 item • Updated 17 days ago • 1

upvoted 2 articles 17 days ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 110

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

18 days ago

• 144

upvoted a paper 19 days ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published 24 days ago • 63

upvoted a paper 21 days ago

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Paper • 2404.16022 • Published Apr 24 • 19

upvoted a collection 21 days ago

Fellows Highlights Fall '23 🎃

Collection

This collection consists of the work of our Fellows in August, September & October '23. 🍂 🍁 🍃 • 21 items • Updated Nov 8, 2023 • 2

upvoted a paper 22 days ago

DocGraphLM: Documental Graph Language Model for Information Extraction

Paper • 2401.02823 • Published Jan 5 • 34

upvoted a paper 23 days ago

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published 30 days ago • 41

upvoted an article 29 days ago

Article

LLM Inference at scale with TGI

By

•

29 days ago

• 7

upvoted 4 papers 29 days ago

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Paper • 2405.07518 • Published May 13 • 24

upvoted an article 29 days ago

Article

Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face

By

•

29 days ago

• 15

upvoted a collection about 1 month ago

Yi-Coder

Collection

4 items • Updated Sep 4 • 29

upvoted a paper about 1 month ago

InkubaLM: A small language model for low-resource African languages

Paper • 2408.17024 • Published Aug 30 • 12

upvoted a collection about 1 month ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 15 items • Updated 17 days ago • 129

upvoted a paper about 1 month ago

Enhancing Training Efficiency Using Packing with Flash Attention

Paper • 2407.09105 • Published Jul 12 • 12

upvoted an article about 2 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 78

upvoted 2 collections about 2 months ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 174

DeepSeek-Prover

Collection

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16 • 13

upvoted an article about 2 months ago

Article

The Workflow of PEFT

By

•

Aug 14

• 19

upvoted a collection about 2 months ago

synthetic-data-generation-demos

Collection

A collection of demos for various approaches to synthetic data generation • 4 items • Updated Jun 25 • 13

upvoted an article about 2 months ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 26

upvoted a collection about 2 months ago

InternLM2.5

Collection

14 items • Updated 21 days ago • 68

upvoted an article about 2 months ago

Article

Tool Use, Unified

Aug 12

• 54

upvoted 2 papers about 2 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 74

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6 • 25

upvoted an article 2 months ago

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By

•

Aug 4

• 25

upvoted a paper 2 months ago

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Paper • 2402.00892 • Published Jan 31 • 12

upvoted an article 2 months ago

Article

RAG chatbot using llama3

By

•

Jul 7

• 73

upvoted a paper 2 months ago

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Paper • 2406.06612 • Published Jun 6 • 14

upvoted 2 collections 2 months ago

SEE-2-SOUND

Collection

4 items • Updated Jul 6 • 4

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35

upvoted 2 articles 2 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 49

Article

Local AI with Docker's Testcontainers

By

•

Aug 3

• 5

upvoted a collection 2 months ago

Palmyra (Writer license)

Collection

Palmyra LLMs under Writer license https://writer.com/legal/open-model-license/ • 8 items • Updated Aug 17 • 6

upvoted 2 articles 2 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 52

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 212

upvoted a collection 2 months ago

Granite Time Series Models

Collection

A collection of time series models trained by IBM licensed under Apache 2.0 license. • 4 items • Updated Jul 19 • 13

upvoted a paper 2 months ago

The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

Paper • 2406.01462 • Published Jun 3 • 6

upvoted an article 3 months ago

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

By

•

Jul 19

• 17

upvoted a collection 3 months ago

Finance Commons

Collection

A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 3

upvoted 6 papers 3 months ago

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17 • 38

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17 • 33

Patch-Level Training for Large Language Models

Paper • 2407.12665 • Published Jul 17 • 16

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16 • 52

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17 • 76

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17 • 48

upvoted an article 3 months ago

Article

Train a Llama model from scratch

By

•

Jul 29

• 43

upvoted a collection 3 months ago

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 63

Joseph Pollack

AI & ML interests

Articles

Local AI with Docker's Testcontainers

How to use Instruct Embeddings Correctly

Organizations

Tonic's activity

Introducing the Open FinLLM Leaderboard

Tiny Test Models

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

LLM Inference at scale with TGI

Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face

XetHub is joining Hugging Face!

The Workflow of PEFT

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Tool Use, Unified

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

RAG chatbot using llama3

🪆 Introduction to Matryoshka Embedding Models

Local AI with Docker's Testcontainers

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

Train a Llama model from scratch