ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget Paper • 2408.00103 • Published Jul 31 • 16
Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers Paper • 2401.06461 • Published Jan 12 • 1
Chinchunmei on WASSA2024 Shared-Task 1 Collection This is the model cards collection for Chinchunmei team in the WASSA2024 Shared-Task 1: Empathy Detection and Emotion Classification. • 5 items • Updated Jul 3 • 2
emotions-extraction Collection This collection lists the personal studies related to extraction of emotion / emotion-cases from texts • 4 items • Updated Jul 23 • 2
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Paper • 2406.18629 • Published Jun 26 • 40
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 110
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published 24 days ago • 63
PuLID: Pure and Lightning ID Customization via Contrastive Alignment Paper • 2404.16022 • Published Apr 24 • 19
Fellows Highlights Fall '23 🎃 Collection This collection consists of the work of our Fellows in August, September & October '23. 🍂 🍁 🍃 • 21 items • Updated Nov 8, 2023 • 2
DocGraphLM: Documental Graph Language Model for Information Extraction Paper • 2401.02823 • Published Jan 5 • 34
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Paper • 2409.04109 • Published 30 days ago • 41
PIE: Simulating Disease Progression via Progressive Image Editing Paper • 2309.11745 • Published Sep 21, 2023 • 3
Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts Paper • 2310.05898 • Published Oct 9, 2023 • 2
Communication Efficient Distributed Training with Distributed Lion Paper • 2404.00438 • Published Mar 30 • 2
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13 • 24
view article Article Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face By andreagagliano • 29 days ago • 15
InkubaLM: A small language model for low-resource African languages Paper • 2408.17024 • Published Aug 30 • 12
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 17 days ago • 129
Enhancing Training Efficiency Using Packing with Flash Attention Paper • 2407.09105 • Published Jul 12 • 12
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 174
synthetic-data-generation-demos Collection A collection of demos for various approaches to synthetic data generation • 4 items • Updated Jun 25 • 13
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 • 26
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine Paper • 2408.02900 • Published Aug 6 • 25
view article Article The case for specialized pre-training: ultra-fast foundation models for dedicated tasks By Pclanglais • Aug 4 • 25
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks Paper • 2402.00892 • Published Jan 31 • 12
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound Paper • 2406.06612 • Published Jun 6 • 14
Probably function calling datasets Collection Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35
Palmyra (Writer license) Collection Palmyra LLMs under Writer license https://writer.com/legal/open-model-license/ • 8 items • Updated Aug 17 • 6
Granite Time Series Models Collection A collection of time series models trained by IBM licensed under Apache 2.0 license. • 4 items • Updated Jul 19 • 13
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage Paper • 2406.01462 • Published Jun 3 • 6
view article Article Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing By Pclanglais • Jul 19 • 17
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 3
E5-V: Universal Embeddings with Multimodal Large Language Models Paper • 2407.12580 • Published Jul 17 • 38
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17 • 33
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression Paper • 2407.12077 • Published Jul 16 • 52
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper • 2407.12327 • Published Jul 17 • 76
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper • 2407.12784 • Published Jul 17 • 48
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 63