Synthetic Data Generation Collection A curated list of papers focusing on synthetic data generation • 9 items • Updated Mar 11 • 3
Empowering SLMs Collection Collection of resources focusing on making Small Language Models (SLMs) better at various tasks • 6 items • Updated Apr 29 • 2
bruphin Collection Series of merge experiments attempting to make a small uncensored ChatML model based initially on ehartford/dolphin and rwitz/go-bruins-v2 mistral-7B • 11 items • Updated Mar 30 • 1
flammen Collection Merge & finetune experiments aiming for strong character roleplay, creative writing, and overall good performance. • 44 items • Updated Jun 1 • 1
🐶 IDEFICS 🐶 Collection Collection assembling all the models and spaces related to IDEFICS • 6 items • Updated Apr 15 • 7
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data • 7 items • Updated Jul 31 • 33
Awesome RLHF Collection A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF). • 11 items • Updated Oct 2, 2023 • 7
Explore-Instruct Collection EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration • 13 items • Updated Aug 16 • 3
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 144
SEAHORSE release Collection The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated Jul 31 • 17
Reward models on the hub Collection UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 25
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated 11 days ago • 43
Llama2 HQQ Quantized Models Collection LLama2 models quantized using https://github.com/mobiusml/hqq • 6 items • Updated Mar 29 • 5
Chat Fine-tuning Datasets Collection Versions of the OpenAssist Dataset for chat fine-tuning different models. See https://youtu.be/71x8EMrB0Gc for a full video run through of methods. • 6 items • Updated Jan 3 • 2
Breeze-7B Collection Breeze-7B is a language model family that builds on top of Mistral-7B, specifically intended for Traditional Chinese use. • 9 items • Updated 8 days ago • 9
Constitutional AI Collection A collection of datasets and models that accompany the Constitutional AI recipe. See hf.co/blog/constitutional-ai for more details. • 9 items • Updated Feb 1 • 5
🐐 GEITje 7B ultra 🤖 Collection SFT and DPO models for GEITje 7B Ultra, including the datasets used to train them. • 10 items • Updated May 5 • 6
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 182 items • Updated about 7 hours ago • 24
DreamGen Opus V1: Story-writing & role-playing models Collection Uncensored models for steerable story-writing and role-playing. Prompting guide: https://dreamgen.com/docs/models/opus/v1 • 16 items • Updated Jun 19 • 9
My Best Models Collection These all mark personal achievements in my journey • 7 items • Updated Mar 31 • 4
Top Model Collection This model outperformed all previous phi-2 based finetunes, except for one MoE implementation • 3 items • Updated Aug 18 • 2
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 25
GeoChat Collection GeoChat is the first grounded Large Vision Language Model, specifically tailored to Remote Sensing(RS) scenarios. • 4 items • Updated Jun 11 • 4
Zephyr 7B Gemma Collection Models, dataset, and Demo for Zephyr 7B Gemma. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 5 items • Updated Apr 12 • 15
MAPO: Multilingual Reasoning with Preference Optimization Collection MAPO: Advancing Multilingual Reasoning through Multilingual Alignment‑as‑Preference Optimization • 10 items • Updated Mar 26 • 2
Soft Prompts Collection Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. • 4 items • Updated Mar 22 • 2
Trained Models 🏋️ Collection They may be small, but they're training like giants! • 8 items • Updated May 13 • 16
🔍 Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized • 71 items • Updated 11 days ago • 84
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 11 days ago • 57