Makya Baylis's picture

Makya Baylis

Makya

·

AI & ML interests

None yet

Organizations

Makya's activity

upvoted 59 collections 7 months ago

General

2 items • Updated Mar 10 • 1

生成式AI導論 2024

https://www.youtube.com/@HungyiLeeNTU • 57 items • Updated Jun 19 • 1

🎭 Coding Models

Fine-tunes/Merges of Coding Models. • 3 items • Updated Mar 22 • 2

Through your eyes

1 item • Updated Mar 10 • 1

domain-specific

2 items • Updated Mar 10 • 1

daily assemble

26 items • Updated Apr 12 • 1

Synthetic Data Generation

A curated list of papers focusing on synthetic data generation • 9 items • Updated Mar 11 • 3

GenAI

2 items • Updated Mar 11 • 1

Empowering SLMs

Collection of resources focusing on making Small Language Models (SLMs) better at various tasks • 6 items • Updated Apr 29 • 2

bruphin

Series of merge experiments attempting to make a small uncensored ChatML model based initially on ehartford/dolphin and rwitz/go-bruins-v2 mistral-7B • 11 items • Updated Mar 30 • 1

flammen

Merge & finetune experiments aiming for strong character roleplay, creative writing, and overall good performance. • 44 items • Updated Jun 1 • 1

LLM

2 items • Updated Mar 11 • 1

Datasets - Math - Word Problems

4 items • Updated Jul 21 • 1

Stable Code

Suite of developer assistant models • 5 items • Updated Apr 8 • 36

Stable LM

Suite of LLMs trained on English • 7 items • Updated May 7 • 36

🐶 IDEFICS 🐶

Collection assembling all the models and spaces related to IDEFICS • 6 items • Updated Apr 15 • 7

papers

34 items • Updated Jan 12 • 3

OpenChat

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data • 7 items • Updated Jul 31 • 33

Awesome RLHF

A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF). • 11 items • Updated Oct 2, 2023 • 7

Explore-Instruct

EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration • 13 items • Updated Aug 16 • 3

Long context

94 items • Updated 7 days ago • 29

Dataset generation

126 items • Updated Jul 22 • 24

Reasoning

151 items • Updated Apr 6 • 25

Knowledge distillation

88 items • Updated Feb 7 • 6

Coding

193 items • Updated 2 days ago • 16

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 144

Math

46 items • Updated May 31 • 8

SEAHORSE release

The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated Jul 31 • 17

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 25

Tulu V2 Suite

The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" • 19 items • Updated 11 days ago • 43

ise-uiuc's Papers

7 items • Updated Mar 31 • 7

Llama2 HQQ Quantized Models

LLama2 models quantized using https://github.com/mobiusml/hqq • 6 items • Updated Mar 29 • 5

Interesting Papers

18 items • Updated Apr 25 • 3

Chat Fine-tuning Datasets

Versions of the OpenAssist Dataset for chat fine-tuning different models. See https://youtu.be/71x8EMrB0Gc for a full video run through of methods. • 6 items • Updated Jan 3 • 2

Tower

Model weights and SFT data for Tower. • 10 items • Updated 2 days ago • 25

Breeze-7B

Breeze-7B is a language model family that builds on top of Mistral-7B, specifically intended for Traditional Chinese use. • 9 items • Updated 8 days ago • 9

Constitutional AI

A collection of datasets and models that accompany the Constitutional AI recipe. See hf.co/blog/constitutional-ai for more details. • 9 items • Updated Feb 1 • 5

🐐 GEITje 7B ultra 🤖

SFT and DPO models for GEITje 7B Ultra, including the datasets used to train them. • 10 items • Updated May 5 • 6

Datasets - DPO

7 items • Updated Apr 20 • 2

Merges

4 items • Updated Feb 19 • 4

AI Paper of the Day

A collection of papers that I think are interesting, one added each day • 182 items • Updated about 7 hours ago • 24

DreamGen Opus V1: Story-writing & role-playing models

Uncensored models for steerable story-writing and role-playing. Prompting guide: https://dreamgen.com/docs/models/opus/v1 • 16 items • Updated Jun 19 • 9

Fine-Tuned

41 items • Updated 27 days ago • 6

My Best Models

These all mark personal achievements in my journey • 7 items • Updated Mar 31 • 4

daily_paper_coll

62 items • Updated 26 days ago • 3

Top Model

This model outperformed all previous phi-2 based finetunes, except for one MoE implementation • 3 items • Updated Aug 18 • 2

Models - Math

11 items • Updated Jul 11 • 3

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 25

FuseChat

FuseChat: Knowledge Fusion of Chat Models • 8 items • Updated Aug 16 • 2

GeoChat

GeoChat is the first grounded Large Vision Language Model, specifically tailored to Remote Sensing(RS) scenarios. • 4 items • Updated Jun 11 • 4

Zephyr 7B Gemma

Models, dataset, and Demo for Zephyr 7B Gemma. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 5 items • Updated Apr 12 • 15

MAPO: Multilingual Reasoning with Preference Optimization

MAPO: Advancing Multilingual Reasoning through Multilingual Alignment‑as‑Preference Optimization • 10 items • Updated Mar 26 • 2

🇫🇷 Calme-7B

Calme fine-tuned models • 20 items • Updated Jul 21 • 7

Soft Prompts

Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. • 4 items • Updated Mar 22 • 2

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated May 13 • 16

Reading Papers

218 items • Updated 25 days ago • 10

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized • 71 items • Updated 11 days ago • 84

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated 11 days ago • 57

Hub Models

654 items • Updated about 2 hours ago • 5