File size: 1,253 Bytes
c9e3923
 
 
 
 
049090a
c9e3923
 
 
 
 
 
 
1d183f4
c9e3923
 
 
 
 
 
 
e5e7d38
 
 
 
 
 
 
 
 
0b3b05e
04f3311
 
 
 
 
 
e87e272
049090a
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---
datasets:
- Reza8848/MUFFIN_68k
language:
- en
license: mit
---

<img src="https://cdn-uploads.huggingface.co/production/uploads/6434a6e8ea46c009904c617e/J_4FHXmtM6TuRnN3aL06y.png" width="38" height="38">


This is the model weight of **MUFFIN-T5-11B** (**Mu**lti-**F**aceted **In**structions).

We fine-tune the [T5-11B](https://huggingface.co/t5-11b) model on our [MUFFIN dataset](https://arxiv.org/abs/2312.02436).

We released both 3B and 11B models:
|Model|Number of parameters|
|-|-|
|[MUFFIN-T5-3B](https://huggingface.co/Reza8848/MUFFIN-T5-3B)|3 billion|
|[MUFFIN-T5-11B](https://huggingface.co/Reza8848/MUFFIN-T5-11B)|11 billion|

Please refer to [MUFFIN-T5-3B](https://huggingface.co/Reza8848/MUFFIN-T5-3B) for detailed documentation.




## 🥳 Citation

Please kindly cite our paper if you use any resources in this repository:

```bibtex
@inproceedings{Lou2023MUFFIN,
   title={{MUFFIN}: Curating Multi-Faceted Instructions for Improving Instruction Following},
   author={Renze Lou and Kai Zhang and Jian Xie and Yuxuan Sun and Janice Ahn and Hanzi Xu and Yu su and Wenpeng Yin},
   booktitle={The Twelfth International Conference on Learning Representations},
   year={2024},
   url={https://openreview.net/forum?id=1vrS1zwekw}
}
```