File size: 1,565 Bytes
24bfe40
1bc7ef1
24bfe40
 
 
 
 
 
 
 
 
 
 
 
ef01dff
24bfe40
 
 
 
 
1bc7ef1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: bigcode-openrail-m
---

# GPTQ-for-StarCoder
Visit [GPTQ-for-SantaCoder](https://github.com/mayank31398/GPTQ-for-SantaCoder) for instructions on how to use the model weights here.
If you want 4-bit weights, visit [starcoderbase-GPTQ-4bit-128g](https://huggingface.co/mayank31398/starcoderbase-GPTQ-4bit-128g).

## Results
| StarCoderBase                                      | Bits | group-size | memory(MiB) | wikitext2 |    ptb     |     c4     |   stack    | checkpoint size(MB) |
| -------------------------------------------------- | ---- | ---------- | ----------- | --------- | ---------- | ---------- | ---------- | ------------------- |
| FP32                                               |  32  |     -      |             |  10.172   |   15.756   |   12.736   |   1.692    |       59195         |
| BF16                                               |  16  |     -      |             |  10.173   |   15.765   |   12.745   |   1.692    |       29597         |
| [GPTQ](https://arxiv.org/abs/2210.17323)           |  8   |    128     |             |  10.174   |   15.767   |   12.739   |   1.692    |       16163         |
| [GPTQ](https://arxiv.org/abs/2210.17323)           |  4   |    128     |             |  10.387   |   16.056   |   13.005   |   1.708    |        8877         |

# License
The model is licenses under the CodeML Open RAIL-M v0.1 license. You can find the full license [here](https://huggingface.co/spaces/bigcode/license).

# Acknowledgements
Thanks to everyone in BigCode who worked so hard to create these code models.