brucethemoose
/

Yi-34B-200K-DARE-merge-v5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

brucethemoose commited on Dec 18, 2023

Commit

0e28c29

•

1 Parent(s): 57250a5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -34,7 +34,7 @@ Being a Yi model, try running a lower temperature with 0.05-0.1 MinP, a little r
 I recommend exl2 quantizations profiled on data similar to the desired task. It is especially sensitive to the quantization data at low bpw. I've published my own fiction-oriented quantizations here: https://huggingface.co/collections/brucethemoose/most-recent-merge-65742644ca03b6c514afa204
-To load this in full-context backends like transformers and, you *must* change `max_position_embeddings` in config.json to a lower value than 200,000, otherwise you will OOM!
 ***
 ## Testing Notes

 I recommend exl2 quantizations profiled on data similar to the desired task. It is especially sensitive to the quantization data at low bpw. I've published my own fiction-oriented quantizations here: https://huggingface.co/collections/brucethemoose/most-recent-merge-65742644ca03b6c514afa204
+To load this in full-context backends like transformers, you *must* change `max_position_embeddings` in config.json to a lower value than 200,000, otherwise you will OOM!
 ***
 ## Testing Notes