abhinand
/

telugu-llama-7b-instruct-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

abhinand commited on Jan 23

Commit

5e77656

•

1 Parent(s): b49908f

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -42,7 +42,16 @@ The Telugu LLaMA models have been enhanced and tailored specifically with an ext
 Benchmarking was done using [LLM-Autoeval](https://github.com/mlabonne/llm-autoeval) on an RTX 3090 on [runpod](https://www.runpod.io/).
-![Benchmark results](https://cdn-uploads.huggingface.co/production/uploads/60c8619d95d852a24572b025/P_0y9pvGg2Gn1Uz7SSJ7E.png)
 ## Related Models

 Benchmarking was done using [LLM-Autoeval](https://github.com/mlabonne/llm-autoeval) on an RTX 3090 on [runpod](https://www.runpod.io/).
+| Benchmark     | Llama 2 Chat | Tamil Llama v0.2 Instruct | Telugu Llama Instruct | Malayalam Llama Instruct |
+|---------------|--------------|---------------------------|-----------------------|--------------------------|
+| ARC Challenge (25-shot) | 52.9         | **53.75**                     | 52.47                 | 52.82                    |
+| TruthfulQA (0-shot)    | 45.57        | 47.23                     | **48.47**                 | 47.46                    |
+| Hellaswag (10-shot)    | **78.55**        | 76.11                     | 76.13                 | 76.91                    |
+| Winogrande (5-shot)   | 71.74        | **73.95**                     | 71.74                 | 73.16                    |
+| AGI Eval (0-shot)     | 29.3         | **30.95**                     | 28.44                 | 29.6                     |
+| BigBench (0-shot)     | 32.6         | 33.08                     | 32.99                 | **33.26**                    |
+| Average       | 51.78        | **52.51**                     | 51.71                 | 52.2                     |
 ## Related Models