shaowenchen
/

vicuna-33b-v1.3-gguf

Text Generation

Model card Files Files and versions Community

shaowenchen commited on Sep 13, 2023

Commit

c2e1b2f

•

1 Parent(s): 71af746

Update README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -45,11 +45,15 @@ docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/ggu
 ## Provided images
-| Name                                    | Quant method | Size    |
-| --------------------------------------- | ------------ | ------- |
-| `shaowenchen/vicuna-33b-v1.3-gguf:Q2_K` | Q2_K         | 3.68 GB |
-| `shaowenchen/vicuna-33b-v1.3-gguf:Q3_K` | Q3_K         | 4.16 GB |
-| `shaowenchen/vicuna-33b-v1.3-gguf:Q4_K` | Q4_K         | 4.16 GB |
 Usage:

 ## Provided images
+| Name                                    | Quant method | Compressed Size |
+| --------------------------------------- | ------------ | --------------- |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q2_K` | Q2_K         | 12.78 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q3_K` | Q3_K         | 14.81 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q4_K` | Q4_K         | 18.24 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q5_K` | Q5_K         | 21.72 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q6_K` | Q6_K         | 25.05 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:Q8_0` | Q8_0         | 31.34 GB        |
+| `shaowenchen/vicuna-33b-v1.3-gguf:full` | full         | 56.07 GB        |
 Usage: