Update README.md
Browse files
README.md
CHANGED
@@ -8,19 +8,19 @@ pipeline_tag: text-generation
|
|
8 |
This repository hosts both the standard and quantized versions of the Zephyr 7B model, allowing users to choose the version that best fits their resource constraints and performance needs.
|
9 |
|
10 |
# Model Details
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
|
17 |
|
18 |
|
19 |
# Performance and Efficiency
|
20 |
The quantized version of Zephyr 7B is optimized for environments with limited computational resources. It offers:
|
21 |
|
22 |
-
|
23 |
-
|
24 |
|
25 |
|
26 |
|
|
|
8 |
This repository hosts both the standard and quantized versions of the Zephyr 7B model, allowing users to choose the version that best fits their resource constraints and performance needs.
|
9 |
|
10 |
# Model Details
|
11 |
+
Model Name: Zephyr 7B
|
12 |
+
Model Size: 7 billion parameters
|
13 |
+
Architecture: Transformer-based
|
14 |
+
Languages: Primarily English, with support for multilingual text
|
15 |
+
Quantized Version: Available for reduced memory footprint and faster inference
|
16 |
|
17 |
|
18 |
|
19 |
# Performance and Efficiency
|
20 |
The quantized version of Zephyr 7B is optimized for environments with limited computational resources. It offers:
|
21 |
|
22 |
+
Reduced Memory Usage: The model size is significantly smaller, making it suitable for deployment on devices with limited RAM.
|
23 |
+
Faster Inference: Quantized models can perform faster inference, providing quicker responses in real-time applications.
|
24 |
|
25 |
|
26 |
|