second-state
/

Llama-2-7B-Chat-GGUF

Text Generation

Model card Files Files and versions Community

apepkuss79 commited on Apr 24

Commit

802dcfc

•

1 Parent(s): 9e80949

Update README.md

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -55,13 +55,20 @@ quantized_by: Second State Inc.
 - Run as LlamaEdge service
   ```bash
-  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama-2-7b-chat-hf-Q5_K_M.gguf llama-api-server.wasm -p llama-2-chat
   ```
 - Run as LlamaEdge command app
   ```bash
-  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama-2-7b-chat-hf-Q5_K_M.gguf llama-chat.wasm -p llama-2-chat
   ```
 ## Quantized GGUF Models

 - Run as LlamaEdge service
   ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama-2-7b-chat-hf-Q5_K_M.gguf \
+    llama-api-server.wasm \
+    --prompt-template llama-2-chat \
+    --ctx-size 4096
+    --model-name llama-2-7b-chat
   ```
 - Run as LlamaEdge command app
   ```bash
+  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama-2-7b-chat-hf-Q5_K_M.gguf \
+    llama-chat.wasm \
+    --prompt-template llama-2-chat \
+    --ctx-size 4096
   ```
 ## Quantized GGUF Models