flowaicom
/

Flow-Judge-v0.1-GGUF

Text Generation

Model card Files Files and versions Community

sariola commited on 18 days ago

Commit

0cb362c

•

1 Parent(s): b969bb8

Update README.md

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -42,11 +42,18 @@ This repo contains GGUF quants for [Flow-Judge-v0.1](https://huggingface.co/flow
 ## Quantization config
-TBD
 ## Running the GGUF file
-TBD
 # Original model card: Flow-Judge-v0.1

 ## Quantization config
+Version used: github:ggerganov/llama.cpp/8e6e2fbe1458ac91387266241262294a964d6b95?narHash=sha256-Z3Rg43p8G9MdxiGvSl9m43KsJ1FvvhQwtzRy/grg9X0%3D
+```
+llama-convert-hf-to-gguf ./flowaicom/Flow-Judge-v0.1 --outfile flow-judge-v0.1-bf16.gguf --outtype auto
+llama-quantize flow-judge-v0.1-bf16.gguf flow-judge-v0.1-Q4_K_M.gguf Q4_K_M
+```
 ## Running the GGUF file
+```shell
+llama-server -ngl 33 -t 16 -m Flow-Judge-v0.1-GGUF/flow-judge-v0.1-Q4_K_M.gguf -c 8192 -n 8192 -fa
+```
 # Original model card: Flow-Judge-v0.1