Joseph717171's picture
Create README.md
1356e8b verified
|
raw
history blame
195 Bytes

Custom GGUF quants of arcee-ai’s gemma-2-2b-it, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. 🧠🔥🚀