Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
LPN64
/
LongCite-llama3.1-8b-GGUF
like
3
Text Generation
GGUF
THUDM/LongCite-45k
imatrix
importance matrix
llama.cpp
Inference Endpoints
conversational
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
d6202fc
LongCite-llama3.1-8b-GGUF
1 contributor
History:
12 commits
LPN64
Update README.md
d6202fc
verified
14 days ago
.gitattributes
1.97 kB
Upload imatrix.dat
14 days ago
LongCite-llama3.1-8B-F16.gguf
16.1 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
LongCite-llama3.1-8B-IQ3_M.gguf
3.78 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
LongCite-llama3.1-8B-Q4_K_M.gguf
4.92 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
LongCite-llama3.1-8B-Q5_K_M.gguf
5.73 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
LongCite-llama3.1-8B-Q6_K.gguf
6.6 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
LongCite-llama3.1-8B-Q8_0.gguf
8.54 GB
LFS
Updated GGUFs with correct EOS token
14 days ago
README.md
7.95 kB
Update README.md
14 days ago
imatrix.dat
pickle
4.99 MB
LFS
Upload imatrix.dat
14 days ago