Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

ModelCloud
/

Meta-Llama-3.1-8B-Instruct-gptq-4bit

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Meta-Llama-3.1-8B-Instruct-gptq-4bit

3 contributors

History: 16 commits

lrl-modelcloud's picture

Update README.md

4ddfe5a verified 2 months ago

.gitattributes

1.52 kB

initial commit 2 months ago
README.md

9.79 kB

Update README.md 2 months ago
config.json

1.35 kB

Upload folder using huggingface_hub (#7) 2 months ago
model.safetensors
5.73 GB
LFS

Upload folder using huggingface_hub (#7) 2 months ago
quantize_config.json

348 Bytes

Upload folder using huggingface_hub (#7) 2 months ago
special_tokens_map.json

340 Bytes

Upload folder using huggingface_hub (#3) 2 months ago
tokenizer.json

9.09 MB

Upload tokenizer.json (#6) 2 months ago
tokenizer_config.json

50.9 kB

Upload folder using huggingface_hub (#3) 2 months ago