Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
microsoft
/
Phi-3-medium-128k-instruct-onnx-cuda
like
23
Text Generation
Transformers
ONNX
phi3
ONNX
DML
ONNXRuntime
nlp
conversational
custom_code
text-generation-inference
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
main
Phi-3-medium-128k-instruct-onnx-cuda
/
cuda-int4-rtn-block-32
2 contributors
History:
1 commit
kvaishnavi
Upload Phi-3-medium-128k-instruct ONNX models
2f3ef2d
5 months ago
added_tokens.json
293 Bytes
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
config.json
3.19 kB
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
configuration_phi3.py
10.4 kB
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
genai_config.json
1.75 kB
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
phi3-medium-128k-instruct-cuda-int4-rtn-block-32.onnx
34.9 MB
LFS
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
phi3-medium-128k-instruct-cuda-int4-rtn-block-32.onnx.data
8.09 GB
LFS
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
special_tokens_map.json
569 Bytes
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
tokenizer.json
1.84 MB
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
tokenizer.model
pickle
500 kB
LFS
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago
tokenizer_config.json
3.16 kB
Upload Phi-3-medium-128k-instruct ONNX models
5 months ago