quantized the model to 4-bits (using Q4_K_M method)

#2
No description provided.
ersanbil changed pull request status to closed

Sign up or log in to comment