[Cache Request] meta-llama/Llama-3.2-3B-Instruct

#205
by Constantine-Forever - opened

Please add the following model to the neuron cache

AWS Inferentia and Trainium org

The model is present in the neuron cache but only available with optimum-neuron 0.0.25. While waiting for the release you can build locally the development version.

Sign up or log in to comment