File size: 306 Bytes
3f061e9
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
---
pipeline_tag: text-generation
tags:
- Neuron
- Optimum-Neuron
- AWS
---


Compiled CodeLlama-7b-hf using optimum-neuron

```shell 
optimum-cli export neuron  --model codellama/CodeLlama-7b-hf --batch_size 1 --sequence_length 1024 --num_cores 2 --auto_cast_type fp16  ./models/codellama/CodeLlama-7b-hf