Llama2 13b

Description

Llama 2 is a pretrained and fine-tuned generative text models, This is the 13B pretrained model.

Dedicated Endpoints: Calculated by the instance type and the number of GPUs, you can find the details in pricing page. You can also contact us to reserve GPUs.
Serverless Endpoints: $0.18 / M tokens for using Llama2 13b, pay as you go.

System prompt

Temperature

Max tokens

Top P

Lepton AI