Llama3 8B
8K context
Description
Meta developed and released the Meta Llama 3 family of large language models (LLMs). The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks.
Pricing
- Dedicated Endpoints: Calculated by the instance type and the number of GPUs, you can find the details in pricing page. You can also contact us to reserve GPUs.
- Serverless Endpoints: $0.07 / M tokens for using Llama3 8B, pay as you go.
Create a Dedicated Endpoint
Beyond the serverless endpoints, Lepton provides a simple way to create a dedicated endpoint for Llama3 8B, which is a fully managed endpoint for your own use cases. If this model is what you are looking for, head over our dashboard to create your endpoint.
Playground
System prompt
Temperature
Max tokens
Top P