The most cost-effective way to build and run AI applications, at scale, and in minutes with a cloud native platform.
Compare the three plans to find the best fit for your needs.
Pay exclusively for your actual compute usage, billed by the minute.
Saving your models, data and logs just right in the platform, and pay only for what you use.
Enhance your application with premium open-source models while leveraging Lepton's exceptional runtime performance and reliability.
Model | Price |
---|---|
Llama3.2 1b | $0.01 / million tokens |
Llama3.2 3b | $0.03 / million tokens |
Llama3 8b | $0.07 / million tokens |
Llama3.1 8b | $0.07 / million tokens |
Llama2 13b | $0.18 / million tokens |
Llama3 70b | $0.8 / million tokens |
Llama3.1 70b | $0.8 / million tokens |
Llama3.3 70b | $0.8 / million tokens |
Find answers to the most common questions about Lepton AI.