WizardLM-2 7B

32K context

Description

WizardLM-2 7B is the fastest and achieves comparable performance with existing 10x larger opensource leading models.

Dedicated Endpoints: Calculated by the instance type and the number of GPUs, you can find the details in pricing page. You can also contact us to reserve GPUs.
Serverless Endpoints: $0.07 / M tokens for using WizardLM-2 7B, pay as you go.

System prompt

Temperature

Max tokens

Top P

Lepton AI