Basic

$0
+ compute+ resource
/ month

Perfect for individuals and small teams to get started.

GET STARTED

Standard

$30
+ compute+ resource
/ month

Designed for collaborative teams and growing businesses.

SIGN IN TO UPGRADE

Enterprise

Custom

For organizations requiring high SLAs, performance, and compliance.

CONTACT US

Compute costs
TypeNameCPUMEMGPUGPU Memory
CPU
cpu.small14 GB--$0.0495
cpu.medium28 GB--$0.099
cpu.large416 GB--$0.198
NVIDIA-A10
gpu.a102496 GB1 × NVIDIA-A1024 GB$1.212
NVIDIA-A100
gpu.a100-80gb12192 GB1 × NVIDIA-A100-80GB80 GB$3.21
gpu.2xa100-80gb24384 GB2 × NVIDIA-A100-80GB160 GB$6.42
gpu.4xa100-80gb48768 GB4 × NVIDIA-A100-80GB320 GB$12.84
gpu.8xa100-80gb961536 GB8 × NVIDIA-A100-80GB640 GB$25.68
NVIDIA-A6000
gpu.a6000864 GB1 × NVIDIA-RTX-A600048 GB$1.65
NVIDIA-H100
gpu.h100-sxm20240 GB1 × NVIDIA-H100-80GB-HBM380 GB$3
gpu.2xh100-sxm40480 GB2 × NVIDIA-H100-80GB-HBM3160 GB$6
gpu.4xh100-sxm80960 GB4 × NVIDIA-H100-80GB-HBM3320 GB$12
gpu.8xh100-sxm1601920 GB8 × NVIDIA-H100-80GB-HBM3640 GB$24
Serverless Endpoint costs
ModelPrice
BGE Base$0.005 / million tokens
Dolphin Mixtral 8x7b$0.5 / million tokens
Gemma 2 9B$0.07 / million tokens
Llama3.2 3b$0.03 / million tokens
Llama3 8b$0.07 / million tokens
Llama3.1 8b$0.07 / million tokens
Llama2 13b$0.18 / million tokens
Llama3 70b$0.8 / million tokens
Llama3.1 70b$0.8 / million tokens
Llama3.3 70b$0.8 / million tokens
Llama3.1 405b$2.8 / million tokens
Mistral 7B$0.07 / million tokens
Mistral Nemo$0.18 / million tokens
Mixtral 8x7b$0.5 / million tokens
MythoMax L2 13b$0.18 / million tokens
Nous: Hermes 13B$0.18 / million tokens
OpenChat 3.5$0.07 / million tokens
Qwen QwQ 32B Preview$0.8 / million tokens
Qwen2 72B$0.8 / million tokens
Toppy M 7B$0.07 / million tokens
WizardLM-2 7B$0.07 / million tokens
WizardLM-2 8x22B$1 / million tokens
Whisper$0.00007 / second
Stable Diffusion XL$0.00015 / step
Stable Video Diffusion$0.0092 / step
Lepton Search (beta)$0.015 / step