Pricing built for growth

Production inference that won't break your product or your bank.

Trusted by top engineering and machine learning teams
Logo
Logo
OpenEvidence logo
Logo
Logo
Writer logo
Logo
Logo
Superhuman logo
Logo
Logo
Logo
Gamma logo
Logo
Logo
Logo
Logo
Logo
Logo
Rime logo
Latent Health logo
Praktika AI logo
Logo
Oxen AI logo
Logo
Scaled Cognition logo in grayscale
Aurelio
toby
Logo
Logo
Logo
Logo
OpenEvidence logo
Logo
Logo
Writer logo
Logo
Logo
Superhuman logo
Logo
Logo
Logo
Gamma logo
Logo
Logo
Logo
Logo
Logo
Logo
Rime logo
Latent Health logo
Praktika AI logo
Logo
Oxen AI logo
Logo
Scaled Cognition logo in grayscale
Aurelio
toby
Logo
Logo
Basic

Deploy custom, fine-tuned, and open-source models

Included in Basic:

Dedicated deployments
Model APIs
Fast cold starts
SOC 2 Type II and HIPAA compliant
Email and in-app chat support

Deployment options

$0 per month, pay as you go

Get started
Pro

Unlimited autoscaling and priority compute access

Everything in Basic plus:

Priority access to high-demand GPUs
Dedicated compute
Higher Model API rate limits
Hands-on engineering expertise
Dedicated support on Slack and Zoom

Deployment options

Volume discounts available

Get a quote
Enterprise

Full control in your cloud and ours

Everything in Pro plus:

Custom SLAs
Training (Beta)
Self-host deployments
On-demand flex compute
Use existing cloud commitments
Full control over data residency
Advanced security and compliance
Custom global regions

Deployment options

Volume discounts available

Get a quote

Pricing

Best-in-class model performance, effortless autoscaling, and blazing fast cold starts mean you get the most out of each GPU, saving money along the way.

Dedicated Deployments

Only pay for the compute you use, down to the minute.

Price per

GPU Instances

Price

T4

16 GiB VM

$0.01052

L4

24 GiB VRAM

$0.01414

A10G

24 GiB VM

$0.02012

A100

80 GiB VRAM

$0.06667

H100 MIG

40 GiB VRAM

H100

80 GiB VRAM

$0.10833

B200

180 GiB VRAM

$0.16633

CPU Instances

Price

1x2

1 vCPU, 2 GiB RAM

$0.00058

1x4

1 vCPU, 4 GiB RAM

$0.00086

2x8

2 vCPUs, 8 GiB RAM

$0.00173

4x16

4 vCPUs, 16 GiB RAM

$0.00346

8x32

8 vCPUs, 32 GiB RAM

$0.00691

16x64

16 vCPUs, 64 GiB RAM

$0.01382

Talk to Sales about compute in other countries and regions.

Training

Get 20% of Training spend back as credits for Dedicated Deployments.

Price per

GPU Instances

Price

T4

16 GiB VM

L4

24 GiB VRAM

A10G

24 GiB VM

A100

80 GiB VRAM

H100 MIG

40 GiB VRAM

H100

80 GiB VRAM

B200

180 GiB VRAM

Talk to Sales about compute in other countries and regions.

Common questions

Contact us