Platform
Platform
Solutions
Solutions
Resources
Resources
Pricing
Pricing
Docs
Docs
Log in
Get started
Rachel Rapp
Product
Infrastructure
How we built Multi-cloud Capacity Management (MCM)
William Lau
3 others
Infrastructure
How Baseten multi-cloud capacity management (MCM) unifies deployments
Rachel Rapp
1 other
News
Introducing Baseten Embeddings Inference: The fastest embeddings solution available
Michael Feil
1 other
News
Baseten Chains is now GA for production compound AI systems
Marius Killinger
2 others
News
New observability features: activity logging, LLM metrics, and metrics dashboard customization
Suren Atoyan
4 others
News
Introducing our Speculative Decoding Engine Builder integration for ultra-low-latency LLM inference
Justin Yi
3 others
Model performance
Generally Available: The fastest, most accurate and cost-efficient Whisper transcription
William Gao
3 others
News
Introducing Custom Servers: Deploy production-ready model servers from Docker images
Tianshu Cheng
2 others
News
Create custom environments for deployments on Baseten
Samiksha Pal
3 others
1
2
3
Explore Baseten today
Start deploying
Talk to an engineer
Rachel Rapp - Product | Baseten Author