Our Series E: we raised $300M at a $5B valuation to power a multi-model future.
READ
Product
Product
Platform
Platform
Developer
Developer
Resources
Resources
Research
Research
Customers
Customers
Pricing
Pricing
Log in
Get started
Pankaj Gupta
Co-Founder
News
Announcing Baseten's $300M Series E
Tuhin Srivastava
3 others
Model performance
Wan 2.2 video generation in less than 60 seconds
Faraz Shahsavan
3 others
Infrastructure
Testing Llama 3.3 70B inference performance on NVIDIA GH200 in Lambda Cloud
Pankaj Gupta
1 other
Model performance
Driving model performance optimization: 2024 highlights
Pankaj Gupta
Model performance
How we built production-ready speculative decoding with TensorRT-LLM
Pankaj Gupta
2 others
Model performance
A quick introduction to speculative decoding
Pankaj Gupta
2 others
Infrastructure
Evaluating NVIDIA H200 Tensor Core GPUs for LLM inference
Pankaj Gupta
1 other
Model performance
How to serve 10,000 fine-tuned LLMs from a single GPU
Pankaj Gupta
1 other
Infrastructure
Using fractional H100 GPUs for efficient model serving
Matt Howard
3 others
1
2
Explore Baseten today
Start deploying
Talk to an engineer