Our Series E: we raised $300M at a $5B valuation to power a multi-model future.
READ
Product
Product
Platform
Platform
Developer
Developer
Resources
Resources
Research
Research
Customers
Customers
Pricing
Pricing
Log in
Get started
Tri Dao
Model performance
Kimi K2 Thinking at 140+ TPS on NVIDIA Blackwell
Abu Qader
2 others
Model performance
How we made the fastest GPT-OSS on NVIDIA GPUs 60% faster
Tri Dao
2 others
Model performance
How we run GPT OSS 120B at 500+ tokens per second on NVIDIA GPUs
Amir Haghighat
4 others
Explore Baseten today
Start deploying
Talk to an engineer