Our Series E: we raised $300M at a $5B valuation to power a multi-model future.
READ
Product
Product
Platform
Platform
Developer
Developer
Resources
Resources
Research
Research
Customers
Customers
Pricing
Pricing
Log in
Get started
Pankaj Gupta
Co-Founder
Model performance
Benchmarking fast Mistral 7B inference
Abu Qader
3 others
Model performance
33% faster LLM inference with FP8 quantization
Pankaj Gupta
1 other
Model performance
FP8: Efficient model inference with 8-bit floating point numbers
Pankaj Gupta
1 other
Model performance
40% faster Stable Diffusion XL inference with NVIDIA TensorRT
Pankaj Gupta
2 others
Model performance
Unlocking the full power of NVIDIA H100 GPUs for ML inference with TensorRT
Pankaj Gupta
1 other
Model performance
Faster Mixtral inference with TensorRT-LLM and quantization
Pankaj Gupta
2 others
Infrastructure
Technical deep dive: Truss live reload
Pankaj Gupta
1
2
Explore Baseten today
Start deploying
Talk to an engineer
Pankaj Gupta - Co-Founder