Our Series E: we raised $300M at a $5B valuation to power a multi-model future. READ

Model library

Browse our library of open source models that are ready to deploy behind an API endpoint in seconds.

🔥 Trending models

large language models

See all
Kimi
LLM

Kimi K2.5

2.5
DeepSeek Logo
LLM

DeepSeek V3.2

V3.2 - B200
Z AI
Model API
LLM

GLM 4.7

4.7
Z AI
LLM

GLM-4.6V

4.6 - Vision

text to speech models

See all
Canopy Labs Logo
Text to speech

Orpheus 3B Websockets

TRT-LLM - H100 MIG 40GB
Canopy Labs Logo
Text to speech

Orpheus TTS

TRT-LLM - H100 MIG 40GB
three triangles with the bottom edge missing inside each other
Text to speech

MARS6

V6 - L4
Coqui
Text to speech

XTTS V2

T4

transcription models

See all
OpenAI logo
Transcription

Whisper Large V3

V3 - H100 MIG 40GB
OpenAI logo
Transcription

Whisper Large V3 Turbo

V3 - Turbo - H100 MIG 40GB
OpenAI logo
Transcription

Whisper Large V2

V2 - H100 MIG 40GB
Fixie Logo
Transcription

Ultravox v0.6 70B

v0.6 - H100
Mistral AI logo
Transcription

Voxtral Small 24B

2507 - Small - H100

image generation models

See all
Stability AI logo
Image generation

Stable Diffusion XL

XL 1.0 - L4
Qwen Logo
Image generation

Qwen Image

Text-to-Image
Fotographer AI
Image generation

ZenCtrl

Custom Server - H100
ByteDance logo
Image generation

SDXL Lightning

1.0 - Lightning - A100
Stability AI logo
Image generation

Stable Diffusion 3 Medium

3 - A100
Stability AI logo
Image generation

SDXL ControlNet

XL 1.0 - Controlnet - L4

embedding models

See all
google logo
Embedding

EmbeddingGemma

Embedding
Qwen Logo
Embedding

Qwen3 8B Reranker

BEI - H100 MIG 40GB
Qwen Logo
Embedding

Qwen3 8B Embedding

BEI - H100 MIG 40GB
Allen AI
Embedding

Tulu 3 8B Reward

V3 - Reward - BEI - H100 MIG 40GB
BAAI
Embedding

BGE Reranker M3

BEI - H100
BAAI
Embedding

BGE Embedding ICL

BEI - H100

DeepSeek models

See all
DeepSeek Logo
LLM

DeepSeek V3.2

V3.2 - B200
DeepSeek Logo
Model API
LLM

DeepSeek V3.1

V3.1 - B200
DeepSeek Logo
LLM

DeepSeek-R1 Llama 70B

R1 - Llama - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 32B

R1 - Qwen - TRT-LLM - H100
DeepSeek Logo
LLM

DeepSeek-R1 Qwen 7B

R1 - Qwen - TRT-LLM - H100 MIG 40GB
DeepSeek Logo
LLM

DeepSeek R1 0528

R1 - 0528 - B200

Qwen models

See all
Qwen Logo
Model API
LLM

Qwen3 Coder 480B

3 - Coder
Qwen Logo
LLM

Qwen 3 32B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen3 VL 235B

3 - Vision Language
Qwen Logo
LLM

Qwen3 Coder 30B

3 - Coder
Qwen Logo
Image generation

Qwen Image

Text-to-Image

Meta models

See all
Meta logo
LLM

Llama 3.3 70B Instruct

3.3 - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 8B Instruct

3.1 - Instruct - TRT-LLM - H100
Meta logo
LLM

Llama 3.1 405B Instruct

3.1 - Instruct - H100
Meta logo
LLM

Llama 3.2 11B Vision Instruct

3.2 - Vision - A100
Meta logo
LLM

Llama 4 Scout

V4.0 - Instruct - vLLM - H100
Meta logo
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200