large language

KimiKimi K2 Thinking

A 1 trillion parameter reasoning model for agents, coding, and writing

Model details

Example usage

The smartest model in the world is now open source.

Baseten offers Dedicated Deployments for Kimi K2 Thinking powered by the Baseten Inference Stack.

Kimi K2 Thinking rivals GPT 5 and Claude Sonnet 4.5 on agentic, coding, and reasoning benchmarks. Deployments of Kimi are OpenAI-compatible.

Kimi K2 Thinking rivals the top closed-source models on the market.Kimi K2 Thinking rivals the top closed-source models on the market.

Input
1# You can use this model with any of the OpenAI clients in any language!
2# Simply change the API Key to get started
3
4from openai import OpenAI
5
6client = OpenAI(
7    api_key="YOUR_API_KEY",
8    base_url="https://inference.baseten.co/v1"
9)
10
11response = client.chat.completions.create(
12    model="moonshotai/Kimi-K2-Thinking",
13    messages=[
14        {
15            "role": "user",
16            "content": "Implement Hello World in Python"
17        }
18    ],
19    stream=True,
20    stream_options={
21        "include_usage": True,
22        "continuous_usage_stats": True
23    },
24    top_p=1,
25    max_tokens=1000,
26    temperature=1,
27    presence_penalty=0,
28    frequency_penalty=0
29)
30
31for chunk in response:
32    if chunk.choices and chunk.choices[0].delta.content is not None:
33        print(chunk.choices[0].delta.content, end="", flush=True)
JSON output
1{
2    "id": "143",
3    "choices": [
4        {
5            "finish_reason": "stop",
6            "index": 0,
7            "logprobs": null,
8            "message": {
9                "content": "[Model output here]",
10                "role": "assistant",
11                "audio": null,
12                "function_call": null,
13                "tool_calls": null
14            }
15        }
16    ],
17    "created": 1741224586,
18    "model": "",
19    "object": "chat.completion",
20    "service_tier": null,
21    "system_fingerprint": null,
22    "usage": {
23        "completion_tokens": 145,
24        "prompt_tokens": 38,
25        "total_tokens": 183,
26        "completion_tokens_details": null,
27        "prompt_tokens_details": null
28    }
29}

large language models

See all
Z AI
LLM

GLM-4.6V

4.6 - Vision
DeepSeek Logo
Model API
LLM

DeepSeek V3.2

V3.2 - B200

Moonshot AI models

See all
Kimi
Model API
LLM

Kimi K2 Thinking

Thinking - K2
Kimi
Model API
LLM

Kimi K2 Instruct

0905

🔥 Trending models