One API.
100+ LLMs unified.
Connect to OpenAI, Claude, Gemini, DeepSeek, Qwen and 40+ providers through a single endpoint. Auto retry, quota control, cost analytics—all in one place.
50+
Providers
100+
Billable models
50+
API routes
99.9%
Uptime
Connected
All major LLMs · One gateway
Core features
Built for production AI workloads
Smart routing · auto retry
Weighted multi-channel routing with automatic failover, rate limiting, graceful degradation and quota isolation.
Transparent billing
Per-token billing in real time. Cache-hit discounts detected automatically. One line per call.
Zero migration cost
Use the OpenAI / Claude / Gemini SDKs you already have. Just point baseURL at us.
baseURL: hivellm.io/v1
Enterprise-grade security
Token groups, IP allow-list, model black/white list, full audit log.
High concurrency
Auto load balancing. 8000+ QPS per instance.
Observable
Tokens, requests, latency and errors in one dashboard.
Multi-tenant
Team / project / user — three-tier permission isolation.
Self-hostable
Run on your own infra, fully under your control. No vendor lock-in.
Transparent pricing
All models · up to 45% off
Every LLM, billed per token in real time. Per-vendor discount shown below; prices follow upstream list.
Price per 1M tokens — CNY (USD reference)
Claude
15 models 25% off| Model | Type | Input | Output | Discount |
|---|---|---|---|---|
| claude-opus-4-7 | Text | ¥35.5 $5 | ¥177.5 $25 | 25% off |
| claude-opus-4-6 | Text | ¥35.5 $5 | ¥177.5 $25 | 25% off |
| claude-sonnet-4-6 | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-opus-4-5-20251101 | Text | ¥35.5 $5 | ¥177.5 $25 | 25% off |
| claude-haiku-4-5-20251001 | Text | ¥7.1 $1 | ¥35.5 $5 | 25% off |
| claude-haiku-4-5-20251001-thinking | Text | ¥7.1 $1 | ¥35.5 $5 | 25% off |
| claude-3-7-sonnet-20250219-thinking | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-opus-4-1-20250805 | Text | ¥106.5 $15 | ¥532.5 $75 | 25% off |
| claude-3-7-sonnet-20250219 | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-opus-4-20250514 | Text | ¥106.5 $15 | ¥532.5 $75 | 25% off |
| claude-sonnet-4-20250514-thinking | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-sonnet-4-5-20250929-thinking | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-3-haiku-20240307 | Text | ¥1.78 $0.25 | ¥8.88 $1.25 | 25% off |
| claude-sonnet-4-5-20250929 | Text | ¥21.3 $3 | ¥106.5 $15 | 25% off |
| claude-opus-4-1-20250805-thinking | Text | ¥106.5 $15 | ¥532.5 $75 | 25% off |
Claude
15 models 25% off
-
claude-opus-4-7Text in ¥35.5 $5 out ¥177.5 $25 -
claude-opus-4-6Text in ¥35.5 $5 out ¥177.5 $25 -
claude-sonnet-4-6Text in ¥21.3 $3 out ¥106.5 $15 -
claude-opus-4-5-20251101Text in ¥35.5 $5 out ¥177.5 $25 -
claude-haiku-4-5-20251001Text in ¥7.1 $1 out ¥35.5 $5 -
claude-haiku-4-5-20251001-thinkingText in ¥7.1 $1 out ¥35.5 $5 -
claude-3-7-sonnet-20250219-thinkingText in ¥21.3 $3 out ¥106.5 $15 -
claude-opus-4-1-20250805Text in ¥106.5 $15 out ¥532.5 $75 -
claude-3-7-sonnet-20250219Text in ¥21.3 $3 out ¥106.5 $15 -
claude-opus-4-20250514Text in ¥106.5 $15 out ¥532.5 $75 -
claude-sonnet-4-20250514-thinkingText in ¥21.3 $3 out ¥106.5 $15 -
claude-sonnet-4-5-20250929-thinkingText in ¥21.3 $3 out ¥106.5 $15 -
claude-3-haiku-20240307Text in ¥1.78 $0.25 out ¥8.88 $1.25 -
claude-sonnet-4-5-20250929Text in ¥21.3 $3 out ¥106.5 $15 -
claude-opus-4-1-20250805-thinkingText in ¥106.5 $15 out ¥532.5 $75
Gemini
13 models 25% off| Model | Type | Input | Output | Discount |
|---|---|---|---|---|
| gemini-3.1-flash-image-preview | Text | ¥3.55 $0.5 | ¥21.3 $3 | 25% off |
| Image | — — | ¥426 $60 | 25% off | |
| gemini-3.1-pro-preview | Text · ≤200K tokens | ¥14.2 $2 | ¥85.2 $12 | 25% off |
| Text · >200K tokens | ¥28.4 $4 | ¥127.8 $18 | 25% off | |
| gemini-3-flash-preview | Text | ¥3.55 $0.5 | ¥21.3 $3 | 25% off |
| gemini-3-pro-image-preview | Text | ¥14.2 $2 | ¥85.2 $12 | 25% off |
| Image | — — | ¥852 $120 | 25% off | |
| gemini-2.5-flash | Text | ¥2.13 $0.3 | ¥17.75 $2.5 | 25% off |
| gemini-2.5-flash-preview-09-2025 | Text | ¥2.13 $0.3 | ¥17.75 $2.5 | 25% off |
| gemini-3-pro-preview | Text · ≤200K tokens | ¥14.2 $2 | ¥85.2 $12 | 25% off |
| Text · >200K tokens | ¥28.4 $4 | ¥127.8 $18 | 25% off | |
| gemini-2.0-flash | Text | ¥0.71 $0.1 | ¥2.84 $0.4 | 25% off |
| gemini-2.5-flash-lite | Text | ¥0.71 $0.1 | ¥2.84 $0.4 | 25% off |
| gemini-2.5-flash-image | Image | ¥2.13 $0.3 | ¥213 $30 | 25% off |
| gemini-2.5-pro | Text · ≤200K tokens | ¥8.88 $1.25 | ¥71 $10 | 25% off |
| Text · >200K tokens | ¥17.75 $2.5 | ¥106.5 $15 | 25% off | |
| gemini-2.5-flash-lite-preview-09-2025 | Text | ¥0.71 $0.1 | ¥2.84 $0.4 | 25% off |
| gemini-2.5-flash-image-preview | Image | ¥2.13 $0.3 | ¥213 $30 | 25% off |
Gemini
13 models 25% off
-
gemini-3.1-flash-image-previewText in ¥3.55 $0.5 out ¥21.3 $3Image in — — out ¥426 $60 -
gemini-3.1-pro-previewText · ≤200K tokens in ¥14.2 $2 out ¥85.2 $12Text · >200K tokens in ¥28.4 $4 out ¥127.8 $18 -
gemini-3-flash-previewText in ¥3.55 $0.5 out ¥21.3 $3 -
gemini-3-pro-image-previewText in ¥14.2 $2 out ¥85.2 $12Image in — — out ¥852 $120 -
gemini-2.5-flashText in ¥2.13 $0.3 out ¥17.75 $2.5 -
gemini-2.5-flash-preview-09-2025Text in ¥2.13 $0.3 out ¥17.75 $2.5 -
gemini-3-pro-previewText · ≤200K tokens in ¥14.2 $2 out ¥85.2 $12Text · >200K tokens in ¥28.4 $4 out ¥127.8 $18 -
gemini-2.0-flashText in ¥0.71 $0.1 out ¥2.84 $0.4 -
gemini-2.5-flash-liteText in ¥0.71 $0.1 out ¥2.84 $0.4 -
gemini-2.5-flash-imageImage in ¥2.13 $0.3 out ¥213 $30 -
gemini-2.5-proText · ≤200K tokens in ¥8.88 $1.25 out ¥71 $10Text · >200K tokens in ¥17.75 $2.5 out ¥106.5 $15 -
gemini-2.5-flash-lite-preview-09-2025Text in ¥0.71 $0.1 out ¥2.84 $0.4 -
gemini-2.5-flash-image-previewImage in ¥2.13 $0.3 out ¥213 $30
OpenAI
15 models 45% off| Model | Type | Input | Output | Discount |
|---|---|---|---|---|
| gpt-5.5-pro | Text | ¥213 $30 | ¥1278 $180 | 45% off |
| gpt-5.5 | Text | ¥35.5 $5 | ¥213 $30 | 45% off |
| gpt-5.4-pro | Text | ¥213 $30 | ¥1278 $180 | 45% off |
| gpt-5.4 | Text | ¥17.75 $2.5 | ¥106.5 $15 | 45% off |
| gpt-5.2 | Text | ¥12.43 $1.75 | ¥99.4 $14 | 45% off |
| gpt-5-pro | Text | ¥106.5 $15 | ¥852 $120 | 45% off |
| gpt-5.1 | Text | ¥8.88 $1.25 | ¥71 $10 | 45% off |
| gpt-5 | Text | ¥8.88 $1.25 | ¥71 $10 | 45% off |
| gpt-5-mini | Text | ¥1.77 $0.25 | ¥14.2 $2 | 45% off |
| gpt-5-nano | Text | ¥0.355 $0.05 | ¥2.84 $0.4 | 45% off |
| gpt-4.1-mini | Text | ¥2.84 $0.4 | ¥11.36 $1.6 | 45% off |
| gpt-4o-2024-08-06 | Text | ¥17.75 $2.5 | ¥71 $10 | 45% off |
| gpt-4.1-2025-04-14 | Text | ¥14.2 $2 | ¥56.8 $8 | 45% off |
| gpt-4.1-nano | Text | ¥0.71 $0.1 | ¥2.84 $0.4 | 45% off |
| gpt-4o-mini | Text | ¥2.84 $0.4 | ¥10.65 $1.5 | 45% off |
OpenAI
15 models 45% off
-
gpt-5.5-proText in ¥213 $30 out ¥1278 $180 -
gpt-5.5Text in ¥35.5 $5 out ¥213 $30 -
gpt-5.4-proText in ¥213 $30 out ¥1278 $180 -
gpt-5.4Text in ¥17.75 $2.5 out ¥106.5 $15 -
gpt-5.2Text in ¥12.43 $1.75 out ¥99.4 $14 -
gpt-5-proText in ¥106.5 $15 out ¥852 $120 -
gpt-5.1Text in ¥8.88 $1.25 out ¥71 $10 -
gpt-5Text in ¥8.88 $1.25 out ¥71 $10 -
gpt-5-miniText in ¥1.77 $0.25 out ¥14.2 $2 -
gpt-5-nanoText in ¥0.355 $0.05 out ¥2.84 $0.4 -
gpt-4.1-miniText in ¥2.84 $0.4 out ¥11.36 $1.6 -
gpt-4o-2024-08-06Text in ¥17.75 $2.5 out ¥71 $10 -
gpt-4.1-2025-04-14Text in ¥14.2 $2 out ¥56.8 $8 -
gpt-4.1-nanoText in ¥0.71 $0.1 out ¥2.84 $0.4 -
gpt-4o-miniText in ¥2.84 $0.4 out ¥10.65 $1.5
Prices follow upstream list. Final pricing as shown in the console at top-up time.
Plug-and-play
Change one line. Ship.
Drop-in replacement for OpenAI, Claude and Gemini — SDKs and REST.
from openai import OpenAI
client = OpenAI(
base_url="https://api.hivellm.io/v1", # ← only line that changes
api_key="hivellm-xxxxx",
)
resp = client.chat.completions.create(
model="gpt-4o", # or any of 100+ models
messages=[{"role": "user", "content": "hi"}],
)
print(resp.choices[0].message.content)