all systems normal · 99.99% uptime

Every AI. One API.

A single OpenAI-compatible endpoint that fans out to Claude, GPT, Gemini, Veo, Sora, Suno — and 100+ more. Production-grade routing, transparent Credits pricing, zero vendor lock-in.

Get an API key $ read the docs· 200 Credits free on signup · no card
100+
models
99.99%
routing SLA
24 / 7
failover
多种 AI 模型通过 Orux AI 网关汇聚为统一 API
fig. 00 · unified gatewaylive topology
many models · one gatewayroute active
orux-quickstart.shbash · v1
$ curl https://orux.top/api/v1/chat/completions
# 1. Drop-in OpenAI base URL
curl https://orux.top/api/v1/chat/completions \
  -H "Authorization: Bearer sk-app-…" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4.7",
    "messages": [{ "role":"user","content":"Hello, world." }] 
  }'
POST /api/v1/chat/completions200 OK · 240ms
  1. 1{
  2. 2 "id": "chatcmpl-9aA…",
  3. 3 "model": "claude-opus-4.7",
  4. 4 "choices": [{ "message": { "content": "Hello, world." } }],
  5. 5 "usage": { "total_credits": 3.6, "latency_ms": 240 },
  6. 6 "x_orux": { "route": "auto", "fallback": "none" }
  7. 7}
credits: 3.6 Cr$0.018 auto-routed
routed through Orux AI100+ models
AnthropicClaude Opus 4.7Chat
OpenAIGPT-5.5Chat
GoogleGemini 3 ProChat
DeepSeekDeepSeek V4Chat
GoogleVeo 3.1Video
OpenAISora 2Video
SunoSuno v5Audio
MidjourneyMJ FastImage
how it works

One request. Every upstream.

Orux AI sits between your app and every model provider. Pick a model id — Orux AI resolves it to the cheapest healthy upstream, retries on failure, and bills in a single unit (Credits).

  • OpenAI-compatible protocol · drop in as a base URL
  • Sub-200ms median routing overhead
  • Multi-channel failover — redundant upstreams behind every model
  • HMAC-signed webhooks for async tasks (video / music / Midjourney)
fig. 01 · request routingstateless · ~200ms overhead
CLIENTyour-appORUX/v1gatewayClaudeAnthropicGPTOpenAIGeminiGoogleVeo / SoraVideoSunoAudio
engineering

Built for production.

Everything you need to ship AI features fast — without re-architecting every quarter.

fn.01

Drop-in OpenAI

Same protocol. Same SDKs. Swap base URL and ship.

protocol · v1
fn.02

100+ models

Claude Opus 4.7 · GPT-5.5 · Gemini 3 Pro · Veo · Sora · Suno · Midjourney.

catalog
fn.03

Multi-channel failover

Routes around upstream outages. One provider dark ≠ you down.

resilience
fn.04

Transparent Credits

Fixed 1 USD = 200 Cr. No expiry. No monthly minimum.

billing
fn.05

Streaming + tools

SSE, function calling, JSON mode — 1:1 with OpenAI.

protocol · v1
fn.06

Real observability

Per-model usage, cost, error rate, p95 latency — out of the box.

telemetry
product

Operate it like infrastructure.

Dashboard, docs and playground — engineered for the same humans who read RFCs.

block.01

Real-time dashboard

Live cost, calls, latency and error rate — per key, per model.

  • Credits balance + burn rate
  • 24h call & p95 latency charts
  • Per-key credit quota + per-user concurrency
https://orux.top/dashboard
Credits
12,480Cr
24h calls
38,214+12%
p95 latency
142ms
Latency · last 24hP50 / P95
050100150200-24h-16h-8h-1h
Recent callslast 5
timemodelcrmsstatus
12:04claude-opus-4.73.6240200
12:03gpt-5.52.1180200
12:02gemini-3-pro1.8210200
12:01veo-3.1-quality180.0asyncqueued
11:59deepseek-v3.10.4160200
block.02

Docs that respect your time

Endpoint reference, code samples in 4 languages, and a real OpenAPI spec you can fork.

  • cURL · Node · Python · Go side-by-side
  • Live request inspector
  • OpenAPI 3.1 — downloadable
https://orux.top/docs/api/chat
POST

/api/v1/chat/completions

OpenAI-compatible
cURLnodepythongo
curl https://orux.top/api/v1/chat/completions \
  -H "Authorization: Bearer sk-app-…" \
  -H "Content-Type: application/json" \
  -d '{ "model": "claude-opus-4.7",
        "messages": [{"role":"user","content":"hi"}] }'
modelstring
required
messagesarray
required
streamboolean
default: false
toolsarray
default: —
block.03

Playground on your own key

Try any model with your real sk — what you see is exactly what your app gets.

  • Streaming responses with first-byte timing
  • Image, video and music generation
  • Templates: 50+ curated prompts
https://orux.top/playground
Response · streaminglive
youBuild me a rate-limited fetch wrapper.
claudeSure — here is a minimal token-bucket implementation in TypeScript:
// 200 req/min, burstable to 50
const limiter = tokenBucket(200, 50);
await limiter.acquire();
return fetch(url, init);
312 tokens · 1.2 Cr240ms first byte
pricing

Pay only for what you use.

Top up Credits when you need them. No subscription, no expiration, no surprise invoices.

view full pricing
tiercreditscost (USD)note
Free
evaluation
200 Cr$0on signup · no card
Starter
side-project
2,000 Cr$10one-time top-up
Growth
most popular
20,000 Cr$100covers ~5k chat calls
Scale
production
200,000 Cr$1,000priority routing
1 USD = 200 Credits · 1 CNY = 28 Credits · rate is fixed — what you see is what you pay
free tier · 200 Credits on signup

Ship faster. Outgrow vendor lock-in.

Email sign-up, no card. From signup to first response in under five minutes.