all systems normal · 99.99% uptime

Every AI. One API.

A single OpenAI-compatible endpoint that fans out to Claude, GPT, Gemini, Veo, Sora, Suno — and 100+ more. Production-grade routing, transparent Credits pricing, zero vendor lock-in.

Get an API key $ read the docs· 200 Credits free on signup · no card

100+

models

99.99%

routing SLA

24 / 7

failover

fig. 00 · unified gatewaylive topology

many models · one gatewayroute active

orux-quickstart.shbash · v1

$ curl https://orux.top/api/v1/chat/completions

# 1. Drop-in OpenAI base URL
curl https://orux.top/api/v1/chat/completions \
  -H "Authorization: Bearer sk-app-…" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4.7",
    "messages": [{ "role":"user","content":"Hello, world." }] 
  }'

POST /api/v1/chat/completions200 OK · 240ms

1{
2 "id": "chatcmpl-9aA…",
3 "model": "claude-opus-4.7",
4 "choices": [{ "message": { "content": "Hello, world." } }],
5 "usage": { "total_credits": 3.6, "latency_ms": 240 },
6 "x_orux": { "route": "auto", "fallback": "none" }
7}

credits: 3.6 Cr≈$0.018 auto-routed

routed through Orux AI100+ models

AnthropicClaude Opus 4.7Chat

OpenAIGPT-5.5Chat

GoogleGemini 3 ProChat

DeepSeekDeepSeek V4Chat

GoogleVeo 3.1Video

OpenAISora 2Video

SunoSuno v5Audio

MidjourneyMJ FastImage

how it works

One request. Every upstream.

Orux AI sits between your app and every model provider. Pick a model id — Orux AI resolves it to the cheapest healthy upstream, retries on failure, and bills in a single unit (Credits).

OpenAI-compatible protocol · drop in as a base URL
Sub-200ms median routing overhead
Multi-channel failover — redundant upstreams behind every model
HMAC-signed webhooks for async tasks (video / music / Midjourney)

fig. 01 · request routingstateless · ~200ms overhead

engineering

Built for production.

Everything you need to ship AI features fast — without re-architecting every quarter.

fn.01●

Drop-in OpenAI

Same protocol. Same SDKs. Swap base URL and ship.

protocol · v1

fn.02●

100+ models

Claude Opus 4.7 · GPT-5.5 · Gemini 3 Pro · Veo · Sora · Suno · Midjourney.

catalog

fn.03●

Multi-channel failover

Routes around upstream outages. One provider dark ≠ you down.

resilience

fn.04●

Transparent Credits

Fixed 1 USD = 200 Cr. No expiry. No monthly minimum.

billing

fn.05●

Streaming + tools

SSE, function calling, JSON mode — 1:1 with OpenAI.

protocol · v1

fn.06●

Real observability

Per-model usage, cost, error rate, p95 latency — out of the box.

telemetry

product

Operate it like infrastructure.

Dashboard, docs and playground — engineered for the same humans who read RFCs.

block.01

Real-time dashboard

Live cost, calls, latency and error rate — per key, per model.

Credits balance + burn rate
24h call & p95 latency charts
Per-key credit quota + per-user concurrency

●https://orux.top/dashboard

Credits

12,480Cr

24h calls

38,214+12%

p95 latency

142ms

Latency · last 24hP50 / P95

Recent callslast 5

time	model	cr	ms	status
12:04	claude-opus-4.7	3.6	240	200
12:03	gpt-5.5	2.1	180	200
12:02	gemini-3-pro	1.8	210	200
12:01	veo-3.1-quality	180.0	async	queued
11:59	deepseek-v3.1	0.4	160	200

block.02

Docs that respect your time

Endpoint reference, code samples in 4 languages, and a real OpenAPI spec you can fork.

cURL · Node · Python · Go side-by-side
Live request inspector
OpenAPI 3.1 — downloadable

●https://orux.top/docs/api/chat

POST

/api/v1/chat/completions

OpenAI-compatible

cURLnodepythongo

curl https://orux.top/api/v1/chat/completions \
  -H "Authorization: Bearer sk-app-…" \
  -H "Content-Type: application/json" \
  -d '{ "model": "claude-opus-4.7",
        "messages": [{"role":"user","content":"hi"}] }'

modelstring

required

messagesarray

required

streamboolean

default: false

toolsarray

default: —

block.03

Playground on your own key

Try any model with your real sk — what you see is exactly what your app gets.

Streaming responses with first-byte timing
Image, video and music generation
Templates: 50+ curated prompts

●https://orux.top/playground

Response · streaminglive

youBuild me a rate-limited fetch wrapper.

claudeSure — here is a minimal token-bucket implementation in TypeScript:

// 200 req/min, burstable to 50
const limiter = tokenBucket(200, 50);
await limiter.acquire();
return fetch(url, init);

312 tokens · 1.2 Cr240ms first byte

pricing

Pay only for what you use.

Top up Credits when you need them. No subscription, no expiration, no surprise invoices.

view full pricing

tier	credits	cost (USD)	note
Free evaluation	200 Cr	$0	on signup · no card
Starter side-project	2,000 Cr	$10	one-time top-up
Growth most popular	20,000 Cr	$100	covers ~5k chat calls
Scale production	200,000 Cr	$1,000	priority routing

1 USD = 200 Credits · 1 CNY = 28 Credits · rate is fixed — what you see is what you pay

free tier · 200 Credits on signup

Ship faster. Outgrow vendor lock-in.

Email sign-up, no card. From signup to first response in under five minutes.

create your account $ quickstart