One gateway · every major model · UPI billing
One API key for every AI model.
Pay with UPI.
Yantra is an OpenAI-compatible gateway. Point your code at one endpoint, use a single key, and our routing engine sends each request to the best provider — OpenAI, Anthropic, Gemini, DeepSeek, Mistral or a local model — with automatic failover. Billed per token in rupees.
$ curl https://your-gateway/v1/chat/completions -H "Authorization: Bearer yk_live_…"
Intelligent Request Routing
LIVE
Idle — send a request in the playground to watch a live route.
16
Models, one key
6
Providers routed
99.9%
Failover uptime target
₹10
Minimum top-up (UPI)
How it works
Three steps from zero to your first completion.
01
Generate an API key
Create a key in seconds. One yk_live_… key works for every model on the platform — no per-model credentials.
02
Add credits with UPI
Top up in rupees by scanning a UPI QR or opening any UPI app. Balance is metered per token — pay only for what you use.
03
Call one endpoint
Point the OpenAI SDK at the gateway. Pick a model or say auto and the routing engine chooses the best one, with failover.
Why Yantra
A single, billed, OpenAI-compatible front door to the whole model ecosystem.
One key, every model
Stop juggling a dozen provider dashboards and secrets. Generate one key and reach GPT, Claude, Gemini, DeepSeek, Mistral and local models alike.
Intelligent routing + failover
Ask for a capability — auto:cheap, auto:reasoning, auto:fast — and the engine routes to the right model, retrying on a sibling if a provider errors.
OpenAI-compatible
Drop-in /v1/chat/completions, streaming included. Change two lines — base URL and key — and your existing code just works.
Rupee billing over UPI
No cards, no forex. Recharge with UPI and get transparent per-token pricing in INR, metered to the paisa.
Cost-saving auto mode
Let Yantra pick the cheapest model that satisfies the request, so routine calls cost a fraction of always-on flagship pricing.
Local & private models
Route sensitive workloads to self-hosted Llama or Qwen on your own hardware — same key, same API, zero per-token cost.