Control API access across your team. Set per-key budgets, track every request, and manage tokens — all in one dashboard.
# Set up in 30 seconds
$ export ANTHROPIC_BASE_URL=https://api.claudegateway.shop
$ export ANTHROPIC_API_KEY=cg_live_xxxx
$ python demo.py
Connected! Using Claude Opus 4.6
Budget remaining: 98,420 / 100,000 tokens
Window resets in: 4h 32m
Built for teams, freelancers, and anyone who wants real control over their Claude API usage.
Set spending limits on every API key. 5-hour rolling windows auto-reset so your costs stay predictable.
Track every request — model used, tokens consumed, latency, and status. All in a clean dashboard.
Each key gets its own rate limits and expiry. Perfect for multi-tenant apps and team workflows.
Server-sent events flow straight through. First token arrives in milliseconds, no buffering.
Swap your Anthropic base URL and you're done. Works with Claude Code, Python SDK, and TypeScript SDK.
Repeated context is cached automatically. Cache tokens don't eat into your budget.
No complex setup. No infrastructure to manage. Just your API keys and you're live.
Pick your plan, set your token budget, and create a key in seconds from the dashboard.
Change the base URL to our gateway and paste your key. No other code changes needed.
Start building with the tools you already know. Monitor usage and budgets from the dashboard.
Three models, one gateway. Switch between them without changing your code.
The sweet spot for everyday tasks — fast, reliable, and great value.
Top-tier reasoning and coding. Use when quality can't be compromised.
Lightning-quick responses for classification, extraction, and high-volume jobs.