AI API Gateway

Access powerful AI models through a simple OpenAI-compatible API. Pay per token or subscribe for included usage.

Why CutaZo?

🔌

OpenAI Compatible

Drop-in replacement. Works with any OpenAI SDK or client.

Streaming Support

Real-time streaming responses for interactive applications.

💰

Flexible Pricing

Pay per token or subscribe for included usage with rate limits.

🔑

API Key Management

Create multiple keys, track usage, rotate anytime.

📊

Usage Dashboard

Monitor usage, costs, and performance in real-time.

🛡️

Rate Limiting

Built-in rate limiting and input validation for safety.

Models

cuta-max

Most capable model. $5 / $25 per 1M tokens (input/output).

cuta-opus

Premium reasoning model. $5 / $25 per 1M tokens.

claude-opus-4-6

Direct Claude access. $5 / $25 per 1M tokens.

cuta-sonnet

Fast & affordable. $3 / $15 per 1M tokens.

Pricing

Pay As You Go

$0/mo
  • Pay per token
  • $1 welcome bonus
  • All models
  • 60 req/min
Get Started

Pro

$20/mo
  • 18K tokens / 5h
  • Then pay per token
  • All models
  • Priority
Subscribe

Max 20x

$200/mo
  • 220K tokens / 5h
  • Then pay per token
  • All models
  • Priority
Subscribe

Quick Start

curl https://cutazo.com/v1/chat/completions \
  -H "Authorization: Bearer cz-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "cuta-sonnet",
    "messages": [{"role": "user", "content": "Hello!"}],
    "max_tokens": 1024
  }'