Overview
ModelStack provides access to models from three leading AI providers through a single API. All models use the OpenAI-compatible chat completions format.
Versioned model variants (e.g., claude-sonnet-4-5-20250929) are also supported. Any model with a supported prefix (claude-, gpt-, o1-, o3-, o4-, gemini-) is accepted.
Anthropic
Model ID Input / 1M tokens Output / 1M tokens claude-opus-4-6$5.00 $25.00 claude-opus-4-5$5.00 $25.00 claude-opus-4-1$15.00 $75.00 claude-sonnet-4-5$3.00 $15.00 claude-sonnet-4-0$3.00 $15.00 claude-opus-4$15.00 $75.00 claude-haiku-4-5$1.00 $5.00 claude-3-7-sonnet-20250219$3.00 $15.00 claude-3-5-sonnet-20241022$3.00 $15.00 claude-3-5-haiku-20241022$0.80 $4.00 claude-3-haiku-20240307$0.25 $1.25
Versioned variants also supported:
claude-opus-4-5-20251101, claude-opus-4-1-20250805, claude-sonnet-4-5-20250929, claude-sonnet-4-20250514, claude-opus-4-20250514, claude-haiku-4-5-20251001
OpenAI
Model ID Input / 1M tokens Output / 1M tokens gpt-4o$2.50 $10.00 gpt-4o-mini$0.15 $0.60 gpt-4-turbo$10.00 $30.00 o1$15.00 $60.00 o1-mini$1.10 $4.40 o3$10.00 $40.00 o3-mini$1.10 $4.40 o4-mini$1.10 $4.40
Versioned variants also supported:
gpt-4o-2024-11-20, gpt-4o-mini-2024-07-18
Google
Model ID Input / 1M tokens Output / 1M tokens gemini-2.5-pro$1.25 $10.00 gemini-2.5-flash$0.15 $0.60 gemini-2.0-flash$0.10 $0.40 gemini-2.0-flash-lite$0.075 $0.30 gemini-1.5-pro$1.25 $5.00 gemini-1.5-flash$0.075 $0.30
Model Selection Guide
Best Quality claude-opus-4-1 or o1 — Most capable models for complex reasoning, code generation, and nuanced tasks.
Best Balance claude-sonnet-4-5 or gpt-4o — Excellent quality at moderate cost. Great for production workloads.
Best Speed gemini-2.5-flash or gpt-4o-mini — Fast responses at low cost. Ideal for high-volume, latency-sensitive tasks.
Best Value gemini-2.0-flash-lite or claude-3-haiku-20240307 — Lowest cost per token for budget-conscious applications.