Documentation Index
Fetch the complete documentation index at: https://docs.modelstack.cc/llms.txt
Use this file to discover all available pages before exploring further.
Overview
ModelStack provides access to models from three leading AI providers through a single API. All models use the OpenAI-compatible chat completions format.Versioned model variants (e.g.,
claude-sonnet-4-5-20250929) are also supported. Any model with a supported prefix (claude-, gpt-, o1-, o3-, o4-, gemini-) is accepted.Anthropic
| Model ID | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
claude-opus-4-6 | $5.00 | $25.00 |
claude-opus-4-5 | $5.00 | $25.00 |
claude-opus-4-1 | $15.00 | $75.00 |
claude-sonnet-4-5 | $3.00 | $15.00 |
claude-sonnet-4-0 | $3.00 | $15.00 |
claude-opus-4 | $15.00 | $75.00 |
claude-haiku-4-5 | $1.00 | $5.00 |
claude-3-7-sonnet-20250219 | $3.00 | $15.00 |
claude-3-5-sonnet-20241022 | $3.00 | $15.00 |
claude-3-5-haiku-20241022 | $0.80 | $4.00 |
claude-3-haiku-20240307 | $0.25 | $1.25 |
claude-opus-4-5-20251101, claude-opus-4-1-20250805, claude-sonnet-4-5-20250929, claude-sonnet-4-20250514, claude-opus-4-20250514, claude-haiku-4-5-20251001
OpenAI
| Model ID | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
gpt-4o | $2.50 | $10.00 |
gpt-4o-mini | $0.15 | $0.60 |
gpt-4-turbo | $10.00 | $30.00 |
o1 | $15.00 | $60.00 |
o1-mini | $1.10 | $4.40 |
o3 | $10.00 | $40.00 |
o3-mini | $1.10 | $4.40 |
o4-mini | $1.10 | $4.40 |
gpt-4o-2024-11-20, gpt-4o-mini-2024-07-18
| Model ID | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
gemini-2.5-pro | $1.25 | $10.00 |
gemini-2.5-flash | $0.15 | $0.60 |
gemini-2.0-flash | $0.10 | $0.40 |
gemini-2.0-flash-lite | $0.075 | $0.30 |
gemini-1.5-pro | $1.25 | $5.00 |
gemini-1.5-flash | $0.075 | $0.30 |
Model Selection Guide
Best Quality
claude-opus-4-1 or o1 — Most capable models for complex reasoning, code generation, and nuanced tasks.Best Balance
claude-sonnet-4-5 or gpt-4o — Excellent quality at moderate cost. Great for production workloads.Best Speed
gemini-2.5-flash or gpt-4o-mini — Fast responses at low cost. Ideal for high-volume, latency-sensitive tasks.Best Value
gemini-2.0-flash-lite or claude-3-haiku-20240307 — Lowest cost per token for budget-conscious applications.