AI Resources

Managed LLM API Pricing & Model Specification Database

This database lists current API token pricing, context lengths, and caching support for major language models across OpenAI, Anthropic, Google, and DeepSeek.

Interactive LLM Pricing Specs Explorer

Model NameProviderContext SizeInput / MTokOutput / MTokCached / MTokAction
DeepSeek V3DeepSeek128k$0.140$0.28$0.014
GPT-4o-miniOpenAI128k$0.150$0.60$0.075
Gemini 1.5 FlashGoogle2M$0.075$0.30$0.037
GPT-4oOpenAI128k$2.500$10.00$1.250
Claude 3.5 SonnetAnthropic200k$3.000$15.00$0.300
Gemini 1.5 ProGoogle2M$1.250$5.00$0.625
Claude 3.5 HaikuAnthropic200k$0.800$4.00$0.080
Mistral Large 2Mistral128k$2.000$6.00$2.000
Llama 3.3 70B (Serverless)Meta128k$0.350$0.40$0.350

Estimate Your Billing on: GPT-4o

Cost Per Query
$0.0075
Daily Running Bill
$0.75
Monthly API Expense
$22.50

1. Model Selection Criteria

When selecting a model, balance reasoning capability, latency, and token cost. Use this database to compare specifications and identify cost-effective alternatives.

2. Comparative Model Grid

Below is a consolidated list of model specifications and input/output pricing.

Frequently Asked Questions

How is prompt caching billed?

Providers offer discounts on input tokens that match cached prefixes. Anthropic discounts reads by 90%, Google and OpenAI discount reads by 50%.

What is DeepSeek pricing?

DeepSeek V3 costs $0.14 / MTok input (cached is $0.014 / MTok) and $0.28 / MTok output, making it the cheapest high-capability API available.