AI Resources

Managed LLM API Pricing & Model Specification Database

This database lists current API token pricing, context lengths, and caching support for major language models across OpenAI, Anthropic, Google, and DeepSeek.

Interactive LLM Pricing Specs Explorer

Model Name	Provider	Context Size	Input / MTok	Output / MTok	Cached / MTok
DeepSeek V3	DeepSeek	128k	$0.140	$0.28	$0.014
GPT-4o-mini	OpenAI	128k	$0.150	$0.60	$0.075
Gemini 1.5 Flash	Google	2M	$0.075	$0.30	$0.037
GPT-4o	OpenAI	128k	$2.500	$10.00	$1.250
Claude 3.5 Sonnet	Anthropic	200k	$3.000	$15.00	$0.300
Gemini 1.5 Pro	Google	2M	$1.250	$5.00	$0.625
Claude 3.5 Haiku	Anthropic	200k	$0.800	$4.00	$0.080
Mistral Large 2	Mistral	128k	$2.000	$6.00	$2.000
Llama 3.3 70B (Serverless)	Meta	128k	$0.350	$0.40	$0.350

Estimate Your Billing on: GPT-4o

Input Tokens / Query

Output Tokens / Query

Queries / Day

Cost Per Query

$0.0075

Daily Running Bill

$0.75

Monthly API Expense

$22.50

1. Model Selection Criteria

When selecting a model, balance reasoning capability, latency, and token cost. Use this database to compare specifications and identify cost-effective alternatives.

2. Comparative Model Grid

Below is a consolidated list of model specifications and input/output pricing.

Frequently Asked Questions

How is prompt caching billed?

Providers offer discounts on input tokens that match cached prefixes. Anthropic discounts reads by 90%, Google and OpenAI discount reads by 50%.

What is DeepSeek pricing?

DeepSeek V3 costs $0.14 / MTok input (cached is $0.014 / MTok) and $0.28 / MTok output, making it the cheapest high-capability API available.