Models and Pricing - LLMLAYER API Documentation

Zero Markup Policy - We pass through exact provider pricing with no hidden fees. LLMLayer only charges $0.004 per search for our infrastructure. You get the same rates as going direct to providers, plus our powerful search capabilities.

Why LLMLayer Pricing?

✅ No markup on models - Pay exactly what providers charge
✅ One API, all models - No need for multiple provider accounts
✅ $0.004 per search - Predictable infrastructure cost
✅ Real-time web search - Built into every request
✅ Instant switching - Change models with one single line of code

Model Pricing

All prices are per million tokens. We update these automatically when providers change their rates.

OpenAI Models

Model	Input ($/M tokens)	Output ($/M tokens)	Best For
`openai/gpt-5`	$1.25	$10.00	Complex reasoning & analysis
`openai/gpt-5-mini`	$0.25	$2.00	Cost-effective reasoning
`openai/gpt-5-nano`	$0.05	$0.40	Balanced performance
`openai/o3`	$2.00	$8.00	Complex reasoning & analysis
`openai/o3-mini`	$1.10	$4.40	Cost-effective reasoning
`openai/o4-mini`	$1.10	$4.40	Balanced performance
`openai/gpt-4.1`	$2.00	$8.00	Advanced tasks
`openai/gpt-4.1-mini`	$0.40	$1.60	Efficient advanced tasks
`openai/gpt-4o`	$2.50	$10.00	Multimodal & complex queries
`openai/gpt-4o-mini`	$0.15	$0.60	Fast, affordable searches

Groq Models

Model	Input ($/M tokens)	Output ($/M tokens)	Best For
`groq/openai-gpt-oss-120b`	$0.15	$0.75	High-performance search
`groq/openai-gpt-oss-20b`	$0.10	$0.50	Budget-friendly quality
`groq/kimi-k2`	$1.00	$3.00	High-performance search
`groq/qwen3-32b`	$0.29	$0.59	Budget-friendly quality
`groq/llama-3.3-70b-versatile`	$0.59	$0.79	Versatile applications
`groq/deepseek-r1-distill-llama-70b`	$0.75	$0.99	Deep reasoning tasks
`groq/llama-4-maverick-17b-128e-instruct`	$0.20	$0.60	Fast, efficient searches

Anthropic Models

Model	Input ($/M tokens)	Output ($/M tokens)	Best For
`anthropic/claude-sonnet-4`	$3.00	$15.00	highly creative writing and intelligent model

DeepSeek Models

Model	Input ($/M tokens)	Output ($/M tokens)	Best For
`deepseek/deepseek-chat`	$0.27	$1.10	General purpose chat
`deepseek/deepseek-reasoner`	$0.55	$2.19	Complex reasoning

Coming Soon - We’re constantly adding new models. Anthropic Claude, Google Gemini, and more providers are on the way. Pricing updates are reflected immediately when providers change their rates.

The LLMLayer Advantage

🚀 Production-Ready Infrastructure

Automatic retries - Handle failures gracefully
Global CDN - Low latency worldwide

💰 Unbeatable Value

No API key juggling - One key for 15+ models
Pay-as-you-go - No minimums or commitments

Pricing Calculator

Want to estimate your costs? Here’s a simple formula:

Total Cost = $0.004 (per search) + (Input Tokens × Input Price) + (Output Tokens × Output Price)

FAQ

How does pricing compare to using providers directly?

Can I bring my own API keys?

How often are prices updated?

Is there a free tier?

Start Building Today

Get API Key

View Quickstart

Integration in 2 minutes

API Reference

Explore all features

Contact Sales

Enterprise & volume pricing

Questions? Join our Discord or email support@llmlayer.ai

Get Started

​Why LLMLayer Pricing?

​Model Pricing

​OpenAI Models

​Groq Models

​Anthropic Models

​DeepSeek Models

​The LLMLayer Advantage

​🚀 Production-Ready Infrastructure

​💰 Unbeatable Value

​Pricing Calculator

​FAQ

​Start Building Today

Get API Key

View Quickstart

API Reference

Contact Sales

Why LLMLayer Pricing?

Model Pricing

OpenAI Models

Groq Models

Anthropic Models

DeepSeek Models

The LLMLayer Advantage

🚀 Production-Ready Infrastructure

💰 Unbeatable Value

Pricing Calculator

FAQ

Start Building Today