March 3, 2026

57 AI and ML Tools Pricing Compared (March 2026)

Q: What is the vector databases?

Self-hosting math: Qdrant self-hosted on a $20/mo VPS costs $20/mo vs. $45/mo for Qdrant Cloud. The breakeven depends on your engineering time cost, not the raw compute cost. Weaviate's 14-day sandbox deadline is easy to miss — no credit card at signup, but you'll be asked before day 14 ends.

LLM API prices dropped 50-75% between 2024 and 2026. Current pricing for 57 AI and ML tools across 10 categories, all verified March 2026.

LLM API prices dropped 50–75% between 2024 and 2026. The tools you budgeted for last year cost materially less today — if you're still working from old numbers, you're either overspending or over-provisioning.

We compiled current pricing for 57 AI and ML tools across 10 categories, all verified March 2026. This covers LLM APIs, AI coding assistants, image generation, vector databases, ML platforms, AI agents, speech/audio, AI search/RAG, computer vision, and AI observability.

LLM APIs — Current Pricing

The token price wars continued through 2025 and into 2026. Here's where pricing stands:

Model	Input (per 1M tokens)	Output (per 1M tokens)	Notes
Mistral Small 3.1	$0.03	$0.11	Cheapest production-grade option; open weights
GPT-4o Mini	$0.15	$0.60	Best for high-volume, cost-sensitive tasks
Google Gemini 2.5 Flash	$0.30	$2.50	Cheapest Google flagship; 50% batch discount
Claude Haiku 4.5	$1.00	$5.00	Anthropic's fastest, cheapest model
GPT-4o	$2.50	$10.00	50% cut from 2024 ($5.00 input)
Claude Sonnet 4.6	$3.00	$15.00	Same price as Sonnet 4.5
Gemini 2.5 Pro	$1.25	$10.00	—
Claude Opus 4.6	$5.00	$25.00	—
Legacy Opus 4.1	$15.00	$75.00	3x more expensive than current Opus — check your config

What changed significantly:

GPT-4o input price dropped from $5.00 to $2.50/1M tokens — a 50% reduction.
Mistral Large 3 dropped from $8/$24 to $2/$6 per million tokens — a 75% cut on input.
Perplexity Sonar: citation tokens no longer billed as of early 2026. Reduces per-response cost for high-volume search augmentation.
Google Gemini context caching now available — reduces costs up to 75% on long-context tasks.

Free tiers on LLM APIs:

Google Gemini is the only major frontier model with a genuinely usable free tier: 1.5M tokens/day via AI Studio. Mistral's free tier via La Plateforme is rate-limited but sufficient for testing.

AI Coding Assistants

Tool	Free Tier	Paid Tier	Notes
GitHub Copilot	2,000 completions + 50 chats/mo	$10/mo (Pro)	GPT-4o and Claude Sonnet on free tier
Cursor	2,000 completions + 50 slow requests/mo	$20/mo (Pro)	Pro+ $60/mo; Ultra $200/mo
Amazon Q Developer	50 agentic req + unlimited inline/mo	$19/user/mo (Pro)	Formerly CodeWhisperer
Codeium (Windsurf)	Unlimited completions	$15/mo (Pro)	Rebranded to Windsurf in 2025

GitHub Copilot's late-2024 free tier launch was significant — it provides GPT-4o and Claude Sonnet access at zero cost. The 50 chat messages/month limit is the binding constraint for most users.

Cursor's tier structure expanded in 2025: Pro ($20/mo), Pro+ ($60/mo with higher request limits), and Ultra ($200/mo for power users). Business plans at $40/user/mo.

Vector Databases

Tool	Free Tier	Starter Price	Self-Host Option
Pinecone	2GB, 5 indexes, 2M write units/mo	PAYG (no monthly minimum)	No
Qdrant Cloud	1GB RAM cluster (permanent, not trial)	~$45/mo (Starter)	Yes — open source, free
Weaviate	14-day sandbox only	$45/mo (Flex)	Yes — BSD-3 license
Chroma	No cloud free tier	Beta pricing	Yes — MIT license
Milvus/Zilliz	Serverless free tier	From $0 (serverless)	Yes — open source

Self-hosting math: Qdrant self-hosted on a $20/mo VPS costs $20/mo vs. $45/mo for Qdrant Cloud. The breakeven depends on your engineering time cost, not the raw compute cost. Weaviate's 14-day sandbox deadline is easy to miss — no credit card at signup, but you'll be asked before day 14 ends.

ML Platforms and Experiment Tracking

Tool	Free Tier	Paid Tier	Notes
Weights & Biases	Unlimited experiments, 100GB storage	$35/user/mo (Team)	No expiry; no seat limit on free
Comet ML	1 project, limited compute	$179/mo (Team)	—
MLflow	Free (open source)	Self-hosted only	No managed cloud free tier
Neptune.ai	200 hours of runs free	$29/mo (Individual)	—

Weights & Biases' free tier is one of the most generous in ML tooling — unlimited experiment tracking with 100GB storage and no time limit. 90%+ of teams use it on the free tier.

Speech and Audio

Tool	Free Tier	Starter Price	Notes
ElevenLabs	10,000 characters/mo (~10 min TTS)	$5/mo (Starter)	Expanded to Conversational AI in 2025
Deepgram	$200 credit on signup	$0.0077/min (Nova-3 PAYG)	Streaming costs 79% more than batch
OpenAI Whisper	Free (open source)	API: $0.006/min	Batch is cheapest; API adds latency
AssemblyAI	100 hours free	$0.007/min (Nano)	Strongest speaker diarization

Deepgram's streaming vs. batch cost difference matters at scale: streaming (real-time transcription) costs 79% more per minute than batch (async) on the Nova-3 model. If your use case allows async processing, the cost difference is significant.

AI Observability

Tool	Free Tier	Paid Tier	Notes
Langfuse	1M trace spans/mo, unlimited users	$29/mo (Hobby Cloud)	Self-hosted version: fully free, no limits
Helicone	10K requests/mo	$79/mo	Proxy-based — zero code change to instrument
Arize AI	None	$50,000/yr minimum	Enterprise-only
WhyLabs	None	$799/mo minimum	Enterprise-only

Langfuse is MIT licensed — the self-hosted version is completely free with no limits on traces, users, or retention. For high-volume production LLM apps, self-hosted Langfuse avoids the $29+/month cloud cost.

What's in the Full Dataset

57 tools, 10 categories, 15 columns per row including:

Exact free tier limits with no vague descriptions ("generous free tier" is not useful — we use specific numbers)
Starter and mid-tier pricing, monthly and annual
Enterprise pricing notes and ranges from Vendr/G2 buyer data where available
Self-hosting availability and license type
2025–2026 pricing changes, rebrands, and acquisitions for every tool
Direct pricing URLs, all verified March 2026

All pricing verified March 2026 from public pricing pages and published API documentation. Token prices in particular change frequently — verify before budgeting.

Full Dataset Available

AI & ML Tools Pricing Dataset 2026

Get the complete structured dataset as an instant-download CSV. Filter, sort, and use it in Excel, Google Sheets, or any data tool.

Get the full dataset — $49Instant download. No subscription.

Get notified when we publish new datasets

We publish 1-2 new data products per month. No spam — just a short email when something new ships.