March 3, 2026

57 AI and ML Tools Pricing Compared (March 2026)

LLM API prices dropped 50-75% between 2024 and 2026. Current pricing for 57 AI and ML tools across 10 categories, all verified March 2026.

LLM API prices dropped 50–75% between 2024 and 2026. The tools you budgeted for last year cost materially less today — if you're still working from old numbers, you're either overspending or over-provisioning.

We compiled current pricing for 57 AI and ML tools across 10 categories, all verified March 2026. This covers LLM APIs, AI coding assistants, image generation, vector databases, ML platforms, AI agents, speech/audio, AI search/RAG, computer vision, and AI observability.


LLM APIs — Current Pricing

The token price wars continued through 2025 and into 2026. Here's where pricing stands:

Model Input (per 1M tokens) Output (per 1M tokens) Notes
Mistral Small 3.1 $0.03 $0.11 Cheapest production-grade option; open weights
GPT-4o Mini $0.15 $0.60 Best for high-volume, cost-sensitive tasks
Google Gemini 2.5 Flash $0.30 $2.50 Cheapest Google flagship; 50% batch discount
Claude Haiku 4.5 $1.00 $5.00 Anthropic's fastest, cheapest model
GPT-4o $2.50 $10.00 50% cut from 2024 ($5.00 input)
Claude Sonnet 4.6 $3.00 $15.00 Same price as Sonnet 4.5
Gemini 2.5 Pro $1.25 $10.00
Claude Opus 4.6 $5.00 $25.00
Legacy Opus 4.1 $15.00 $75.00 3x more expensive than current Opus — check your config

What changed significantly:

  • GPT-4o input price dropped from $5.00 to $2.50/1M tokens — a 50% reduction.
  • Mistral Large 3 dropped from $8/$24 to $2/$6 per million tokens — a 75% cut on input.
  • Perplexity Sonar: citation tokens no longer billed as of early 2026. Reduces per-response cost for high-volume search augmentation.
  • Google Gemini context caching now available — reduces costs up to 75% on long-context tasks.

Free tiers on LLM APIs:

Google Gemini is the only major frontier model with a genuinely usable free tier: 1.5M tokens/day via AI Studio. Mistral's free tier via La Plateforme is rate-limited but sufficient for testing.


AI Coding Assistants

Tool Free Tier Paid Tier Notes
GitHub Copilot 2,000 completions + 50 chats/mo $10/mo (Pro) GPT-4o and Claude Sonnet on free tier
Cursor 2,000 completions + 50 slow requests/mo $20/mo (Pro) Pro+ $60/mo; Ultra $200/mo
Amazon Q Developer 50 agentic req + unlimited inline/mo $19/user/mo (Pro) Formerly CodeWhisperer
Codeium (Windsurf) Unlimited completions $15/mo (Pro) Rebranded to Windsurf in 2025

GitHub Copilot's late-2024 free tier launch was significant — it provides GPT-4o and Claude Sonnet access at zero cost. The 50 chat messages/month limit is the binding constraint for most users.

Cursor's tier structure expanded in 2025: Pro ($20/mo), Pro+ ($60/mo with higher request limits), and Ultra ($200/mo for power users). Business plans at $40/user/mo.


Vector Databases

Tool Free Tier Starter Price Self-Host Option
Pinecone 2GB, 5 indexes, 2M write units/mo PAYG (no monthly minimum) No
Qdrant Cloud 1GB RAM cluster (permanent, not trial) ~$45/mo (Starter) Yes — open source, free
Weaviate 14-day sandbox only $45/mo (Flex) Yes — BSD-3 license
Chroma No cloud free tier Beta pricing Yes — MIT license
Milvus/Zilliz Serverless free tier From $0 (serverless) Yes — open source

Self-hosting math: Qdrant self-hosted on a $20/mo VPS costs $20/mo vs. $45/mo for Qdrant Cloud. The breakeven depends on your engineering time cost, not the raw compute cost. Weaviate's 14-day sandbox deadline is easy to miss — no credit card at signup, but you'll be asked before day 14 ends.


ML Platforms and Experiment Tracking

Tool Free Tier Paid Tier Notes
Weights & Biases Unlimited experiments, 100GB storage $35/user/mo (Team) No expiry; no seat limit on free
Comet ML 1 project, limited compute $179/mo (Team)
MLflow Free (open source) Self-hosted only No managed cloud free tier
Neptune.ai 200 hours of runs free $29/mo (Individual)

Weights & Biases' free tier is one of the most generous in ML tooling — unlimited experiment tracking with 100GB storage and no time limit. 90%+ of teams use it on the free tier.


Speech and Audio

Tool Free Tier Starter Price Notes
ElevenLabs 10,000 characters/mo (~10 min TTS) $5/mo (Starter) Expanded to Conversational AI in 2025
Deepgram $200 credit on signup $0.0077/min (Nova-3 PAYG) Streaming costs 79% more than batch
OpenAI Whisper Free (open source) API: $0.006/min Batch is cheapest; API adds latency
AssemblyAI 100 hours free $0.007/min (Nano) Strongest speaker diarization

Deepgram's streaming vs. batch cost difference matters at scale: streaming (real-time transcription) costs 79% more per minute than batch (async) on the Nova-3 model. If your use case allows async processing, the cost difference is significant.


AI Observability

Tool Free Tier Paid Tier Notes
Langfuse 1M trace spans/mo, unlimited users $29/mo (Hobby Cloud) Self-hosted version: fully free, no limits
Helicone 10K requests/mo $79/mo Proxy-based — zero code change to instrument
Arize AI None $50,000/yr minimum Enterprise-only
WhyLabs None $799/mo minimum Enterprise-only

Langfuse is MIT licensed — the self-hosted version is completely free with no limits on traces, users, or retention. For high-volume production LLM apps, self-hosted Langfuse avoids the $29+/month cloud cost.


What's in the Full Dataset

57 tools, 10 categories, 15 columns per row including:

  • Exact free tier limits with no vague descriptions ("generous free tier" is not useful — we use specific numbers)
  • Starter and mid-tier pricing, monthly and annual
  • Enterprise pricing notes and ranges from Vendr/G2 buyer data where available
  • Self-hosting availability and license type
  • 2025–2026 pricing changes, rebrands, and acquisitions for every tool
  • Direct pricing URLs, all verified March 2026

All pricing verified March 2026 from public pricing pages and published API documentation. Token prices in particular change frequently — verify before budgeting.

Full Dataset Available

AI & ML Tools Pricing Dataset 2026

Get the complete structured dataset as an instant-download CSV. Filter, sort, and use it in Excel, Google Sheets, or any data tool.

Get the full dataset — $49Instant download. No subscription.

Get notified when we publish new datasets

We publish 1-2 new data products per month. No spam — just a short email when something new ships.