Skip to content

Frontier Intelligence Directory · Updated May 20, 2026

LLM Provider Hub 2026

The decision layer on top of the raw data. Every frontier provider, model, and agentic platform — categorized by capability, priced live, and paired with a verdict. Built for humans and agents.

We cite OpenRouter, Artificial Analysis, and LMArena as sources, and add what they don’t: task-first navigation, the agentic-platform comparison, curated verdicts, and a creator-stack lens.

5

Providers tracked

8

Frontier models

0

Agentic platforms

6

Live-priced

Start here: pick your constraint

The fastest path from “which model?” to an answer. One dominant constraint → a recommendation.

If you need…PickRunner-upWhy
Lowest cost (closed)gemini-3-5-flashClaude Haiku 4.5Frontier agentic performance at less than half the cost of comparable flagships.
Lowest cost (open weights)deepseek-v3-2qwen3-coder-nextFrontier-class reasoning at fraction-of-a-cent economics under MIT license.
Hardest reasoningClaude Opus 4.6GPT-5.2 Pro#1 ARC-AGI-2 (68.8%) and OSWorld (72.7%).
Agentic codinggemini-3-5-flashClaude Opus 4.676.2% Terminal-Bench 2.1 and 83.6% MCP Atlas at low cost.
Longest contextGrok 4.1Gemini 3 Pro2M-token native context with aggressive pricing.
Native voiceGPT-5.2 ProNative audio modality with no close runner-up.
Multimodal understandingGemini 3 ProGPT-5.2 ProWidest modality support; 81% MMMU-Pro.
Video generationgemini-omniNative frontier video gen with natural-language editing.
EU data sovereigntymistral-large-3command-a-reasoningParis-based, full EU residency; Apache-licensed small-model pairing.
Self-host / own the weightsLlama 4 Maverickdeepseek-v3-2Open-weight MoE (400B/17B) that runs on a single H100.

Browse by capability

Pick the job, jump to the providers that lead.

Reasoning & Analysis

Complex problem-solving, math, abstract reasoning, long-horizon planning

No tracked providers yet

Multimodal Understanding

Vision, document, chart, and cross-modal reasoning across text/image/audio

No tracked providers yet

Video Generation

Generative video models, text-to-video, image-to-video, editing

No tracked providers yet

Coding & Engineering

Agentic coding, terminal use, debugging, multi-file refactors

No tracked providers yet

Agentic Infrastructure

Tool use, function calling, agent SDKs, computer use, long-horizon execution

No tracked providers yet

Voice & Audio

Native speech in/out, real-time conversation, audio understanding

No tracked providers yet

Image Generation

Text-to-image, editing, in-painting, brand-consistent generation

No tracked providers yet

Model explorer

Sort and filter every tracked model. Live pricing via OpenRouter where available. Click a model for the full breakdown.

8 models live pricing via OpenRouter

Claude Opus 4.6Anthropic2026-02-051M$5.00$25.00
GPT-5.2 ProOpenAI2026-01-01400K$21.00$168.00
Gemini 3 ProGoogle DeepMind2025-12-012M
Llama 4 MaverickMeta AI2025-12-011.0M$0.15$0.60
Claude Opus 4.5Anthropic2025-11-01200K$5.00$25.00
Grok 4.1xAI2025-11-012M
Claude Haiku 4.5Anthropic2025-10-01200K$1.00$5.00
Claude Sonnet 4.5Anthropic2025-09-291M$3.00$15.00

Creator stacks

Creators don’t pick a model — they assemble a stack. Here’s what to use across each modality, and the workflow.

Provider directory

Flagship model, capability focus, agentic platforms, and notable tech for every tracked provider.

Agentic platforms

Where the models actually do work — IDEs, CLIs, desktops, agent platforms, managed runtimes. The layer the data sites skip.

Frequently asked

What is the best LLM in 2026?+

There is no single winner. Claude Opus 4.6 leads reasoning (68.8% ARC-AGI-2) and agentic coding. GPT-5.2 Pro dominates broad multimodal + voice. Gemini 3.5 Flash (Google I/O ’26) sets a new cost/intelligence frontier at less than half the cost of comparable flagships. Gemini 3.5 Pro ships next month for the highest-tier reasoning. Pick by task — use the decision matrix above.

How is this different from OpenRouter or Artificial Analysis?+

Those are the raw-data sources — OpenRouter for live pricing and routing, Artificial Analysis for independent benchmarks, LMArena for human preference. We cite all three. The FrankX LLM Hub adds the decision layer they don’t: task-first navigation, the agentic-platform comparison (Claude Code vs Antigravity vs Cursor vs Codex), curated verdicts, and a creator-stack lens — for humans and agents.

Which is the cheapest frontier reasoning model?+

DeepSeek V3.2 leads on pure cost ($0.27 / $1.10 per 1M tokens, MIT license). Gemini 3.5 Flash is the cheapest closed-frontier option at $0.30 / $2.50. Both deliver frontier-class reasoning for production agentic workloads.

What is the best agentic LLM in 2026?+

By category: coding agents — Gemini 3.5 Flash (76.2% Terminal-Bench 2.1) and Claude Opus 4.6; long-horizon enterprise — Gemini Spark and Claude Agent Teams; computer-use — GPT-5.2 Operator and Claude Opus 4.6 (72.7% OSWorld).

Is the pricing live?+

Where a model maps to OpenRouter, pricing is fetched live (hourly) and marked with a ⚡ icon and "via OpenRouter." Otherwise it comes from our curated registry. Always verify against the provider before relying on it for billing.

Can AI agents consume this hub?+

Yes. The full curated dataset — models, pricing, verdicts, decision matrix, comparisons — is available as clean JSON at /llm-hub.json, plus JSON-LD structured data on every page and deep links in /llms.txt.

FrankX Intelligence Pipeline · Last refreshed May 20, 2026

Source of truth: data/model-registry.json · Agent surface: /llm-hub.json · Add a model via /new-model