Weekly ranking · weighted by market momentum
Every model in AIDB is ranked by a blend of its AIDB score, frontier-firm pedigree, recency, and current launch momentum — so the spotlight reflects what's actually moving the market right now.
Last week, we released a major update to Gemini 3 Deep Think to solve modern challenges across science, research and engineering.
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
Anthropic's language model tracked by Epoch, focused on question answering.
Cohere's open-source W4A4-quantized vision-language reasoning model for agentic, multilingual, tool-use enterprise tasks.
Google DeepMind's closed-source natively multimodal reasoning model for fast, high-capability agentic and coding tasks.
Anthropic's best coding and agentic model, strong at long autonomous tasks.
Frontier agentic coding model for long-horizon software engineering inside Codex.
We present Qwen3-Max-Thinking, our latest flagship reasoning model.