Category · 142 models
Models that reason step by step — for analysis, math, and tough decisions.
These models slow down and 'think out loud' before answering. They're built for problems where the right answer matters more than a fast one — financial analysis, legal research, scientific reasoning.
OpenAI
OpenAI's flagship multimodal reasoning model with long-context tool use.
OpenAI
Frontier reasoning model tuned for math, science and coding workflows.
Anthropic
Anthropic's best coding and agentic model, strong at long autonomous tasks.
Anthropic
Top-tier reasoning model for research, analysis and complex writing.
Google DeepMind
Long-context multimodal model with native tool use and 1M+ token window.
xAI
xAI's flagship reasoning model with real-time X knowledge and tool use.
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Alibaba
Open multilingual model family with hybrid thinking modes.
Microsoft
Small language model punching above its weight on reasoning benchmarks.
Google DeepMind / Isomorphic
Predicts the structure and interactions of life's molecules.
Medical LLM achieving expert-level performance on USMLE-style questions.
Arc Institute
Genome-scale foundation model spanning DNA, RNA and proteins.
Bloomberg
Finance-domain LLM trained on decades of market and news data.
OpenAI
Improved GPT-4 series model with stronger coding and instruction following.
OpenAI
Compact reasoning model balancing cost and quality.
OpenAI
OpenAI's first open-weights reasoning models since GPT-2.
Anthropic
Hybrid reasoning model with extended thinking mode.
xAI
Multi-agent variant of Grok 4 for the hardest problems.
Zhipu AI
Open agentic foundation model from Zhipu's GLM family.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
MiniMax
Open reasoning model with 1M-token context.
01.AI
Fast, low-cost frontier-tier LLM from 01.AI.
Baidu
Baidu's flagship multimodal foundation model.
EvolutionaryScale
Frontier protein language model for biology design.
MIT / Recursion
Open AlphaFold3-class biomolecular structure prediction model.
Google DeepMind
Best-in-class medium-range global weather forecasting AI.
Google DeepMind
Probabilistic AI weather forecasting beating ENS.
Reka
Multimodal frontier model with open weights.
NVIDIA
NVIDIA's open LLM family for synthetic data and reasoning.
Dell Technologies
End-to-end AI infrastructure stack combining PowerEdge servers, storage, networking and NVIDIA AI Enterprise.
AMD
Datacenter accelerator (CDNA 4) for training and inference of frontier models.
NVIDIA
Managed AI training service on dedicated NVIDIA Hopper/Blackwell clusters.
NVIDIA
Flagship datacenter GPU for trillion-parameter AI training and inference.
Intel
Datacenter AI accelerator targeting price/performance vs H100.
Custom AI accelerators powering Gemini training and Google Cloud AI.
Hewlett Packard Enterprise
Turnkey on-prem AI cloud built with NVIDIA, co-engineered for enterprises.
Pure Storage
AI-ready storage stack co-engineered with NVIDIA for training pipelines.
NetApp
Converged AI infrastructure with ONTAP storage and NVIDIA compute.
Supermicro
Liquid-cooled GPU SuperClusters for trillion-parameter LLM training.
Cerebras
Wafer-scale AI processor delivering record-breaking inference throughput.
Groq
Language Processing Unit delivering ultra-low-latency LLM inference.
SambaNova Systems
Full-stack AI platform with Reconfigurable Dataflow Units (RDUs).
Tenstorrent
Open RISC-V based AI accelerator from Jim Keller's team.
AWS
Custom AWS chip purpose-built for training large language models.
AWS
Cost-optimized AWS chip for high-throughput LLM inference.
Palantir
AI Platform deploying LLMs against classified and government datasets with audit and policy controls.
Scale AI
LLM-powered decision-making platform for defense and intelligence analysts.
Vannevar Labs
AI for open-source intelligence and non-traditional collection.
ServiceNow
Domain-specific large language models tuned for the Now Platform.
Salesforce / Tableau
Generative analytics and natural-language insights inside Tableau.
Google Cloud
In-warehouse ML and GenAI directly on BigQuery data via SQL.
Snowflake
Natural-language SQL and analytics assistant inside Snowflake.
Databricks
Conversational analytics over governed lakehouse data.
Qlik
Generative analytics service delivering trusted answers from unstructured data.
SAS
Analytics and AI platform with embedded LLM orchestration and copilots.
Teradata
In-database analytics and GenAI orchestration on Teradata VantageCloud.
Anthropic
Anthropic's most intelligent model — state-of-the-art on coding, agents and computer use.
OpenAI
Frontier agentic coding model for long-horizon software engineering inside Codex.
OpenAI
Updated GPT-5 with warmer tone, adaptive reasoning and stronger instruction following.
Google DeepMind
Leads LMArena Text, WebDev and Vision — Google's flagship multimodal reasoning model.
Google DeepMind
Extended-thinking variant of Gemini 3 for hardest math, science and research problems.
Ai2
Fully open model flow with training data, checkpoints and recipes for reproducible AI.
xAI
Refresh of Grok 4 with stronger reasoning, lower hallucination and faster tool use.
Mistral AI
Cost-efficient enterprise model with frontier-class performance for business workloads.
Alibaba
Alibaba's trillion-parameter flagship multilingual reasoning model.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
Eigen AI
Agentic RL vision-language model for tool-integrated visual reasoning.
Zendesk
AutoQA AI that scores 100% of support conversations across voice and chat.
Twilio
Speech-to-text, summaries and language operators that analyze every call in real time.
Stripe
ML-based fraud detection trained on the global Stripe payments network.
Robinhood
AI investing companion delivering market insights to Robinhood Gold customers.
Synopsys
AI suite (DSO.ai, VSO.ai, TSO.ai) optimizing chip design across the EDA flow.
Cadence
Generative AI for digital chip implementation and verification across the Cadence flow.
Ansys
Cloud generative-AI app delivering near-instant simulation predictions for engineers.
Tencent
Tencent's deep-reasoning model, mamba-based and tuned for complex multi-step problems.
Google DeepMind
Google DeepMind's mathematics model tracked by Epoch, focused on geometry.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Reka AI
We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka.
Zhipu AI
We introduce ChatGLM, an evolving family of large language models that we have been developing over time.
SenseTime
SenseTime's multimodal, language, vision model tracked by Epoch, focused on vision-language generation.
Mistral AI
We're contributing Mathstral to the science community to bolster efforts in advanced mathematical problems requiring complex, multi-step logical reasoning.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
Moonshot AI
Artificial general intelligence start-up Kimi, owned by Chinese AI start-up Moonshot AI, on Saturday launched its first reasoning AI model k0-math.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Prime Intellect
INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning.
Moonshot AI
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
OpenAI
We’re releasing OpenAI o3-mini, the newest, most cost-efficient model in our reasoning series, available in both ChatGPT and the API today.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
OpenAI
We advance AI capabilities by scaling two complementary paradigms: unsupervised learning and reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
Tencent
As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
Anthropic
Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Kunlun Inc.
We introduce MindLink, a new family of large language models developed by Kunlun Inc.
Google DeepMind
To advance Gemini’s capabilities towards solving hard reasoning problems, we developed a novel reasoning approach, called Deep Think, that naturally blends in parallel thinking techniques during response generation.
Anthropic
Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
Alibaba
We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
xAI
Today, we’re excited to launch two powerful new additions to the xAI API: Grok 4.1 Fast, our best tool-calling model with a 2M context window.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on mathematical reasoning.
SK Telecom
SK Telecom's language model tracked by Epoch, focused on code generation.
LG AI Research
K-EXAONE is a large-scale multilingual language model developed by LG AI Research.
Liquid AI
Liquid AI's second-generation efficient foundation models built on liquid neural networks for on-device use.
Sakana AI
LLM-discovered preference optimization algorithm from Sakana's evolutionary research line.
Magic.dev
100M-token context model purpose-built for whole-repository software synthesis.
Poolside
Code-first foundation model trained with reinforcement learning from code-execution feedback.
Perplexity
Perplexity's in-house search-grounded LLM powering the Perplexity answer engine.
Imbue
Imbue's research model trained from scratch for robust agentic reasoning and code.
Nous Research
Open-source aligned LLM family known for steerable, uncensored research use.
Alibaba
Alibaba Cloud's closed-source trillion-parameter flagship LLM for coding, reasoning, and enterprise agentic workflows.
Cohere
Cohere's open-source W4A4-quantized vision-language reasoning model for agentic, multilingual, tool-use enterprise tasks.
Google DeepMind
Google DeepMind's closed-source natively multimodal reasoning model for fast, high-capability agentic and coding tasks.