Category · 132 models
AI that writes, explains, and fixes software code.
Even if you don't write code yourself, your engineering team almost certainly uses these. They speed up development, catch bugs, and help non-developers automate small tasks.
OpenAI
Frontier reasoning model tuned for math, science and coding workflows.
Anthropic
Anthropic's best coding and agentic model, strong at long autonomous tasks.
Google DeepMind
Long-context multimodal model with native tool use and 1M+ token window.
Meta
Meta's open-weights multimodal MoE family (Scout & Maverick).
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Mistral AI
European frontier LLM strong at code, math and multilingual tasks.
Alibaba
Open multilingual model family with hybrid thinking modes.
Anthropic
Agentic coding tool that lives in your terminal and edits real codebases.
GitHub / OpenAI
In-IDE pair programmer powering code completion and chat.
Anysphere
AI-first code editor with multi-file edits and background agents.
Cognition
Autonomous software engineer that plans, codes and ships PRs.
Mistral AI
Code-specialised open model covering 80+ programming languages.
OpenAI
Improved GPT-4 series model with stronger coding and instruction following.
OpenAI
Smaller, faster GPT-4.1 for production workloads.
OpenAI
Compact reasoning model balancing cost and quality.
Anthropic
Anthropic's fastest and cheapest frontier-class small model.
Anthropic
Hybrid reasoning model with extended thinking mode.
Meta
Open-weights instruct model competitive with much larger LLMs.
xAI
Speed-optimised code model for agentic IDE workflows.
Mistral AI
Frontier-class performance at a fraction of the cost.
Mistral AI
Sparse mixture-of-experts open model.
Mistral AI
Open agentic coding model built with All Hands AI.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
Lovable
AI fullstack builder that ships production web apps from prompts.
StackBlitz
Browser-based AI agent that builds and runs full-stack apps.
Replit
Agent that creates, edits and deploys apps inside Replit.
AWS
Coding & cloud assistant deeply integrated with AWS.
Sourcegraph
Code AI with deep codebase context across repos.
Snowflake
Open enterprise LLM optimised for SQL and coding.
AMD
Open software stack for GPU compute and AI on Instinct & Radeon hardware.
Intel
Open toolkit to optimize and deploy AI inference across Intel CPUs, GPUs and NPUs.
Qualcomm
Library of optimized AI models ready to deploy on Snapdragon devices.
Samsung
Samsung's in-house generative model family for Galaxy products.
Siemens
Generative AI copilot for engineers across PLC, design and operations.
IBM
Open enterprise LLM family for code, language and time series.
Databricks
Open MoE LLM by Databricks for enterprise customization.
Oracle
GenAI coding companion optimized for Java, SQL and OCI.
Microsoft / GitHub
AI pair-programmer for code completion, chat, reviews and agent mode.
Microsoft
GenAI copilots for data engineering, science and Power BI inside Fabric.
AWS
AI coding and operations assistant across the developer lifecycle on AWS.
AI coding assistant with enterprise context across IDEs and Google Cloud.
Snowflake
Natural-language SQL and analytics assistant inside Snowflake.
GitLab
AI assistant across the GitLab DevSecOps platform with code, chat and security.
Anthropic
Anthropic's most intelligent model — state-of-the-art on coding, agents and computer use.
OpenAI
Frontier agentic coding model for long-horizon software engineering inside Codex.
Google DeepMind
Leads LMArena Text, WebDev and Vision — Google's flagship multimodal reasoning model.
Anthropic
Fast, cheap Claude tier matching prior Sonnet-class quality for high-volume agents.
Mistral AI
Cost-efficient enterprise model with frontier-class performance for business workloads.
Alibaba
Alibaba's trillion-parameter flagship multilingual reasoning model.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
GitHub / Microsoft
Agentic dev environment that plans, edits and tests entire features from a GitHub issue.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Reka AI
We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka.
Meta
Meta's language model tracked by Epoch, focused on chat.
Zhipu AI
We introduce ChatGLM, an evolving family of large language models that we have been developing over time.
DeepSeek
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
Anthropic
This addendum to our Claude 3 Model Card describes Claude 3.5 Sonnet, a new model which outperforms our previous most capable model, Claude 3 Opus, while operating faster and at a lower cost.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
xAI
Grok-2 is our frontier language model with state-of-the-art reasoning capabilities.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
Writer
Palmyra X4 boasts state-of-the-art reasoning through novel training techniques.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
Amazon
A highly capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Google DeepMind
Today, we’re releasing an experimental version of Gemini 2.0 Pro that responds to that feedback.
Moonshot AI
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
OpenAI
We’re releasing OpenAI o3-mini, the newest, most cost-efficient model in our reasoning series, available in both ChatGPT and the API today.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
xAI
We are pleased to introduce Grok 3, our most advanced model yet: blending strong reasoning with extensive pretraining knowledge.
Inception Labs
Today, we’re excited to announce that Mercury, our first general chat model, is available to support a wider range of text generation applications.
OpenAI
We advance AI capabilities by scaling two complementary paradigms: unsupervised learning and reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
Tencent
As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Huawei
We present Pangu Ultra, a Large Language Model (LLM) with 135 billion parameters and dense Transformer modules trained on Ascend Neural Processing Units (NPUs).
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
Anthropic
Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we're announcing Qwen3-Coder, our most agentic code model to date.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Kunlun Inc.
We introduce MindLink, a new family of large language models developed by Kunlun Inc.
Google DeepMind
To advance Gemini’s capabilities towards solving hard reasoning problems, we developed a novel reasoning approach, called Deep Think, that naturally blends in parallel thinking techniques during response generation.
Anthropic
Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
MiniMax
Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
xAI
Today, we’re excited to launch two powerful new additions to the xAI API: Grok 4.1 Fast, our best tool-calling model with a 2M context window.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
MiniMax
MiniMax's language model tracked by Epoch, focused on chat.
SK Telecom
SK Telecom's language model tracked by Epoch, focused on code generation.
LG AI Research
K-EXAONE is a large-scale multilingual language model developed by LG AI Research.
Alibaba
Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development.
Cognition
We are sharing an early preview of our ongoing SWE-1.6 training run.
NVIDIA
NVIDIA's language model tracked by Epoch, focused on language modeling/generation.
Anysphere
Composer 2 is a specialized model designed for agentic software engineering.
Anysphere
Anysphere's language model tracked by Epoch, focused on coding.
Magic.dev
100M-token context model purpose-built for whole-repository software synthesis.
Poolside
Code-first foundation model trained with reinforcement learning from code-execution feedback.
Imbue
Imbue's research model trained from scratch for robust agentic reasoning and code.
Inception Labs
Diffusion-based LLM that generates code in parallel for order-of-magnitude latency gains.