Category · 341 models
AI you talk to like a person — writes emails, summarizes, answers questions.
Think of these as your always-on writing partner. You type a request in plain English, and the AI drafts an email, rewrites a paragraph, summarizes a long document, or chats back and forth to help you think.
OpenAI
OpenAI's flagship multimodal reasoning model with long-context tool use.
OpenAI
Real-time omni-model handling text, vision and voice in a single network.
OpenAI
Frontier reasoning model tuned for math, science and coding workflows.
Anthropic
Anthropic's best coding and agentic model, strong at long autonomous tasks.
Anthropic
Top-tier reasoning model for research, analysis and complex writing.
Google DeepMind
Long-context multimodal model with native tool use and 1M+ token window.
Google DeepMind
Fast, cheap multimodal model optimised for high-volume production use.
xAI
xAI's flagship reasoning model with real-time X knowledge and tool use.
Meta
Meta's open-weights multimodal MoE family (Scout & Maverick).
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Mistral AI
European frontier LLM strong at code, math and multilingual tasks.
Alibaba
Open multilingual model family with hybrid thinking modes.
Cohere
Enterprise-grade RAG and tool-use model for business workloads.
Microsoft
Small language model punching above its weight on reasoning benchmarks.
Anthropic
Agentic coding tool that lives in your terminal and edits real codebases.
GitHub / OpenAI
In-IDE pair programmer powering code completion and chat.
Anysphere
AI-first code editor with multi-file edits and background agents.
Cognition
Autonomous software engineer that plans, codes and ships PRs.
Mistral AI
Code-specialised open model covering 80+ programming languages.
Medical LLM achieving expert-level performance on USMLE-style questions.
Bloomberg
Finance-domain LLM trained on decades of market and news data.
Harvey
Generative AI platform purpose-built for elite law firms.
Khan Academy
AI tutor that guides students with Socratic questioning.
Perplexity
Answer engine combining LLMs with cited live web search.
Source-grounded research assistant with audio overviews.
OpenAI
High-dimensional embeddings for search, RAG and clustering.
Voyage AI
Top-ranked retrieval embeddings, optimised for RAG quality.
Jasper
Marketing copilot for brand-aware content at enterprise scale.
OpenAI
Improved GPT-4 series model with stronger coding and instruction following.
OpenAI
Smaller, faster GPT-4.1 for production workloads.
OpenAI
Cheapest, fastest GPT-4.1 tier for high-volume tasks.
OpenAI
Compact reasoning model balancing cost and quality.
OpenAI
Cost-efficient multimodal small model.
OpenAI
OpenAI's first open-weights reasoning models since GPT-2.
Anthropic
Anthropic's fastest and cheapest frontier-class small model.
Anthropic
Hybrid reasoning model with extended thinking mode.
Anthropic
Fast, low-cost model for everyday tasks.
Google DeepMind
Smallest, cheapest Gemini for high-volume tasks.
Google DeepMind
Multimodal model with native tool use and live API.
Open-weights model family in 1B–27B sizes for on-device & server.
Open vision-language models for fine-tuning.
Meta
Open-weights instruct model competitive with much larger LLMs.
Meta
Open multimodal model in 11B and 90B sizes.
Meta
Multilingual speech-to-speech and speech-to-text translation.
xAI
Multi-agent variant of Grok 4 for the hardest problems.
xAI
Speed-optimised code model for agentic IDE workflows.
Mistral AI
Frontier-class performance at a fraction of the cost.
Mistral AI
Fast open-weights small model with strong reasoning.
Mistral AI
124B multimodal model with state-of-the-art image understanding.
Mistral AI
Sparse mixture-of-experts open model.
Mistral AI
Open agentic coding model built with All Hands AI.
Alibaba
Open vision-language model with strong document understanding.
Zhipu AI
Open agentic foundation model from Zhipu's GLM family.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
MiniMax
Open reasoning model with 1M-token context.
01.AI
Fast, low-cost frontier-tier LLM from 01.AI.
Tencent
Open MoE model with 389B params from Tencent.
Baidu
Baidu's flagship multimodal foundation model.
ByteDance
ByteDance's flagship LLM, widely deployed in China.
Lovable
AI fullstack builder that ships production web apps from prompts.
StackBlitz
Browser-based AI agent that builds and runs full-stack apps.
Replit
Agent that creates, edits and deploys apps inside Replit.
AWS
Coding & cloud assistant deeply integrated with AWS.
Sourcegraph
Code AI with deep codebase context across repos.
Conversational AI search experience in Google.
Brave
Privacy-respecting AI assistant built into the Brave browser.
Cohere
Multimodal multilingual embeddings for enterprise RAG.
Thomson Reuters
Generative AI legal assistant for lawyers.
Hebbia
Agentic research platform for finance and legal teams.
Glean
Enterprise AI assistant grounded in company knowledge.
Microsoft
AI assistant across Word, Excel, Outlook, PowerPoint and Teams.
AI in Gmail, Docs, Sheets, Slides and Meet.
Notion
AI for writing, search and meeting notes inside Notion.
Copy.ai
GTM AI platform for marketing and sales workflows.
Writer
Enterprise LLM family powering Writer's generative platform.
Inflection
Personal AI focused on emotionally intelligent conversation.
Character.AI
Platform for creating and chatting with AI characters.
Duolingo
AI-powered language tutoring features.
Allen Institute (AI2)
Fully open language model with training data and code released.
TII
Open LLM family from the Technology Innovation Institute.
Hugging Face
Compact open model strong in its size class.
Reka
Multimodal frontier model with open weights.
NVIDIA
NVIDIA's open LLM family for synthetic data and reasoning.
Snowflake
Open enterprise LLM optimised for SQL and coding.
Databricks
Open MoE LLM tuned for enterprise tasks.
Lenovo
On-device AI assistant running locally on Lenovo AI PCs for private productivity.
Smallest Gemini model running fully on-device on Pixel and Android.
Samsung
Samsung's in-house generative model family for Galaxy products.
HP
Local AI assistant bundled with HP AI PCs for private document Q&A.
IBM
Enterprise AI & data platform with model studio, governance and runtime.
AWS
Managed service to build agents using foundation models from many vendors.
AWS
Amazon's foundation model family (text, image, video) on Bedrock.
ServiceNow
Generative AI built into the ServiceNow workflow platform.
SAP
Generative AI copilot embedded across SAP business applications.
Microsoft
Azure OpenAI Service in Azure Government for IL5 / FedRAMP High workloads.
AWS
Amazon Bedrock foundation models in AWS GovCloud for US public-sector workloads.
Booking.com
Conversational trip planner suggesting destinations, hotels and itineraries.
TripAdvisor
AI-generated personalized itineraries from reviews and traveler data.
IBM
Enterprise studio for building, training and deploying foundation models.
IBM
Open enterprise LLM family for code, language and time series.
Snowflake
Managed LLMs and agents running directly inside Snowflake.
Databricks
Open MoE LLM by Databricks for enterprise customization.
Notion
Built-in AI for writing, search and Q&A across Notion workspaces.
Canva
Suite of AI design tools for image, video, copy and presentations.
Writer
Enterprise LLM family and generative-AI platform for regulated industries.
Jasper
AI marketing platform for brand-aligned content generation.
ServiceNow
GenAI assistant embedded across ITSM, CSM, HRSD and creator workflows.
ServiceNow
Domain-specific large language models tuned for the Now Platform.
Oracle
Managed LLM service on OCI featuring Cohere and Meta Llama models.
SAP
Generative-AI copilot embedded across the SAP application portfolio.
Salesforce
Generative-AI layer across Sales, Service, Marketing and Commerce Clouds.
Salesforce / Slack
AI summaries, search and recap built into Slack channels and DMs.
Microsoft
Generative-AI assistant across Word, Excel, PowerPoint, Outlook and Teams.
Microsoft
Enterprise access to GPT, o-series and DALL·E models on Azure.
AWS
HIPAA-eligible service that generates clinical notes from patient conversations.
AI assistance across Gmail, Docs, Sheets, Slides, Meet and Drive.
IBM
Studio to train, tune and deploy foundation models including Granite.
Adobe
Conversational AI to summarize, query and draft from PDF documents.
Box
AI for content Q&A, summarization and metadata extraction in Box.
Smartsheet
GenAI formulas, summaries and content generation in Smartsheet.
Anthropic
Anthropic's most intelligent model — state-of-the-art on coding, agents and computer use.
OpenAI
Frontier agentic coding model for long-horizon software engineering inside Codex.
OpenAI
Updated GPT-5 with warmer tone, adaptive reasoning and stronger instruction following.
Google DeepMind
Leads LMArena Text, WebDev and Vision — Google's flagship multimodal reasoning model.
Google DeepMind
Extended-thinking variant of Gemini 3 for hardest math, science and research problems.
Ai2
Fully open model flow with training data, checkpoints and recipes for reproducible AI.
xAI
Refresh of Grok 4 with stronger reasoning, lower hallucination and faster tool use.
Anthropic
Fast, cheap Claude tier matching prior Sonnet-class quality for high-volume agents.
Mistral AI
Cost-efficient enterprise model with frontier-class performance for business workloads.
Alibaba
Alibaba's trillion-parameter flagship multilingual reasoning model.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
Eigen AI
Agentic RL vision-language model for tool-integrated visual reasoning.
Shopify
Generative AI across the Shopify admin — product descriptions, emails, blog posts and image edits.
Zendesk
Agent-side AI copilot suggesting replies, summaries and next actions in real time.
Broadcom
GenAI for agile planning — story generation, sprint summaries and risk forecasting.
Microsoft
Role-based Copilot inside Outlook & Teams pulling CRM context from Dynamics 365 and Salesforce.
Microsoft
Frontline copilot for contact center agents inside Dynamics 365 Customer Service.
Microsoft / Nuance
Ambient AI scribe for clinicians that drafts notes and orders from doctor-patient conversations.
Cloudflare
Serverless GPU inference platform running open models at the edge.
Conversational search that synthesizes answers from authentic Reddit discussions.
DoorDash
Real-time AI moderation that detects harassment across Dasher-customer chats in 99 languages.
Tencent
Tencent's deep-reasoning model, mamba-based and tuned for complex multi-step problems.
Writer
Palmyra X 003, is a top-performing instruct model, built specifically for structured text completion rather than conversational use.
Moonshot AI
Moonshot AI's language model tracked by Epoch, focused on language modeling/generation.
Google DeepMind
Google DeepMind's mathematics model tracked by Epoch, focused on geometry.
Alibaba
Alibaba's multimodal, language, vision model tracked by Epoch, focused on chat.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Cohere for AI
Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages.
Google DeepMind
Google DeepMind's language, multimodal model tracked by Epoch, focused on language modeling.
ByteDance
We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs.
Mistral AI
Mistral AI's language model tracked by Epoch, focused on chat.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Saudi Aramco
Saudi Aramco's language model tracked by Epoch, focused on language modeling/generation.
Inflection AI
At Inflection, our mission is to create a personal AI for everyone.
Apple
In this work, we discuss building performant Multimodal Large Language Models (MLLMs).
Apple
Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds.
OpenAI
Today, we shared dozens of new additions and improvements, and reduced pricing across many parts of our platform.
Reka AI
We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka.
Meta
Meta's language model tracked by Epoch, focused on chat.
NVIDIA
Visual language models (VLMs) rapidly progressed with the recent success of large language models.
01.AI
01.AI's language model tracked by Epoch, focused on chat.
Zhipu AI
We introduce ChatGLM, an evolving family of large language models that we have been developing over time.
Saudi Data and Artificial Intelligence Authority
We present ALLaM: Arabic Large Language Model, a series of large language models to support the ecosystem of Arabic Language Technologies (ALT).
Alibaba
After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2.
NVIDIA
High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences.
Stanford University
Stanford University's robotics, vision, language model tracked by Epoch, focused on robotic manipulation.
NVIDIA
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4- 340B-Instruct, and Nemotron-4-340B-Reward.
DeepSeek
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
Anthropic
This addendum to our Claude 3 Model Card describes Claude 3.5 Sonnet, a new model which outperforms our previous most capable model, Claude 3 Opus, while operating faster and at a lower cost.
New York University (NYU)
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach.
EvolutionaryScale
More than three billion years of evolution have produced an image of biology encoded into the space of natural proteins.
Baidu
Baidu's multimodal, language, vision model tracked by Epoch, focused on vision-language generation.
SenseTime
SenseTime's multimodal, language, vision model tracked by Epoch, focused on vision-language generation.
Mistral AI
We're contributing Mathstral to the science community to bolster efforts in advanced mathematical problems requiring complex, multi-step logical reasoning.
DeepL
DeepL's language model tracked by Epoch, focused on translation.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
Apple
Apple's language model tracked by Epoch, focused on language modeling/generation.
Apple
Apple's language model tracked by Epoch, focused on language modeling/generation.
ByteDance
We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the LLaVA-NeXT blog series.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
xAI
Grok-2 is our frontier language model with state-of-the-art reasoning capabilities.
AI21 Labs
We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture.
Inspur
Inspur's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
At the KDD International Conference on Data Mining and Knowledge Discovery, the Zhipu GLM team unveiled the new generation of base large model—GLM-4-Plus.
Tencent
Tencent's language model tracked by Epoch, focused on language modeling/generation.
Harrison.ai
Harrison.ai's vision, medicine, language, multimodal model tracked by Epoch, focused on visual question answering.
Google DeepMind
Computational design of protein-binding proteins is a fundamental capability with broad utility in biomedical research and biotechnology.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
Tsinghua University
Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
Meta
Meta's multimodal, vision, language model tracked by Epoch, focused on visual question answering.
Writer
Palmyra X4 boasts state-of-the-art reasoning through novel training techniques.
Chai discovery
We introduce Chai-1, a multi-modal foundation model for molecular structure prediction that performs at the state-of-the-art across a variety of tasks relevant to drug discovery.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
ByteDance
A professional-grade, self-developed LLM supporting up to 128k tokens, enabling fine-tuning across the entire series.
Google DeepMind
Google DeepMind's language model tracked by Epoch, focused on language modeling.
Moonshot AI
Artificial general intelligence start-up Kimi, owned by Chinese AI start-up Moonshot AI, on Saturday launched its first reasoning AI model k0-math.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
NVIDIA
Fugatto is a versatile audio synthesis and transformation model capable of following free-form text instructions with optional audio inputs.
Amazon
A highly capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
NVIDIA
Visual language models (VLMs) have made significant advances in accuracy in recent years.
LG AI Research
This technical report introduces the EXAONE 3.5 instruction-tuned language models, developed and released by LG AI Research.
Google DeepMind
Today, we’re releasing an experimental version of Gemini 2.0 Pro that responds to that feedback.
Meta AI
Despite the rapid integration of video perception capabilities into Large Multimodal Models (LMMs), the underlying mechanisms driving their video understanding remain poorly understood.
Prime Intellect
INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning.
NVIDIA
Recently, promising progress has been made by open-source vision-language models (VLMs) in bringing their capabilities closer to those of proprietary frontier models.
Moonshot AI
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
OpenAI
Today we introduced a research preview of Operator(opens in a new window), an agent that can go to the web to perform tasks for you.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
OpenAI
We’re releasing OpenAI o3-mini, the newest, most cost-efficient model in our reasoning series, available in both ChatGPT and the API today.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
xAI
We are pleased to introduce Grok 3, our most advanced model yet: blending strong reasoning with extensive pretraining knowledge.
Inception Labs
Today, we’re excited to announce that Mercury, our first general chat model, is available to support a wider range of text generation applications.
OpenAI
We advance AI capabilities by scaling two complementary paradigms: unsupervised learning and reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
Mistral AI
Mistral OCR is an Optical Character Recognition API that sets a new standard in document understanding.
Tencent
As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Huawei
We present Pangu Ultra, a Large Language Model (LLM) with 135 billion parameters and dense Transformer modules trained on Ascend Neural Processing Units (NPUs).
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
ByteDance
We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning.
Anthropic
Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Alibaba
In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
ByteDance
Seed1.6 is the latest general-purpose model series unveiled by the ByteDance Seed team.
Google DeepMind
Google DeepMind's earth science model tracked by Epoch, focused on weather forecasting.
LG AI Research
LG AI Research's vision, medicine model tracked by Epoch, focused on cancer diagnosis.
Google DeepMind
In this report, we introduce Gemini Embedding, a state-of-the-art embedding model leveraging the power of Gemini, Google's most capable large language model.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we're announcing Qwen3-Coder, our most agentic code model to date.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Kunlun Inc.
We introduce MindLink, a new family of large language models developed by Kunlun Inc.
Google DeepMind
To advance Gemini’s capabilities towards solving hard reasoning problems, we developed a novel reasoning approach, called Deep Think, that naturally blends in parallel thinking techniques during response generation.
Sapient Intelligence
Reasoning, the process of devising and executing complex goal-oriented action sequences, remains a critical challenge in AI.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Anthropic
Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Alibaba
We present Qwen3-Omni, a single multimodal model that, for the first time, maintains state-of-the-art performance across text, image, audio, and video without any degradation relative to single-modal counterparts.
Google DeepMind
Our most capable vision-language model (VLM) reasons about the physical world, natively calls digital tools and creates detailed, multi-step plans to complete a mission.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
MiniMax
Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.
Alibaba
We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
Meta
Meta's recommendation model tracked by Epoch, focused on recommender system.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
"Today we’re upgrading the GPT‑5 series with the release of: GPT‑5.1 Instant: our most-used model, now warmer, more intelligent, and better at following your instructions.
Shanghai AI Lab
Recent progress in large language models (LLMs) has moved the frontier from puzzle-solving to science-grade reasoning-the kind needed to tackle problems whose answers must stand against nature, not merely fit a rubric.
xAI
Today, we’re excited to launch two powerful new additions to the xAI API: Grok 4.1 Fast, our best tool-calling model with a 2M context window.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on mathematical reasoning.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's model tracked by Epoch, focused on language modeling/generation.
NVIDIA
We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid MambaTransformer language model.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
MiniMax
MiniMax's language model tracked by Epoch, focused on chat.
NAVER
Developed by Naver, South Korea’s leading AI research lab, this cutting-edge language model supports multimodal inputs and advanced reasoning.
NC AI
NC AI's language model tracked by Epoch, focused on language modeling/generation.
SK Telecom
SK Telecom's language model tracked by Epoch, focused on code generation.
Upstage
Solar Open is Upstage's flagship 102B-parameter large language model, trained entirely from scratch and released under the Solar-Apache License 2.0 (see LICENSE).
LG AI Research
K-EXAONE is a large-scale multilingual language model developed by LG AI Research.
Alibaba
We present Qwen3-Max-Thinking, our latest flagship reasoning model.
Alibaba
Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development.
Moonshot AI
We introduce Kimi K2.5, an open-source multimodal agentic model designed to advance general agentic intelligence.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Alibaba
We are delighted to announce the official release of Qwen3.5, introducing the open-weight of the first model in the Qwen3.5 series, namely Qwen3.5-397B-A17B.
xAI
xAI's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering.
Google DeepMind
Last week, we released a major update to Gemini 3 Deep Think to solve modern challenges across science, research and engineering.
Alibaba
Over recent months, we have intensified our focus on developing foundation models that deliver exceptional utility and performance.
Cognition
We are sharing an early preview of our ongoing SWE-1.6 training run.
Google DeepMind
Today, we're introducing Gemini 3.1 Flash-Lite, our fastest and most cost-efficient Gemini 3 series model.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's language model tracked by Epoch, focused on language modeling/generation.
Anysphere
Composer 2 is a specialized model designed for agentic software engineering.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
Anthropic
Anthropic's language model tracked by Epoch, focused on question answering.
Moonshot AI
Moonshot AI's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
Anysphere
Anysphere's language model tracked by Epoch, focused on coding.
Liquid AI
Liquid AI's second-generation efficient foundation models built on liquid neural networks for on-device use.
Sakana AI
LLM-discovered preference optimization algorithm from Sakana's evolutionary research line.
Magic.dev
100M-token context model purpose-built for whole-repository software synthesis.
Poolside
Code-first foundation model trained with reinforcement learning from code-execution feedback.
Perplexity
Perplexity's in-house search-grounded LLM powering the Perplexity answer engine.
Inflection AI
Inflection's empathetic conversational assistant tuned for personal, supportive dialogue.
Imbue
Imbue's research model trained from scratch for robust agentic reasoning and code.
Sarvam AI
First Indic-first foundation model optimized for 10 Indian languages and English.
Cohere
Massively multilingual open-weights model covering 23 languages from Cohere For AI.
Inception Labs
Diffusion-based LLM that generates code in parallel for order-of-magnitude latency gains.
Nous Research
Open-source aligned LLM family known for steerable, uncensored research use.
Tencent
Tencent Hunyuan's open-source multilingual translation family for fast, instruction-following translation across 33 languages.
Alibaba
Alibaba Cloud's closed-source trillion-parameter flagship LLM for coding, reasoning, and enterprise agentic workflows.
Cohere
Cohere's open-source W4A4-quantized vision-language reasoning model for agentic, multilingual, tool-use enterprise tasks.
Google DeepMind
Google DeepMind's closed-source multimodal video creation and editing model that generates or edits video from text, image, video, and audio references.
Google DeepMind
Google DeepMind's closed-source natively multimodal reasoning model for fast, high-capability agentic and coding tasks.
Alibaba
Alibaba's vision-enhanced real-time audio/video translation model for live multilingual interpretation across 60 languages.
NVIDIA
NVIDIA's open 14B text-generation LM supporting autoregressive, diffusion-style parallel, and self-speculative decoding.