Category · 50 models
Write code on your laptop or VPC without sending source to a cloud API.
Open-weights coding models that run on a single GPU or Apple-silicon laptop — for regulated codebases, air-gapped environments, or teams that simply don't want their repo leaving the firewall.
Meta
Meta's open-weights multimodal MoE family (Scout & Maverick).
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Mistral AI
European frontier LLM strong at code, math and multilingual tasks.
Alibaba
Open multilingual model family with hybrid thinking modes.
Mistral AI
Code-specialised open model covering 80+ programming languages.
Meta
Open-weights instruct model competitive with much larger LLMs.
Mistral AI
Sparse mixture-of-experts open model.
Mistral AI
Open agentic coding model built with All Hands AI.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
Snowflake
Open enterprise LLM optimised for SQL and coding.
AMD
Open software stack for GPU compute and AI on Instinct & Radeon hardware.
Intel
Open toolkit to optimize and deploy AI inference across Intel CPUs, GPUs and NPUs.
IBM
Open enterprise LLM family for code, language and time series.
Databricks
Open MoE LLM by Databricks for enterprise customization.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Meta
Meta's language model tracked by Epoch, focused on chat.
DeepSeek
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we're announcing Qwen3-Coder, our most agentic code model to date.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
MiniMax
Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.