Category · 159 models
Open weights you can run inside your VPC, your data center, or air-gapped.
For regulated industries — health, defense, finance, government — that need frontier-class capability but cannot send data to a hosted API. Bring your own infrastructure, own your weights.
Meta
Meta's open-weights multimodal MoE family (Scout & Maverick).
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Mistral AI
European frontier LLM strong at code, math and multilingual tasks.
Alibaba
Open multilingual model family with hybrid thinking modes.
Cohere
Enterprise-grade RAG and tool-use model for business workloads.
Microsoft
Small language model punching above its weight on reasoning benchmarks.
Stability AI
Open-weights image generator with strong fine-tuning ecosystem.
Black Forest Labs
State-of-the-art open image model from ex-Stable Diffusion researchers.
OpenAI
Open multilingual speech recognition and translation model.
Mistral AI
Code-specialised open model covering 80+ programming languages.
Arc Institute
Genome-scale foundation model spanning DNA, RNA and proteins.
Meta
Segment Anything for images and video, in real time.
Genesis Embodied AI
Generative physics platform for robotics simulation and 4D worlds.
OpenAI
OpenAI's first open-weights reasoning models since GPT-2.
Open-weights model family in 1B–27B sizes for on-device & server.
Open vision-language models for fine-tuning.
Meta
Open-weights instruct model competitive with much larger LLMs.
Meta
Open multimodal model in 11B and 90B sizes.
Meta
Self-supervised vision foundation model for image features.
Meta
Multilingual speech-to-speech and speech-to-text translation.
Mistral AI
Fast open-weights small model with strong reasoning.
Mistral AI
124B multimodal model with state-of-the-art image understanding.
Mistral AI
Sparse mixture-of-experts open model.
Mistral AI
Open agentic coding model built with All Hands AI.
Alibaba
Open vision-language model with strong document understanding.
Zhipu AI
Open agentic foundation model from Zhipu's GLM family.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
MiniMax
Open reasoning model with 1M-token context.
Tencent
Open MoE model with 389B params from Tencent.
Black Forest Labs
Image editing model with character & style consistency.
HiDream
Open 17B image generation model topping benchmarks.
Useful Sensors
Open ASR model optimised for real-time edge inference.
EvolutionaryScale
Frontier protein language model for biology design.
MIT / Recursion
Open AlphaFold3-class biomolecular structure prediction model.
Google DeepMind
Best-in-class medium-range global weather forecasting AI.
Google DeepMind
Probabilistic AI weather forecasting beating ENS.
Physical Intelligence
Generalist vision-language-action model for robots.
NVIDIA
Open foundation model for humanoid robots.
Allen Institute (AI2)
Fully open language model with training data and code released.
TII
Open LLM family from the Technology Innovation Institute.
Hugging Face
Compact open model strong in its size class.
Reka
Multimodal frontier model with open weights.
NVIDIA
NVIDIA's open LLM family for synthetic data and reasoning.
Snowflake
Open enterprise LLM optimised for SQL and coding.
Databricks
Open MoE LLM tuned for enterprise tasks.
AMD
Open software stack for GPU compute and AI on Instinct & Radeon hardware.
Intel
Open toolkit to optimize and deploy AI inference across Intel CPUs, GPUs and NPUs.
Tenstorrent
Open RISC-V based AI accelerator from Jim Keller's team.
IBM
Open enterprise LLM family for code, language and time series.
Databricks
Open MoE LLM by Databricks for enterprise customization.
Weaviate
Open-source vector database with hybrid search and modules.
Meta
Segment Anything 3D — reconstructs objects, scenes and human bodies from a single image.
Ai2
Fully open model flow with training data, checkpoints and recipes for reproducible AI.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
Eigen AI
Agentic RL vision-language model for tool-integrated visual reasoning.
Coinbase
Toolkit letting AI agents transact on-chain with wallets, USDC and smart contracts.
Google DeepMind
Google DeepMind's mathematics model tracked by Epoch, focused on geometry.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Cohere for AI
Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages.
Tsinghua University
Performing language-conditioned robotic manipulation tasks in unstructured environments is highly demanded for general intelligent robots.
Meta
Meta's language model tracked by Epoch, focused on chat.
NVIDIA
Visual language models (VLMs) rapidly progressed with the recent success of large language models.
University of California (UC) Berkeley
University of California (UC) Berkeley's robotics model tracked by Epoch, focused on robotic manipulation.
Alibaba
After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2.
NVIDIA
High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences.
Stanford University
Stanford University's robotics, vision, language model tracked by Epoch, focused on robotic manipulation.
NVIDIA
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4- 340B-Instruct, and Nemotron-4-340B-Reward.
DeepSeek
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
New York University (NYU)
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach.
Mistral AI
We're contributing Mathstral to the science community to bolster efforts in advanced mathematical problems requiring complex, multi-step logical reasoning.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
ByteDance
We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the LLaVA-NeXT blog series.
AI21 Labs
We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
Tsinghua University
Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
Meta
Meta's multimodal, vision, language model tracked by Epoch, focused on visual question answering.
Tsinghua University
Tsinghua University's robotics model tracked by Epoch, focused on robotic manipulation.
Chai discovery
We introduce Chai-1, a multi-modal foundation model for molecular structure prediction that performs at the state-of-the-art across a variety of tasks relevant to drug discovery.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
Visual language models (VLMs) have made significant advances in accuracy in recent years.
ByteDance
We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.
LG AI Research
This technical report introduces the EXAONE 3.5 instruction-tuned language models, developed and released by LG AI Research.
Stability AI
We study the problem of single-image 3D object reconstruction.
Prime Intellect
INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning.
NVIDIA
Recently, promising progress has been made by open-source vision-language models (VLMs) in bringing their capabilities closer to those of proprietary frontier models.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
NVIDIA
Understanding and modeling lighting effects are fundamental tasks in computer vision and graphics.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Alibaba
In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models.
LG AI Research
LG AI Research's vision, medicine model tracked by Epoch, focused on cancer diagnosis.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we're announcing Qwen3-Coder, our most agentic code model to date.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
We present Qwen-Image, an image generation foundation model in the Qwen series that achieves significant advances in complex text rendering and precise image editing.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Alibaba
We present Qwen3-Omni, a single multimodal model that, for the first time, maintains state-of-the-art performance across text, image, audio, and video without any degradation relative to single-modal counterparts.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
MiniMax
Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.
Alibaba
We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
Shanghai AI Lab
Recent progress in large language models (LLMs) has moved the frontier from puzzle-solving to science-grade reasoning-the kind needed to tackle problems whose answers must stand against nature, not merely fit a rubric.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on mathematical reasoning.
NVIDIA
We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid MambaTransformer language model.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
MiniMax
MiniMax's language model tracked by Epoch, focused on chat.
NC AI
NC AI's language model tracked by Epoch, focused on language modeling/generation.
Alibaba
Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development.
Moonshot AI
We introduce Kimi K2.5, an open-source multimodal agentic model designed to advance general agentic intelligence.
Alibaba
We are delighted to announce the official release of Qwen3.5, introducing the open-weight of the first model in the Qwen3.5 series, namely Qwen3.5-397B-A17B.
Zhipu AI
We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering.
Alibaba
Over recent months, we have intensified our focus on developing foundation models that deliver exceptional utility and performance.
Moonshot AI
Moonshot AI's language model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
Liquid AI
Liquid AI's second-generation efficient foundation models built on liquid neural networks for on-device use.
Sakana AI
LLM-discovered preference optimization algorithm from Sakana's evolutionary research line.
Sarvam AI
First Indic-first foundation model optimized for 10 Indian languages and English.
Cohere
Massively multilingual open-weights model covering 23 languages from Cohere For AI.
Nous Research
Open-source aligned LLM family known for steerable, uncensored research use.
Meituan
Meituan LongCat's open-source audio-driven avatar video model for single- and multi-character human video generation.
Tencent
Tencent Hunyuan's open-source multilingual translation family for fast, instruction-following translation across 33 languages.
Microsoft
Microsoft's open-source 3.8B text-to-image model focused on efficient training, fast high-res generation, and strong prompt adherence.
Cohere
Cohere's open-source W4A4-quantized vision-language reasoning model for agentic, multilingual, tool-use enterprise tasks.
NVIDIA
NVIDIA's open 14B text-generation LM supporting autoregressive, diffusion-style parallel, and self-speculative decoding.
Meta
Meta's audio generation model focused on high-fidelity waveform synthesis and speech-music co-generation.
Sapient Intelligence
Sapient's 1B Hierarchical Reasoning Model for compact, structured chain-of-thought text generation.