Catalog Search · 705 models indexed
Find any AI model in AIDB by name, maker, capability, industry or modality.
Showing 705 of 705 models
OpenAI
OpenAI's flagship multimodal reasoning model with long-context tool use.
OpenAI
Real-time omni-model handling text, vision and voice in a single network.
OpenAI
Frontier reasoning model tuned for math, science and coding workflows.
Anthropic
Anthropic's best coding and agentic model, strong at long autonomous tasks.
Anthropic
Top-tier reasoning model for research, analysis and complex writing.
Google DeepMind
Long-context multimodal model with native tool use and 1M+ token window.
Google DeepMind
Fast, cheap multimodal model optimised for high-volume production use.
xAI
xAI's flagship reasoning model with real-time X knowledge and tool use.
Meta
Meta's open-weights multimodal MoE family (Scout & Maverick).
DeepSeek
High-performance open MoE LLM rivaling closed frontier models on benchmarks.
DeepSeek
Open reasoning model trained with RL, competitive with o1-class systems.
Mistral AI
European frontier LLM strong at code, math and multilingual tasks.
Alibaba
Open multilingual model family with hybrid thinking modes.
Cohere
Enterprise-grade RAG and tool-use model for business workloads.
Microsoft
Small language model punching above its weight on reasoning benchmarks.
OpenAI
Production-grade image generation API with strong text rendering.
OpenAI
Prompt-faithful image generator integrated across ChatGPT.
Google DeepMind
Photoreal image model with sharp typography and detail.
Midjourney
Aesthetic-first image model beloved by designers and concept artists.
Stability AI
Open-weights image generator with strong fine-tuning ecosystem.
Black Forest Labs
State-of-the-art open image model from ex-Stable Diffusion researchers.
Ideogram
Image model specialised in legible in-image typography and logos.
OpenAI
Text-to-video model producing minute-long cinematic clips.
Google DeepMind
High-fidelity video generation with native synchronised audio.
Runway
Pro video generation with consistent characters and worlds.
Kuaishou
Chinese text-to-video model with strong physical realism.
Pika Labs
Creative video generator with scene ingredients and edits.
OpenAI
Open multilingual speech recognition and translation model.
ElevenLabs
Best-in-class expressive TTS and voice cloning across 70+ languages.
Suno
Generates full songs with vocals from a text prompt.
Udio
Text-to-music model focused on production-quality tracks.
Anthropic
Agentic coding tool that lives in your terminal and edits real codebases.
GitHub / OpenAI
In-IDE pair programmer powering code completion and chat.
Anysphere
AI-first code editor with multi-file edits and background agents.
Cognition
Autonomous software engineer that plans, codes and ships PRs.
Mistral AI
Code-specialised open model covering 80+ programming languages.
Google DeepMind / Isomorphic
Predicts the structure and interactions of life's molecules.
Medical LLM achieving expert-level performance on USMLE-style questions.
Arc Institute
Genome-scale foundation model spanning DNA, RNA and proteins.
Bloomberg
Finance-domain LLM trained on decades of market and news data.
Harvey
Generative AI platform purpose-built for elite law firms.
Khan Academy
AI tutor that guides students with Socratic questioning.
Perplexity
Answer engine combining LLMs with cited live web search.
Source-grounded research assistant with audio overviews.
OpenAI
High-dimensional embeddings for search, RAG and clustering.
Voyage AI
Top-ranked retrieval embeddings, optimised for RAG quality.
Meta
Segment Anything for images and video, in real time.
Google DeepMind
Vision-language-action model that controls robots from web knowledge.
Genesis Embodied AI
Generative physics platform for robotics simulation and 4D worlds.
Stability AI / Tripo
Fast single-image to 3D mesh reconstruction model.
Jasper
Marketing copilot for brand-aware content at enterprise scale.
OpenAI
Browser-using agent that performs tasks on the open web.
OpenAI
Improved GPT-4 series model with stronger coding and instruction following.
OpenAI
Smaller, faster GPT-4.1 for production workloads.
OpenAI
Cheapest, fastest GPT-4.1 tier for high-volume tasks.
OpenAI
Compact reasoning model balancing cost and quality.
OpenAI
Cost-efficient multimodal small model.
OpenAI
OpenAI's first open-weights reasoning models since GPT-2.
OpenAI
OpenAI text-to-speech voices via the audio API.
Anthropic
Anthropic's fastest and cheapest frontier-class small model.
Anthropic
Hybrid reasoning model with extended thinking mode.
Anthropic
Fast, low-cost model for everyday tasks.
Google DeepMind
Smallest, cheapest Gemini for high-volume tasks.
Google DeepMind
Multimodal model with native tool use and live API.
Open-weights model family in 1B–27B sizes for on-device & server.
Open vision-language models for fine-tuning.
Google DeepMind
Google's professional music generation model.
High-fidelity expressive TTS voices on Google Cloud.
Meta
Open-weights instruct model competitive with much larger LLMs.
Meta
Open multimodal model in 11B and 90B sizes.
Meta
Self-supervised vision foundation model for image features.
Meta
Multilingual speech-to-speech and speech-to-text translation.
Meta
Open text-to-music model from AudioCraft.
xAI
Multi-agent variant of Grok 4 for the hardest problems.
xAI
Speed-optimised code model for agentic IDE workflows.
xAI
Photoreal autoregressive image generation model.
Mistral AI
Frontier-class performance at a fraction of the cost.
Mistral AI
Fast open-weights small model with strong reasoning.
Mistral AI
124B multimodal model with state-of-the-art image understanding.
Mistral AI
Sparse mixture-of-experts open model.
Mistral AI
Open agentic coding model built with All Hands AI.
Alibaba
Open agentic coding model in the Qwen3 family.
Alibaba
Open vision-language model with strong document understanding.
Alibaba
Open text-to-video and image-to-video model.
Zhipu AI
Open agentic foundation model from Zhipu's GLM family.
Moonshot AI
Trillion-parameter open MoE model with strong agentic skills.
MiniMax
Open reasoning model with 1M-token context.
MiniMax
Cinematic text-to-video generator.
01.AI
Fast, low-cost frontier-tier LLM from 01.AI.
Tencent
Open MoE model with 389B params from Tencent.
Baidu
Baidu's flagship multimodal foundation model.
ByteDance
ByteDance's flagship LLM, widely deployed in China.
ByteDance
ByteDance Seed video generation model.
Recraft
Image model designed for brand & vector-style design assets.
Adobe
Commercially-safe image model trained on licensed data.
Leonardo.Ai
In-house foundation model with strong prompt adherence.
Playground
Image model focused on graphic design and typography.
Black Forest Labs
Image editing model with character & style consistency.
HiDream
Open 17B image generation model topping benchmarks.
Luma AI
Large video generative model with realistic motion.
HeyGen
AI avatar video generator for marketing and training.
Synthesia
Enterprise AI video platform with realistic avatars.
Tencent
Open 13B text-to-video model.
Genmo
Open-source video generation model.
Lightricks
Real-time open video generation model.
Cartesia
Ultra-low-latency state-space TTS model.
PlayHT
Conversational TTS optimised for AI agents.
Resemble AI
Voice cloning and real-time speech synthesis platform.
Deepgram
Production-grade streaming speech-to-text model.
AssemblyAI
Highly accurate speech recognition with rich audio intelligence.
Useful Sensors
Open ASR model optimised for real-time edge inference.
Stability AI
Generates full-length audio tracks from text.
Riffusion
AI music generation with vocal & instrumental control.
Lovable
AI fullstack builder that ships production web apps from prompts.
StackBlitz
Browser-based AI agent that builds and runs full-stack apps.
Vercel
Generative UI tool that produces React + Tailwind components.
Replit
Agent that creates, edits and deploys apps inside Replit.
Codeium
Agentic IDE with deep multi-file flows.
Tabnine
Privacy-first AI code assistant for the enterprise.
AWS
Coding & cloud assistant deeply integrated with AWS.
Sourcegraph
Code AI with deep codebase context across repos.
Aider
Open-source CLI pair-programmer using your favorite LLM.
Continue
Open-source AI code assistant for VS Code & JetBrains.
OpenAI
ChatGPT mode that browses, codes and acts on your behalf.
Butterfly Effect
General-purpose autonomous agent that executes long workflows.
Conversational AI search experience in Google.
You.com
AI assistant combining web search with multi-model chat.
Brave
Privacy-respecting AI assistant built into the Brave browser.
Cohere
Multimodal multilingual embeddings for enterprise RAG.
Jina AI
Multilingual long-context embedding model.
BAAI
Open multi-functional, multilingual embedding model.
Nomic
Open MoE multilingual text embeddings.
EvolutionaryScale
Frontier protein language model for biology design.
Baker Lab
Open diffusion model for de novo protein design.
MIT / Recursion
Open AlphaFold3-class biomolecular structure prediction model.
Google DeepMind
Best-in-class medium-range global weather forecasting AI.
Google DeepMind
Probabilistic AI weather forecasting beating ENS.
LLM tuned for therapeutic and drug development tasks.
Physical Intelligence
Generalist vision-language-action model for robots.
Figure
Vision-language-action model for humanoid robot control.
NVIDIA
Open foundation model for humanoid robots.
NVIDIA
World foundation models for physical AI simulation.
Google DeepMind
Real-time interactive world model from a text prompt.
Meshy
Text & image to 3D model generator for creators.
Hyper3D
High-fidelity 3D asset generation with PBR textures.
Tripo AI
Production-quality text and image to 3D.
Thomson Reuters
Generative AI legal assistant for lawyers.
Hebbia
Agentic research platform for finance and legal teams.
Glean
Enterprise AI assistant grounded in company knowledge.
Microsoft
AI assistant across Word, Excel, Outlook, PowerPoint and Teams.
AI in Gmail, Docs, Sheets, Slides and Meet.
Notion
AI for writing, search and meeting notes inside Notion.
Copy.ai
GTM AI platform for marketing and sales workflows.
Writer
Enterprise LLM family powering Writer's generative platform.
Inflection
Personal AI focused on emotionally intelligent conversation.
Character.AI
Platform for creating and chatting with AI characters.
Duolingo
AI-powered language tutoring features.
Allen Institute (AI2)
Fully open language model with training data and code released.
TII
Open LLM family from the Technology Innovation Institute.
Hugging Face
Compact open model strong in its size class.
Reka
Multimodal frontier model with open weights.
NVIDIA
NVIDIA's open LLM family for synthetic data and reasoning.
IBM
Open enterprise-ready foundation models.
Snowflake
Open enterprise LLM optimised for SQL and coding.
Databricks
Open MoE LLM tuned for enterprise tasks.
Dell Technologies
End-to-end AI infrastructure stack combining PowerEdge servers, storage, networking and NVIDIA AI Enterprise.
Dell Technologies
Toolkit for deploying on-device AI models to Dell AI PCs at scale.
Lenovo
On-device AI assistant running locally on Lenovo AI PCs for private productivity.
Lenovo
Hybrid AI platform spanning ThinkSystem servers, ThinkEdge devices and managed services.
AMD
Datacenter accelerator (CDNA 4) for training and inference of frontier models.
AMD
Laptop CPU with XDNA 2 NPU delivering 50+ TOPS for Copilot+ AI PCs.
AMD
Open software stack for GPU compute and AI on Instinct & Radeon hardware.
NVIDIA
Managed AI training service on dedicated NVIDIA Hopper/Blackwell clusters.
NVIDIA
Flagship datacenter GPU for trillion-parameter AI training and inference.
NVIDIA
Containerized inference microservices for deploying optimized AI models anywhere.
NVIDIA
Edge robotics platform built on Blackwell for humanoid and physical AI.
NVIDIA
Centralized car computer for AV, cockpit AI and infotainment.
Intel
Datacenter AI accelerator targeting price/performance vs H100.
Intel
Laptop CPU with integrated NPU powering Copilot+ on-device AI.
Intel
Open toolkit to optimize and deploy AI inference across Intel CPUs, GPUs and NPUs.
Qualcomm
Arm laptop SoC with 45 TOPS Hexagon NPU for Copilot+ PCs.
Qualcomm
Library of optimized AI models ready to deploy on Snapdragon devices.
Apple
On-device + private cloud generative AI across iPhone, iPad and Mac.
Apple
38 TOPS NPU integrated in Apple Silicon for on-device ML workloads.
Pixel SoC powering on-device Gemini Nano features.
Custom AI accelerators powering Gemini training and Google Cloud AI.
Smallest Gemini model running fully on-device on Pixel and Android.
Samsung
Suite of on-device + cloud AI features for Galaxy phones (translate, edit, summarize).
Samsung
Samsung's in-house generative model family for Galaxy products.
Hewlett Packard Enterprise
Turnkey on-prem AI cloud built with NVIDIA, co-engineered for enterprises.
HP
Local AI assistant bundled with HP AI PCs for private document Q&A.
IBM
Enterprise AI & data platform with model studio, governance and runtime.
Cisco
Security platform that protects AI applications from misuse and attacks.
Cisco
Pre-validated infrastructure stacks for inference at the edge of the enterprise.
Pure Storage
AI-ready storage stack co-engineered with NVIDIA for training pipelines.
NetApp
Converged AI infrastructure with ONTAP storage and NVIDIA compute.
Supermicro
Liquid-cooled GPU SuperClusters for trillion-parameter LLM training.
Cerebras
Wafer-scale AI processor delivering record-breaking inference throughput.
Groq
Language Processing Unit delivering ultra-low-latency LLM inference.
SambaNova Systems
Full-stack AI platform with Reconfigurable Dataflow Units (RDUs).
Tenstorrent
Open RISC-V based AI accelerator from Jim Keller's team.
AWS
Custom AWS chip purpose-built for training large language models.
AWS
Cost-optimized AWS chip for high-throughput LLM inference.
AWS
Managed service to build agents using foundation models from many vendors.
AWS
Amazon's foundation model family (text, image, video) on Bedrock.
Microsoft
Unified platform to design, customize and operate enterprise AI agents.
Microsoft
Windows AI PC category with NPU-powered features like Recall and Live Captions.
Tesla
End-to-end neural network for autonomous driving on Tesla vehicles.
Mercedes-Benz
In-car operating system with conversational AI assistant powered by Google Cloud.
BMW
Voice-first in-car AI assistant integrating Alexa LLM features.
Mobileye
Eyes-off AV system combining EyeQ chips, surround sensing and REM mapping.
Waymo
Full-stack autonomous driving system deployed in robotaxis.
John Deere
Computer vision system that targets herbicide only at weeds in real time.
Boston Dynamics
Software platform managing Spot robots with AI-driven inspection routines.
Figure
Humanoid robot powered by the Helix VLA model for general-purpose work.
Tesla
General-purpose humanoid robot using Tesla's autonomy stack.
Unitree
Affordable humanoid robot platform with onboard AI.
Rabbit
Pocket AI device built around the Large Action Model paradigm.
Humane
Wearable AI assistant with laser projection and voice-first UI.
Meta
Smart glasses with multimodal Meta AI for live look-and-ask.
CrowdStrike
Generative AI security analyst built into the Falcon platform.
Microsoft
Generative AI assistant for SOC analysts and IT admins.
Palo Alto Networks
Discovery and protection for employee use of generative AI apps.
Salesforce
Platform for building autonomous AI agents on top of CRM data.
ServiceNow
Generative AI built into the ServiceNow workflow platform.
Oracle
AI agents embedded across Oracle Fusion Cloud apps and OCI.
SAP
Generative AI copilot embedded across SAP business applications.
John Deere
Computer-vision sprayer that targets weeds in real time, cutting herbicide use up to 60%.
Bayer / Climate
Digital agronomy platform with AI-driven yield, planting and nitrogen recommendations.
PEAT
Mobile crop-disease diagnosis from a single leaf photo, used by 30M+ smallholders.
John Deere
Robotics + computer vision for precision weeding and crop care.
Taranis
Aerial leaf-level imagery with AI for early pest, disease and nutrient detection.
CropX
Soil-sensor + AI agronomic platform for irrigation and nitrogen optimization.
AGCO
Connected farm AI platform for fleets of Massey Ferguson and Fendt machinery.
Carbon Robotics
AI-guided lasers identify and zap weeds in row crops without chemicals.
Indigo Ag
AI-powered soil carbon and regenerative-ag marketplace.
Palantir
AI Platform deploying LLMs against classified and government datasets with audit and policy controls.
Microsoft
Azure OpenAI Service in Azure Government for IL5 / FedRAMP High workloads.
AWS
Amazon Bedrock foundation models in AWS GovCloud for US public-sector workloads.
Gemini and Vertex AI tailored for federal, state and local government.
Oracle
Generative AI in Oracle Cloud for Government across HCM, ERP and citizen services.
ServiceNow
AI-powered citizen-services workflows for federal and local agencies.
Veritone
AI for evidence redaction, transcription and investigations for law enforcement.
Anduril
AI command-and-control mesh fusing sensors, drones and effectors across the battlespace.
Shield AI
Autonomy stack flying GPS- and comms-denied missions on V-BAT and F-16.
Helsing
European defense AI for sensor fusion, electronic warfare and autonomous strike.
Palantir
AI-driven targeting and ISR fusion deployed across US combatant commands.
Scale AI
LLM-powered decision-making platform for defense and intelligence analysts.
Lockheed Martin
Defense-grade AI infrastructure subsidiary supporting national security missions.
BAE Systems
Autonomous and AI/ML systems for ISR, EW and mission systems.
Saab
AI-enabled command-and-control suite for joint and combined operations.
Rebellion Defense
AI products for ISR, mission planning and computer vision in defense.
Vannevar Labs
AI for open-source intelligence and non-traditional collection.
Booking.com
Conversational trip planner suggesting destinations, hotels and itineraries.
Expedia
Group-travel AI assistant integrated in chat for planning and booking.
Kayak
ChatGPT-powered travel concierge for searching flights, hotels and cars.
TripAdvisor
AI-generated personalized itineraries from reviews and traveler data.
Hopper
Conversational price-prediction and booking assistant for flights and hotels.
Hilton
AI-driven family-room matching and stay personalization across Hilton brands.
Marriott
Generative-AI experiences for guest service and personalization.
Airbnb
AI-powered search, photo-tour categorization and host customer-service agent.
General Motors
Hands-free driver-assistance system using AI perception and HD-map fusion.
Ford
Hands-free highway driving with AI driver monitoring.
Hyundai Motor
AI-powered software-defined vehicle OS with voice and personalization.
Stellantis
Level-3 autonomous-driving stack across Stellantis brands.
Cruise
Driverless robotaxi platform built on GM vehicles.
Amazon
Purpose-built autonomous robotaxi with end-to-end AI driving stack.
Wayve
Generative world model for end-to-end embodied driving.
Siemens
Generative AI copilot for engineers across PLC, design and operations.
Rockwell Automation
AI-enabled HMI and analytics for industrial automation.
GE Vernova
AI for grid orchestration, wind-turbine optimization and power generation.
Schneider Electric
AI for energy management and industrial automation across EcoStruxure.
ABB
Industrial analytics and AI platform for process and discrete manufacturing.
Honeywell
Industrial AI for buildings, aerospace and process industries.
SLB (Schlumberger)
Generative-AI platform for energy operations across exploration and production.
Baker Hughes
Autonomous field-operations AI for oil & gas production.
Alphabet X
AI-driven virtualization platform for the electric grid.
Octopus Energy
AI-native customer and grid platform powering 60M+ energy accounts.
Ericsson
AI/ML for autonomous 5G network operations and energy savings.
Nokia
Edge AI platform for private 5G and industrial automation.
AT&T
Internal generative-AI assistant built on Azure OpenAI for 80k+ employees.
Verizon
Generative-AI agent for customer-service and field operations.
Vodafone
AI customer-service chatbot serving hundreds of millions of subscribers.
Amazon
Generative-AI shopping assistant inside the Amazon app.
Shopify
AI commerce assistant that runs the merchant's store via natural language.
Walmart
Generative-AI shopping assistant in the Walmart app.
Klarna
OpenAI-powered customer-service agent handling 2/3 of Klarna chats.
Instacart
ChatGPT-powered grocery search and meal planning.
Maersk
AI-powered remote container monitoring across reefer fleets.
FedEx
AI logistics intelligence platform built with Microsoft for shipment visibility.
UPS
Machine-learning system scoring delivery-success likelihood for shippers.
Project44
Generative-AI supply-chain assistant on top of real-time visibility data.
Blue Yonder
AI/ML supply-chain planning, forecasting and execution.
Zillow
Neural-network home-value estimation across 100M+ US homes.
Procore
Generative-AI assistant for construction project management.
Autodesk
Generative design and AI across AutoCAD, Revit, Forma and Fusion.
HPE Aruba Networking
AIOps for wired, wireless and SD-WAN with predictive issue resolution.
HPE Aruba Networking
Conversational AI inside Aruba Central for network troubleshooting.
Cisco
Cross-portfolio AI assistant for security, networking and collaboration.
Cisco
AI-native distributed security fabric for data centers and clouds.
Juniper Networks
AI-driven networking and Marvis virtual network assistant.
Arista Networks
Autonomous Virtual Assist AI for network operations and security.
Extreme Networks
GenAI assistant for network operations across the Extreme platform.
F5
Application-delivery and security AI gateway for LLM apps.
Fortinet
GenAI security analyst across the Fortinet Security Fabric.
Zscaler
Generative-AI copilot for digital experience and zero-trust operations.
Palo Alto Networks
GenAI copilot for network security across the Strata portfolio.
Palo Alto Networks
AI-driven SOC platform unifying SIEM, EDR and SOAR.
Check Point
Generative-AI assistant for security administration and threat analysis.
SentinelOne
Generative-AI threat-hunting analyst across the Singularity platform.
Darktrace
Self-learning AI platform for autonomous response across email, network and cloud.
Vectra AI
AI-driven threat detection and response across hybrid cloud.
Veeam
AI-powered data resilience, anomaly detection and recovery analytics.
Commvault
GenAI assistant for cyber resilience, recovery and data protection.
Rubrik
Generative-AI assistant for cyber recovery investigations and remediation.
Cohesity
RAG-based AI search and insights over enterprise backup data.
Pure Storage
AI-ready infrastructure (with NVIDIA DGX) for training and inference at scale.
NetApp
Converged AI infrastructure with NVIDIA for enterprise model training.
Dell Technologies
Scale-out file storage tuned for large-scale AI training and RAG.
VAST Data
Unified data platform for AI with embedded vector database and compute.
WEKA
High-performance data platform for GPU-accelerated AI pipelines.
DDN
Reference AI storage architecture co-engineered with NVIDIA DGX SuperPOD.
Hitachi Vantara
Industry-tailored generative AI solutions on Hitachi infrastructure.
IBM
Enterprise studio for building, training and deploying foundation models.
IBM
Open enterprise LLM family for code, language and time series.
Snowflake
Managed LLMs and agents running directly inside Snowflake.
Databricks
End-to-end platform for building and serving custom AI agents on the lakehouse.
Databricks
Open MoE LLM by Databricks for enterprise customization.
Pinecone
Managed vector database powering RAG and semantic-search apps.
Weaviate
Open-source vector database with hybrid search and modules.
Elastic
GenAI assistant across Elastic Search, Observability and Security.
Splunk (Cisco)
GenAI assistant for SPL, observability and security operations.
New Relic
Generative-AI observability assistant for engineers.
Datadog
Generative-AI assistant across Datadog observability and security.
Dynatrace
Hypermodal AI combining causal, predictive and generative AI for observability.
Workday
AI agents for HR, finance and planning across Workday.
Atlassian
Enterprise search and AI agents across Jira, Confluence and 3rd-party SaaS.
Notion
Built-in AI for writing, search and Q&A across Notion workspaces.
Zoom
AI assistant for meeting summaries, chat and email across Zoom.
Cisco
GenAI assistant for meetings, contact center and collaboration.
Intuit
GenAI financial assistant across TurboTax, QuickBooks, Credit Karma and Mailchimp.
Adobe
Commercially-safe generative-AI models for image, vector and video.
Adobe
Enterprise generative-AI platform for marketing content production.
Canva
Suite of AI design tools for image, video, copy and presentations.
HubSpot
AI agents and copilots across marketing, sales and service.
Glean
Enterprise-search and work-AI assistant across SaaS data.
Writer
Enterprise LLM family and generative-AI platform for regulated industries.
Jasper
AI marketing platform for brand-aligned content generation.
ServiceNow
GenAI assistant embedded across ITSM, CSM, HRSD and creator workflows.
ServiceNow
Autonomous AI agents for IT, HR, customer service and security operations.
ServiceNow
Domain-specific large language models tuned for the Now Platform.
Workday
Role-based AI agents for recruiting, payroll, expenses and contracts.
Workday
Central system to manage, govern and orchestrate AI agents across the enterprise.
Oracle
Prebuilt AI agents across Oracle Fusion Cloud HCM, ERP, SCM and CX.
Oracle
Managed LLM service on OCI featuring Cohere and Meta Llama models.
Oracle
Conversational AI platform for building enterprise assistants.
Oracle
GenAI coding companion optimized for Java, SQL and OCI.
Oracle
Voice-enabled clinical documentation agent for clinicians.
SAP
Generative-AI copilot embedded across the SAP application portfolio.
SAP
Portfolio of AI capabilities and agents across SAP business processes.
SAP
Runtime and lifecycle management for AI workloads on SAP BTP.
Salesforce
Platform for building, deploying and governing autonomous AI agents.
Salesforce
Generative-AI layer across Sales, Service, Marketing and Commerce Clouds.
Salesforce
Unified customer data foundation powering Einstein and Agentforce.
Salesforce / Slack
AI summaries, search and recap built into Slack channels and DMs.
Salesforce / Tableau
Generative analytics and natural-language insights inside Tableau.
Microsoft
Generative-AI assistant across Word, Excel, PowerPoint, Outlook and Teams.
Microsoft
Low-code platform for building and orchestrating custom AI agents.
Microsoft
Role-based AI copilots for sales, service, finance, supply chain and HR.
Microsoft
Unified platform to build, evaluate and deploy AI agents and models on Azure.
Microsoft
Enterprise access to GPT, o-series and DALL·E models on Azure.
Microsoft
Vector and hybrid retrieval engine for grounding LLMs on enterprise data.
Microsoft / GitHub
AI pair-programmer for code completion, chat, reviews and agent mode.
Microsoft
GenAI copilots for data engineering, science and Power BI inside Fabric.
Microsoft
AI models and prebuilt skills for Power Apps and Power Automate.
AWS
Generative-AI assistant grounded on enterprise data and SaaS connectors.
AWS
AI coding and operations assistant across the developer lifecycle on AWS.
AWS
Secure runtime for deploying and scaling production AI agents on Bedrock.
AWS
End-to-end platform to build, train and deploy ML and foundation models.
AWS
GenAI for contact-center agents, self-service and analytics.
AWS
HIPAA-eligible service that generates clinical notes from patient conversations.
Google Cloud
Unified platform for Gemini, Model Garden, agents and ML on GCP.
Google Cloud
Build, deploy and manage multi-agent systems grounded on enterprise data.
AI assistance across Gmail, Docs, Sheets, Slides, Meet and Drive.
AI coding assistant with enterprise context across IDEs and Google Cloud.
Google Cloud
Generative contact-center AI for virtual agents, agent assist and insights.
Google Cloud
In-warehouse ML and GenAI directly on BigQuery data via SQL.
IBM
Studio to train, tune and deploy foundation models including Granite.
IBM
Build and orchestrate AI agents across HR, procurement and sales workflows.
IBM
Open data lakehouse optimized for AI workloads and RAG.
IBM
AI governance, risk and compliance for foundation models and agents.
Informatica
Generative-AI assistant for data management, integration and governance.
Informatica
AI agents across the Intelligent Data Management Cloud for pipelines and quality.
Adobe
GenAI assistant for marketers across Adobe Experience Cloud applications.
Adobe
Conversational AI to summarize, query and draft from PDF documents.
Snowflake
Build agentic apps grounded on governed Snowflake data with hosted LLMs.
Snowflake
Natural-language SQL and analytics assistant inside Snowflake.
Databricks
Tooling to build, evaluate and govern compound AI agents on the lakehouse.
Databricks
Conversational analytics over governed lakehouse data.
Cisco
Cross-portfolio AI assistant for security, networking and collaboration.
Broadcom / VMware
On-prem GenAI reference architecture co-engineered with NVIDIA and IBM.
Box
AI for content Q&A, summarization and metadata extraction in Box.
Dropbox
Universal search and AI assistant across SaaS apps and content.
DocuSign
AI-powered Intelligent Agreement Management for contract data and workflows.
Zendesk
Autonomous and copilot AI agents for customer service.
Freshworks
Generative-AI assistants and agents across CX, ITSM and CRM.
ZoomInfo
GenAI go-to-market copilot for sellers, grounded on B2B data.
Gong
Revenue AI for call insights, forecasting and deal execution.
Pegasystems
GenAI Blueprint and agents for case management and CRM workflows.
UiPath
Agentic automation copilot across the UiPath platform for citizens and developers.
Automation Anywhere
Build and govern AI agents that combine LLMs with enterprise automation.
Qlik / Talend
AI-assisted data integration, quality and governance.
Qlik
Generative analytics service delivering trusted answers from unstructured data.
SAS
Analytics and AI platform with embedded LLM orchestration and copilots.
Cloud Software Group
AI across Spotfire, integration and data virtualization products.
Teradata
In-database analytics and GenAI orchestration on Teradata VantageCloud.
MongoDB
Native vector search in MongoDB Atlas for RAG and semantic apps.
Redis
Low-latency vector database and semantic cache for GenAI apps.
Twilio
GenAI and predictive AI across Twilio messaging, voice and Segment.
Asana
AI teammates and copilots for work management and goals.
monday.com
AI assistant and blocks for automating Work OS workflows.
Smartsheet
GenAI formulas, summaries and content generation in Smartsheet.
Coupa
Community-powered AI and agents for spend management.
GitLab
AI assistant across the GitLab DevSecOps platform with code, chat and security.
Atlassian
AI features and agents across Jira, Confluence, Bitbucket and Loom.
Anthropic
Anthropic's most intelligent model — state-of-the-art on coding, agents and computer use.
OpenAI
Frontier agentic coding model for long-horizon software engineering inside Codex.
OpenAI
Updated GPT-5 with warmer tone, adaptive reasoning and stronger instruction following.
Google DeepMind
Leads LMArena Text, WebDev and Vision — Google's flagship multimodal reasoning model.
Google DeepMind
Extended-thinking variant of Gemini 3 for hardest math, science and research problems.
Google DeepMind
Gemini-powered flagship image generation and editing model with best-in-class text.
Meta
Segment Anything 3D — reconstructs objects, scenes and human bodies from a single image.
Ai2
Fully open model flow with training data, checkpoints and recipes for reproducible AI.
xAI
Refresh of Grok 4 with stronger reasoning, lower hallucination and faster tool use.
Anthropic
Fast, cheap Claude tier matching prior Sonnet-class quality for high-volume agents.
Mistral AI
Cost-efficient enterprise model with frontier-class performance for business workloads.
Alibaba
Alibaba's trillion-parameter flagship multilingual reasoning model.
Moonshot AI
Long-context open agentic model from Moonshot, strong on tool use and coding.
Zhipu AI
Open bilingual frontier model from Zhipu, competitive on coding and reasoning.
Eigen AI
Agentic RL vision-language model for tool-integrated visual reasoning.
Shopify
Generative AI across the Shopify admin — product descriptions, emails, blog posts and image edits.
Shopify
Embeddings-based product search powering natural-language storefront discovery.
Shopify
Personal shopping assistant in the Shop app, recommending and tracking orders across merchants.
Zendesk
Agent-side AI copilot suggesting replies, summaries and next actions in real time.
Zendesk
Agentic CX platform (post-Ultimate.ai) for end-to-end automated customer resolutions.
Zendesk
AutoQA AI that scores 100% of support conversations across voice and chat.
Twilio
Speech-to-text, summaries and language operators that analyze every call in real time.
Twilio
Build conversational AI agents over SMS, voice and WhatsApp grounded in Segment data.
Twilio Segment
AI-powered CDP predictions joining warehouse data to real-time activation.
Broadcom / Symantec
AI-driven data loss prevention classifying sensitive content across cloud, email and endpoints.
Broadcom
GenAI for agile planning — story generation, sprint summaries and risk forecasting.
Broadcom / VMware
Private AI services for VCF — model serving, RAG and vector DB on-prem.
Microsoft
Role-based Copilot inside Outlook & Teams pulling CRM context from Dynamics 365 and Salesforce.
Microsoft
Frontline copilot for contact center agents inside Dynamics 365 Customer Service.
Microsoft / Nuance
Ambient AI scribe for clinicians that drafts notes and orders from doctor-patient conversations.
GitHub / Microsoft
Agentic dev environment that plans, edits and tests entire features from a GitHub issue.
Palo Alto Networks
AI Runtime Security — protects models, agents and data across enterprise AI deployments.
Palo Alto Networks
Unified AI-driven CNAPP + CDR converging Prisma Cloud and Cortex into one platform.
Cloudflare
Serverless GPU inference platform running open models at the edge.
Cloudflare
Observability, caching and rate-limiting proxy for any LLM provider.
Akamai
Distributed-edge inference platform built on the Akamai Connected Cloud.
Stripe
ML-based fraud detection trained on the global Stripe payments network.
PayPal
Personalized AI recommendations and cashback on merchant receipts.
Block
AI assistant for sellers — answers business questions from Square sales data.
Coinbase
Toolkit letting AI agents transact on-chain with wallets, USDC and smart contracts.
Robinhood
AI investing companion delivering market insights to Robinhood Gold customers.
Spotify
Personalized AI DJ that curates and narrates listening sessions in a realistic voice.
Conversational search that synthesizes answers from authentic Reddit discussions.
Snap
GPT-powered chatbot inside Snapchat with vision and Snap Map awareness.
GenAI ads platform that builds creative and optimizes targeting automatically.
Uber
In-app GenAI assistant guiding riders and drivers through Uber and Uber Eats workflows.
DoorDash
Real-time AI moderation that detects harassment across Dasher-customer chats in 99 languages.
Synopsys
AI suite (DSO.ai, VSO.ai, TSO.ai) optimizing chip design across the EDA flow.
Cadence
Generative AI for digital chip implementation and verification across the Cadence flow.
Ansys
Cloud generative-AI app delivering near-instant simulation predictions for engineers.
Veeva Systems
Embedded AI agents and shortcuts across Veeva Vault and Commercial Cloud for life sciences.
Tencent
Tencent's deep-reasoning model, mamba-based and tuned for complex multi-step problems.
Baidu
Autonomous Driving Foundation Model powering Apollo Go robotaxis across China.
ByteDance
No-code bot platform for building, publishing and monetizing AI agents.
Writer
Palmyra X 003, is a top-performing instruct model, built specifically for structured text completion rather than conversational use.
Moonshot AI
Moonshot AI's language model tracked by Epoch, focused on language modeling/generation.
Google DeepMind
Google DeepMind's mathematics model tracked by Epoch, focused on geometry.
Alibaba
Alibaba's multimodal, language, vision model tracked by Epoch, focused on chat.
Alibaba
In recent months, our focus has been on developing a “good” model while optimizing the developer experience.
Cohere for AI
Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages.
Google DeepMind
Google DeepMind's language, multimodal model tracked by Epoch, focused on language modeling.
Stability AI
Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos.
ByteDance
We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs.
Mistral AI
Mistral AI's language model tracked by Epoch, focused on chat.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Anthropic
Anthropic's multimodal, language, vision model tracked by Epoch, focused on chat.
Saudi Aramco
Saudi Aramco's language model tracked by Epoch, focused on language modeling/generation.
Inflection AI
At Inflection, our mission is to create a personal AI for everyone.
Tsinghua University
Performing language-conditioned robotic manipulation tasks in unstructured environments is highly demanded for general intelligent robots.
Apple
In this work, we discuss building performant Multimodal Large Language Models (MLLMs).
Apple
Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds.
OpenAI
Today, we shared dozens of new additions and improvements, and reduced pricing across many parts of our platform.
Reka AI
We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka.
Meta
Meta's language model tracked by Epoch, focused on chat.
NVIDIA
Visual language models (VLMs) rapidly progressed with the recent success of large language models.
01.AI
01.AI's language model tracked by Epoch, focused on chat.
University of California (UC) Berkeley
University of California (UC) Berkeley's robotics model tracked by Epoch, focused on robotic manipulation.
Zhipu AI
We introduce ChatGLM, an evolving family of large language models that we have been developing over time.
Saudi Data and Artificial Intelligence Authority
We present ALLaM: Arabic Large Language Model, a series of large language models to support the ecosystem of Arabic Language Technologies (ALT).
Alibaba
After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2.
NVIDIA
High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences.
Stanford University
Stanford University's robotics, vision, language model tracked by Epoch, focused on robotic manipulation.
NVIDIA
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4- 340B-Instruct, and Nemotron-4-340B-Reward.
DeepSeek
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
Anthropic
This addendum to our Claude 3 Model Card describes Claude 3.5 Sonnet, a new model which outperforms our previous most capable model, Claude 3 Opus, while operating faster and at a lower cost.
New York University (NYU)
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach.
EvolutionaryScale
More than three billion years of evolution have produced an image of biology encoded into the space of natural proteins.
Baidu
Baidu's multimodal, language, vision model tracked by Epoch, focused on vision-language generation.
SenseTime
SenseTime's multimodal, language, vision model tracked by Epoch, focused on vision-language generation.
Mistral AI
We're contributing Mathstral to the science community to bolster efforts in advanced mathematical problems requiring complex, multi-step logical reasoning.
DeepL
DeepL's language model tracked by Epoch, focused on translation.
Meta
Modern artificial intelligence (AI) systems are powered by foundation models.
Apple
Apple's language model tracked by Epoch, focused on language modeling/generation.
Apple
Apple's language model tracked by Epoch, focused on language modeling/generation.
ByteDance
We present LLaVA-OneVision, a family of open large multimodal models (LMMs) developed by consolidating our insights into data, models, and visual representations in the LLaVA-NeXT blog series.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
Google DeepMind
Achieving human-level speed and performance on real world tasks is a north star for the robotics research community.
xAI
Grok-2 is our frontier language model with state-of-the-art reasoning capabilities.
AI21 Labs
We present Jamba-1.5, new instruction-tuned large language models based on our Jamba architecture.
Inspur
Inspur's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
At the KDD International Conference on Data Mining and Knowledge Discovery, the Zhipu GLM team unveiled the new generation of base large model—GLM-4-Plus.
Tencent
Tencent's language model tracked by Epoch, focused on language modeling/generation.
Harrison.ai
Harrison.ai's vision, medicine, language, multimodal model tracked by Epoch, focused on visual question answering.
Google DeepMind
Computational design of protein-binding proteins is a fundamental capability with broad utility in biomedical research and biotechnology.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback.
Alibaba
Qwen2.5 is the latest series of Qwen large language models.
Tsinghua University
Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours.
China Telecom
China Telecom's language model tracked by Epoch, focused on language modeling/generation.
ByteDance
PixelDance V1.4 is a video generation model developed by the ByteDance Research team, using the DiT structure.
Meta
Meta's multimodal, vision, language model tracked by Epoch, focused on visual question answering.
Meta
We present Movie Gen, a cast of foundation models that generates high-quality, 1080p HD videos with different aspect ratios and synchronized audio.
ByteDance
We present GR-2, a state-of-the-art generalist robot agent for versatile and generalizable robot manipulation.
Writer
Palmyra X4 boasts state-of-the-art reasoning through novel training techniques.
Tsinghua University
Tsinghua University's robotics model tracked by Epoch, focused on robotic manipulation.
Chai discovery
We introduce Chai-1, a multi-modal foundation model for molecular structure prediction that performs at the state-of-the-art across a variety of tasks relevant to drug discovery.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
ByteDance
A professional-grade, self-developed LLM supporting up to 128k tokens, enabling fine-tuning across the entire series.
ByteDance
We introduce SeedEdit, a diffusion model that is able to revise a given image with any text prompts.
Google DeepMind
Google DeepMind's language model tracked by Epoch, focused on language modeling.
Moonshot AI
Artificial general intelligence start-up Kimi, owned by Chinese AI start-up Moonshot AI, on Saturday launched its first reasoning AI model k0-math.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
NVIDIA
Fugatto is a versatile audio synthesis and transformation model capable of following free-form text instructions with optional audio inputs.
Amazon
A highly capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
NVIDIA
Visual language models (VLMs) have made significant advances in accuracy in recent years.
ByteDance
We present Infinity, a Bitwise Visual AutoRegressive Modeling capable of generating high-resolution, photorealistic images following language instruction.
OpenAI
Our video generation model is rolling out at sora.com.
LG AI Research
This technical report introduces the EXAONE 3.5 instruction-tuned language models, developed and released by LG AI Research.
Google DeepMind
Today, we’re releasing an experimental version of Gemini 2.0 Pro that responds to that feedback.
Meta AI
Despite the rapid integration of video perception capabilities into Large Multimodal Models (LMMs), the underlying mechanisms driving their video understanding remain poorly understood.
Google DeepMind
Google DeepMind's video, vision model tracked by Epoch, focused on video generation.
University of Southern California
We present STORM, a spatio-temporal reconstruction model designed for reconstructing dynamic outdoor scenes from sparse observations.
Stability AI
We study the problem of single-image 3D object reconstruction.
Prime Intellect
INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning.
NVIDIA
Recently, promising progress has been made by open-source vision-language models (VLMs) in bringing their capabilities closer to those of proprietary frontier models.
Moonshot AI
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
OpenAI
Today we introduced a research preview of Operator(opens in a new window), an agent that can go to the web to perform tasks for you.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
OpenAI
We’re releasing OpenAI o3-mini, the newest, most cost-efficient model in our reasoning series, available in both ChatGPT and the API today.
Tsinghua University
Tsinghua University's mathematics model tracked by Epoch, focused on mathematical reasoning.
xAI
We are pleased to introduce Grok 3, our most advanced model yet: blending strong reasoning with extensive pretraining knowledge.
Inception Labs
Today, we’re excited to announce that Mercury, our first general chat model, is available to support a wider range of text generation applications.
OpenAI
We advance AI capabilities by scaling two complementary paradigms: unsupervised learning and reasoning.
Alibaba
QwQ is the reasoning model of the Qwen series.
Mistral AI
Mistral OCR is an Optical Character Recognition API that sets a new standard in document understanding.
Tencent
As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model.
LG AI Research
We present EXAONE Deep series, which exhibits superior capabilities in various reasoning tasks, including math and coding benchmarks.
Baidu
In this report, we introduce ERNIE 4.5, a new family of large-scale multimodal models comprising 10 distinct variants.
OpenAI
We've developed a new series of AI models designed to spend more time thinking before they respond.
NVIDIA
Understanding and modeling lighting effects are fundamental tasks in computer vision and graphics.
DeepSeek
We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
OpenAI
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Huawei
We present Pangu Ultra, a Large Language Model (LLM) with 135 billion parameters and dense Transformer modules trained on Ascend Neural Processing Units (NPUs).
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
ByteDance
We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning.
Anthropic
Claude Sonnet 4 can understand nuanced instructions and context, recognize and correct its own mistakes, and create sophisticated analysis and insights from complex data.
DeepSeek
We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
Alibaba
In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models.
Google DeepMind
Gemini 2.5 Pro Experimental is our most advanced model for complex tasks.
ByteDance
Seed1.6 is the latest general-purpose model series unveiled by the ByteDance Seed team.
Google DeepMind
Google DeepMind's earth science model tracked by Epoch, focused on weather forecasting.
LG AI Research
LG AI Research's vision, medicine model tracked by Epoch, focused on cancer diagnosis.
Google DeepMind
In this report, we introduce Gemini Embedding, a state-of-the-art embedding model leveraging the power of Gemini, Google's most capable large language model.
LG AI Research
This technical report introduces EXAONE 4.0, which integrates a Non-reasoning mode and a Reasoning mode to achieve both the excellent usability of EXAONE 3.5 and the advanced reasoning abilities of EXAONE Deep.
Alibaba
Today, we're announcing Qwen3-Coder, our most agentic code model to date.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Alibaba
Today, we are excited to announce the release of Qwen3, the latest addition to the Qwen family of large language models.
Kunlun Inc.
We introduce MindLink, a new family of large language models developed by Kunlun Inc.
Google DeepMind
To advance Gemini’s capabilities towards solving hard reasoning problems, we developed a novel reasoning approach, called Deep Think, that naturally blends in parallel thinking techniques during response generation.
Alibaba
We present Qwen-Image, an image generation foundation model in the Qwen series that achieves significant advances in complex text rendering and precise image editing.
Sapient Intelligence
Reasoning, the process of devising and executing complex goal-oriented action sequences, remains a critical challenge in AI.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Anthropic
Today we're releasing Claude Opus 4.1, an upgrade to Claude Opus 4 on agentic tasks, real-world coding, and reasoning.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
Text-to-Image: Generate high-quality images from simple or complex text descriptions.
Meituan Inc
We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities.
Alibaba
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving.
Alibaba
We present Qwen3-Omni, a single multimodal model that, for the first time, maintains state-of-the-art performance across text, image, audio, and video without any degradation relative to single-modal counterparts.
Google DeepMind
Our most capable vision-language model (VLM) reasons about the physical world, natively calls digital tools and creates detailed, multi-step plans to complete a mission.
OpenAI
Our latest video generation model is more physically accurate, realistic, and more controllable than prior systems.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
Ant Group
Ling-1T is the first flagship non-thinking model in the Ling 2.0 series, featuring 1 trillion total parameters with ≈ 50 billion active parameters per token.
Google DeepMind
We’re also introducing Veo 3.1, which brings richer audio, more narrative control, and enhanced realism that captures true-to-life textures.
MiniMax
Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.
Alibaba
We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks.
Moonshot AI
Today, we are introducing Kimi K2 Thinking, our best open-source thinking model.
Meta
Meta's recommendation model tracked by Epoch, focused on recommender system.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
"Today we’re upgrading the GPT‑5 series with the release of: GPT‑5.1 Instant: our most-used model, now warmer, more intelligent, and better at following your instructions.
Physical Intelligence
We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL).
Shanghai AI Lab
Recent progress in large language models (LLMs) has moved the frontier from puzzle-solving to science-grade reasoning-the kind needed to tackle problems whose answers must stand against nature, not merely fit a rubric.
xAI
Today, we’re excited to launch two powerful new additions to the xAI API: Grok 4.1 Fast, our best tool-calling model with a 2M context window.
Google DeepMind
Today, we’re introducing Nano Banana Pro (Gemini 3 Pro Image), our new state-of-the art image generation and editing model.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on mathematical reasoning.
Google DeepMind
We introduce SIMA 2, a generalist embodied agent that understands and acts in a wide variety of 3D virtual worlds.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's model tracked by Epoch, focused on language modeling/generation.
NVIDIA
We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid MambaTransformer language model.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
MiniMax
MiniMax's language model tracked by Epoch, focused on chat.
NAVER
Developed by Naver, South Korea’s leading AI research lab, this cutting-edge language model supports multimodal inputs and advanced reasoning.
NC AI
NC AI's language model tracked by Epoch, focused on language modeling/generation.
SK Telecom
SK Telecom's language model tracked by Epoch, focused on code generation.
Upstage
Solar Open is Upstage's flagship 102B-parameter large language model, trained entirely from scratch and released under the Solar-Apache License 2.0 (see LICENSE).
LG AI Research
K-EXAONE is a large-scale multilingual language model developed by LG AI Research.
Alibaba
We present Qwen3-Max-Thinking, our latest flagship reasoning model.
Alibaba
Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development.
Moonshot AI
We introduce Kimi K2.5, an open-source multimodal agentic model designed to advance general agentic intelligence.
OpenAI
OpenAI's language model tracked by Epoch, focused on language modeling/generation.
ByteDance
ByteDance's image generation, video, audio model tracked by Epoch, focused on video generation.
Alibaba
We are delighted to announce the official release of Qwen3.5, introducing the open-weight of the first model in the Qwen3.5 series, namely Qwen3.5-397B-A17B.
xAI
xAI's language model tracked by Epoch, focused on language modeling/generation.
Zhipu AI
We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering.
Google DeepMind
Last week, we released a major update to Gemini 3 Deep Think to solve modern challenges across science, research and engineering.
Alibaba
Over recent months, we have intensified our focus on developing foundation models that deliver exceptional utility and performance.
Cognition
We are sharing an early preview of our ongoing SWE-1.6 training run.
Google DeepMind
Today, we're introducing Gemini 3.1 Flash-Lite, our fastest and most cost-efficient Gemini 3 series model.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
NVIDIA
NVIDIA's language model tracked by Epoch, focused on language modeling/generation.
Anysphere
Composer 2 is a specialized model designed for agentic software engineering.
Zhipu AI
Zhipu AI's language model tracked by Epoch, focused on language modeling/generation.
Google DeepMind
Google DeepMind's audio model tracked by Epoch, focused on audio generation.
Anthropic
Anthropic's language model tracked by Epoch, focused on question answering.
Moonshot AI
Moonshot AI's language model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's image generation model tracked by Epoch, focused on image generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
OpenAI
OpenAI's multimodal, language, vision model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
DeepSeek
DeepSeek's language model tracked by Epoch, focused on language modeling/generation.
Anysphere
Anysphere's language model tracked by Epoch, focused on coding.
Hume AI
Empathic voice interface that perceives and generates emotional speech in real time.
Krea AI
Krea's in-house image model tuned for aesthetic control and real-time iteration.
World Labs
Fei-Fei Li's World Labs spatial intelligence model that generates explorable 3D worlds from a single image.
Liquid AI
Liquid AI's second-generation efficient foundation models built on liquid neural networks for on-device use.
Sakana AI
LLM-discovered preference optimization algorithm from Sakana's evolutionary research line.
Magic.dev
100M-token context model purpose-built for whole-repository software synthesis.
Poolside
Code-first foundation model trained with reinforcement learning from code-execution feedback.
Decart
Real-time generative world model that re-skins live video streams with text prompts.
Perplexity
Perplexity's in-house search-grounded LLM powering the Perplexity answer engine.
Inflection AI
Inflection's empathetic conversational assistant tuned for personal, supportive dialogue.
Kuaishou
Kuaishou's flagship text-to-video model with strong motion coherence and 1080p output.
Imbue
Imbue's research model trained from scratch for robust agentic reasoning and code.
Sarvam AI
First Indic-first foundation model optimized for 10 Indian languages and English.
Cohere
Massively multilingual open-weights model covering 23 languages from Cohere For AI.
Inception Labs
Diffusion-based LLM that generates code in parallel for order-of-magnitude latency gains.
Nous Research
Open-source aligned LLM family known for steerable, uncensored research use.
Runway
Runway's closed-source in-context video editing model that modifies existing videos while preserving untouched regions.
Meituan
Meituan LongCat's open-source audio-driven avatar video model for single- and multi-character human video generation.
Tencent
Tencent Hunyuan's open-source multilingual translation family for fast, instruction-following translation across 33 languages.
Alibaba
Alibaba Cloud's closed-source trillion-parameter flagship LLM for coding, reasoning, and enterprise agentic workflows.
Microsoft
Microsoft's open-source 3.8B text-to-image model focused on efficient training, fast high-res generation, and strong prompt adherence.
Stability AI
Stability AI's 2B text-to-audio diffusion model for higher-capacity music, sound-effect generation, and audio editing.
Cohere
Cohere's open-source W4A4-quantized vision-language reasoning model for agentic, multilingual, tool-use enterprise tasks.
Google DeepMind
Google DeepMind's closed-source multimodal video creation and editing model that generates or edits video from text, image, video, and audio references.
Deemos Technologies
Hyper3D OmniCraft Texture generates photorealistic, seamless, tileable PBR textures for 3D assets and design pipelines.
Google DeepMind
Google DeepMind's closed-source natively multimodal reasoning model for fast, high-capability agentic and coding tasks.
Alibaba
Alibaba's vision-enhanced real-time audio/video translation model for live multilingual interpretation across 60 languages.
NVIDIA
NVIDIA's open 14B text-generation LM supporting autoregressive, diffusion-style parallel, and self-speculative decoding.
Mirelo AI
Mirelo's text-to-sound-effects model for production-ready Foley, ambience, and SFX generation.
Meta
Meta's audio generation model focused on high-fidelity waveform synthesis and speech-music co-generation.
ByteDance
ByteDance's foundation model for fast multimodal content creation across short-form video pipelines.
Odyssey
Odyssey's interactive world model for real-time AI-generated explorable video environments.
Sapient Intelligence
Sapient's 1B Hierarchical Reasoning Model for compact, structured chain-of-thought text generation.
Resemble AI
Resemble AI's expressive multi-character voice acting model for long-form dramatic dialogue and narration.
Stability AI
Stability AI's compact text-to-sound-effects diffusion model optimized for low-latency on-device SFX generation.
Stability AI
Stability AI's compact text-to-music diffusion model tuned for short, license-friendly musical loops and stems.