Rank #4 on AIDB
21 models in the AIDB database. Average AIDB score 90, top score 93, momentum index 161.5.
Containerized inference microservices for deploying optimized AI models anywhere.
Centralized car computer for AV, cockpit AI and infotainment.
NVIDIA's language model tracked by Epoch, focused on language modeling/generation.
World foundation models for physical AI simulation.
Edge robotics platform built on Blackwell for humanoid and physical AI.
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
Recently, promising progress has been made by open-source vision-language models (VLMs) in bringing their capabilities closer to those of proprietary frontier models.
We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4- 340B-Instruct, and Nemotron-4-340B-Reward.
Fugatto is a versatile audio synthesis and transformation model capable of following free-form text instructions with optional audio inputs.
Visual language models (VLMs) have made significant advances in accuracy in recent years.
We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid MambaTransformer language model.
NVIDIA's open LLM family for synthetic data and reasoning.
Managed AI training service on dedicated NVIDIA Hopper/Blackwell clusters.
Flagship datacenter GPU for trillion-parameter AI training and inference.
Visual language models (VLMs) rapidly progressed with the recent success of large language models.
NVIDIA's open 14B text-generation LM supporting autoregressive, diffusion-style parallel, and self-speculative decoding.
Open foundation model for humanoid robots.
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
Understanding and modeling lighting effects are fundamental tasks in computer vision and graphics.
NVIDIA's vision, language model tracked by Epoch, focused on language modeling/generation.
High-quality preference datasets are essential for training reward models that can effectively guide large language models (LLMs) in generating high-quality responses aligned with human preferences.