NVIDIA

Enterprise +

company

Verified

https://www.nvidia.com/

nvidia

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

mingyuliutw updated a model about 1 hour ago

nvidia/Cosmos3-Nano-Policy-DROID

mingyuliutw updated a model about 1 hour ago

nvidia/Cosmos3-Super-Image2Video

mingyuliutw updated a model about 1 hour ago

nvidia/Cosmos3-Super-Text2Image

View all activity

Papers

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching

View all Papers

Articles

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

1 day ago

• 2

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

1 day ago

• 35

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

1 day ago

• 11

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

5 days ago

• 70

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

14 days ago

• 29

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

18 days ago

• 21

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Apr 28

• 61

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Mar 19

• 47

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Mar 17

• 65

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Mar 16

• 30

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Mar 13

• 40

Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation

Mar 13

• 18

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

Mar 12

• 33

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

Mar 11

• 6

How NVIDIA Builds Open Data for AI

Mar 10

• 16

Deploying Open Source Vision Language Models (VLM) on Jetson

Feb 24

• 37

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

Feb 19

• 2

From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development

Feb 19

• 2

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

Feb 17

• 25

NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI

Feb 17

• 3

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Feb 4

• 28

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Jan 29

• 48

Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI

Jan 28

• 11

Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI

Jan 27

• 10

NVIDIA Earth-2 Open Models Span the Whole Weather Stack

Jan 26

• 36

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Jan 6

• 28

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

Jan 5

• 24

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Jan 5

• 64

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

• 87

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Dec 17, 2025

• 50

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

• 111

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

Dec 2, 2025

• 26

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

Oct 28, 2025

• 20

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

Oct 28, 2025

• 35

Nemotron-Personas-USA: Synthesized Data for Sovereign AI

Oct 28, 2025

• 12

NVIDIA Isaac GR00T in LeRobot

Oct 28, 2025

• 29

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

• 21

NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks

Oct 28, 2025

• 17

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Oct 28, 2025

• 21

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Oct 22, 2025

• 11

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

Oct 21, 2025

• 14

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Oct 20, 2025

• 19

Nemotron-Personas-India: Synthesized Data for Sovereign AI

Oct 13, 2025

• 14

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Sep 26, 2025

• 10

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

Sep 23, 2025

• 27

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Aug 20, 2025

• 19

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18, 2025

• 32

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

Aug 18, 2025

• 5

NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual

Aug 18, 2025

• 4

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11, 2025

• 76

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Aug 4, 2025

• 5

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

Jul 21, 2025

• 5

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18, 2025

• 51

Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval

Jul 9, 2025

• 4

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Jun 27, 2025

• 31

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Jun 17, 2025

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Jun 11, 2025

• 133

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

Jun 10, 2025

• 7

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

Jun 10, 2025

• 25

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Jun 4, 2025

• 23

Mastering Long Contexts in LLMs with KVPress

Jan 23, 2025

• 77

View all articles

Organization Card

Community About org cards

This is NVIDIA's home for open model weights, datasets, and interactive demos. Everything here is designed to give developers and researchers production-ready starting points for Generative AI, Physical AI, and agentic workflows – backed by the same research that powers NVIDIA's enterprise AI platform.

The Nemotron Family: Digital Intelligence

The Nemotron family is NVIDIA's lineup of purpose-built foundation models spanning language, reasoning, vision, retrieval, speech, and safety. Each model targets a specific performance profile - from ultra-efficient edge inference to heavyweight multi-turn agent orchestration - and ships with open weights, open datasets, and reproducible training recipes.

Language & Reasoning (Nemotron 3)

The core language model lineup, engineered for advanced reasoning and agentic tasks across a range of model sizes and deployment targets.

Nemotron 3 Nano: A highly efficient small language model (SLM) built on a hybrid Mamba-2 + Transformer MoE architecture (30B total / 3B active parameters), optimized for on-device agentic tasks. Features a 1M-token context window, reasoning ON/OFF modes with configurable thinking budgets, and up to 4× faster inference than its predecessor. Served via vLLM and SGLang.
Nemotron 3 Super: This model features 120 billion total parameters with 12 billion active parameters per forward pass. It is built on a hybrid Mamba-Transformer Mixture-of-Experts (MoE) architecture. To maximize compute efficiency and accuracy, the model incorporates LatentMoE, Multi-Token Prediction (MTP) layers, and native NVFP4 pretraining. Designed for complex multi-agent applications, it maintains a 1-million-token context window and delivers up to 5x higher throughput than the previous Nemotron Super.
Nemotron 3 Ultra: A frontier-scale large language model (550B total / 55B active parameters) utilizing a hybrid LatentMoE architecture with MTP layers. Designed for the most demanding workloads, it delivers strong agentic, reasoning, and conversational capabilities for complex multi-step agents, long-context analysis, and high-stakes reasoning over code, math, and science.

Safety & Content Moderation

Models purpose-built for enterprise-grade safety and alignment workflows.

Nemotron 3.5 Content Safety: A multimodal, multilingual small language model (4B parameters) designed to act as a robust content-safety moderator. It supports standard taxonomy safety classification as well as custom-policy enforcement with reasoning traces for text and image inputs.

Speech & Multimodal

NVIDIA provides a broad set of specialized multimodal foundations that integrate seamlessly with the Nemotron ecosystem — spanning speech recognition, multilingual translation, vision-language understanding, and real-time voice AI. These models are optimized for both cloud and edge deployment.

Speech Recognition & Translation (Nemotron Speech)

State-of-the-art, production-ready speech models from the NVIDIA NeMo Speech research team for ASR, TTS, speaker diarization, and speech-to-speech.

Parakeet: A family of FastConformer-based ASR (Automatic Speech Recognition) models achieving state-of-the-art WER (Word Error Rate). The latest parakeet-tdt-0.6b-v3 extends support to 25 European languages with automatic language detection, while the 1.1B English variant delivers maximum transcription accuracy.
Canary: Multilingual and multitask speech models capable of simultaneous translation and transcription across 25 languages, trained on NVIDIA's Granary dataset.
Nemotron Speech Streaming: A cache-aware streaming ASR model with native punctuation and capitalization, offering configurable chunk sizes (80ms–1120ms) for low-latency voice agent pipelines.
Nemotron 3.5 ASR Streaming 0.6B: A highly accurate, low-latency streaming automatic speech recognition (ASR) model capable of fast, real-time transcription across multiple languages, optimized for efficient voice AI pipelines.
Parakeet Realtime EOU: A lightweight 120M-parameter streaming ASR model with built-in end-of-utterance detection at 80–160ms latency, purpose-built for voice AI agent turn-taking.
Multitalker Parakeet: Streaming multi-speaker ASR using speaker kernel injection - handles fully overlapped speech without requiring speaker enrollment audio.

Vision & Document Intelligence

Vision-language models that bring multimodal understanding to documents, images, and video - from OCR and chart parsing to visual Q&A.

Nemotron Parse: A general-purpose document parsing model that overcomes the shortcomings of traditional OCR technologies by deeply understanding complex layouts. It seamlessly extracts structured formatting, tables (as Markdown/LaTeX), bounding-boxes, and semantic classes from highly unstructured PDFs and images.
Nemotron 3 Omni: A unified multimodal large language model that seamlessly ingests video, audio, image, and text to support enterprise-grade Q&A, transcription, and document intelligence workflows, effectively handling complex reasoning across rich media.

Nemotron RAG

A complete, modular retrieval-augmented generation stack — from document ingestion through semantic search to precision reranking — designed for production pipelines that handle text, images, and complex multimodal documents.

Extract: Models for ingesting and extracting structured information from multimodal sources, including charts, tables, and scanned documents (e.g. Nemotron Parse v1.2, Nemotron OCR and Object Detection).
Embed: Multimodal embedding models that map text, images, or audio into shared semantic vectors spaces for search and retrieval (e.g. Llama Nemotron Embed VL 1B v2).
Rerank: Cross-encoder reranking models that rescore retrieved candidates using deeper relevance modeling (e.g. Llama Nemotron Rerank VL 1B v2).

Community Collaborations

NVIDIA releases optimized and aligned versions of leading community architectures, leveraging proprietary alignment techniques (SteerLM, RLHF, RPO) and open datasets like HelpSteer2 to push open models further.

Llama-3.1-Nemotron: A diverse family of models where Llama 3.1 architectures are fine-tuned using NVIDIA's HelpSteer2 datasets to improve helpfulness and instruction adherence. Includes Ultra (253B), Super (49B), and Nano (8B) variants.
Mistral-NeMo: A 12B parameter model built in collaboration with Mistral AI, offering a high performance-to-size ratio and an expanded 128k context window.

Physical AI

NVIDIA Cosmos

NVIDIA Cosmos is a platform of generative World Foundation Models (WFMs), tokenizers, and data curation tools — purpose-built to model and simulate physical interactions for robotics and autonomous systems.

Cosmos Tokenizer: A suite of high-compression visual tokenizers for images and videos, achieving up to 2048× total compression at up to 12× faster than prior SOTA. Available in Continuous and Discrete variants, enabling efficient encoding/decoding for both diffusion and autoregressive modeling.
Cosmos Predict 2.5: Diffusion-transformer based models for generating high-fidelity, physics-aware images and videos from text, image, or video inputs. Available in 2B and 14B variants.

NVIDIA GR00T

GR00T N1.5 VLA: NVIDIA's open foundation model for humanoid robot reasoning and control. Combines an Eagle-based vision-language backbone with a diffusion transformer (DiT) action head for language-conditioned manipulation across diverse embodiments. We have integrated GR00T N1.5 into LeRobot for policy post-training learning and deployment.

IsaacLab-Arena

IsaacLab-Arena: An open-source framework for large-scale, GPU-accelerated robot policy evaluation in simulation built over top of IsaacLab. Provides modular APIs for task curation, automated diversification, and parallel benchmarking across embodiments and environments. We have integrated IsaacLab-Arena into LeRobot for scalable closed-loop policy evaluation and benchmarking along with datasets and 250+ scenes from our partner Lightwheel AI, on HuggingfaceHub.

Nemotron Datasets

Every model NVIDIA ships rests on a data layer — and that data shapes how the model reasons, what it knows, and where it can be safely deployed. Nemotron Datasets are the open version of that foundation: web-scale pretraining corpora, alignment and reasoning data, multimodal grounding, and embodied AI simulation, released under permissive licenses with the training recipes and evaluation frameworks that produced them. Beyond Nemotron, NVIDIA's broader open data catalog spans 200+ releases across Physical AI and robotics, autonomous vehicles, biology and drug discovery, retrieval and evaluation benchmarks, and sovereign AI. Use the table below to find the right starting point for what you're trying to build.

Which dataset collection should I use?

If you want to...	Use this Collection	Start with these datasets
FOUNDATION
Pre-train a base model	Nemotron Pre-Training Collection	Nemotron-Pretraining-Legal-v1, Nemotron-Pretraining-Specialized-v1.2,Nemotron-Pretraining-Code-v3
BUILD A CAPABILITY
Math reasoning, proofs, and quantitative problem-solving	Nemotron Math & Reasoning Collection	Nemotron-SFT-Math-v3, Nemotron-Math-v2, AceReason-Math, Nemotron-CC-Math-v1
Code generation, debugging, and SWE workflows	Nemotron Code & SWE Collection	Nemotron-SFT-Competitive-Programming-v2, Nemotron-SFT-SWE-v2, Nemotron-CC-Code-v1
Helpful, multi-turn, instruction-following chat	Nemotron Chat & Instruction Collection	Nemotron-SFT-Instruction-Following-Chat-v2, Nemotron-RL-instruction_following
Agentic and tool-use behavior	Nemotron Agentic Collection	Nemotron-SFT-Agentic-v2, Nemotron-RL-Agentic-Function-Calling-Pivot-v1
Safety, refusals, and content moderation	Nemotron Safety Collection	Aegis-AI-Content-Safety-Dataset-2.0, Nemotron-Safety-Guard-Dataset-v3, Nemotron-PII
Image and document understanding	Nemotron Vision-Language Collection	Nemotron-VLM-Dataset-v2, Llama-Nemotron-VLM-Dataset-v1
TRAINING STAGES
RL data with verifiable rewards (math, code, agentic, instruction)	Nemotron Reinforcement Learning Collection	Nemotron-RL-math-OpenMathReasoning, Nemotron-RL-coding-competitive_coding, Nemotron-RL-Agentic-Function-Calling-Pivot-v1
Train a reward model	Nemotron Reward Modeling Collection	HelpSteer3, HelpSteer2, Nemotron-RLHF-GenRM-v1
Full post-training recipe (SFT + RL blend)	Nemotron Post-Training Blends Collection	Llama-Nemotron-Post-Training-Dataset, Nemotron-Post-Training-Dataset-v2, Nemotron-Cascade-2-SFT-Data
Evaluate model performance	Nemotron Eval & Benchmark Collection	SPEED-bench
SPECIALIZED & SOVEREIGN
Multilingual or domain-specific (e.g. finance) capability	Nemotron Supervised Fine-Tuning Collection	Nemotron-SFT-Multilingual-v1, Nemotron-SpecializedDomains-Finance-v1
Diverse synthetic personas grounded in real population distributions	Nemotron Personas Collection	Nemotron-Personas-USA / India / Japan / Brazil / France / Singapore / El Salvador / Vietnam

Collections 114

View 114 collections

spaces 60

NV-Generate Synthetic Medical Imaging

🧠

Synthetic 3D CT and MR generation with NVIDIA NV-Generate.

185

Music Flamingo

🎵

Analyze music and answer questions from audio or YouTube links

VoMP

🚀

Volumetric physics materials for interactive worlds

LLM RTL Coding Errors Explainer

🥇

NVR - How LLMs Fail and Generalize in RTL Coding

257

Kimodo

🏃

Generate high-quality motions from text prompts

KVPress Leaderboard

🥇

KVPress leaderboard: benchmark KV Cache compression methods

View 60 Spaces

models 832

nvidia/nemotron-3.5-asr-streaming-0.6b

Automatic Speech Recognition • Updated about 5 hours ago • 597 • 197

nvidia/omni-dreams-models

Image-to-Video • Updated about 5 hours ago • 17

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated about 6 hours ago • 1.6k • 4

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Text Generation • 561B • Updated about 7 hours ago • 9.13k • 118

nvidia/DeepSeek-V4-Pro-NVFP4

Text Generation • 910B • Updated about 8 hours ago • 7.67k • 58

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Text Generation • 335B • Updated about 9 hours ago • 7.42k • • 109

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-GenRM

Text Generation • 561B • Updated about 9 hours ago • 55 • 9

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-Base-BF16

Text Generation • 561B • Updated about 9 hours ago • 557 • 20

nvidia/ArtiFixer

Updated 1 day ago • 8

nvidia/Nemotron-3.5-Content-Safety

4B • Updated 2 days ago • 111 • 14

View 832 models

datasets 283

nvidia/Nemotron-RL-Ultra-Training-Blends

Viewer • Updated about 2 hours ago • 54.2k • 52 • 1

nvidia/Nemotron-RL-litmus-bench-v0.1

Viewer • Updated about 2 hours ago • 5.71k • 58 • 1

nvidia/Nemotron-SFT-SWE-v3

Viewer • Updated about 3 hours ago • 238k • 8

nvidia/Anchor-Lab

Viewer • Updated about 3 hours ago • 992k • 5

nvidia/simready-catalog

Updated about 6 hours ago • 1

nvidia/earth2studio-assets

Viewer • Updated about 6 hours ago • 14 • 2.11k • 3

nvidia/Nemotron-SFT-Multilingual-v2

Viewer • Updated about 11 hours ago • 370k • 45 • 1

nvidia/Nemotron-3.5-Content-Safety-Dataset

Viewer • Updated about 11 hours ago • 98.3k • 52 • 2

nvidia/Nemotron-Personas-Vietnam

Viewer • Updated about 14 hours ago • 100k • 101 • 17

nvidia/Nemotron-Personas-El-Salvador

Viewer • Updated about 16 hours ago • 148k • 1.43k • 31

View 283 datasets

AI & ML interests

Recent Activity

Papers

Articles

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI

Gemma 4 VLA Demo on Jetson Orin Nano Super

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

Building a Fast Multilingual OCR Model with Synthetic Data

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Build a Domain-Specific Embedding Model in Under a Day

Nemotron 3 Content Safety 4B: Multimodal, Multilingual Content Moderation

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

How NVIDIA Builds Open Data for AI

Deploying Open Source Vision Language Models (VLM) on Jetson

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI

Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI

**NVIDIA Earth-2 Open Models Span the Whole Weather Stack**

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

Nemotron-Personas-USA: Synthesized Data for Sovereign AI

NVIDIA Isaac GR00T in LeRobot

Can Your LLM Think Like a Professional? Introducing ProfBench

NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Nemotron-Personas-India: Synthesized Data for Sovereign AI

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Mastering Long Contexts in LLMs with KVPress

Team members 4,316

The Nemotron Family: Digital Intelligence

Language & Reasoning (Nemotron 3)

Safety & Content Moderation

Speech & Multimodal

Speech Recognition & Translation (Nemotron Speech)

Vision & Document Intelligence

Nemotron RAG

NVIDIA Earth-2 Open Models Span the Whole Weather Stack

spaces 60

models 832

datasets 283