Back to Models

NVIDIA: Nemotron 3 Ultra

nemotron-3-ultra-550b-a55b

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Modalities

Input

text

Output

text
Pricing
Cost per 1 million tokens
Input
$0.6
Output
$3
Model Specs
Context Window
262,144
Max Output
16,384
Release Date
2026-06-04
Knowledge Cutoff
Capabilities
Reasoning
Tool Calling
Vision

Last Updated: 2026-06-04

Provider: