How do you use Nemotron 3 Super?

Nemotron 3 Super can be called through OpenRouter's OpenAI-compatible API using the model ID nvidia/nemotron-3-super-120b-a12b. This page includes a ready-to-run curl example and the supported parameter list.

Is Nemotron 3 Super free?

Nemotron 3 Super is a paid model in the OpenRouter catalog, with separate input and output token pricing shown on this page.

Nemotron 3 Super

All models

NVIDIANemotronReleased 2026-03-11

Nemotron 3 Super

Q: What is Nemotron 3 Super?

Nemotron 3 Super is an AI model from NVIDIA in the Nemotron series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

262K context

$0.090/M input

$0.450/M output

120B total, 12B active

NVIDIA Nemotron 3 Super is a 120B-parameter hybrid Mixture-of-Experts (MoE) model designed for compute efficiency and accuracy in complex multi-agent applications. It features a hybrid Mamba-Transformer architecture with multi-token prediction (MTP) and a 1M token context window for long-term coherence, cross-document reasoning, and multi-step task planning. The model leverages latent MoE to activate only 12B parameters during inference, enabling high intelligence and generalization at reduced computational cost. It is trained across 10+ environments using multi-environment reinforcement learning and achieves leading accuracy on benchmarks such as AIME 2025, TerminalBench, and SWE-Bench Verified.

What is Nemotron 3 Super?

Nemotron 3 Super is an AI model from NVIDIA that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Nemotron 3 Super against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Hybrid Mamba-Transformer Mixture-of-Experts (MoE)

Parameters

120B total, 12B active

Tokenizer

Other

License

NVIDIA Open License

Released

2026-03-11

Modalities

Input

text

Output

text

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p

Strengths

Efficient compute with latent MoE activating only 12B parameters
1M token context window for long-term coherence and reasoning
Multi-environment RL training for high accuracy across benchmarks
Multi-token prediction for faster token generation
Open customization and deployment under NVIDIA Open License

Limitations

Limited information on training data sources and cutoff date
Performance on certain benchmarks like CritPt is relatively low
High computational requirements for full 120B parameter usage
Potential challenges in production deployment due to trial use restrictions
Limited support for economically valuable tasks (GDPval-AA score)

Recommended Use Cases

Multi-agent applications requiring long-term coherence

Cross-document reasoning and multi-step task planning

Scientific reasoning and graduate-level problem solving

Agentic coding and terminal use

Customizable AI deployment across workstations and cloud environments

More from NVIDIA

Nemotron 3 Super (free)

262K ctxFree

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Nemotron 3 Nano 30B A3B (free)

256K ctxFree

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Nemotron 3 Nano 30B A3B

262K ctx$0.050/M

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Nemotron Nano 12B 2 VL (free)

128K ctxFree

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Stay in the know

Nemotron 3 Super

What is Nemotron 3 Super?

More from NVIDIA

Related content