How do you use Nemotron 3 Super (free)?

Nemotron 3 Super (free) can be called through OpenRouter's OpenAI-compatible API using the model ID nvidia/nemotron-3-super-120b-a12b:free. This page includes a ready-to-run curl example and the supported parameter list.

Is Nemotron 3 Super (free) free?

Nemotron 3 Super (free) is currently listed as free for both input and output tokens in the OpenRouter catalog.

Nemotron 3 Super (free)

All models

NVIDIANemotronFreeReleased 2026-03-11

Nemotron 3 Super (free)

Q: What is Nemotron 3 Super (free)?

Nemotron 3 Super (free) is an AI model from NVIDIA in the Nemotron series. Agent Mag tracks its context window, pricing, modalities, and supported API parameters on this page.

262K context

Free

120B total, 12B active

NVIDIA Nemotron 3 Super is a 120B-parameter hybrid Mixture-of-Experts (MoE) model designed for efficiency and accuracy in complex multi-agent applications. It features a 1M token context window for long-term coherence, cross-document reasoning, and multi-step task planning. The model employs latent MoE to activate 12B parameters for inference, enabling high intelligence and generalization at reduced computational cost. Multi-environment reinforcement learning across 10+ environments enhances its accuracy on benchmarks such as AIME 2025, TerminalBench, and SWE-Bench Verified.

What is Nemotron 3 Super (free)?

Nemotron 3 Super (free) is an AI model from NVIDIA that Agent Mag tracks for pricing, context window, modalities, benchmarks, and API compatibility. Builders can use this page to compare Nemotron 3 Super (free) against other models for agent workflows and production deployments.

Model ID

Architecture & Specifications

Architecture

Hybrid Mamba-Transformer Mixture-of-Experts (MoE)

Parameters

120B total, 12B active

Tokenizer

Other

License

NVIDIA Open License

Released

2026-03-11

Modalities

Input

text

Output

text

Supported Parameters

include_reasoningmax_tokensreasoningresponse_formatseedstructured_outputstemperaturetool_choicetoolstop_p

Strengths

Efficient activation of 12B parameters for inference
1M token context window for long-term coherence
Multi-environment reinforcement learning for improved accuracy
Latent MoE enabling cost-effective expert activation
High token generation rate compared to leading open models

Limitations

Not suitable for production or business-critical systems
Prompts and outputs are logged, raising privacy concerns
Limited accuracy in research-level physics reasoning (CritPt: 3.1%)
Relatively high hallucination rate (13.0%) in knowledge tasks
Lower performance in economically valuable tasks (GDPval-AA: 25.3%)

Recommended Use Cases

Multi-agent applications

Cross-document reasoning

Long-term task planning

Scientific computing and coding

Interactive roleplay and storytelling

More from NVIDIA

Nemotron 3 Super

262K ctx$0.090/M

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

Nemotron 3 Nano 30B A3B (free)

256K ctxFree

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Nemotron 3 Nano 30B A3B

262K ctx$0.050/M

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Nemotron Nano 12B 2 VL (free)

128K ctxFree

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Stay in the know

Nemotron 3 Super (free)

What is Nemotron 3 Super (free)?

More from NVIDIA

Related content