Nvidia Releases Nemotron 3 Super, a 120B Open AI Model Built for Agentic Workloads

Key Takeaways:

Image source: Nvidia blog.

Multi-Token Prediction layers, using two shared-weight heads, speed up chain-of-thought generation and allow native speculative decoding. On structured tasks, Nvidia reports up to three times faster generation.

The model was pre-trained on 25 trillion tokens across two phases. The first phase used 20 trillion tokens of broad data. The second used five trillion high-quality tokens tuned for benchmark performance. A final extension phase on 51 billion tokens extended native context to one million tokens. Post-training included supervised fine-tuning on roughly seven million samples and reinforcement learning across 21 environments with more than 1.2 million rollouts.

In benchmarks, Nemotron 3 Super scored 83.73 on MMLU-Pro, 90.21 on AIME25, and 60.47 on SWE-Bench using OpenHands. On PinchBench, it reached 85.6 percent, the highest reported score among open models in its class. On long-context evaluation, it scored 91.64 on RULER 1M.

Compared to GPT-OSS-120B, Nemotron 3 Super delivers 2.2 times the throughput at 8k input and 64k output. Against Qwen3.5-122B-A10B, that figure reaches 7.5 times. Nvidia also reports more than five times the throughput and up to two times the accuracy over the prior Nemotron Super generation.

Nvidia trained the model end-to-end in its NVFP4 four-bit floating-point format, optimized for Blackwell GPUs. On B200 hardware, Nvidia says inference runs up to four times faster compared to FP8 on H100 with no reported accuracy loss. Quantized FP8 and NVFP4 checkpoints retain 99.8 percent or more of full-precision accuracy.

The model also powers the Nvidia AI-Q research agent, which reached the top position on the Deepresearch Bench leaderboard.

Nemotron 3 Super is fully open under the Nvidia Nemotron Open Model License. Checkpoints in BF16, FP8, and NVFP4 formats, along with pre-training data, post-training samples, and reinforcement learning environments, are available on Hugging Face. Inference is supported through Nvidia NIM, build.nvidia.com, Perplexity, Openrouter, Together AI, Google Cloud, AWS, Azure, and Coreweave, with on-premises options via Dell Enterprise Hub and HPE.

Developers can access training recipes, fine-tuning guides, and inference cookbooks through the NeMo platform using vLLM, SGLang, and TensorRT-LLM.

About Author

See author's posts

Tags: AI agents, Artificial intelligence (AI), blog, heads, layers, multitoken, News, Nvidia, prediction, sharedweight, source, TakeawaysImage, using

Continue Reading

Previous Upcoming ‘Bitcoin’ Movie With Casey Affleck, Gal Gadot Probes Satoshi’s Identity
Next Report: NYDIG Close to Buying Alcoa’s Massena New York Smelter Site for Bitcoin Mining Operations

Nvidia Releases Nemotron 3 Super, a 120B Open AI Model Built for Agentic Workloads

About Author

Solana Price Ready For A Big Move — Is It Time To Jump In?

Bitcoin ‘Sharks’ Silently Accumulate Amid Market Uncertainty — Details

Bitcoin’s Big Players Are Accumulating — Is $80K Just The Start?

Bitcoin Traders Double Down On Bearish Bets Amid Consolidation – What This Means For Price

Drivechain Architect Paul Sztorc Unveils August Bitcoin Hard Fork With 1:1 BTC Coin Split

Hashrate Index: Brazil and Venezuela Show Potential to Grow Latam’s Bitcoin Mining Share

Leave a Reply Cancel reply

Robinhood Ventures Fund I Invests $75 Million in OpenAI to Expand Retail Investor Access

X Cashtags Generate $1B Trading Volume in Two Days During Canada Pilot Launch

TRX Now Live on Binance.US as TRON DAO Expands Regulated U.S. Market Access

TradFi Assets Reach 9% of Binance Futures Volume Amid Rising Market Volatility

Binance Data Shows Crypto Traders Are Taking Over Traditional Markets

Binance Dominates 2026 Crypto Trading as Futures Volume Surges Past Spot Markets

You may have missed

What Are Provably Fair Games and How Do They Work?

The Journalist’s View: What Makes a Good Crypto PR Pitch in 2026

Hot Wallets Explained: Secure, Instant Crypto Access

Chainlink Price Gains Strength as ETF Inflows Surge: Can LINK Rally Continue?

Chainlink Price Gains Strength as ETF Inflows Surge: Can LINK Rally Continue?

Newsletter

Support Crypto News

Donate Bitcoin to this address

Donate Ethereum to this address

Donate Via Wallets