What workloads does the RTX PRO AI Factory target?

Generative AI, agentic AI, data analytics, visual computing, and engineering simulation — balanced for space- and power-constrained data centers.

How does the NVL72 configuration differ from HGX?

NVL72 is designed for trillion-parameter models with exascale per rack, while HGX targets multi-node training and inference at up to 128 nodes.

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Listen

A server rack with glowing blue lights and cables, representing Nvidia's AI data center blueprints for 4-node to…

Products & LaunchesScore: 60

Nvidia Ships AI Factory Blueprints: 4-Node to 128-Cluster Specs

Nvidia published three validated AI data center blueprints — RTX PRO, HGX, NVL72 — spanning 4-node to 128-node clusters, targeting agentic AI and trillion-parameter models.

AAAla SMITH & AI Research Desk·11h ago·3 min read··3 views·AI-Generated·Report error

Source: nvidia.comvia hn_data_centerSingle Source

What are Nvidia's Enterprise Reference Architectures for AI data centers?

Nvidia published Enterprise Reference Architectures for AI data centers, offering validated cluster designs from 4-node RTX PRO setups to 128-node HGX and NVL72 racks targeting trillion-parameter models.

TL;DR

Nvidia published validated cluster designs for enterprise AI factories. · Three tiers: RTX PRO, HGX, NVL72 — up to 128-node. · NVL72 targets trillion-parameter models with exascale per rack.

Nvidia published validated blueprints for AI data centers across three tiers, from 4-node RTX PRO clusters to 128-node NVL72 racks. The Enterprise Reference Architectures target agentic AI, physical AI, and trillion-parameter model training with specific design points.

Key facts

Three tiers: RTX PRO (16-32 nodes), HGX (32-128 nodes), NVL72 (4-8 racks).
NVL72 targets trillion-parameter models with exascale per rack.
HGX claims up to 15x higher token throughput via Spectrum-X networking.
RTX PRO optimized for PCIe environments in power-constrained data centers.
Nvidia did not disclose pricing or specific power consumption figures.

Nvidia's Enterprise Reference Architectures (Enterprise RAs) provide validated, repeatable infrastructure designs for deploying AI factories in enterprise data centers. The documentation covers three distinct configurations, each targeting specific workload scales and hardware tiers.

Three Tiers, Three Use Cases

The RTX PRO AI Factory targets space- and power-constrained data centers using PCIe-based NVIDIA RTX PRO Servers. It offers 16- and 32-node design points optimized for generative AI, agentic AI, data analytics, visual computing, and engineering simulation. This is the entry point for enterprises not ready for full-scale HGX deployments.

The HGX AI Factory scales to 32-, 64-, and 128-node configurations using NVIDIA HGX systems with Spectrum-X networking. The rail-optimized design claims up to 15x higher token throughput versus prior generations, targeting multi-node training and inference at scale.

The NVL72 AI Factory is the flagship: designed for trillion-parameter models, delivering exascale computing within a single rack. Deployment centers on four- and eight-rack configurations, built on a flexible, rail-optimized network architecture.

The Unique Take: Nvidia Is Codifying the Data Center Playbook

Nvidia's move to publish these reference architectures is a structural shift. Previously, enterprises relied on systems integrators or cloud providers to design AI clusters. By releasing validated, repeatable blueprints — including networking, observability, and software stacks — Nvidia is commoditizing the design phase and making AI factory deployment a turnkey operation. This mirrors how VMware standardized virtualized infrastructure two decades ago. The goal: remove the integration friction that slows enterprise AI adoption, locking customers into Nvidia's hardware ecosystem before AMD or custom ASIC alternatives mature.

NVIDIA NVL72 Rack

Networking and Observability as Moat

The reference architectures include high-speed east-west and north-south networking specs and observability tools — not just compute. Nvidia's Spectrum-X networking is mandatory for the HGX and NVL72 designs, creating a full-stack dependency. Enterprises that follow these blueprints will find it costly to swap in non-Nvidia networking or storage components later.

High-Performance Enterprise Reference Architecture is purpose-built for multi-node AI training

Context: Recent Nvidia Infrastructure Moves

This release follows Nvidia's May 2026 partnership with Invenergy and Emerald AI to build flexible AI factories [per the company's blog post], and the open-sourcing of MRC — the RDMA protocol powering OpenAI's Blackwell clusters — on May 6, 2026. The reference architectures complement these efforts by providing the deployment playbook for the hardware those protocols run on.

Enterprise Reference Architecture is designed for multi-node AI

Limitations

Nvidia did not disclose pricing for any of the configurations, nor specific power consumption figures beyond the general claim of efficiency. The 15x token throughput improvement for HGX lacks a baseline comparison — whether against prior HGX generations or competitor systems. The reference architectures are design documents, not turnkey products; enterprises still need to source hardware from certified partners.

What to watch

Watch for partner certifications from Dell, HPE, and Supermicro — the first validated RTX PRO and HGX systems should ship within 90 days. Also track whether AMD or Intel announce competing reference architectures for their GPU lines, which would validate Nvidia's playbook strategy.

Source: gentic.news · 11h ago · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

Nvidia's Enterprise Reference Architectures represent a strategic escalation from chip vendor to infrastructure standard-setter. By releasing validated designs for networking, observability, and compute, Nvidia is attempting to become the VMware of AI — defining the reference architecture that enterprises copy. This is a direct challenge to systems integrators and cloud providers who have historically owned the cluster design phase. The three-tier segmentation is telling: RTX PRO for enterprises with constrained data centers, HGX for serious training clusters, and NVL72 for frontier labs. The omission of pricing and power figures suggests these are pre-commercialization documents, likely timed to align with Blackwell Ultra and Vera Rubin hardware cycles. The 15x token throughput claim for HGX lacks a baseline — a common Nvidia marketing pattern. Without knowing whether the comparison is to H100 or A100, the number is less useful for procurement decisions. The real story is the architectural lock-in: following these blueprints means committing to Spectrum-X networking, proprietary interconnects, and Nvidia-certified hardware, making future switching costs substantial.

#ai infrastructure #nvidia #enterprise ai

Compare side-by-side

RTX PRO vs HGX

→

Mentioned in this article

Nvidia RTX PRO HGX NVL72 Enterprise Reference Architectures Spectrum-X

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Products & Launches

Claude Code Digest — Apr 28–May 01

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

More in Products & Launches

View all

Greg Brockman in a courtroom, facing a federal judge, with documents referencing a $1B journal entry and his $30B…

Products & Launches

OpenAI Trial Reveals Brockman's $1B Journal Entry, $30B Net Worth

Greg Brockman's 2017 journal entry asking how to reach $1B was unsealed in the OpenAI trial, revealing he walked into court worth $30B while Musk donated $38M.

x.com/20h ago/3 min read

wealth creationlegalopenai

Luma Labs Uni-1 API dashboard on a laptop screen, developer typing code, creative pipeline workflow icons floating…

Products & Launches

Luma Labs Opens Uni-1.1 API for Production — Image, Not Video, and #1 ELO Comes With a Caveat

Luma Labs has shipped the Uni-1.1 API for production — an image-generation model (not video) with two REST endpoints, Python and JavaScript SDKs, and support for up to nine reference images per call. The widely-cited '#1 Human Preference ELO' is from Luma's own internal pairwise evaluation; on pure text-to-image Luma reports #2 behind Google Nano Banana. Pricing: ~$0.09 per 2K image, 10–30% below Nano Banana 2 / Pro.

x.com/1d ago/3 min read

creative aiai modelsdeveloper tools

NVIDIA engineers working on server racks with cables connected to Blackwell GPUs, illustrating data flow across 64…

Products & Launches

100

NVIDIA Open-Sources MRC, the RDMA Protocol Powering OpenAI's Blackwell Clusters

NVIDIA open-sourced MRC, a multi-path RDMA protocol used by OpenAI on Blackwell clusters, enabling microsecond rerouting across 64 paths.

x.com/1d ago/3 min read/Widely Reported

open sourceai infrastructurenetworking

Three Tiers, Three Use Cases

The Unique Take: Nvidia Is Codifying the Data Center Playbook

Networking and Observability as Moat

Context: Recent Nvidia Infrastructure Moves

Limitations

What to watch

AI Analysis

✨AI Toolslive

Related Articles

Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1

NVIDIA Open-Sources MRC, the RDMA Protocol Powering OpenAI's Blackwell Clusters

Meta Building Agentic AI Tool for 3B+ Users, Sources Say

Anthropic Launches Wall Street Agents, $1.5B JV with Blackstone

Nvidia's China Market Share Hits Zero, Huang Says

Claude Code Digest — Apr 28–May 01

The framework underneath this story

More in Products & Launches

OpenAI Trial Reveals Brockman's $1B Journal Entry, $30B Net Worth

Luma Labs Opens Uni-1.1 API for Production — Image, Not Video, and #1 ELO Comes With a Caveat

NVIDIA Open-Sources MRC, the RDMA Protocol Powering OpenAI's Blackwell Clusters