Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

A data center rack packed with TPUv8t chips, connected by thick cables, illustrating Google's Virgo network linking…
AI ResearchScore: 100

Google’s Virgo network interconnects 134K TPUv8t chips at 47 Pbps

Google's Virgo network interconnects 134,400 TPUv8t chips at 47 Pbps, targeting large-scale training clusters.

·1d ago·3 min read··23 views·AI-Generated·Report error
Share:
What is Google's Virgo network architecture for TPUv8t chips?

Google’s Virgo network architecture interconnects up to 134,400 TPUv8t training chips with 47 Pbps of non-blocking bisectional bandwidth, per [@SemiAnalysis_].

TL;DR

Google introduced TPUv8t and Virgo network. · Virgo interconnects up to 134,400 chips. · Non-blocking bisectional bandwidth reaches 47 Pbps.

Google’s Virgo network interconnects up to 134,400 TPUv8t chips at 47 Pbps. The scale-out architecture, disclosed by @SemiAnalysis_, targets training clusters for frontier models.

Key facts

  • Virgo interconnects up to 134,400 TPUv8t chips.
  • Non-blocking bisectional bandwidth: 47 Pbps.
  • TPUv8t is a training-focused variant of Google's TPU.
  • 47 Pbps equals ~47,000 Tbps.
  • Competes with NVIDIA NVLink and InfiniBand fabrics.

Google has introduced a new training-focused TPU, the TPUv8t, alongside a scale-out network architecture called Virgo. According to @SemiAnalysis_, Virgo can interconnect up to 134,400 chips with up to 47 Pbps of non-blocking bisectional bandwidth. That bandwidth figure dwarfs prior interconnects; by comparison, InfiniBand NDR 400 switches top out around 3.2 Tbps per port, and NVIDIA's NVLink Switch supports 900 GB/s per GPU directionally.

Why 47 Pbps matters

47 Pbps is roughly 47,000 Tbps — enough to transfer the entire printed Library of Congress in under a second. For training runs on clusters exceeding 100,000 accelerators, the interconnect often becomes the bottleneck: all-reduce gradients across 100K+ chips can stall if bandwidth is insufficient. Virgo’s non-blocking design means every chip can communicate with any other chip at full line rate simultaneously, eliminating the head-of-line blocking that plagues tree-based or fat-tree topologies.

TPUv8t: training-first design

Google did not disclose TPUv8t’s raw FLOPS or memory bandwidth, but the “t” suffix indicates a training-optimized variant of the eighth-generation TPU. Previous TPU generations (v4, v5p, v6) were general-purpose; the v8t appears to sacrifice some inference efficiency for higher sustained throughput on large-batch training jobs. The 134,400-chip ceiling suggests Google is targeting clusters comparable to the 100K-GPU scale used for Gemini and GPT-4-class models.

Competitive positioning

Virgo competes directly with NVIDIA’s NVLink and InfiniBand fabrics, and with AMD’s Infinity Fabric. However, no commercial interconnect currently advertises 47 Pbps aggregate bandwidth at that scale. Google’s custom silicon approach allows tighter integration between the TPU and the network topology, potentially reducing latency and power consumption compared to third-party switches. The architecture also aligns with Google’s strategy of vertical integration for AI infrastructure, similar to how AWS builds its Trainium/Nitro combination.

Unknowns

Google has not published Virgo’s per-port bandwidth, latency, or power draw. Nor has it disclosed the TPUv8t’s training performance on standard benchmarks like MLPerf. The 134,400-chip figure is a theoretical maximum; real-world clusters may be smaller. [@SemiAnalysis_] did not specify when Virgo or TPUv8t will enter production.

What to watch

Watch for TPUv8t MLPerf training submission results and Google Cloud’s pricing for Virgo-based TPU slices. Also track whether Google licenses Virgo to third-party hardware or keeps it proprietary for internal training runs.

Sources cited in this article

  1. GPU
Source: gentic.news · · author= · citation.json

AI-assisted reporting. Generated by gentic.news from 1 verified source, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

The 47 Pbps figure is the headline, but the more interesting structural read is Google’s shift toward a dedicated training-only TPU variant. Previous generations (v4, v5p) were general-purpose; the v8t suggests Google now sees enough differentiation between training and inference workloads to justify separate silicon. That mirrors the industry split between NVIDIA’s H100 (training) and L40S (inference), and AMD’s MI300X vs. MI350. The 134,400-chip ceiling is also notable: it’s slightly larger than the 100K-GPU clusters rumored for GPT-4 and Gemini Ultra, implying Google is future-proofing for models that require 1e26 FLOPs or more. Virgo’s non-blocking topology is a direct answer to the all-reduce bottleneck that plagued Microsoft’s 100K-GPU training runs, where InfiniBand congestion caused 10-20% utilization loss. If Virgo delivers on its bandwidth promise, Google could achieve higher effective throughput per chip than any competitor.
This story is part of
The AI Infrastructure War Shifts from Chips to Developer Tools
Nvidia's enterprise pivot and AWS's OpenAI bet collide with Cursor's quiet ascent

Mentioned in this article

Enjoyed this article?
Share:

AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Related Articles

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

More in AI Research

View all