
The worldâs most powerful GPU
NVIDIA® GB200 GPUs AVAILABLE SOON
GB200 NVL72 is a rack-scale, liquid-cooled solution connecting 36 Grace CPUs and 72 Blackwell GPUs, enabling a single 72-GPU NVLink domain that delivers 30X faster real-time trillion-parameter LLM inference.
With GB200 SXM you get:
- Real-Time Inference for Trillion-Parameter LLMs
- Massive LLM Training at High Speed
Top 4 Use Cases

Groundbreaking Blackwell Architecture
NVIDIA Blackwell architecture sets a new benchmark for accelerated computing with unparalleled performance, efficiency, and scalability, featuring 208 billion transistors on a custom TSMC 4NP process.

Breakthrough CPU Performance
NVIDIA Grace CPU revolutionizes data centre computing with outstanding performance and memory bandwidth, offering 2X energy efficiency and unprecedented speed for AI, cloud, and HPC applications.

Seamless Interconnectivity
The fifth-generation NVIDIA NVLink unlocks exascale computing and trillion-parameter AI models, enabling swift and seamless communication between every GPU in your server cluster for accelerated performance.

High-Performance Networking
NVIDIA Quantum-X800 InfiniBand, NVIDIA Spectrum X800 Ethernet, and NVIDIA BlueField®-3 DPUs enable efficient scalability across hundreds and thousands of Blackwell GPUs for optimal application performance.
Tech Specs
Form Factor | GB200 NVL72 |
---|---|
Configuration | 36 Grace CPU : 72 Blackwell GPUs |
FP4 Tensor Core | 1,440 PFLOPS |
FP8/FP6 Tensor Core | 720 PFLOPS |
INT8 Tensor Core | 720 POPS |
FP16/BF16 Tensor Core | 360 PFLOPS |
TF32 Tensor Core | 180 PFLOPS |
FP32 | 6,480 TFLOPS |
FP64 | 3,240 TFLOPS |
FP64 Tensor Core | 3,240 TFLOPS |
GPU Memory | Bandwidth | Up to 13.5 TB HBM3e | 576 TB/s |
NVLink Bandwidth | 130TB/s |
CPU Core Count | 2,592 Arm® Neoverse V2 cores |
CPU Memory | Bandwidth | Up to 17 TB LPDDR5X | Up to 18.4 TB/s |