What is the Supermicro GB200 NVL72 SuperCluster?

The GB200 NVL72 SuperCluster is a complete rack-scale AI system with 72 NVIDIA B200/B300 GPUs and 36 Grace CPUs. It delivers 1+ ExaFLOPS FP8 performance with 13.8TB+ HBM3e memory in a single rack. Price range is $3M-$4M per rack system.

What Supermicro AI server models are available?

Key models include the 4U liquid-cooled SYS-421GE-TNHR2-LCC (8x H200/B200, $350K-$500K), 8U air-cooled SYS-821GE-TNHR ($250K-$350K), and AMD Instinct AS-8125GS-TNMR2 (8x MI300X, $300K-$400K). SuperBlade and Intel Gaudi 3 platforms are also available.

100+ GPU Configurations

Supermicro AI Servers

Total IT solution provider with the industry's broadest GPU server portfolio. Buy Supermicro AI servers with NVIDIA H200, B200, GB200, and AMD MI300X GPUs. Building Block Solutions® architecture enables 100+ configurations with 4-6 week lead times and DLC-2 liquid cooling innovation.

Get Supermicro Quote Call 1-312-416-8438

Flagship SuperCluster

GB200 NVL72

Rack-Scale Exascale AI

GPUs/Rack 72x B200

Performance 1+ ExaFLOP

Memory 13.8TB HBM3e

Cooling Full DLC

1993 Founded - 30+ Years Innovation

100+ GPU Server Configurations

4-6 Weeks Lead Time

5,000 Racks/Month Capacity

Modular Architecture

Building Block Solutions®

Revolutionary modular architecture using flexible, reusable components for any workload. Enables 100+ GPU configurations vs 10-20 from competitors with fastest time-to-market.

100+ Configurations

4-6 Week Lead Times

3 Global Sites

Product Portfolio

GPU Server Portfolio

Comprehensive H200, B200, MI300X, and MI350 platforms with industry-leading configurations and competitive pricing. Air-cooled and liquid-cooled options.

Liquid Cooled

SYS-421GE-TNHR2-LCC

4U Direct Liquid Cooled H200/B200

Maximum density AI training with DLC-2 cooling. Industry's highest GPU density per rack in 4U form factor.

GPUs 8x H200/B200

Memory 1.5TB HBM3e

Form 4U

DLC-2 cold plates on GPUs, CPUs, DIMMs
Dual 5th Gen Intel Xeon Scalable (350W)
Up to 4TB DDR5 memory
10 servers per 42U rack (80 GPUs)

$450K - $550K Per server (8 GPUs) • 4-6 weeks

Configure SYS-421GE-TNHR2-LCC

Air Cooled

SYS-821GE-TNHR

8U Air-Cooled H200/B200 Platform

Versatile air-cooled platform supporting both Intel and AMD CPUs for existing datacenter infrastructure.

GPUs 8x H200/B200

CPUs Intel/AMD

Form 8U

NVIDIA HGX H200 or B200 platform
Dual Intel Xeon OR AMD EPYC 9004
Up to 6TB DDR5 (AMD config)
4-5 servers per rack (32-40 GPUs)

$320K - $420K Per server (8 GPUs) • 4-6 weeks

Configure SYS-821GE-TNHR

AMD Instinct

AS-8125GS-TNMR2

8U AMD Instinct MI300X Platform

High-memory AMD alternative with 1.54TB GPU memory per server. Excellent tokens-per-dollar for LLM inference.

GPUs 8x MI300X

Memory 1.54TB HBM3

Form 8U

192GB HBM3 per GPU (1.54TB total)
Dual AMD EPYC 9004 (up to 256 cores)
Up to 6TB DDR5 system memory
AMD Infinity Fabric interconnect

$300K - $400K Per server (8 GPUs) • 4-6 weeks

Configure MI300X

Next-Gen

AMD MI350 Series

CDNA 4 Architecture Platform

Next-generation AMD Instinct with 288GB HBM3e per GPU. 40% more tokens-per-dollar for AI reasoning models.

GPUs 8x MI355X

Memory 2.3TB HBM3e

Options 4U/8U

288GB HBM3e per GPU (2.3TB total)
FP4/FP6 precision support
40% more tokens-per-dollar
4U liquid OR 8U air-cooled

Available H2 2025 8-10 week lead time

Pre-Order MI350

Cooling Innovation

DLC-2 Direct Liquid Cooling

Second-generation liquid cooling technology with comprehensive component coverage delivering 40% power savings and 20% TCO reduction.

40%

Power Savings vs Air

20%

TCO Reduction

250kW

Heat Removal/Rack

8-GPU Form Factor

Comprehensive Coverage

Cold plates on all GPUs, CPUs, DIMMs, VRMs, and PCIe switches for maximum thermal efficiency.

Warmer Inlet Temps

Higher coolant inlet temperatures reduce infrastructure requirements and enable free cooling.

4U Density

Industry-leading 8x B200 (1000W each) in 4U form factor. 10-12 servers per rack = 80-96 GPUs.

Comparison

Platform Comparison

Compare specifications across Supermicro GPU server platforms to find the right configuration for your workload.

Platform	Form Factor	Max GPUs	GPU Options	Cooling	Ideal For
GB200 NVL72	Rack	72	B200 + Grace	Liquid	Trillion-param training
SYS-421GE-TNHR2-LCC	4U	8	H200, B200	Liquid	High-density AI
SYS-821GE-TNHR	8U	8	H200, B200	Air	Versatile AI/HPC
AS-8125GS-TNMR2	8U	8	MI300X	Air	Memory-intensive LLM
SuperBlade	52U	120 nodes	Various	Air/Liquid	Scale-out clusters

Partnerships

Strategic Technology Partnerships

Supermicro's partnerships with leading silicon vendors ensure access to the latest AI acceleration technologies with validated configurations.

Elite Partner

NVIDIA Partnership

Deep collaboration enabling first-to-market NVIDIA platforms including H200, B200, and GB200 NVL72 SuperCluster systems.

H200, B200, GB200 full platform support
HGX baseboard integration
NVLink and NVSwitch optimization
NVIDIA AI Enterprise validated

Strategic Partner

AMD Partnership

Comprehensive AMD Instinct accelerator support with EPYC processor optimization for high-memory AI workloads.

MI300X (192GB), MI325X (256GB) support
Next-gen MI350/MI355X ready
EPYC 9004/9005 optimization
ROCm software stack validated

Innovation Partner

Intel Partnership

Long-standing Intel collaboration for Xeon processors and Gaudi 3 accelerator platforms for silicon diversity.

Xeon Scalable 4th/5th/6th Gen
Intel Gaudi 3 accelerators
Open software ecosystem
Cost-effective alternative

GPU Options

Explore Supermicro GPU Configurations

Deep dive into specific GPU platforms with detailed specifications, configurations, and pricing.

Supermicro H100/H200 Servers

Hopper architecture with up to 141GB HBM3e. Enterprise AI training and inference.

View Hopper Specs

Supermicro B200/B300 Servers

Next-gen Blackwell architecture with up to 288GB HBM3e. 2.5x performance over H100.

View Blackwell Specs

Supermicro GB200 Servers

Grace Blackwell Superchip with unified CPU-GPU architecture for rack-scale AI.

View Grace Blackwell Specs

Supermicro MI300X Servers

AMD Instinct with 192GB HBM3. Excellent memory capacity for LLM inference.

View AMD Instinct Specs

Custom Configs

Rack Integration

Global Shipping

24/7 Support

FAQ

Frequently Asked Questions

Common questions about Supermicro AI servers and infrastructure.

What makes Supermicro unique for AI servers?

Supermicro offers 100+ GPU server configurations using Building Block Solutions® architecture, significantly more than competitors who typically offer 10-20. Combined with 4-6 week lead times (vs 12-16 weeks industry average), DLC-2 liquid cooling achieving 40% power savings, and support for NVIDIA, AMD, and Intel accelerators, Supermicro provides unmatched flexibility and speed-to-deployment.

What is the GB200 NVL72 SuperCluster?

The GB200 NVL72 SuperCluster is a complete rack-scale AI system with 72 NVIDIA B200/B300 GPUs and 36 Grace CPUs connected via NVLink 5.0. It delivers 1+ ExaFLOPS FP8 performance with 13.8TB+ HBM3e memory in a single liquid-cooled rack. Ideal for trillion-parameter model training, drug discovery, climate modeling, and financial risk analysis. Price range is $3M-$4M per rack system.

What is Supermicro DLC-2 liquid cooling?

DLC-2 is Supermicro's second-generation direct liquid cooling technology featuring cold plates on all GPUs, CPUs, DIMMs, VRMs, and PCIe switches. It achieves 40% power savings versus air cooling, 20% TCO reduction, supports 250kW heat removal per rack, and enables an industry-leading 4U form factor for 8x B200 (1000W each) GPUs.

How fast can Supermicro servers be delivered?

Supermicro offers industry-leading 4-6 week standard lead times thanks to Building Block Solutions® modular architecture and 5,000 racks/month manufacturing capacity across 3 global sites (US, Taiwan, Netherlands). This is significantly faster than typical 12-16 week enterprise server lead times, enabling faster AI project deployment.

Does Supermicro support AMD GPUs?

Yes, Supermicro offers comprehensive AMD Instinct support including MI300X (192GB HBM3), MI325X (256GB HBM3e), and upcoming MI350/MI355X (288GB HBM3e). AMD platforms paired with EPYC processors provide excellent memory capacity for LLM inference and a competitive alternative to NVIDIA for cost-conscious deployments.

Deploy Supermicro AI Infrastructure Today

Partner with SLYD to leverage Supermicro's Building Block Solutions® with 100+ configurations, fastest lead times, and advanced liquid cooling.

Request Supermicro Quote Call 1-312-416-8438

Compare OEM Partners

See how Supermicro compares to Dell, HPE, Lenovo, and Gigabyte.

View Comparison

GPU Cloud

Rent Supermicro servers on-demand in our cloud.

Explore Cloud

Resources

Download Supermicro datasheets and specifications.

Get Resources