Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

 
 

About this Document

This document describes the design requirements and implementation of an AI cluster infrastructure featuring Juniper RDMA-Aware Load Balancing (RLB) and Juniper BGP Deterministic Path Forwarding (BGP-DPF) in the GPU Backend Fabric. This fabric is built based on AI-optimized Juniper Data Center QFX5240 series switches and Nvidia H100 DGX GPU servers.

All validation tests were conducted in Juniper’s AI Innovation Lab in Sunnyvale, CA, USA. In this open lab, Juniper collaborates closely with customers and technology partners to develop AI solutions and test deployments for a range of AI applications and models.

The AI Innovation Lab allows customers to see AI training and inference in action. Juniper performs these tests running both customer-specific models as well as those from MLCommons for MLPerf performance benchmarking and comparisons.