Juniper Networks presented its latest Apstra functionality for AI data center network operations at AI Infrastructure Field Day. It focused on providing operators with the context and tools to manage complex AI networks efficiently. Jeremy Wallace, a Data Center/IP Fabric Architect, emphasized the importance of context in understanding the network's expected behavior to identify and resolve issues quickly. Juniper is leveraging existing Apstra capabilities, augmented with new features such as compute agents deployable on NVIDIA servers, and enhanced probes and dashboards, to monitor AI networks. This presentation aims to equip operators to maintain optimal performance and minimize downtime in critical infrastructure environments.
The presentation highlighted the evolution of network management for AI data centers, transitioning from traditional methods to a more proactive and data-driven approach. The core of Juniper's solution involves leveraging telemetry, including data collected from GPU NICs and switches, to provide real-time insights into network performance. This enables operators to monitor key metrics, such as GPU network utilization and traffic patterns, and respond to potential issues swiftly. The Honeycomb view, traffic dashboards, and integration with congestion control mechanisms (ECN and PFC) demonstrate how to provide visibility into the network's behavior. The goal is to provide context and the tools to diagnose and resolve problems faster.
Finally, Wallace demonstrated a live demo of the platform, showcasing features like real-time traffic analysis, heatmaps of GPU utilization, and auto-tuning load balancing. The auto-tuning functionality dynamically adjusts parameters like inactivity intervals to optimize performance and eliminate out-of-sequence packets, increasing the likelihood of successful job completion. These power packs are essentially Python scripts and are evolving, with Juniper actively working on creating more of these power packs. Juniper is also working on deeper integration with other vendors for their customers' environments and solutions.
Presented by Jeremy Wallace, Data Center/IP Fabric Architect, Apstra Product Specialist Team, Juniper Networks. Recorded live in Santa Clara, California, on April 23, 2025, as part of AI Infrastructure Field Day. Watch the entire presentation at https://techfieldday.com/appearance/juniper-networks-presents-at-ai-infrastructure-field-day-2/ or https://techfieldday.com/event/aiifd2/ for more information.
Up Next in AI Infrastructure Field Day 2
-
Getting to Know the Unsung Hero of AI...
Solidigm's presentation at AI Infrastructure Field Day emphasized the critical role of data storage in the evolving landscape of AI solutions, using the framework of the AI data pipeline. The amount of data used for training an AI model directly correlates with its accuracy, making storage a crit...
-
AI Data solutions are not One Size Fi...
Solidigm's presentation focuses on AI performance and efficiency, highlighting the role of high-performance SSDs in addressing the challenges of rapidly growing AI development and optimizing the total cost of ownership. Scott Shadley, Director of Leadership Narrative and Evangelist at Solidigm, b...
-
Why Storage Matters to AI in 2025 wit...
Solidigm focused on the evolving role of storage in AI, specifically highlighting its significance in the AI data pipeline through 2025. The presentation emphasizes that the AI workflow involves a series of distinct tasks, each with unique demands on hardware, and that it's often distributed amon...