Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

 
 

Device Health Analytics Report

This topic provides an overview of the device health reports you can generate in the Apstra GUI. To learn how to generate this report, see Generate an Analytics Report.

The Device Health report analyzes the health of the device. In this report, you can view the health of all Apstra managed devices in your network.

Inventory Overview Report

The Inventory Overview report shows the number of devices that Apstra manages in your network. This report (Figure 1) shows the device hardware model, system ID, label, network operating system (NOS), average/max memory usage, and average/max CPU usage.

Figure 1: Inventory Overview Inventory Overview

Memory Usage Analysis

The Memory Usage Analysis reports show detailed memory usage for all devices and includes charts used to identify memory leaks. This report is useful for capacity planning and identifying usage patterns to predict demand.

Device Memory Usage

The device memory usage chart (Figure 2) shows the memory usage for all devices.

Figure 2: Device Memory Usage Chart Device Memory Usage Chart

Memory Trending Charts

The Memory Trending charts show devices with the highest memory increments rate. A device might use more memory over time due to increased workload or new features.

For example, consider three devices. Device A's memory usage increases 5-fold from 10M to 50M. Device B's usage increases 10% from 100M to 110M. Device C's memory usage decreases from 500M to 490M. Accordingly, we rank device A before Device B and eliminate Device C.

The following examples show the memory trending charts for two leaf devices and one spine device.

Figure 3: Memory Trending Chart - Leaf1 Memory Trending Chart - Leaf1
Figure 4: Memory Tending Chart - Leaf2 Memory Tending Chart - Leaf2
Figure 5: Memory Trending Chart - Spine2 Memory Trending Chart - Spine2

CPU Usage Analysis

The CPU analysis section provides on-device CPU usage and Apstra telemetry collectors for each device.

Device CPU Usage

The Device CPU usage chart ( Figure 6) displays the CPU usage for all your devices.

Note:
Figure 6: Device CPU Usage Chart Device CPU Usage Chart

CPU Usage Analysis for all Devices

The measurement of some metrics, such as CPU usage, might exhibit periodical behaviors due to certain operations repeated at fixed intervals. For example, Apstra device telemetry collectors might issue CLI commands to collect traffic counters every few second, causing device CPU usage to increase. We use FFT (Fast Fourier Transformation) to convert a time domain measurement to the frequency domain. This helps identify associations between various frequencies and known periodical operations.

For example, in the frequency domain chart (Figure 6) for a leaf device, the x-axis represents frequency in number per hour. Value 0 means a constant signal, while value 120 means the signal repeats 120 times per hour, with a 30-second period.

The device CPU usage exhibits a periodic pattern at 34.285714285714285 seconds rate which correspond to following Apstra telemetry service collectors: interface_counters.

Figure 7: Device CPU Frequency Device CPU Frequency
Figure 8: Apstra Telemetry Service Collector Execution Time Apstra Telemetry Service Collector Execution Time