Troubleshoot the QFX5240 Switch
QFX5240 Troubleshooting Resources Overview
To troubleshoot a QFX5240 problem, you can use:
-
Junos Evolved OS CLI
The CLI is the primary tool for controlling and troubleshooting hardware, Junos OS Evolved, routing protocols, and network connectivity. CLI commands display information from routing tables, information specific to routing protocols, and information about network connectivity derived from the ping and traceroute utilities. For information about using the CLI to troubleshoot Junos OS Evolved, see the appropriate Junos OS Evolved configuration guide.
-
Alarms and LEDs on the network ports, management panel, and components
When the Routing Engine detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. In addition, you can also use component LEDs and network port LEDs to troubleshoot the QFX5240 switch. For more information, see QFX5240 Management Panel.
-
JTAC
If you need assistance during troubleshooting, you can contact the Juniper Networks Technical Assistance Center (JTAC) by using the Web or by telephone. If you encounter software problems, or problems with hardware components not discussed here, contact JTAC.
-
Knowledge Base articles–Knowledge Base.
QFX5240 Alarm Messages Overview
When a QFX5240 switch detects an alarm condition, it lights the red or yellow alarm
LED on the management panel as
appropriate. To view a more detailed description of
the alarm cause, issue the show system alarms
operational CLI
command.
user@host> show system alarms 2 alarms currently active Alarm time Class Description 2019-01-22 16:32:54 PST Major PEM 1 Absent 2019-01-22 16:31:04 PST Minor Host 0 Disk 2 Labelled incorrectly
For thermal problems, the show chassis temperature-thresholds
CLI
command shows the cutoff temperatures for each level of alarm:
user@host> show chassis temperature-thresholds Fan speed Yellow alarm Red alarm Fire Shutdown (degrees C) (degrees C) (degrees C) (degrees C) Item Normal High Normal Bad fan Normal Bad fan Normal Routing Engine 0 CPU Temperature 65 70 95 92 97 94 100 Routing Engine 0 Ch-0 DIMM-0 Temp 55 62 80 77 82 79 85 Routing Engine 0 Ch-1 DIMM-0 Temp 55 62 80 77 82 79 85 FPC 0 Sensor TH5 Max Reading 75 80 100 97 105 102 110 FPC 0 et-0/0/0 53 60 73 73 76 76 77 FPC 0 et-0/0/4 53 60 73 73 76 76 77 FPC 0 MB Middle Right Rear 45 50 80 77 85 82 90 FPC 0 MB Middle Left Rear 45 50 80 77 85 82 90 FPC 0 MB Left Rear 45 50 80 77 85 82 90 FPC 0 MB Left Front 45 50 80 77 85 82 90 FPC 0 MB Right Rear 45 50 80 77 85 82 90 FPC 0 MB Right Front 45 50 80 77 85 82 90 FPC 0 MB OSFP_GRP1_3V3 75 80 115 112 120 117 125 FPC 0 MB OSFP_GRP2_3V3 75 80 115 112 120 117 125 FPC 0 MB OSFP_GRP3_3V3 75 80 115 112 120 117 125 FPC 0 MB VDD_0P75 75 80 115 112 120 117 125 FPC 0 MB TRVDD1_0V9_0V75 75 80 115 112 120 117 125 FPC 0 MB TRVDD0_0V9_0V75 75 80 115 112 120 117 125 FPC 0 FB Exhaust Left 40 45 65 62 70 67 75 FPC 0 FB Exhaust Right 40 45 65 62 70 67 75
You
can also calculate the percentage of fan RPM, or duty
cycle.
Use
the following
command as
root
:
root@re0.~#i2cget -y -f 13 0x66 0x11 0x07
In this example, the system returned the hexadecimal value 0x07
.
If
you convert that value to decimal,
you
get
7.
Then use this formula to
calculate
the duty cycle:
Duty cycle = (value returned + 1)*6.25%
In this example, duty cycle = (7 +1)*6.25 = 50%
Chassis Alarm Messages
Chassis alarms indicate a failure on the device or one of its components. Chassis alarms are preset and cannot be modified.
Chassis alarms on QFX5240 switches have two severity levels:
-
Major (red)—Indicates a critical situation on the device that has resulted from one of the conditions described in Table 1. A red alarm condition requires immediate action.
-
Minor (yellow)—Indicates a noncritical condition on the device that, if left unchecked, might cause an interruption in service or degradation in performance. A yellow alarm condition requires monitoring or maintenance.
Table 1 describes the chassis alarm messages on a QFX5240.
Junos OS Evolved systems, such as QFX5240, are based on a new alarm infrastructure, which does not support all power supplies and fan alarms. Table 1 shows these alarms.
Component |
Alarm Type |
CLI Message |
Recommended Action |
---|---|---|---|
Fans |
Red (major) |
|
Install fan modules in the slots where they are absent. |
|
Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, replace the fan module. |
||
sensor-location Temp Sensor Too Hot |
Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down. |
||
Yellow (minor) |
|
Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, check the system log for the message related to the sensor and report the message to customer service. |
|
sensor-location Temp Sensor Too Warm |
Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. |
||
Power Supplies |
Red (major) |
|
Install a power supply into the empty slot and ensure the power supply is powered. |
Temperature sensors |
Major (red) |
|
Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down. |
Minor (yellow) |
|
Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. |
|
|
Check the system log for the following error message and report the message to customer support: |
||
Routing Engine |
Major (red) |
|
File storage is at capacity. Reduce unnecessary files to free space. |
Minor (yellow) |
|
File storage is reaching capacity. Reduce unnecessary files to free space. |
|
Management Ethernet interface |
Major (red) |
|
Check whether a cable is connected to the management Ethernet interface, or whether the cable is defective. Replace the cable, if required. |