Troubleshoot the QFX5241-64OD and QFX5241-64QD Switches
QFX5241-64OD and QFX5241-64QD Troubleshooting Resources Overview
To troubleshoot a problem on the QFX5241-64OD and QFX5241-64QD switches, you can use:
-
Junos Evolved OS CLI
The CLI is the primary tool for controlling and troubleshooting hardware, Junos OS Evolved, routing protocols, and network connectivity. CLI commands display information from routing tables, information specific to routing protocols, and information about network connectivity derived from the ping and traceroute utilities. For information about using the CLI to troubleshoot Junos OS Evolved, see the appropriate Junos OS Evolved configuration guide.
-
Alarms and LEDs on the network ports, management panel, and components
When the Routing Engine detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. In addition, you can also use FRU LEDs and network port LEDs to troubleshoot the QFX5241-64OD and QFX5241-64QD switches. For more information, see QFX5241-64OD and QFX5241-64QD Management Panel.
-
JTAC
If you need assistance during troubleshooting, you can contact the Juniper Networks Technical Assistance Center (JTAC) by using the Web or by telephone. If you encounter software problems, or problems with hardware components not discussed here, contact JTAC.
-
Knowledge Base articles—Knowledge Base.
QFX5241-64OD and QFX5241-64QD Alarm Messages Overview
When a QFX5241-64OD or QFX5241-64QD switch detects an alarm condition, it lights the
red or amber alarm LED on the management panel as appropriate. To view a more
detailed description of the alarm cause, issue the
show system alarms
operational CLI command.
user@host> show system alarms 0 alarms currently active
For thermal problems, the show chassis temperature-thresholds
CLI
command shows the cutoff temperatures for each level of alarm:
user@host> show chassis temperature-thresholds Fan speed Yellow alarm Red alarm Fire Shutdown (degrees C) (degrees C) (degrees C) (degrees C) Item Normal High Normal Bad fan Normal Bad fan Normal Routing Engine 0 CPU Temperature 70 80 92 89 96 93 98 Routing Engine 0 Ch-0 DIMM-0 Temp 63 70 82 79 85 82 88 Routing Engine 0 Ch-1 DIMM-0 Temp 63 70 82 79 85 82 88 FPC 0 Sensor TH5 Max Reading 75 80 100 97 105 102 110 FPC 0 xcvr-0/0/0 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/1 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/3 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/4 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/7 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/8 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/9 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/10 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/11 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/12 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/13 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/14 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/15 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/16 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/17 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/18 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/19 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/20 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/21 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/22 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/23 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/24 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/25 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/26 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/27 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/28 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/29 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/30 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/31 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/32 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/33 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/34 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/35 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/36 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/37 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/38 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/39 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/40 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/41 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/42 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/43 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/44 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/45 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/46 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/47 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/48 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/49 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/50 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/51 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/52 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/53 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/54 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/55 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/56 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/57 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/58 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/59 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/60 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/61 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/62 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/63 53 60 73 73 76 76 77 FPC 0 xcvr-0/0/64 65 72 85 85 90 90 91 FPC 0 xcvr-0/0/65 65 72 85 85 90 90 91 FPC 0 MB Middle Right Rear 45 50 80 77 85 82 90 FPC 0 MB Middle Left Rear 45 50 80 77 85 82 90 FPC 0 MB Left Rear 45 50 80 77 85 82 90 FPC 0 MB Left Front 45 50 80 77 85 82 90 FPC 0 MB Right Rear 45 50 80 77 85 82 90 FPC 0 MB Right Front 45 50 80 77 85 82 90 FPC 0 MB OPTICS_GRP1_3V3 75 80 115 112 120 117 125 FPC 0 MB OPTICS_GRP2_3V3 75 80 115 112 120 117 125 FPC 0 MB OPTICS_GRP3_3V3 75 80 115 112 120 117 125 FPC 0 MB VDD_0P75 75 80 115 112 120 117 125 FPC 0 MB TRVDD1_0V9_0V75 75 80 115 112 120 117 125 FPC 0 MB TRVDD0_0V9_0V75 75 80 115 112 120 117 125 FPC 0 FB Exhaust Left 40 45 65 62 70 67 75 FPC 0 FB Exhaust Right 40 45 65 62 70 67 75
You can also calculate the percentage of fan RPM, or duty cycle. Use the following command to know the fan RPM percentage:
root@re0> show chassis fan Item Status % RPM Measurement Fan Tray 0 Fan 1 OK 69% 9450 RPM Fan Tray 0 Fan 2 OK 69% 10650 RPM Fan Tray 1 Fan 1 OK 69% 9450 RPM Fan Tray 1 Fan 2 OK 69% 10650 RPM Fan Tray 2 Fan 1 OK 69% 9450 RPM Fan Tray 2 Fan 2 OK 69% 10650 RPM Fan Tray 3 Fan 1 OK 69% 9450 RPM Fan Tray 3 Fan 2 OK 69% 10650 RPM
Chassis Alarm Messages
Chassis alarms indicate a failure on the device or one of its components. Chassis alarms are preset and cannot be modified.
Chassis alarms on QFX5241-64OD and QFX5241-64QD switches have two severity levels:
-
Major (red)—Indicates a critical situation on the device that has resulted from one of the conditions described in Table 1. A red alarm condition requires immediate action.
-
Minor (amber)—Indicates a noncritical condition on the device that, if left unchecked, might cause an interruption in service or degradation in performance. An amber alarm condition requires monitoring or maintenance.
Junos OS Evolved systems, such as QFX5241-64OD and QFX5241-64QD switches, are based on a new alarm infrastructure, which does not support all PSUs and fan alarms. Table 1 shows these alarms, the alarm messages, and the recommended action to take.
Component |
Alarm Type |
CLI Message |
Recommended Action |
---|---|---|---|
Fans |
Red (major) |
|
Install fan modules in the slots where they are absent. |
|
Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, replace the fan module. |
||
sensor-location Temp Sensor Too Hot |
Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down. |
||
Amber (minor) |
|
Remove and check the fan module for obstructions. Reinsert the fan module. If the problem persists, check the system log for the message related to the sensor and report the message to customer service. |
|
sensor-location Temp Sensor Too Warm |
Check the environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. |
||
PSUs |
Red (major) |
|
Install a PSU into the empty slot. Ensure that the PSU is receiving power. |
Temperature sensors |
Major (red) |
|
Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down. |
Minor (yellow) |
|
Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. |
|
|
Check the system log for an error message. Report the message to customer support. |
||
Routing Engine |
Major (red) |
|
File storage is at capacity. Reduce unnecessary files to free space. |
Minor (yellow) |
|
File storage is reaching capacity. Reduce unnecessary files to free space. |
|
Management Ethernet interface |
Major (red) |
|
Check whether a cable is connected to the management Ethernet interface, or whether the cable is defective. Replace the cable, if required. |