Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

 
 

Troubleshooting the QFX5200

QFX5200 Troubleshooting Resources Overview

To troubleshoot a QFX5200, you use the Junos OS CLI, alarms, and LEDs on the network ports, management panel, and components.

  • LEDs—When the Routing Engine detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. In addition, you can also use component LEDs and network port LEDs to troubleshoot the QFX5200. For more information, see the following topics:

  • CLI—The CLI is the primary tool for controlling and troubleshooting hardware, Junos OS, routing protocols, and network connectivity. CLI commands display information from routing tables, information specific to routing protocols, and information about network connectivity derived from the ping and traceroute utilities. For information about using the CLI to troubleshoot Junos OS, see the appropriate Junos OS configuration guide.

  • JTAC—If you need assistance during troubleshooting, you can contact the Juniper Networks Technical Assistance Center (JTAC) by using the Web or by telephone. If you encounter software problems, or problems with hardware components not discussed here, contact JTAC.

  • Knowledge Base articles–Knowledge Base.

QFX Series Alarm Messages Overview

When a QFX Series switch detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. To view a more detailed description of the alarm cause, issue the show chassis alarms CLI command:

For Junos OS Evolved systems, show system alarms CLI command indicates major and minor alarms on the system. In this example from a Junos OS Evolved system, a fan tray error is shown in slot 4.

Chassis Alarm Messages

Chassis alarms indicate a failure on the device or one of its components. Chassis alarms are preset and cannot be modified.

Chassis alarms on QFX5200 devices have two severity levels:

  • Major (red)—Indicates a critical situation on the device that has resulted from one of the conditions described in Table 1. A red alarm condition requires immediate action.

  • Minor (yellow)—Indicates a noncritical condition on the device that, if left unchecked, might cause an interruption in service or degradation in performance. A yellow alarm condition requires monitoring or maintenance.

Table 1 describes the chassis alarm messages on QFX5200-32C and QFX5200-48Y devices. For QFX5200-32C-L devices see Table 2.

Table 1: Chassis Alarm Messages for QFX5200-32C and QFX5200-48Y

Component

Alarm Type

CLI Message

Recommended Action

Fans

Major (red)

Fan Failure

Replace the fan module and report the failure to customer support.

Fan I2C Failure

Check the system log for one of the following error messages and report the message to customer support:

  • CM ENV Monitor: Get fan speed failed.

  • fan-number is NOT spinning @ correct speed, where fan-number can be 1, 2, 3, 4, or 5.

Fan fan-number Not Spinning

Remove and check the fan module for obstructions, and then reinsert the fan module. If the problem persists, replace the fan module.

Minor (yellow)

Fan/Blower Absent

Check the system log for the error message fan-number Absent, where fan-number can be can be 1, 2, 3, 4, or 5.

Install fan modules in the slots where they are absent.

Power supplies

Major (red)

PEM pem-number Airflow not matching Chassis Airflow

Replace the power supply with a power supply that supports the same airflow direction as supported by the chassis.

PEM pem-number I2C Failure

Check the system log for one of the following error messages and report the message to customer support:

  • I2C Read failed for device number, where number where number ranges from 123 through 125.

  • PS number: Transitioning from online to offline, where power supply number is 1 or 2.

PEM pem-number is not powered

Check the power cord connection and reconnect, if necessary.

PEM pem-number is not supported

Replace the power supply with a supported power supply.

PEM pem-number Not OK

Indicates a problem with the incoming AC power or outgoing DC power. Report the error to customer support.

Minor (yellow)

PEM pem-number Absent

Reboot the switch after removing one of the power supply. The switch can continue to operate with a single power supply.

OR

Replace the removed power supply and reboot the switch.

PEM pem-number Power Supply Type Mismatch

Check whether there is a mix of AC and DC power supplies in the same chassis. Reboot the switch with only AC or only DC power supplies.

PEM pem-number Removed

Replace the removed power supply or reboot the switch. The switch can continue to operate with a single power supply.

Temperature sensors

Major (red)

sensor-location Temp Sensor Fail

Check the system log for the following error message and report the message to customer support:

Temp sensor sensor-number failed, where sensor-number ranges from 1 through 10.

sensor-location Temp Sensor Too Hot

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. If the condition persists, the device might shut down.

Minor (yellow)

sensor-location Temp Sensor Too Warm

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor.

Routing Engine

Minor (yellow)

RE RE number /var partition usage is high

Clean up the system file storage space on the switch. For more information, see Cleaning Up the System File Storage Space.

Major (red)

RE RE number /var partition is full

Clean up the system file storage space on the switch. For more information, see Cleaning Up the System File Storage Space.

Minor (yellow)

Rescue configuration is not set

Use the request system configuration rescue save command to set the rescue configuration. For more information, see Setting or Deleting the Rescue Configuration.

Feature usage requires a license

or

License for feature expired

Install the required license for the feature specified in the alarm. For more information, see Software Features That Require Licenses on the QFX Series.

Management Ethernet interface

Major (red)

Management Ethernet 1 Link Down

Check whether a cable is connected to the management Ethernet interface, or whether the cable is defective. Replace the cable, if required.

On models that have both em0 and em1 management interfaces available, you must connect both interfaces. If both interfaces are not connected, the alarm is raised. However, the alarm has no service impact.

If you are unable to resolve the problem, open a support case by using the Case Manager link at https://www.juniper.net/support/ or call 1-888-314-5822 (tollfree, US or 1-408-745-9500 (from outside the United States).

Junos OS Evolved systems, such as QFX5200-32C-L are based on a new alarm infrastructure, not all power supplies and fan alarms are supported. Table 2 shows these alarms.

Table 2: Chassis Alarm Messages for QFX5200-32C-L

Component

Alarm Type

CLI Message

Recommended Action

Fans

Red (major)

Fan Tray fan-tray-number Absent

Install fan modules in the slots where they are absent.

Fan Tray fan-tray-number Failure

Remove and check fan module for obstructions. Reinsert the fan module. If the problem persists, replace the fan module.

Yellow (minor)

FAN fan-number Fan Sensor Fail

Remove and check fan module for obstructions. Reinsert the fan module. If the problem persists, check the system log for the message related to the sensor and report the message to customer service.

Power Supplies

Red (major)

PEM pem-number Not Powered

Install a power supply into the empty slot and ensure the power supply is powered.

Temperature sensors

Major (red)

FPC 0 Temperature Hot

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor. if the condition persists, the device might shut down.

Minor (yellow)

FPC 0 Temperature Warm

Check environmental conditions and alarms on other devices. Ensure that environmental factors (such as hot air blowing around the equipment) do not affect the temperature sensor.

FPC 0 Temp Sensor Fail

Check the system log for the following error message and report the message to customer support:

Management Ethernet interface

Major (red)

Management interface management-interface-name down on node

Check whether a cable is connected to the management Ethernet interface, or whether the cable is defective. Replace the cable, if required.