Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

Troubleshooting QFX10000 Modular Chassis

 

QFX10000 Troubleshooting Resources Overview

To troubleshoot a QFX10000 modular chassis, you use the Junos OS CLI, alarms, and LEDs on the network ports, management panel, and components.

  • LEDs—When the Routing Engine detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. In addition, you can also use component LEDs and network port LEDs to troubleshoot the QFX10000. For more information, see the following topics:

  • CLI—The CLI is the primary tool for controlling and troubleshooting hardware, Junos OS, routing protocols, and network connectivity. CLI commands display information from routing tables, information specific to routing protocols, and information about network connectivity derived from the ping and traceroute utilities. For information about using the CLI to troubleshoot Junos OS, see the appropriate Junos OS configuration guide.

  • JTAC—If you need assistance during troubleshooting, you can contact the Juniper Networks Technical Assistance Center (JTAC) by using the Web or by telephone. If you encounter software problems, or problems with hardware components not discussed here, contact JTAC.

  • Knowledge Base articles–Knowledge Base.

QFX Series Alarm Messages Overview

When a QFX Series switch detects an alarm condition, it lights the red or yellow alarm LED on the management panel as appropriate. To view a more detailed description of the alarm cause, issue the show chassis alarms CLI command:

user@host> show chassis alarms

For Junos OS Evolved systems, show system alarms CLI command indicates major and minor alarms on the system. In this example from a Junos OS Evolved system, a fan tray error is shown in slot 4.

user@host> show system alarms

Chassis Alarm Messages on QFX10008 and QFX10016 Modular Chassis Switches

Chassis alarms indicate a failure on the device or one of its components. Chassis alarms are preset and cannot be modified.

Chassis alarms on QFX10008 and QFX10016 modular chassis have two severity levels:

  • Major (red)—Indicates a critical situation on the device that has resulted from one of the conditions described in Table 1. A red alarm condition requires immediate action.

  • Minor (yellow or amber)—Indicates a noncritical condition on the device that, if left unchecked, might cause an interruption in service or degradation in performance. A yellow alarm condition requires monitoring or maintenance.

Table 1 describes the chassis alarm messages on QFX10000 modular chassis switches.

Table 1: Chassis Component Alarm Conditions on QFX10000 Modular Models

Chassis Component

Alarm Condition

Alarm Severity

Remedy

Control board

A Control board has failed.

Major (red)

Replace the failed Control board.

A Control board has been removed.

Minor (yellow)

Install a Control board in the empty slot.

Line Cards

A line card is offline.

Minor (yellow)

Check the line card. Remove and reinstall the line card. If this fails, replace the failed card.

A line card has failed.

Major (red)

Replace the failed line card.

A line card has been removed.

Major (red)

Install a line card in the empty slot.

Fan trays

A fan tray has been removed from the chassis.

Major (red)

Install the missing fan tray.

One fan in the chassis is not spinning or is spinning below required speed.

Major (red)

Replace the fan tray.

A fan is not receiving power from the fan tray controller.

Major (red)

Check and replace the failed fan tray controller if required.

Fan Tray Controller

A fan tray controller has failed.

Minor (yellow)

Check and replace the failed fan tray controller if required.

One of the fan tray controller in the chassis is not receiving enough power.

Major (red)

Check the power supply.

Switch Interface Boards (SIBs)

One of the SIB has failed.

Minor (yellow)

Check the below:

  • The SIB is not receiving power.

  • The fan tray controller is having a power problem.

SATA SSD slot

The SSD has failed or removed.

Major (red)

Install the SSD.

Ethernet

The Ethernet management interface on the Control board is down.

Minor (yellow)

  • Check the interface cable connection.

  • Reboot the system.

  • If the alarm recurs, open a support case using the Case Manager link at https://www.juniper.net/support/ or call 1-888-314-5822 (toll free, US & Canada) or 1-408-745-9500 (from outside the United States).

Hot swapping

Too many hot-swap interrupts are occurring.

Major (red)

Replace the failed components.

Power supplies

A power supply has been removed from the chassis.

Minor (yellow)

Install a power supply in the empty slot.

A power supply has a high temperature.

Major (red)

Replace the failed power supply.

A power supply input has failed.

Major (red)

Check power supply input connection and the power cord.

A power supply output has failed.

Major (red)

Check power supply output connection.

A power supply has failed.

Major (red)

Replace the failed power supply.

AC and DC power supplies are installed.

Major (red)

Do not mix AC and DC power supplies.

Inadequate number of power supplies.

Major (red)

Install an additional power supply.

Temperature

The chassis temperature has exceeded 131° F (55° C), the fans have been turned on to full speed, and one or more fans have failed.

Minor (yellow)

  • Check room temperature.

  • Check airflow.

  • Replace the fan tray.

The chassis temperature has exceeded 149° F (65° C), and the fans have been turned on to full speed.

Minor (yellow)

  • Check room temperature.

  • Check airflow.

  • Check the fans.

The chassis temperature has exceeded 149° F (65° C), and a fan has failed. If this condition persists for more than 4 minutes, the switch will shut down.

Major (red)

  • Check room temperature.

  • Check airflow.

  • Check the fan.

Chassis temperature has exceeded 167° F (75° C). If this condition persists for more than 4 minutes, the switch will shut down.

Major (red)

  • Check room temperature.

  • Check airflow.

  • Check fan.

The temperature sensor has failed.

Major (red)

Open a support case using the Case Manager link at https://www.juniper.net/support/ or call 1-888-314-5822 (toll free, US & Canada) or 1-408-745-9500 (from outside the United States).