IBM Support

Troubleshooting SCSI, temperature, fan, voltage, DASD, and bus error messages

Troubleshooting


Problem

Diagnosing problems with the IBM eServer xSeries 130, 135, 330, and IntelliStation R Pro

Resolving The Problem

Affected configurations

  • IBM eServer xSeries 130, 135, 330, and IBM IntelliStation R Pro

This document is intended for trained servicers who are familiar with IBM server and workstation products. Use this document along with advanced diagnostic tests to troubleshoot problems effectively. Before servicing an IBM product, be sure to review the safety information. Click here to review the safety information.

SCSI error mesages
Error Code FRU/Action

One or more of the following might be causing the problem:   

  • A failing SCSI device (adapter, drive, controller)
  • An improper SCSI configuration or SCSI termination jumper setting
  • Duplicate SCSI IDs in the same SCSI chain
  • A missing or improperly installed SCSI terminator 
  • A defective SCSI terminator
  • An improperly installed cable
  • A defective cable
  1. External SCSI devices must be powered-on before you power-on the server.
  2. The cables for all external SCSI devices are connected correctly.
  3. If you have attached an external SCSI device to the server, make sure the external SCSI termination is set to automatic.
  4. The last device in each SCSI chain is terminated correctly.
  5. The SCSI devices are configured correctly.

Temperature error messages
Message Action
DASD over recommended temperature (sensor X) (level-warning; DASD bay "X" had over temperature condition)
  1. Ensure system is being properly cooled
    a). Each of the drive bays has either a drive or a filler panel installed
    b). The top cover is in place during normal operation
    c). There is at least 50 mm (2 inches) of ventilated space at the sides of the server and 100 mm (4 inches) at the rear of the server.
    d). The top cover is removed for no longer than 30 minutes while the server is operating.
    e). A removed hot-swap drive is replaced within two minutes of removal.
    f). Cables for optional adapters are routed according to the instructions provided with the adapters (ensure that cables are not restricting air flow)
    g). The fans are operating correctly and the air flow is good.
    h). A failed fan is replaced within 48 hours.
DASD under recommended temperature (sensor X) (level-warning;direct access storage device bay "X" had under temperature condition)
  1. Ambient temperature must be within normal operating specifications
DASD 1 over temperature (level-critical; sensor for DASD1 reported temperature over recommended range)
  1. Ensure system is being properly cooled
    a). Each of the drive bays has either a drive or a filler panel installed
    b). The top cover is in place during normal operation
    c). There is at least 50 mm (2 inches) of ventilated space at the sides of the server and 100 mm (4 inches) at the rear of the server.
    d). The top cover is removed for no longer than 30 minutes while the server is operating.
    e). A removed hot-swap drive is replaced within two minutes of removal.
    f). Cables for optional adapters are routed according to the instructions provided with the adapters (ensure that cables are not restricting air flow)
    g). The fans are operating correctly and the air flow is good.
    h). A failed fan is replaced within 48 hours.
Power supply "X" temperature Fault   (level-critical; power supply "x" had over temperature condition)
  1. Ensure system is being properly cooled
  2. Replace Power Supply "X"
System board is over recommended temperature (level-warning; system board is over recommended temperature)
  1. Ensure system is being properly cooled
  2. Replace system board
System board is under recommended temperature (level-warning; system board is under recommended temperature)
  1. Ambient temperature must be within normal operating specifications
System over temperature for CPU "X" (level-warning; CPU "X" reporting over temperature condition)
  1. Ensure system is being properly cooled
    a). Each of the drive bays has either a drive or a filler panel installed
    b). The top cover is in place during normal operation
    c). There is at least 50 mm (2 inches) of ventilated space at the sides of the server and 100 mm (4 inches) at the rear of the server.
    d). The top cover is removed for no longer than 30 minutes while the server is operating.
    e). A removed hot-swap drive is replaced within two minutes of removal.
    f). Cables for optional adapters are routed according to the instructions provided with the adapters (ensure that cables are not restricting air flow)
    g). The fans are operating correctly and the air flow is good.
    h). A failed fan is replaced within 48 hours.
System under recommended CPU "X"temperature (level-warning; system reporting under temperature condition for CPU "X")
  1. Ambient temperature must be within normal operating specifications

Fan error messages
Message
Action
Fan "X" failure (level-critical; fan "X" had a failure)
  1. Check connections to fan "X"
  2. Replace fan "X"
Fan "X" fault (level-critical; fan "X" beyond recommended RPM range)
  1. Check connections to fan "X"
  2. Replace fan "X"
Fan "X" Outside Recommended Speed Action
  1. Replace Fan "X"

Voltage related system shutdown
Message
Action
System shutoff due to "X" current over maximum value (level-critical; system drawing too much current on voltage "X" bus)
  1. Power off the system and disconnect the AC cord(s).
  2. Disconnect all external cables and remove server from the rack.
  3. Check for loose cables and short circuits in the power subsystem.
  4. Remove adapters and disconnect the cables and power connectors to all internal and external devices until system is at minimum configuration required for power-on.
  5. Reconnect the AC cord and power on the system.
  6. If the system powers up successfully, replace adapters and devices one at a time until the problem is isolated.
  7. If system does not power up from minimal configuration, replace FRUs of minimal configuration one at a time until the problem is isolated.
System shutoff due to "X" current under minimum. value (level-critical; current on voltage bus "X" under minimum value)
  1. Power off the system and disconnect the AC cord(s).
  2. Disconnect all external cables and remove server from the rack.
  3. Check for loose cables and short circuits in the power subsystem.
  4. Remove adapters and disconnect the cables and power connectors to all internal and external devices until system is at minimum configuration required for power-on.
  5. Reconnect the AC cord and power on the system.
  6. If the system powers up successfully, replace adapters and devices one at a time until the problem is isolated.
  7. If system does not power up from minimal configuration, replace FRUs of minimal configuration one at a time until the problem is isolated.
System shutoff due to "X" V over voltage (level-critical; system shutoff due to "X" supply over voltage)
  1. Check power supply connectors
  2. Replace power supply
System shutoff due to "X" V under voltage (level-critical system shutoff due to "X" supply under voltage)
  1. Check power supply connectors
  2. Replace power supply
System shutoff due to VRM "X"over voltage
  1. Replace power supply

DASD error messages
Message
Action
Hard drive "X" removal detected (level-critical; hard drive "X" has been removed)
  1. Information only, take action as appropriate.

Bus fault messages
Message
Action
Failure reading I2C device. Check devices on bus 0.
  1. Replace system board
Failure reading I2C device. Check devices on bus 1.
  1. Front Panel
  2. Replacesystem board
Failure reading I2C device. Check devices on bus 2.
  1. Replace DASD backplane
  2. Replace system board
Failure reading I2C device. Check devices on bus 3.
  1. Replace system board
Failure reading I2C device. Check devices on bus 4.
  1. Replace DIMM
  2. Replace system board

Need more help?
Please select one of the the following options for further assistance:

//www.ibm.com/i/v14/icons/fw.gif Support forums
//www.ibm.com/i/v14/icons/fw.gif Submit a technical question
Before you call IBM Service

Document Location

Worldwide

Operating System

IntelliStation Pro:All operating systems listed

Older System x:Operating system independent / None

[{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW18E","label":"Older System x->xSeries 135"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU016","label":"Multiple Vendor Support"},"Product":{"code":"HW18L","label":"Older System x->xSeries 330"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}},{"Type":"HW","Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"HWP99","label":"IntelliStation Pro->IntelliStation R Pro"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
28 January 2019

UID

ibm1MIGR-45211