subscribe iconSubscribe to this information

Verifying a high-speed link, system PCI bus, or a multi-adapter bridge repair

Use this procedure to verify a repair for the high-speed link, a system PCI bus, or for a multi-adapter bridge.

Within this procedure, the terms "system" and "logical partition" are interchangeable when used individually.
  1. Perform this procedure from the logical partition you were in when you were sent to this procedure, or from the HMC if this error was worked from the HMC.
  2. If you previously powered off a system or logical partition, or an expansion unit during this service action, then you need to power it off again.
  3. Install all cards, cables, and hardware, ensuring that all connections are tight. You can use the system configuration list to verify that the cards are installed correctly.
  4. Power on any expansion unit, logical partition or system unit that was powered off during the service action. Is one of the following true?
    • If the system or a logical partition was powered off during the service action, does the IPL complete successfully to the IPL or does Install the System display?
    • If an expansion tower was powered off during the service action, does the expansion tower power on complete successfully?
    • If any IOP or IOA card locations were powered off using concurrent maintenance during the service action, do the slots power on successfully?
    • If you exchanged a FRU that should appear as a resource or resources to the system, such as an IOA, or I/O bridge, does the new FRU's resource appear in HSM as operational?
      • Yes: Continue with the next step.
      • No: Verify that you have followed the power off, remove and replace, and power-on procedures correctly. When you are sure that you have followed the procedures correctly, then exchange the next FRU in the list. If there are no more FRUs to exchange, then contact your next level of support. This ends the procedure.
  5. Does the system or logical partition have mirrored protection? Select Yes if you are not sure.
    • No: Continue with the next step.
    • Yes: From the Dedicated Service Tools (DST) display, select Work with disk units, and resume mirrored protection for all units that have a suspended status.
  6. Choose from the following options:
    • If you are working from a partition, from the Start a Service Tool display, select Hardware service manager and look for the I/O processors that have a failed or missing status.
    • If you are working from a HMC, look at the CEC properties.
    1. Choose the I/O tab.
    2. Look for IOAs or IOPs that have a failed or missing status.

    Are all I/O processor cards operational?

    Note: Ignore any IOPs that are listed with a status of not connected.
    • Yes: Go to step 10.
    • No: Display the logical hardware resource information for the non-operational I/O processors. For all I/O processors and I/O adapters that are failing; record the bus number (BBBB), board (bb) and card information (Cc). Continue with the next step.
  7. Perform the following:
    1. Return to the Dedicated Service Tools (DST) display.
    2. Display the Product Activity Log.
    3. Select All logs and search for an entry with the same bus, board, and card address information as the non-operational I/O processor. Do not include informational or statistical entries in your search. Use only entries that occurred during the last IPL.
      Did you find an entry for the SRC that sent you to this procedure?
      • No: Continue with the next step.
      • Yes: Ask your next level of support for assistance. This ends the procedure.
  8. Did you find a B600 6944 SRC that occurred during the last IPL?
    • Yes: Continue with the next step.
    • No: A different SRC is associated with the non-operational I/O processor. Go to the System reference code list and look up the new SRC to correct the problem. This ends the procedure.
  9. Is there a B600 xxxx SRC that occurred during the last IPL other than the B600 6944 and informational SRCs?
    • Yes: Use the other B600 xxxx SRC to determine the problem. Go to the System reference code list and look up the new SRC to correct the problem. This ends the procedure.
    • No: You connected an I/O processor in the wrong card position. Use the system configuration list to compare the cards. When you have corrected the configuration, go to the start of this procedure to verify the bus repair. This ends the procedure.
  10. If in a partition, use the hardware service manager function to print the system configuration list.
    Are there any configuration mismatches?
    • No: Continue with the next step.
    • Yes: Ask your next level of support for assistance. This ends the procedure.
  11. You have verified the repair of the system bus.
    1. If for this service action only an expansion unit was powered off or only the concurrent maintenance function was used for an IOP or IOA, then continue with the next step.
    2. Otherwise, perform the following to return the system to the customer:
      1. Power off the system or logical partition. See Powering on and powering off for details.
      2. Select the operating mode with which the customer was originally running.
      3. Power on the system or logical partition.
  12. If the system has logical partitions and the entry point SRC was B600 xxxx, then check for related problems in other logical partitions that could have been caused by the failing part. This ends the procedure.

Send feedback | Rate this page

Last updated: Fri, Oct 30, 2009