Preparing the 9009-41A, 9009-41G, 9009-42A, 9009-42G, 9223-42H, or 9223-42S system to remove and replace a disk drive or solid-state drive

To prepare the system to remove and replace a disk drive or solid-state drive (SSD), complete the steps in this procedure.

Procedure

  1. Back up the data on the drive you plan to remove depending on the operating system you are using.
    • AIX®: Ensure that all data on that drive or the array that contains the drive is backed up and removed from the drive.
      • If the drive that is being replaced is protected by Raid-5, Raid-6, or Raid-10 and the array remains functional after removing the drive, it is not necessary to back up or remove data.
      • If the drive you are removing is a mirrored drive, you must suspend mirror protection on the drive before you remove the drive. Contact your system administrator for assistance.
      • If the drives are just a bunch of disks (JBOD) or a Raid array that will not remain functional when the drive is removed (that is, Raid-0), ensure that the disks are in the defined state by completing the following steps:
        1. Remove data access methods from the JBOD or failing array (that is, file systems, databases, and so on).
        2. Delete the array.
        3. Unconfigure the JBOD.
    • IBM® i: Ensure that all data on that drive or the array that contains the drive is backed up and removed from the drive. If the drive that is being replaced is protected by RAID-5, RAID-6, RAID-10, or mirroring, it is not necessary to back up or remove data.
    • Linux®: Ensure that all data on that drive or the array that contains the drive is backed up and removed from the drive.
      • If the drive that is being replaced is protected by RAID, it is not necessary to back up or remove data.
      • If the drive you are removing is mirrored, you need to break the mirror before you remove the drive. Contact your system administrator for assistance.
      • If the drives are just a bunch of disks (JBOD), ensure that the disks are in the defined state.
  2. Review the drive slot and service indicator locations. The disk drive and SSD slots are located in the front of the system.

    Figure 1 and Figure 2 show the disk drive or solid-state drive, and the service indicator locations for a rack-mounted and a stand-alone system.

    Figure 1. Disk drive or solid-state drive and service indicator locations in a rack-mounted 9009-41A, 9009-41G, 9009-42A, 9009-42G, 9223-42H, or 9223-42S system
    Disk drive or solid-state drive and service indicator locations in a rack-mounted 9009-41A, 9009-41G, 9009-42A, 9009-42G, 9223-42H, or 9223-42S system.
    Note: If you have an RDX drive, locations P2-D13 through P2-D18 are not available for disk drives or SSDs.
    Figure 2. Disk drive or solid-state drive and service indicator locations in a stand-alone 9009-41A or 9009-41G system
    Disk drive or solid-state drive and service indicator locations in a stand-alone 9009-41A or 9009-41G system.
    Note: If you have an RDX drive, locations P2-D13 through P2-D18 are not available for disk drives or SSDs.
  3. Choose from the following options:
    Note: You must use the power off option if the drive that you are removing is in the AIX or Linux root volume group (rootvg) or the IBM i configured unit, and it is not protected by either Redundant Array of Independent Disks (RAID) or mirroring.
    • If you want to remove a drive when the system power is turned off, continue with step 4.
    • If you want to remove a drive by using the AIX operating system when the system power is turned on, continue with step 6.
    • If you want to remove a drive by using the IBM i operating system when the system power is turned on, continue with step 8.
    • If you want to remove a drive by using the Linux operating system when the system power is turned on, continue with step 10.
  4. To prepare the system to remove a drive when the system is powered off, complete the following steps:
    1. Turn on the identify LED for the drive to be removed and for the system.
      For instructions, see Identifying a part.
      Use the blue identify LED on the enclosure to locate the system. Ensure that the serial number of the system matches the serial number to be serviced.
    2. Record the position where the disk drive or solid-state drive will be removed. For example, the drive slot might be P1-D3 or P2-D3.
    3. Stop the system. For instructions, see Stopping a system.
    4. If applicable, open the rack door at the rear of the system.
    5. Label and disconnect the power cords from the system unit.
      See Figure 3 or Figure 4.
      Notes:
      • This system might be equipped with two or more power supplies. If the removing and replacing procedures require the system power to be turned off, ensure that all the power sources to the system are disconnected.
      • The power cord (B) is fastened to the system with hook-and-loop fastener (A). If you are placing the system in a service position after you disconnect the power cords, ensure that you unstrap the fastener.
      Figure 3. Removing the power cords from a rack-mounted server
      Removing the power cords from a rack-mounted server
      Figure 4. Removing the power cords from a stand-alone server
      Removing the power cords from a stand-alone server
      (L003)
      L003 label image
      or
      L003 label image
      or
      L003 label image
      or
      L003 label image
      or
      L003 label image
      L003 label image
      L003 label image
      DANGER
      Multiple power cords. The product might be equipped with multiple AC power cords or multiple DC power cables. To remove all hazardous voltages, disconnect all power cords and power cables. (L003)
  5. Continue with step 11.
  6. To prepare the system to remove a drive by using the AIX operating system, complete the following steps:
    1. Log in as root user.
    2. Type diag at the command line and then press Enter.
    3. On the Diagnostic Operating Instructions display, press Enter to continue.
    4. On the Function Selection display, select Task Selection > RAID Array Manager > IBM SAS Disk Array Manager > Diagnostics and Recovery Options > SCSI and SCSI RAID Hot Plug Manager.
    5. Identify the location of the drive to be removed by selecting Identify a Device Attached to an SCSI Hot Swap Enclosure Device.
    6. Choose the slot corresponding to the drive, and then press Enter.
      Notes: If the device is in use and cannot be removed, complete the following actions:
      • Ensure that the device is not in use and that the device can be unconfigured.
      • If the device is in a Raid array that will go to a failed state when you remove the drive, the array must be deleted first. See Deleting a disk array.
      • If the drive you are removing is mirrored, you will need to break the mirror before you remove the drive. Contact your system administrator for assistance.
    7. Verify that the identify LED for the slot is flashing rapidly, and record the location of the drive.
    8. Press Enter to stop the LED from flashing and continue.
    9. Prepare to remove the drive by pressing F3 to return to the SCSI and SCSI RAID Hot Plug Manager display.
    10. Select Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device.
    11. Select the drive that you want to remove and press Enter.
      Follow the on-screen prompts.
  7. Continue with step 11.
  8. To prepare the system to remove a drive by using the IBM i operating system, complete the following steps:
    1. Determine the protection status of the drive that you are replacing by completing the following steps:
      1. Sign on with at least service level authority.
      2. Type strsst on the command line of the IBM i session, and then press Enter.
      3. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display, and then press Enter.

        The service tools password is case-sensitive.

      4. Select Work with disk units > Display Disk Configuration > Display Disk Configuration Status.
        A list of each auxiliary storage pool (ASP) displays, showing the drives that are members of the ASP. The Status column shows one of the following protection statuses for the ASP:
        • Mirrored. These ASPs are configured for mirrored protection. If the failing drive has mirrored protection, record the status of both drives in the mirrored pair. You need this information in the recovery procedure.
        • Unprotected. An ASP with a status of Unprotected might contain drives that are device-parity protected. Record the status of the failing drive as it is shown in the Status column of the Display Disk Configuration Status display.
        Tip: Use the F1 (Help) key for more information about the meaning of a status for any drive.
    2. Determine whether the drive to be removed is a non-configured drive by completing the following steps:
      1. Type strsst on the command line of the IBM i session, and then press Enter.
      2. Type your service tools user ID and service tools password, and then press Enter.

        The service tools password is case-sensitive.

      3. Select Work with disk units > Display Disk Configuration > Display non-configured units, and then press Enter.

        Is the failed drive listed as a non-configured drive in the display?

        • No: You must suspend the drive. Continue with step 8.c.
        • Yes: Determine whether the drive to be removed was replaced by a hot spare when it failed. Continue with step 8.e.
    3. Verify that the mirrored drive that you are removing has a status of Suspended.
      1. Sign on with service level authority.
      2. Type strsst on the command line of the IBM i session and then press Enter.
      3. Type your service tools user ID and service tools password in the Start Service Tools Sign On display. Press Enter.

        The service tools password is case-sensitive.

      4. Select Work with disk units > Display Disk Configuration > Display Disk Configuration Status.
        Does the drive that is mirrored to the drive that you are replacing (same drive number) have a status of Active?
        • No: A service provider must replace the drive. Contact your next level of support.
        • Yes: Continue with the next step.
    4. Does the drive that you are replacing have a status of Suspended?
      • Yes: Press F3 repeatedly to exit System Service Tools and return to the main menu. Then, continue with the next step.
      • No: To suspend mirrored protection on the drive that you are replacing, complete the following steps:
        1. Press F3 from the Display Disk Configuration display to return to the Work with Disk Units display.
        2. Select Work with Disk Unit Recovery > Suspend mirrored protection and select the option to suspend the drive that you are replacing.
        3. Press F3 repeatedly to exit System Service Tools and return to the main menu.
    5. Select the concurrent maintenance option within the Service Action Log. Record the drive location that is shown.
      Then, go to step 8.j.
    6. From the Hardware Service Manager display, navigate to the Select Device Concurrent Maintenance option. Press Enter.
    7. Choose from the following options:
      • If the drive location is shown on the Select Device Concurrent Maintenance display, continue with step 8.j.
      • If the drive location is not shown on the Select Device Concurrent Maintenance display, continue with the next step.
    8. If the physical location does not show on the Device Concurrent Maintenance display, you must type the physical location where you are replacing the drive.
      Notes:
      • The physical location has the following form: U78D3.001.AAAXXXX-P2-D3 where U78D3.001 is the system identifier, AAAXXXX is the sequence number, and P2-D3 is the drive slot location. Find the physical location U78D3.001.AAAXXXX information from the control panel display on the system.
      • If you cannot find the location information from the control panel, find the label that is located on the front of the system or enclosure. The system identifier is listed after FC and the sequence number is the last 7 digits after SN or SEQ.
        Figure 5. Example of a system or enclosure label
        Example of a system or enclosure label
      Use Table 1 to find the system machine type and model that corresponds to the system identifier.
      Table 1. System identifier for POWER9™ systems and enclosures
      System identifier System
      U78D3.001 5105-22E, 9008-22L, 9009-22A, 9009-22G, 9223-22H, or 9223-22S
      U78D2.001 9009-41A, 9009-41G, 9009-42A, 9009-42G, 9223-42H, or 9223-42S
      U78D4.001 9040-MR9
      U5887.001 5887 disk drive enclosure
      UESLL.001 ESLL storage enclosure
      UESLS.001 ESLS storage enclosure

      In the example, U5887.001.AAAXXXX-D1-D24, you would install a drive in the 5887 disk drive enclosure with type 5887, model 001, sequence number AAAXXXX, and drive position D1–D24.

    9. To set the time delay for removing the drive by using the IBM i operating system, complete the following steps:
      1. Select 1 (Remove device) for the action to be completed.
      2. Set the time delay in minutes. For example, for five minutes, enter: 05.
        Note: You can set an appropriate time delay of 01 - 19 minutes to allow adequate time to access the drive.
        Important: Do not press Enter now.
      3. Locate the service indicator that corresponds to the position of the drive that you are removing from a system. The service indicators are located above the drive latch.
  9. Continue with step 11.
  10. To prepare the system to remove a drive by using the Linux operating system, complete the following steps:
    1. Log in as root user.
    2. Type iprconfig on the command line of the Linux session, and then press Enter.
      The IBM Power RAID Configuration Utility display is shown.
    3. Select Analyze log > View most recent ipr error messages.
    4. Find the entry in the log file for the drive you want to replace.
    5. Record the location information for the drive.
      Note: The location information is in either of the following forms:
      0:0:5:0
      In this example, 0 is the SCSI host number, 0 is the SCSI bus, 5 is the SCSI target ID, and 0 is the logical unit number (LUN).
      0/00-0E-02
      In this example, 0 is the SCSI host number, 00 is the I/O adapter (IOA) SAS port, 0E is the expander port, and 02 is the device port.
    6. Type q twice.
      The IBM Power RAID Configuration Utility display is shown.
    7. Select Display hardware status. Press Enter.
    8. Look for the drive at the SCSI location you recorded.
    9. Type q to return to the IBM Power RAID Configuration Utility display.
    10. Select 3. Work with disk unit recovery > 2. Concurrent remove device. Press Enter.
    11. Type 1 (Select) next to the location for this drive (0:0:5:0 or 0/00-0E-02).
      The Verify Device Concurrent Remove display is shown. The service indicator flashes for that drive slot. Follow the on-screen prompts.
  11. Find the package that contains the new drive.
    Attention: Drives are fragile. Handle with care.
  12. Attach the electrostatic discharge (ESD) wrist strap.
    The ESD wrist strap must be connected to an unpainted metal surface until the service procedure is completed, and if applicable, until the service access cover is replaced.
    Attention:
    • Attach an electrostatic discharge (ESD) wrist strap to the front ESD jack, to the rear ESD jack, or to an unpainted metal surface of your hardware to prevent the electrostatic discharge from damaging your hardware.
    • When you use an ESD wrist strap, follow all electrical safety procedures. An ESD wrist strap is used for static control. It does not increase or decrease your risk of receiving electric shock when using or working on electrical equipment.
    • If you do not have an ESD wrist strap, just prior to removing the product from ESD packaging and installing or replacing hardware, touch an unpainted metal surface of the system for a minimum of 5 seconds. If at any point in this service process you move away from the system, it is important to again discharge yourself by touching an unpainted metal surface for at least 5 seconds before you continue with the service process.
  13. Remove the drive from the static-protective package and place it on an ESD mat.