IBM Support

IJ24885: USING NPIV WITH VIOS 3.1.1 CAN CAUSE A CRASH APPLIES TO AIX 7200-04

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • **************************************************************
    * USERS AFFECTED:
    * Systems running VIOS 3.1.1.x with
    * devices.vdevice.IBM.vfc-server.rte at
    * or below 7.2.4.1.
      **************************************************************
    * ERROR DESCRIPTION:
    * This is only an issue using NPIV with VIOS 3.1.1.
    * A VIOS partition with Virtual Fibre Channel (NPIV) adapters
    * can crash during LPM migration/validation, a Simplified
    * Remote Restart (SRR), a VM Recovery Manager HA/DR Operation,
    * a client LPAR boot, SAN changes, I/O timeouts, etc.
    * In a dual VIOS environment, client LPARs should maintain
    * network and storage connectivity through the alternate VIOS
    * during this crash.
    *
    * The stack of the crash will be similar to below:
    * ÝF1000000C0209A14¨vfc_host:npiv_dequeue+000014
    * ÝF1000000C01FDFA0¨vfc_host:npiv_release_cmd_list@AF81_26
    * ÝF1000000C01EFC58¨vfc_host:cmdh_send_messages+0004B8
    * ÝF1000000C01F0850¨vfc_host:npiv_crq_reader+0001F0
    * ÝF1000000C01F28D4¨vfc_host:npiv_cmd_handler+0000B4
    * ÝF1000000C01F3AD0¨vfc_host:npiv_cmdh_thr_sched+0000D0
    * ÝF1000000C01F2CD0¨vfc_host:npiv_thr_sched_run+000070
    * Ý00014D70¨.hkey_legacy_gate+00004C ()
    *
    * IBM recommends applying the fix to prevent hitting the issue.
    * Disabling src and dest lun level validation on the VIOS
    * during
    * LPM/validation, and avoiding the operations listed above may
    * reduce the chances of hitting the crash until the fix can be
    * applied.
    * You can disable src and dest lun level validation using the
    * following padmin commands on the VIOS. These changes are
    * dynamic:
    *
    * $ chdev -dev vioslpm0 -attr src_lun_val=off
    * $ chdev -dev vioslpm0 -attr dest_lun_val=restart_off
      **************************************************************
    * RECOMMENDATION:
    * Install APAR IJ24885.
    * Prior to fix availability, an interim fix is available from
    * either
    * ftp://aix.software.ibm.com/aix/ifixes/ij24885/
    * https://aix.software.ibm.com/aix/ifixes/ij24885/
    * The ifix can be installed using Live Update (LU).
    * If LU is not used, installation of the ifix requires a
    * reboot.
      **************************************************************
    

Local fix

Problem summary

  • The VIOS may crash in function npiv_dequeue.  This is most
    likely to occur during LPM or when the client NPIV driver
    performs error recovery (for example, if the fabric is in a bad
    state).
    

Problem conclusion

  • Hold proper lock when accessing command queue.
    

Temporary fix

  •   *********
      * HIPER *
      *********
    

Comments

  • 7200-04 - use AIX APAR IJ24885
    

APAR Information

  • APAR number

    IJ24885

  • Reported component name

    AIX V7.2

  • Reported component ID

    5765CD200

  • Reported release

    720

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Submitted date

    2020-05-12

  • Closed date

    2020-05-18

  • Last modified date

    2020-11-16

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

    IJ24886 U887730

Fix information

  • Fixed component name

    AIX V7.2

  • Fixed component ID

    5765CD200

Applicable component levels

  • R720 PSY U887730

       UP20/07/20 I 1000 Ž

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSVEF8","label":"AIX 7.2 Enterprise Edition"},"Platform":[{"code":"PF053","label":"Power Systems"}],"Version":"720","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}}]

Document Information

Modified date:
17 November 2020