A fix is available
APAR status
Closed as program error.
Error description
************************************************************** * USERS AFFECTED: * Systems running VIOS 3.1.1.x with * devices.vdevice.IBM.vfc-server.rte at * or below 7.2.4.1. ************************************************************** * ERROR DESCRIPTION: * This is only an issue using NPIV with VIOS 3.1.1. * A VIOS partition with Virtual Fibre Channel (NPIV) adapters * can crash during LPM migration/validation, a Simplified * Remote Restart (SRR), a VM Recovery Manager HA/DR Operation, * a client LPAR boot, SAN changes, I/O timeouts, etc. * In a dual VIOS environment, client LPARs should maintain * network and storage connectivity through the alternate VIOS * during this crash. * * The stack of the crash will be similar to below: * ÝF1000000C0209A14¨vfc_host:npiv_dequeue+000014 * ÝF1000000C01FDFA0¨vfc_host:npiv_release_cmd_list@AF81_26 * ÝF1000000C01EFC58¨vfc_host:cmdh_send_messages+0004B8 * ÝF1000000C01F0850¨vfc_host:npiv_crq_reader+0001F0 * ÝF1000000C01F28D4¨vfc_host:npiv_cmd_handler+0000B4 * ÝF1000000C01F3AD0¨vfc_host:npiv_cmdh_thr_sched+0000D0 * ÝF1000000C01F2CD0¨vfc_host:npiv_thr_sched_run+000070 * Ý00014D70¨.hkey_legacy_gate+00004C () * * IBM recommends applying the fix to prevent hitting the issue. * Disabling src and dest lun level validation on the VIOS * during * LPM/validation, and avoiding the operations listed above may * reduce the chances of hitting the crash until the fix can be * applied. * You can disable src and dest lun level validation using the * following padmin commands on the VIOS. These changes are * dynamic: * * $ chdev -dev vioslpm0 -attr src_lun_val=off * $ chdev -dev vioslpm0 -attr dest_lun_val=restart_off ************************************************************** * RECOMMENDATION: * Install APAR IJ24885. * Prior to fix availability, an interim fix is available from * either * ftp://aix.software.ibm.com/aix/ifixes/ij24885/ * https://aix.software.ibm.com/aix/ifixes/ij24885/ * The ifix can be installed using Live Update (LU). * If LU is not used, installation of the ifix requires a * reboot. **************************************************************
Local fix
Problem summary
The VIOS may crash in function npiv_dequeue. This is most likely to occur during LPM or when the client NPIV driver performs error recovery (for example, if the fabric is in a bad state).
Problem conclusion
Hold proper lock when accessing command queue.
Temporary fix
********* * HIPER * *********
Comments
7200-04 - use AIX APAR IJ24885
APAR Information
APAR number
IJ24885
Reported component name
AIX V7.2
Reported component ID
5765CD200
Reported release
720
Status
CLOSED PER
PE
NoPE
HIPER
YesHIPER
Submitted date
2020-05-12
Closed date
2020-05-18
Last modified date
2020-11-16
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
IJ24886 U887730
Fix information
Fixed component name
AIX V7.2
Fixed component ID
5765CD200
Applicable component levels
R720 PSY U887730
UP20/07/20 I 1000
[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSVEF8","label":"AIX 7.2 Enterprise Edition"},"Platform":[{"code":"PF053","label":"Power Systems"}],"Version":"720","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}}]
Document Information
Modified date:
17 November 2020