A fix is available
APAR status
Closed as program error.
Error description
VSWITCH response to a stalled OSA Express device may lead to PRG004 in HCPLCK+53A when HCPIQR tries to release QIOLOCK. When the OSA Express fails to process the VSWITCH outbound queue (inbound to the OSA card) in a timely fashion, the VSWITCH controller declares a "stalled queue" condition and resets the OSA interface device to recover. If VSWITCH logic in HCPIQRRD is processing a queue of packets intended for the uplink port, the call to the NIDRECV function may trigger the reset (HCPVQATV) which clears an important pointer. If this happens while the RD$REDRV flag is set, the next cycle cannot access the QIOBK and a residual RD$QIOBK flag leads to this attempt to release the QIOLOCK with an invalid pointer.
Local fix
Apply PTF
Problem summary
**************************************************************** * USERS AFFECTED: All customers using a z/VM VSwitch for guest * * network connectivity. * * * **************************************************************** * PROBLEM DESCRIPTION: * **************************************************************** * RECOMMENDATION: APPLY PTF * **************************************************************** This is a base problem related to a VSwitch's processing of network data transmitted to an OSA UpLink Port, while the port transitions from an operational to an inoperable state. In this specific case, the VSwitch Device Driver determines the OSA Adapter is no longer processing packets placed on its output queue for transmission and therefore terminates the network connection to recover. The ABEND may occur when the VSwitch's incoming data handler attempts to recover an OSA Stalled Queue situation. In this case the existing logic appropriately deletes all the pending packets waiting for transfer, issues message HCPSWU2832E "OSA Express device stall detected." and proceeds to terminate the stalled network connection. During this process, addressability to a key QDIO Control Block is lost. Unfortunately, there is a very small window where an additional packet can be queued by another task after we've terminated the network connection, but before we actually perform task termination logic. The new packet will be discovered by task termination, forcing the logic to redrive the incoming data handler instead of terminating the task. Since addressability to the key QDIO Control Block was lost, it attempts to acquire an invalid lock and ABENDs.
Problem conclusion
z/VM's recovery for a Stalled QDIO Queue, appropriately deletes all pending packets waiting for transfer, issue message HCPSWU2832E "OSA Express device stall detected." and proceeds to terminate the stalled network connection. During this process, addressability to a key QDIO Control Block is lost causing HCPIQRRD to ABEND. Applying this fix will insure all key control blocks are preserved when performing OSA Stalled Queue Recovery.
Temporary fix
********* * HIPER * ********* FOR RELEASE VM/ESA CP/ESA R640 : PREREQ: VM66357 CO-REQ: NONE IF-REQ: NONE FOR RELEASE VM/ESACP/ESAR710 : PREREQ: VM66357 CO-REQ: NONE IF-REQ: NONE
Comments
APAR Information
APAR number
VM66359
Reported component name
VM CP
Reported component ID
568411202
Reported release
710
Status
CLOSED PER
PE
NoPE
HIPER
YesHIPER
Special Attention
NoSpecatt / Xsystem
Submitted date
2020-01-30
Closed date
2020-03-11
Last modified date
2021-06-29
APAR is sysrouted FROM one or more of the following:
APAR is sysrouted TO one or more of the following:
UM35612 UM35613
Modules/Macros
HCPIQR
Fix information
Fixed component name
VM CP
Fixed component ID
568411202
Applicable component levels
Fix is available
Select the PTF appropriate for your component level. You will be required to sign in. Distribution on physical media is not available in all countries.
[{"Business Unit":{"code":"BU011","label":"Systems - zSystems software"},"Product":{"code":"SG27M"},"Platform":[{"code":"PF054","label":"z\/OS"}],"Version":"710","Line of Business":{"code":"LOB16","label":"Mainframe HW"}}]
Document Information
Modified date:
30 June 2021