IBM Support

IJ21071: NMI WATCHDOG: BUG: SOFT LOCKUP - CPU STUCK [NFSD]

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • On RHEL 7 nodes (pre-Linux kernel v3.18), in the GPFS kernel NFS
    support
    
    environment, GPFS may try to acquire some mutex, while holding
    an inode
    
    spin lock, which may be detected as a soft lockup issue by the
    kernel NMI watchdog.
    

Local fix

Problem summary

  • On RHEL 7 nodes (pre-Linux kernel v3.18), in the GPFS kernel NFS
     support
    
    environment, GPFS may try to acquire some mutex, while holding
    an inode
    
    spin lock, which may be detected as a soft lockup issue by the
    kernel NMI watchdog.
    

Problem conclusion

  • Benefits of the solution, in customer terms:
    
    Avoid CPU stuck and performance impacts
    
    Work around:
    
    None
    
    Problem trigger:
    
    GPFS breaks some spin lock holding policy in NFS support
    environment
    
    Symptom:
    
    Performance Impact/CPU stuck
    
    Platforms affected:
    
    All RHEL 7.x
    
    Functional Area affected:
    
    All users of KNFS/CNFS
    
    Customer Impact:
    
    High Importance
    

Temporary fix

Comments

APAR Information

  • APAR number

    IJ21071

  • Reported component name

    GPFS NR POWER E

  • Reported component ID

    5725Q01NP

  • Reported release

    423

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-11-20

  • Closed date

    2019-11-20

  • Last modified date

    2019-11-20

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    GPFS NR POWER E

  • Fixed component ID

    5725Q01NP

Applicable component levels

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SS6JZK","label":"IBM Spectrum Scale RAID"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"423","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SS6JZK","label":"IBM Spectrum Scale RAID"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"423","Edition":"","Line of Business":{"code":"LOB26","label":"Storage"}}]

Document Information

Modified date:
08 March 2021