IBM Support

IT34283: RANDOM PERFORMANCE MONITOR SAMPLES SKIPPED, LOGGING HWNPM4302E

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Storage Insights occasionally marks the monitoring status as
    ?degraded? due to periodic missed performance data
    samples.
    
    This can lead the user to believe that there?s a
    problem with the device, when in fact this degraded status is
    the result of the data collector agent?s scheduler function,
    polling for sample data at incorrect intervals.
    
    An example of
    this message from the performance monitoring logs visible in
    the Storage Insights user interface:
    
    HWNPM4302E New
    performance data is not yet available for the device.
    
    Error
    	Sep 15, 2020 	04:37:21.847	HWNPM4302E	New performance data is
    not yet available for the device. Statistics with time stamps
    later than 09/15/2020 04:32:22:000 CDT could not be
    found.
    Information	Sep 15, 2020 	04:37:21.907	HWNPM2145I	The
    data is being collected by the data collector:
    hostname.company.com.
    
    The Performance Monitoring scheduler
    which runs within the Data Collector agent has been observed to
    vary the Performance Monitoring interval, which should be a
    consistent 5 minute interval. It has been observed to poll at
    inconsistent length intervals - for example, polling at 4
    minutes, 13 seconds once, then 5 minute, 20 seconds the next
    time, etc.
    
    Confirmation of this issue requires Storage
    Insights support reps to view the performance monitor tracing
    logs, and observe incorrect polling interval timestamps, as
    illustrated here:
    
    sample #1 at 12:27:54
    2020-09-09
    12:27:54.261-0400 xxx-000000000000...XXX StatsCollectorSvcNi
    queryDateTimeInfo EXIT return value = Timestamp= 20200909
    122754 -4:00
    ...2020-09-09 12:27:54.671-0400
    xxx-000000000000...XXX StatsCollectorSvcNi selectFileSet EXIT
    return value = MoreCnt: 0, Selected:
    [Nn_stats_78E00MY-1_200909_122312,...
    
    sample #2 at 12:33:15 (
    5min 21 sec)
    2020-09-09 12:33:15.988-0400
    XXX-000000000000...XXX StatsCollectorSvcNi queryDateTimeInfo
    EXIT return value = Timestamp= 20200909 123315
    -4:00
    ....2020-09-09 12:33:16.394-0400 XXX-000000000000...XXX
    StatsCollectorSvcNi selectFileSet EXIT return value = MoreCnt:
    0, Selected: [Nn_stats_78E00MY-1_200909_123313,...
    
    sample #3
    at 12:37:58 ( 4min 43sec.)
    2020-09-09 12:37:58.813-0400
    XXX-000000000000...XXX StatsCollectorSvcNi queryDateTimeInfo
    EXIT return value = Timestamp= 20200909 123758
    -4:00
    ...2020-09-09 12:37:59.204-0400 XXX-000000000000...XXX
    StatsCollectorSvcNi selectFileSet EXIT return value = MoreCnt:
    0, Selected: [Nn_stats_78E00MY-1_200909_123313,...
    
    sample #4
    at 12:42:39 (4min 41sec)
    2020-09-09 12:42:39.301-0400
    XXX-000000000000...XXX StatsCollectorSvcNi queryDateTimeInfo
    EXIT return value = Timestamp= 20200909 124239
    -4:00
    ...2020-09-09 12:42:39.707-0400 XXX-000000000000...XXX
    StatsCollectorSvcNi selectFileSet EXIT return value = MoreCnt:
    0, Selected: [Nn_stats_78E00MY-1_200909_123814,...
    
    sample #5
    at 12:47:42 (5min 03sec)
    2020-09-09 12:47:42.221-0400
    XXX-000000000000...XXX StatsCollectorSvcNi queryDateTimeInfo
    EXIT return value = Timestamp= 20200909 124742
    -4:00
    ...2020-09-09 12:47:42.690-0400 XXX-000000000000...XXX
    StatsCollectorSvcNi selectFileSet EXIT return value = MoreCnt:
    0, Selected: [Nn_stats_78E00MY-1_200909_124315,...
    
    
    The result
    of this is that the Data Collector will poll the monitored
    storage device, before it is ready to provide the sample data
    based on the previous polling time. When samples are not
    collected, the data collection status changes to degraded,
    which can lead to undue concern and confusion.
    
    
    
    
    
    ____________________________________________________
    
    The
    defect is against component: 5608TPCSI
    Server/Manager
    build/release (TPC): 5.4.0
    
    Problem as described by customer:
    Degraded monitoring status in Storage Insights
    Initial customer
    impact (low/med/high): med
    

Local fix

  • Temporary relief can be achieved by restarting the monitoring
    tasks for affected storage devices.
    

Problem summary

  • ****************************************************************
    * USERS AFFECTED:                                              *
    * IBM Storage Insights users                                   *
    ****************************************************************
    * PROBLEM DESCRIPTION:                                         *
    * Storage Insights occasionally marks the monitoring status    *
    * as "Degraded" due to periodic missed performance data        *
    * samples.                                                     *
    *                                                              *
    * This can lead the user to believe that there is a            *
    * problem with the device, when in fact this degraded          *
    * status is the result of the data collector agent's scheduler *
    *                                                              *
    * function, polling for sample data at incorrect intervals.    *
    *                                                              *
    * The Performance Monitoring scheduler which runs within       *
    * the Data Collector agent has been observed to vary the       *
    * Performance Monitoring interval, which should be a           *
    * consistent 5 minute interval. It has been observed to poll   *
    * at inconsistent length intervals - for example, polling at   *
    * 4 minutes, 13 seconds once, then 5 minute, 20 seconds        *
    * the next time, etc.                                          *
    *                                                              *
    * The result of this is that the Data Collector will poll the  *
    * monitored storage device, before it is ready to provide the  *
    * sample data based on the previous polling time. When         *
    * samples are not collected, the data collection status        *
    * changes                                                      *
    * to Degraded, which can lead to undue concern and confusion.  *
    *                                                              *
    * Confirmation of this issue requires Storage Insights Support *
    *                                                              *
    * team to view the performance monitor tracing logs, and       *
    * observe incorrect polling interval timestamps to review and  *
    * confirm the issue.                                           *
    *                                                              *
    ****************************************************************
    * RECOMMENDATION:                                              *
    ****************************************************************
    

Problem conclusion

  • The fix for this APAR is targeted for the following release:
    
    IBM Storage Insights 2Q21  [ 54X-IBM-SI ]
    ( release target 2Q 2021 / June )
    
    The target dates for future releases do not represent a formal
    commitment by IBM. The dates are subject to change without
    notice.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IT34283

  • Reported component name

    STORAGE INSIGHT

  • Reported component ID

    5608TPCSI

  • Reported release

    540

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2020-09-18

  • Closed date

    2021-05-27

  • Last modified date

    2021-05-27

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    STORAGE INSIGHT

  • Fixed component ID

    5608TPCSI

Applicable component levels

[{"Line of Business":{"code":"LOB26","label":"Storage"},"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSQRB8","label":"IBM Storage Insights"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"540"}]

Document Information

Modified date:
08 February 2022