ESS known issues
Known issues in ESS
For information about ESS 5.3.7.x known issues, see Known issues in ESS 5.3.7.x Quick Deployment Guide.
Issue | Resolution or action |
---|---|
After initial deployment, the EMS may show SERVER2U instead of 5105-22E as the
MTM. Product
|
|
The Ansible tool essrun cannot add more than one building block at a
time in a cluster. Product
|
If it is necessary to add more than one building block in a cluster, the following two options
are available:
|
During upgrade, if the container had an unintended loss of connection with the target
canister(s), there might be a timeout of up to 2 hours in the Ansible® update task. Product
|
Wait for the timeout and retry the essrun update task. |
When running essrun commands, you might see messages such as
these:
Product
|
This is a restriction in the Ansible timestamp
module. It shows timestamps even for the “skipped” tasks. If you want to remove timestamps from the
output, change the ansible.cfg file inside the container as follows:
|
After reboot of an ESS 5000 node, systemd could be loaded incorrectly. Users might see the
following error when trying to start
GPFS:
Product
|
Power off the system and then
power it on again.
|
In ESS 5000 SLx series, after pulling a hard drive out for a long time wherein the drive
has finished draining, when you re-insert the drive, the drive could not be
recovered. Product
|
Run the following command from EMS or IO node to revive the
drive:
Where RGName is the recovery group that the drive belongs to and PdiskName is the drive's pdisk name. |
After the deployment is complete, if firmware on the enclosure, drive, or HBA adapter does
not match the expected level, and if you run essinstallcheck, the following
mmvdisk settings related error message is
displayed:
Product
|
The error about mmvdisk settings can be ignored. The resolution is to update the mismatched firmware levels on enclosure, adapter, or HBA adapters to the correct levels. You can run the mmvdisk configuration check command to confirm. The mmvdisk settings do not match best practices. Run the mmvdisk server configure --verify --node-class <nodeclass> command. List the mmvdisk node classes: mmvdisk nc
list
Note: essinstallcheck detects inconsistencies from
mmvdisk best practices for all node classes in the cluster and stops immediately
if an issue is found.
|
When running essinstallcheck you might see an error message similar
to: Only one time in the
containerProduct
|
Run vpdupdate on each I/O node. Rerun essinstallcheck which should properly query the firmware level. |
During command-less disk replacement, there is a limit on how many disks can be replaced at
one time. Product
|
For command-less disk replacement using commands, only replace up to 2 disks at a time. If command-less disk replacement is enabled, and more than 2 disks are replaceable, replace the 1st 2 disks, and then use the commands to replace the 3rd and subsequent disks. |
Issue reported with command-less disk replacement warning LEDs. Product
|
The replaceable disk will have the amber led on, but not blinking. Disk replacement should still succeed. |
After upgrading an ESS node to
version, the pmsensors service needs
to be manually started.Product
|
After the ESS upgrade is complete, the
pmsensors service does not automatically start. You must manually start the service
for performance monitoring to be restored. On each ESS node, run the following
command:
For checking the
status of the service, run the following command:
|
The canister_failed event does not surface amber LED on the
canister or the enclosure LED front panel. Product
|
Root cause: The failed canister is not the master canister, and the other canister is not
up/running. Action required: No |
Migration from ESS Legacy releases (5.3.7.x) to the container version (ESS 6.1.x.x) might revert values in the mmvdisk to default settings. Product
|
For more information about this issue, see IBM Support. |
Node call home might not work for nodes that are designated as protocol nodes. If a power
supply (or any other Opal related node problem) is damaged or pulled, a call home will not be
available on the Salesforce system. Opal PRD might not log error from FSP that is caussing this issue. Product
|
Determine a power supply problem by manually inspecting the ASMI error/event logs by using FSP and open problem with support if required. |
If the essrun gui –configure command is run after the GUI and
performance monitoring is already set up, you might get an error prompting you to remove any
existing GUI config before continuing. Product
|
If the GUI is already set up, it is not required to remove the existing GUI config. Exit
the container.
|
The mmcallhome ticket list reports multiple tickets opened for the same issue. Product
|
On the EMS check if there are more duplicates events in the queue to be sent to IBM, issue the
following command:
If this directory contains more entries of duplicate call home events:
|
The mmcallhome ticket list still reports “New Case Opened” after the PMR
is closed by IBM. Product
|
Remove the
ticket.
|
After deploying the protocol VM on an ESS 3500 canister the Mellanox OFED driver is not installed. Example:
Product
|
|
Cannot create CES file system, if I/O nodes are deployed with versions prior to 6.1.2.0 by using the essrun command. Example of old naming convention:
Product
|
Ansible tries to gather the RG by using the new name format.
Example:
Create the CES file system by using the mmvdisk command directly in the EMS or
any I/O node in the cluster.
|
During the file system creation in Mixed environments (ESS 5000 and ESS 3500), the following
error can appear:
An exception occurred during task execution. To see the full traceback, use -vvv. The error was: ZeroDivisionError: division by zero Bad access to a specific variable of ESS 5000 I/O node causes this issue. Produce
|
|
BMC network may become unresponsive when configured with VLAN. VLAN configuration failed to properly activate in the BMC network stack. Product
|
|
Amber LED on the power supply may flash or turn solid without any amber LED in the front of
the enclosure. Power supply may incorrectly detect out of range operating parameters such as incoming voltage or power supply temperature. Product
|
|
The esscallhomeconf command may not be able to automatically create call home
group. It may present the following
message:
Product
|
From the EMS:
|