Installation and upgrade issues

Use the troubleshooting tips and tricks in IBM Storage Fusion installation and upgrade.

Update operator OOMKilled error

To resolve the OOMKilled issue for the update operator, do the following resolution steps:
Note: If the pod is in a CrashLoopBackOff state, delete the isf-update-operator-* pod. The pod comes back to running state for a couple of minutes. Do steps 1-4 in the following resolution steps when the update operator pod is in Running state.
  1. In the OpenShift® Container Platform console, go to Home > Search.
  2. Search for UpdateManager in the Resources drop-down list.
  3. In the UpdateManagers, open the version instance.
  4. Go to the YAML tab.
  5. Increase the memory limit in spec.resources.limits.memory.
  6. After a couple of minutes, check whether the IBM Storage Fusion clusterserviceversion object (Operators > Installed Operators > IBM Storage Fusion operator > YAML tab) reflects the updated limit set for the update operator:
    • Search for the deployment name of the update operator (isf-update-operator-controller-manager) from the list of deployments under spec.install.spec.deployments.
    • In the specified deployment object, search for the container name manager under the spec.template.spec.containers. Also, check whether the limits.memory is equal to the one in the UpdateManager CR. If not equal, change the memory under limits.memeory to the same limits value as mentioned in the UpdateManager CR in step 5.
    • Go to Workloads > Deployments > isf-update-operator-controller-manager > YAML tab and check whether the limits.memory is equal to the limit set in the previous step. If not equal, change the memory under limits.memory to the same limits value as mentioned in the previous steps.

x509: certificate signed by an unknown authority

The x509: certificate signed by an unknown authority error can occur when you trigger a service or firmware upgrade. A sample error is as follows:
Internal error occurred: failed calling webhook "mupdatemanager.kb.io": failed to call webhook: Post "https://isf-update-operator-controller-manager-service.ibm-spectrum-fusion-ns.svc:443/mutate-update-isf-ibm-com-v1-updatemanager?timeout=10s": x509: certificate signed by unknown authority
Do the following resolution steps:
  1. In the OpenShift Container Platform console, go to Home > Search.
  2. From the Resources drop-down list, select MutatingWebhookConfiguration.
  3. Select the Label drop-down list and change it to Name.
  4. Search for mupdatemanager. Check whether there are more than one instance of mupdatemanager.* webhook. If so, take a backup of the older one and delete it.
  5. Go back to Home > Search page.
  6. From the Resources, select ValidatingWebhookConfiguration.
  7. Search for vupdatemanager. Check whether there are more than one instance of vupdatemanager.* webhook. If so, take the backup of the older one and delete it.