IBM Support

Red Hat OpenShift Platform (OCP) Troubleshooting for IBM Cloud Paks

Troubleshooting


Problem

Cloud Pak users might run into Red Hat OpenShift Platform issues or have questions. The following list of OCP assets are frequently used by support teams when resolving OCP cases.

Resolving The Problem

Title

URL

Description

Consolidated Troubleshooting Article OpenShift Container Platform 4.x

https://access.redhat.com/articles/4217411

This Consolidated Document is an index for information related to Red Hat OpenShift Container Platform 4.x. It covers:

  • General Troubleshootins
  • Troubleshooting OpenShift Components
  • Basic OpenShift How-Tos
  • OpenShift FAQ

ETCD performance troubleshooting guide for OpenShift Container Platform

https://access.redhat.com/articles/6271341

Issues covered:

  • ETCD alerts from etcd-cluster-operator like:

    etcdHighFsyncDurations

    etcdInsufficientMembers

    etcdMembersDown

    etcdNoLeader

    etcdBackendQuotaLowSpace

    etcdGRPCRequestsSlow

  • Random timeouts in the cluster.
  • The cluster doesn't look stable.
  • oc login doesn't work every time.
  • ETCD members where RAFT indexes mismatch (which could mean that some members are not fast enough or have connection issues).
  • Problem collecting must-gather (RHOCP 4) due to timeouts.

How to graph etcd metrics using Prometheus to gauge Etcd performance in OpenShift

https://access.redhat.com/solutions/5489721

Issues covered:

  • How can I graph etcd etcd_disk_wal_fsync_duration, etcd_disk_backend_commit_duration and etcd_disk_backend_commit_duration using prometheus to gauge etcd performance in OCP?
  • How to graph the cpu iowait for the etcd members in OpenShift.
  • How to check the etcd leader changes in Prometheus.
  • How to see the etcd db size.

Installing OpenShift Container Platform clusters

https://www.ibm.com/docs/en/cloud-paks/1.0?topic=installing-openshift-container-platform-clusters

Installing OpenShift Container Platform clusters

Transfer ownership of an OCP 4 cluster

https://access.redhat.com/solutions/4661621

Transfer ownership of an OCP 4 cluster

  • Transfer ownership of an OpenShift 4 cluster to a different user in the same of different Organization
  • Cancel the transfer of ownership for an OCP cluster
  • Is there an expiry date for the cancellation of the cluster ownership transfer?

Red Hat OpenShift Container Platform Life Cycle Policy

https://access.redhat.com/support/policy/updates/openshift

Red Hat OpenShift Container Platform Life Cycle Policy

Updating a cluster using the CLI

https://docs.openshift.com/container-platform/4.9/updating/updating-cluster-cli.html

Updating a cluster using CLI

Pods using Persistent Volumes with high file counts fail to start or take an excessive amount of time in OpenShift

https://access.redhat.com/solutions/6221251

Pods using Persistent Volumes with high file counts fail to start or take an excessive amount of time in OpenShift. Issues covered:
 

  • Pod deployments are failing with the following message:

Error: Failed to create pod sandbox: rpc error: code = Unknown desc = Kubelet may be retrying requests that are timing out in CRI-O due to system load: context deadline exceeded

  • Pods not able to start falling into CreateContainerError status:

mypod-5-1111a           0/1     CreateContainerError   0          7m29s

  • When attaching volumes to pods in Red Hat OpenShift Container Platform, why do pods sometimes not start, or otherwise take an excessive amount of time to start?
  • The volumes themselves have very high file counts, measured often in tens of thousands of files and directories (or higher).
  • Starting the pods without the high file count volumes allows the pod to become Ready quickly (but without access to the data the volume provides).
  • It is possible that entire nodes sometimes are marked as NotReady due to this issue as the container runtime (docker or cri-o) is unresponsive (as seen with hung docker ps or crictl ps commands).
  • When using Persistent Volumes with high file counts in OpenShift, why do pods fail to start or take an excessive amount of time to achieve Ready state?

OpenShift Troubleshooting Resources

https://connect.redhat.com/en/blog/openshift-troubleshooting-resources

OpenShift Troubleshooting Resources

OpenShift Container Platform x86_64 4.x Tested Integrations for Older Versions

https://access.redhat.com/articles/7041739

OpenShift Container Platform x86_64 4.x

Tested Integrations for Older Versions

OpenShift Container Platform 4.x Tested Integrations (for x86_x64)

https://access.redhat.com/articles/4763741

OpenShift Container Platform 4.x

Tested Integrations (for x86_x64)

OpenShift Container Platform 4.x Tested Integrations

https://access.redhat.com/articles/4128421

OpenShift Container Platform 4.x

Tested Integrations – Master Document for all Architectures

OpenShift Container Platform (OCP) 4 upgrade paths

https://access.redhat.com/solutions/4583231

OpenShift Container Platform (OCP) 4 Upgrade Paths. Issues covered:

  • What are the upgrade paths in OpenShift 4?
  • How to upgrade to the next minor version of OpenShift 4?

Red Hat Customer Portal Labs

https://access.redhat.com/labs/?product=Red+Hat+OpenShift+Container+Platform

Red Hat Customer Portal Labs

Developed by Red Hat engineers to help you improve performance, troubleshoot issues, identify security problems, and optimize configuration

How to upgrade Red Hat Ansible Automation Platform

https://www.redhat.com/sysadmin/

How to upgrade Red Hat Ansible Automation Platform

Support for IBM Cloud Paks - Frequently Asked Questions (FAQ)

https://access.redhat.com/articles/5024951

Support for IBM Cloud Paks – Frequently Asked Questions(FAQ)

Red Hat Advanced Cluster Management for Kubernetes 2.8 Support Matrix

https://access.redhat.com/articles/7006295

Red Hat Advanced Cluster Management for Kubernetes 2.8 Support Matrix

Document Location

Worldwide

[{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSE9G0Q","label":"IBM Cloud Pak for AIOps"},"ARM Category":[{"code":"a8m0z0000001jFJAAY","label":"Watson AIOps"}],"ARM Case Number":"","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSCSJL","label":"IBM Cloud Pak for Applications"},"ARM Category":[{"code":"a8m3p000000F83CAAS","label":"CP4Apps"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSBYVB","label":"IBM Cloud Pak for Business Automation"},"ARM Category":[{"code":"a8m0z0000001hwVAAQ","label":"Business Console-\u003EAPI"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SS8QTD","label":"IBM Cloud Pak for Integration"},"ARM Category":[{"code":"a8m0z0000001ho8AAA","label":"Application Integration (ACE)"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB10","label":"Data and AI"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSHGYS","label":"IBM Cloud Pak for Data"},"ARM Category":[{"code":"a8m3p000000UoQtAAK","label":"Administration"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB24","label":"Security Software"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSTDPP","label":"IBM Cloud Pak for Security"},"ARM Category":[{"code":"a8m3p000000F8yvAAC","label":"Cloud Pak for Security (CP4S)"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSFC4F","label":"IBM Cloud Pak for Multicloud Management"},"ARM Category":[{"code":"a8m0z0000001ipaAAA","label":"CloudPak4MCM"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"},{"Type":"MASTER","Line of Business":{"code":"LOB45","label":"Automation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSDSDC","label":"IBM Cloud Pak for Network Automation"},"ARM Category":[{"code":"a8m3p000000hAHpAAM","label":"AI Manager"}],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"All Versions"}]

Product Synonym

RHOCP

Document Information

Modified date:
12 December 2023

UID

ibm17095750