Cloud deployment environments
You can choose to deploy IBM Cloud Pak for Data in the environment that suits your business needs.
Cloud Pak for Data can be deployed in various private cloud and public cloud environments.
Deployment environment | Managed OpenShift® | Self-managed OpenShift |
---|---|---|
On-premises | ✓ | ✓ |
IBM Cloud | ✓ | ✓ |
Amazon Web Services (AWS) | ✓ | ✓ |
Microsoft Azure | ✓ | ✓ |
Google Cloud | ✓ | ✓ |
- Manual deployments
- You can deploy Cloud Pak for Data on any supported
cloud deployment environment by using the
cli
installation method. For more information, see Manual deployment options. - Automated deployments
- You can deploy Cloud Pak for Data on select cloud deployment environments by using a marketplace offering. The marketplace offerings enable you to install Cloud Pak for Data quickly and easily but have several limitations. For more information, see Automated deployment options.
Manual deployment options
With a manual installation of Cloud Pak for Data, you have complete control over how Cloud Pak for Data is installed on your cluster.
- How to install OpenShift on the environment
- The recommended cluster architecture
- How to install the Cloud Pak for Data platform and services on the environment
On-premises infrastructure
If you want to ensure that your environment is running securely behind your firewall, or you have an existing on-premises Red Hat OpenShift Container Platform cluster, you can deploy Cloud Pak for Data on your on-premises infrastructure.
You can deploy Cloud Pak for Data on bare-metal servers or on IBM Storage Fusion HCI System.
- Multicloud Object Gateway
-
- Watson Discovery
- Watson Speech services
- watsonx Assistant
- watsonx Orchestrate
- Graphical processing units (GPUs)
-
- IBM Knowledge Catalog Premium
- IBM Knowledge Catalog Standard
- Watson Machine Learning (required to use optional GPU features)
- Watson Machine Learning Accelerator
- Watson Studio Runtimes that require GPU
- watsonx.ai
- watsonx Assistant (required to use optional GPU features)
- watsonx Code Assistant for Red Hat Ansible® Lightspeed
- watsonx Code Assistant for Z
- watsonx.governance (the service does not require GPUs but does have a dependency on Red Hat OpenShift AI)
- watsonx Orchestrate
- Installing Red Hat OpenShift Container Platform
- The method for installing Red Hat
OpenShift Container Platform
depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed You can deploy a managed OpenShift cluster on your on-premises infrastructure with IBM Cloud Satellite. For details, see IBM Cloud Satellite in the IBM Cloud marketplace. Self-managed You can deploy a self-managed OpenShift cluster on your on-premises infrastructure by following the Red Hat OpenShift Container Platform documentation: You can optionally use hosted control planes.
Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated control plane and worker nodes. This
configuration provides better performance, better cluster stability, and increased ease of scaling
the cluster to support workload growth. The specific requirements for a production-level cluster are
identified in Hardware
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift On-premises IBM Cloud Satellite supports the following storage options with managed OpenShift: - OpenShift Data Foundation
- Portworx
The following storage options are supported on bare metal and VMware infrastructure with self-managed OpenShift: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- IBM Storage Fusion Global Data Platform
- IBM Storage Scale Container Native
- Portworx
- NFS
- NetApp Trident
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
IBM Cloud infrastructure
If you already use IBM Cloud to run business-critical applications, or if you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on IBM Cloud.
- Installing Red Hat OpenShift Container Platform
-
The method for installing Red Hat OpenShift Container Platform depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed You can deploy a managed OpenShift cluster on IBM Cloud infrastructure from the IBM Cloud catalog. For details, see Red Hat OpenShift on IBM Cloud in the IBM Cloud catalog. Ensure that you select a supported version of Red Hat OpenShift.
You can deploy Cloud Pak for Data on Virtual Private Cloud (VPC) infrastructure. You can use either a single zone or multi zone deployment.
Self-managed You can deploy a self-managed OpenShift cluster on Virtual Private Cloud (VPC) IBM Cloud infrastructure by following the Red Hat OpenShift Container Platform documentation: Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated control plane and worker nodes. This
configuration provides better performance, better cluster stability, and increased ease of scaling
the cluster to support workload growth. The specific requirements for a production-level cluster are
identified in Hardware
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift IBM Cloud Red Hat OpenShift on IBM Cloud supports the following storage options on VPC infrastructure: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Portworx
The following storage options are supported on VPC IBM Cloud infrastructure with self-managed OpenShift: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
Amazon Web Services (AWS) infrastructure
If you already use AWS and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on AWS.
- Installing Red Hat OpenShift Container Platform
-
The method for installing Red Hat OpenShift Container Platform depends on whether you want a managed or self-managed OpenShift cluster:
OpenShift type Deployment options Managed - ROSA classic
- You can use the ROSA CLI to manually install ROSA. For details, see the Red Hat OpenShift Service on AWS (ROSA) documentation.
- ROSA with hosted control planes
- You can use the ROSA CLI to manually install ROSA with hosted control planes (ROSA with HCP). For details, see the Red Hat OpenShift Service on AWS (ROSA) documentation.
- AWS GovCloud (US)
- You can use the AWS CLI to manually install ROSA on AWS GovCloud (US). For details, see Red Hat OpenShift Service on AWS in the AWS GovCloud (US) documentation.
- IBM Cloud Satellite
- You can use IBM Cloud Satellite in the IBM Cloud marketplace to install Red Hat OpenShift Container Platform on AWS.
Self-managed You can deploy a self-managed OpenShift cluster on AWS infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster.Restriction: Not all services can be installed on hosted control planes. Services with the following prerequisites cannot be installed on hosted control planes:- Multicloud Object Gateway
-
- Watson Discovery
- Watson Speech services
- watsonx Assistant
- watsonx Orchestrate
- Graphical processing units (GPUs)
-
- IBM Knowledge Catalog Premium
- IBM Knowledge Catalog Standard
- Watson Machine Learning (required to use optional GPU features)
- Watson Machine Learning Accelerator
- Watson Studio Runtimes that require GPU
- watsonx.ai
- watsonx Assistant (required to use optional GPU features)
- watsonx Code Assistant for Red Hat Ansible Lightspeed
- watsonx Code Assistant for Z
- watsonx.governance (the service does not require GPUs but does have a dependency on Red Hat OpenShift AI)
- watsonx Orchestrate
- Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated control plane and worker nodes. This
configuration provides better performance, better cluster stability, and increased ease of scaling
the cluster to support workload growth. The specific requirements for a production-level cluster are
identified in Hardware
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Amazon Web Services (AWS) Red Hat OpenShift Service on AWS (ROSA) supports the following storage options: - IBM Storage Fusion Global Data Platform
- Amazon Elastic Block Store (EBS)
- Amazon Elastic File System (EFS)
- NetApp Trident (includes Amazon FSx for NetApp ONTAP)
The following storage options are supported on AWS infrastructure with self-managed OpenShift: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Amazon Elastic Block Store (EBS)
- Amazon Elastic File System (EFS)
- Portworx
- NFS
- NetApp Trident (includes Amazon FSx for NetApp ONTAP)
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
Microsoft Azure infrastructure
If you already use Microsoft Azure and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on Azure.
- Installing Red Hat OpenShift Container Platform
-
OpenShift type Deployment options Managed You can use theaz aro
CLI to manually install Azure Red Hat OpenShift 4 (ARO). For details, see the Azure Red Hat OpenShift 4 documentation.Self-managed You can deploy a self-managed OpenShift cluster on Azure infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Tip: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated control plane and worker nodes. This
configuration provides better performance, better cluster stability, and increased ease of scaling
the cluster to support workload growth. The specific requirements for a production-level cluster are
identified in Hardware
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Microsoft Azure Azure Red Hat OpenShift (ARO) supports the following storage options: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Portworx
- NFS
The following storage options are supported on Microsoft Azure infrastructure with self-managed OpenShift: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Portworx
- NFS, specifically Microsoft Azure locally redundant Premium SSD storage
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
Google Cloud infrastructure
If you already use Google Cloud and you don't want to set up and manage your own hardware, you can deploy Cloud Pak for Data on Google Cloud.
- Installing Red Hat OpenShift Container Platform
-
OpenShift type Deployment options Managed You can deploy Red Hat OpenShift Dedicated on Google Cloud using a variety of methods. For details, see Creating a cluster on GCP in the Red Hat OpenShift Dedicated documentation. Self-managed You can deploy a self-managed OpenShift cluster on Google Cloud infrastructure by following the Red Hat OpenShift Container Platform documentation:Additional guidance on setting up OpenShift is available in the IBM Cloud Paks documentation.
Remember: If your license includes entitlement to Red Hat OpenShift Container Platform, you can apply the entitlement to your self-managed OpenShift cluster. - Recommended cluster architecture
- Cloud Pak for Data is deployed
on a multi-node cluster. Although you can deploy Cloud Pak for Data on a 3-node cluster for development or proof of
concept environments, it is strongly recommended that you deploy your production environment on a
larger, highly available cluster with multiple dedicated control plane and worker nodes. This
configuration provides better performance, better cluster stability, and increased ease of scaling
the cluster to support workload growth. The specific requirements for a production-level cluster are
identified in Hardware
requirements.
For more information, see Architecture for IBM Cloud Pak for Data.
- Supported storage
-
Deployment environment Managed OpenShift Self-managed OpenShift Google Cloud Red Hat OpenShift Dedicated on Google Cloud supports the following storage options: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
The following storage options are supported on Google Cloud infrastructure with self-managed OpenShift: - OpenShift Data Foundation
- IBM Storage Fusion Data Foundation
- Portworx
- NFS
- Prerequisites
- Ensure that you review the following information:
Topic What to look for Hardware requirements Based on the services that you plan to install, determine whether you have sufficient hardware and whether you need AVX instructions or dedicated nodes. Software requirements Based on the services that you plan to install, determine whether you have the necessary external software and service dependencies. Storage requirements The storage that you use determines the services that you can run. Ensure that the services that you want to run are compatible with the storage in your environment.
- Installing Cloud Pak for Data and services
- See Installing IBM Cloud Pak for Data.
In addition to Cloud Pak for Data software, IBM offers IBM Cloud Pak for Data as a Service. IBM Cloud Pak for Data as a Service might be right for you if you already use IBM Cloud to run business-critical applications and you don't want to set up and manage your own deployment of Cloud Pak for Data. IBM Cloud Pak for Data as a Service differs from the Cloud Pak for Data software in several ways. For details, see the IBM Cloud Pak for Data as a Service documentation.
Automated deployment options
Depending on the environment , you might be able to install Cloud Pak for Data and some services from your cloud provider's marketplace.
The marketplace installation options are offered in addition to the default cli
installation method.
- Some marketplace installations automatically create a Red Hat OpenShift Container Platform cluster and set up storage for you. You cannot use the marketplace offering to install Cloud Pak for Data on an existing cluster.
- Market place installations support a subset of the storage that Cloud Pak for Data supports.
- Marketplace installations do not support installation from a private container
registry. If you want to use a private container registry, you must use the
cli
installation method. - Marketplace installations support only installation. If you want to upgrade an existing
installation, you must use the
cli
to upgrade. - Marketplace installations can be used to automatically install a subset of Cloud Pak for Data services. However, the services that are
available depend on the cloud deployment environment. If you want to install additional services,
you must use the
cli
installation method.
Each marketplace offering specifies the version of Cloud Pak for Data that it will install. If you want to install a
different version of Cloud Pak for Data, you must use the
cli
installation method.
IBM Cloud infrastructure
You can install Cloud Pak for Data on Red Hat OpenShift on IBM Cloud from the IBM Cloud Catalog.
- Red Hat OpenShift Container Platform
- Review the following information to ensure that the marketplace offering supports the type of
environment where you want to run Cloud Pak for Data.
Environment Details Managed OpenShift Supported. Self-managed OpenShift No marketplace offering is available for self-managed Red Hat OpenShift Container Platform on IBM Cloud. - Prerequisites
- Before you can install Cloud Pak for Data from the
IBM Cloud Catalog, you must:
- Install Red Hat
OpenShift on IBM Cloud.
For details, see Red Hat OpenShift on IBM Cloud in the IBM Cloud catalog.
Ensure that you select a supported version of Red Hat OpenShift.
You can deploy Cloud Pak for Data on Virtual Private Cloud (VPC) Gen2 single or multi zone infrastructure.
- Set up storage.
This marketplace offering supports OpenShift Data Foundation storage.
- Install Red Hat
OpenShift on IBM Cloud.
Amazon Web Services (AWS) infrastructure
You can install Cloud Pak for Data on AWS from the AWS marketplace.
- Red Hat OpenShift Container Platform
- Review the following information to ensure that the marketplace offering supports the type of
environment where you want to run Cloud Pak for Data.
Environment Details Managed OpenShift Supported. Important: Red Hat OpenShift Service on AWS (ROSA) is supported only for Data Fabric offerings.Self-managed OpenShift Supported. Important: Self-managed OpenShift is supported for BYOL offerings and cartridge offerings. - About this offering
-
The AWS marketplace offerings will complete the following tasks for you:
- Set up a Red Hat OpenShift Container Platform cluster.
- Set up storage.You can choose from the following options:
- Red Hat OpenShift Data Foundation storage
- Amazon Elastic File System (EFS) storage and Amazon Elastic Block Store (EBS)
- Amazon Elastic File System (EFS) storage only
- Install Cloud Pak for Data and selected services.
- Prerequisites
-
Before you can install Cloud Pak for Data from the AWS marketplace, you must have the following information:
- Your IBM Entitlement key
- Your OpenShift pull secret
- If you want to install Cloud Pak for Data on an existing VPC, you must have information about the VPC.
Microsoft Azure infrastructure
You can install Cloud Pak for Data on Azure from the Azure Marketplace.
- Red Hat OpenShift Container Platform
- Review the following information to ensure that the marketplace offering supports the type of
environment where you want to run Cloud Pak for Data.
Environment Details Managed OpenShift Supported. Important: To install on Azure Red Hat OpenShift (ARO), choose an offering with ARO in the name.Self-managed OpenShift Supported. - About this offering
-
The Azure marketplace offerings will complete the following tasks for you:
- Set up a Red Hat OpenShift Container Platform cluster.
- Set up Red Hat OpenShift Data Foundation storage.
- Install Cloud Pak for Data and selected services.
- Prerequisites
-
Before you can install Cloud Pak for Data from the Azure marketplace, you must have the following information:
- Your IBM Entitlement key
- Your OpenShift pull secret