What's new and changed in DataStage

DataStage updates can include new features, bug fixes, and security updates. Releases are listed in reverse chronological order so that the latest release is at the beginning of the topic.

You can see a list of the new features for the platform and all of the services at What's new in IBM Cloud Pak for Data.

Installing or upgrading DataStage

Ready to install or upgrade DataStage?

  • To install DataStage along with the other Cloud Pak for Data services, see Installing Cloud Pak for Data.
  • To upgrade DataStage along with the other Cloud Pak for Data services, see Upgrading Cloud Pak for Data.
  • To install or upgrade DataStage independently, see DataStage.
    Remember: All of the Cloud Pak for Data components associated with an instance of Cloud Pak for Data must be installed at the same version.

Cloud Pak for Data Version 5.0.0

A new version of DataStage was released in June 2024 with Cloud Pak for Data 5.0.0.

Operand version: 5.0.0

This release includes the following changes:

New features
This release of DataStage includes the following features and updates:
Run DataStage jobs in multiple locations with a remote data plane

You can now deploy on a remote data plane to run DataStage jobs in multiple locations, including in different geographies or cloud providers, without creating multipleDataStage instances. For more information, see Deploying on a remote data plane.

Import and export selected asset types

You can now select specific asset types to import or export from a .zip file that contains DataStage assets. By default, all asset types are selected.

Set up metrics storage at the project level for your DataStage flows

You can now use the metrics repository to store metrics in a database. For more information, see Storing and persisting DataStage metrics.

Name changes for DataStage connections and connectors
  • "Apache Cassandra (optimized)" is now "Apache Cassandra for DataStage."
  • "IBM Db2 (optimized") is now "IBM Db2 for DataStage."
  • "IBM Netezza Performance Server (optimized)" is now "IBM Netezza Performance Server for DataStage."
  • "IBM Watson Query" is now "IBM Data Virtualization."
  • "Oracle (optimized)" is now "Oracle Database for DataStage."
  • "Salesforce.com (optimized)" is now "Salesforce API for DataStage."
  • "Teradata (optimized)" is now "Teradata database for DataStage."

Your previous settings for the connections, connectors, and their associated jobs remain the same. Only the connection and connector names are changed.

Connect to more data sources in DataStage
You can now include data from these data sources in your DataStage flows:
  • IBM Planning Analytics
  • Microsoft Azure Databricks
  • MinIO
  • SAP BAPI

For the full list of connectors, see Supported data sources in DataStage.

Security issues fixed in this release
The following security issues were fixed in this release:

CVE-2019-11250

CVE-2020-8565

CVE-2021-32052

CVE-2022-0778

CVE-2023-25613, CVE-2023-26048, CVE-2023-26049, CVE-2023-32636, CVE-2023-35116, CVE-2023-44487, CVE-2023-46490

CVE-2024-23944, CVE-2024-29025, CVE-2024-29131, CVE-2024-29133