Db2 Big SQL on Cloud Pak for Data

Important: IBM Cloud Pak® for Data Version 4.8 will reach end of support (EOS) on 31 July, 2025. For more information, see the Discontinuance of service announcement for IBM Cloud Pak for Data Version 4.X.

Upgrade to IBM Software Hub Version 5.1 before IBM Cloud Pak for Data Version 4.8 reaches end of support. For more information, see Upgrading from IBM Cloud Pak for Data Version 4.8 to IBM Software Hub Version 5.1.

Version: 7.6.8 Included IBM

Thumbnail depiction of the interface of this service

Description

Db2® Big SQL is a cloud-native, elastic, scalable SQL engine optimized for workloads on data stored in object stores or HDFS.

Db2 Big SQL can query data stored on legacy Hadoop clusters, using the configurations of open source components, such as:

HDFS
Hive metastore
Ranger

Using Db2 Big SQL with IBM Cloud Pak for Data can be useful in the following situations:

You need to query large amounts of data residing on legacy Hadoop secured (Kerberized) or unsecured clusters and on private or public cloud object storage.
You need highly optimized queries for multiple open source data formats, including Parquet, ORC, Avro, and CSV.

Integrated services

Table 1. Related services. The following related services are often used with this service and provide complementary features, but they are not required.
Service	Capability
IBM® Db2 Data Management Console	Administer, monitor, manage, and optimize the performance of your IBM Db2 databases.
Runtime 22.2 on Python 3.10 for GPU	Access compute environments for Jupyter Notebooks that use GPU-accelerated Python 3.10 libraries.
Runtime 23.1 on Python 3.10 for GPU	Access compute environments for Jupyter Notebooks that use GPU-accelerated Python 3.10 libraries.
Watson Studio	Prepare, analyze, and model data in a collaborative environment with tools for data scientists, developers, and domain experts.

Db2 Big SQL on Cloud Pak for Data

Description

Quick links

Integrated services