Presto (C++) overview

Presto (C++) is a version of Presto workers that are implemented in C++ instead of Java by using the Velox library.

watsonx.data Developer edition

watsonx.data on Red Hat® OpenShift®

watsonx.data SaaS on AWS

Presto (C++) aims to enhance performance for data lakes without requiring a JVM on worker nodes. It supports several connectors, including Hive and Iceberg, and focuses on improved integration with data warehousing systems.

IBM® watsonx.data uses version 0.286 of Presto (C++).

Presto (C++) features

Task management
Presto C++ includes HTTP endpoints that allow users to monitor and manage tasks. This feature enhances operational oversight and makes it easier to track ongoing processes.
Remote function execution
Enables executing functions on remote nodes, which enhance scalability and distributed processing capabilities, making data processing more efficient across a network of nodes.
Authentication
Uses JSON Web Tokens (JWT) for secure internal communication between nodes, ensuring that data remains secure and tamper-proof during transmission.
Data caching
Implements asynchronous data caching with prefetching capabilities. This optimizes data retrieval and processing speed by anticipating data needs and caching it in advance.
Performance Tuning
Offers various session properties for performance tuning, including settings for spill thresholds and compression. This allows users to fine-tune performance parameters according to their specific needs, ensuring optimal performance of data processing tasks.

For more information about Presto (C++), see Presto C++ features.

For more information about provisioning a Presto (C++) engine, see Provisioning a Presto (C++) engine.

For more information about switching between Presto (Java) and Presto (C++) engines, see Switching between Presto (Java) and Presto (C++) engines.