IBM Cloud provides a cloud native data lake platform, which is unique in its commitment to a serverless consumption model.
You can ingest, store, prepare, optimize and analyze your data in a serverless manner that scales resources fully transparently and exposes a fair pay-as-you-go cost model end-to-end. We provided a good introduction at the SubSurface WINTER 2021 Cloud Data Lake conference a few weeks ago.
Dremio connector to IBM Cloud Data Lake
Dremio is a very popular and open data lake engine to interactively explore, curate and consume data lake data, so it makes sense to use Dremio with IBM Cloud Data LakeIBM’s Cloud Data Lake platform.
We are happy to announce the availability of a new Dremio connector to IBM Cloud Data Lake. It enables Dremio to connect to IBM Cloud Data Lake services, push down SQL operations and retrieve results for processing into the Dremio engine. According to our commitment to open stacks, we have also made the connector itself available as open source.
In the documentation of the connector, you’ll find few simple steps on how you can optionally deploy and run Dremio itself right in IBM Cloud on top of the IBM Cloud Kubernetes Service.
This way, you can build data lake solutions with an open, interactive user experience on top of a fully serverless data lake foundation that scales seamlessly and fairly along with your workload and data demands.