HDFS data lake

Processing jobs can store data in a Hadoop Distributed File System (HDFS) data lake. In a typical installation of Business Automation Insights, you set up long term storage in a Hadoop distributed file system (HDFS) data lake for further downstream processing of your business data.

Because HDFS is long-term storage and intended as a historical archive and machine learning processes, active summaries are not stored to HDFS but only exposed in Elasticsearch or in Kibana dashboards.

Learn how to prepare for HDFS storage in Preparing to use HDFS.

Restriction: The Developer Edition does not support HDFS data storage.