Flow Collector

A Flow is a sequence of packets with common characteristics such as same source and destination IP address, transport layer port information, and type of protocol. The Flow enabled devices or exporters collect Flow data from the network.

The Flow Collector Service in Network Performance Insight performs these basic functions:
  • Receives Flow records from Flow-exporters.
  • Parses, validates, and normalizes the various Flow record formats into a common format.
  • Enriches and filters Interfaces based on enable/disable flag set per network interface.
  • Limits the number of interfaces that are enabled in Network Performance Insight®
  • Stores the normalized and enriched Flow records in Apache Parquet files in HDFS.
  • Notifies the Storage Service of the availability of Flow RAW data files via Kafka.

The configuration settings are available on Ambari server from Services > NPI > Configs > NPI Settings under NPI Components > NPI Flow Collector .