Restarting from a checkpoint or savepoint from Business Automation Insights 18.0.2
New in 18.0.2 Starting from savepoints is mandatory when you upgrade Business Automation Insights and the new version is based on a new Apache Flink version.
About this task
For upgrades that include a new version of Flink, a savepoint is required for each processing job. You cannot use checkpoints in this case. You create savepoints as part of the upgrade procedure in Upgrading Business Automation Insights releases.
At any time, you can create savepoints by running the exec command as shown
below. If a job failure is preventing you from creating a savepoint, that is, if the
create-savepoints.sh script returns an error, use the latest successful
checkpoint.
Tip: Always prefer to use savepoints over checkpoints because savepoints are
always retained until you explicitly delete them. Use checkpoints when savepoint creation fails.
However, because only the 3 latest successful checkpoints are retained, and to prevent them from
being deleted while a new checkpoint is created, remember to first cancel the job. On cancellation,
the 3 latest checkpoints are retained.
You can then restart the processing by using the Recovery Path property of each job submitter in the Helm release properties, using either a savepoint or a checkpoint.
Procedure
Results
Jobs are restored from the savepoint or checkpoint using the
allowNonRestoredState flag that is described in the Restore a savepoint section of the Flink documentation. To
enable you to remove operators, such as HDFS storage or Kafka egress, that flag is removed.
Therefore, be careful when you restore from a savepoint or checkpoint and look up the logs. When a
state is ignored, the job manager log contains a message such as the following one. Similar messages
appear only when you disable operators, and only once for each operator.
2019-01-08 18:11:04,737 INFO org.apache.flink.runtime.checkpoint.Checkpoints - Skipping savepoint state for operator <operator-id>.
For more information, see Advanced updates.