Hi Kaushal,

Only the state handle pointer is stored in the ZooKeeper node. Since
ZooKeeper is built for
small data(KB level) storage. The real data will be persisted in the *"*
*high-availability.storageDir"*.
Note that it should be distributed storage(HDFS, S3, etc.).

The ZooKeeper HA service has the following usage, as well as the Kubernetes
HA service,
you could find more information here[1].
* Leader election/retrieval
* Running job registry
* Submitted job graph store
* Checkpoint store

[1].
https://cwiki.apache.org/confluence/display/FLINK/FLIP-144%3A+Native+Kubernetes+HA+for+Flink#FLIP144:NativeKubernetesHAforFlink-HAcomponents


Best,
Yang


Kaushal Raj <kaushal.raj142...@gmail.com> 于2020年12月3日周四 下午7:13写道:

> Hello,
>
> I am new to flink. Have few queries regarding the HA mode with zookeeper
> and checkpointing. When flink is configured in HA mode with zookeeper,
> where do the job checkpoints are stored? zookeeper only used for recovering
> the jobmanager or even checkpoints? what is the significance of 
> *"**high-availability.storageDir"
> *here.
>
> Thanks,
> Kaushal
>

Reply via email to