[ 
https://issues.apache.org/jira/browse/FLINK-17571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17106088#comment-17106088
 ] 

Congxian Qiu(klion26) commented on FLINK-17571:
-----------------------------------------------

[~pnowojski] the directory of checkpoint and savepoint is not the same, the 
files in one savepoint always going into one 
[directory|https://ci.apache.org/projects/flink/flink-docs-release-1.10/ops/state/savepoints.html#triggering-savepoints],
 so users can delete the whole savepoint safely if they want. Currently, only 
checkpoint files will go into different directories(shared, taskowend, 
exclusive).

The command I proposed can be used for both checkpoint and savepoint, what I 
have in mind for the command is that
 # read the meta-file(this already has in our codebase)
 # deserialize the meta-file(this already has in our codebase)
 # output the files referenced in the meta-file

The reason I want to keep it as a command in Flink is that: if we change 
something related to the metafile, we can have the command updated also.

> A better way to show the files used in currently checkpoints
> ------------------------------------------------------------
>
>                 Key: FLINK-17571
>                 URL: https://issues.apache.org/jira/browse/FLINK-17571
>             Project: Flink
>          Issue Type: New Feature
>          Components: Runtime / Checkpointing
>            Reporter: Congxian Qiu(klion26)
>            Priority: Major
>
> Inspired by the 
> [userMail|http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Shared-Checkpoint-Cleanup-and-S3-Lifecycle-Policy-tt34965.html]
> Currently, there are [three types of 
> directory|https://ci.apache.org/projects/flink/flink-docs-release-1.10/ops/state/checkpoints.html#directory-structure]
>  for a checkpoint, the files in TASKOWND and EXCLUSIVE directory can be 
> deleted safely, but users can't delete the files in the SHARED directory 
> safely(the files may be created a long time ago).
> I think it's better to give users a better way to know which files are 
> currently used(so the others are not used)
> maybe a command-line command such as below is ok enough to support such a 
> feature.
> {{./bin/flink checkpoint list $checkpointDir  # list all the files used in 
> checkpoint}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to