fredia commented on code in PR #22973: URL: https://github.com/apache/flink/pull/22973#discussion_r1561957606
########## docs/layouts/shortcodes/generated/checkpointing_configuration.html: ########## @@ -44,6 +44,42 @@ <td>String</td> <td>The default directory used for storing the data files and meta data of checkpoints in a Flink supported filesystem. The storage path must be accessible from all participating processes/nodes(i.e. all TaskManagers and JobManagers). If the 'state.checkpoint-storage' is set to 'jobmanager', only the meta data of checkpoints will be stored in this directory.</td> </tr> + <tr> + <td><h5>state.checkpoints.file-merging.across-checkpoint-boundary</h5></td> + <td style="word-wrap: break-word;">false</td> + <td>Boolean</td> + <td>Only relevant if <code class="highlighter-rouge">state.checkpoints.file-merging.enabled</code> is enabled.<br />Whether to allow merging data of multiple checkpoints into one physical file. If this option is set to false, only merge files within checkpoint boundaries will be merged. Otherwise, it is possible for the logical files of different checkpoints to share the same physical file.</td> + </tr> + <tr> + <td><h5>state.checkpoints.file-merging.enabled</h5></td> + <td style="word-wrap: break-word;">false</td> + <td>Boolean</td> + <td>Whether to enable merging multiple checkpoint files into one, which will greatly reduce the number of small checkpoint files.</td> + </tr> + <tr> + <td><h5>state.checkpoints.file-merging.max-file-size</h5></td> + <td style="word-wrap: break-word;">32 mb</td> + <td>MemorySize</td> + <td>Max size of a physical file for merged checkpoints.</td> + </tr> + <tr> + <td><h5>state.checkpoints.file-merging.max-space-amplification</h5></td> + <td style="word-wrap: break-word;">0.75</td> Review Comment: 👍 I took it as the proportion of invalid data, the old description is "A threshold that triggers a compaction (re-uploading) of one physical file. If the amount of invalid data in a physical file exceeds the threshold, a new physical file will be created and uploaded." Changed it to 2 and modify the description. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org