prashantwason commented on a change in pull request #4821: URL: https://github.com/apache/hudi/pull/4821#discussion_r816537071
########## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java ########## @@ -429,25 +428,21 @@ public void mergeArchiveFiles(List<FileStatus> compactCandidate) throws IOExcept .collect(Collectors.groupingBy(i -> Pair.of(i.getTimestamp(), HoodieInstant.getComparableAction(i.getAction())))); - // If metadata table is enabled, do not archive instants which are more recent than the last compaction on the - // metadata table. - if (config.isMetadataTableEnabled()) { - try (HoodieTableMetadata tableMetadata = HoodieTableMetadata.create(table.getContext(), config.getMetadataConfig(), - config.getBasePath(), FileSystemViewStorageConfig.SPILLABLE_DIR.defaultValue())) { - Option<String> latestCompactionTime = tableMetadata.getLatestCompactionTime(); Review comment: >> When the job restarts, the last commit would be rollbacked first right ? The last commit failed on the dataset but succeed on the metadata table. So yes it will be rolled back on the dataset eventually - depends on the settings (EAGER vs LAZY rollbacks). Also we need to support the readers - they need to ignore the deltacommit. There can be a delay between the failed job and the retry and readers should read consistent data during that time. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org