hudi-bot opened a new issue, #14738: URL: https://github.com/apache/hudi/issues/14738
If an HBASE table becomes corrupted for any reason, we should think of a way that we can provide some sort of disaster recovery built in to re-build index. Currently only way to do this without any extra utilities is to rewrite entire dataset. Can we create a CLI command which rebuilds just index? Can we add checkpointing to hbase table to reduce disaster recovery time? i.e. only re-build index for records written after last checkpoint time ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-1580 - Type: New Feature - Affects version(s): - 0.9.0 --- ## Comments 29/Mar/21 04:19;shivnarayan;CC [~nishith29]: do we have any such tool within Uber that you can upstream. or do we have any options ATM until we have the tool built.;;; --- 30/Mar/21 06:49;nishith29;[~shivnarayan] [~ryanpife] and I have chatted about this before which resulted in this ticket. There are mechanisms at Uber but unfortunately that is not upstream-able. [~ryanpife] I know you guys were working on some aspects of this here, any progress ?;;; --- 13/Dec/21 14:40;shivnarayan;[~ryanpife] : Can you respond to Nishith's comment above. would be good if you have some working solution internally, to upstream it.;;; -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
