hudi-bot opened a new issue, #14738:
URL: https://github.com/apache/hudi/issues/14738

   If an HBASE table becomes corrupted for any reason, we should think of a way 
that we can provide some sort of disaster recovery built in to re-build index. 
Currently only way to do this without any extra utilities is to rewrite entire 
dataset.
   
   Can we create a CLI command which rebuilds just index?
   
   Can we add checkpointing to hbase table to reduce disaster recovery time? 
i.e. only re-build index for records written after last checkpoint time
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-1580
   - Type: New Feature
   - Affects version(s):
     - 0.9.0
   
   
   ---
   
   
   ## Comments
   
   29/Mar/21 04:19;shivnarayan;CC [~nishith29]: do we have any such tool within 
Uber that you can upstream. or do we have any options ATM until we have the 
tool built.;;;
   
   ---
   
   30/Mar/21 06:49;nishith29;[~shivnarayan] [~ryanpife] and I have chatted 
about this before which resulted in this ticket. There are mechanisms at Uber 
but unfortunately that is not upstream-able. 
   
    
   
   [~ryanpife] I know you guys were working on some aspects of this here, any 
progress ?;;;
   
   ---
   
   13/Dec/21 14:40;shivnarayan;[~ryanpife] : Can you respond to Nishith's 
comment above. would be good if you have some working solution internally, to 
upstream it.;;;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to