hudi-bot opened a new issue, #15115:
URL: https://github.com/apache/hudi/issues/15115

   Currently, block-merging is configurable to be either lazy or non-lazy. 
However non-lazy sequence is incorrect – it will be merging blocks before 
actually rolling back reverted ones. To make sure users do not accidentally hit 
this issue, we need to revisit MOR block merging sequence and make sure that 
following invariants are upheld
    # Blocks are merged in 2 passes:
    ## First we load all blocks, while dropping rolled back ones, then
    ## We merge them in another forward-pass
    # We should try to avoid having 2 merging sequences and instead consolidate 
on just one: right now we have "block + block", and "base + block", but we 
should be able to just get away with just the latter (this will simplify 
merging sequence quite substantially, for ex in respect to handling of 
deletions) 
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-3828
   - Type: Bug
   - Fix version(s):
     - 1.1.0
   
   
   ---
   
   
   ## Comments
   
   18/Jan/23 20:33;alexey.kudinkin;This is addressed in the new scanV2 
implementation;;;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to