nsivabalan commented on issue #4031:
URL: https://github.com/apache/hudi/issues/4031#issuecomment-974820517


   Let me know if I understand your question correctly. 
   - You are seeing a behavior where when "_hoodie_is_deleted" is set to null 
or false, hudi persist this column on storage. And you are asking why do we 
need to do this and why not just drop the column altogether? 
   
   
   Guess its easier to have same schema across incoming dataset and whats in 
storage. Also, since this is a boolean column, and only non-deleted entries are 
persisted on storage, this will compress nicely. So, I don't think this will 
give us much benefit.
   
   Let me know if this makes sense. But happy to discuss more if you need more 
clarification or have suggestion to improve furhtere. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to