nsivabalan commented on issue #4031: URL: https://github.com/apache/hudi/issues/4031#issuecomment-974820517
Let me know if I understand your question correctly. - You are seeing a behavior where when "_hoodie_is_deleted" is set to null or false, hudi persist this column on storage. And you are asking why do we need to do this and why not just drop the column altogether? Guess its easier to have same schema across incoming dataset and whats in storage. Also, since this is a boolean column, and only non-deleted entries are persisted on storage, this will compress nicely. So, I don't think this will give us much benefit. Let me know if this makes sense. But happy to discuss more if you need more clarification or have suggestion to improve furhtere. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org