hudi-bot opened a new issue, #15471:
URL: https://github.com/apache/hudi/issues/15471

   When doing a simple computation of {{numInserts - numDeletes}} for all the 
commits, this leads to negative total records.  Need to check if number of 
inserts and deletes are accurate when both inserts and deletes exist in the 
same input batch for upsert.
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-4958
   - Type: Bug
   - Fix version(s):
     - 1.1.0
   
   
   ---
   
   
   ## Comments
   
   30/Sep/22 21:26;guoyihua;I check the commit data after insert, upsert 
(including deletes with "_hoodie_is_deleted"), and delete operations using 
Spark datasource in the Spark Guide.  The numInserts and numDeletes look 
accurate.  Also the logic for deriving numDeletes in HoodieMergeHandle looks 
OK.  We need to see if the inaccuracy comes from the custom payload 
implementation.;;;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to