hudi-bot opened a new issue, #15471:
URL: https://github.com/apache/hudi/issues/15471
When doing a simple computation of {{numInserts - numDeletes}} for all the
commits, this leads to negative total records. Need to check if number of
inserts and deletes are accurate when both inserts and deletes exist in the
same input batch for upsert.
## JIRA info
- Link: https://issues.apache.org/jira/browse/HUDI-4958
- Type: Bug
- Fix version(s):
- 1.1.0
---
## Comments
30/Sep/22 21:26;guoyihua;I check the commit data after insert, upsert
(including deletes with "_hoodie_is_deleted"), and delete operations using
Spark datasource in the Spark Guide. The numInserts and numDeletes look
accurate. Also the logic for deriving numDeletes in HoodieMergeHandle looks
OK. We need to see if the inaccuracy comes from the custom payload
implementation.;;;
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]