[GitHub] [hudi] xushiyan commented on issue #3709: [SUPPORT] insert operation does not consistently insert duplicate records

2021-09-28 Thread GitBox
xushiyan commented on issue #3709: URL: https://github.com/apache/hudi/issues/3709#issuecomment-928732681 JIRA filed https://issues.apache.org/jira/browse/HUDI-2496 and we'll prioritize a fix. Thanks again @helanto -- This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] xushiyan commented on issue #3709: [SUPPORT] insert operation does not consistently insert duplicate records

2021-09-27 Thread GitBox
xushiyan commented on issue #3709: URL: https://github.com/apache/hudi/issues/3709#issuecomment-928732681 JIRA filed https://issues.apache.org/jira/browse/HUDI-2496 and we'll prioritize a fix. Thanks again @helanto -- This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] xushiyan commented on issue #3709: [SUPPORT] insert operation does not consistently insert duplicate records

2021-09-26 Thread GitBox
xushiyan commented on issue #3709: URL: https://github.com/apache/hudi/issues/3709#issuecomment-927439267 @helanto I can reproduce this and I agree with you that the dedup behaviors should be consistent across the same options. Also `PARQUET_SMALL_FILE_LIMIT` should just be a workaround at