[GitHub] [hudi] root18039532923 commented on issue #2448: [SUPPORT] deltacommit for client 172.16.116.102 already exists

2021-02-22 Thread GitBox
root18039532923 commented on issue #2448: URL: https://github.com/apache/hudi/issues/2448#issuecomment-783972684 if you set "hoodie.compact.inline -> true",this means compaction changes inline, but async? This is an

[GitHub] [hudi] danny0405 commented on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
danny0405 commented on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-783951593 > Thanks @danny0405 for the base index patch, maybe there are some points to think about later,  Thanks for the new ideas if you have some and welcome the contribution ~

[GitHub] [hudi] lhjzmn commented on a change in pull request #2532: [HUDI-1534]HiveSyncTool-It is not necessary to use JDBC and MetaStoreClient at the same time

2021-02-22 Thread GitBox
lhjzmn commented on a change in pull request #2532: URL: https://github.com/apache/hudi/pull/2532#discussion_r580796528 ## File path: hudi-sync/hudi-hive-sync/src/test/java/org/apache/hudi/hive/TestHiveSyncTool.java ## @@ -87,30 +89,30 @@ public void testSchemaConvertArray()

[jira] [Assigned] (HUDI-1636) Support Builder Pattern To Build Table Properties For HoodieTableConfig

2021-02-22 Thread pengzhiwei (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pengzhiwei reassigned HUDI-1636: Assignee: pengzhiwei > Support Builder Pattern To Build Table Properties For HoodieTableConfig >

[jira] [Created] (HUDI-1636) Support Builder Pattern To Build Table Properties For HoodieTableConfig

2021-02-22 Thread pengzhiwei (Jira)
pengzhiwei created HUDI-1636: Summary: Support Builder Pattern To Build Table Properties For HoodieTableConfig Key: HUDI-1636 URL: https://issues.apache.org/jira/browse/HUDI-1636 Project: Apache Hudi

[GitHub] [hudi] lamber-ken commented on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
lamber-ken commented on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-783929502 Thanks @danny0405 for the base index patch, maybe there are some points to think about later,  This

[jira] [Closed] (HUDI-1624) The state based index should bootstrap from existing base files

2021-02-22 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang closed HUDI-1624. -- Fix Version/s: 0.8.0 Resolution: Fixed Fixed via master branch: 3ceb1b4c83d4e004c9b397bd05600c2aaf86196d

[jira] [Assigned] (HUDI-1624) The state based index should bootstrap from existing base files

2021-02-22 Thread vinoyang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vinoyang reassigned HUDI-1624: -- Assignee: Danny Chen > The state based index should bootstrap from existing base files >

[hudi] branch master updated (43a0776 -> 3ceb1b4)

2021-02-22 Thread vinoyang
This is an automated email from the ASF dual-hosted git repository. vinoyang pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 43a0776 [HUDI-1586] [Common Core] [Flink Integration] Reduce the coupling of hadoop. (#2540) add 3ceb1b4

[GitHub] [hudi] yanghua merged pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
yanghua merged pull request #2581: URL: https://github.com/apache/hudi/pull/2581 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [hudi] getniz commented on issue #2573: Rebuild a HUDI table using the Snapshot of HUDI table with its commit timeline metadata

2021-02-22 Thread GitBox
getniz commented on issue #2573: URL: https://github.com/apache/hudi/issues/2573#issuecomment-783919732 @bvaradar I m still on it. It will take sometime for me to validate this. I ll close for this now. thanks @bvaradar

[GitHub] [hudi] getniz closed issue #2573: Rebuild a HUDI table using the Snapshot of HUDI table with its commit timeline metadata

2021-02-22 Thread GitBox
getniz closed issue #2573: URL: https://github.com/apache/hudi/issues/2573 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] codejoyan opened a new issue #2592: [SUPPORT] Does latest versions of Hudi (0.7.0, 0.6.0) work with Spark 2.3.0 when reading orc files?

2021-02-22 Thread GitBox
codejoyan opened a new issue #2592: URL: https://github.com/apache/hudi/issues/2592 **Description:** The documentation says: Hudi works with Spark-2.x & Spark 3.x versions. (https://hudi.apache.org/docs/quick-start-guide.html) But I have not been able to use hudi-spark-bundle_2.11

[GitHub] [hudi] rubenssoto commented on issue #2563: [Feature Request] Full Schema Evolution

2021-02-22 Thread GitBox
rubenssoto commented on issue #2563: URL: https://github.com/apache/hudi/issues/2563#issuecomment-783862335 `Caused by: org.apache.hudi.exception.HoodieUpsertException: Failed to merge old record into new file for key 7176859 from old file

[GitHub] [hudi] leesf closed pull request #2560: [HUDI-1606]align BaseJavaCommitActionExecuto#execute method with BaseSparkCommitActionExecutor

2021-02-22 Thread GitBox
leesf closed pull request #2560: URL: https://github.com/apache/hudi/pull/2560 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [hudi] codecov-io edited a comment on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-781103554 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [hudi] lamber-ken commented on a change in pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
lamber-ken commented on a change in pull request #2581: URL: https://github.com/apache/hudi/pull/2581#discussion_r580736398 ## File path: hudi-flink/src/main/java/org/apache/hudi/operator/partitioner/BucketAssignFunction.java ## @@ -78,13 +131,22 @@ public

[GitHub] [hudi] satishkotha edited a comment on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-22 Thread GitBox
satishkotha edited a comment on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-783834813 From the error message: > missing required field description at org.apache.avro.io.ResolvingDecoder.doAction(ResolvingDecoder.java:292) at

[GitHub] [hudi] satishkotha commented on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-22 Thread GitBox
satishkotha commented on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-783834813 From the error message: > missing required field description at org.apache.avro.io.ResolvingDecoder.doAction(ResolvingDecoder.java:292) at

[GitHub] [hudi] teeyog commented on a change in pull request #2475: [HUDI-1527] automatically infer the data directory, users only need to specify the table directory

2021-02-22 Thread GitBox
teeyog commented on a change in pull request #2475: URL: https://github.com/apache/hudi/pull/2475#discussion_r580728819 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala ## @@ -74,6 +78,19 @@ class DefaultSource extends

[GitHub] [hudi] danny0405 commented on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
danny0405 commented on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-783819474 > @danny0405 please check the CI? Should not be caused by this PR, re-trigger to run the tests again.

[GitHub] [hudi] bvaradar commented on issue #2546: Whether to provide flink to read the api of hudi, or use flink sql to query hudi?

2021-02-22 Thread GitBox
bvaradar commented on issue #2546: URL: https://github.com/apache/hudi/issues/2546#issuecomment-783812882 @robin-su : Pinging again to see if you are still looking for answers. This is an automated message from the Apache

[GitHub] [hudi] bvaradar commented on issue #2564: Hoodie clean is not deleting old files

2021-02-22 Thread GitBox
bvaradar commented on issue #2564: URL: https://github.com/apache/hudi/issues/2564#issuecomment-783812545 @rswagatika : Running inline compaction is fine. But from the time, if compaction ran fine, there should have been a .commit (not .deltacommit) file in the timeline. But, I did not

[GitHub] [hudi] codecov-io edited a comment on pull request #2580: [HUDI 1623] Introduce start & end commit times to timeline

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2580: URL: https://github.com/apache/hudi/pull/2580#issuecomment-780218110 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2580?src=pr=h1) Report > Merging [#2580](https://codecov.io/gh/apache/hudi/pull/2580?src=pr=desc) (61c4a17) into

[GitHub] [hudi] bvaradar commented on issue #2573: Rebuild a HUDI table using the Snapshot of HUDI table with its commit timeline metadata

2021-02-22 Thread GitBox
bvaradar commented on issue #2573: URL: https://github.com/apache/hudi/issues/2573#issuecomment-783811753 @getniz : Let me know if you run into any problems This is an automated message from the Apache Git Service. To

[GitHub] [hudi] bvaradar commented on issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-22 Thread GitBox
bvaradar commented on issue #2589: URL: https://github.com/apache/hudi/issues/2589#issuecomment-783810812 @satishkotha: Can you please take a look at this ticket ? This is an automated message from the Apache Git Service. To

[GitHub] [hudi] codecov-io edited a comment on pull request #2475: [HUDI-1527] automatically infer the data directory, users only need to specify the table directory

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2475: URL: https://github.com/apache/hudi/pull/2475#issuecomment-765495259 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=h1) Report > Merging [#2475](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=desc) (62cff1e) into

[GitHub] [hudi] codecov-io edited a comment on pull request #2475: [HUDI-1527] automatically infer the data directory, users only need to specify the table directory

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2475: URL: https://github.com/apache/hudi/pull/2475#issuecomment-765495259 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=h1) Report > Merging [#2475](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=desc) (62cff1e) into

[GitHub] [hudi] codecov-io edited a comment on pull request #2580: [HUDI 1623] Introduce start & end commit times to timeline

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2580: URL: https://github.com/apache/hudi/pull/2580#issuecomment-780218110 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2580?src=pr=h1) Report > Merging [#2580](https://codecov.io/gh/apache/hudi/pull/2580?src=pr=desc) (61c4a17) into

[GitHub] [hudi] yanghua commented on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
yanghua commented on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-783806234 @danny0405 please check the CI? This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] satishkotha commented on pull request #2532: [HUDI-1534]HiveSyncTool-It is not necessary to use JDBC and MetaStoreClient at the same time

2021-02-22 Thread GitBox
satishkotha commented on pull request #2532: URL: https://github.com/apache/hudi/pull/2532#issuecomment-783799632 @lhjzmn Can you start a discuss thread in dev and users channel. I think removing JDBC support requires consensus in community

[jira] [Commented] (HUDI-1635) Improvements to Hudi Test Suite

2021-02-22 Thread Nishith Agarwal (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17288739#comment-17288739 ] Nishith Agarwal commented on HUDI-1635: --- # Correcting date partitions to start from 1970 #

[jira] [Created] (HUDI-1635) Improvements to Hudi Test Suite

2021-02-22 Thread Nishith Agarwal (Jira)
Nishith Agarwal created HUDI-1635: - Summary: Improvements to Hudi Test Suite Key: HUDI-1635 URL: https://issues.apache.org/jira/browse/HUDI-1635 Project: Apache Hudi Issue Type: New Feature

[GitHub] [hudi] codecov-io commented on pull request #2590: [WIP] [HUDI-1615] Avoid passing in null schema from row writing/deltastreamer

2021-02-22 Thread GitBox
codecov-io commented on pull request #2590: URL: https://github.com/apache/hudi/pull/2590#issuecomment-783707361 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2590?src=pr=h1) Report > Merging [#2590](https://codecov.io/gh/apache/hudi/pull/2590?src=pr=desc) (4b9cf33) into

[GitHub] [hudi] codecov-io edited a comment on pull request #1946: [HUDI-1176]Support log4j2 config

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #1946: URL: https://github.com/apache/hudi/pull/1946#issuecomment-774846457 # [Codecov](https://codecov.io/gh/apache/hudi/pull/1946?src=pr=h1) Report > Merging [#1946](https://codecov.io/gh/apache/hudi/pull/1946?src=pr=desc) (b7d1d47) into

[jira] [Created] (HUDI-1634) Handle the case metadata table cannot be synced due to instants being archived

2021-02-22 Thread Prashant Wason (Jira)
Prashant Wason created HUDI-1634: Summary: Handle the case metadata table cannot be synced due to instants being archived Key: HUDI-1634 URL: https://issues.apache.org/jira/browse/HUDI-1634 Project:

[GitHub] [hudi] akanungoz opened a new pull request #2591: [MINOR] check for table after creating db if auto create is true

2021-02-22 Thread GitBox
akanungoz opened a new pull request #2591: URL: https://github.com/apache/hudi/pull/2591 ## What is the purpose of the pull request *with glue client if db does not exist, db not found exception is thrown. This PR first check if database exists or autoCreateDatabase is true

[GitHub] [hudi] t0il3ts0ap edited a comment on issue #2515: [HUDI-1615] [SUPPORT] ERROR HoodieTimelineArchiveLog: Failed to archive commits

2021-02-22 Thread GitBox
t0il3ts0ap edited a comment on issue #2515: URL: https://github.com/apache/hudi/issues/2515#issuecomment-783587850 @vinothchandar Ya I am overriding just the target schema to null so as to use the `DataSet`'s schema. It is working fine for regular inserts and update.

[GitHub] [hudi] t0il3ts0ap commented on issue #2515: [HUDI-1615] [SUPPORT] ERROR HoodieTimelineArchiveLog: Failed to archive commits

2021-02-22 Thread GitBox
t0il3ts0ap commented on issue #2515: URL: https://github.com/apache/hudi/issues/2515#issuecomment-783587850 @vinothchandar Ya I am overriding just the target schema to null so as to use the DataSet's schema. It is working fine for regular inserts and update.

[jira] [Commented] (HUDI-1602) Corrupted Avro schema extracted from parquet file

2021-02-22 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17288559#comment-17288559 ] Vinoth Chandar commented on HUDI-1602: -- so, you have upgraded spark? to spark 3?  > Corrupted Avro

[GitHub] [hudi] nsivabalan commented on pull request #2443: [HUDI-1269] Make whether the failure of connect hive affects hudi ingest process configurable

2021-02-22 Thread GitBox
nsivabalan commented on pull request #2443: URL: https://github.com/apache/hudi/pull/2443#issuecomment-783558211 I have pushed a commit to add tests and address the roTableRename. But not sure if we can make the Config span any exception in general or just connect exception.

[GitHub] [hudi] nsivabalan commented on a change in pull request #2188: [HUDI-1347]fix Hbase index partition changes cause data duplication p…

2021-02-22 Thread GitBox
nsivabalan commented on a change in pull request #2188: URL: https://github.com/apache/hudi/pull/2188#discussion_r580441277 ## File path: hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/index/hbase/TestHBaseIndex.java ## @@ -268,6 +268,66 @@ public void

[GitHub] [hudi] rubenssoto edited a comment on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-22 Thread GitBox
rubenssoto edited a comment on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-783534393 https://user-images.githubusercontent.com/36298331/108744715-07c64a80-7519-11eb-8b02-98261e74474d.png;> Sometimes take a while to show the error, these jobs normally run

[GitHub] [hudi] rubenssoto commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-22 Thread GitBox
rubenssoto commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-783534393 https://user-images.githubusercontent.com/36298331/108744715-07c64a80-7519-11eb-8b02-98261e74474d.png;> Sometimes take a while to show the error, these jobs run in 5 minutes,

[GitHub] [hudi] vinothchandar opened a new pull request #2590: [HUDI-1615] Avoid passing in null schema from row writing/deltastreamer

2021-02-22 Thread GitBox
vinothchandar opened a new pull request #2590: URL: https://github.com/apache/hudi/pull/2590 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of

[GitHub] [hudi] vinothchandar commented on issue #2515: [HUDI-1615] [SUPPORT] ERROR HoodieTimelineArchiveLog: Failed to archive commits

2021-02-22 Thread GitBox
vinothchandar commented on issue #2515: URL: https://github.com/apache/hudi/issues/2515#issuecomment-783468146 @t0il3ts0ap The only code path for deltastreamer I see, that can pass a null schema is when the schema provider returns `null`. `--schemaprovider-class

[GitHub] [hudi] bvaradar commented on issue #2588: [SUPPORT] Cannot create hive connection

2021-02-22 Thread GitBox
bvaradar commented on issue #2588: URL: https://github.com/apache/hudi/issues/2588#issuecomment-783449274 @umehrot2 : Can you check if there is anything that can be done in Hudi to fix this in EMR ecosystem? This is an

[GitHub] [hudi] pengzhiwei2018 commented on a change in pull request #2283: [HUDI-1415] Read Hoodie Table As Spark DataSource Table

2021-02-22 Thread GitBox
pengzhiwei2018 commented on a change in pull request #2283: URL: https://github.com/apache/hudi/pull/2283#discussion_r580215958 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/HoodieSparkSqlWriter.scala ## @@ -377,11 +388,71 @@ private[hudi]

[GitHub] [hudi] codecov-io edited a comment on pull request #2475: [HUDI-1527] automatically infer the data directory, users only need to specify the table directory

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2475: URL: https://github.com/apache/hudi/pull/2475#issuecomment-765495259 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=h1) Report > Merging [#2475](https://codecov.io/gh/apache/hudi/pull/2475?src=pr=desc) (0e4f1ee) into

[GitHub] [hudi] t0il3ts0ap opened a new issue #2589: [SUPPORT] Issue with adding column while running deltastreamer with kafka source.

2021-02-22 Thread GitBox
t0il3ts0ap opened a new issue #2589: URL: https://github.com/apache/hudi/issues/2589 **Describe the problem you faced** Schema evolution is not working when using deltastreamer with a kafka source and avro schema registry. In my usecase, I am trying to ingest cdc dumped to

[GitHub] [hudi] codecov-io edited a comment on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-781103554 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=h1) Report > Merging [#2581](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=desc) (ba652e8) into

[GitHub] [hudi] codecov-io edited a comment on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-781103554 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=h1) Report > Merging [#2581](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=desc) (ba652e8) into

[GitHub] [hudi] codecov-io edited a comment on pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
codecov-io edited a comment on pull request #2581: URL: https://github.com/apache/hudi/pull/2581#issuecomment-781103554 # [Codecov](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=h1) Report > Merging [#2581](https://codecov.io/gh/apache/hudi/pull/2581?src=pr=desc) (ba652e8) into

[GitHub] [hudi] danny0405 commented on a change in pull request #2581: [HUDI-1624] The state based index should bootstrap from existing base…

2021-02-22 Thread GitBox
danny0405 commented on a change in pull request #2581: URL: https://github.com/apache/hudi/pull/2581#discussion_r580094188 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/HoodieIndexUtils.java ## @@ -37,6 +38,29 @@ */ public class

[GitHub] [hudi] kingkongpoon edited a comment on issue #2557: [SUPPORT]Container exited with a non-zero exit code 137

2021-02-22 Thread GitBox
kingkongpoon edited a comment on issue #2557: URL: https://github.com/apache/hudi/issues/2557#issuecomment-783209847 > To help investigate better > > * Can you post the configs you used to write to hudi. > * Can you post a screen shot of spark stages. So that we know where its

[GitHub] [hudi] kingkongpoon edited a comment on issue #2557: [SUPPORT]Container exited with a non-zero exit code 137

2021-02-22 Thread GitBox
kingkongpoon edited a comment on issue #2557: URL: https://github.com/apache/hudi/issues/2557#issuecomment-783209847 > To help investigate better > > * Can you post the configs you used to write to hudi. > * Can you post a screen shot of spark stages. So that we know where its

[GitHub] [hudi] kingkongpoon commented on issue #2557: [SUPPORT]Container exited with a non-zero exit code 137

2021-02-22 Thread GitBox
kingkongpoon commented on issue #2557: URL: https://github.com/apache/hudi/issues/2557#issuecomment-783209847 > To help investigate better > > * Can you post the configs you used to write to hudi. > * Can you post a screen shot of spark stages. So that we know where its failing

[GitHub] [hudi] kingkongpoon commented on issue #2557: [SUPPORT]Container exited with a non-zero exit code 137

2021-02-22 Thread GitBox
kingkongpoon commented on issue #2557: URL: https://github.com/apache/hudi/issues/2557#issuecomment-783182791 > > When I first run the cow table (SaveMode.Overwrite), it's very fast.(about 700MB data in hdfs). but when I run an increment(SaveMode.Append), it's very slowly,and throw error