[GitHub] [hudi] codecov-commenter edited a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-869795991 #

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378241#comment-17378241 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 53ce329dd8973ea83fdafb3e9522d62aaad9222d Azure:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378240#comment-17378240 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 4f3a389384a1fbf0deb654caa490f3b32c3b7e41 Azure:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378234#comment-17378234 ] ASF GitHub Bot commented on HUDI-1860: -- Samrat002 commented on a change in pull request #3184: URL:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378233#comment-17378233 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] Samrat002 commented on a change in pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
Samrat002 commented on a change in pull request #3184: URL: https://github.com/apache/hudi/pull/3184#discussion_r54583 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/SparkRDDWriteClient.java ## @@ -200,7 +200,8 @@ public HoodieWriteResult

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 4f3a389384a1fbf0deb654caa490f3b32c3b7e41 Azure:

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378232#comment-17378232 ] ASF GitHub Bot commented on HUDI-2087: -- hudi-bot edited a comment on pull request #3252: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3252: [HUDI-2087] Support Append only in Flink stream

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3252: URL: https://github.com/apache/hudi/pull/3252#issuecomment-877374804 ## CI report: * 701c28f6701201382ccdb911662a26b445595833 Azure:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378230#comment-17378230 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 4f3a389384a1fbf0deb654caa490f3b32c3b7e41 Azure:

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378229#comment-17378229 ] ASF GitHub Bot commented on HUDI-2087: -- hudi-bot commented on pull request #3252: URL:

[GitHub] [hudi] hudi-bot commented on pull request #3252: [HUDI-2087] Support Append only in Flink stream

2021-07-09 Thread GitBox
hudi-bot commented on pull request #3252: URL: https://github.com/apache/hudi/pull/3252#issuecomment-877374804 ## CI report: * 701c28f6701201382ccdb911662a26b445595833 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run travis`

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378226#comment-17378226 ] ASF GitHub Bot commented on HUDI-2087: -- yuzhaojing opened a new pull request #3252: URL:

[GitHub] [hudi] yuzhaojing opened a new pull request #3252: [HUDI-2087] Support Append only in Flink stream

2021-07-09 Thread GitBox
yuzhaojing opened a new pull request #3252: URL: https://github.com/apache/hudi/pull/3252 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contributing.html before opening a pull request.* ## What is the purpose of the

[hudi] branch master updated (3715267 -> b4562e8)

2021-07-09 Thread vinoth
This is an automated email from the ASF dual-hosted git repository. vinoth pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 3715267 [HUDI-2087] Support Append only in Flink stream (#3174) add b4562e8 Revert "[HUDI-2087] Support Append

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378224#comment-17378224 ] ASF GitHub Bot commented on HUDI-2087: -- vinothchandar merged pull request #3251: URL:

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378223#comment-17378223 ] ASF GitHub Bot commented on HUDI-2087: -- vinothchandar opened a new pull request #3251: URL:

[GitHub] [hudi] vinothchandar merged pull request #3251: Revert "[HUDI-2087] Support Append only in Flink stream (#3174)"

2021-07-09 Thread GitBox
vinothchandar merged pull request #3251: URL: https://github.com/apache/hudi/pull/3251 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] vinothchandar opened a new pull request #3251: Revert "[HUDI-2087] Support Append only in Flink stream (#3174)"

2021-07-09 Thread GitBox
vinothchandar opened a new pull request #3251: URL: https://github.com/apache/hudi/pull/3251 This reverts commit 371526789d663dee85041eb31c27c52c81ef87ef. ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review

[jira] [Commented] (HUDI-2087) Support Append only in Flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378221#comment-17378221 ] ASF GitHub Bot commented on HUDI-2087: -- vinothchandar commented on pull request #3174: URL:

[GitHub] [hudi] vinothchandar commented on pull request #3174: [HUDI-2087] Support Append only in Flink stream

2021-07-09 Thread GitBox
vinothchandar commented on pull request #3174: URL: https://github.com/apache/hudi/pull/3174#issuecomment-877369660 It failed on azure too. ``` [INFO] hudi-spark-bundle_2.11 . SUCCESS [ 2.656 s] [INFO] hudi-presto-bundle

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378212#comment-17378212 ] ASF GitHub Bot commented on HUDI-1483: -- hudi-bot edited a comment on pull request #3142: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-866996072 ## CI report: * f86f50e817a625dc30f35a39b7495a4f359e4da5 Azure:

[jira] [Commented] (HUDI-1077) Integration tests to validate clustering

2021-07-09 Thread satish (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378206#comment-17378206 ] satish commented on HUDI-1077: -- Hi [~codope], I havent added any thing related to clustering to test suite

[GitHub] [hudi] codecov-commenter edited a comment on pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3250: URL: https://github.com/apache/hudi/pull/3250#issuecomment-877313010 #

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378186#comment-17378186 ] ASF GitHub Bot commented on HUDI-1483: -- hudi-bot edited a comment on pull request #3142: URL:

[jira] [Commented] (HUDI-2144) Offline clustering(independent sparkJob) will cause insert action losing data

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378184#comment-17378184 ] ASF GitHub Bot commented on HUDI-2144: -- codope commented on a change in pull request #3240: URL:

[jira] [Commented] (HUDI-1828) Ensure All Tests Pass with ORC format

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378183#comment-17378183 ] ASF GitHub Bot commented on HUDI-1828: -- jintaoguan commented on a change in pull request #3237: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-866996072 ## CI report: * f86f50e817a625dc30f35a39b7495a4f359e4da5 Azure:

[GitHub] [hudi] codope commented on a change in pull request #3240: [HUDI-2144]Bug-Fix:Offline clustering(HoodieClusteringJob) will cause insert action losing data

2021-07-09 Thread GitBox
codope commented on a change in pull request #3240: URL: https://github.com/apache/hudi/pull/3240#discussion_r667097064 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/UpsertPartitioner.java ## @@ -146,7 +146,7 @@ private int

[GitHub] [hudi] jintaoguan commented on a change in pull request #3237: [HUDI-1828] Update unit tests to support ORC as the base file format

2021-07-09 Thread GitBox
jintaoguan commented on a change in pull request #3237: URL: https://github.com/apache/hudi/pull/3237#discussion_r667097170 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/HoodieReadClient.java ## @@ -144,7 +145,15 @@ private void

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378182#comment-17378182 ] ASF GitHub Bot commented on HUDI-1483: -- hudi-bot edited a comment on pull request #3142: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-866996072 ## CI report: * 890e9822855fcd45c8387f83740975f43474cddc Azure:

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378180#comment-17378180 ] ASF GitHub Bot commented on HUDI-1483: -- hudi-bot edited a comment on pull request #3142: URL:

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378181#comment-17378181 ] ASF GitHub Bot commented on HUDI-1483: -- codope commented on pull request #3142: URL:

[GitHub] [hudi] codope commented on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
codope commented on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-877330225 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] hudi-bot edited a comment on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-866996072 ## CI report: * 890e9822855fcd45c8387f83740975f43474cddc Azure:

[GitHub] [hudi] codope commented on a change in pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
codope commented on a change in pull request #3250: URL: https://github.com/apache/hudi/pull/3250#discussion_r667092538 ## File path: hudi-client/hudi-client-common/src/test/java/org/apache/hudi/config/TestHoodieWriteConfig.java ## @@ -43,6 +43,7 @@ public void

[jira] [Commented] (HUDI-2141) Integration flink metric in flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378167#comment-17378167 ] ASF GitHub Bot commented on HUDI-2141: -- codecov-commenter edited a comment on pull request #3235:

[GitHub] [hudi] codecov-commenter edited a comment on pull request #3235: [HUDI-2141] Integration flink metric in flink stream

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3235: URL: https://github.com/apache/hudi/pull/3235#issuecomment-875600178 #

[GitHub] [hudi] nsivabalan commented on a change in pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
nsivabalan commented on a change in pull request #3250: URL: https://github.com/apache/hudi/pull/3250#discussion_r667079580 ## File path: hudi-client/hudi-client-common/src/test/java/org/apache/hudi/config/TestHoodieWriteConfig.java ## @@ -43,6 +43,7 @@ public void

[GitHub] [hudi] codecov-commenter commented on pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
codecov-commenter commented on pull request #3250: URL: https://github.com/apache/hudi/pull/3250#issuecomment-877313010 # [Codecov](https://codecov.io/gh/apache/hudi/pull/3250?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[jira] [Commented] (HUDI-2141) Integration flink metric in flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378158#comment-17378158 ] ASF GitHub Bot commented on HUDI-2141: -- codecov-commenter edited a comment on pull request #3235:

[jira] [Commented] (HUDI-2144) Offline clustering(independent sparkJob) will cause insert action losing data

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378157#comment-17378157 ] ASF GitHub Bot commented on HUDI-2144: -- vinothchandar commented on pull request #3240: URL:

[GitHub] [hudi] codecov-commenter edited a comment on pull request #3235: [HUDI-2141] Integration flink metric in flink stream

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3235: URL: https://github.com/apache/hudi/pull/3235#issuecomment-875600178 #

[GitHub] [hudi] vinothchandar commented on pull request #3240: [HUDI-2144]Bug-Fix:Offline clustering(HoodieClusteringJob) will cause insert action losing data

2021-07-09 Thread GitBox
vinothchandar commented on pull request #3240: URL: https://github.com/apache/hudi/pull/3240#issuecomment-877306590 @satishkotha @codope Can one of you please help review? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[jira] [Commented] (HUDI-2151) Make performant out-of-box configs

2021-07-09 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378152#comment-17378152 ] Vinoth Chandar commented on HUDI-2151: -- Does this even help?   {code:java}  public static final

[GitHub] [hudi] codecov-commenter edited a comment on pull request #3235: [HUDI-2141] Integration flink metric in flink stream

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3235: URL: https://github.com/apache/hudi/pull/3235#issuecomment-875600178 #

[jira] [Commented] (HUDI-2141) Integration flink metric in flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378144#comment-17378144 ] ASF GitHub Bot commented on HUDI-2141: -- codecov-commenter edited a comment on pull request #3235:

[jira] [Created] (HUDI-2158) Upstream support for MOR tables.

2021-07-09 Thread Vinoth Chandar (Jira)
Vinoth Chandar created HUDI-2158: Summary: Upstream support for MOR tables. Key: HUDI-2158 URL: https://issues.apache.org/jira/browse/HUDI-2158 Project: Apache Hudi Issue Type: Sub-task

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378124#comment-17378124 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378125#comment-17378125 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667023267 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/utils/HiveBucketUtils.java ## @@ -0,0 +1,91 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667023026 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/utils/HiveBucketUtils.java ## @@ -0,0 +1,91 @@ +/* + * Licensed to the Apache

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378122#comment-17378122 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378123#comment-17378123 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667022566 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/utils/HiveBucketUtils.java ## @@ -0,0 +1,91 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667022284 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/BucketInfo.java ## @@ -30,6 +30,10 @@ String fileIdPrefix;

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378121#comment-17378121 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667021453 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/keygen/SimpleAvroKeyGenerator.java ## @@ -30,19 +33,36 @@ public

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378114#comment-17378114 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378113#comment-17378113 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667017924 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieIndexConfig.java ## @@ -189,6 +189,13 @@

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667017924 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieIndexConfig.java ## @@ -189,6 +189,13 @@

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378112#comment-17378112 ] ASF GitHub Bot commented on HUDI-1951: -- leesf commented on a change in pull request #3173: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3173: URL: https://github.com/apache/hudi/pull/3173#discussion_r667017924 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieIndexConfig.java ## @@ -189,6 +189,13 @@

[jira] [Commented] (HUDI-2144) Offline clustering(independent sparkJob) will cause insert action losing data

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378111#comment-17378111 ] ASF GitHub Bot commented on HUDI-2144: -- zhangyue19921010 commented on pull request #3240: URL:

[GitHub] [hudi] zhangyue19921010 commented on pull request #3240: [HUDI-2144]Bug-Fix:Offline clustering(HoodieClusteringJob) will cause insert action losing data

2021-07-09 Thread GitBox
zhangyue19921010 commented on pull request #3240: URL: https://github.com/apache/hudi/pull/3240#issuecomment-877251884 Hi @leesf Thanks for your review. Yes, `when clustering plan contains the small files, the new insert should not get into small files` only works when users set

[jira] [Commented] (HUDI-1828) Ensure All Tests Pass with ORC format

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378109#comment-17378109 ] ASF GitHub Bot commented on HUDI-1828: -- leesf commented on a change in pull request #3237: URL:

[GitHub] [hudi] leesf commented on a change in pull request #3237: [HUDI-1828] Update unit tests to support ORC as the base file format

2021-07-09 Thread GitBox
leesf commented on a change in pull request #3237: URL: https://github.com/apache/hudi/pull/3237#discussion_r667013274 ## File path: hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/client/HoodieReadClient.java ## @@ -144,7 +145,15 @@ private void

[jira] [Commented] (HUDI-2144) Offline clustering(independent sparkJob) will cause insert action losing data

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378105#comment-17378105 ] ASF GitHub Bot commented on HUDI-2144: -- leesf commented on pull request #3240: URL:

[GitHub] [hudi] leesf commented on pull request #3240: [HUDI-2144]Bug-Fix:Offline clustering(HoodieClusteringJob) will cause insert action losing data

2021-07-09 Thread GitBox
leesf commented on pull request #3240: URL: https://github.com/apache/hudi/pull/3240#issuecomment-877245202 when clustering plan contains the small files, the new insert should not get into small files, so the new insert get into file slice2 is strange. -- This is an automated message

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378096#comment-17378096 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 4f3a389384a1fbf0deb654caa490f3b32c3b7e41 Azure:

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378090#comment-17378090 ] ASF GitHub Bot commented on HUDI-1951: -- hudi-bot edited a comment on pull request #3173: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-869653867 ## CI report: * 5afbbaabe333b8d290c79dfdeb10d8d3aaca11c7 Azure:

[GitHub] [hudi] codecov-commenter edited a comment on pull request #3248: [MINOR] Fix some wrong assert reasons

2021-07-09 Thread GitBox
codecov-commenter edited a comment on pull request #3248: URL: https://github.com/apache/hudi/pull/3248#issuecomment-877187703 #

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378069#comment-17378069 ] ASF GitHub Bot commented on HUDI-1951: -- hudi-bot edited a comment on pull request #3173: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-869653867 ## CI report: * 7aef86d8976d44c1427ccf944d3ecfa00b629159 Azure:

[jira] [Commented] (HUDI-1951) Hash Index for HUDI

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378066#comment-17378066 ] ASF GitHub Bot commented on HUDI-1951: -- hudi-bot edited a comment on pull request #3173: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-869653867 ## CI report: * 7aef86d8976d44c1427ccf944d3ecfa00b629159 Azure:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378064#comment-17378064 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 15ea5785bc8556cd17b6dc4da5cce7d542fbd896 Azure:

[GitHub] [hudi] codecov-commenter commented on pull request #3248: [MINOR] Fix some wrong assert reasons

2021-07-09 Thread GitBox
codecov-commenter commented on pull request #3248: URL: https://github.com/apache/hudi/pull/3248#issuecomment-877187703 # [Codecov](https://codecov.io/gh/apache/hudi/pull/3248?src=pr=h1_medium=referral_source=github_content=comment_campaign=pr+comments_term=The+Apache+Software+Foundation)

[jira] [Created] (HUDI-2157) Spark write the bucket index table

2021-07-09 Thread XiaoyuGeng (Jira)
XiaoyuGeng created HUDI-2157: Summary: Spark write the bucket index table Key: HUDI-2157 URL: https://issues.apache.org/jira/browse/HUDI-2157 Project: Apache Hudi Issue Type: New Feature

[GitHub] [hudi] hudi-bot edited a comment on pull request #3184: [HUDI-1860] Add INSERT_OVERWRITE and INSERT_OVERWRITE_TABLE support to DeltaStreamer

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3184: URL: https://github.com/apache/hudi/pull/3184#issuecomment-870410669 ## CI report: * 15ea5785bc8556cd17b6dc4da5cce7d542fbd896 Azure:

[jira] [Commented] (HUDI-1860) Add INSERT_OVERWRITE support to DeltaStreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378063#comment-17378063 ] ASF GitHub Bot commented on HUDI-1860: -- hudi-bot edited a comment on pull request #3184: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3250: URL: https://github.com/apache/hudi/pull/3250#issuecomment-877145194 ## CI report: * 0420f87e4e013d2c6c1fdff56b82e26d464c635c Azure:

[jira] [Created] (HUDI-2156) Cluster the table with bucket index

2021-07-09 Thread XiaoyuGeng (Jira)
XiaoyuGeng created HUDI-2156: Summary: Cluster the table with bucket index Key: HUDI-2156 URL: https://issues.apache.org/jira/browse/HUDI-2156 Project: Apache Hudi Issue Type: New Feature

[jira] [Updated] (HUDI-2155) Bulk insert support bucket index

2021-07-09 Thread XiaoyuGeng (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiaoyuGeng updated HUDI-2155: - Summary: Bulk insert support bucket index (was: Bulk insert support Hive index) > Bulk insert support

[jira] [Created] (HUDI-2155) Bulk insert support Hive index

2021-07-09 Thread XiaoyuGeng (Jira)
XiaoyuGeng created HUDI-2155: Summary: Bulk insert support Hive index Key: HUDI-2155 URL: https://issues.apache.org/jira/browse/HUDI-2155 Project: Apache Hudi Issue Type: New Feature

[jira] [Commented] (HUDI-2141) Integration flink metric in flink stream

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378037#comment-17378037 ] ASF GitHub Bot commented on HUDI-2141: -- hudi-bot edited a comment on pull request #3235: URL:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3235: [HUDI-2141] Integration flink metric in flink stream

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3235: URL: https://github.com/apache/hudi/pull/3235#issuecomment-875512974 ## CI report: * 12f9fc5e0391242aaddc4e0c09ada8ee3c745a47 Azure:

[GitHub] [hudi] hudi-bot edited a comment on pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3250: URL: https://github.com/apache/hudi/pull/3250#issuecomment-877145194 ## CI report: * 0420f87e4e013d2c6c1fdff56b82e26d464c635c Azure:

[GitHub] [hudi] hudi-bot commented on pull request #3250: [MINOR] Fix EXTERNAL_RECORD_AND_SCHEMA_TRANSFORMATION config

2021-07-09 Thread GitBox
hudi-bot commented on pull request #3250: URL: https://github.com/apache/hudi/pull/3250#issuecomment-877145194 ## CI report: * 0420f87e4e013d2c6c1fdff56b82e26d464c635c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run travis`

[GitHub] [hudi] hudi-bot edited a comment on pull request #3142: [HUDI-1483] Support async clustering for deltastreamer and Spark streaming

2021-07-09 Thread GitBox
hudi-bot edited a comment on pull request #3142: URL: https://github.com/apache/hudi/pull/3142#issuecomment-866996072 ## CI report: * 890e9822855fcd45c8387f83740975f43474cddc Azure:

[jira] [Commented] (HUDI-1483) async clustering for deltastreamer

2021-07-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378033#comment-17378033 ] ASF GitHub Bot commented on HUDI-1483: -- hudi-bot edited a comment on pull request #3142: URL:

<    1   2   3   >