[GitHub] [hudi] hudi-bot removed a comment on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057760060 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot commented on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057761850 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN * 86e65215ff5f069470d732f4dce80cd20426fb5c

[GitHub] [hudi] hudi-bot removed a comment on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057754873 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot commented on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057760060 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN * 86e65215ff5f069470d732f4dce80cd20426fb5c

[GitHub] [hudi] wangxianghu commented on a change in pull request #4918: [HUDI-3518] Make HiveSchemaProvider support AWS Glue Catalog

2022-03-02 Thread GitBox
wangxianghu commented on a change in pull request #4918: URL: https://github.com/apache/hudi/pull/4918#discussion_r818386566 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/HiveSchemaProvider.java ## @@ -52,12 +52,11 @@ private final Schema

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057724174 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * 4142034716ffdf23d99d3e6b36d2b90761610b47 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057758900 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * 4142034716ffdf23d99d3e6b36d2b90761610b47 UNKNOWN * afaceda3e445dfdbba44ac686fe2203e43cf14ec

[GitHub] [hudi] danny0405 commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
danny0405 commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057757106 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] wangxianghu commented on a change in pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
wangxianghu commented on a change in pull request #4930: URL: https://github.com/apache/hudi/pull/4930#discussion_r818383469 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/processor/JsonKafkaSourcePostProcessor.java ## @@ -0,0 +1,40 @@ +/* + *

[GitHub] [hudi] hudi-bot removed a comment on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057747910 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot commented on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057754873 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN * 86e65215ff5f069470d732f4dce80cd20426fb5c

[GitHub] [hudi] hanson2021 commented on issue #4943: [SUPPORT] NoClassDefFoundError: org/apache/hudi/org/apache/hadoop/hive/metastore/api/NoSuchObjectException

2022-03-02 Thread GitBox
hanson2021 commented on issue #4943: URL: https://github.com/apache/hudi/issues/4943#issuecomment-1057751997 I compile hudi-0.10.1 with mvn -Pflink-bundle-shade-hive3 ...,can not found this class in all generated jars -- This is an automated message from the Apache Git Service. To

[GitHub] [hudi] pratyakshsharma commented on pull request #4779: [HUDI-3264]: made schema registry urls configurable with MTDS

2022-03-02 Thread GitBox
pratyakshsharma commented on pull request #4779: URL: https://github.com/apache/hudi/pull/4779#issuecomment-1057748598 ack. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot removed a comment on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-976368835 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
hudi-bot commented on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057747910 ## CI report: * 9e64e88d819b6b6bf5ccc5811ea5f4714138fc9e UNKNOWN * e7e7f170612fcecc8b07839d296f2c06972f2f44 UNKNOWN * 86e65215ff5f069470d732f4dce80cd20426fb5c

[GitHub] [hudi] zhilinli123 commented on issue #4881: Full incremental Enable index loading to discover duplicate data(index.bootstrap.enabled)

2022-03-02 Thread GitBox
zhilinli123 commented on issue #4881: URL: https://github.com/apache/hudi/issues/4881#issuecomment-1057743257 > I'm using the code for the master branch where this problem occurred after 0.10.1 -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] hanson2021 opened a new issue #4943: [SUPPORT] NoClassDefFoundError: org/apache/hudi/org/apache/hadoop/hive/metastore/api/NoSuchObjectException

2022-03-02 Thread GitBox
hanson2021 opened a new issue #4943: URL: https://github.com/apache/hudi/issues/4943 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at

[GitHub] [hudi] zhilinli123 removed a comment on issue #4881: Full incremental Enable index loading to discover duplicate data(index.bootstrap.enabled)

2022-03-02 Thread GitBox
zhilinli123 removed a comment on issue #4881: URL: https://github.com/apache/hudi/issues/4881#issuecomment-1057734767 > 似乎以前修复了一个已知的错误,您的是否有此修复:#3925? I'm using the code for the master branch where this problem occurred after 0.10.1 -- This is an automated message from the

[GitHub] [hudi] hudi-bot commented on pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4930: URL: https://github.com/apache/hudi/pull/4930#issuecomment-1057736428 ## CI report: * b31e57b6d3c2981e72c164f9844c82ff9b87ec68 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4930: URL: https://github.com/apache/hudi/pull/4930#issuecomment-1057734795 ## CI report: * b31e57b6d3c2981e72c164f9844c82ff9b87ec68 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4930: URL: https://github.com/apache/hudi/pull/4930#issuecomment-1056455960 ## CI report: * b31e57b6d3c2981e72c164f9844c82ff9b87ec68 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4930: URL: https://github.com/apache/hudi/pull/4930#issuecomment-1057734795 ## CI report: * b31e57b6d3c2981e72c164f9844c82ff9b87ec68 Azure:

[GitHub] [hudi] zhilinli123 commented on issue #4881: Full incremental Enable index loading to discover duplicate data(index.bootstrap.enabled)

2022-03-02 Thread GitBox
zhilinli123 commented on issue #4881: URL: https://github.com/apache/hudi/issues/4881#issuecomment-1057734767 > 似乎以前修复了一个已知的错误,您的是否有此修复:#3925? I'm using the code for the master branch where this problem occurred after 0.10.1 -- This is an automated message from the Apache Git

[GitHub] [hudi] wangxianghu commented on pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
wangxianghu commented on pull request #4930: URL: https://github.com/apache/hudi/pull/4930#issuecomment-1057734099 > Can we add a test case by adding a test JsonKafkaPostProcessor and ensure it works. also add a test where you set some invalid class for the new config added. and assert

[GitHub] [hudi] wangxianghu commented on a change in pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
wangxianghu commented on a change in pull request #4930: URL: https://github.com/apache/hudi/pull/4930#discussion_r818363212 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java ## @@ -189,6 +189,11 @@ public static long

[GitHub] [hudi] wangxianghu commented on a change in pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
wangxianghu commented on a change in pull request #4930: URL: https://github.com/apache/hudi/pull/4930#discussion_r818363121 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java ## @@ -189,6 +189,11 @@ public static long

[GitHub] [hudi] wxplovecc commented on pull request #4679: [HUDI-3315] RFC-35 Part-1 Support bucket index in Flink writer

2022-03-02 Thread GitBox
wxplovecc commented on pull request #4679: URL: https://github.com/apache/hudi/pull/4679#issuecomment-1057724736 I have tryed and got an Exception: ```java Caused by: java.util.NoSuchElementException: No value present in Option at

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057724174 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * 4142034716ffdf23d99d3e6b36d2b90761610b47 UNKNOWN * afaceda3e445dfdbba44ac686fe2203e43cf14ec

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057678845 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] danny0405 edited a comment on pull request #4927: [HUDI-3524] - Docs for basic config page

2022-03-02 Thread GitBox
danny0405 edited a comment on pull request #4927: URL: https://github.com/apache/hudi/pull/4927#issuecomment-1057707941 List some options here that i think are basic enough from `FlinkOptions`: base options: `PATH` `TABLE_TYPE` read options: `READ_TASKS` `READ_AS_STREAMING`

[GitHub] [hudi] danny0405 commented on pull request #4927: [HUDI-3524] - Docs for basic config page

2022-03-02 Thread GitBox
danny0405 commented on pull request #4927: URL: https://github.com/apache/hudi/pull/4927#issuecomment-1057707941 List some options here that i think are basic enough from `FlinkOptions`: base options: `PATH` `TABLE_TYPE` read options: `READ_TASKS` `READ_AS_STREAMING`

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057656106 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057678845 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] pmgod8922 commented on issue #4929: [SUPPORT] SparkSession To Hudi Small files are not merged

2022-03-02 Thread GitBox
pmgod8922 commented on issue #4929: URL: https://github.com/apache/hudi/issues/4929#issuecomment-1057670318 thank you very much -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Created] (HUDI-3553) Support Hudi table as source for Kafka sink

2022-03-02 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-3553: Summary: Support Hudi table as source for Kafka sink Key: HUDI-3553 URL: https://issues.apache.org/jira/browse/HUDI-3553 Project: Apache Hudi Issue Type: Epic

[GitHub] [hudi] danny0405 commented on a change in pull request #4880: [HUDI-2752] The MOR DELETE block breaks the event time sequence of CDC

2022-03-02 Thread GitBox
danny0405 commented on a change in pull request #4880: URL: https://github.com/apache/hudi/pull/4880#discussion_r818311010 ## File path: hudi-common/src/main/java/org/apache/hudi/common/model/DeleteKey.java ## @@ -0,0 +1,87 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] danny0405 commented on a change in pull request #4880: [HUDI-2752] The MOR DELETE block breaks the event time sequence of CDC

2022-03-02 Thread GitBox
danny0405 commented on a change in pull request #4880: URL: https://github.com/apache/hudi/pull/4880#discussion_r818311010 ## File path: hudi-common/src/main/java/org/apache/hudi/common/model/DeleteKey.java ## @@ -0,0 +1,87 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057656106 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057654821 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057653197 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a UNKNOWN Bot commands @hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057654821 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057653197 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN * f2bdddf8d2247ee2a7132f5f53186a0fc749aa6a UNKNOWN Bot commands @hudi-bot supports

[GitHub] [hudi] hudi-bot removed a comment on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057651178 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4942: URL: https://github.com/apache/hudi/pull/4942#issuecomment-1057651178 ## CI report: * 3c10526aefdc9c8ed86c35fc830550fa85f7881f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] vinothchandar commented on a change in pull request #4880: [HUDI-2752] The MOR DELETE block breaks the event time sequence of CDC

2022-03-02 Thread GitBox
vinothchandar commented on a change in pull request #4880: URL: https://github.com/apache/hudi/pull/4880#discussion_r817913665 ## File path: hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/table/action/commit/FlinkWriteHelper.java ## @@ -105,7 +105,7 @@ public

[jira] [Updated] (HUDI-3552) Strength the NetworkUtils#getHostname by checking network interfaces first

2022-03-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3552: - Labels: pull-request-available (was: ) > Strength the NetworkUtils#getHostname by checking

[GitHub] [hudi] danny0405 opened a new pull request #4942: [HUDI-3552] Strength the NetworkUtils#getHostname by checking network…

2022-03-02 Thread GitBox
danny0405 opened a new pull request #4942: URL: https://github.com/apache/hudi/pull/4942 … interfaces first ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.*

[jira] [Created] (HUDI-3552) Strength the NetworkUtils#getHostname by checking network interfaces first

2022-03-02 Thread Danny Chen (Jira)
Danny Chen created HUDI-3552: Summary: Strength the NetworkUtils#getHostname by checking network interfaces first Key: HUDI-3552 URL: https://issues.apache.org/jira/browse/HUDI-3552 Project: Apache Hudi

[GitHub] [hudi] hudi-bot commented on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1057630398 ## CI report: * bfd1ab9455d5179eb2156bd39caf13116b64bd4e Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1057602680 ## CI report: * 0a55c4ad48c9557eed053d59fadd6a43f09c2381 Azure:

[GitHub] [hudi] liujinhui1994 commented on a change in pull request #3312: [HUDI-648][RFC-20] Implement error log/table for Datasource/DeltaStreamer/WriteClient/Compaction writes

2022-03-02 Thread GitBox
liujinhui1994 commented on a change in pull request #3312: URL: https://github.com/apache/hudi/pull/3312#discussion_r818273912 ## File path: hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/AbstractHoodieWriteClient.java ## @@ -315,6 +315,17 @@ public void

[GitHub] [hudi] hudi-bot commented on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057609983 ## CI report: * f648cc8727f9986deed050c6e2322eed23c1719b Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057570906 ## CI report: * a025412a3d2bb1a9010944a72c4ee3b558e8c49c Azure:

[GitHub] [hudi] danny0405 commented on pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
danny0405 commented on pull request #3771: URL: https://github.com/apache/hudi/pull/3771#issuecomment-1057606418 @test-wangxiaoyu can you rebase the master code and force push again, i'm planning to review this PR again ~ the `commands`: `git fetch upstream master` `git

[GitHub] [hudi] xiarixiaoyao commented on pull request #4786: [HUDI-3383] Sync column comments while syncing a hive table, especially using spark datasource

2022-03-02 Thread GitBox
xiarixiaoyao commented on pull request #4786: URL: https://github.com/apache/hudi/pull/4786#issuecomment-1057603794 @MrSleeping123 could you pls rebase this pr, thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] guanziyue edited a comment on pull request #4880: [HUDI-2752] The MOR DELETE block breaks the event time sequence of CDC

2022-03-02 Thread GitBox
guanziyue edited a comment on pull request #4880: URL: https://github.com/apache/hudi/pull/4880#issuecomment-1057501670 Happy to see the discussion of this problem, I would like to share our solution of this problem. We chose to abandon delete block mechanism and treat all records as

[GitHub] [hudi] xiarixiaoyao commented on a change in pull request #4786: [HUDI-3383] Sync column comments while syncing a hive table, especially using spark datasource

2022-03-02 Thread GitBox
xiarixiaoyao commented on a change in pull request #4786: URL: https://github.com/apache/hudi/pull/4786#discussion_r818263065 ## File path: hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/ddl/HMSDDLExecutor.java ## @@ -247,6 +248,26 @@ public void

[GitHub] [hudi] stayrascal commented on pull request #4724: [HUDI-2815] add partial overwrite payload to support partial overwrit…

2022-03-02 Thread GitBox
stayrascal commented on pull request #4724: URL: https://github.com/apache/hudi/pull/4724#issuecomment-1057603548 > yeah, @LinMingQiang has mentioned this one above. From my understanding, if we want to enable "partial update" feature by defining customized payload class, it

[GitHub] [hudi] hudi-bot commented on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1057602680 ## CI report: * 0a55c4ad48c9557eed053d59fadd6a43f09c2381 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1057601192 ## CI report: * 0a55c4ad48c9557eed053d59fadd6a43f09c2381 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1057601192 ## CI report: * 0a55c4ad48c9557eed053d59fadd6a43f09c2381 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4926: URL: https://github.com/apache/hudi/pull/4926#issuecomment-1054998230 ## CI report: * 0a55c4ad48c9557eed053d59fadd6a43f09c2381 Azure:

[GitHub] [hudi] stayrascal commented on a change in pull request #4724: [HUDI-2815] add partial overwrite payload to support partial overwrit…

2022-03-02 Thread GitBox
stayrascal commented on a change in pull request #4724: URL: https://github.com/apache/hudi/pull/4724#discussion_r818260180 ## File path: hudi-common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java ## @@ -0,0 +1,137 @@ +/* + * Licensed to

[GitHub] [hudi] scxwhite commented on a change in pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
scxwhite commented on a change in pull request #4926: URL: https://github.com/apache/hudi/pull/4926#discussion_r818259794 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/CustomizedThreadFactory.java ## @@ -0,0 +1,56 @@ +/* + * Licensed to the Apache

[GitHub] [hudi] xiaozhch5 commented on a change in pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
xiaozhch5 commented on a change in pull request #3771: URL: https://github.com/apache/hudi/pull/3771#discussion_r818259563 ## File path: hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/HiveSyncTool.java ## @@ -77,6 +77,10 @@ public HiveSyncTool(HiveSyncConfig cfg,

[GitHub] [hudi] scxwhite commented on a change in pull request #4926: add thread factory in BoundedInMemoryExecutor

2022-03-02 Thread GitBox
scxwhite commented on a change in pull request #4926: URL: https://github.com/apache/hudi/pull/4926#discussion_r818258322 ## File path: hudi-common/src/main/java/org/apache/hudi/common/util/queue/BoundedInMemoryExecutor.java ## @@ -48,8 +49,10 @@ private static final

[GitHub] [hudi] hudi-bot removed a comment on pull request #4907: [WIP][CI Test Only][HUDI-1180] Upgrade HBase to 2.4.9

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4907: URL: https://github.com/apache/hudi/pull/4907#issuecomment-1057538054 ## CI report: * 4cd46c8a580bd0e01d4a1bad28d5ad4a72181c3c UNKNOWN * 671d589463b6241bb6b1ffe971b366a621004737 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4907: [WIP][CI Test Only][HUDI-1180] Upgrade HBase to 2.4.9

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4907: URL: https://github.com/apache/hudi/pull/4907#issuecomment-1057587149 ## CI report: * 4cd46c8a580bd0e01d4a1bad28d5ad4a72181c3c UNKNOWN * 671d589463b6241bb6b1ffe971b366a621004737 UNKNOWN * 2a213efbf2894ff66599b8d1e09653e6811d1033

[GitHub] [hudi] nsivabalan commented on a change in pull request #4930: [HUDI-3525] Introduce JsonkafkaSourcePostProcessor to support data preprocess before it is transformed to DataSet

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4930: URL: https://github.com/apache/hudi/pull/4930#discussion_r818242568 ## File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java ## @@ -189,6 +189,11 @@ public static long

[GitHub] [hudi] XuQianJin-Stars commented on a change in pull request #4720: [HUDI-3221] Support querying a table as of a savepoint

2022-03-02 Thread GitBox
XuQianJin-Stars commented on a change in pull request #4720: URL: https://github.com/apache/hudi/pull/4720#discussion_r818241910 ## File path: hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/analysis/HoodieAnalysis.scala ## @@ -350,6 +353,35 @@ case

[GitHub] [hudi] hudi-bot commented on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057570906 ## CI report: * a025412a3d2bb1a9010944a72c4ee3b558e8c49c Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057569392 ## CI report: * a025412a3d2bb1a9010944a72c4ee3b558e8c49c Azure:

[GitHub] [hudi] melin commented on issue #4938: [SUPPORT]Add snapshot,  send event notification, user can customize listener

2022-03-02 Thread GitBox
melin commented on issue #4938: URL: https://github.com/apache/hudi/issues/4938#issuecomment-1057570270 > sorry, I am not very clear on the requirement. We already have commit callbacks. Would that suffice or are you looking for something else. can you throw some light please.

[GitHub] [hudi] hudi-bot commented on pull request #4941: [HUDI-3544] Fixing "populate meta fields" update to metadata table

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4941: URL: https://github.com/apache/hudi/pull/4941#issuecomment-1057569508 ## CI report: * 9718564c43d4b75f6021df5193df758f5464a138 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4941: [HUDI-3544] Fixing "populate meta fields" update to metadata table

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4941: URL: https://github.com/apache/hudi/pull/4941#issuecomment-1057532638 ## CI report: * 9718564c43d4b75f6021df5193df758f5464a138 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057569392 ## CI report: * a025412a3d2bb1a9010944a72c4ee3b558e8c49c Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4905: [HUDI-3548] Fix if user specify key "hoodie.datasource.clustering.async.enable" directly, async clustering not work

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4905: URL: https://github.com/apache/hudi/pull/4905#issuecomment-1057100672 ## CI report: * a025412a3d2bb1a9010944a72c4ee3b558e8c49c Azure:

[GitHub] [hudi] xiarixiaoyao commented on issue #4931: Use hive to query the XXX_ rt of Hudi

2022-03-02 Thread GitBox
xiarixiaoyao commented on issue #4931: URL: https://github.com/apache/hudi/issues/4931#issuecomment-1057567085 @LiangZuoXiang pls share your hive version thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[jira] [Updated] (HUDI-2747) Fix hudi cli metadata commands

2022-03-02 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2747: Description: Fix hudi cli metadata commands.  Currently when running hudi cli metadata commands locally,

[GitHub] [hudi] nsivabalan commented on a change in pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4848: URL: https://github.com/apache/hudi/pull/4848#discussion_r818226210 ## File path: hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java ## @@ -922,4 +952,39 @@ public static int

[GitHub] [hudi] nsivabalan commented on a change in pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4848: URL: https://github.com/apache/hudi/pull/4848#discussion_r818222679 ## File path: hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java ## @@ -799,30 +824,20 @@ public static

[GitHub] [hudi] hudi-bot removed a comment on pull request #4940: [WIP] [HUDI-2871]Decouple metrics dependencies from hudi-client-common

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4940: URL: https://github.com/apache/hudi/pull/4940#issuecomment-1057520066 ## CI report: * eb916b3d8fd10f1343a3a454f1c0c47644f76c40 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4940: [WIP] [HUDI-2871]Decouple metrics dependencies from hudi-client-common

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4940: URL: https://github.com/apache/hudi/pull/4940#issuecomment-1057553755 ## CI report: * eb916b3d8fd10f1343a3a454f1c0c47644f76c40 Azure:

[GitHub] [hudi] test-wangxiaoyu commented on a change in pull request #3771: [HUDI-2402] Add Kerberos configuration options to Hive Sync

2022-03-02 Thread GitBox
test-wangxiaoyu commented on a change in pull request #3771: URL: https://github.com/apache/hudi/pull/3771#discussion_r818224037 ## File path: hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/HiveSyncTool.java ## @@ -77,6 +77,10 @@ public

[GitHub] [hudi] nsivabalan commented on a change in pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4848: URL: https://github.com/apache/hudi/pull/4848#discussion_r818219576 ## File path: hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java ## @@ -831,7 +828,7 @@ public static

[GitHub] [hudi] nsivabalan commented on a change in pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4848: URL: https://github.com/apache/hudi/pull/4848#discussion_r818218764 ## File path: hudi-common/src/main/java/org/apache/hudi/metadata/HoodieTableMetadataUtil.java ## @@ -187,94 +178,90 @@ public static void

[GitHub] [hudi] nsivabalan commented on a change in pull request #4848: [HUDI-3258] HoodieData for metadata index records, bloom and colstats init

2022-03-02 Thread GitBox
nsivabalan commented on a change in pull request #4848: URL: https://github.com/apache/hudi/pull/4848#discussion_r818218157 ## File path: hudi-common/src/main/java/org/apache/hudi/common/config/HoodieMetadataConfig.java ## @@ -165,6 +165,12 @@ + "used for pruning

[GitHub] [hudi] bhasudha commented on issue #4729: [SUPPORT] During compaction, can I merge only modified columns in a record and leave others unchanged

2022-03-02 Thread GitBox
bhasudha commented on issue #4729: URL: https://github.com/apache/hudi/issues/4729#issuecomment-1057541450 @soma17dec @nsivabalan I believe we dont have oob support for this yet like Shiva already mentioned. However, I know that some users in the community handle this via implementing

[GitHub] [hudi] hudi-bot removed a comment on pull request #4907: [WIP][CI Test Only][HUDI-1180] Upgrade HBase to 2.4.9

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4907: URL: https://github.com/apache/hudi/pull/4907#issuecomment-1057519990 ## CI report: * 4cd46c8a580bd0e01d4a1bad28d5ad4a72181c3c UNKNOWN * 671d589463b6241bb6b1ffe971b366a621004737 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #4907: [WIP][CI Test Only][HUDI-1180] Upgrade HBase to 2.4.9

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4907: URL: https://github.com/apache/hudi/pull/4907#issuecomment-1057538054 ## CI report: * 4cd46c8a580bd0e01d4a1bad28d5ad4a72181c3c UNKNOWN * 671d589463b6241bb6b1ffe971b366a621004737 UNKNOWN * 1f8fa758f993efbea4da161b965c87a30a94852b

[GitHub] [hudi] hudi-bot commented on pull request #4941: [HUDI-3544] Fixing "populate meta fields" update to metadata table

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4941: URL: https://github.com/apache/hudi/pull/4941#issuecomment-1057532638 ## CI report: * 9718564c43d4b75f6021df5193df758f5464a138 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4941: [HUDI-3544] Fixing "populate meta fields" update to metadata table

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4941: URL: https://github.com/apache/hudi/pull/4941#issuecomment-1057530660 ## CI report: * 9718564c43d4b75f6021df5193df758f5464a138 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4941: [HUDI-3544] Fixing "populate meta fields" update to metadata table

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4941: URL: https://github.com/apache/hudi/pull/4941#issuecomment-1057530660 ## CI report: * 9718564c43d4b75f6021df5193df758f5464a138 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3544) Reading from Metadata table fails w/ NPE

2022-03-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3544: - Labels: pull-request-available (was: ) > Reading from Metadata table fails w/ NPE >

[GitHub] [hudi] nsivabalan opened a new pull request #4941: [HUDI-3544] Fixing populateMeta fields update to metadata table

2022-03-02 Thread GitBox
nsivabalan opened a new pull request #4941: URL: https://github.com/apache/hudi/pull/4941 ## What is the purpose of the pull request PopulateMetafields value for metadata table will never get updated once metadata table get initialized for the first time. With 0.11, we are making

[GitHub] [hudi] Gatsby-Lee commented on issue #4839: Hudi upsert doesnt trigger compaction for MOR

2022-03-02 Thread GitBox
Gatsby-Lee commented on issue #4839: URL: https://github.com/apache/hudi/issues/4839#issuecomment-1057524586 @nsivabalan I see. So, in Spark Streaming, Async Table services are expected to run. Thank you -- This is an automated message from the Apache Git Service. To

[jira] [Created] (HUDI-3551) Add OCS StorageScheme to support Oracle Cloud

2022-03-02 Thread Rajesh (Jira)
Rajesh created HUDI-3551: Summary: Add OCS StorageScheme to support Oracle Cloud Key: HUDI-3551 URL: https://issues.apache.org/jira/browse/HUDI-3551 Project: Apache Hudi Issue Type: Improvement

[GitHub] [hudi] hudi-bot removed a comment on pull request #4940: [WIP] [HUDI-2871]Decouple metrics dependencies from hudi-client-common

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4940: URL: https://github.com/apache/hudi/pull/4940#issuecomment-1057518735 ## CI report: * eb916b3d8fd10f1343a3a454f1c0c47644f76c40 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4940: [WIP] [HUDI-2871]Decouple metrics dependencies from hudi-client-common

2022-03-02 Thread GitBox
hudi-bot commented on pull request #4940: URL: https://github.com/apache/hudi/pull/4940#issuecomment-1057520066 ## CI report: * eb916b3d8fd10f1343a3a454f1c0c47644f76c40 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4907: [WIP][CI Test Only][HUDI-1180] Upgrade HBase to 2.4.9

2022-03-02 Thread GitBox
hudi-bot removed a comment on pull request #4907: URL: https://github.com/apache/hudi/pull/4907#issuecomment-1056288769 ## CI report: * 4cd46c8a580bd0e01d4a1bad28d5ad4a72181c3c UNKNOWN * 671d589463b6241bb6b1ffe971b366a621004737 UNKNOWN *

  1   2   3   4   >