[GitHub] [hudi] nsivabalan opened a new pull request, #6878: [HUDI-3397] Guard repeated rdd triggers

2022-10-05 Thread GitBox
nsivabalan opened a new pull request, #6878: URL: https://github.com/apache/hudi/pull/6878 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performan

[jira] [Updated] (HUDI-3397) Make sure Spark RDDs triggering actual FS activity are only dereferenced once

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3397: - Labels: pull-request-available spark (was: spark) > Make sure Spark RDDs triggering actual FS act

[GitHub] [hudi] nsivabalan opened a new pull request, #6877: [HUDI-3397] Guard repeated rdd triggers

2022-10-05 Thread GitBox
nsivabalan opened a new pull request, #6877: URL: https://github.com/apache/hudi/pull/6877 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performan

[GitHub] [hudi] gtwuser commented on issue #6869: [SUPPORT] Incremental upsert or merge is not working

2022-10-05 Thread GitBox
gtwuser commented on issue #6869: URL: https://github.com/apache/hudi/issues/6869#issuecomment-1269410021 > so the problem is got two records with same key?@gtwuser Yes same record key -- This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [hudi] hudi-bot commented on pull request #6876: [MINOR] Handling null event time

2022-10-05 Thread GitBox
hudi-bot commented on PR #6876: URL: https://github.com/apache/hudi/pull/6876#issuecomment-1269392805 ## CI report: * 6ce255ff0537ecb4ecf9bf7cf7f2534f7021337b Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-10-05 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1269321846 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * f21eab07069aa87544e04b115e7463126cd9c472 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-10-05 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1269318141 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * f21eab07069aa87544e04b115e7463126cd9c472 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6836: [HUDI-4952] Fixing reading from metadata table when there are no inflight commits

2022-10-05 Thread GitBox
hudi-bot commented on PR #6836: URL: https://github.com/apache/hudi/pull/6836#issuecomment-1269280015 ## CI report: * e246d65957362860b850f1af9ef973b85bf1a4eb Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6745: Fix comment in RFC46

2022-10-05 Thread GitBox
hudi-bot commented on PR #6745: URL: https://github.com/apache/hudi/pull/6745#issuecomment-1269274673 ## CI report: * af8e58757bed12e53907076da02add1ba98b220c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6745: Fix comment in RFC46

2022-10-05 Thread GitBox
hudi-bot commented on PR #6745: URL: https://github.com/apache/hudi/pull/6745#issuecomment-1269271854 ## CI report: * af8e58757bed12e53907076da02add1ba98b220c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[jira] [Updated] (HUDI-4605) Upgrade hudi-presto-bundle version to 0.12.0

2022-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4605: - Fix Version/s: 0.12.2 > Upgrade hudi-presto-bundle version to 0.12.0 > ---

[GitHub] [hudi] hudi-bot commented on pull request #6876: [MINOR] Handling null event time

2022-10-05 Thread GitBox
hudi-bot commented on PR #6876: URL: https://github.com/apache/hudi/pull/6876#issuecomment-1269268926 ## CI report: * 6ce255ff0537ecb4ecf9bf7cf7f2534f7021337b Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[jira] [Updated] (HUDI-4522) [DOCS] Set presto session prop to use parquet column names in case of type mismatch

2022-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4522: - Fix Version/s: 0.13.0 (was: 0.12.0) > [DOCS] Set presto session prop to use parquet

[GitHub] [hudi] hudi-bot commented on pull request #6862: [HUDI-4989] fixing deltastreamer init failures

2022-10-05 Thread GitBox
hudi-bot commented on PR #6862: URL: https://github.com/apache/hudi/pull/6862#issuecomment-1269268848 ## CI report: * 149aec6ea8ff6d895da07b0226be1efdf920e3d8 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[jira] [Updated] (HUDI-4522) [DOCS] Set presto session prop to use parquet column names in case of type mismatch

2022-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-4522: - Fix Version/s: 0.12.0 (was: 0.13.0) > [DOCS] Set presto session prop to use parquet

[jira] [Updated] (HUDI-3210) [UMBRELLA] Native Presto connector for Hudi

2022-10-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3210: - Summary: [UMBRELLA] Native Presto connector for Hudi (was: [UMBRELLA] A new Presto connector for Hudi) >

[jira] [Closed] (HUDI-4988) Add Docs regarding Hudi RecordMerger

2022-10-05 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong closed HUDI-4988. Resolution: Fixed > Add Docs regarding Hudi RecordMerger > > >

[jira] [Assigned] (HUDI-4988) Add Docs regarding Hudi RecordMerger

2022-10-05 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong reassigned HUDI-4988: Assignee: Frank Wong > Add Docs regarding Hudi RecordMerger >

[jira] [Assigned] (HUDI-3217) RFC-46: Optimize Record Payload handling

2022-10-05 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong reassigned HUDI-3217: Assignee: Alexey Kudinkin (was: Frank Wong) > RFC-46: Optimize Record Payload handling > -

[jira] [Reopened] (HUDI-3217) RFC-46: Optimize Record Payload handling

2022-10-05 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong reopened HUDI-3217: -- > RFC-46: Optimize Record Payload handling > > > Ke

[jira] [Updated] (HUDI-3217) RFC-46: Optimize Record Payload handling

2022-10-05 Thread Frank Wong (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Frank Wong updated HUDI-3217: - Status: In Progress (was: Reopened) > RFC-46: Optimize Record Payload handling >

[hudi] branch master updated: [MINOR] Fix deploy script for flink 1.15 (#6872)

2022-10-05 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new fd8a947e61 [MINOR] Fix deploy script for flink 1

[GitHub] [hudi] xushiyan merged pull request #6872: [MINOR] Fix deploy script for flink 1.15

2022-10-05 Thread GitBox
xushiyan merged PR #6872: URL: https://github.com/apache/hudi/pull/6872 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[GitHub] [hudi] hudi-bot commented on pull request #6876: [MINOR] Handling null event time

2022-10-05 Thread GitBox
hudi-bot commented on PR #6876: URL: https://github.com/apache/hudi/pull/6876#issuecomment-1269234032 ## CI report: * 6ce255ff0537ecb4ecf9bf7cf7f2534f7021337b UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4986) Enhance hudi integ test readme for multi-writer tests

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4986: - Labels: pull-request-available (was: ) > Enhance hudi integ test readme for multi-writer tests >

[hudi] branch master updated: Enhancing README for multi-writer tests (#6870)

2022-10-05 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 280194d3b6 Enhancing README for multi-writer tests

[GitHub] [hudi] codope merged pull request #6870: [HUDI-4986] Enhancing README for multi-writer tests

2022-10-05 Thread GitBox
codope merged PR #6870: URL: https://github.com/apache/hudi/pull/6870 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[hudi] branch master updated: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create (#6857)

2022-10-05 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new fb4f026580 [HUDI-4970] Update kafka-connect readme

[GitHub] [hudi] codope merged pull request #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
codope merged PR #6857: URL: https://github.com/apache/hudi/pull/6857 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[GitHub] [hudi] codope commented on pull request #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
codope commented on PR #6857: URL: https://github.com/apache/hudi/pull/6857#issuecomment-1269231905 Landing it. Just a readme and test update. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[hudi] branch master updated: Revert "[HUDI-4915] improve avro serializer/deserializer (#6788)" (#6809)

2022-10-05 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 067cc24d88 Revert "[HUDI-4915] improve avro seri

[GitHub] [hudi] xushiyan merged pull request #6809: Revert "[HUDI-4915] improve avro serializer/deserializer (#6788)"

2022-10-05 Thread GitBox
xushiyan merged PR #6809: URL: https://github.com/apache/hudi/pull/6809 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[GitHub] [hudi] hudi-bot commented on pull request #6836: [HUDI-4952] Fixing reading from metadata table when there are no inflight commits

2022-10-05 Thread GitBox
hudi-bot commented on PR #6836: URL: https://github.com/apache/hudi/pull/6836#issuecomment-1269231396 ## CI report: * 23d923e6b8c75781053f3f7bbc811084141f7786 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] nsivabalan opened a new pull request, #6876: [MINOR] Handling null event time

2022-10-05 Thread GitBox
nsivabalan opened a new pull request, #6876: URL: https://github.com/apache/hudi/pull/6876 ### Change Logs Seeing noisy debug logs (`Fail to parse event time value`) with tests. Fixing null event time handling. ### Impact _Describe any public API or user-facing feature

[GitHub] [hudi] hudi-bot commented on pull request #6836: [HUDI-4952] Fixing reading from metadata table when there are no inflight commits

2022-10-05 Thread GitBox
hudi-bot commented on PR #6836: URL: https://github.com/apache/hudi/pull/6836#issuecomment-1269228467 ## CI report: * 23d923e6b8c75781053f3f7bbc811084141f7786 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-10-05 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1269225293 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * f21eab07069aa87544e04b115e7463126cd9c472 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] nsivabalan commented on a diff in pull request #6836: [HUDI-4952] Fixing reading from metadata table when there are no inflight commits

2022-10-05 Thread GitBox
nsivabalan commented on code in PR #6836: URL: https://github.com/apache/hudi/pull/6836#discussion_r988495539 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala: ## @@ -293,16 +293,20 @@ object HoodieFileIndex extends Logging { s

[GitHub] [hudi] hudi-bot commented on pull request #6862: [HUDI-4989] fixing deltastreamer init failures

2022-10-05 Thread GitBox
hudi-bot commented on PR #6862: URL: https://github.com/apache/hudi/pull/6862#issuecomment-1269183442 ## CI report: * 149aec6ea8ff6d895da07b0226be1efdf920e3d8 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[jira] [Updated] (HUDI-4989) Deltastreamer fails if table instantiation failed mid-way in prior attempt

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4989: - Labels: pull-request-available (was: ) > Deltastreamer fails if table instantiation failed mid-wa

[GitHub] [hudi] hudi-bot commented on pull request #6862: [HUDI-4989] fixing deltastreamer init failures

2022-10-05 Thread GitBox
hudi-bot commented on PR #6862: URL: https://github.com/apache/hudi/pull/6862#issuecomment-1269180039 ## CI report: * 149aec6ea8ff6d895da07b0226be1efdf920e3d8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4989) Deltastreamer fails if table instantiation failed mid-way in prior attempt

2022-10-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4989: -- Priority: Critical (was: Major) > Deltastreamer fails if table instantiation failed mid

[jira] [Created] (HUDI-4989) Deltastreamer fails if table instantiation failed mid-way in prior attempt

2022-10-05 Thread sivabalan narayanan (Jira)
sivabalan narayanan created HUDI-4989: - Summary: Deltastreamer fails if table instantiation failed mid-way in prior attempt Key: HUDI-4989 URL: https://issues.apache.org/jira/browse/HUDI-4989 Proj

[jira] [Updated] (HUDI-4989) Deltastreamer fails if table instantiation failed mid-way in prior attempt

2022-10-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-4989: -- Fix Version/s: 0.13.0 > Deltastreamer fails if table instantiation failed mid-way in pri

[jira] [Assigned] (HUDI-4989) Deltastreamer fails if table instantiation failed mid-way in prior attempt

2022-10-05 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-4989: - Assignee: sivabalan narayanan > Deltastreamer fails if table instantiation failed

[GitHub] [hudi] yihua commented on a diff in pull request #6003: [HUDI-1575][RFC-56] Early Conflict Detection For Multi-writer

2022-10-05 Thread GitBox
yihua commented on code in PR #6003: URL: https://github.com/apache/hudi/pull/6003#discussion_r988454552 ## rfc/rfc-56/rfc-56.md: ## @@ -0,0 +1,238 @@ + + +# RFC-56: Early Conflict Detection For Multi-writer + +## Proposers + +- @zhangyue19921010 + +## Approvers + +- @yihua + +#

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-10-05 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1269130978 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * 91655a8009d60f6337939f87d6e2e01922877848 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6358: [HUDI-4588][HUDI-4472] Fixing `HoodieParquetReader` to properly specify projected schema when reading Parquet file

2022-10-05 Thread GitBox
hudi-bot commented on PR #6358: URL: https://github.com/apache/hudi/pull/6358#issuecomment-1269127142 ## CI report: * 288d166c49602a4593b1e97763a467811903737d UNKNOWN * 91655a8009d60f6337939f87d6e2e01922877848 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6745: Fix comment in RFC46

2022-10-05 Thread GitBox
hudi-bot commented on PR #6745: URL: https://github.com/apache/hudi/pull/6745#issuecomment-1269065121 ## CI report: * af8e58757bed12e53907076da02add1ba98b220c Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] the-other-tim-brown commented on a diff in pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-10-05 Thread GitBox
the-other-tim-brown commented on code in PR #6806: URL: https://github.com/apache/hudi/pull/6806#discussion_r988360500 ## hudi-utilities/pom.xml: ## @@ -85,7 +85,6 @@ com.google.protobuf protobuf-java-util - test Review Comment: Should I revert that

[GitHub] [hudi] hudi-bot commented on pull request #6745: Fix comment in RFC46

2022-10-05 Thread GitBox
hudi-bot commented on PR #6745: URL: https://github.com/apache/hudi/pull/6745#issuecomment-1268931103 ## CI report: * 7525a09b2415fbf4e5e7de7c71cfffd8afc8c410 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6745: Fix comment in RFC46

2022-10-05 Thread GitBox
hudi-bot commented on PR #6745: URL: https://github.com/apache/hudi/pull/6745#issuecomment-1268925125 ## CI report: * 7525a09b2415fbf4e5e7de7c71cfffd8afc8c410 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6806: [HUDI-4905] Improve type handling in proto schema conversion

2022-10-05 Thread GitBox
hudi-bot commented on PR #6806: URL: https://github.com/apache/hudi/pull/6806#issuecomment-1268918966 ## CI report: * f03f9610cf4e2c490d33ca734ca9b3241b2be778 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1201

[GitHub] [hudi] hudi-bot commented on pull request #6858: [HUDI-4824]RANGE BUCKET index, base logic and test.

2022-10-05 Thread GitBox
hudi-bot commented on PR #6858: URL: https://github.com/apache/hudi/pull/6858#issuecomment-1265746656 ## CI report: * 76d14eb325d62a026248cc5c30de0d415e0c92a2 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] hudi-bot commented on pull request #6858: [HUDI-4824]RANGE BUCKET index, base logic and test.

2022-10-05 Thread GitBox
hudi-bot commented on PR #6858: URL: https://github.com/apache/hudi/pull/6858#issuecomment-1265741058 ## CI report: * 76d14eb325d62a026248cc5c30de0d415e0c92a2 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] wqwl611 commented on pull request #6636: [HUDI-4824]Add new index RANGE_BUCKET , when primary key is auto-increment like most mysql table

2022-10-05 Thread GitBox
wqwl611 commented on PR #6636: URL: https://github.com/apache/hudi/pull/6636#issuecomment-1265711269 I accidentally deleted my previous fork, so I opened a new PR, and modified the code according to the previous review requirements, please help me review it,thanks. @danny0405 @YuweiXiao

[GitHub] [hudi] wqwl611 commented on pull request #6858: [HUDI-4824]RANGE Bucket index, base logic and test.

2022-10-05 Thread GitBox
wqwl611 commented on PR #6858: URL: https://github.com/apache/hudi/pull/6858#issuecomment-1265709727 I accidentally deleted my previous fork, so I opened a new PR, and modified the code according to the previous review requirements, please help me review it,thanks. @danny0405 @YuweiXiao

[GitHub] [hudi] wqwl611 opened a new pull request, #6858: [HUDI-4824]RANGE Bucket index, base logic and test.

2022-10-05 Thread GitBox
wqwl611 opened a new pull request, #6858: URL: https://github.com/apache/hudi/pull/6858 ### Change Logs The rangeBucket is mainly used in the scenario of sync mysql tables to hudi in near real time, which avoids the disadvantage of the fixed number of buckets in simpleBucket. Usua

[GitHub] [hudi] hudi-bot commented on pull request #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
hudi-bot commented on PR #6857: URL: https://github.com/apache/hudi/pull/6857#issuecomment-1265637355 ## CI report: * d4f9276e7a3802fb2df5c3b7e28c224e4a1e7f15 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] codope commented on issue #6332: Avoid creating Configuration copies in Hudi

2022-10-05 Thread GitBox
codope commented on issue #6332: URL: https://github.com/apache/hudi/issues/6332#issuecomment-1265613884 Synced up with @pratyakshsharma regarding this issue. First of all, the issue affects hudi tables queries via presto-hive connector. We need to see if we can use the config provided by t

[GitHub] [hudi] codope closed issue #6332: Avoid creating Configuration copies in Hudi

2022-10-05 Thread GitBox
codope closed issue #6332: Avoid creating Configuration copies in Hudi URL: https://github.com/apache/hudi/issues/6332 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubs

[jira] [Updated] (HUDI-4970) hudi-kafka-connect-bundle: Could not initialize class org.apache.hadoop.security.UserGroupInformation

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4970: - Labels: pull-request-available (was: ) > hudi-kafka-connect-bundle: Could not initialize class >

[GitHub] [hudi] hudi-bot commented on pull request #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
hudi-bot commented on PR #6857: URL: https://github.com/apache/hudi/pull/6857#issuecomment-1265411902 ## CI report: * d4f9276e7a3802fb2df5c3b7e28c224e4a1e7f15 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] hudi-bot commented on pull request #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
hudi-bot commented on PR #6857: URL: https://github.com/apache/hudi/pull/6857#issuecomment-1265405407 ## CI report: * d4f9276e7a3802fb2df5c3b7e28c224e4a1e7f15 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6850: [Draft][HUDI-4964] inline all the getter methods that have no logic …

2022-10-05 Thread GitBox
hudi-bot commented on PR #6850: URL: https://github.com/apache/hudi/pull/6850#issuecomment-1265398895 ## CI report: * 4b1c2e6a4a256989d070a105cdd88ef02aaa8fc1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] codope opened a new pull request, #6857: [HUDI-4970] Update kafka-connect readme and refactor HoodieConfig#create

2022-10-05 Thread GitBox
codope opened a new pull request, #6857: URL: https://github.com/apache/hudi/pull/6857 ### Change Logs Update Kafka-connect setup with some details. Also, remove `HoodieConfig#create`, which is being used only in tests an unnecessarily class loader has to load `org.apache.hadoop.fs.F

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1265314521 ## CI report: * 3b01f5fd8a8be1d5b7dfca7adc882771f7fa787d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1197

[GitHub] [hudi] codope merged pull request #6846: [HUDI-4962] Move cloud dependencies to cloud modules

2022-10-05 Thread GitBox
codope merged PR #6846: URL: https://github.com/apache/hudi/pull/6846 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[GitHub] [hudi] pramodbiligiri commented on pull request #6846: [HUDI-4962] Move cloud dependencies to cloud modules

2022-10-05 Thread GitBox
pramodbiligiri commented on PR #6846: URL: https://github.com/apache/hudi/pull/6846#issuecomment-1265279684 Tested that this build worked locally. Built this branch as follows and ran the sync. Results after sync pasted down below. Build: $ mvn -DskipTests -Dspark3.2 -Dscala-2.12 -Dche

[GitHub] [hudi] hudi-bot commented on pull request #6850: [Draft][HUDI-4964] inline all the getter methods that have no logic …

2022-10-05 Thread GitBox
hudi-bot commented on PR #6850: URL: https://github.com/apache/hudi/pull/6850#issuecomment-1265246778 ## CI report: * 13a464e9dca3394ed7d946c0e682ad02f7edfc43 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=119

[GitHub] [hudi] hudi-bot commented on pull request #6856: [HUDI-4968] Update misleading read.streaming.skip_compaction config

2022-10-05 Thread GitBox
hudi-bot commented on PR #6856: URL: https://github.com/apache/hudi/pull/6856#issuecomment-1265241676 ## CI report: * 7fbe39a558949b0e0e8938546aad96e5ba0c1956 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1196

[GitHub] [hudi] dragonH commented on issue #6832: [SUPPORT] AWS Glue 3.0 fail to write dataset with hudi (hive sync issue)

2022-10-05 Thread GitBox
dragonH commented on issue #6832: URL: https://github.com/apache/hudi/issues/6832#issuecomment-1265226296 @kazdy thanks for the information! wonder if there's better way to avoid this kind of issue e.g. - add a new config property `AWSGlueDataCataglogEnabled` and if it se

[GitHub] [hudi] hudi-bot commented on pull request #6850: [Draft][HUDI-4964] inline all the getter methods that have no logic …

2022-10-05 Thread GitBox
hudi-bot commented on PR #6850: URL: https://github.com/apache/hudi/pull/6850#issuecomment-1265171581 ## CI report: * 13a464e9dca3394ed7d946c0e682ad02f7edfc43 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=119

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1265165246 ## CI report: * b4875afb16a2a8bdd0bce03f518af4fee9ada2a7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1194

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1265159045 ## CI report: * b4875afb16a2a8bdd0bce03f518af4fee9ada2a7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1194

[GitHub] [hudi] hudi-bot commented on pull request #6850: [Draft][HUDI-4964] inline all the getter methods that have no logic …

2022-10-05 Thread GitBox
hudi-bot commented on PR #6850: URL: https://github.com/apache/hudi/pull/6850#issuecomment-1265159167 ## CI report: * e3aef767db19eed24222f8fff89ae4c59d0799c2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1195

[GitHub] [hudi] hudi-bot commented on pull request #6850: [Draft][HUDI-4964] inline all the getter methods that have no logic …

2022-10-05 Thread GitBox
hudi-bot commented on PR #6850: URL: https://github.com/apache/hudi/pull/6850#issuecomment-1265165358 ## CI report: * e3aef767db19eed24222f8fff89ae4c59d0799c2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1195

[GitHub] [hudi] hudi-bot commented on pull request #6856: [HUDI-4968] Update misleading read.streaming.skip_compaction config

2022-10-05 Thread GitBox
hudi-bot commented on PR #6856: URL: https://github.com/apache/hudi/pull/6856#issuecomment-1265078737 ## CI report: * 7fbe39a558949b0e0e8938546aad96e5ba0c1956 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1196

[GitHub] [hudi] voonhous opened a new pull request, #6856: [HUDI-4968] Update misleading read.streaming.skip_compaction config

2022-10-05 Thread GitBox
voonhous opened a new pull request, #6856: URL: https://github.com/apache/hudi/pull/6856 ### Change Logs Update misleading `read.streaming.skip_compaction` config. ### Impact _Describe any public API or user-facing feature change or any performance impact._ **Ris

[GitHub] [hudi] hkszn commented on issue #6718: [SUPPORT] Deltastreamer concurrent writes in continuous mode

2022-10-05 Thread GitBox
hkszn commented on issue #6718: URL: https://github.com/apache/hudi/issues/6718#issuecomment-1265049923 Thank you for your reply. > If you are interested, I can guide you on how to achieve this. Yes, I would like to try it. -- This is an automated message from the Apache Git

[GitHub] [hudi] hudi-bot commented on pull request #6856: [HUDI-4968] Update misleading read.streaming.skip_compaction config

2022-10-05 Thread GitBox
hudi-bot commented on PR #6856: URL: https://github.com/apache/hudi/pull/6856#issuecomment-1265072776 ## CI report: * 7fbe39a558949b0e0e8938546aad96e5ba0c1956 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] kazdy commented on issue #6832: [SUPPORT] AWS Glue 3.0 fail to write dataset with hudi (hive sync issue)

2022-10-05 Thread GitBox
kazdy commented on issue #6832: URL: https://github.com/apache/hudi/issues/6832#issuecomment-1265036326 Btw on emr you'll get the same error because glue client is the same. I got this error when running hudi on emr. -- This is an automated message from the Apache Git Service. To respond

[jira] [Updated] (HUDI-4968) Fix ambiguous stream read config

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4968: - Labels: pull-request-available (was: ) > Fix ambiguous stream read config > -

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1265000971 ## CI report: * b4875afb16a2a8bdd0bce03f518af4fee9ada2a7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1194

[GitHub] [hudi] voonhous opened a new pull request, #6855: [HUDI-4968] Update old config keys

2022-10-05 Thread GitBox
voonhous opened a new pull request, #6855: URL: https://github.com/apache/hudi/pull/6855 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any performance

[GitHub] [hudi] xushiyan commented on pull request #6852: [MINOR] Fix testUpdateRejectForClustering

2022-10-05 Thread GitBox
xushiyan commented on PR #6852: URL: https://github.com/apache/hudi/pull/6852#issuecomment-1264955767 Test fix works. CI failure is irrelevant. Landing this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abov

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1264952448 ## CI report: * b4875afb16a2a8bdd0bce03f518af4fee9ada2a7 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] xushiyan commented on a diff in pull request #6732: [HUDI-4148] Add client for hudi table management service

2022-10-05 Thread GitBox
xushiyan commented on code in PR #6732: URL: https://github.com/apache/hudi/pull/6732#discussion_r985359891 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieWriteClient.java: ## @@ -1052,13 +875,29 @@ public void dropIndex(List partitionTypes) {

[GitHub] [hudi] xushiyan merged pull request #6852: [MINOR] Fix testUpdateRejectForClustering

2022-10-05 Thread GitBox
xushiyan merged PR #6852: URL: https://github.com/apache/hudi/pull/6852 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[GitHub] [hudi] hudi-bot commented on pull request #6838: [MINOR] Update azure image and balance CI jobs

2022-10-05 Thread GitBox
hudi-bot commented on PR #6838: URL: https://github.com/apache/hudi/pull/6838#issuecomment-1264955736 ## CI report: * b4875afb16a2a8bdd0bce03f518af4fee9ada2a7 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1194

[GitHub] [hudi] xushiyan commented on issue #6281: [SUPPORT] AwsGlueCatalogSyncTool -The number of partition keys do not match the number of partition values

2022-10-05 Thread GitBox
xushiyan commented on issue #6281: URL: https://github.com/apache/hudi/issues/6281#issuecomment-1264943872 @crutis closing this as explained by @yihua . let us know how it works -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] xushiyan closed issue #6281: [SUPPORT] AwsGlueCatalogSyncTool -The number of partition keys do not match the number of partition values

2022-10-05 Thread GitBox
xushiyan closed issue #6281: [SUPPORT] AwsGlueCatalogSyncTool -The number of partition keys do not match the number of partition values URL: https://github.com/apache/hudi/issues/6281 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to Git

[GitHub] [hudi] xushiyan merged pull request #6851: [HUDI-4966] Add a partition extractor to handle partition values with slashes

2022-10-05 Thread GitBox
xushiyan merged PR #6851: URL: https://github.com/apache/hudi/pull/6851 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[GitHub] [hudi] xushiyan commented on pull request #6732: [HUDI-4148] Add client for hudi table management service

2022-10-05 Thread GitBox
xushiyan commented on PR #6732: URL: https://github.com/apache/hudi/pull/6732#issuecomment-1264932668 @yuzhaojing please also fill up the PR description properly. as discussed, a class diagram to show the new hierarchy expedites the review. -- This is an automated message from the Apache

[GitHub] [hudi] zhangyue19921010 commented on a diff in pull request #6003: [HUDI-1575][RFC-56] Early Conflict Detection For Multi-writer

2022-10-05 Thread GitBox
zhangyue19921010 commented on code in PR #6003: URL: https://github.com/apache/hudi/pull/6003#discussion_r985349298 ## rfc/rfc-56/rfc-56.md: ## @@ -0,0 +1,238 @@ + + +# RFC-56: Early Conflict Detection For Multi-writer + +## Proposers + +- @zhangyue19921010 + +## Approvers + +-

[GitHub] [hudi] xushiyan commented on a diff in pull request #4309: [HUDI-3016][RFC-43] Proposal to implement Table Management Service

2022-10-05 Thread GitBox
xushiyan commented on code in PR #4309: URL: https://github.com/apache/hudi/pull/4309#discussion_r985348379 ## rfc/rfc-43/rfc-43.md: ## @@ -0,0 +1,369 @@ + + +# RFC-43: Implement Table Management ServiceTable Management Service for Hudi + +## Proposers + +- @yuzhaojing + +## App

[GitHub] [hudi] yesemsanthoshkumar commented on a diff in pull request #6726: [HUDI-4630] Add transformer capability to individual feeds in MultiTableDeltaStreamer

2022-10-05 Thread GitBox
yesemsanthoshkumar commented on code in PR #6726: URL: https://github.com/apache/hudi/pull/6726#discussion_r985346731 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieMultiTableDeltaStreamer.java: ## @@ -135,6 +135,7 @@ private void populateTableExe

[GitHub] [hudi] zhangyue19921010 commented on pull request #6003: [HUDI-1575][RFC-56] Early Conflict Detection For Multi-writer

2022-10-05 Thread GitBox
zhangyue19921010 commented on PR #6003: URL: https://github.com/apache/hudi/pull/6003#issuecomment-1264887516 Hi @yihua and @pratyakshsharma . Really appreciate for your attention here! Address the comments. PTAL :) -- This is an automated message from the Apache Git Service. To respond t

[GitHub] [hudi] dragonH commented on issue #6832: [SUPPORT] AWS Glue 3.0 fail to write dataset with hudi (hive sync issue)

2022-10-05 Thread GitBox
dragonH commented on issue #6832: URL: https://github.com/apache/hudi/issues/6832#issuecomment-1264882911 hi @codope sure, wiil also do the latest hudi testing with EMR and share th result here thanks for the help hi @kazdy thanks for the help i acknowledg

[GitHub] [hudi] hudi-bot commented on pull request #6854: [HUDI-4631] Adding retries to spark datasource writes on conflict failures:

2022-10-05 Thread GitBox
hudi-bot commented on PR #6854: URL: https://github.com/apache/hudi/pull/6854#issuecomment-1264756277 ## CI report: * 3fd99e92b8be748fa52e025f8bc6bbf6681df359 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1196

[jira] [Updated] (HUDI-4631) Enhance retries for failed writes w/ write conflicts in a multi writer scenarios

2022-10-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4631: - Labels: pull-request-available (was: ) > Enhance retries for failed writes w/ write conflicts in

  1   2   >