[GitHub] [hudi] rohit-m-99 commented on issue #6015: [SUPPORT] Building workload profile failing after upgrade to 0.11.0

2022-07-01 Thread GitBox
rohit-m-99 commented on issue #6015: URL: https://github.com/apache/hudi/issues/6015#issuecomment-1172007450 Was able to successfully run the job by 1. Downgrading from Spark 3.2.1 to 3.1.2 2. Using hadoop version 3.2.0 3. Using hudi-utilities bundle exclusively in the deltastre

[GitHub] [hudi] txl2017 commented on issue #6007: spark query partition field error

2022-07-01 Thread GitBox
txl2017 commented on issue #6007: URL: https://github.com/apache/hudi/issues/6007#issuecomment-1172021461 > @txl2017 could you also provide the full commands for Hive sync and spark-submit/spark-shell for reproducing the issue? 'partition-path-field'='systemdate' 'keygen-class'='Co

[GitHub] [hudi] txl2017 commented on issue #6007: spark query partition field error

2022-07-01 Thread GitBox
txl2017 commented on issue #6007: URL: https://github.com/apache/hudi/issues/6007#issuecomment-1172032212 > > @txl2017 could you also provide the full commands for Hive sync and spark-submit/spark-shell for reproducing the issue? > > 'partition-path-field'='systemdate' 'keygen-class'

[GitHub] [hudi] hudi-bot commented on pull request #5830: [HUDI-3981] Flink engine support for comprehensive schema evolution(RFC-33)

2022-07-01 Thread GitBox
hudi-bot commented on PR #5830: URL: https://github.com/apache/hudi/pull/5830#issuecomment-1172032707 ## CI report: * 88ce744bc98ae26b81b00276a5e289c43519 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9663

[GitHub] [hudi] minihippo commented on a diff in pull request #6010: [DOCS] Add Chinese community call page.

2022-07-01 Thread GitBox
minihippo commented on code in PR #6010: URL: https://github.com/apache/hudi/pull/6010#discussion_r911705064 ## website/i18n/cn/docusaurus-plugin-content-pages/syncs.md: ## @@ -0,0 +1,42 @@ +--- +sidebar_position: 2 +title: "Community Syncs" +toc: true +last_modified_at: 2020-09

[jira] [Created] (HUDI-4348) merge into will cause data quality in concurrent scene

2022-07-01 Thread KnightChess (Jira)
KnightChess created HUDI-4348: - Summary: merge into will cause data quality in concurrent scene Key: HUDI-4348 URL: https://issues.apache.org/jira/browse/HUDI-4348 Project: Apache Hudi Issue Type

[jira] [Assigned] (HUDI-4348) merge into will cause data quality in concurrent scene

2022-07-01 Thread KnightChess (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] KnightChess reassigned HUDI-4348: - Assignee: KnightChess > merge into will cause data quality in concurrent scene >

[GitHub] [hudi] KnightChess opened a new pull request, #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
KnightChess opened a new pull request, #6020: URL: https://github.com/apache/hudi/pull/6020 ## What is the purpose of the pull request fix data quality in concurrent scene when use merge into ## Brief change log every executor thread use it own HoodieAvroDeserializer cach

[jira] [Updated] (HUDI-4348) merge into will cause data quality in concurrent scene

2022-07-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4348: - Labels: pull-request-available (was: ) > merge into will cause data quality in concurrent scene >

[GitHub] [hudi] KnightChess commented on pull request #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
KnightChess commented on PR #6020: URL: https://github.com/apache/hudi/pull/6020#issuecomment-1172062602 cause by #5825 @danny0405 can you help review it -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abov

[GitHub] [hudi] hudi-bot commented on pull request #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
hudi-bot commented on PR #6020: URL: https://github.com/apache/hudi/pull/6020#issuecomment-1172092263 ## CI report: * 5980587e9a5aaa22da33c220af7624b3588ca468 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
hudi-bot commented on PR #6020: URL: https://github.com/apache/hudi/pull/6020#issuecomment-1172095987 ## CI report: * 5980587e9a5aaa22da33c220af7624b3588ca468 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9665

[jira] [Created] (HUDI-4349) Recover table config in flink job startup

2022-07-01 Thread konwu (Jira)
konwu created HUDI-4349: --- Summary: Recover table config in flink job startup Key: HUDI-4349 URL: https://issues.apache.org/jira/browse/HUDI-4349 Project: Apache Hudi Issue Type: Bug Reporte

[jira] [Created] (HUDI-4350) reduce the shuffle work when we just insert but not update and delete

2022-07-01 Thread chenfengLiu (Jira)
chenfengLiu created HUDI-4350: - Summary: reduce the shuffle work when we just insert but not update and delete Key: HUDI-4350 URL: https://issues.apache.org/jira/browse/HUDI-4350 Project: Apache Hudi

[jira] [Assigned] (HUDI-4350) reduce the shuffle work when we just insert but not update and delete

2022-07-01 Thread chenfengLiu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chenfengLiu reassigned HUDI-4350: - Assignee: chenfengLiu > reduce the shuffle work when we just insert but not update and delete > -

[jira] [Closed] (HUDI-4349) Recover table config in flink job startup

2022-07-01 Thread konwu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] konwu closed HUDI-4349. --- Resolution: Not A Problem > Recover table config in flink job startup > - > >

[GitHub] [hudi] KnightChess commented on issue #6001: [SUPPORT] Cannot create again after deleting the Hudi external table using Spark SQL

2022-07-01 Thread GitBox
KnightChess commented on issue #6001: URL: https://github.com/apache/hudi/issues/6001#issuecomment-1172115069 try `Create Table for an existing Hudi Table` if you not use `purge` param in drop sql https://user-images.githubusercontent.com/20125927/176862535-cdc8c32a-3c70-41f1-b75b-6a8ee03

[GitHub] [hudi] KnightChess commented on pull request #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
KnightChess commented on PR #6020: URL: https://github.com/apache/hudi/pull/6020#issuecomment-1172140116 I have add log to check it, the joinSqlRecord sometimes contains certain fields in target and source, and sometimes they are completely inconsistent at all( key cols too) ```scala

[GitHub] [hudi] codope commented on a diff in pull request #5695: [HUDI-3730][RFC-55] Improve metasync class design and simplify configs

2022-07-01 Thread GitBox
codope commented on code in PR #5695: URL: https://github.com/apache/hudi/pull/5695#discussion_r911823576 ## rfc/rfc-55/rfc-55.md: ## @@ -0,0 +1,148 @@ + +# RFC-55: Improve metasync class design and simplify configs + +## Proposers + +- @ +- @ + +## Approvers + + - @ + - @ + +##

[GitHub] [hudi] BruceKellan commented on pull request #5953: [HUDI-4314] Improve the performance of reading from the specified ins…

2022-07-01 Thread GitBox
BruceKellan commented on PR #5953: URL: https://github.com/apache/hudi/pull/5953#issuecomment-1172187660 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [hudi] hudi-bot commented on pull request #5953: [HUDI-4314] Improve the performance of reading from the specified ins…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5953: URL: https://github.com/apache/hudi/pull/5953#issuecomment-1172207806 ## CI report: * 167fc6a9792aef5818bcf5d6f92c993d0a5c8352 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9654

[GitHub] [hudi] hudi-bot commented on pull request #6020: [HUDI-4348] fix merge into sql data quality in concurrent scene

2022-07-01 Thread GitBox
hudi-bot commented on PR #6020: URL: https://github.com/apache/hudi/pull/6020#issuecomment-1172207950 ## CI report: * 5980587e9a5aaa22da33c220af7624b3588ca468 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9665

[GitHub] [hudi] hudi-bot commented on pull request #5953: [HUDI-4314] Improve the performance of reading from the specified ins…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5953: URL: https://github.com/apache/hudi/pull/5953#issuecomment-1172211262 ## CI report: * 167fc6a9792aef5818bcf5d6f92c993d0a5c8352 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9654

[GitHub] [hudi] hudi-bot commented on pull request #5445: [HUDI-3953]Flink Hudi module should support low-level source and sink…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5445: URL: https://github.com/apache/hudi/pull/5445#issuecomment-1172213823 ## CI report: * 2b169ed10813e97c04608f0ec5ddd070b28d06d0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9334

[GitHub] [hudi] hudi-bot commented on pull request #5953: [HUDI-4314] Improve the performance of reading from the specified ins…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5953: URL: https://github.com/apache/hudi/pull/5953#issuecomment-1172214284 ## CI report: * 167fc6a9792aef5818bcf5d6f92c993d0a5c8352 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9654

[GitHub] [hudi] hudi-bot commented on pull request #5445: [HUDI-3953]Flink Hudi module should support low-level source and sink…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5445: URL: https://github.com/apache/hudi/pull/5445#issuecomment-1172216827 ## CI report: * 2b169ed10813e97c04608f0ec5ddd070b28d06d0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9334

[GitHub] [hudi] JerryYue-M commented on pull request #5445: [HUDI-3953]Flink Hudi module should support low-level source and sink…

2022-07-01 Thread GitBox
JerryYue-M commented on PR #5445: URL: https://github.com/apache/hudi/pull/5445#issuecomment-117884 @danny0405 This PR is ready to merge -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to th

[GitHub] [hudi] todd5167 opened a new issue, #6021: [SUPPORT] flink write rollback error, [Cannot use marker based rollback strategy on completed instant ]

2022-07-01 Thread GitBox
todd5167 opened a new issue, #6021: URL: https://github.com/apache/hudi/issues/6021 flink version: flink1.13 hudi-version: 0.10.0-patch The flink job has been running normally for over 20 days. Has the bug been fixed? @danny0405 ``` 2022-07-01 17:16:41 org.apache.

[GitHub] [hudi] Nazerra opened a new issue, #6022: [SUPPORT] ALL PARQUET FILES FROM BASE PATH GOT DELETED BY CLEANER

2022-07-01 Thread GitBox
Nazerra opened a new issue, #6022: URL: https://github.com/apache/hudi/issues/6022 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at dev-subscr..

[hudi] branch revert-5998-asf-site created (now a0edab522a)

2022-07-01 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a change to branch revert-5998-asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git at a0edab522a Revert "[DOCS] Remove duplicate faq page (#5998)" This branch includes the following new commi

[hudi] 01/01: Revert "[DOCS] Remove duplicate faq page (#5998)"

2022-07-01 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch revert-5998-asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git commit a0edab522abdb813b3b6e6f5f24e99d9c808d43a Author: Sagar Sumit AuthorDate: Fri Jul 1 16:42:48 2022 +0530

[GitHub] [hudi] codope opened a new pull request, #6023: Revert "[DOCS] Remove duplicate faq page"

2022-07-01 Thread GitBox
codope opened a new pull request, #6023: URL: https://github.com/apache/hudi/pull/6023 Reverts apache/hudi#5998 Reason: recheck if all internal redirects are working fine. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [hudi] codope merged pull request #6023: Revert "[DOCS] Remove duplicate faq page"

2022-07-01 Thread GitBox
codope merged PR #6023: URL: https://github.com/apache/hudi/pull/6023 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[hudi] branch asf-site updated: Revert "[DOCS] Remove duplicate faq page (#5998)" (#6023)

2022-07-01 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 8501c7206a Revert "[DOCS] Remove duplicate faq

[GitHub] [hudi] codope commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
codope commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r910986567 ## hudi-common/src/main/java/org/apache/hudi/common/config/HoodieConfig.java: ## @@ -133,6 +135,14 @@ public String getString(ConfigProperty configProperty) { return

[GitHub] [hudi] hudi-bot commented on pull request #5953: [HUDI-4314] Improve the performance of reading from the specified ins…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5953: URL: https://github.com/apache/hudi/pull/5953#issuecomment-1172270186 ## CI report: * 80640cedd8f087691b209c85c857c72ebf8fd855 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9666

[GitHub] [hudi] todd5167 closed issue #6021: [SUPPORT] flink write rollback error, [Cannot use marker based rollback strategy on completed instant ]

2022-07-01 Thread GitBox
todd5167 closed issue #6021: [SUPPORT] flink write rollback error, [Cannot use marker based rollback strategy on completed instant ] URL: https://github.com/apache/hudi/issues/6021 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] hudi-bot commented on pull request #5445: [HUDI-3953]Flink Hudi module should support low-level source and sink…

2022-07-01 Thread GitBox
hudi-bot commented on PR #5445: URL: https://github.com/apache/hudi/pull/5445#issuecomment-1172311989 ## CI report: * a04b159f6d6d9ebac6d129f40a6679917fffbd80 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9667

[GitHub] [hudi] XuQianJin-Stars commented on pull request #6012: [HUDI-3505] Add call procedure for UpgradeOrDowngradeCommand

2022-07-01 Thread GitBox
XuQianJin-Stars commented on PR #6012: URL: https://github.com/apache/hudi/pull/6012#issuecomment-1172317791 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comme

[GitHub] [hudi] chenshzh commented on pull request #5913: [HUDI-4287] Optimize Flink checkpoint meta mechanism to fix mistaken pending instants

2022-07-01 Thread GitBox
chenshzh commented on PR #5913: URL: https://github.com/apache/hudi/pull/5913#issuecomment-1172318760 @danny0405 updated already, pls take a review to see whether it's ok. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and us

[GitHub] [hudi] hudi-bot commented on pull request #6012: [HUDI-3505] Add call procedure for UpgradeOrDowngradeCommand

2022-07-01 Thread GitBox
hudi-bot commented on PR #6012: URL: https://github.com/apache/hudi/pull/6012#issuecomment-1172320225 ## CI report: * 96817f1d66a9ba4ed15654bf660c57daacf24344 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9660

[GitHub] [hudi] hudi-bot commented on pull request #6012: [HUDI-3505] Add call procedure for UpgradeOrDowngradeCommand

2022-07-01 Thread GitBox
hudi-bot commented on PR #6012: URL: https://github.com/apache/hudi/pull/6012#issuecomment-1172324534 ## CI report: * 96817f1d66a9ba4ed15654bf660c57daacf24344 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9660

[GitHub] [hudi] kazdy commented on issue #5452: Schema Evolution: Missing column for previous records when new entry does not have the same while upsert.

2022-07-01 Thread GitBox
kazdy commented on issue #5452: URL: https://github.com/apache/hudi/issues/5452#issuecomment-1172376069 hi @santoshsb, take a look at this nice PR by @xiarixiaoyao: https://github.com/apache/hudi/pull/6017 -- This is an automated message from the Apache Git Service. To respond to the me

[GitHub] [hudi] kazdy commented on issue #6022: [SUPPORT] ALL PARQUET FILES FROM BASE PATH GOT DELETED BY CLEANER

2022-07-01 Thread GitBox
kazdy commented on issue #6022: URL: https://github.com/apache/hudi/issues/6022#issuecomment-1172511714 As discussed in slack, OCC was not enabled. Let's try with OCC enabled since these two jobs run in parallel. -- This is an automated message from the Apache Git Service. To respond to t

[GitHub] [hudi] danny0405 commented on pull request #5366: [HUDI-1176] Upgrade hudi to log4j2

2022-07-01 Thread GitBox
danny0405 commented on PR #5366: URL: https://github.com/apache/hudi/pull/5366#issuecomment-1172538092 After this change, i can not see any log when executing the tests in local IDEA, can you fix that, without log, i can not see the execution details so can not debug better. -- This is a

[GitHub] [hudi] Gatsby-Lee opened a new issue, #6024: [SUPPORT] DELETE_PARTITION causes AWS Athena Query failure

2022-07-01 Thread GitBox
Gatsby-Lee opened a new issue, #6024: URL: https://github.com/apache/hudi/issues/6024 **Describe the problem you faced** A clear and concise description of the problem. **To Reproduce** Steps to reproduce the behavior: 1. DELETE_PARTITION for non-existing partition

[jira] [Closed] (HUDI-4006) Fail on data loss semantics for deltastreamer Kafka sources

2022-07-01 Thread Ji Qi (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ji Qi closed HUDI-4006. --- Resolution: Fixed > Fail on data loss semantics for deltastreamer Kafka sources >

[GitHub] [hudi] XuQianJin-Stars merged pull request #5445: [HUDI-3953]Flink Hudi module should support low-level source and sink…

2022-07-01 Thread GitBox
XuQianJin-Stars merged PR #5445: URL: https://github.com/apache/hudi/pull/5445 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.

[hudi] branch master updated (62a0c962ac -> bdf73b2650)

2022-07-01 Thread forwardxu
This is an automated email from the ASF dual-hosted git repository. forwardxu pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 62a0c962ac [HUDI-3634] Could read empty or partial HoodieCommitMetaData in downstream if using HDFS (#5048) add

[jira] [Resolved] (HUDI-3953) Flink Hudi module should support low-level read and write APIs

2022-07-01 Thread Forward Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Forward Xu resolved HUDI-3953. -- > Flink Hudi module should support low-level read and write APIs >

[GitHub] [hudi] leesf commented on a diff in pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-01 Thread GitBox
leesf commented on code in PR #6017: URL: https://github.com/apache/hudi/pull/6017#discussion_r912308590 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/BaseHoodieWriteClient.java: ## @@ -276,15 +276,21 @@ private void saveInternalSchema(HoodieTable table,

[GitHub] [hudi] LinMingQiang commented on a diff in pull request #5997: [HUDI-4338] resolve the data skew when using flink datastream write hudi

2022-07-01 Thread GitBox
LinMingQiang commented on code in PR #5997: URL: https://github.com/apache/hudi/pull/5997#discussion_r912314810 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/utils/Pipelines.java: ## @@ -330,17 +330,23 @@ public static DataStream hoodieStreamWrite(Confi

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912314998 ## hudi-common/src/main/java/org/apache/hudi/common/config/HoodieConfig.java: ## @@ -133,6 +135,14 @@ public String getString(ConfigProperty configProperty) { retur

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912315441 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieSyncConfig.java: ## @@ -165,24 +140,85 @@ public class HoodieSyncConfig extends HoodieConfig

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912315525 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieSyncConfig.java: ## @@ -165,24 +140,85 @@ public class HoodieSyncConfig extends HoodieConfig

[GitHub] [hudi] LinMingQiang commented on a diff in pull request #5997: [HUDI-4338] resolve the data skew when using flink datastream write hudi

2022-07-01 Thread GitBox
LinMingQiang commented on code in PR #5997: URL: https://github.com/apache/hudi/pull/5997#discussion_r912315747 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/utils/Pipelines.java: ## @@ -330,17 +330,23 @@ public static DataStream hoodieStreamWrite(Confi

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912315896 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieSyncClient.java: ## @@ -0,0 +1,150 @@ +/* + * Licensed to the Apache Software Foundation (ASF

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912316292 ## hudi-gcp/src/main/java/org/apache/hudi/gcp/bigquery/BigQuerySyncTool.java: ## @@ -106,14 +119,19 @@ private void syncCoWTable(HoodieBigQuerySyncClient bqSyncClient) {

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912316766 ## hudi-sync/hudi-adb-sync/src/main/java/org/apache/hudi/sync/adb/AdbSyncConfig.java: ## @@ -152,89 +109,101 @@ public class AdbSyncConfig extends HoodieSyncConfig {

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912316766 ## hudi-sync/hudi-adb-sync/src/main/java/org/apache/hudi/sync/adb/AdbSyncConfig.java: ## @@ -152,89 +109,101 @@ public class AdbSyncConfig extends HoodieSyncConfig {

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912317299 ## hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/ddl/QueryBasedDDLExecutor.java: ## @@ -90,11 +98,11 @@ public void createTable(String tableName, MessageType

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912320024 ## hudi-sync/hudi-adb-sync/src/main/java/org/apache/hudi/sync/adb/HoodieAdbJdbcClient.java: ## @@ -405,26 +423,69 @@ private String constructShowCreateDatabaseSql(String

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912321105 ## hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/ddl/HMSDDLExecutor.java: ## @@ -241,20 +248,20 @@ public void dropPartitionsToTable(String tableName, List

[GitHub] [hudi] xushiyan commented on a diff in pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
xushiyan commented on code in PR #5854: URL: https://github.com/apache/hudi/pull/5854#discussion_r912321557 ## hudi-gcp/src/main/java/org/apache/hudi/gcp/bigquery/BigQuerySyncTool.java: ## @@ -40,26 +47,28 @@ * * @Experimental */ -public class BigQuerySyncTool extends Abst

[GitHub] [hudi] hudi-bot commented on pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
hudi-bot commented on PR #5854: URL: https://github.com/apache/hudi/pull/5854#issuecomment-1172841917 ## CI report: * 5ed4a8fb1d9fe9c84f5c9078f8842579990903a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9646

[GitHub] [hudi] hudi-bot commented on pull request #5854: [HUDI-3730] Improve meta sync class design and hierarchies

2022-07-01 Thread GitBox
hudi-bot commented on PR #5854: URL: https://github.com/apache/hudi/pull/5854#issuecomment-1172842585 ## CI report: * 5ed4a8fb1d9fe9c84f5c9078f8842579990903a0 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9646

[jira] [Created] (HUDI-4351) Improve HoodieFlinkCompactor

2022-07-01 Thread Forward Xu (Jira)
Forward Xu created HUDI-4351: Summary: Improve HoodieFlinkCompactor Key: HUDI-4351 URL: https://issues.apache.org/jira/browse/HUDI-4351 Project: Apache Hudi Issue Type: Improvement Comp

[GitHub] [hudi] XuQianJin-Stars opened a new pull request, #6025: [HUDI-4351] Improve HoodieFlinkCompactor

2022-07-01 Thread GitBox
XuQianJin-Stars opened a new pull request, #6025: URL: https://github.com/apache/hudi/pull/6025 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Updated] (HUDI-4351) Improve HoodieFlinkCompactor

2022-07-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4351: - Labels: pull-request-available (was: ) > Improve HoodieFlinkCompactor > -

[GitHub] [hudi] hudi-bot commented on pull request #6025: [HUDI-4351] Improve HoodieFlinkCompactor

2022-07-01 Thread GitBox
hudi-bot commented on PR #6025: URL: https://github.com/apache/hudi/pull/6025#issuecomment-1172844426 ## CI report: * 9b3fc9d9e9767eee22c18ad4010788270828c42a UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Created] (HUDI-4352) Can't find valid constructor for FSDataOutputStream when compile hudi base on hadoop3.1.0

2022-07-01 Thread Leo zhang (Jira)
Leo zhang created HUDI-4352: --- Summary: Can't find valid constructor for FSDataOutputStream when compile hudi base on hadoop3.1.0 Key: HUDI-4352 URL: https://issues.apache.org/jira/browse/HUDI-4352 Project:

[jira] [Created] (HUDI-4353) Column stats data skipping for flink

2022-07-01 Thread Danny Chen (Jira)
Danny Chen created HUDI-4353: Summary: Column stats data skipping for flink Key: HUDI-4353 URL: https://issues.apache.org/jira/browse/HUDI-4353 Project: Apache Hudi Issue Type: New Feature

[GitHub] [hudi] danny0405 opened a new pull request, #6026: [HUDI-4353] Column stats data skipping for flink

2022-07-01 Thread GitBox
danny0405 opened a new pull request, #6026: URL: https://github.com/apache/hudi/pull/6026 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[jira] [Updated] (HUDI-4353) Column stats data skipping for flink

2022-07-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4353: - Labels: pull-request-available (was: ) > Column stats data skipping for flink > -

[GitHub] [hudi] danny0405 commented on issue #5460: org.apache.hudi.exception.HoodieRemoteException: status code: 500, reason phrase: Server Error

2022-07-01 Thread GitBox
danny0405 commented on issue #5460: URL: https://github.com/apache/hudi/issues/5460#issuecomment-1172846821 We have created a retry for remote request recently: maybe it can help you https://github.com/apache/hudi/pull/5884#issuecomment-1171983624 -- This is an automated message from the