[GitHub] [hudi] lanyuanxiaoyao commented on a diff in pull request #5677: [HUDI-4152] Flink offline compaction support compacting multi compaction plan at once

2022-07-05 Thread GitBox
lanyuanxiaoyao commented on code in PR #5677: URL: https://github.com/apache/hudi/pull/5677#discussion_r914479792 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/compact/strategy/InstantCompactionPlanSelectStrategy.java: ## @@ -0,0 +1,40 @@ +/* + * License

[GitHub] [hudi] RoderickAdriance commented on issue #5765: [SUPPORT] throw "java.lang.NoSuchMethodError: org.apache.hadoop.hdfs.client.HdfsDataInputStream.getReadStatistics()"

2022-07-05 Thread GitBox
RoderickAdriance commented on issue #5765: URL: https://github.com/apache/hudi/issues/5765#issuecomment-1175839617 @yihua I use Hadoop3 and spark2 this problem will be resolved. So I think this problem is caused by the incompatibility between Hudi jar package and spark3 package. -- Th

[GitHub] [hudi] RoderickAdriance commented on issue #5765: [SUPPORT] throw "java.lang.NoSuchMethodError: org.apache.hadoop.hdfs.client.HdfsDataInputStream.getReadStatistics()"

2022-07-05 Thread GitBox
RoderickAdriance commented on issue #5765: URL: https://github.com/apache/hudi/issues/5765#issuecomment-1175836187 @yihua I use Hadoop3 and spark2 this problem will be resolved. So I think HFile classes is not compatible with spark2. -- This is an automated message from the Apache Git S

[GitHub] [hudi] shqiprimbkodelabs opened a new issue, #6052: [SUPPORT] HoodieRollbackException when starting Flink Job on existing Hudi Table

2022-07-05 Thread GitBox
shqiprimbkodelabs opened a new issue, #6052: URL: https://github.com/apache/hudi/issues/6052 I am using Hudi with Flink and when I submit the job it fails to rollback. This is issue happens also if any of the checkpoints fails. `Caused by: org.apache.flink.util.FlinkException: Global fai

[jira] [Commented] (HUDI-4357) Support flink 1.15.x

2022-07-05 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17562957#comment-17562957 ] Danny Chen commented on HUDI-4357: -- Fixed via master branch: 7eeaff9ee0ee12e93e6bd7a6e8fa

[jira] [Resolved] (HUDI-4357) Support flink 1.15.x

2022-07-05 Thread Danny Chen (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Danny Chen resolved HUDI-4357. -- > Support flink 1.15.x > > > Key: HUDI-4357 > URL: http

[hudi] branch master updated (b18c32379f -> 7eeaff9ee0)

2022-07-05 Thread danny0405
This is an automated email from the ASF dual-hosted git repository. danny0405 pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from b18c32379f [HUDI-4219] Merge Into when update expression "col=s.col+2" on precombine cause exception (#5828) ad

[GitHub] [hudi] danny0405 merged pull request #6050: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
danny0405 merged PR #6050: URL: https://github.com/apache/hudi/pull/6050 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache

[GitHub] [hudi] hudi-bot commented on pull request #6051: [HUDI-4366] Synchronous cleaning for flink bounded source

2022-07-05 Thread GitBox
hudi-bot commented on PR #6051: URL: https://github.com/apache/hudi/pull/6051#issuecomment-1175784579 ## CI report: * f825ff8c71e8912a6b656bbb51789d05e49871ce Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9741

[GitHub] [hudi] hudi-bot commented on pull request #5995: [HUDI-4335] Bug fixes in AWSGlueCatalogSyncClient post schema evolution.

2022-07-05 Thread GitBox
hudi-bot commented on PR #5995: URL: https://github.com/apache/hudi/pull/5995#issuecomment-1175784512 ## CI report: * f06a5460500b4a12fa14b752de26b6eddc270ebe Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9610

[GitHub] [hudi] hudi-bot commented on pull request #6051: [HUDI-4366] Synchronous cleaning for flink bounded source

2022-07-05 Thread GitBox
hudi-bot commented on PR #6051: URL: https://github.com/apache/hudi/pull/6051#issuecomment-1175782571 ## CI report: * f825ff8c71e8912a6b656bbb51789d05e49871ce UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #5995: [HUDI-4335] Bug fixes in AWSGlueCatalogSyncClient post schema evolution.

2022-07-05 Thread GitBox
hudi-bot commented on PR #5995: URL: https://github.com/apache/hudi/pull/5995#issuecomment-1175782509 ## CI report: * f06a5460500b4a12fa14b752de26b6eddc270ebe Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9610

[GitHub] [hudi] yihua commented on issue #5979: [SUPPORT]the hudi's table of join can not handle delete operation.But simple table is ok.why?

2022-07-05 Thread GitBox
yihua commented on issue #5979: URL: https://github.com/apache/hudi/issues/5979#issuecomment-1175781714 cc @yuzhaojing -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [hudi] yihua commented on issue #5984: [SUPPORT] Error on GlobalSortPartitioner using 0.9.0

2022-07-05 Thread GitBox
yihua commented on issue #5984: URL: https://github.com/apache/hudi/issues/5984#issuecomment-1175781226 cc @minihippo -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [hudi] yihua commented on issue #5985: [SUPPORT] Hudi upsert fails with java.lang.ClassCastException: optional binary xx (STRING) is not a group

2022-07-05 Thread GitBox
yihua commented on issue #5985: URL: https://github.com/apache/hudi/issues/5985#issuecomment-1175781069 cc @minihippo -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [hudi] yihua commented on issue #5989: [SUPPORT] Schema Evolution Issue - New columns are not showing up in Spark-SQL.

2022-07-05 Thread GitBox
yihua commented on issue #5989: URL: https://github.com/apache/hudi/issues/5989#issuecomment-1175780819 cc @minihippo @xiarixiaoyao -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] yihua commented on issue #6007: spark query partition field error

2022-07-05 Thread GitBox
yihua commented on issue #6007: URL: https://github.com/apache/hudi/issues/6007#issuecomment-1175779565 cc @minihippo -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

[GitHub] [hudi] yihua commented on issue #6011: [SUPPORT] HoodieFlinkCompactor failed

2022-07-05 Thread GitBox
yihua commented on issue #6011: URL: https://github.com/apache/hudi/issues/6011#issuecomment-1175779210 cc @yuzhaojing -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [hudi] yihua commented on issue #6014: [SUPPORT] High runtime for a batch in SparkWriteHelper stage

2022-07-05 Thread GitBox
yihua commented on issue #6014: URL: https://github.com/apache/hudi/issues/6014#issuecomment-1175779034 cc @minihippo @xiarixiaoyao -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

[GitHub] [hudi] kumudkumartirupati commented on pull request #5995: [HUDI-4335] Bug fixes in AWSGlueCatalogSyncClient post schema evolution.

2022-07-05 Thread GitBox
kumudkumartirupati commented on PR #5995: URL: https://github.com/apache/hudi/pull/5995#issuecomment-1175778645 Conflicts resolved -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [hudi] kumud-hs commented on pull request #5995: [HUDI-4335] Bug fixes in AWSGlueCatalogSyncClient post schema evolution.

2022-07-05 Thread GitBox
kumud-hs commented on PR #5995: URL: https://github.com/apache/hudi/pull/5995#issuecomment-1175778248 Conflicts resolved. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[jira] [Updated] (HUDI-4366) Synchronous cleaning for flink bounded source

2022-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4366: - Labels: pull-request-available (was: ) > Synchronous cleaning for flink bounded source >

[GitHub] [hudi] danny0405 opened a new pull request, #6051: [HUDI-4366] Synchronous cleaning for flink bounded source

2022-07-05 Thread GitBox
danny0405 opened a new pull request, #6051: URL: https://github.com/apache/hudi/pull/6051 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[jira] [Created] (HUDI-4366) Synchronous cleaning for flink bounded source

2022-07-05 Thread Danny Chen (Jira)
Danny Chen created HUDI-4366: Summary: Synchronous cleaning for flink bounded source Key: HUDI-4366 URL: https://issues.apache.org/jira/browse/HUDI-4366 Project: Apache Hudi Issue Type: New Featu

[GitHub] [hudi] danny0405 commented on pull request #6050: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
danny0405 commented on PR #6050: URL: https://github.com/apache/hudi/pull/6050#issuecomment-1175759080 The filed module `hudi-integ-test` has no relationship with this PR's change, and it succeed in the build history. -- This is an automated message from the Apache Git Service. To respond

[GitHub] [hudi] hudi-bot commented on pull request #6050: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6050: URL: https://github.com/apache/hudi/pull/6050#issuecomment-1175747079 ## CI report: * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9739

[GitHub] [hudi] Aiden-Dong commented on pull request #5963: [HUDI-4300] Add sync clean and archive for compaction service in Spark Env

2022-07-05 Thread GitBox
Aiden-Dong commented on PR #5963: URL: https://github.com/apache/hudi/pull/5963#issuecomment-1175746885 @danny0405 Hello, can you take a moment to check this pr? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] Aiden-Dong commented on pull request #5945: [HUDI-4308] READ_OPTIMIZED read mode will temporary loss of data when compaction

2022-07-05 Thread GitBox
Aiden-Dong commented on PR #5945: URL: https://github.com/apache/hudi/pull/5945#issuecomment-1175746309 @danny0405 Hello, can you take a moment to check this pr? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [hudi] liufangqi commented on a diff in pull request #5997: [HUDI-4338] resolve the data skew when using flink datastream write hudi

2022-07-05 Thread GitBox
liufangqi commented on code in PR #5997: URL: https://github.com/apache/hudi/pull/5997#discussion_r914375846 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/utils/Pipelines.java: ## @@ -330,17 +330,23 @@ public static DataStream hoodieStreamWrite(Configur

[GitHub] [hudi] xiarixiaoyao commented on pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-05 Thread GitBox
xiarixiaoyao commented on PR #6017: URL: https://github.com/apache/hudi/pull/6017#issuecomment-1175730234 @leesf fixed the comment and UT. could you pls help review again, thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitH

[GitHub] [hudi] hudi-bot commented on pull request #6050: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6050: URL: https://github.com/apache/hudi/pull/6050#issuecomment-1175725791 ## CI report: * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] danny0405 opened a new pull request, #6050: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
danny0405 opened a new pull request, #6050: URL: https://github.com/apache/hudi/pull/6050 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the purpo

[GitHub] [hudi] danny0405 closed pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
danny0405 closed pull request #6036: [HUDI-4357] Support flink 1.15.x URL: https://github.com/apache/hudi/pull/6036 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscri

[GitHub] [hudi] hudi-bot commented on pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6036: URL: https://github.com/apache/hudi/pull/6036#issuecomment-1175719724 ## CI report: * ca25b34f04a0bf02daf2c2198fe3db5ede544129 UNKNOWN * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] danny0405 commented on a diff in pull request #5997: [HUDI-4338] resolve the data skew when using flink datastream write hudi

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5997: URL: https://github.com/apache/hudi/pull/5997#discussion_r914365747 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/utils/Pipelines.java: ## @@ -330,17 +330,23 @@ public static DataStream hoodieStreamWrite(Configur

[GitHub] [hudi] hudi-bot commented on pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6036: URL: https://github.com/apache/hudi/pull/6036#issuecomment-1175717646 ## CI report: * ca25b34f04a0bf02daf2c2198fe3db5ede544129 UNKNOWN * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914361984 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/format/mor/MergeOnReadTableState.java: ## @@ -82,6 +85,10 @@ public int getOperationPos() { r

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914361663 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/table/format/mor/MergeOnReadInputFormat.java: ## @@ -649,7 +656,8 @@ static class MergeIterator implemen

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914360633 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/configuration/FlinkOptions.java: ## @@ -283,6 +284,13 @@ private FlinkOptions() { .withDescriptio

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914360207 ## hudi-common/src/main/java/org/apache/hudi/common/util/HoodieRecordUtils.java: ## @@ -0,0 +1,68 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one +

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914359578 ## hudi-common/src/main/java/org/apache/hudi/common/table/log/AbstractHoodieLogRecordReader.java: ## @@ -160,6 +162,7 @@ protected AbstractHoodieLogRecordReader(FileSyste

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914359119 ## hudi-common/src/main/java/org/apache/hudi/common/table/HoodieTableConfig.java: ## @@ -154,6 +155,12 @@ public class HoodieTableConfig extends HoodieConfig { .wi

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914358079 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieRecord.java: ## @@ -169,15 +179,17 @@ public HoodieOperation getOperation() { return operation; }

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914357550 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieRecord.java: ## @@ -131,25 +131,35 @@ public String getFieldName() { */ private HoodieOperation ope

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914355923 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieMerge.java: ## @@ -0,0 +1,38 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914355454 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieAvroRecordMerge.java: ## @@ -0,0 +1,57 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under o

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914352505 ## hudi-client/hudi-client-common/src/test/java/org/apache/hudi/testutils/HoodieWriteableTestTable.java: ## @@ -44,13 +50,6 @@ import org.apache.hudi.io.storage.HoodieOr

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914352001 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieWriteHandle.java: ## @@ -103,6 +106,7 @@ protected HoodieWriteHandle(HoodieWriteConfig config, S

[GitHub] [hudi] danny0405 commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
danny0405 commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914350928 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieWriteConfig.java: ## @@ -123,6 +124,12 @@ public class HoodieWriteConfig extends HoodieConfig

[GitHub] [hudi] danny0405 commented on pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
danny0405 commented on PR #6036: URL: https://github.com/apache/hudi/pull/6036#issuecomment-1175694155 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. T

[hudi] branch master updated (3670e82af5 -> b18c32379f)

2022-07-05 Thread mengtao
This is an automated email from the ASF dual-hosted git repository. mengtao pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 3670e82af5 [HUDI-4356] Fix the error when sync hive in CTAS (#6029) add b18c32379f [HUDI-4219] Merge Into when upd

[GitHub] [hudi] xiarixiaoyao merged pull request #5828: [HUDI-4219] Merge Into when update expression "col=s.col+2" on precombine cause exception

2022-07-05 Thread GitBox
xiarixiaoyao merged PR #5828: URL: https://github.com/apache/hudi/pull/5828 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apa

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175632658 ## CI report: * 48dc9f122fb2e119ed8784cdd1bd8a8412e633bf Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9737

[GitHub] [hudi] xushiyan commented on pull request #5995: [HUDI-4335] Bug fixes in AWSGlueCatalogSyncClient post schema evolution.

2022-07-05 Thread GitBox
xushiyan commented on PR #5995: URL: https://github.com/apache/hudi/pull/5995#issuecomment-1175615060 @kumudkumartirupati thanks for the fix. could you resolve the conflict pls? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175599033 ## CI report: * 1c9a917927cf0339a0271e04d474e91ac89254c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9736

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175573690 ## CI report: * 1c9a917927cf0339a0271e04d474e91ac89254c8 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9736

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175570125 ## CI report: * 1c9a917927cf0339a0271e04d474e91ac89254c8 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9736

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175515606 ## CI report: * 1c9a917927cf0339a0271e04d474e91ac89254c8 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9736

[GitHub] [hudi] hudi-bot commented on pull request #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
hudi-bot commented on PR #6049: URL: https://github.com/apache/hudi/pull/6049#issuecomment-1175512074 ## CI report: * 1c9a917927cf0339a0271e04d474e91ac89254c8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[jira] [Updated] (HUDI-4365) Bulk Insert not URL encoding Partition Path properly

2022-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-4365: - Labels: pull-request-available (was: ) > Bulk Insert not URL encoding Partition Path properly > -

[GitHub] [hudi] alexeykudinkin opened a new pull request, #6049: [HUDI-4365] Fixing URL-encoding in Bulk Insert row-writing path

2022-07-05 Thread GitBox
alexeykudinkin opened a new pull request, #6049: URL: https://github.com/apache/hudi/pull/6049 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Updated] (HUDI-4365) Bulk Insert not URL encoding Partition Path properly

2022-07-05 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-4365: -- Description: Currently when using partition paths with slashes in it, Hudi lays out partitioned

[jira] [Updated] (HUDI-4365) Bulk Insert not URL encoding Partition Path properly

2022-07-05 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated HUDI-4365: -- Attachment: Screen Shot 2022-07-05 at 1.07.19 PM.png > Bulk Insert not URL encoding Partition Pa

[jira] [Created] (HUDI-4365) Bulk Insert not URL encoding Partition Path properly

2022-07-05 Thread Alexey Kudinkin (Jira)
Alexey Kudinkin created HUDI-4365: - Summary: Bulk Insert not URL encoding Partition Path properly Key: HUDI-4365 URL: https://issues.apache.org/jira/browse/HUDI-4365 Project: Apache Hudi Issu

[jira] [Updated] (HUDI-4346) Fix the params of bulkInsertAsRow not update BULKINSERT_ARE_PARTITIONER_RECORDS_SORTED

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-4346: Status: In Progress (was: Open) > Fix the params of bulkInsertAsRow not update > BULKINSERT_ARE_PARTITIONE

[jira] [Closed] (HUDI-4346) Fix the params of bulkInsertAsRow not update BULKINSERT_ARE_PARTITIONER_RECORDS_SORTED

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-4346. --- Resolution: Fixed > Fix the params of bulkInsertAsRow not update > BULKINSERT_ARE_PARTITIONER_RECORDS_SORTED

[jira] [Updated] (HUDI-4346) Fix the params of bulkInsertAsRow not update BULKINSERT_ARE_PARTITIONER_RECORDS_SORTED

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-4346: Status: Patch Available (was: In Progress) > Fix the params of bulkInsertAsRow not update > BULKINSERT_ARE

[jira] [Closed] (HUDI-4360) Fix HoodieDropPartitionsTool based on refactored meta sync

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-4360. --- Resolution: Fixed > Fix HoodieDropPartitionsTool based on refactored meta sync > -

[jira] [Updated] (HUDI-4360) Fix HoodieDropPartitionsTool based on refactored meta sync

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-4360: Status: Patch Available (was: In Progress) > Fix HoodieDropPartitionsTool based on refactored meta sync > -

[jira] [Updated] (HUDI-4360) Fix HoodieDropPartitionsTool based on refactored meta sync

2022-07-05 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-4360: Status: In Progress (was: Open) > Fix HoodieDropPartitionsTool based on refactored meta sync >

[GitHub] [hudi] alexeykudinkin commented on pull request #5828: [HUDI-4219] Merge Into when update expression "col=s.col+2" on precombine cause exception

2022-07-05 Thread GitBox
alexeykudinkin commented on PR #5828: URL: https://github.com/apache/hudi/pull/5828#issuecomment-1175364275 LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
alexeykudinkin commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914081837 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieMerge.java: ## @@ -0,0 +1,38 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one +

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #5627: [HUDI-3350][HUDI-3351] Support HoodieMerge API and Spark engine-specific HoodieRecord

2022-07-05 Thread GitBox
alexeykudinkin commented on code in PR #5627: URL: https://github.com/apache/hudi/pull/5627#discussion_r914081837 ## hudi-common/src/main/java/org/apache/hudi/common/model/HoodieMerge.java: ## @@ -0,0 +1,38 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one +

[GitHub] [hudi] alexeykudinkin commented on a diff in pull request #5664: [HUDI-4140] Fixing hive style partitioning and default partition with bulk insert row writer with SimpleKeyGen and virtual key

2022-07-05 Thread GitBox
alexeykudinkin commented on code in PR #5664: URL: https://github.com/apache/hudi/pull/5664#discussion_r914070359 ## hudi-spark-datasource/hudi-spark2/src/test/java/org/apache/hudi/internal/TestHoodieBulkInsertDataInternalWriter.java: ## @@ -109,6 +112,51 @@ public void testData

[GitHub] [hudi] hudi-bot commented on pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-05 Thread GitBox
hudi-bot commented on PR #6017: URL: https://github.com/apache/hudi/pull/6017#issuecomment-1175294900 ## CI report: * 572b3bd83c499348795f380004520f880506cf86 UNKNOWN * 65d15683ec3b8084330a6df7e121ca4218b83b2f UNKNOWN * 26f78b05748846a5724c2153d52a695cba641759 Azure: [SUCCES

[jira] [Commented] (HUDI-2749) Improve the streaming read for hudi

2022-07-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17562738#comment-17562738 ] Raymond Xu commented on HUDI-2749: -- [~danny0405] can you clean up the issues under this E

[hudi] branch master updated (8570c3aab4 -> 3670e82af5)

2022-07-05 Thread forwardxu
This is an automated email from the ASF dual-hosted git repository. forwardxu pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 8570c3aab4 [HUDI-4359] Support show_fs_path_detail command on Call Produce Command (#6042) add 3670e82af5 [HUDI

[GitHub] [hudi] XuQianJin-Stars merged pull request #6029: [HUDI-4356] Fix the error when sync hive in CTAS

2022-07-05 Thread GitBox
XuQianJin-Stars merged PR #6029: URL: https://github.com/apache/hudi/pull/6029 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.

[jira] [Updated] (HUDI-2749) Improve the streaming read for hudi

2022-07-05 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2749: - Fix Version/s: (was: 0.12.0) > Improve the streaming read for hudi > -

[hudi] branch master updated (23c9c5c296 -> 8570c3aab4)

2022-07-05 Thread forwardxu
This is an automated email from the ASF dual-hosted git repository. forwardxu pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 23c9c5c296 [HUDI-3836] Improve the way of fetching metadata partitions from table (#5286) add 8570c3aab4 [HUDI-

[GitHub] [hudi] XuQianJin-Stars merged pull request #6042: [HUDI-4359] Support show_fs_path_detail command on Call Produce Command

2022-07-05 Thread GitBox
XuQianJin-Stars merged PR #6042: URL: https://github.com/apache/hudi/pull/6042 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.

[GitHub] [hudi] hudi-bot commented on pull request #6025: [HUDI-4351] Improve HoodieFlinkCompactor

2022-07-05 Thread GitBox
hudi-bot commented on PR #6025: URL: https://github.com/apache/hudi/pull/6025#issuecomment-1175219792 ## CI report: * c71c6e8ea76e26fc528daa2ed5f05a6fc99dfe7f Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9731

[GitHub] [hudi] fengjian428 commented on issue #6038: [SUPPORT] MOR taking more time than COW using HoodieJavaWriteClient

2022-07-05 Thread GitBox
fengjian428 commented on issue #6038: URL: https://github.com/apache/hudi/issues/6038#issuecomment-1175214688 https://hudi.apache.org/community/get-involved just click the join group link -- This is an automated message from the Apache Git Service. To respond to the message, please log o

[GitHub] [hudi] noahtaite commented on issue #6048: [SUPPORT] S3 throttling while loading a table written with "hoodie.metadata.enable" = true

2022-07-05 Thread GitBox
noahtaite commented on issue #6048: URL: https://github.com/apache/hudi/issues/6048#issuecomment-1175210980 ![MicrosoftTeams-image (6)](https://user-images.githubusercontent.com/24283126/177366433-d21d3d16-31ed-41c8-93cf-aff1ae37b687.png) You can see the effect is a spike here after i

[GitHub] [hudi] tommss commented on issue #6038: [SUPPORT] MOR taking more time than COW using HoodieJavaWriteClient

2022-07-05 Thread GitBox
tommss commented on issue #6038: URL: https://github.com/apache/hudi/issues/6038#issuecomment-1175202995 I have sent request to add me to the slack group (https://github.com/apache/hudi/issues/143). Can you add me there -- This is an automated message from the Apache Git Service. To

[GitHub] [hudi] tommss commented on issue #143: Tracking ticket for folks to be added to slack group

2022-07-05 Thread GitBox
tommss commented on issue #143: URL: https://github.com/apache/hudi/issues/143#issuecomment-1175201665 Hi,Please add me to slack group Email: sheisher...@gmail.com Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] fengjian428 commented on issue #6038: [SUPPORT] MOR taking more time than COW using HoodieJavaWriteClient

2022-07-05 Thread GitBox
fengjian428 commented on issue #6038: URL: https://github.com/apache/hudi/issues/6038#issuecomment-1175192155 why don't you create a dataframe on top of rdd and then save it to hudi? btw have you joined hudi's slack? we can discuss this -- This is an automated message from the Apache Gi

[GitHub] [hudi] fengjian428 commented on issue #6038: [SUPPORT] MOR taking more time than COW using HoodieJavaWriteClient

2022-07-05 Thread GitBox
fengjian428 commented on issue #6038: URL: https://github.com/apache/hudi/issues/6038#issuecomment-1175179593 HoodieJavaWriteClient use HoodieJavaMergeOnReadTable for handle mor table, and HoodieJavaMergeOnReadTable does nothing but inherits function from HoodieJavaCopyOnWriteTable. So

[GitHub] [hudi] tommss commented on issue #6038: [SUPPORT] MOR taking more time than COW using HoodieJavaWriteClient

2022-07-05 Thread GitBox
tommss commented on issue #6038: URL: https://github.com/apache/hudi/issues/6038#issuecomment-1175174196 - I changed index to Bloom to see if it makes any difference, but it does not. - What do you mean by HoodieJavaMergeOnReadTable is unfinished ? - Below is what we are trying to achi

[GitHub] [hudi] hudi-bot commented on pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-05 Thread GitBox
hudi-bot commented on PR #6017: URL: https://github.com/apache/hudi/pull/6017#issuecomment-1175163203 ## CI report: * 572b3bd83c499348795f380004520f880506cf86 UNKNOWN * d98c0e31e8d401014fe7338207e45879e5828f99 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-05 Thread GitBox
hudi-bot commented on PR #6017: URL: https://github.com/apache/hudi/pull/6017#issuecomment-1175158251 ## CI report: * 572b3bd83c499348795f380004520f880506cf86 UNKNOWN * d98c0e31e8d401014fe7338207e45879e5828f99 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6046: [HUDI-4363] Support Clustering row writer to improve performance

2022-07-05 Thread GitBox
hudi-bot commented on PR #6046: URL: https://github.com/apache/hudi/pull/6046#issuecomment-1175153416 ## CI report: * 58cf2096e648ccc8c7e7c563003753ce89a90261 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9730

[GitHub] [hudi] hudi-bot commented on pull request #6017: [HUDI-4276] Reconcile schema-inject null values for missing fields and add new fields

2022-07-05 Thread GitBox
hudi-bot commented on PR #6017: URL: https://github.com/apache/hudi/pull/6017#issuecomment-1175153262 ## CI report: * 572b3bd83c499348795f380004520f880506cf86 UNKNOWN * d98c0e31e8d401014fe7338207e45879e5828f99 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[hudi] branch master updated (fbda4ad5bd -> 23c9c5c296)

2022-07-05 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from fbda4ad5bd [HUDI-4360] Fix HoodieDropPartitionsTool based on refactored meta sync (#6043) add 23c9c5c296 [HUDI-3836

[GitHub] [hudi] yihua merged pull request #5286: [HUDI-3836] Improve the way of fetching metadata partitions from table

2022-07-05 Thread GitBox
yihua merged PR #5286: URL: https://github.com/apache/hudi/pull/5286 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

[GitHub] [hudi] xiarixiaoyao commented on pull request #6042: [HUDI-4359] Support show_fs_path_detail command on Call Produce Command

2022-07-05 Thread GitBox
xiarixiaoyao commented on PR #6042: URL: https://github.com/apache/hudi/pull/6042#issuecomment-1175145943 nice work -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

[GitHub] [hudi] hudi-bot commented on pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6036: URL: https://github.com/apache/hudi/pull/6036#issuecomment-1175083192 ## CI report: * ca25b34f04a0bf02daf2c2198fe3db5ede544129 UNKNOWN * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6036: [HUDI-4357] Support flink 1.15.x

2022-07-05 Thread GitBox
hudi-bot commented on PR #6036: URL: https://github.com/apache/hudi/pull/6036#issuecomment-1175077768 ## CI report: * ca25b34f04a0bf02daf2c2198fe3db5ede544129 UNKNOWN * 64e3f11d32fc3dd5cb6bc8158913994e3b6a691f Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #6028: [HUDI-4355] Bulk insert As Row: Should also repartiiton records if populateMetaFields is false

2022-07-05 Thread GitBox
hudi-bot commented on PR #6028: URL: https://github.com/apache/hudi/pull/6028#issuecomment-1175077675 ## CI report: * ba6cd2a43d6a4f4e69594d434c34a859419fcff7 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=9729

[GitHub] [hudi] codope commented on pull request #5608: [HUDI-2150] Rename/Restructure configs for better modularity

2022-07-05 Thread GitBox
codope commented on PR #5608: URL: https://github.com/apache/hudi/pull/5608#issuecomment-1175069366 @liujinhui1994 I'll pick up the review this week. A couple of high-level questions: 1. Are there any default behaviour changes? 2. For the renames, is the backward compatibility handled?

  1   2   3   >