[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201417934 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/spark/sql/hudi/TestInsertTable.scala: ## @@ -599,138 +582,250 @@ class TestInsertTable extends HoodieSparkSq

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201621093 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestSparkDataSourceDAGExecution.scala: ## @@ -89,7 +89,7 @@ class TestSparkDataSourceDAGExec

[GitHub] [hudi] boneanxs commented on a diff in pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8076: URL: https://github.com/apache/hudi/pull/8076#discussion_r1201621093 ## hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/functional/TestSparkDataSourceDAGExecution.scala: ## @@ -89,7 +89,7 @@ class TestSparkDataSourceDAGExec

[GitHub] [hudi] aajisaka commented on pull request #8162: [MINOR][DOCS] HMS mode with Spark datasource is already supported.

2023-05-22 Thread via GitHub
aajisaka commented on PR #8162: URL: https://github.com/apache/hudi/pull/8162#issuecomment-1558635066 Thank you @danny0405 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[hudi] annotated tag release-0.13.1 updated (7a654395836 -> c20cdb68418)

2023-05-22 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a change to annotated tag release-0.13.1 in repository https://gitbox.apache.org/repos/asf/hudi.git *** WARNING: tag release-0.13.1 was modified! *** from 7a654395836 (commit) to c20cdb68418 (t

[GitHub] [hudi] codope commented on pull request #8774: [HUDI-6246] Fixing restore for compaction commit

2023-05-22 Thread via GitHub
codope commented on PR #8774: URL: https://github.com/apache/hudi/pull/8774#issuecomment-1558614877 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [hudi] ad1happy2go commented on issue #8066: org.apache.hudi.exception.HoodieIOException: Could not check if hdfs:/user/hive/warehouse/xxx is a valid table

2023-05-22 Thread via GitHub
ad1happy2go commented on issue #8066: URL: https://github.com/apache/hudi/issues/8066#issuecomment-1558614367 @wangzhongz Closing this issue as looks to be more of the cluster permission issue and hudi. Please reopen the ticket in case you still facing the issue. -- This is an automated m

[hudi] branch master updated: [MINOR] Optimize code style (#8751)

2023-05-22 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new dcc1a07dcfe [MINOR] Optimize code style (#8751) dcc

[GitHub] [hudi] yihua commented on pull request #8751: [MINOR] Optimize code style

2023-05-22 Thread via GitHub
yihua commented on PR #8751: URL: https://github.com/apache/hudi/pull/8751#issuecomment-1558613881 CI is green. https://github.com/apache/hudi/assets/2497195/b8287021-b256-4764-b75b-e8f8ed8a8ba8";> -- This is an automated message from the Apache Git Service. To respond to the messag

[hudi] branch master updated (c3403ff5e09 -> 2b3cfa8a744)

2023-05-22 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from c3403ff5e09 [HUDI-5584] Metasync update props when changed (#8775) add 2b3cfa8a744 [HUDI-6245] Automatically downgra

[GitHub] [hudi] yihua merged pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
yihua merged PR #8771: URL: https://github.com/apache/hudi/pull/8771 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

[hudi] 01/01: Create release branch for version 0.13.1

2023-05-22 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a commit to branch release-0.13.1 in repository https://gitbox.apache.org/repos/asf/hudi.git commit 7a6543958368540d221ddc18e0c12b8d526b6859 Author: zhangyue19921010 AuthorDate: Tue May 23 14:21:44 2023 +

[hudi] branch release-0.13.1 created (now 7a654395836)

2023-05-22 Thread zhangyue19921010
This is an automated email from the ASF dual-hosted git repository. zhangyue19921010 pushed a change to branch release-0.13.1 in repository https://gitbox.apache.org/repos/asf/hudi.git at 7a654395836 Create release branch for version 0.13.1 This branch includes the following new commits:

[GitHub] [hudi] yihua commented on pull request #8771: [HUDI-6245] Automatically downgrade table version of metadata table

2023-05-22 Thread via GitHub
yihua commented on PR #8771: URL: https://github.com/apache/hudi/pull/8771#issuecomment-1558603266 Azure CI is green. https://github.com/apache/hudi/assets/2497195/1802a267-722f-42f3-84b3-bb46d8146684";> -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [hudi] hudi-bot commented on pull request #8076: [HUDI-5884] Support bulk_insert for insert_overwrite and insert_overwrite_table

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8076: URL: https://github.com/apache/hudi/pull/8076#issuecomment-1558599047 ## CI report: * 6a239ada8998fd440f19c0082b26d206ed589870 UNKNOWN * 851a1c3d1bbb317a01822b67ac14ea16ba49fa5e Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] xushiyan closed pull request #7719: [HUDI-5584] When the table to be synchronized already exists in hive,…

2023-05-22 Thread via GitHub
xushiyan closed pull request #7719: [HUDI-5584] When the table to be synchronized already exists in hive,… URL: https://github.com/apache/hudi/pull/7719 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[hudi] branch master updated (b74e6ad2eb9 -> c3403ff5e09)

2023-05-22 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from b74e6ad2eb9 [HUDI-6247] Add bundle validation for release candidates (#8779) add c3403ff5e09 [HUDI-5584] Metasyn

[GitHub] [hudi] xushiyan merged pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
xushiyan merged PR #8775: URL: https://github.com/apache/hudi/pull/8775 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[jira] [Updated] (HUDI-6250) Refactor metasync client api to return sync result

2023-05-22 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-6250: - Fix Version/s: 0.14.0 0.12.4 > Refactor metasync client api to return sync result > ---

[GitHub] [hudi] xushiyan commented on a diff in pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
xushiyan commented on code in PR #8775: URL: https://github.com/apache/hudi/pull/8775#discussion_r1201583793 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieMetaSyncOperations.java: ## @@ -186,16 +188,20 @@ default void updateLastCommitTimeSynced(St

[jira] [Created] (HUDI-6250) Refactor metasync client api to return sync result

2023-05-22 Thread Raymond Xu (Jira)
Raymond Xu created HUDI-6250: Summary: Refactor metasync client api to return sync result Key: HUDI-6250 URL: https://issues.apache.org/jira/browse/HUDI-6250 Project: Apache Hudi Issue Type: Impr

[GitHub] [hudi] hudi-bot commented on pull request #8445: [HUDI-3088] Use Spark 3.2 as default Spark version

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8445: URL: https://github.com/apache/hudi/pull/8445#issuecomment-1558589844 ## CI report: * fe494c5e09f8c3a57446834c86ad82904bcda585 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1728

[GitHub] [hudi] boneanxs commented on a diff in pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8452: URL: https://github.com/apache/hudi/pull/8452#discussion_r1201575964 ## hudi-common/src/main/java/org/apache/hudi/metadata/FileSystemBackedTableMetadata.java: ## @@ -96,11 +109,32 @@ public List getPartitionPathWithPathPrefixes(List relat

[GitHub] [hudi] boneanxs commented on a diff in pull request #8452: [HUDI-6077] Add more partition push down filters

2023-05-22 Thread via GitHub
boneanxs commented on code in PR #8452: URL: https://github.com/apache/hudi/pull/8452#discussion_r1201575964 ## hudi-common/src/main/java/org/apache/hudi/metadata/FileSystemBackedTableMetadata.java: ## @@ -96,11 +109,32 @@ public List getPartitionPathWithPathPrefixes(List relat

[jira] [Updated] (HUDI-4248) Upgrade Apache Avro version for hudi-flink

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4248: Fix Version/s: 0.14.0 (was: 0.13.1) > Upgrade Apache Avro version for hudi-flink > --

[jira] [Updated] (HUDI-3519) Make sure every public Hudi Client Method invokes necessary prologue

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3519: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure every public Hudi Client Method invok

[jira] [Updated] (HUDI-4245) Support nested fields in Column Stats Index

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-4245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-4245: Fix Version/s: 0.14.0 (was: 0.13.1) > Support nested fields in Column Stats Index > -

[jira] [Updated] (HUDI-3674) Remove unnecessary HBase-related dependencies from bundles if there is any

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3674: Fix Version/s: 0.14.0 (was: 0.13.1) > Remove unnecessary HBase-related dependencies f

[jira] [Updated] (HUDI-2767) Enable timeline server based marker type as default

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2767: Fix Version/s: 0.14.0 (was: 0.13.1) > Enable timeline server based marker type as def

[jira] [Updated] (HUDI-3879) Suppress exceptions that are not fatal in HoodieMetadataTableValidator

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3879: Fix Version/s: 0.14.0 (was: 0.13.1) > Suppress exceptions that are not fatal in Hoodi

[jira] [Updated] (HUDI-3115) Kafka Connect should not be packaged as a bundle

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3115: Fix Version/s: 0.14.0 (was: 0.13.1) > Kafka Connect should not be packaged as a bundl

[jira] [Updated] (HUDI-3531) Review and shade transitive dependencies in hudi bundle jar

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3531: Fix Version/s: 0.14.0 (was: 0.13.1) > Review and shade transitive dependencies in hud

[jira] [Updated] (HUDI-3321) HFileWriter, HFileReader and HFileDataBlock should avoid hardcoded key field name

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3321: Fix Version/s: 0.14.0 (was: 0.13.1) > HFileWriter, HFileReader and HFileDataBlock sho

[jira] [Updated] (HUDI-3317) Partition specific pointed lookup/reading strategy for metadata table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-3317: Fix Version/s: 0.14.0 (was: 0.13.1) > Partition specific pointed lookup/reading strat

[jira] [Updated] (HUDI-2737) Use earliest instant by default for compaction and clustering job

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2737: Fix Version/s: 0.14.0 (was: 0.13.1) > Use earliest instant by default for compaction

[jira] [Updated] (HUDI-2736) Redundant metadata table initialization by the metadata writer

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2736: Fix Version/s: 0.14.0 (was: 0.13.1) > Redundant metadata table initialization by the

[jira] [Updated] (HUDI-2458) Relax compaction in metadata being fenced based on inflight requests in data table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2458: Fix Version/s: 0.14.0 (was: 0.13.1) > Relax compaction in metadata being fenced based

[jira] [Updated] (HUDI-2388) Add test nodes for Spark SQL in integration test suite

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-2388: Fix Version/s: 0.14.0 (was: 0.13.1) > Add test nodes for Spark SQL in integration tes

[jira] [Updated] (HUDI-1101) Decouple Hive dependencies from hudi-spark

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-1101: Fix Version/s: 0.14.0 (was: 0.13.1) > Decouple Hive dependencies from hudi-spark > --

[jira] [Updated] (HUDI-6127) Flink Hudi Support Commit on empty batch

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6127: Issue Type: Improvement (was: New Feature) > Flink Hudi Support Commit on empty batch > --

[jira] [Updated] (HUDI-5517) HoodieTimeline support filter instants by state transition time

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5517: Fix Version/s: (was: 0.13.1) > HoodieTimeline support filter instants by state transition time > ---

[jira] [Updated] (HUDI-6091) Add Java 11 and 17 to bundle validation image

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6091: Issue Type: Improvement (was: New Feature) > Add Java 11 and 17 to bundle validation image > --

[jira] [Updated] (HUDI-5941) Support savepoint CALL procedure with table base path

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5941: Fix Version/s: 0.14.0 (was: 0.13.1) > Support savepoint CALL procedure with table bas

[jira] [Updated] (HUDI-6176) Fix flaky test testArchivalWithMultiWriters

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6176: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix flaky test testArchivalWithMultiWriters > -

[jira] [Updated] (HUDI-6138) HoodieAvroRecord - Fix Option get for empty values

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6138: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieAvroRecord - Fix Option get for empty val

[jira] [Updated] (HUDI-6061) NPE with nullable MapType and new hudi merger

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6061: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE with nullable MapType and new hudi merger >

[jira] [Updated] (HUDI-5904) support more than one update actions in merge into table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5904: Fix Version/s: 0.14.0 (was: 0.13.1) > support more than one update actions in merge i

[jira] [Updated] (HUDI-5890) Fix build failure of asf-site branch

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5890: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix build failure of asf-site branch >

[jira] [Updated] (HUDI-6011) Hudi CLI show archived commits is broken for replace commit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6011: Fix Version/s: (was: 0.13.1) > Hudi CLI show archived commits is broken for replace commit > ---

[jira] [Updated] (HUDI-5914) Fix for RowData class cast exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5914: Fix Version/s: (was: 0.13.1) > Fix for RowData class cast exception > --

[jira] [Updated] (HUDI-5968) Global index update partition for MOR creating duplicates

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5968: Fix Version/s: (was: 0.13.1) > Global index update partition for MOR creating duplicates > -

[jira] [Updated] (HUDI-6025) Incremental read with MOR doesn't give correct results

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-6025: Fix Version/s: 0.14.0 (was: 0.13.1) > Incremental read with MOR doesn't give correct

[jira] [Updated] (HUDI-5824) COMBINE_BEFORE_UPSERT=false option does not work for upsert

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5824: Fix Version/s: 0.14.0 (was: 0.13.1) > COMBINE_BEFORE_UPSERT=false option does not wor

[jira] [Updated] (HUDI-5867) Use commons.io v2.7+ for hbase-server

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5867: Fix Version/s: 0.14.0 (was: 0.13.1) > Use commons.io v2.7+ for hbase-server > ---

[jira] [Updated] (HUDI-5864) Update release notes regarding the HoodieMetadataFileSystemView regression

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5864: Fix Version/s: 0.14.0 (was: 0.13.1) > Update release notes regarding the HoodieMetada

[jira] [Updated] (HUDI-5866) Fix unnecessary log messages during bulk insert in Spark

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5866: Fix Version/s: (was: 0.13.1) > Fix unnecessary log messages during bulk insert in Spark > --

[GitHub] [hudi] codope commented on a diff in pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
codope commented on code in PR #8775: URL: https://github.com/apache/hudi/pull/8775#discussion_r1201549502 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieMetaSyncOperations.java: ## @@ -186,16 +188,20 @@ default void updateLastCommitTimeSynced(Stri

[jira] [Updated] (HUDI-5760) Make sure DeleteBlock doesn't use Kryo for serialization to disk

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5760: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure DeleteBlock doesn't use Kryo for seri

[jira] [Updated] (HUDI-5807) HoodieSparkParquetReader is not appending partition-path values

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5807: Fix Version/s: 0.14.0 (was: 0.13.1) > HoodieSparkParquetReader is not appending parti

[jira] [Updated] (HUDI-5759) Hudi do not support add column on mor table with log

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5759: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi do not support add column on mor table wit

[jira] [Updated] (HUDI-5769) Partitions created by Async indexer could be deleted by regular writers

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5769: Fix Version/s: 0.14.0 (was: 0.13.1) > Partitions created by Async indexer could be de

[jira] [Updated] (HUDI-5737) Fix Deletes issued without any prior commits

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5737: Fix Version/s: (was: 0.13.1) > Fix Deletes issued without any prior commits > --

[jira] [Updated] (HUDI-5733) TestHoodieDeltaStreamer.testHoodieIndexer failure

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5733: Fix Version/s: 0.14.0 (was: 0.13.1) > TestHoodieDeltaStreamer.testHoodieIndexer failu

[jira] [Updated] (HUDI-5731) Fix com.google.common classes still being relocated in Hudi Spark bundle

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5731: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix com.google.common classes still being reloc

[GitHub] [hudi] xushiyan commented on a diff in pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
xushiyan commented on code in PR #8775: URL: https://github.com/apache/hudi/pull/8775#discussion_r1201547833 ## hudi-sync/hudi-sync-common/src/main/java/org/apache/hudi/sync/common/HoodieMetaSyncOperations.java: ## @@ -186,16 +188,20 @@ default void updateLastCommitTimeSynced(St

[jira] [Updated] (HUDI-5670) Server-based markers creation times out

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5670: Fix Version/s: 0.14.0 (was: 0.13.1) > Server-based markers creation times out > -

[GitHub] [hudi] bvaradar commented on a diff in pull request #8303: [HUDI-5998] Speed up reads from bootstrapped tables in spark

2023-05-22 Thread via GitHub
bvaradar commented on code in PR #8303: URL: https://github.com/apache/hudi/pull/8303#discussion_r1201546844 ## hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBootstrapRelation.scala: ## @@ -188,11 +188,23 @@ case class HoodieBootstrapRelation(overr

[jira] [Updated] (HUDI-5711) NPE occurs when enabling metadata on table which does'nt has metadata previously

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5711: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE occurs when enabling metadata on table whic

[GitHub] [hudi] xushiyan commented on a diff in pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
xushiyan commented on code in PR #8775: URL: https://github.com/apache/hudi/pull/8775#discussion_r1201544493 ## hudi-sync/hudi-hive-sync/src/main/java/org/apache/hudi/hive/HiveSyncTool.java: ## @@ -280,83 +282,87 @@ protected void syncHoodieTable(String tableName, boolean useRe

[jira] [Updated] (HUDI-5697) Spark SQL re-lists Hudi table after every SQL operations

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5697: Fix Version/s: 0.14.0 (was: 0.13.1) > Spark SQL re-lists Hudi table after every SQL o

[jira] [Updated] (HUDI-5688) schema field of EmptyRelation subtype of BaseRelation should not be null

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5688: Fix Version/s: 0.14.0 (was: 0.13.1) > schema field of EmptyRelation subtype of BaseRe

[jira] [Updated] (HUDI-5716) Fix Partitioners to avoid assuming that parallelism is always present

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5716: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Partitioners to avoid assuming that paralle

[jira] [Updated] (HUDI-5609) Hudi table not queryable by SQL on Databricks Spark

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5609: Fix Version/s: 0.14.0 (was: 0.13.1) > Hudi table not queryable by SQL on Databricks S

[jira] [Updated] (HUDI-5619) Fix HoodieTableFileSystemView inefficient latest base-file lookups

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5619: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HoodieTableFileSystemView inefficient lates

[jira] [Updated] (HUDI-5597) Deltastreamer ingestion fails when consistent hashing index is used

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5597: Fix Version/s: 0.14.0 (was: 0.13.1) > Deltastreamer ingestion fails when consistent h

[jira] [Updated] (HUDI-5641) Streamline Advanced Schema Evolution flow

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5641: Fix Version/s: 0.14.0 (was: 0.13.1) > Streamline Advanced Schema Evolution flow > ---

[jira] [Updated] (HUDI-5602) Troubleshoot METADATA_ONLY bootstrapped table not being able to read back partition path

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5602: Fix Version/s: 0.14.0 (was: 0.13.1) > Troubleshoot METADATA_ONLY bootstrapped table n

[jira] [Updated] (HUDI-5608) Support decimals w/ precision > 30 in Column Stats

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5608: Fix Version/s: 0.14.0 (was: 0.13.1) > Support decimals w/ precision > 30 in Column St

[jira] [Updated] (HUDI-5575) Support any record key generation along w/ any partition path generation for row writer

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5575: Fix Version/s: 0.14.0 (was: 0.13.1) > Support any record key generation along w/ any

[jira] [Updated] (HUDI-5574) Support auto record key generation with Spark SQL

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5574: Fix Version/s: 0.14.0 (was: 0.13.1) > Support auto record key generation with Spark S

[jira] [Updated] (HUDI-5588) Fix Metadata table validator to deduce valid partitions when first commit where partition was added is failed

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5588: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix Metadata table validator to deduce valid pa

[jira] [Updated] (HUDI-5444) FileNotFound issue w/ metadata enabled

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5444: Fix Version/s: 0.14.0 (was: 0.13.1) > FileNotFound issue w/ metadata enabled > --

[jira] [Updated] (HUDI-5507) SparkSQL can not read the latest change data without execute "refresh table xxx"

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5507: Fix Version/s: 0.14.0 (was: 0.13.1) > SparkSQL can not read the latest change data wi

[jira] [Updated] (HUDI-5557) Wrong candidate files found in metadata table

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5557: Fix Version/s: 0.14.0 (was: 0.13.1) > Wrong candidate files found in metadata table

[jira] [Updated] (HUDI-5463) Apply rollback commits from data table as rollbacks in MDT instead of Delta commit

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5463: Fix Version/s: 0.14.0 (was: 0.13.1) > Apply rollback commits from data table as rollb

[jira] [Updated] (HUDI-5442) Fix HiveHoodieTableFileIndex to use lazy listing

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5442: Fix Version/s: 0.14.0 (was: 0.13.1) > Fix HiveHoodieTableFileIndex to use lazy listin

[jira] [Updated] (HUDI-5520) Fail MDT when list of log files grows unboundedly

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5520: Fix Version/s: 0.14.0 (was: 0.13.1) > Fail MDT when list of log files grows unbounded

[jira] [Updated] (HUDI-5436) Auto repair tool for MDT out of sync

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5436: Fix Version/s: 0.14.0 (was: 0.13.1) > Auto repair tool for MDT out of sync >

[jira] [Updated] (HUDI-5374) Use KeyGeneratorFactory class for instantiating a KeyGenerator

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5374: Fix Version/s: 0.14.0 (was: 0.13.1) > Use KeyGeneratorFactory class for instantiating

[jira] [Updated] (HUDI-5271) Inconsistent reader and writer schema in HoodieAvroDataBlock cause exception

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5271: Fix Version/s: 0.14.0 (was: 0.13.1) > Inconsistent reader and writer schema in Hoodie

[jira] [Updated] (HUDI-5364) Make sure Hudi's Column Stats are wired into Spark's relation stats

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5364: Fix Version/s: 0.14.0 (was: 0.13.1) > Make sure Hudi's Column Stats are wired into Sp

[jira] [Updated] (HUDI-5385) Make behavior of keeping File Writers open configurable

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5385: Fix Version/s: 0.14.0 (was: 0.13.1) > Make behavior of keeping File Writers open conf

[jira] [Updated] (HUDI-5322) Bulk-insert (row-writing) is not rewriting incoming dataset into Writer's schema

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5322: Fix Version/s: 0.14.0 (was: 0.13.1) > Bulk-insert (row-writing) is not rewriting inco

[jira] [Updated] (HUDI-5405) Avoid using Projections in generic Merge Into DMLs

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5405: Fix Version/s: 0.14.0 (was: 0.13.1) > Avoid using Projections in generic Merge Into D

[jira] [Updated] (HUDI-5361) Propagate Hudi properties set in Spark's SQLConf to Hudi

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5361: Fix Version/s: 0.14.0 (was: 0.13.1) > Propagate Hudi properties set in Spark's SQLCon

[jira] [Updated] (HUDI-5438) Benchmark calls w/ metadata enabled and ensure no calls to direct FS

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5438: Fix Version/s: 0.14.0 (was: 0.13.1) > Benchmark calls w/ metadata enabled and ensure

[jira] [Updated] (HUDI-5352) Jackson fails to serialize LocalDate when updating Delta Commit metadata

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5352: Fix Version/s: 0.14.0 (was: 0.13.1) > Jackson fails to serialize LocalDate when updat

[jira] [Updated] (HUDI-5319) NPE in Bloom Filter Index

2023-05-22 Thread Yue Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yue Zhang updated HUDI-5319: Fix Version/s: 0.14.0 (was: 0.13.1) > NPE in Bloom Filter Index > ---

[GitHub] [hudi] hudi-bot commented on pull request #8445: [HUDI-3088] Use Spark 3.2 as default Spark version

2023-05-22 Thread via GitHub
hudi-bot commented on PR #8445: URL: https://github.com/apache/hudi/pull/8445#issuecomment-1558546855 ## CI report: * fe494c5e09f8c3a57446834c86ad82904bcda585 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] xushiyan commented on a diff in pull request #8775: [HUDI-5584] Metasync update props when changed

2023-05-22 Thread via GitHub
xushiyan commented on code in PR #8775: URL: https://github.com/apache/hudi/pull/8775#discussion_r1201537869 ## hudi-aws/src/main/java/org/apache/hudi/aws/sync/AWSGlueCatalogSyncClient.java: ## @@ -477,13 +472,19 @@ private static Table getTable(AWSGlue awsGlue, String database

  1   2   3   4   5   >