[GitHub] [hudi] codope commented on a diff in pull request #9406: [DOCS] Update Metadata table and metadata indexing related pages

2023-08-11 Thread via GitHub
codope commented on code in PR #9406: URL: https://github.com/apache/hudi/pull/9406#discussion_r1292107760 ## website/docs/metadata.md: ## @@ -3,80 +3,173 @@ title: Metadata Table keywords: [ hudi, metadata, S3 file listings] --- -## Motivation for a Metadata Table +## Metad

[GitHub] [hudi] hudi-bot commented on pull request #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9434: URL: https://github.com/apache/hudi/pull/9434#issuecomment-1675703699 ## CI report: * e895bfb27350f497100c3cd50246badcba99f27d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[GitHub] [hudi] prashantwason commented on pull request #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
prashantwason commented on PR #9434: URL: https://github.com/apache/hudi/pull/9434#issuecomment-1675703170 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

[GitHub] [hudi] Zouxxyy commented on a diff in pull request #9416: [HUDI-6678] Fix the acquisition of clean&rollback instants to archive

2023-08-11 Thread via GitHub
Zouxxyy commented on code in PR #9416: URL: https://github.com/apache/hudi/pull/9416#discussion_r1291978221 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -452,107 +431,137 @@ private Stream getCommitInstantsToArchive()

[GitHub] [hudi] Riddle4045 opened a new issue, #9435: [SUPPORT] Trino can't read tables created by Flink Hudi conector

2023-08-11 Thread via GitHub
Riddle4045 opened a new issue, #9435: URL: https://github.com/apache/hudi/issues/9435 I am creating a hudi table using Flink Hudi connector ```sql CREATE TABLE flink.flink_hudi_hms3 ( uuid VARCHAR(20), name VARCHAR(10), age INT, ts TIMESTAMP(3), `pa

[GitHub] [hudi] danny0405 commented on a diff in pull request #9297: Generate test jars for hudi-utilities and hudi-hive-sync modules

2023-08-11 Thread via GitHub
danny0405 commented on code in PR #9297: URL: https://github.com/apache/hudi/pull/9297#discussion_r1291957282 ## hudi-sync/hudi-hive-sync/pom.xml: ## @@ -200,6 +200,9 @@ + + false + Review Comment: Thanks, can we m

[GitHub] [hudi] danny0405 commented on issue #9344: [SUPPORT] Getting error when writing to different HUDI tables in different threads in same job

2023-08-11 Thread via GitHub
danny0405 commented on issue #9344: URL: https://github.com/apache/hudi/issues/9344#issuecomment-1675598997 It seems the MDT encountered inconsistency, can you fix the errored table separately and restart the job, the table metadata can be re-bootstrapped through HUDI CLI. -- This is an

[GitHub] [hudi] danny0405 commented on a diff in pull request #9416: [HUDI-6678] Fix the acquisition of clean&rollback instants to archive

2023-08-11 Thread via GitHub
danny0405 commented on code in PR #9416: URL: https://github.com/apache/hudi/pull/9416#discussion_r1291949979 ## hudi-client/hudi-client-common/src/main/java/org/apache/hudi/client/HoodieTimelineArchiver.java: ## @@ -452,107 +431,137 @@ private Stream getCommitInstantsToArchive

[GitHub] [hudi] danny0405 commented on issue #9402: [SUPPORT] HiveSync not support schema evolution

2023-08-11 Thread via GitHub
danny0405 commented on issue #9402: URL: https://github.com/apache/hudi/issues/9402#issuecomment-1675592849 The hive seems have no good support for schema evolution. step7 may caused by the position change of the columns. -- This is an automated message from the Apache Git Service. To res

[GitHub] [hudi] hudi-bot commented on pull request #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9434: URL: https://github.com/apache/hudi/pull/9434#issuecomment-1675552821 ## CI report: * e895bfb27350f497100c3cd50246badcba99f27d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[GitHub] [hudi] hudi-bot commented on pull request #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9433: URL: https://github.com/apache/hudi/pull/9433#issuecomment-1675545141 ## CI report: * 32538d15eea7ed25ca8f0f8efbdd2bd1c21fa41e Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[jira] [Updated] (HUDI-6687) S3/GCS incr job improvements

2023-08-11 Thread Lokesh Lingarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lokesh Lingarajan updated HUDI-6687: Description: 1. Current batched commit string is of the type "commit#key", given this if we

[jira] [Updated] (HUDI-6687) S3/GCS incr job improvements

2023-08-11 Thread Lokesh Lingarajan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lokesh Lingarajan updated HUDI-6687: Description: 1. Current batched commit string is of the type "commit#key", given this if we

[GitHub] [hudi] lokesh-lingarajan-0310 commented on a diff in pull request #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
lokesh-lingarajan-0310 commented on code in PR #9433: URL: https://github.com/apache/hudi/pull/9433#discussion_r1291842978 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/IncrSourceHelper.java: ## @@ -219,7 +218,7 @@ public static Pair> filterAndGenera

[jira] [Created] (HUDI-6687) S3/GCS incr job improvements

2023-08-11 Thread Lokesh Lingarajan (Jira)
Lokesh Lingarajan created HUDI-6687: --- Summary: S3/GCS incr job improvements Key: HUDI-6687 URL: https://issues.apache.org/jira/browse/HUDI-6687 Project: Apache Hudi Issue Type: Improvement

[GitHub] [hudi] amrishlal commented on a diff in pull request #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
amrishlal commented on code in PR #9433: URL: https://github.com/apache/hudi/pull/9433#discussion_r1291826591 ## hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/IncrSourceHelper.java: ## @@ -219,7 +218,7 @@ public static Pair> filterAndGenerateChe }

[GitHub] [hudi] hudi-bot commented on pull request #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9434: URL: https://github.com/apache/hudi/pull/9434#issuecomment-1675411037 ## CI report: * e895bfb27350f497100c3cd50246badcba99f27d Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[GitHub] [hudi] hudi-bot commented on pull request #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9434: URL: https://github.com/apache/hudi/pull/9434#issuecomment-1675404896 ## CI report: * e895bfb27350f497100c3cd50246badcba99f27d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9433: URL: https://github.com/apache/hudi/pull/9433#issuecomment-1675404838 ## CI report: * 32538d15eea7ed25ca8f0f8efbdd2bd1c21fa41e Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[GitHub] [hudi] prashantwason closed pull request #9431: Bumping release candidate number 1

2023-08-11 Thread via GitHub
prashantwason closed pull request #9431: Bumping release candidate number 1 URL: https://github.com/apache/hudi/pull/9431 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

[GitHub] [hudi] hudi-bot commented on pull request #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9433: URL: https://github.com/apache/hudi/pull/9433#issuecomment-1675397975 ## CI report: * 32538d15eea7ed25ca8f0f8efbdd2bd1c21fa41e UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] prashantwason opened a new pull request, #9434: Dummy commit to trigger CI

2023-08-11 Thread via GitHub
prashantwason opened a new pull request, #9434: URL: https://github.com/apache/hudi/pull/9434 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any perfor

[jira] [Updated] (HUDI-6686) Handling empty commit for s3 Incr job

2023-08-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6686: - Labels: pull-request-available (was: ) > Handling empty commit for s3 Incr job >

[GitHub] [hudi] lokesh-lingarajan-0310 opened a new pull request, #9433: [HUDI-6686] - Handling empty commits after s3 applyFilter api

2023-08-11 Thread via GitHub
lokesh-lingarajan-0310 opened a new pull request, #9433: URL: https://github.com/apache/hudi/pull/9433 ### Change Logs Handling empty commit and returning current batch's endpoint to handle scenarios of customer configuring filters for specific objects in s3 among other objects.

[GitHub] [hudi] hudi-bot commented on pull request #9431: Bumping release candidate number 1

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9431: URL: https://github.com/apache/hudi/pull/9431#issuecomment-1675300491 ## CI report: * 0169afc0895d3d5fb7ad23a6ecf744cdf54a5a86 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[hudi] branch asf-site updated: [DOCS] Update Indexing and File Layout pages (#9346)

2023-08-11 Thread bhavanisudha
This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new b8a0e75494b [DOCS] Update Indexing and F

[GitHub] [hudi] bhasudha merged pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
bhasudha merged PR #9346: URL: https://github.com/apache/hudi/pull/9346 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[jira] [Created] (HUDI-6686) Handling empty commit for s3 Incr job

2023-08-11 Thread Lokesh Lingarajan (Jira)
Lokesh Lingarajan created HUDI-6686: --- Summary: Handling empty commit for s3 Incr job Key: HUDI-6686 URL: https://issues.apache.org/jira/browse/HUDI-6686 Project: Apache Hudi Issue Type: Tas

[GitHub] [hudi] nsivabalan commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
nsivabalan commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291705672 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Curre

[GitHub] [hudi] bhasudha commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
bhasudha commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291692336 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Current

[jira] [Updated] (HUDI-6685) Fix code typo in quick start guide under pyspark "Insert Overwrite" section.

2023-08-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-6685: - Labels: pull-request-available (was: ) > Fix code typo in quick start guide under pyspark "Insert

[GitHub] [hudi] amrishlal opened a new pull request, #9432: [HUDI-6685] Fix code typo in pyspark 'Insert Overwrite' section of Quick Start Guide.

2023-08-11 Thread via GitHub
amrishlal opened a new pull request, #9432: URL: https://github.com/apache/hudi/pull/9432 Fix code typo in pyspark 'Insert Overwrite' section of Quick Start Guide. ### Change Logs Fix code typo in pyspark 'Insert Overwrite' section of Quick Start Guide. ### Impact

[GitHub] [hudi] bhasudha commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
bhasudha commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291685815 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Current

[jira] [Created] (HUDI-6685) Fix code typo in quick start guide under pyspark "Insert Overwrite" section.

2023-08-11 Thread Amrish Lal (Jira)
Amrish Lal created HUDI-6685: Summary: Fix code typo in quick start guide under pyspark "Insert Overwrite" section. Key: HUDI-6685 URL: https://issues.apache.org/jira/browse/HUDI-6685 Project: Apache Hudi

[GitHub] [hudi] bhasudha commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
bhasudha commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291682097 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Current

[GitHub] [hudi] bhasudha commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
bhasudha commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291674781 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Current

[GitHub] [hudi] nsivabalan commented on a diff in pull request #9346: [DOCS] Update Indexing page with all index types and file layout page

2023-08-11 Thread via GitHub
nsivabalan commented on code in PR #9346: URL: https://github.com/apache/hudi/pull/9346#discussion_r1291627907 ## website/docs/indexing.md: ## @@ -20,34 +24,90 @@ _Figure: Comparison of merge cost for updates (yellow blocks) against base files ## Index Types in Hudi -Curre

[GitHub] [hudi] hudi-bot commented on pull request #9431: Bumping release candidate number 1

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9431: URL: https://github.com/apache/hudi/pull/9431#issuecomment-1675168634 ## CI report: * 0169afc0895d3d5fb7ad23a6ecf744cdf54a5a86 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1927

[GitHub] [hudi] bhasudha merged pull request #9429: [DOCS] Update clustering page with inline configs

2023-08-11 Thread via GitHub
bhasudha merged PR #9429: URL: https://github.com/apache/hudi/pull/9429 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

[hudi] branch asf-site updated: [DOCS] Update clustering page with inline configs (#9429)

2023-08-11 Thread bhavanisudha
This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new a0dc7e93b5e [DOCS] Update clustering pag

[GitHub] [hudi] yihua merged pull request #9428: [DOCS] Update compaction page

2023-08-11 Thread via GitHub
yihua merged PR #9428: URL: https://github.com/apache/hudi/pull/9428 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

[hudi] branch asf-site updated: [DOCS] Update compaction page (#9428)

2023-08-11 Thread yihua
This is an automated email from the ASF dual-hosted git repository. yihua pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/asf-site by this push: new 735bfb16597 [DOCS] Update compaction page (#942

[GitHub] [hudi] hudi-bot commented on pull request #9431: Bumping release candidate number 1

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9431: URL: https://github.com/apache/hudi/pull/9431#issuecomment-1675118764 ## CI report: * 0169afc0895d3d5fb7ad23a6ecf744cdf54a5a86 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] prashantwason opened a new pull request, #9431: Bumping release candidate number 1

2023-08-11 Thread via GitHub
prashantwason opened a new pull request, #9431: URL: https://github.com/apache/hudi/pull/9431 ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any public API or user-facing feature change or any perfor

[hudi] 01/01: Bumping release candidate number 1

2023-08-11 Thread pwason
This is an automated email from the ASF dual-hosted git repository. pwason pushed a commit to branch release-0.14.0 in repository https://gitbox.apache.org/repos/asf/hudi.git commit 0169afc0895d3d5fb7ad23a6ecf744cdf54a5a86 Author: Prashant Wason AuthorDate: Tue Aug 8 14:12:17 2023 -0700 Bum

[hudi] branch release-0.14.0 updated (dddfe85f1c1 -> 0169afc0895)

2023-08-11 Thread pwason
This is an automated email from the ASF dual-hosted git repository. pwason pushed a change to branch release-0.14.0 in repository https://gitbox.apache.org/repos/asf/hudi.git discard dddfe85f1c1 Create release branch for version 0.14.0. new 0169afc0895 Bumping release candidate number 1 T

[hudi] branch master updated: [HUDI-6553] Speedup column stats and bloom index creation on large datasets. (#9223)

2023-08-11 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 05ac0113165 [HUDI-6553] Speedup column stats and b

[GitHub] [hudi] codope merged pull request #9223: [HUDI-6553] Speedup column stats and bloom index creation on large datasets.

2023-08-11 Thread via GitHub
codope merged PR #9223: URL: https://github.com/apache/hudi/pull/9223 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[GitHub] [hudi] hudi-bot commented on pull request #9223: [HUDI-6553] Speedup column stats and bloom index creation on large datasets.

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9223: URL: https://github.com/apache/hudi/pull/9223#issuecomment-1674749373 ## CI report: * d773045840e52ecc767bfa8716a3a3287ee6aa93 UNKNOWN * 482f63ffe2df3fbaf0176a175b530082e0f31154 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2

[GitHub] [hudi] hudi-bot commented on pull request #9425: When invalidate the table in the spark sql query cache, verify if the…

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9425: URL: https://github.com/apache/hudi/pull/9425#issuecomment-1674643707 ## CI report: * fd8df70aa04ad91eb10da4cf1e94a7978626a40c Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1926

[GitHub] [hudi] bhasudha commented on pull request #9429: [DOCS] Update clustering page with inline configs

2023-08-11 Thread via GitHub
bhasudha commented on PR #9429: URL: https://github.com/apache/hudi/pull/9429#issuecomment-1674588209 Tested locally! ![Screenshot 2023-08-11 at 4 18 52 AM](https://github.com/apache/hudi/assets/2179254/a7174706-41a3-4415-b7e2-8cddd4e5e23f) ![Screenshot 2023-08-11 at 4 19 04 A

[GitHub] [hudi] bhasudha opened a new pull request, #9429: [DOCS] Update clustering page with inline configs

2023-08-11 Thread via GitHub
bhasudha opened a new pull request, #9429: URL: https://github.com/apache/hudi/pull/9429 ### Change Logs Update clustering page ### Impact Doc changes ### Risk level (write none, low medium or high below) Low ### Documentation Update _Describe any

[GitHub] [hudi] hudi-bot commented on pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
hudi-bot commented on PR #8437: URL: https://github.com/apache/hudi/pull/8437#issuecomment-1674563328 ## CI report: * 4fdb9dc536d97832f1dc16dd1c754ce7015b1bc6 UNKNOWN * 9c3005ffd94d8ed4d1d0e50ad1511b46365519ac UNKNOWN * 574867dbdf53d51424aae544ef7a3b25938ec6de Azure: [SUCCES

[GitHub] [hudi] bhasudha commented on pull request #9428: [DOCS] Update compaction page

2023-08-11 Thread via GitHub
bhasudha commented on PR #9428: URL: https://github.com/apache/hudi/pull/9428#issuecomment-1674525482 Tested locally. ![Screenshot 2023-08-11 at 3 18 01 AM](https://github.com/apache/hudi/assets/2179254/2792faf2-6061-4654-b65f-e38ac579f0e0) ![Screenshot 2023-08-11 at 3 18 21 AM](h

[GitHub] [hudi] bhasudha opened a new pull request, #9428: [DOCS] Update compaction page

2023-08-11 Thread via GitHub
bhasudha opened a new pull request, #9428: URL: https://github.com/apache/hudi/pull/9428 Summary: - Add high level context - Add inline configs with compaction strategies explained - Restructure flow ### Change Logs _Describe context and summary for this change. Highlig

[hudi] branch master updated: [HUDI-6663] New Parquet File Format remove broadcast to fix performance issue for complex file slices (#9409)

2023-08-11 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new f4832d9903f [HUDI-6663] New Parquet File Format re

[GitHub] [hudi] empcl commented on pull request #9425: When invalidate the table in the spark sql query cache, verify if the…

2023-08-11 Thread via GitHub
empcl commented on PR #9425: URL: https://github.com/apache/hudi/pull/9425#issuecomment-1674480250 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

[GitHub] [hudi] codope merged pull request #9409: [HUDI-6663] New Parquet File Format remove broadcast to fix performance issue for complex file slices

2023-08-11 Thread via GitHub
codope merged PR #9409: URL: https://github.com/apache/hudi/pull/9409 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.or

[GitHub] [hudi] xuzifu666 opened a new issue, #9427: [SUPPORT] bucket index mor table would list all files in merge into

2023-08-11 Thread via GitHub
xuzifu666 opened a new issue, #9427: URL: https://github.com/apache/hudi/issues/9427 **_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at dev-subscr

[GitHub] [hudi] SteNicholas commented on issue #9424: 'read.utc-timezone=false' has no effect on writes

2023-08-11 Thread via GitHub
SteNicholas commented on issue #9424: URL: https://github.com/apache/hudi/issues/9424#issuecomment-1674472732 @JingFengWang, you could use `TIMESTAMP_LTZ` type to solve the above problem. I have tested that uses `TIMESTAMP_LTZ` type and worked well. -- This is an automated message from th

[GitHub] [hudi] Hans-Raintree opened a new issue, #9426: [SUPPORT] Reading CDC logs does not return partition path as a column.

2023-08-11 Thread via GitHub
Hans-Raintree opened a new issue, #9426: URL: https://github.com/apache/hudi/issues/9426 **Describe the problem you faced** When reading cdc logs the partition path is included in the before/after columns, but not as a top level column, so I can't filter to specific partitions before

[GitHub] [hudi] SteNicholas opened a new pull request, #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
SteNicholas opened a new pull request, #8437: URL: https://github.com/apache/hudi/pull/8437 ### Change Logs `HoodieTableSource` supports the implementation of `SupportsFilterPushDown` interface that pushes down the `CallExpression` filter into `FileIndex`. `HoodieTableSource` should

[GitHub] [hudi] SteNicholas closed pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
SteNicholas closed pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down URL: https://github.com/apache/hudi/pull/8437 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

[GitHub] [hudi] hudi-bot commented on pull request #9425: When invalidate the table in the spark sql query cache, verify if the…

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9425: URL: https://github.com/apache/hudi/pull/9425#issuecomment-1674371956 ## CI report: * fd8df70aa04ad91eb10da4cf1e94a7978626a40c Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=1926

[GitHub] [hudi] hudi-bot commented on pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
hudi-bot commented on PR #8437: URL: https://github.com/apache/hudi/pull/8437#issuecomment-1674369184 ## CI report: * 4fdb9dc536d97832f1dc16dd1c754ce7015b1bc6 UNKNOWN * 9c3005ffd94d8ed4d1d0e50ad1511b46365519ac UNKNOWN * c85fe5d97a90a9a93eac7d126b42657d65fb96b6 Azure: [SUCCES

[GitHub] [hudi] hudi-bot commented on pull request #9425: When invalidate the table in the spark sql query cache, verify if the…

2023-08-11 Thread via GitHub
hudi-bot commented on PR #9425: URL: https://github.com/apache/hudi/pull/9425#issuecomment-1674361960 ## CI report: * fd8df70aa04ad91eb10da4cf1e94a7978626a40c UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run the

[GitHub] [hudi] hudi-bot commented on pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
hudi-bot commented on PR #8437: URL: https://github.com/apache/hudi/pull/8437#issuecomment-1674360221 ## CI report: * 4fdb9dc536d97832f1dc16dd1c754ce7015b1bc6 UNKNOWN * 9c3005ffd94d8ed4d1d0e50ad1511b46365519ac UNKNOWN * c85fe5d97a90a9a93eac7d126b42657d65fb96b6 Azure: [SUCCES

[GitHub] [hudi] SteNicholas commented on a diff in pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
SteNicholas commented on code in PR #8437: URL: https://github.com/apache/hudi/pull/8437#discussion_r1291005999 ## hudi-flink-datasource/hudi-flink/src/test/java/org/apache/hudi/table/ITTestHoodieDataSource.java: ## @@ -1553,14 +1553,14 @@ void testBuiltinFunctionWithCatalog(Str

[GitHub] [hudi] empcl opened a new pull request, #9425: When invalidate the table in the spark sql query cache, verify if the…

2023-08-11 Thread via GitHub
empcl opened a new pull request, #9425: URL: https://github.com/apache/hudi/pull/9425 … hive-async database exists ### Change Logs When invalidate the table in the spark sql query cache, verify if the hive-async database exists ### Impact When invalidate the table in

[GitHub] [hudi] PhantomHunt commented on issue #9344: [SUPPORT] Getting error when writing to different HUDI tables in different threads in same job

2023-08-11 Thread via GitHub
PhantomHunt commented on issue #9344: URL: https://github.com/apache/hudi/issues/9344#issuecomment-1674331138 Yes, this file is not there in our S3 bucket. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [hudi] SteNicholas commented on pull request #8437: [HUDI-6066] HoodieTableSource supports parquet predicate push down

2023-08-11 Thread via GitHub
SteNicholas commented on PR #8437: URL: https://github.com/apache/hudi/pull/8437#issuecomment-1674329768 @danny0405, I have added test case whether the parquet predicate push down. PTAL. -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [hudi] empcl commented on a diff in pull request #9297: Generate test jars for hudi-utilities and hudi-hive-sync modules

2023-08-11 Thread via GitHub
empcl commented on code in PR #9297: URL: https://github.com/apache/hudi/pull/9297#discussion_r1290981444 ## hudi-sync/hudi-hive-sync/pom.xml: ## @@ -200,6 +200,9 @@ + + false + Review Comment: ` org.apa

[GitHub] [hudi] empcl commented on a diff in pull request #9297: Generate test jars for hudi-utilities and hudi-hive-sync modules

2023-08-11 Thread via GitHub
empcl commented on code in PR #9297: URL: https://github.com/apache/hudi/pull/9297#discussion_r1290981444 ## hudi-sync/hudi-hive-sync/pom.xml: ## @@ -200,6 +200,9 @@ + + false + Review Comment: ` org.apa

[GitHub] [hudi] empcl commented on a diff in pull request #9297: Generate test jars for hudi-utilities and hudi-hive-sync modules

2023-08-11 Thread via GitHub
empcl commented on code in PR #9297: URL: https://github.com/apache/hudi/pull/9297#discussion_r1290981072 ## hudi-sync/hudi-hive-sync/pom.xml: ## @@ -200,6 +200,9 @@ + + false + Review Comment: org.apac

[GitHub] [hudi] leosanqing commented on a diff in pull request #9297: Generate test jars for hudi-utilities and hudi-hive-sync modules

2023-08-11 Thread via GitHub
leosanqing commented on code in PR #9297: URL: https://github.com/apache/hudi/pull/9297#discussion_r1290969842 ## hudi-sync/hudi-hive-sync/pom.xml: ## @@ -200,6 +200,9 @@ + + false + Review Comment: And when I add