Re: [PR] [HUDI-7198]Create nested node path if does not exist for zookeeper. [hudi]

2024-01-02 Thread via GitHub
rmahindra123 commented on PR #10438: URL: https://github.com/apache/hudi/pull/10438#issuecomment-1874959764 lgtm -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscr

Re: [PR] [MINOR] Fix usages of orElse [hudi]

2024-01-02 Thread via GitHub
yihua commented on code in PR #10435: URL: https://github.com/apache/hudi/pull/10435#discussion_r1440140227 ## hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala: ## @@ -107,23 +107,19 @@ object HoodieSparkUtils extends SparkAdapterSupport with

(hudi) branch master updated (12c26345f7c -> 1b74fc18fee)

2024-01-02 Thread vbalaji
This is an automated email from the ASF dual-hosted git repository. vbalaji pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 12c26345f7c [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql (#10414) add 1b74fc18fee [MI

Re: [PR] [MINOR] Fix ArchivalUtils Logger name [hudi]

2024-01-02 Thread via GitHub
bvaradar merged PR #10436: URL: https://github.com/apache/hudi/pull/10436 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apach

[PR] [HUDI-6207] spark support bucket index query for table with bucket index [hudi]

2024-01-02 Thread via GitHub
KnightChess opened a new pull request, #10191: URL: https://github.com/apache/hudi/pull/10191 ### Change Logs spark support query filter use bucket field if a bucket table query with appropriate expression( = 、in、and、or) ### Impact impore table query performance when use

Re: [PR] [HUDI-6207] spark support bucket index query for table with bucket index [hudi]

2024-01-02 Thread via GitHub
KnightChess closed pull request #10191: [HUDI-6207] spark support bucket index query for table with bucket index URL: https://github.com/apache/hudi/pull/10191 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[jira] [Updated] (HUDI-7265) Support schema evolution by Flink SQL using HoodieHiveCatalog

2024-01-02 Thread Jing Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jing Zhang updated HUDI-7265: - Component/s: flink-sql > Support schema evolution by Flink SQL using HoodieHiveCatalog > -

[jira] [Updated] (HUDI-7270) Support schema evolution by Flink SQL using HoodieCatalog

2024-01-02 Thread Jing Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jing Zhang updated HUDI-7270: - Component/s: flink-sql > Support schema evolution by Flink SQL using HoodieCatalog > -

[jira] [Updated] (HUDI-7270) Support schema evolution by Flink SQL using HoodieCatalog

2024-01-02 Thread Jing Zhang (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jing Zhang updated HUDI-7270: - Description: Since Flink 1.17, Flink SQL support more advanced alter table syntax. {code:sql} -- add a ne

[jira] [Created] (HUDI-7270) Support schema evolution by Flink SQL using HoodieCatalog

2024-01-02 Thread Jing Zhang (Jira)
Jing Zhang created HUDI-7270: Summary: Support schema evolution by Flink SQL using HoodieCatalog Key: HUDI-7270 URL: https://issues.apache.org/jira/browse/HUDI-7270 Project: Apache Hudi Issue Typ

[PR] Create nested node path if does not exist for zookeeper. [hudi]

2024-01-02 Thread via GitHub
harsh1231 opened a new pull request, #10438: URL: https://github.com/apache/hudi/pull/10438 Catch KeeperException if node already exist. ### Change Logs _Describe context and summary for this change. Highlight if any code was copied._ ### Impact _Describe any publ

[jira] [Closed] (HUDI-7261) Add TVF to query hudi file system view through spark-sql

2024-01-02 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit closed HUDI-7261. - Resolution: Done > Add TVF to query hudi file system view through spark-sql >

[jira] [Updated] (HUDI-7261) Add TVF to query hudi file system view through spark-sql

2024-01-02 Thread Sagar Sumit (Jira)
[ https://issues.apache.org/jira/browse/HUDI-7261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sagar Sumit updated HUDI-7261: -- Fix Version/s: 1.0.0 > Add TVF to query hudi file system view through spark-sql > --

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
codope merged PR #10414: URL: https://github.com/apache/hudi/pull/10414 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

(hudi) branch master updated: [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql (#10414)

2024-01-02 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 12c26345f7c [HUDI-7261] TVF to query hudi table's

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874929248 ## CI report: * 904c660c2821c6cf77dcb1e5a308391b14eecb53 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

(hudi) branch master updated (6a042255555 -> d15993b36be)

2024-01-02 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git from 6a04225 [HUDI-7198] Create nested node path if does not exist for zookeeper. (#10281) add d15993b36be Revert "

Re: [PR] Revert "[HUDI-7198]Create nested node path if does not exist for zookeeper." [hudi]

2024-01-02 Thread via GitHub
codope merged PR #10437: URL: https://github.com/apache/hudi/pull/10437 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

(hudi) branch master updated: [HUDI-7198] Create nested node path if does not exist for zookeeper. (#10281)

2024-01-02 Thread codope
This is an automated email from the ASF dual-hosted git repository. codope pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 6a04225 [HUDI-7198] Create nested node path if

Re: [PR] [HUDI-7198]Create nested node path if does not exist for zookeeper. [hudi]

2024-01-02 Thread via GitHub
codope merged PR #10281: URL: https://github.com/apache/hudi/pull/10281 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.

Re: [I] [SUPPORT] Failed to create marker file Exception when trying to write data on Hudi [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10432: URL: https://github.com/apache/hudi/issues/10432#issuecomment-1874904475 @gsudhanshu After setting these, it should not use timeline server. Do you still see references of TimelineServerBasedWriteMarkers in the stack trace? can you paste the new

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1874882886 ## CI report: * c64e1e3a9816b278606ee32aede728ffb928708c Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [I] [SUPPORT] Clean action failure triggers an exception while trying to check whether metadata is a table [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10127: URL: https://github.com/apache/hudi/issues/10127#issuecomment-1874852826 @shubhamn21 Thanks for the update. Yes, having multiple writer without lock provider can cause inconsistent behaviour and create this kind of issues. -- This is an automated me

Re: [I] Getting error while connecting to Hudi(CLI) 0.14.0 tables. [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10249: URL: https://github.com/apache/hudi/issues/10249#issuecomment-1874851371 @jjjigar Sorry for the delay here. I will work on this in this week. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitH

Re: [I] Getting error while connecting to Hudi(CLI) 0.14.0 tables. [hudi]

2024-01-02 Thread via GitHub
jjjigar commented on issue #10249: URL: https://github.com/apache/hudi/issues/10249#issuecomment-1874828878 Any update please? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

Re: [PR] [MINOR] Avoid resource leaks [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10345: URL: https://github.com/apache/hudi/pull/10345#issuecomment-1874828489 ## CI report: * c637ed283ad26d4b97d46c7ddedb3858a5744831 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=2

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874828373 ## CI report: * 6908f7cfde32ce14fbc3b73dee9ceace749a8abe Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [MINOR] Avoid resource leaks [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10345: URL: https://github.com/apache/hudi/pull/10345#issuecomment-1874825437 ## CI report: * c637ed283ad26d4b97d46c7ddedb3858a5744831 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=2

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874825301 ## CI report: * 6908f7cfde32ce14fbc3b73dee9ceace749a8abe Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1874800023 ## CI report: * 502d354dd4ddb15b8fe6e9c9a42973d8299fdb6d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874799885 ## CI report: * cd562d6b1f2ded014670a9a765248013f21d49c1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1874796616 ## CI report: * 502d354dd4ddb15b8fe6e9c9a42973d8299fdb6d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
bhat-vinay commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1874797572 Thanks for the review @bvaradar. @codope pointed that the failing tests could be fixed by https://github.com/apache/hudi/pull/10381. Rebased past it to see if I can get a clean run. -

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874796476 ## CI report: * cd562d6b1f2ded014670a9a765248013f21d49c1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874792952 ## CI report: * 6908f7cfde32ce14fbc3b73dee9ceace749a8abe Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

(hudi) branch master updated: [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark (#10381)

2024-01-02 Thread xushiyan
This is an automated email from the ASF dual-hosted git repository. xushiyan pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git The following commit(s) were added to refs/heads/master by this push: new 98616b1196b [HUDI-7244] Ensure HoodieFileGroupRe

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
xushiyan merged PR #10381: URL: https://github.com/apache/hudi/pull/10381 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apach

Re: [PR] [MINOR] Fix usages of orElse [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10435: URL: https://github.com/apache/hudi/pull/10435#issuecomment-1874763454 ## CI report: * 402d1eb5d0ea586d3a4afbf736dbb843809f7bb4 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=2

Re: [I] hoodie.bulkinsert.shuffle.parallelism Not activated [hudi]

2024-01-02 Thread via GitHub
zhangjw123321 commented on issue #10418: URL: https://github.com/apache/hudi/issues/10418#issuecomment-1874749412 通过这个链接下载https://dlcdn.apache.org/hudi/0.14.0/hudi-0.14.0.src.tgz,maven编辑的hudi-spark3.2-bundle_2.12-0.14.0.jar -- This is an automated message from the Apache Git Service.

Re: [I] hoodie.bulkinsert.shuffle.parallelism Not activated [hudi]

2024-01-02 Thread via GitHub
zhangjw123321 commented on issue #10418: URL: https://github.com/apache/hudi/issues/10418#issuecomment-1874749317 Which stage is deleting duplicate records,Other than the above configuration, no other configuration is manually set。 -- This is an automated message from the Apache Git Servi

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874710579 ## CI report: * 632755327f46883194b8da0f42c9b06d88a9cce4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874705088 ## CI report: * 632755327f46883194b8da0f42c9b06d88a9cce4 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [MINOR] Fix usages of orElse [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10435: URL: https://github.com/apache/hudi/pull/10435#issuecomment-1874674372 ## CI report: * 072ac266eb2cf4b81d62cafda0c2579b88b43d5b Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=2

Re: [PR] [MINOR] Fix usages of orElse [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10435: URL: https://github.com/apache/hudi/pull/10435#issuecomment-1874669320 ## CI report: * 072ac266eb2cf4b81d62cafda0c2579b88b43d5b Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=2

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
jonvex commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874662101 Azure CI is passsing: https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21791 -- This is an automated message from the Apache Git Servic

Re: [I] [SUPPORT] Clean action failure triggers an exception while trying to check whether metadata is a table [hudi]

2024-01-02 Thread via GitHub
shubhamn21 commented on issue #10127: URL: https://github.com/apache/hudi/issues/10127#issuecomment-1874657544 Hi @ad1happy2go, Thanks for responding. Yes, I did have multiple executors at one point writing to the table (kafka streaming job). But I recently limited my deployments to 1

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874624757 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ffcf47d27b84e8301b7a9b986d8df69257e220a3 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874572982 ## CI report: * e24ea448ae3743cc48798b4640a93a30a2e6270e Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874561587 ## CI report: * e24ea448ae3743cc48798b4640a93a30a2e6270e Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874542310 ## CI report: * e24ea448ae3743cc48798b4640a93a30a2e6270e Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [] CVE-2023-44487 Upgrade jetty and exclude older jetty [hudi]

2024-01-02 Thread via GitHub
CTTY commented on PR #10223: URL: https://github.com/apache/hudi/pull/10223#issuecomment-1874534132 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To u

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874480730 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * 9819ca4db7b4ab9f2476aecc753e3fcc09c7cb7a Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874430037 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * de4e4ccc30e75153d16bd322447311f3233d3579 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874421122 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * c96dd5a7fe1951aabe9df3ca283e7858153b21c2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874369688 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * c96dd5a7fe1951aabe9df3ca283e7858153b21c2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874359251 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * 8fd105afa86dc4d815dd94d7a55bca5bb85031d2 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874359461 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ebdeeb3f45cad66a7eaa120e5f6ecc5cc6e3ddd7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874359011 ## CI report: * cd562d6b1f2ded014670a9a765248013f21d49c1 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874311524 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ebdeeb3f45cad66a7eaa120e5f6ecc5cc6e3ddd7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874302126 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ebdeeb3f45cad66a7eaa120e5f6ecc5cc6e3ddd7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-4552][RFC-58] Integrate column stats index with all query engines [hudi]

2024-01-02 Thread via GitHub
pratyakshsharma commented on code in PR #6345: URL: https://github.com/apache/hudi/pull/6345#discussion_r1439625464 ## rfc/rfc-58/rfc-58.md: ## @@ -0,0 +1,69 @@ + +# RFC-58: Integrate column stats index with all query engines + + + +## Proposers + +- @pratyakshsharma + +## Appro

Re: [PR] [HUDI-4552][RFC-58] Integrate column stats index with all query engines [hudi]

2024-01-02 Thread via GitHub
pratyakshsharma commented on code in PR #6345: URL: https://github.com/apache/hudi/pull/6345#discussion_r1439623738 ## rfc/rfc-58/rfc-58.md: ## @@ -0,0 +1,69 @@ + +# RFC-58: Integrate column stats index with all query engines + + + +## Proposers + +- @pratyakshsharma + +## Appro

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874240503 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * fa4b20f1f5cabcd03bc488badaa4e97d26da49c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874228173 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ebdeeb3f45cad66a7eaa120e5f6ecc5cc6e3ddd7 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874227930 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * fa4b20f1f5cabcd03bc488badaa4e97d26da49c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874206870 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * 987684e6b4e397d92c14d523d933f735a8a75984 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [I] [SUPPORT] Hudi 0.13.1 on EMR, MOR table writer hangs intermittently with S3 read timeout error for column stats index [hudi]

2024-01-02 Thread via GitHub
ergophobiac commented on issue #10415: URL: https://github.com/apache/hudi/issues/10415#issuecomment-1874167209 Hey @ad1happy2go, we have a test case running, we'll observe till we're sure it's stable and let you know how it turns out. -- This is an automated message from the Apache Git S

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874152589 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ea196eb78876cb4761ccf181131a179ed1c25fa5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874152332 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * fa4b20f1f5cabcd03bc488badaa4e97d26da49c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874152069 ## CI report: * f40f52205a2da4f28b1c3c7300f2551a6699657d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7244] Ensure HoodieFileGroupReader.close() is called in spark [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10381: URL: https://github.com/apache/hudi/pull/10381#issuecomment-1874140814 ## CI report: * 33a87e77b985a8fd3fe0a6a997059ee20fbedb8b UNKNOWN * fa4b20f1f5cabcd03bc488badaa4e97d26da49c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1874140534 ## CI report: * f40f52205a2da4f28b1c3c7300f2551a6699657d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7265] Support schema evolution by Flink SQL using HoodieHiveCatalog [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10426: URL: https://github.com/apache/hudi/pull/10426#issuecomment-1874129904 ## CI report: * b4a68ad41cfe6d582dea52aea53d9f4b96341f26 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [I] [SUPPORT] EMR on EKS version 6.15.0, Spark 3.4.1 and Hudi 0.14.0 getting java.io.IOException: Failed to delete: /usr/lib/hudi/. [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10376: URL: https://github.com/apache/hudi/issues/10376#issuecomment-1874094831 @Lakshmi-Holla12 Were you able to resolve this issue? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [I] [SUPPORT] Hudi 0.13.1 on EMR, MOR table writer hangs intermittently with S3 read timeout error for column stats index [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10415: URL: https://github.com/apache/hudi/issues/10415#issuecomment-1874092759 @ergophobiac Did you got a chance to try this out? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

Re: [I] hoodie.bulkinsert.shuffle.parallelism Not activated [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10418: URL: https://github.com/apache/hudi/issues/10418#issuecomment-1874091034 @zhangjw123321 Its going in deduping records. For bulk insert it doesn't dedup with the default configs. Are you setting any other configs? -- This is an automated message from

Re: [I] [SUPPORT] Kafka connect sink to S3 authentification parameters [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10428: URL: https://github.com/apache/hudi/issues/10428#issuecomment-1874091735 @akolyaga Did this suggestion worked? Do let us know. Thanks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874077677 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ea196eb78876cb4761ccf181131a179ed1c25fa5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [HUDI-6787] Implement the HoodieFileGroupReader API for Hive [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10422: URL: https://github.com/apache/hudi/pull/10422#issuecomment-1874068336 ## CI report: * 99517e23baa60a6a0602e9daf7f522f3c1dcfa1e UNKNOWN * ea196eb78876cb4761ccf181131a179ed1c25fa5 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [PR] [MINOR] Fix ArchivalUtils Logger name [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10436: URL: https://github.com/apache/hudi/pull/10436#issuecomment-1873996433 ## CI report: * 456b3982f0fd8622cf3099f4b982a91f9b739978 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1873996038 ## CI report: * f40f52205a2da4f28b1c3c7300f2551a6699657d Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [I] [SUPPORT] Failed to create marker file Exception when trying to write data on Hudi [hudi]

2024-01-02 Thread via GitHub
gsudhanshu commented on issue #10432: URL: https://github.com/apache/hudi/issues/10432#issuecomment-1873985101 @ad1happy2go thanks for your reply. I have added as following: ![image](https://github.com/apache/hudi/assets/45429552/a3516567-ba3c-47ec-aaff-56489d025cf8) but still g

Re: [I] [SUPPORT] Failed to create marker file Exception when trying to write data on Hudi [hudi]

2024-01-02 Thread via GitHub
ad1happy2go commented on issue #10432: URL: https://github.com/apache/hudi/issues/10432#issuecomment-1873953371 @gsudhanshu Can you try disabling the timeline server? hoodie.write.markers.type= 'direct', hoodie.embed.timeline.server= 'false' We had a silmilar issue (https://

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1873940685 ## CI report: * 502d354dd4ddb15b8fe6e9c9a42973d8299fdb6d Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7261] TVF to query hudi table's filesystem state through spark-sql [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10414: URL: https://github.com/apache/hudi/pull/10414#issuecomment-1873933395 ## CI report: * 502d354dd4ddb15b8fe6e9c9a42973d8299fdb6d UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [PR] [HUDI-7265] Support schema evolution by Flink SQL using HoodieHiveCatalog [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10426: URL: https://github.com/apache/hudi/pull/10426#issuecomment-1873894740 ## CI report: * 037c96b19f25dd81a2c924e7770a535a6a1843c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7265] Support schema evolution by Flink SQL using HoodieHiveCatalog [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10426: URL: https://github.com/apache/hudi/pull/10426#issuecomment-187388 ## CI report: * 037c96b19f25dd81a2c924e7770a535a6a1843c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1873886233 ## CI report: * a251c7e686efd96ac6d2a07f0b95e6383add2ad9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7265] Support schema evolution by Flink SQL using HoodieHiveCatalog [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10426: URL: https://github.com/apache/hudi/pull/10426#issuecomment-1873877758 ## CI report: * 037c96b19f25dd81a2c924e7770a535a6a1843c8 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7144] Build storage partition stats index and use it for data skipping [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10352: URL: https://github.com/apache/hudi/pull/10352#issuecomment-1873877321 ## CI report: * a251c7e686efd96ac6d2a07f0b95e6383add2ad9 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [HUDI-7208] Do writing stage should shutdown with error when insert failed to reduce user execute time and show error details [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10297: URL: https://github.com/apache/hudi/pull/10297#issuecomment-1873877127 ## CI report: * 05bed31829f2362de479344215d29ccca99bd449 UNKNOWN * 855ede2626b0c95d0649a939aad51de277562fa5 Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4

Re: [I] hoodie.bulkinsert.shuffle.parallelism Not activated [hudi]

2024-01-02 Thread via GitHub
zhangjw123321 commented on issue #10418: URL: https://github.com/apache/hudi/issues/10418#issuecomment-1873836778 ![image](https://github.com/apache/hudi/assets/154970920/084e9134-9356-4c15-b489-4a420cfcbec2) -- This is an automated message from the Apache Git Service. To respond to t

Re: [PR] [MINOR] Fix ArchivalUtils Logger name [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10436: URL: https://github.com/apache/hudi/pull/10436#issuecomment-1873829816 ## CI report: * 456b3982f0fd8622cf3099f4b982a91f9b739978 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=21

Re: [PR] [MINOR] Fix ArchivalUtils Logger name [hudi]

2024-01-02 Thread via GitHub
hudi-bot commented on PR #10436: URL: https://github.com/apache/hudi/pull/10436#issuecomment-1873821139 ## CI report: * 456b3982f0fd8622cf3099f4b982a91f9b739978 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure` re-run th

Re: [I] hoodie.bulkinsert.shuffle.parallelism Not activated [hudi]

2024-01-02 Thread via GitHub
zhangjw123321 commented on issue #10418: URL: https://github.com/apache/hudi/issues/10418#issuecomment-1873811228 ![image](https://github.com/apache/hudi/assets/154970920/fa708f0b-f7ad-45a7-8cb3-ddf538504668) ![image](https://github.com/apache/hudi/assets/154970920/6c9264cb-4821-4af5-b95

[PR] [MINOR] Fix ArchivalUtils Logger name [hudi]

2024-01-02 Thread via GitHub
eric9204 opened a new pull request, #10436: URL: https://github.com/apache/hudi/pull/10436 ### Change Logs None ### Impact None ### Risk level (write none, low medium or high below) None ### Documentation Update None - _The config descri

Re: [PR] [HUDI-7266] add clustering metric for flink [hudi]

2024-01-02 Thread via GitHub
stream2000 commented on code in PR #10420: URL: https://github.com/apache/hudi/pull/10420#discussion_r1439252905 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/clustering/ClusteringPlanOperator.java: ## @@ -88,10 +96,20 @@ public void notifyCheckpointComp

Re: [PR] [HUDI-7266] add clustering metric for flink [hudi]

2024-01-02 Thread via GitHub
stream2000 commented on code in PR #10420: URL: https://github.com/apache/hudi/pull/10420#discussion_r1439252905 ## hudi-flink-datasource/hudi-flink/src/main/java/org/apache/hudi/sink/clustering/ClusteringPlanOperator.java: ## @@ -88,10 +96,20 @@ public void notifyCheckpointComp